Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 1,475,502
Updated entries 12,133,305
Unchanged entries 33,843,506
Total 47,452,313
Entries with updated sequences 3,856
With a fragmented AA sequence 6,460,303
With known alternative products 0
Protein Existence (PE) Number of entries
1 Evidence at protein level 50,068
2 Evidence at transcript level 966,872
3 Inferred from homology 10,174,853
4 Predicted 36,260,520
5 Uncertain 0

Taxonomic Origin


Statistics on the number of species

Number of species in
New entries 9,316
Updated entries 102,062
Unchanged entries 371,248
Total 404,743

Sequence data

The shortest sequence is C4PYW0 at 2 AA while the longest sequence is Q3ASY8 at 36,805 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 0 0
Alternative products 0 0
Biophysicochemical properties 0 0
Biotechnological use 0 0
Catalytic activity 4,966,466 4,600,247
Caution 22,199,902 22,170,498
Cofactor 4,341,440 2,206,799
Developmental stage 0 0
Involvement in disease 0 0
Disruption phenotype 0 0
Domain 257,752 245,050
Enzyme regulation 102,504 102,504
Function 5,614,987 5,411,298
Induction 26,548 26,548
Mass spectrometry 0 0
Miscellaneous 118,858 116,378
Pathway 2,547,761 2,196,657
Pharmaceutical use 0 0
Polymorphism 0 0
Post-translational modification 228,329 206,297
RNA Editing 0 0
Sequence caution 0 0
Sequence similarities 15,117,555 12,706,748
Subcellular Location 0 0
Subunit structure 2,873,696 2,858,270
Tissue specificity 0 0
Toxic dose 0 0

Sequence Annotation (featues)

Annotations Entries
Molecule processing 1,591,999 783,259
Chain 888,150 690,661
Initiator methionine 12,128 12,128
Peptide 39 39
Propeptide 8,706 8,706
Signal peptide 680,105 675,114
Transit peptide 2,871 2,859
Regions 6,843,200 2,280,481
Calcium binding 0 0
Coiled-coil 73,784 39,505
Compositional bias 10,057 9,887
DNA binding 50,120 48,121
Domain 759,081 589,465
Motif 235,088 155,428
Nucleotide binding 1,451,068 854,911
Repeat 78,279 19,093
Region 1,255,466 673,763
Topological domain 157,285 39,868
Transmembrane 2,697,905 514,520
Zinc finger 74,745 62,459
Sites 10,073,977 2,230,618
Active site 1,888,588 1,170,012
Metal binding 3,712,276 966,609
Binding site 3,984,781 1,038,054
Other 488,332 257,249
Amino acid modifications 411,627 335,401
Cross-link 9,807 6,792
Disulfide bond 88,226 64,262
Glycosylation 836 307
Lipidation 43,708 21,854
Modified residue 267,055 247,135
Non-standard residue 1,995 1,852
Experimental info 10,048,823 6,471,729
Mutagenesis 0 0
Non-adjacent residues 0 0
Non-terminal residue 10,011,695 6,464,481
Sequence conflict 0 0
Sequence uncertainty 37,128 31,776

Citation usage

Citation type Citations Entries
Submission34,878,87530,986,141
Journal article22,546,09621,064,139
Book9,4589,395
Thesis19,00218,943
Patent11
Unpublished observations00
Online journal article00

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 569,601 427,941

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
EMBL53,432,72746,300,783
PIR170,681137,862
RefSeq17,326,69114,247,642
UniGene557,198518,640
3D structure databases
PDB25,75213,708
PDBsum25,26913,356
ProteinModelPortal7,066,1027,066,102
SMR1,259,6371,259,637
Protein-protein interaction databases
DIP3,1713,166
IntAct15,59515,595
MINT10,04810,047
STRING3,112,3803,112,275
Chemistry
BindingDB32,43432,434
ChEMBL797797
DrugBank14456
GuidetoPHARMACOLOGY1818
Protein family/group databases
Allergome3,7743,093
CAZy73,60069,164
MEROPS201,091201,091
MoonProt55
PeroxiBase2,5522,544
REBASE37,32537,305
TCDB6,1746,165
mycoCLAP438438
PTM databases
PhosphoSite1,1251,125
Polymorphism and mutation databases
2D gel databases
COMPLUYEAST-2DPAGE44
OGP33
REPRODUCTION-2DPAGE6564
SWISS-2DPAGE2828
World-2DPAGE669664
Proteomic databases
MaxQB2,9932,993
PRIDE361,993361,993
PaxDb34,31634,316
PeptideAtlas127127
ProMEX3,4473,447
Protocols and materials databases
DNASU41,77441,448
Genome annotation databases
Ensembl1,165,5081,149,936
EnsemblBacteria25,525,89924,167,839
EnsemblFungi457,300454,778
EnsemblMetazoa936,272917,198
EnsemblPlants1,109,6021,064,442
EnsemblProtists249,672244,028
GeneID5,243,2115,145,357
KEGG9,901,9059,658,501
PATRIC6,482,2886,482,098
UCSC55,95955,711
VectorBase78,24077,723
Organism-specific databases
ArachnoServer170170
CGD6,7286,728
CTD596,382594,821
ConoServer159159
EuPathDB353,301353,276
FlyBase198,079196,620
GenoList14,72614,453
Gramene221,946221,946
H-InvDB592445
HGNC47,64947,565
LegioList5,1385,110
Leproma1,2721,270
MGI54,03553,669
MIM44
PharmGKB3,1813,181
PomBase33
PseudoCAP4,4934,487
RGD22,33721,218
SGD77
TAIR20,62420,507
TubercuList1,0491,048
WormBase43,19143,067
Xenbase24,95924,901
ZFIN47,51947,448
dictyBase7,9927,770
euHCVdb75,26775,264
Phylogenomic databases
GeneTree1,126,5171,126,478
HOGENOM3,565,8823,565,844
HOVERGEN302,045302,036
InParanoid2,721,6722,721,672
KO4,008,7613,990,655
OMA5,857,8785,857,875
OrthoDB5,087,9145,087,914
PhylomeDB476,717476,717
TreeFam587,378587,375
eggNOG2,728,9722,728,937
Enzyme and pathway databases
BRENDA9,8919,597
BioCyc5,128,0355,055,504
Reactome205,34471,323
SABIO-RK531531
SignaLink4,3384,338
UniPathway2,546,5182,195,414
Other
ChiTaRS87,18387,022
EvolutionaryTrace7,2137,213
GenomeRNAi23,25523,255
NextBio199,487199,485
PMAP-CutDB165165
PRO25,52225,522
Gene expression databases
Bgee104,021104,021
ExpressionAtlas286,719286,719
Genevestigator81,17481,170
Ontologies
GO82,025,17527,594,848
Family and domain databases
Gene3D27,763,55321,843,337
HAMAP4,517,6804,458,070
InterPro105,496,18836,562,559
PANTHER6,955,5276,680,861
PIRSF3,911,0343,875,950
PRINTS6,803,2136,078,635
PROSITE23,460,83915,494,838
Pfam46,283,73833,736,350
ProDom830,614787,840
SMART10,673,5598,118,667
SUPFAM26,369,58821,320,692
TIGRFAMs9,535,5958,734,498

Web resource

0 UniProtKB/TrEMBL entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.6%Alanine
  • 5.5%Arginine
  • 4.0%Asparagine
  • 5.3%Aspartate
  • 1.2%Cysteine
  • 3.9%Glutamine
  • 6.2%Glutamate
  • 7.0%Glycine
  • 2.2%Histidine
  • 5.8%Isoleucine
  • 9.8%Leucine
  • 5.2%Lysine
  • 2.4%Methionine
  • 3.9%Phenylalanine
  • 4.7%Proline
  • 6.8%Serine
  • 5.5%Threonine
  • 1.2%Tryptophan
  • 3.0%Tyrosine
  • 6.7%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

897,116 entries are encoded on a mitochondrion, and 398,822 are encoded on a plasmid.

369,277 entries are encoded on a plastid, of which 772 are encoded on apicoplasts, 322,630 on chloroplasts, 1 on organellar chromatophores, 10 on cyanelles, 1,607 on non-photosynthetic plastids and 2,565 on unspecified types of plastid.