Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 3,703,055
Updated entries 37,520,974
Unchanged entries 52,012,957
Total 93,236,986
Entries with updated sequences 2,391
With a fragmented AA sequence 9,290,163
With known alternative products 0
Protein Existence (PE) Number of entries
1 Evidence at protein level 135,226
2 Evidence at transcript level 1,097,343
3 Inferred from homology 22,623,111
4 Predicted 69,381,306
5 Uncertain 0

Taxonomic Origin


Statistics on the number of species

Number of species in
New entries 20,932
Updated entries 478,124
Unchanged entries 292,166
Total 590,351

Sequence data

The shortest sequence is C4PYW0 at 2 AA while the longest sequence is A0A1V4K6M4 at 36,991 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 0 0
Alternative products 0 0
Biophysicochemical properties 0 0
Biotechnological use 0 0
Catalytic activity 10,661,721 9,656,989
Caution 48,909,585 47,757,375
Cofactor 7,398,677 0
Developmental stage 0 0
Involvement in disease 0 0
Disruption phenotype 0 0
Domain 655,795 628,818
Enzyme regulation 201,248 201,246
Function 12,073,091 11,539,985
Induction 42,873 42,873
Mass spectrometry 0 0
Miscellaneous 363,129 357,716
Pathway 5,351,897 4,848,236
Pharmaceutical use 0 0
Polymorphism 0 0
Post-translational modification 470,416 423,705
RNA Editing 0 0
Sequence caution 0 0
Sequence similarities 22,738,288 22,455,134
Subcellular Location 0 0
Subunit structure 6,302,194 6,224,675
Tissue specificity 0 0
Toxic dose 0 0

Sequence Annotation (featues)

Annotations Entries
Molecule processing 11,770,122 5,898,536
Chain 5,869,579 5,867,324
Initiator methionine 22,997 22,997
Peptide 91 91
Propeptide 12,236 12,236
Signal peptide 5,865,123 5,865,114
Transit peptide 96 96
Regions 169,949,086 59,454,474
Calcium binding 213,687 105,310
Coiled-coil 6,078,412 4,084,449
Compositional bias 3,634 3,634
DNA binding 2,345,604 2,078,937
Domain 65,380,411 47,144,498
Motif 580,173 435,397
Nucleotide binding 4,824,527 3,107,225
Repeat 3,674,839 886,709
Region 3,279,238 1,722,494
Topological domain 92,272 30,757
Transmembrane 83,147,250 18,352,449
Zinc finger 328,053 257,928
Sites 26,230,668 5,782,494
Active site 5,205,764 3,216,562
Metal binding 8,781,465 2,356,547
Binding site 11,016,900 2,852,719
Other 1,226,539 703,618
Amino acid modifications 2,497,119 1,714,439
Cross-link 20,074 18,376
Disulfide bond 943,730 260,363
Glycosylation 2,405 1,420
Lipidation 16,860 15,146
Modified residue 1,510,953 1,431,756
Non-standard residue 3,097 2,906
Experimental info 14,534,388 9,343,108
Mutagenesis 0 0
Non-adjacent residues 0 0
Non-terminal residue 14,469,812 9,329,223
Sequence conflict 0 0
Sequence uncertainty 64,576 54,495

Citation usage

Citation type Citations Entries
Submission75,669,97565,875,471
Journal article34,685,72632,757,928
Book11,31011,245
Thesis13,02112,962
Patent11
Unpublished observations00
Online journal article00

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 693,341 505,397

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
EMBL102,768,91690,166,734
PIR163,147130,903
RefSeq42,394,88641,444,137
UniGene857,422730,197
3D structure databases
DisProt9696
PDB34,48317,122
PDBsum34,03816,780
ProteinModelPortal7,489,4067,489,406
SMR1,101,9361,101,936
Protein-protein interaction databases
CORUM126126
DIP3,2453,244
ELM119119
IntAct25,92225,922
MINT9,7319,730
STRING6,532,4036,532,397
Chemistry
BindingDB202202
ChEMBL884884
DrugBank617342
GuidetoPHARMACOLOGY44
SwissLipids8282
Protein family/group databases
Allergome3,8813,144
CAZy129,390121,089
ESTHER74,13773,856
MEROPS249,450249,449
MoonProt33
PeroxiBase2,4812,473
REBASE32,15132,132
TCDB7,9337,918
mycoCLAP447447
PTM databases
PhosphoSitePlus3,5393,539
SwissPalm1,2181,218
UniCarbKB1717
iPTMnet7,3427,342
Polymorphism and mutation databases
2D gel databases
COMPLUYEAST-2DPAGE44
OGP33
REPRODUCTION-2DPAGE6362
SWISS-2DPAGE11
World-2DPAGE316311
Proteomic databases
EPD9,4339,433
MaxQB41,32841,328
PRIDE274,829274,829
PaxDb594,383594,383
PeptideAtlas117,331117,331
ProMEX2,6942,694
TopDownProteomics282282
Protocols and materials databases
DNASU41,35540,916
Genome annotation databases
Ensembl1,234,8771,202,743
EnsemblBacteria40,679,07138,457,531
EnsemblFungi5,489,7455,341,625
EnsemblMetazoa1,091,6481,063,819
EnsemblPlants1,721,9451,606,222
EnsemblProtists1,858,0571,749,163
GeneDB114,834113,054
GeneID9,774,3179,666,700
Gramene1,722,7091,606,957
KEGG14,459,09414,053,555
PATRIC18,229,29918,229,216
UCSC93,94493,749
VectorBase555,625540,626
WBParaSite854,114845,707
Organism-specific databases
ArachnoServer201201
Araport19,51419,430
CGD20,81420,748
CTD899,411897,498
ConoServer160160
EuPathDB634,840634,690
FlyBase222,656221,280
GeneCards1,5391,518
H-InvDB590443
HGNC50,58450,489
LegioList2,4962,483
Leproma1,2711,269
MGI60,47260,095
MIM44
MalaCards99
OpenTargets48,62748,578
PharmGKB3,1543,154
PomBase3131
PseudoCAP4,4574,451
RGD25,13223,789
SGD77
TAIR15,74315,665
TubercuList1,0041,003
WormBase65,74165,351
Xenbase34,31734,257
ZFIN52,87352,521
dictyBase7,9877,765
euHCVdb75,26775,264
Phylogenomic databases
GeneTree1,190,6631,190,536
HOGENOM3,036,2863,036,197
HOVERGEN300,575300,563
InParanoid2,479,1232,479,023
KO6,199,5636,174,305
OMA6,449,5986,449,589
OrthoDB14,545,13514,545,074
PhylomeDB469,634469,634
TreeFam568,012567,998
eggNOG14,168,5587,101,662
Enzyme and pathway databases
BRENDA9,6259,332
BioCyc3,454,5253,453,288
Reactome239,09085,526
SABIO-RK607607
SIGNOR88
SignaLink3,8143,814
UniPathway5,341,9494,838,288
Other
ChiTaRS86,11785,958
EvolutionaryTrace6,0106,010
GenomeRNAi30,25730,257
PMAP-CutDB131131
PRO2,2142,214
Gene expression databases
Bgee547,026547,018
CollecTF200200
ExpressionAtlas395,988395,988
Genevisible15,92215,922
Ontologies
Family and domain databases
CDD15,955,90114,066,751
Gene3D40,984,47134,224,112
HAMAP9,249,2229,132,616
InterPro230,781,33472,141,337
PANTHER14,211,68513,750,185
PIRSF7,955,7997,890,920
PRINTS12,465,52311,238,953
PROSITE46,316,33030,778,412
Pfam90,060,63765,427,224
ProDom1,407,0831,340,687
SFLD768,710424,232
SMART21,975,96116,724,991
SUPFAM60,126,06347,433,564
TIGRFAMs18,740,38617,223,316

Web resource

0 UniProtKB/TrEMBL entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 9.0%Alanine
  • 5.6%Arginine
  • 3.9%Asparagine
  • 5.4%Aspartate
  • 1.2%Cysteine
  • 3.8%Glutamine
  • 6.1%Glutamate
  • 7.2%Glycine
  • 2.2%Histidine
  • 5.7%Isoleucine
  • 9.8%Leucine
  • 5.0%Lysine
  • 2.3%Methionine
  • 3.9%Phenylalanine
  • 4.8%Proline
  • 6.7%Serine
  • 5.5%Threonine
  • 1.2%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

1,698,213 entries are encoded on a mitochondrion, and 684,331 are encoded on a plasmid.

662,142 entries are encoded on a plastid, of which 785 are encoded on apicoplasts, 551,504 on chloroplasts, 1 on organellar chromatophores, 8 on cyanelles, 1,601 on non-photosynthetic plastids and 3,197 on unspecified types of plastid.