Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 1,427,319
Updated entries 41,193,210
Unchanged entries 48,240,376
Total 90,860,905
Entries with updated sequences 12,947
With a fragmented AA sequence 6,900,697
With known alternative products 0
Protein Existence (PE) Number of entries
1 Evidence at protein level 41,908
2 Evidence at transcript level 1,028,346
3 Inferred from homology 22,457,947
4 Predicted 67,332,704
5 Uncertain 0

Taxonomic Origin


Statistics on the number of species

Number of species in
New entries 7,742
Updated entries 248,261
Unchanged entries 254,149
Total 388,813

Sequence data

The shortest sequence is C4PYW0 at 2 AA while the longest sequence is Q3ASY8 at 36,805 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 0 0
Alternative products 0 0
Biophysicochemical properties 0 0
Biotechnological use 0 0
Catalytic activity 11,460,751 10,532,876
Caution 64,122,535 64,069,822
Cofactor 11,961,622 5,548,295
Developmental stage 0 0
Involvement in disease 0 0
Disruption phenotype 0 0
Domain 606,495 583,584
Enzyme regulation 219,620 219,620
Function 13,331,084 12,663,330
Induction 98,393 98,393
Mass spectrometry 0 0
Miscellaneous 347,601 345,375
Pathway 5,965,775 4,941,396
Pharmaceutical use 0 0
Polymorphism 0 0
Post-translational modification 468,692 421,597
RNA Editing 0 0
Sequence caution 0 0
Sequence similarities 35,676,406 27,512,209
Subcellular Location 0 0
Subunit structure 7,281,755 7,199,036
Tissue specificity 0 0
Toxic dose 0 0

Sequence Annotation (featues)

Annotations Entries
Molecule processing 1,881,261 1,017,326
Chain 941,999 749,954
Initiator methionine 30,423 30,423
Peptide 133 133
Propeptide 11,908 11,908
Signal peptide 894,465 888,552
Transit peptide 2,333 2,321
Regions 18,851,552 6,284,499
Calcium binding 0 0
Coiled-coil 199,220 113,105
Compositional bias 30,855 30,684
DNA binding 170,228 159,704
Domain 2,100,079 1,681,937
Motif 624,049 400,558
Nucleotide binding 4,282,407 2,516,359
Repeat 134,076 31,135
Region 3,520,100 1,921,081
Topological domain 703,550 145,620
Transmembrane 6,911,983 1,228,270
Zinc finger 174,599 156,934
Sites 28,403,721 6,310,995
Active site 5,326,255 3,352,304
Metal binding 10,144,837 2,693,427
Binding site 11,436,438 2,906,120
Other 1,496,191 757,647
Amino acid modifications 1,215,582 988,992
Cross-link 30,844 21,935
Disulfide bond 251,053 188,865
Glycosylation 398 147
Lipidation 162,462 81,231
Modified residue 768,739 711,238
Non-standard residue 2,086 1,943
Experimental info 10,386,473 6,912,669
Mutagenesis 0 0
Non-adjacent residues 0 0
Non-terminal residue 10,350,277 6,905,509
Sequence conflict 0 0
Sequence uncertainty 36,196 30,995

Citation usage

Citation type Citations Entries
Submission75,363,68070,350,311
Journal article28,616,57927,088,203
Book9,3619,298
Thesis18,97818,919
Patent11
Unpublished observations00
Online journal article00

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 557,108 425,207

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
EMBL98,232,42789,648,092
PIR170,924138,097
RefSeq18,477,30615,033,543
UniGene557,711519,760
3D structure databases
PDB25,20813,403
PDBsum25,63713,618
ProteinModelPortal31,320,37831,320,378
SMR8,561,3218,561,321
Protein-protein interaction databases
DIP3,1303,125
IntAct16,06316,063
MINT10,08610,085
STRING3,127,4093,127,235
Chemistry
BindingDB89,43389,433
ChEMBL784784
DrugBank14557
GuidetoPHARMACOLOGY2121
Protein family/group databases
Allergome3,8103,141
CAZy73,69669,249
MEROPS225,697225,696
MoonProt77
PeroxiBase2,5832,575
PptaseDB3836
REBASE48,25048,245
TCDB6,3796,370
mycoCLAP424424
PTM databases
PhosSite888876
PhosphoSite1,0781,078
Polymorphism databases
2D gel databases
COMPLUYEAST-2DPAGE44
OGP33
REPRODUCTION-2DPAGE6564
SWISS-2DPAGE2828
World-2DPAGE671666
Proteomic databases
MaxQB2,5172,517
PRIDE913,939913,939
PaxDb28,25028,248
PeptideAtlas127127
ProMEX3,4753,475
Protocols and materials databases
DNASU41,82141,495
Genome annotation databases
Ensembl1,164,7351,149,227
EnsemblBacteria68,374,38267,153,798
EnsemblFungi467,817465,295
EnsemblMetazoa917,460901,180
EnsemblPlants845,013804,364
EnsemblProtists190,998188,504
GeneID12,125,91411,834,877
KEGG10,760,34410,531,543
PATRIC8,243,9548,243,757
UCSC56,31256,102
VectorBase78,24077,723
Organism-specific databases
ArachnoServer9999
CGD6,7376,737
CTD467,158465,930
ConoServer159159
EuPathDB161,145161,144
FlyBase199,513198,038
GenoList14,72714,454
Gramene194,233194,233
H-InvDB594447
HGNC47,08547,006
LegioList5,1385,110
Leproma1,2721,270
MGI53,84253,485
MIM44
PharmGKB3,1863,186
PomBase22
PseudoCAP4,4944,488
RGD21,96720,847
SGD77
TAIR20,93820,821
TubercuList1,0631,062
WormBase43,34843,224
Xenbase25,01824,960
ZFIN47,39847,317
dictyBase7,9937,771
euHCVdb75,26775,264
Phylogenomic databases
GeneTree1,124,4721,124,434
HOGENOM3,640,9433,640,897
HOVERGEN302,197302,188
InParanoid2,678,9652,678,965
KO4,609,2254,586,091
OMA8,487,0748,487,072
OrthoDB5,176,1465,176,143
PhylomeDB423,536423,536
TreeFam587,428587,426
eggNOG2,749,2052,749,170
Enzyme and pathway databases
BRENDA2,5582,532
BioCyc5,767,1715,689,760
Reactome210,28073,504
SABIO-RK501501
SignaLink4,1034,098
UniPathway5,951,4714,927,092
Other
ChiTaRS87,34687,186
EvolutionaryTrace7,8577,857
GenomeRNAi23,38323,383
NextBio200,129200,122
PMAP-CutDB199199
PRO26,79326,792
Gene expression databases
Bgee94,23094,230
ExpressionAtlas266,863266,863
Genevestigator81,64681,642
Ontologies
GO162,711,97756,700,010
Family and domain databases
Gene3D54,836,58642,787,917
HAMAP12,208,72612,036,373
InterPro212,118,98272,064,350
PANTHER12,308,72311,896,327
PIRSF9,597,3949,520,134
PRINTS12,695,75111,421,911
PROSITE44,221,15029,954,386
Pfam92,182,25167,177,483
ProDom1,740,2461,698,516
SMART18,978,40914,486,741
SUPFAM51,978,74841,852,565
TIGRFAMs24,740,73722,576,151

Web resource

0 UniProtKB/TrEMBL entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.9%Alanine
  • 5.3%Arginine
  • 4.1%Asparagine
  • 5.4%Aspartate
  • 1.1%Cysteine
  • 3.9%Glutamine
  • 6.0%Glutamate
  • 7.2%Glycine
  • 2.2%Histidine
  • 6.1%Isoleucine
  • 9.9%Leucine
  • 5.1%Lysine
  • 2.4%Methionine
  • 3.9%Phenylalanine
  • 4.5%Proline
  • 6.3%Serine
  • 5.5%Threonine
  • 1.2%Tryptophan
  • 3.0%Tyrosine
  • 6.9%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

846,387 entries are encoded on a mitochondrion, and 447,488 are encoded on a plasmid.

352,805 entries are encoded on a plastid, of which 772 are encoded on apicoplasts, 308,162 on chloroplasts, 1 on organellar chromatophores, 48 on cyanelles, 1,608 on non-photosynthetic plastids and 2,565 on unspecified types of plastid.