Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 3,013,093
Updated entries 14,439,709
Unchanged entries 46,233,255
Total 63,686,057
Entries with updated sequences 1,348
With a fragmented AA sequence 7,480,210
With known alternative products 0
Protein Existence (PE) Number of entries
1 Evidence at protein level 128,165
2 Evidence at transcript level 1,012,933
3 Inferred from homology 13,043,907
4 Predicted 49,501,052
5 Uncertain 0

Taxonomic Origin


Statistics on the number of species

Number of species in
New entries 6,390
Updated entries 144,628
Unchanged entries 422,801
Total 470,721

Sequence data

The shortest sequence is C4PYW0 at 2 AA while the longest sequence is A0A0Q3ZJN0 at 37,363 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 0 0
Alternative products 0 0
Biophysicochemical properties 0 0
Biotechnological use 0 0
Catalytic activity 6,493,888 6,012,714
Caution 31,380,022 31,340,966
Cofactor 4,686,499 2,884,590
Developmental stage 0 0
Involvement in disease 0 0
Disruption phenotype 0 0
Domain 360,214 343,554
Enzyme regulation 105,787 105,787
Function 7,437,771 7,008,141
Induction 30,693 30,693
Mass spectrometry 0 0
Miscellaneous 210,870 210,870
Pathway 3,199,549 2,900,185
Pharmaceutical use 0 0
Polymorphism 0 0
Post-translational modification 269,709 244,958
RNA Editing 0 0
Sequence caution 0 0
Sequence similarities 20,034,223 16,914,025
Subcellular Location 0 0
Subunit structure 3,880,395 3,829,545
Tissue specificity 0 0
Toxic dose 0 0

Sequence Annotation (featues)

Annotations Entries
Molecule processing 8,888,754 4,446,963
Chain 4,460,439 4,433,443
Initiator methionine 13,011 13,011
Peptide 56 56
Propeptide 6,009 6,009
Signal peptide 4,408,653 4,407,247
Transit peptide 586 586
Regions 107,493,222 39,248,763
Calcium binding 480 398
Coiled-coil 4,279,684 2,890,350
Compositional bias 12,747 12,592
DNA binding 58,449 56,490
Domain 43,238,507 30,892,622
Motif 291,968 190,884
Nucleotide binding 1,925,780 1,126,744
Repeat 102,186 25,928
Region 1,673,371 883,897
Topological domain 170,415 47,492
Transmembrane 55,640,425 12,312,481
Zinc finger 98,965 83,185
Sites 13,476,012 3,002,737
Active site 2,620,269 1,598,003
Metal binding 4,883,216 1,308,775
Binding site 5,304,557 1,415,667
Other 667,970 362,298
Amino acid modifications 602,358 512,843
Cross-link 13,855 9,806
Disulfide bond 118,984 82,649
Glycosylation 1,587 619
Lipidation 53,134 26,860
Modified residue 412,601 397,059
Non-standard residue 2,197 2,030
Experimental info 11,642,309 7,497,693
Mutagenesis 0 0
Non-adjacent residues 0 0
Non-terminal residue 11,594,763 7,488,503
Sequence conflict 0 0
Sequence uncertainty 47,546 40,171

Citation usage

Citation type Citations Entries
Submission49,089,05341,937,828
Journal article27,946,20726,111,684
Book15,37515,310
Thesis11,65311,594
Patent11
Unpublished observations00
Online journal article00

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 629,359 443,309

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
EMBL76,072,99962,102,062
PIR162,438130,221
RefSeq32,251,03331,478,762
UniGene618,123564,485
3D structure databases
PDB29,45015,227
PDBsum28,95214,881
ProteinModelPortal7,460,8847,460,589
SMR903,068903,068
Protein-protein interaction databases
DIP3,2363,231
IntAct18,06718,067
MINT9,9599,958
STRING7,374,6657,370,294
Chemistry
BindingDB25,51525,515
ChEMBL782782
DrugBank16171
GuidetoPHARMACOLOGY44
SwissLipids6363
Protein family/group databases
Allergome3,8623,152
CAZy68,18764,096
ESTHER54,71654,613
MEROPS189,547189,547
MoonProt55
PeroxiBase2,4742,466
REBASE33,46933,450
TCDB7,0006,990
mycoCLAP451451
PTM databases
PhosphoSite1,0701,070
SwissPalm1,0291,029
UniCarbKB1717
iPTMnet4,3894,389
Polymorphism and mutation databases
2D gel databases
COMPLUYEAST-2DPAGE44
OGP33
REPRODUCTION-2DPAGE6564
SWISS-2DPAGE11
World-2DPAGE322317
Proteomic databases
EPD29,38229,382
MaxQB6,8876,887
PRIDE264,475264,459
PaxDb678,505678,236
PeptideAtlas126126
ProMEX2,5862,586
TopDownProteomics240240
Protocols and materials databases
DNASU39,90439,582
Genome annotation databases
Ensembl1,201,6051,180,833
EnsemblBacteria28,536,54225,307,845
EnsemblFungi4,833,0224,720,101
EnsemblMetazoa940,103919,466
EnsemblPlants1,495,6041,430,093
EnsemblProtists1,580,7491,491,439
GeneDB56,18455,319
GeneID7,002,0466,914,069
Gramene1,495,3701,429,350
KEGG11,866,58011,455,734
PATRIC5,630,6485,630,540
UCSC95,79195,601
VectorBase78,23677,719
WBParaSite335,984334,941
Organism-specific databases
ArachnoServer204204
CGD26,95723,752
CTD696,501694,795
ConoServer159159
EuPathDB394,571394,511
FlyBase181,994180,549
H-InvDB591444
HGNC49,26149,158
LegioList2,4962,483
Leproma1,2711,269
MGI57,42156,963
MIM44
MalaCards1212
PharmGKB3,1733,173
PseudoCAP4,4764,470
RGD24,70123,415
SGD77
TAIR19,64919,532
TubercuList1,0311,030
WormBase56,04755,879
Xenbase25,62625,562
ZFIN51,49951,000
dictyBase7,9927,770
euHCVdb75,26775,264
Phylogenomic databases
GeneTree1,179,2611,179,161
HOGENOM3,078,4233,078,373
HOVERGEN301,631301,620
InParanoid2,581,4562,581,414
KO4,962,3954,940,994
OMA6,266,3966,266,379
OrthoDB4,642,7204,642,713
PhylomeDB506,569506,569
TreeFam581,796581,786
eggNOG14,667,5377,351,719
Enzyme and pathway databases
BRENDA9,7019,409
BioCyc4,464,1304,399,633
Reactome187,11069,023
SABIO-RK562562
SignaLink3,9403,940
UniPathway3,194,4892,895,125
Other
ChiTaRS86,61186,451
EvolutionaryTrace6,1006,100
GenomeRNAi30,64430,644
NextBio195,586195,583
PMAP-CutDB135135
PRO2,4022,402
Gene expression databases
Bgee93,95493,821
CollecTF202202
ExpressionAtlas214,966214,965
Genevisible16,56616,566
Ontologies
Family and domain databases
Gene3D37,555,38429,555,729
HAMAP5,932,9415,854,647
InterPro139,364,39848,308,081
PANTHER9,100,4768,806,617
PIRSF5,062,6635,017,269
PRINTS8,620,3757,737,359
PROSITE31,636,11120,785,860
Pfam61,270,93144,602,559
ProDom1,007,407957,569
SMART14,273,35710,847,208
SUPFAM39,046,15931,031,190
TIGRFAMs12,397,72111,371,794

Web resource

0 UniProtKB/TrEMBL entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.9%Alanine
  • 5.6%Arginine
  • 3.9%Asparagine
  • 5.4%Aspartate
  • 1.2%Cysteine
  • 3.8%Glutamine
  • 6.1%Glutamate
  • 7.1%Glycine
  • 2.2%Histidine
  • 5.7%Isoleucine
  • 9.8%Leucine
  • 5.1%Lysine
  • 2.3%Methionine
  • 3.9%Phenylalanine
  • 4.8%Proline
  • 6.8%Serine
  • 5.5%Threonine
  • 1.3%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

1,149,921 entries are encoded on a mitochondrion, and 486,088 are encoded on a plasmid.

456,564 entries are encoded on a plastid, of which 794 are encoded on apicoplasts, 393,249 on chloroplasts, 0 on organellar chromatophores, 10 on cyanelles, 1,604 on non-photosynthetic plastids and 3,168 on unspecified types of plastid.