Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 197
Updated entries 114,616
Unchanged entries 436,572
Total 551,385
Entries with updated sequences 9
With a fragmented AA sequence 9,156
With known alternative products 24,548
Protein Existence (PE) Number of entries
1 Evidence at protein level 92,717
2 Evidence at transcript level 57,784
3 Inferred from homology 387,581
4 Predicted 11,351
5 Uncertain 1,952

Taxonomic Origin

Swiss-Prot entries per taxonomic group


Statistics on the number of species

Number of species in
New entries 35
Updated entries 1,928
Unchanged entries 10,222
Total 10,401

Sequence data

The shortest sequence is P83570 at 2 AA while the longest sequence is A2ASS6 at 35,213 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 707 707
Alternative products 24,548 24,548
Biophysicochemical properties 7,040 7,040
Biotechnological use 594 592
Catalytic activity 257,794 231,420
Caution 31,765 57,344
Cofactor 211,137 551,385
Developmental stage 10,960 10,960
Involvement in disease 6,214 4,157
Disruption phenotype 10,285 10,285
Domain 44,247 38,318
Enzyme regulation 13,484 13,483
Function 446,828 428,338
Induction 18,244 18,243
Mass spectrometry 6,103 4,605
Miscellaneous 35,147 32,322
Pathway 135,508 122,771
Pharmaceutical use 99 99
Polymorphism 1,043 987
Post-translational modification 50,989 38,838
RNA Editing 627 627
Sequence caution 59,595 43,186
Sequence similarities 668,122 526,477
Subcellular Location 643,613 551,385
Subunit structure 264,806 264,669
Tissue specificity 42,771 42,771
Toxic dose 623 577

Sequence Annotation (features)

Annotations Entries
Molecule processing 650,996 551,385
Chain 558,965 544,953
Initiator methionine 18,406 18,365
Peptide 10,740 7,294
Propeptide 13,461 11,543
Signal peptide 40,330 40,320
Transit peptide 9,094 8,981
Regions 1,266,701 305,246
Calcium binding 3,987 1,677
Coiled-coil 21,457 14,796
Compositional bias 57,579 30,829
DNA binding 11,224 10,198
Domain 179,455 108,862
Motif 40,097 25,913
Nucleotide binding 139,586 79,964
Repeat 100,646 14,344
Region 178,526 84,992
Topological domain 136,065 28,033
Transmembrane 365,575 75,826
Zinc finger 30,074 13,347
Sites 922,141 198,782
Active site 157,245 96,067
Metal binding 357,263 88,654
Binding site 355,107 94,129
Other 52,526 29,304
Amino acid modifications 471,481 111,012
Cross-link 11,167 5,588
Disulfide bond 118,117 32,237
Glycosylation 112,101 28,746
Lipidation 12,596 8,089
Modified residue 217,141 68,485
Non-standard residue 359 284
Natural variations 143,744 30,867
Natural variant 143,744 30,867
Alternative sequence 51,037 21,472
Experimental info 226,681 63,704
Mutagenesis 57,391 12,979
Non-adjacent residues 2,238 777
Non-terminal residue 12,290 9,404
Sequence conflict 150,489 46,431
Sequence uncertainty 4,273 755
Secondary structure 504,033 21,549
Helix 220,810 20,762
Turn 53,124 16,827
Beta strand 230,099 19,577

Citation usage

Citation type Citations Entries
Submission192,699167,833
Journal article950,456440,553
Book1,4921,478
Thesis428425
Patent197193
Unpublished observations386382
Online journal article610597

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 741,206 1,046,823

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
CCDS46,59533,600
EMBL941,629540,038
PIR123,019112,721
RefSeq599,575462,342
UniGene106,33094,449
3D structure databases
DisProt605602
PDB129,68523,482
PDBsum129,68523,482
ProteinModelPortal443,936443,936
SMR226,015226,015
Protein-protein interaction databases
BioGrid48,28347,827
DIP16,82916,772
IntAct46,06546,065
MINT31,71131,711
STRING326,041326,041
Chemistry
BindingDB4,6104,610
ChEMBL6,3916,391
DrugBank11,7841,909
GuidetoPHARMACOLOGY1,8891,889
SwissLipids1,004926
Protein family/group databases
Allergome1,6961,111
CAZy7,8897,098
ESTHER2,4382,436
MEROPS12,94112,941
MoonProt6363
PeroxiBase771755
REBASE405405
TCDB6,0175,986
mycoCLAP348344
PTM databases
DEPOD239239
PhosphoSite33,54633,546
SwissPalm5,9445,944
UniCarbKB584584
iPTMnet45,87845,878
Polymorphism and mutation databases
BioMuta17,24617,242
DMDM16,37616,373
dbSNP38,63011,717
2D gel databases
COMPLUYEAST-2DPAGE9998
DOSAC-COBS-2DPAGE146146
OGP375375
REPRODUCTION-2DPAGE1,2591,038
SWISS-2DPAGE1,1801,180
UCD-2DPAGE508499
World-2DPAGE925914
Proteomic databases
EPD23,38223,382
MaxQB32,30132,301
PRIDE123,931123,931
PaxDb110,748110,619
PeptideAtlas5,1595,159
ProMEX436436
TopDownProteomics3,1342,858
Protocols and materials databases
DNASU18,87618,805
Genome annotation databases
Ensembl84,50648,463
EnsemblBacteria354,766335,757
EnsemblFungi30,23427,797
EnsemblMetazoa13,1459,822
EnsemblPlants22,06818,793
EnsemblProtists4,8814,718
GeneDB389350
GeneID273,620264,846
Gramene18,23215,843
KEGG500,150466,416
PATRIC308,217308,182
UCSC48,73044,719
VectorBase618600
WBParaSite2929
Organism-specific databases
ArachnoServer1,1451,135
CGD1,7081,692
CTD73,51072,757
ConoServer949866
EchoBASE4,1614,161
EcoGene4,2944,292
EuPathDB18,00918,008
FlyBase5,9955,633
GeneCards20,02119,847
GeneReviews1,1561,153
H-InvDB5,5894,768
HGNC20,01019,862
HPA24,70016,208
LegioList765763
Leproma672669
MGI16,70916,665
MIM19,77214,503
MaizeGDB506501
MalaCards3,7733,771
Orphanet6,1483,289
PharmGKB18,37918,338
PomBase5,1405,121
PseudoCAP1,3081,299
RGD7,8897,886
SGD6,7396,734
TAIR14,68114,625
TubercuList2,1232,087
WormBase5,5154,255
Xenbase4,4504,444
ZFIN2,8032,803
dictyBase4,2074,092
euHCVdb5544
neXtProt20,04120,040
Phylogenomic databases
GeneTree55,44955,406
HOGENOM388,758388,758
HOVERGEN75,77775,777
InParanoid135,979135,979
KO390,035389,593
OMA407,319407,319
OrthoDB391,069391,069
PhylomeDB94,78294,782
TreeFam44,96244,956
eggNOG657,125327,981
Enzyme and pathway databases
BRENDA12,75911,989
BioCyc325,557308,260
Reactome101,11630,603
SABIO-RK3,3293,329
SIGNOR2,9402,940
SignaLink3,0003,000
UniPathway135,206122,480
Other
ChiTaRS16,47116,461
EvolutionaryTrace16,55216,550
GeneWiki10,36810,282
GenomeRNAi21,79021,790
PMAP-CutDB1,4611,461
PRO90,09090,090
Gene expression databases
Bgee38,88138,881
CleanEx30,04329,403
CollecTF133133
ExpressionAtlas33,11133,111
Genevisible55,14055,140
Ontologies
Family and domain databases
Gene3D472,178348,023
HAMAP325,973322,901
InterPro1,938,764531,681
PANTHER172,482165,641
PIRSF104,402103,364
PRINTS133,775117,996
PROSITE452,975291,227
Pfam744,774510,015
ProDom29,13928,958
SMART189,217139,696
SUPFAM479,693363,776
TIGRFAMs292,183272,032

Web resource

6,870 UniProtKB/Swiss-Prot entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.2%Alanine
  • 5.5%Arginine
  • 4.0%Asparagine
  • 5.4%Aspartate
  • 1.3%Cysteine
  • 3.9%Glutamine
  • 6.7%Glutamate
  • 7.0%Glycine
  • 2.2%Histidine
  • 5.9%Isoleucine
  • 9.6%Leucine
  • 5.8%Lysine
  • 2.4%Methionine
  • 3.8%Phenylalanine
  • 4.7%Proline
  • 6.5%Serine
  • 5.3%Threonine
  • 1.0%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

15,963 entries are encoded on a mitochondrion, and 3,777 are encoded on a plasmid.

12,188 entries are encoded on a plastid, of which 21 are encoded on apicoplasts, 11,623 on chloroplasts, 51 on organellar chromatophores, 145 on cyanelles, 149 on non-photosynthetic plastids and 16 on unspecified types of plastid.