Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 210
Updated entries 322,773
Unchanged entries 226,232
Total 549,215
Entries with updated sequences 56
With a fragmented AA sequence 9,149
With known alternative products 24,322
Protein Existence (PE) Number of entries
1 Evidence at protein level 90,456
2 Evidence at transcript level 57,714
3 Inferred from homology 387,606
4 Predicted 11,484
5 Uncertain 1,955

Taxonomic Origin

Swiss-Prot entries per taxonomic group


Statistics on the number of species

Number of species in
New entries 43
Updated entries 7,642
Unchanged entries 6,755
Total 10,350

Sequence data

The shortest sequence is P83570 at 2 AA while the longest sequence is A2ASS6 at 35,213 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 666 666
Alternative products 24,322 24,322
Biophysicochemical properties 6,642 6,642
Biotechnological use 452 450
Catalytic activity 254,130 228,940
Caution 31,499 58,811
Cofactor 209,200 120,469
Developmental stage 10,660 10,660
Involvement in disease 6,064 4,076
Disruption phenotype 9,095 9,095
Domain 43,313 37,537
Enzyme regulation 13,218 13,218
Function 442,241 424,163
Induction 17,532 17,532
Mass spectrometry 5,953 4,511
Miscellaneous 34,679 31,879
Pathway 134,849 122,221
Pharmaceutical use 98 98
Polymorphism 1,048 991
Post-translational modification 49,128 37,504
RNA Editing 627 627
Sequence caution 58,736 42,644
Sequence similarities 663,334 524,326
Subcellular Location 631,294 332
Subunit structure 261,190 261,190
Tissue specificity 41,799 41,799
Toxic dose 616 570

Sequence Annotation (features)

Annotations Entries
Molecule processing 647,139 549,215
Chain 556,678 542,838
Initiator methionine 17,868 17,836
Peptide 10,602 7,227
Propeptide 13,178 11,353
Signal peptide 39,837 39,827
Transit peptide 8,976 8,863
Regions 1,246,891 300,393
Calcium binding 3,985 1,678
Coiled-coil 21,231 14,654
Compositional bias 56,899 30,412
DNA binding 11,146 10,135
Domain 176,424 106,985
Motif 38,904 25,166
Nucleotide binding 136,243 79,348
Repeat 99,849 14,415
Region 171,892 81,722
Topological domain 134,624 27,676
Transmembrane 363,434 75,331
Zinc finger 29,860 13,205
Sites 898,759 195,203
Active site 153,540 94,062
Metal binding 348,680 86,432
Binding site 345,440 91,800
Other 51,099 28,314
Amino acid modifications 444,820 108,111
Cross-link 7,645 4,195
Disulfide bond 116,363 31,786
Glycosylation 110,642 28,349
Lipidation 12,369 7,953
Modified residue 197,443 65,711
Non-standard residue 358 283
Natural variations 141,227 30,670
Natural variant 141,227 30,670
Alternative sequence 50,670 21,294
Experimental info 221,933 62,892
Mutagenesis 53,671 12,253
Non-adjacent residues 2,223 774
Non-terminal residue 12,287 9,398
Sequence conflict 149,550 46,120
Sequence uncertainty 4,202 752
Secondary structure 480,730 20,679
Helix 210,345 19,903
Turn 50,801 16,166
Beta strand 219,584 18,785

Citation usage

Citation type Citations Entries
Submission192,026167,548
Journal article925,937437,614
Book1,4851,471
Thesis426423
Patent192189
Unpublished observations372368
Online journal article610597

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 687,407 1,027,425

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
CCDS46,12433,465
EMBL930,930537,934
PIR122,457112,191
RefSeq610,530451,076
UniGene104,20493,046
3D structure databases
DisProt605602
PDB119,22722,382
PDBsum119,22722,382
ProteinModelPortal354,392354,392
SMR73,62873,628
Protein-protein interaction databases
BioGrid42,53242,160
DIP16,27216,214
IntAct44,46744,467
MINT31,65531,655
STRING324,757324,757
Chemistry
BindingDB5,6595,659
ChEMBL6,1616,161
DrugBank11,3171,795
GuidetoPHARMACOLOGY2,1212,121
Protein family/group databases
Allergome1,6531,075
CAZy7,8187,035
ESTHER2,4152,413
MEROPS12,89112,891
MoonProt6363
PeroxiBase771755
REBASE388388
TCDB5,4015,380
mycoCLAP346342
PTM databases
DEPOD239239
PhosphoSite33,55033,550
UniCarbKB272272
Polymorphism and mutation databases
BioMuta17,25417,253
DMDM16,39216,391
dbSNP38,22711,679
2D gel databases
COMPLUYEAST-2DPAGE9998
DOSAC-COBS-2DPAGE148146
OGP375375
REPRODUCTION-2DPAGE1,2591,038
SWISS-2DPAGE1,1811,180
UCD-2DPAGE508499
World-2DPAGE923912
Proteomic databases
MaxQB32,20032,200
PRIDE123,618123,618
PaxDb66,81266,811
PeptideAtlas5,1605,160
ProMEX409409
Protocols and materials databases
DNASU18,83218,761
Genome annotation databases
Ensembl78,69844,621
EnsemblBacteria354,214335,177
EnsemblFungi29,66427,406
EnsemblMetazoa12,9969,670
EnsemblPlants21,22718,079
EnsemblProtists9,1894,900
GeneID279,351269,195
KEGG472,651446,791
PATRIC307,996307,961
UCSC60,26844,965
VectorBase615597
Organism-specific databases
ArachnoServer1,1201,110
CGD967936
CTD72,80872,114
CYGD5,5965,593
ConoServer949866
EchoBASE4,1614,161
EcoGene4,2944,292
EuPathDB16,76216,762
FlyBase5,9465,584
GeneCards20,04419,863
GeneFarm3,3733,361
GeneReviews1,1561,153
GenoList7,0757,063
Gramene6,3676,367
H-InvDB5,5894,768
HGNC20,01019,856
HPA24,71316,220
LegioList765763
Leproma671668
MGI16,63516,591
MIM19,43014,304
MaizeGDB506501
Orphanet6,1483,289
PharmGKB18,38818,354
PomBase5,1385,119
PseudoCAP1,3011,292
RGD7,8507,847
SGD6,7376,732
TAIR14,08814,032
TubercuList2,1042,068
WormBase5,3114,142
Xenbase4,7724,766
ZFIN2,7912,791
dictyBase4,2074,092
euHCVdb5544
neXtProt20,04520,044
Phylogenomic databases
GeneTree51,79851,777
HOGENOM387,758387,758
HOVERGEN75,71175,711
InParanoid135,536135,536
KO371,933371,466
OMA407,245407,245
OrthoDB390,310390,310
PhylomeDB94,47894,478
TreeFam44,84944,844
eggNOG431,929431,929
Enzyme and pathway databases
BRENDA12,69111,924
BioCyc325,075307,815
Reactome93,56227,676
SABIO-RK3,0753,075
SignaLink3,0292,990
UniPathway134,637122,018
Other
ChiTaRS16,45716,447
EvolutionaryTrace16,52316,522
GeneWiki10,36810,282
GenomeRNAi21,71921,719
NextBio71,43071,430
PMAP-CutDB1,4611,461
PRO89,83289,832
Gene expression databases
Bgee38,84038,840
CleanEx30,05629,416
ExpressionAtlas30,15730,157
Genevisible42,50242,502
Ontologies
GO2,680,731520,632
Family and domain databases
Gene3D462,560340,544
HAMAP324,596321,634
InterPro1,911,817527,856
PANTHER183,269175,752
PIRSF104,320103,283
PRINTS134,057118,156
PROSITE449,512289,303
Pfam725,510504,172
ProDom29,27829,099
SMART170,796127,932
SUPFAM442,504341,230
TIGRFAMs292,086271,848

Web resource

6,866 UniProtKB/Swiss-Prot entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.2%Alanine
  • 5.5%Arginine
  • 4.0%Asparagine
  • 5.4%Aspartate
  • 1.3%Cysteine
  • 3.9%Glutamine
  • 6.7%Glutamate
  • 7.0%Glycine
  • 2.2%Histidine
  • 5.9%Isoleucine
  • 9.6%Leucine
  • 5.8%Lysine
  • 2.4%Methionine
  • 3.8%Phenylalanine
  • 4.7%Proline
  • 6.5%Serine
  • 5.3%Threonine
  • 1.0%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

15,878 entries are encoded on a mitochondrion, and 3,759 are encoded on a plasmid.

12,188 entries are encoded on a plastid, of which 21 are encoded on apicoplasts, 11,623 on chloroplasts, 51 on organellar chromatophores, 145 on cyanelles, 149 on non-photosynthetic plastids and 16 on unspecified types of plastid.