Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 140
Updated entries 401,468
Unchanged entries 147,400
Total 549,008
Entries with updated sequences 12
With a fragmented AA sequence 9,143
With known alternative products 24,306
Protein Existence (PE) Number of entries
1 Evidence at protein level 85,930
2 Evidence at transcript level 61,845
3 Inferred from homology 387,776
4 Predicted 11,502
5 Uncertain 1,955

Taxonomic Origin

Swiss-Prot entries per taxonomic group


Statistics on the number of species

Number of species in
New entries 39
Updated entries 2,858
Unchanged entries 9,477
Total 10,343

Sequence data

The shortest sequence is P83570 at 2 AA while the longest sequence is A2ASS6 at 35,213 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 666 666
Alternative products 24,306 24,306
Biophysicochemical properties 6,592 6,592
Biotechnological use 447 445
Catalytic activity 254,368 229,201
Caution 31,409 58,742
Cofactor 209,095 120,301
Developmental stage 10,637 10,637
Involvement in disease 6,037 4,059
Disruption phenotype 8,949 8,949
Domain 43,213 37,452
Enzyme regulation 13,193 13,193
Function 441,841 423,794
Induction 17,446 17,446
Mass spectrometry 5,937 4,503
Miscellaneous 34,630 31,829
Pathway 134,726 122,100
Pharmaceutical use 98 98
Polymorphism 1,046 990
Post-translational modification 49,015 37,408
RNA Editing 627 627
Sequence caution 58,727 42,651
Sequence similarities 662,741 524,097
Subcellular Location 629,564 332
Subunit structure 260,233 260,233
Tissue specificity 41,702 41,702
Toxic dose 610 564

Sequence Annotation (features)

Annotations Entries
Molecule processing 646,968 549,008
Chain 556,492 542,635
Initiator methionine 18,057 18,025
Peptide 10,581 7,216
Propeptide 13,107 11,291
Signal peptide 39,764 39,754
Transit peptide 8,967 8,854
Regions 1,245,518 299,985
Calcium binding 3,985 1,678
Coiled-coil 21,223 14,648
Compositional bias 56,856 30,379
DNA binding 11,124 10,114
Domain 176,038 106,689
Motif 38,865 25,141
Nucleotide binding 135,490 79,048
Repeat 99,819 14,413
Region 171,784 81,664
Topological domain 134,799 27,686
Transmembrane 363,293 75,296
Zinc finger 29,842 13,187
Sites 896,880 194,897
Active site 153,343 93,902
Metal binding 348,180 86,354
Binding site 344,469 91,484
Other 50,888 28,164
Amino acid modifications 433,405 107,042
Cross-link 7,625 4,189
Disulfide bond 116,144 31,717
Glycosylation 110,625 28,344
Lipidation 12,365 7,951
Modified residue 186,288 64,177
Non-standard residue 358 283
Natural variations 141,151 30,653
Natural variant 141,151 30,653
Alternative sequence 50,652 21,279
Experimental info 221,457 62,803
Mutagenesis 53,262 12,167
Non-adjacent residues 2,218 772
Non-terminal residue 12,279 9,392
Sequence conflict 149,496 46,099
Sequence uncertainty 4,202 752
Secondary structure 479,411 20,623
Helix 209,731 19,849
Turn 50,689 16,123
Beta strand 218,991 18,739

Citation usage

Citation type Citations Entries
Submission191,681167,209
Journal article912,740437,283
Book1,4851,471
Thesis426423
Patent192189
Unpublished observations344340
Online journal article609596

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 681,293 527,706

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
CCDS46,11733,458
EMBL930,343537,733
PIR122,445112,179
RefSeq610,610451,113
UniGene103,50192,985
3D structure databases
DisProt605602
PDB117,29822,288
PDBsum117,29822,288
ProteinModelPortal442,641442,641
SMR225,104225,104
Protein-protein interaction databases
BioGrid41,86841,503
DIP16,27116,213
IntAct44,44544,445
MINT31,65331,653
STRING324,682324,682
Chemistry
BindingDB5,5535,553
ChEMBL6,1616,161
DrugBank11,2611,788
GuidetoPHARMACOLOGY2,1212,121
Protein family/group databases
Allergome1,6521,074
CAZy7,8177,034
ESTHER2,4152,413
MEROPS12,88812,888
MoonProt6363
PeroxiBase771755
REBASE408408
TCDB5,4015,380
mycoCLAP346342
PTM databases
DEPOD239239
PhosphoSite33,54933,549
UniCarbKB272272
Polymorphism and mutation databases
BioMuta17,25417,253
DMDM16,39216,391
dbSNP38,22411,677
2D gel databases
COMPLUYEAST-2DPAGE9998
DOSAC-COBS-2DPAGE148146
OGP375375
REPRODUCTION-2DPAGE1,2591,038
SWISS-2DPAGE1,1811,180
UCD-2DPAGE508499
World-2DPAGE923912
Proteomic databases
MaxQB31,46731,467
PRIDE123,598123,598
PaxDb66,79966,798
PeptideAtlas5,1605,160
ProMEX407407
Protocols and materials databases
DNASU18,83118,760
Genome annotation databases
Ensembl81,79448,221
EnsemblBacteria353,701334,780
EnsemblFungi18,83118,539
EnsemblMetazoa12,8709,571
EnsemblPlants20,78117,765
EnsemblProtists4,4584,333
GeneID282,117272,124
KEGG471,704446,903
PATRIC307,978307,943
UCSC60,25844,954
VectorBase615597
Organism-specific databases
ArachnoServer1,1201,110
CGD967936
CTD72,86072,165
CYGD5,5965,593
ConoServer949866
EchoBASE4,1614,161
EcoGene4,2944,292
EuPathDB16,76216,762
FlyBase5,9565,582
GeneCards20,86719,779
GeneFarm3,3733,361
GeneReviews1,1561,153
GenoList7,0757,063
Gramene6,2996,299
H-InvDB5,5894,768
HGNC20,01019,855
HPA24,71316,220
LegioList765763
Leproma671668
MGI16,63316,589
MIM19,36314,260
MaizeGDB506501
Orphanet6,1483,289
PharmGKB18,38818,355
PomBase5,1385,119
PseudoCAP1,3011,292
RGD7,8507,847
SGD6,7376,732
TAIR14,08014,024
TubercuList2,1042,068
WormBase5,2964,132
Xenbase4,7714,765
ZFIN2,7892,789
dictyBase4,2074,092
euHCVdb5544
neXtProt20,04520,045
Phylogenomic databases
GeneTree56,87956,860
HOGENOM387,693387,693
HOVERGEN75,70575,705
InParanoid135,488135,488
KO371,571371,105
OMA407,141407,141
OrthoDB390,276390,276
PhylomeDB94,46194,461
TreeFam44,84144,836
eggNOG431,851431,851
Enzyme and pathway databases
BRENDA12,68611,919
BioCyc325,047307,789
Reactome94,31628,147
SABIO-RK3,0753,075
SignaLink3,0262,987
UniPathway134,515121,898
Other
ChiTaRS16,45716,447
EvolutionaryTrace16,52016,519
GeneWiki10,36810,282
GenomeRNAi21,71821,718
NextBio71,40571,405
PMAP-CutDB1,4611,461
PRO89,33989,339
Gene expression databases
Bgee38,83938,839
CleanEx30,05629,416
ExpressionAtlas32,03432,034
Genevisible42,50242,502
Ontologies
GO2,678,965520,478
Family and domain databases
Gene3D464,413342,355
HAMAP324,578321,616
InterPro1,921,839528,066
PANTHER184,019176,581
PIRSF104,309103,272
PRINTS135,276119,317
PROSITE449,099289,109
Pfam737,565508,740
ProDom29,27829,099
SMART170,648127,841
SUPFAM442,601341,372
TIGRFAMs292,064271,827

Web resource

6,861 UniProtKB/Swiss-Prot entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.2%Alanine
  • 5.5%Arginine
  • 4.0%Asparagine
  • 5.4%Aspartate
  • 1.3%Cysteine
  • 3.9%Glutamine
  • 6.7%Glutamate
  • 7.0%Glycine
  • 2.2%Histidine
  • 5.9%Isoleucine
  • 9.6%Leucine
  • 5.8%Lysine
  • 2.4%Methionine
  • 3.8%Phenylalanine
  • 4.7%Proline
  • 6.5%Serine
  • 5.3%Threonine
  • 1.0%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

15,863 entries are encoded on a mitochondrion, and 3,753 are encoded on a plasmid.

12,188 entries are encoded on a plastid, of which 21 are encoded on apicoplasts, 11,623 on chloroplasts, 51 on organellar chromatophores, 145 on cyanelles, 149 on non-photosynthetic plastids and 16 on unspecified types of plastid.