Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 196
Updated entries 83,117
Unchanged entries 467,427
Total 550,740
Entries with updated sequences 32
With a fragmented AA sequence 9,151
With known alternative products 24,488
Protein Existence (PE) Number of entries
1 Evidence at protein level 92,083
2 Evidence at transcript level 57,694
3 Inferred from homology 387,608
4 Predicted 11,405
5 Uncertain 1,950

Taxonomic Origin

Swiss-Prot entries per taxonomic group


Statistics on the number of species

Number of species in
New entries 79
Updated entries 1,980
Unchanged entries 10,164
Total 10,386

Sequence data

The shortest sequence is P83570 at 2 AA while the longest sequence is A2ASS6 at 35,213 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 692 692
Alternative products 24,488 24,488
Biophysicochemical properties 6,929 6,929
Biotechnological use 480 478
Catalytic activity 257,284 230,914
Caution 31,703 59,289
Cofactor 210,878 121,567
Developmental stage 10,854 10,854
Involvement in disease 6,165 4,125
Disruption phenotype 9,856 9,856
Domain 44,021 38,118
Enzyme regulation 13,424 13,424
Function 445,858 427,455
Induction 17,969 17,969
Mass spectrometry 6,067 4,574
Miscellaneous 35,128 32,306
Pathway 135,385 122,649
Pharmaceutical use 99 99
Polymorphism 1,045 989
Post-translational modification 50,662 38,684
RNA Editing 627 627
Sequence caution 59,327 43,023
Sequence similarities 666,671 525,816
Subcellular Location 641,113 329
Subunit structure 264,027 264,027
Tissue specificity 42,464 42,464
Toxic dose 622 576

Sequence Annotation (features)

Annotations Entries
Molecule processing 650,122 550,740
Chain 558,312 544,325
Initiator methionine 18,382 18,341
Peptide 10,713 7,277
Propeptide 13,435 11,519
Signal peptide 40,213 40,203
Transit peptide 9,067 8,954
Regions 1,261,787 303,515
Calcium binding 3,985 1,676
Coiled-coil 21,377 14,747
Compositional bias 57,425 30,726
DNA binding 11,183 10,159
Domain 178,546 108,114
Motif 39,939 25,780
Nucleotide binding 139,298 79,816
Repeat 100,429 14,313
Region 176,523 83,893
Topological domain 135,603 27,915
Transmembrane 365,109 75,721
Zinc finger 29,965 13,256
Sites 915,529 198,026
Active site 156,854 95,863
Metal binding 355,556 88,383
Binding site 350,954 93,132
Other 52,165 29,151
Amino acid modifications 470,308 110,792
Cross-link 11,133 5,574
Disulfide bond 117,504 32,117
Glycosylation 111,817 28,642
Lipidation 12,574 8,085
Modified residue 216,922 68,407
Non-standard residue 358 283
Natural variations 143,402 30,813
Natural variant 143,402 30,813
Alternative sequence 50,907 21,420
Experimental info 225,278 63,463
Mutagenesis 56,290 12,774
Non-adjacent residues 2,238 776
Non-terminal residue 12,283 9,399
Sequence conflict 150,190 46,342
Sequence uncertainty 4,277 756
Secondary structure 496,193 21,276
Helix 217,245 20,493
Turn 52,376 16,602
Beta strand 226,572 19,318

Citation usage

Citation type Citations Entries
Submission192,276167,537
Journal article941,244439,896
Book1,4921,478
Thesis428425
Patent195191
Unpublished observations384380
Online journal article610597

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 717,510 1,054,970

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
CCDS46,55233,573
EMBL935,166539,419
PIR122,862112,574
RefSeq598,405460,483
UniGene105,33593,860
3D structure databases
DisProt605602
PDB126,52923,195
PDBsum126,52923,195
ProteinModelPortal443,710443,710
SMR225,901225,901
Protein-protein interaction databases
BioGrid47,41146,966
DIP16,82816,771
IntAct45,47945,479
MINT31,69331,693
STRING325,706325,706
Chemistry
BindingDB5,6875,687
ChEMBL6,1626,162
DrugBank11,6721,900
GuidetoPHARMACOLOGY1,8291,829
SwissLipids934860
Protein family/group databases
Allergome1,6891,105
CAZy7,8787,087
ESTHER2,4322,430
MEROPS12,91312,913
MoonProt6363
PeroxiBase771755
REBASE413413
TCDB5,9595,930
mycoCLAP347343
PTM databases
DEPOD239239
PhosphoSite33,54433,544
SwissPalm4,9204,920
UniCarbKB584584
iPTMnet35,77935,779
Polymorphism and mutation databases
BioMuta17,24617,245
DMDM16,37616,375
dbSNP38,61511,717
2D gel databases
COMPLUYEAST-2DPAGE9998
DOSAC-COBS-2DPAGE146146
OGP375375
REPRODUCTION-2DPAGE1,2591,038
SWISS-2DPAGE1,1801,180
UCD-2DPAGE508499
World-2DPAGE924913
Proteomic databases
EPD23,36823,368
MaxQB32,29332,293
PRIDE123,841123,841
PaxDb110,477110,443
PeptideAtlas5,1605,160
ProMEX430430
TopDownProteomics3,1382,862
Protocols and materials databases
DNASU18,86518,794
Genome annotation databases
Ensembl84,20048,406
EnsemblBacteria354,446335,427
EnsemblFungi30,04027,675
EnsemblMetazoa13,1789,759
EnsemblPlants21,79418,571
EnsemblProtists4,8714,708
GeneDB389350
GeneID273,238264,244
Gramene18,02315,653
KEGG494,637460,573
PATRIC308,181308,146
UCSC48,62044,640
VectorBase615597
WBParaSite11
Organism-specific databases
ArachnoServer1,1461,136
CGD1,7071,691
CTD73,29372,546
ConoServer949866
EchoBASE4,1614,161
EcoGene4,2944,292
EuPathDB16,70616,706
FlyBase5,9675,605
GeneCards20,02619,851
GeneReviews1,1561,153
H-InvDB5,5894,768
HGNC20,01019,861
HPA24,70016,208
LegioList765763
Leproma672669
MGI16,68316,639
MIM19,69314,471
MaizeGDB506501
MalaCards3,7753,773
Orphanet6,1483,289
PharmGKB18,38318,342
PomBase5,1395,120
PseudoCAP1,3071,298
RGD7,8717,868
SGD6,7396,734
TAIR14,52014,464
TubercuList2,1212,085
WormBase5,4424,218
Xenbase4,7734,767
ZFIN2,8002,800
dictyBase4,2074,092
euHCVdb5544
neXtProt20,04720,047
Phylogenomic databases
GeneTree55,47555,431
HOGENOM388,468388,468
HOVERGEN75,75875,758
InParanoid135,842135,842
KO385,041384,609
OMA406,922406,922
OrthoDB390,840390,840
PhylomeDB94,58694,586
TreeFam44,92244,917
eggNOG656,358327,606
Enzyme and pathway databases
BRENDA12,74311,974
BioCyc325,450308,162
Reactome93,43328,489
SABIO-RK3,2743,274
SignaLink2,9972,997
UniPathway135,137122,412
Other
ChiTaRS16,46816,458
EvolutionaryTrace16,54316,541
GeneWiki10,36810,282
GenomeRNAi21,74521,745
NextBio71,57971,579
PMAP-CutDB1,4611,461
PRO88,79888,798
Gene expression databases
Bgee38,85738,857
CleanEx30,04529,405
CollecTF133133
ExpressionAtlas31,33731,337
Genevisible55,12155,121
Ontologies
GO2,744,432522,618
Family and domain databases
Gene3D471,676347,683
HAMAP325,510322,435
InterPro1,939,105530,881
PANTHER169,256162,753
PIRSF104,351103,313
PRINTS134,199118,276
PROSITE451,974290,527
Pfam744,678509,414
ProDom29,32929,148
SMART171,385128,321
SUPFAM478,808363,337
TIGRFAMs292,229271,993

Web resource

6,889 UniProtKB/Swiss-Prot entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

  • 8.2%Alanine
  • 5.5%Arginine
  • 4.0%Asparagine
  • 5.4%Aspartate
  • 1.3%Cysteine
  • 3.9%Glutamine
  • 6.7%Glutamate
  • 7.0%Glycine
  • 2.2%Histidine
  • 5.9%Isoleucine
  • 9.6%Leucine
  • 5.8%Lysine
  • 2.4%Methionine
  • 3.8%Phenylalanine
  • 4.7%Proline
  • 6.5%Serine
  • 5.3%Threonine
  • 1.0%Tryptophan
  • 2.9%Tyrosine
  • 6.8%Valine
  • Aliphatic
  • Acidic
  • Small hydroxy
  • Basic
  • Amide
  • Aromatic
  • Sulfur

Miscellaneous Statistics

15,946 entries are encoded on a mitochondrion, and 3,773 are encoded on a plasmid.

12,188 entries are encoded on a plastid, of which 21 are encoded on apicoplasts, 11,623 on chloroplasts, 51 on organellar chromatophores, 145 on cyanelles, 149 on non-photosynthetic plastids and 16 on unspecified types of plastid.