Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Release 14.0 of the UniProt Knowledgebase is composed of the UniProtKB/Swiss-Prot Protein Knowledgebase release 56.0 and the UniProtKB/TrEMBL Protein Database release 39.0.

More information on these databases can be found in the user manual What is the UniProt Knowledgebase?.


UniProtKB/Swiss-Prot protein knowledgebase release 56.0 statistics

Release 56.0 of 22-Jul-08 of UniProtKB/Swiss-Prot contains 392'667 sequence entries, comprising 141'217'034 amino acids abstracted from 172'036 references.

The growth of the database is summarized below.

Release Date Number of entries Number of amino acids
2.0 09/86 3'939 900'163
3.0 11/86 4'160 969'641
4.0 04/87 4'387 1'036'010
5.0 09/87 5'205 1'327'683
6.0 01/88 6'102 1'653'982
7.0 04/88 6'821 1'885'771
8.0 08/88 7'724 2'224'465
9.0 11/88 8'702 2'498'140
10.0 03/89 10'008 2'952'613
11.0 07/89 10'856 3'265'966
12.0 10/89 12'305 3'797'482
13.0 01/90 13'837 4'347'336
14.0 04/90 15'409 4'914'264
15.0 08/90 16'941 5'486'399
16.0 11/90 18'364 5'986'949
17.0 02/91 20'024 6'524'504
18.0 05/91 20'772 6'792'034
19.0 08/91 21'795 7'173'785
20.0 11/91 22'654 7'500'130
21.0 03/92 23'742 7'866'596
22.0 05/92 25'044 8'375'696
23.0 08/92 26'706 9'011'391
24.0 12/92 28'154 9'545'427
25.0 04/93 29'955 10'214'020
26.0 07/93 31'808 10'875'091
27.0 10/93 33'329 11'484'420
28.0 02/94 36'000 12'496'420
29.0 06/94 38'303 13'464'008
30.0 10/94 40'292 14'147'368
31.0 02/95 43'470 15'335'248
32.0 11/95 49'340 17'385'503
33.0 02/96 52'205 18'531'384
34.0 10/96 59'021 21'210'389
35.0 11/97 69'113 25'083'768
36.0 07/98 74'019 26'840'295
37.0 12/98 77'977 28'268'293
38.0 07/99 80'000 29'085'965
39.0 05/00 86'593 31'411'114
40.0 10/01 101'602 37'315'215
41.0 02/03 122'564 44'986'459
42.0 10/03 135'850 50'046'799
43.0 03/04 146'720 54'093'154
44.0 07/04 153'871 56'608'159
45.0 10/04 163'235 59'631'787
46.0 02/05 168'297 61'443'278
47.0 05/05 181'577 65'746'672
48.0 09/05 194'317 70'391'852
49.0 02/06 207'132 75'438'310
50.0 05/06 222'289 81'585'146
51.0 10/06 241'242 88'541'632
52.0 03/07 261'513 95'638'062
53.0 05/07 269'293 98'902'758
54.0 07/07 276'256 101'466'206
55.0 02/08 356'194 127'836'513
56.0 07/08 392'667 141'217'034

In rare cases, UniProtKB/Swiss-Prot entries are removed. Deleted entries are almost exclusively Open Reading Frames (ORFs) that have been wrongly predicted to code for proteins. When there is enough evidence that these hypothetical proteins are not real we take the decision to remove them from UniProtKB/Swiss-Prot. In the document delac_sp.txt, you will find a list of all accession numbers which were previously present in UniProtKB/Swiss-Prot, but which have now been deleted from the database.


Status of the model organisms

We have selected a number of organisms that are the target of genome sequencing and/or mapping projects and for which we intend to:

  • be as complete as possible. All sequences available at a given time should be immediately included in UniProtKB/Swiss-Prot. This also includes sequence corrections and updates;
  • provide a higher level of annotation;
  • provide cross-references to specialized database(s) that contain, among other data, some information about the genes that code for these proteins;
  • provide specific indexes and documents.

From our efforts to annotate human sequence entries as completely as possible arose the HPI project, and the bacterial model organisms became the focus of the HAMAP project. Here is the current status of the model organisms which are not covered by these two projects:

Organism Database cross-references Index file Number of sequences
A.thaliana TAIR arath.txt 6'914
C.albicans None yet calbican.txt 727
C.elegans Wormpep celegans.txt 3188
D.discoideum DictyBase dicty.txt 2'479
D.melanogaster FlyBase fly.txt 2'817
M.musculus MGD mgdtosp.txt 15'813
S.cerevisiae SGD yeast.txt 6'553
S.pombe GeneDB_SPombe pombe.txt 4'421

UniProtKB/Swiss-Prot release statistics
                    1.  INTRODUCTION
                    
                    Release 56.0 of 22-Jul-08 of UniProtKB/Swiss-Prot contains 392667 sequence entries,
                    comprising 141217034 amino acids abstracted from 172036 references. 
                    
                    36631 sequences have been added since release 55.0, the sequence data of
                    605 existing entries has been updated and the annotations of
                    356036 entries have been revised.
                    
                    Number of fragments: 8097
                    Number of additional sequences produced by alternative splicing, initiation or promoter usage, or ribosomal frameshifting: 26036
                    
                    
                    Protein existence:
                    PE 1: Evidence at protein level    60013 entries
                    PE 2: Evidence at transcript level 63043 entries
                    PE 3: Inferred from homology       255230 entries
                    PE 4: Predicted                    13153 entries
                    PE 5: Uncertain                    1228 entries
                    
                    
                    2.  AMINO ACID COMPOSITION
                    
                    2.1  Composition in percent for the complete database
                    
                    Ala (A) 8.13   Gln (Q) 3.95   Leu (L) 9.67   Ser (S) 6.67
                    Arg (R) 5.50   Glu (E) 6.73   Lys (K) 5.88   Thr (T) 5.35
                    Asn (N) 4.05   Gly (G) 7.04   Met (M) 2.41   Trp (W) 1.09
                    Asp (D) 5.40   His (H) 2.28   Phe (F) 3.88   Tyr (Y) 2.93
                    Cys (C) 1.42   Ile (I) 5.92   Pro (P) 4.77   Val (V) 6.82
                    
                    Asx (B) 0.000  Glx (Z) 0.000  Xaa (X) 0.00
                    
                    
                    2.2  Classification of the amino acids by their frequency
                    
                    Phe, Tyr, Met, His, Cys, Trpla, Gly, Val, Glu, Ser, Ile, Lys, Arg, Asp, Thr, Pro, Asn, Gln,
                    Phe, Tyr, Met, His, Cys, Trp
                    
                    
                    3.  TAXONOMIC ORIGIN
                    
                    Total number of species represented in this release of UniProtKB/Swiss-Prot: 11471
                    
                    The first twenty species represent 98378 sequences: 25.1 % of the total
                    number of entries.
                    
                    
                    3.1 Table of the frequency of occurrence of species
                    
                    Species represented 1x: 5236
                    2x: 1694
                    3x:  835
                    4x:  548
                    5x:  419
                    6x:  320
                    7x:  232
                    8x:  193
                    9x:  169
                    10x:  107
                    11- 20x:  516
                    21- 50x:  351
                    51-100x:  139
                    >100x:  712
                    
                    
                    3.2  Table of the most represented species
                    
                    ------  ---------  --------------------------------------------
                    Number  Frequency  Species
                    ------  ---------  --------------------------------------------
                    1      20069  Homo sapiens (Human)
                    2      15813  Mus musculus (Mouse)
                    3       7122  Rattus norvegicus (Rat)
                    4       6914  Arabidopsis thaliana (Mouse-ear cress)
                    5       6553  Saccharomyces cerevisiae (Baker's yeast)
                    6       5371  Bos taurus (Bovine)
                    7       4421  Schizosaccharomyces pombe (Fission yeast)
                    8       4342  Escherichia coli (strain K12)
                    9       3188  Caenorhabditis elegans
                    10       2878  Bacillus subtilis
                    11       2817  Drosophila melanogaster (Fruit fly)
                    12       2816  Xenopus laevis (African clawed frog)
                    13       2479  Dictyostelium discoideum (Slime mold)
                    14       2194  Danio rerio (Zebrafish) (Brachydanio rerio)
                    15       2125  Pongo abelii (Sumatran orangutan)
                    16       2054  Gallus gallus (Chicken)
                    17       1950  Escherichia coli O157:H7
                    18       1782  Methanocaldococcus jannaschii (Methanococcus jannaschii)
                    19       1774  Haemophilus influenzae
                    20       1716  Oryza sativa subsp. japonica (Rice)
                    21       1700  Salmonella typhimurium
                    22       1627  Escherichia coli O6
                    23       1625  Shigella flexneri
                    24       1445  Mycobacterium tuberculosis
                    25       1323  Sus scrofa (Pig)
                    26       1292  Salmonella typhi
                    27       1241  Pseudomonas aeruginosa
                    28       1187  Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
                    29       1183  Mycobacterium bovis
                    30       1121  Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey)
                    31        990  Synechocystis sp. (strain PCC 6803)
                    32        981  Archaeoglobus fulgidus
                    33        953  Yersinia pestis
                    34        912  Vibrio cholerae
                    35        909  Acanthamoeba polyphaga mimivirus (APMV)
                    36        888  Rhizobium meliloti (Sinorhizobium meliloti)
                    37        873  Oryctolagus cuniculus (Rabbit)
                    38        866  Salmonella paratyphi A
                    39        864  Staphylococcus aureus (strain Mu50 / ATCC 700699)
                    40        863  Staphylococcus aureus (strain N315)
                    41        835  Staphylococcus aureus (strain MW2)
                    42        835  Staphylococcus aureus (strain COL)
                    43        831  Staphylococcus aureus (strain MSSA476)
                    44        828  Staphylococcus aureus (strain MRSA252)
                    45        814  Salmonella choleraesuis
                    46        809  Yersinia pseudotuberculosis
                    47        808  Escherichia coli O6:K15:H31 (strain 536 / UPEC)
                    48        808  Shigella sonnei (strain Ss046)
                    49        765  Shigella boydii serotype 4 (strain Sb227)
                    50        764  Vibrio parahaemolyticus
                    51        763  Ashbya gossypii (Yeast) (Eremothecium gossypii)
                    52        759  Aquifex aeolicus
                    53        754  Pasteurella multocida
                    54        748  Shigella dysenteriae serotype 1 (strain Sd197)
                    55        747  Escherichia coli O9:H4 (strain HS)
                    56        747  Canis familiaris (Dog)
                    57        744  Escherichia coli (strain UTI89 / UPEC)
                    58        743  Escherichia coli O139:H28 (strain E24377A / ETEC)
                    59        736  Kluyveromyces lactis (Yeast) (Candida sphaerica)
                    60        727  Candida albicans (Yeast)
                    61        724  Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum)
                    62        717  Neurospora crassa
                    63        711  Escherichia coli (strain ATCC 8739 / DSM 1576 / Crooks)
                    64        707  Streptomyces coelicolor
                    65        705  Vibrio vulnificus
                    66        700  Staphylococcus epidermidis (strain ATCC 35984 / RP62A)
                    67        699  Staphylococcus epidermidis (strain ATCC 12228)
                    68        694  Candida glabrata (Yeast) (Torulopsis glabrata)
                    69        692  Photorhabdus luminescens subsp. laumondii
                    70        689  Bacillus halodurans
                    71        688  Vibrio vulnificus (strain YJ016)
                    72        687  Mycoplasma pneumoniae
                    73        685  Shigella flexneri serotype 5b (strain 8401)
                    74        671  Pan troglodytes (Chimpanzee)
                    75        665  Bacillus anthracis
                    76        655  Yersinia pestis bv. Antiqua (strain Nepal516)
                    77        654  Anabaena sp. (strain PCC 7120)
                    78        650  Yersinia enterocolitica serotype O:8 / biotype 1B (strain 8081)
                    79        649  Yersinia pestis bv. Antiqua (strain Antiqua)
                    80        647  Mycobacterium leprae
                    81        639  Pseudomonas syringae pv. tomato
                    82        637  Pseudomonas putida (strain KT2440)
                    83        636  Yersinia pseudotuberculosis serotype O:1b (strain IP 31758)
                    84        630  Escherichia coli O1:K1 / APEC
                    85        627  Staphylococcus aureus (strain NCTC 8325)
                    86        620  Escherichia coli
                    87        618  Salmonella paratyphi B (strain ATCC BAA-1250 / SPB7)
                    88        617  Bradyrhizobium japonicum
                    89        613  Treponema pallidum
                    90        612  Enterobacter sp. (strain 638)
                    91        609  Zea mays (Maize)
                    92        599  Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
                    93        598  Yersinia pestis (strain Pestoides F)
                    94        595  Methanobacterium thermoautotrophicum
                    95        592  Bacillus cereus (strain ATCC 14579 / DSM 31)
                    96        592  Agrobacterium tumefaciens (strain C58 / ATCC 33970)
                    97        589  Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696)
                    98        586  Ralstonia solanacearum (Pseudomonas solanacearum)
                    99        581  Shewanella oneidensis
                    100        581  Rickettsia prowazekii
                    101        580  Staphylococcus aureus (strain USA300)
                    102        579  Helicobacter pylori (Campylobacter pylori)
                    103        578  Rhizobium loti (Mesorhizobium loti)
                    104        575  Serratia proteamaculans (strain 568)
                    105        572  Buchnera aphidicola subsp. Acyrthosiphon pisum 
                    106        569  Listeria monocytogenes
                    107        567  Staphylococcus aureus (strain bovine RF122 / ET3-1)
                    108        566  Lactococcus lactis subsp. lactis (Streptococcus lactis)
                    109        562  Buchnera aphidicola subsp. Schizaphis graminum
                    110        561  Listeria innocua
                    111        560  Photobacterium profundum (Photobacterium sp. (strain SS9))
                    112        560  Helicobacter pylori J99 (Campylobacter pylori J99)
                    113        559  Neisseria meningitidis serogroup B
                    114        556  Xanthomonas campestris pv. campestris
                    115        554  Salmonella arizonae (strain ATCC BAA-731 / CDC346-86 / RSK2980)
                    116        546  Staphylococcus haemolyticus (strain JCSC1435)
                    117        541  Staphylococcus saprophyticus subsp. saprophyticus 
                    118        540  Neisseria meningitidis serogroup A
                    119        538  Brucella melitensis
                    120        535  Brucella suis
                    121        534  Bacillus cereus (strain ATCC 10987)
                    122        532  Yarrowia lipolytica (Candida lipolytica)
                    123        531  Clostridium acetobutylicum
                    124        529  Enterobacter sakazakii (strain ATCC BAA-894)
                    125        528  Caulobacter crescentus (Caulobacter vibrioides)
                    126        521  Emericella nidulans (Aspergillus nidulans)
                    127        521  Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
                    128        521  Xanthomonas axonopodis pv. citri
                    129        515  Oceanobacillus iheyensis
                    130        514  Bacillus thuringiensis subsp. konkukian
                    131        509  Pseudomonas syringae pv. syringae (strain B728a)
                    132        507  Buchnera aphidicola subsp. Baizongia pistaciae
                    133        507  Streptococcus pneumoniae
                    134        504  Vibrio fischeri (strain ATCC 700601 / ES114)
                    135        503  Pseudomonas fluorescens (strain PfO-1)
                    136        502  Bacillus cereus (strain ZK / E33L)
                    137        502  Listeria monocytogenes serotype 4b (strain F2365)
                    138        501  Pseudomonas aeruginosa (strain UCBPP-PA14)
                    139        499  Xylella fastidiosa
                    140        498  Pseudomonas fluorescens (strain Pf-5 / ATCC BAA-477)
                    141        497  Thermotoga maritima
                    142        493  Bacillus licheniformis (strain DSM 13 / ATCC 14580)
                    143        493  Bordetella bronchiseptica (Alcaligenes bronchisepticus)
                    144        491  Rickettsia conorii
                    145        490  Xylella fastidiosa (strain Temecula1 / ATCC 700964)
                    146        488  Pseudomonas syringae pv. phaseolicola (strain 1448A / Race 6)
                    147        483  Mycoplasma genitalium
                    148        481  Bordetella parapertussis
                    149        481  Chromobacterium violaceum
                    150        481  Haemophilus ducreyi
                    151        480  Bordetella pertussis
                    152        478  Deinococcus radiodurans
                    153        475  Sodalis glossinidius (strain morsitans)
                    154        473  Clostridium perfringens
                    155        470  Corynebacterium glutamicum (Brevibacterium flavum)
                    156        467  Vibrio cholerae serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
                    157        464  Methanosarcina acetivorans
                    158        461  Brucella abortus
                    159        458  Haemophilus influenzae (strain 86-028NP)
                    160        456  Pyrococcus horikoshii
                    161        456  Mannheimia succiniciproducens (strain MBEL55E)
                    162        455  Pseudomonas entomophila (strain L48)
                    163        452  Pyrococcus abyssi
                    164        452  Streptomyces avermitilis
                    165        451  Xanthomonas campestris pv. campestris (strain 8004)
                    166        450  Burkholderia pseudomallei (Pseudomonas pseudomallei)
                    167        448  Pseudomonas aeruginosa (strain PA7)
                    168        448  Enterococcus faecalis (Streptococcus faecalis)
                    169        448  Halobacterium salinarium (Halobacterium halobium)
                    170        447  Bacillus clausii (strain KSM-K16)
                    171        446  Rickettsia felis (Rickettsia azadi)
                    172        444  Streptococcus pneumoniae (strain ATCC BAA-255 / R6)
                    173        444  Methanosarcina mazei (Methanosarcina frisia)
                    174        442  Shewanella sp. (strain MR-7)
                    175        441  Synechococcus elongatus (Thermosynechococcus elongatus)
                    176        441  Geobacillus kaustophilus
                    177        440  Lactobacillus plantarum
                    178        440  Vibrio harveyi (strain ATCC BAA-1116 / BB120)
                    179        439  Shewanella sp. (strain MR-4)
                    180        436  Streptococcus mutans
                    181        436  Chlamydia trachomatis
                    182        434  Thermoanaerobacter tengcongensis
                    183        434  Oryza sativa subsp. indica (Rice)
                    184        433  Rickettsia bellii (strain RML369-C)
                    185        433  Pyrococcus furiosus
                    186        432  Ovis aries (Sheep)
                    187        432  Synechococcus elongatus (strain PCC 7942) (Anacystis nidulans R2)
                    188        430  Brucella abortus (strain 2308)
                    189        429  Streptococcus pyogenes serotype M6
                    190        428  Acinetobacter sp. (strain ADP1)
                    191        427  Borrelia burgdorferi (Lyme disease spirochete)
                    192        427  Burkholderia mallei (Pseudomonas mallei)
                    193        427  Nicotiana tabacum (Common tobacco)
                    194        426  Rhodopseudomonas palustris
                    195        424  Anabaena variabilis (strain ATCC 29413 / PCC 7937)
                    196        423  Burkholderia sp. (strain 383) (Burkholderia cepacia 
                    197        422  Campylobacter jejuni
                    198        421  Xanthomonas campestris pv. vesicatoria (strain 85-10)
                    199        420  Pseudomonas putida (strain F1 / ATCC 700007)
                    200        419  Chlamydia pneumoniae (Chlamydophila pneumoniae)
                    201        416  Ralstonia eutropha (strain JMP134) (Alcaligenes eutrophus)
                    202        414  Staphylococcus aureus (strain Newman)
                    203        414  Shewanella frigidimarina (strain NCIMB 400)
                    204        414  Aspergillus fumigatus (Sartorya fumigata)
                    205        413  Shewanella sp. (strain ANA-3)
                    206        412  Xanthomonas oryzae pv. oryzae (strain MAFF 311018)
                    207        412  Pseudomonas putida (strain GB-1)
                    208        410  Methylococcus capsulatus
                    209        409  Chlamydia muridarum
                    210        409  Streptococcus pyogenes serotype M1
                    211        408  Rhizobium sp. (strain NGR234)
                    212        408  Ralstonia eutropha  (Cupriavidus necator 
                    213        407  Sulfolobus solfataricus
                    214        405  Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158)
                    215        405  Streptococcus pyogenes serotype M18
                    216        403  Rickettsia typhi
                    217        403  Streptococcus pyogenes serotype M3
                    218        402  Bacillus amyloliquefaciens (strain FZB42)
                    219        400  Shewanella baltica (strain OS185)
                    220        400  Nitrosomonas europaea
                    221        398  Gloeobacter violaceus
                    222        398  Staphylococcus aureus (strain Mu3 / ATCC 700698)
                    223        397  Hahella chejuensis (strain KCTC 2396)
                    224        397  Solanum lycopersicum (Tomato) (Lycopersicon esculentum)
                    225        395  Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / NCIB 9240)
                    226        395  Pseudoalteromonas haloplanktis (strain TAC 125)
                    227        393  Corynebacterium efficiens
                    228        392  Dechloromonas aromatica (strain RCB)
                    229        389  Neisseria gonorrhoeae (strain ATCC 700825 / FA 1090)
                    230        389  Chlorobium tepidum
                    231        389  Shewanella sp. (strain W3-18-1)
                    232        389  Colwellia psychrerythraea (strain 34H / ATCC BAA-681) (Vibrio psychroerythus)
                    233        388  Shewanella putrefaciens (strain CN-32 / ATCC BAA-453)
                    234        387  Burkholderia xenovorans (strain LB400)
                    235        385  Pseudomonas mendocina (strain ymp)
                    236        385  Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1) 
                    237        384  Mycobacterium paratuberculosis
                    238        384  Idiomarina loihiensis
                    239        382  Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013)
                    240        382  Shewanella baltica (strain OS195)
                    241        381  Haemophilus influenzae (strain PittEE)
                    242        381  Synechococcus sp. (strain WH8102)
                    243        381  Pyrococcus kodakaraensis (Thermococcus kodakaraensis)
                    244        380  Burkholderia thailandensis (strain E264 / ATCC 700388 / DSM 13276 / CIP 106301)
                    245        380  Aeromonas salmonicida (strain A449)
                    246        379  Shewanella baltica (strain OS155 / ATCC BAA-1091)
                    247        377  Actinobacillus pleuropneumoniae serotype 5b (strain L20)
                    248        374  Solanum tuberosum (Potato)
                    249        374  Shewanella amazonensis (strain ATCC BAA-1098 / SB2B)
                    250        374  Burkholderia cenocepacia (strain AU 1054)
                    251        372  Prochlorococcus marinus (strain MIT 9313)
                    252        372  Azoarcus sp. (strain EbN1) (Aromatoleum aromaticum (strain EbN1))
                    253        372  Streptococcus agalactiae serotype III
                    254        371  Burkholderia pseudomallei (strain 1710b)
                    255        370  Xanthomonas oryzae pv. oryzae
                    256        369  Staphylococcus aureus (strain JH1)
                    257        369  Shewanella loihica (strain ATCC BAA-1088 / PV-4)
                    258        368  Streptococcus agalactiae serotype V
                    259        368  Coxiella burnetii
                    260        367  Methanopyrus kandleri
                    261        367  Listeria welshimeri serovar 6b (strain ATCC 35897 / DSM 20650 / SLCC5334)
                    262        365  Rhizobium etli (strain CFN 42 / ATCC 51251)
                    263        365  Bacillus cereus subsp. cytotoxis (strain NVH 391-98)
                    264        364  Prochlorococcus marinus
                    265        363  Staphylococcus aureus (strain JH9)
                    266        363  Leptospira interrogans
                    267        363  Geobacter sulfurreducens
                    268        357  Aeropyrum pernix
                    269        356  Haemophilus somnus (strain 129Pt) (Histophilus somni (strain 129Pt))
                    270        356  Nitrosococcus oceani (strain ATCC 19707 / NCIMB 11848)
                    271        355  Haemophilus influenzae (strain PittGG)
                    272        353  Leptospira interrogans serogroup Icterohaemorrhagiae serovar copenhageni
                    273        352  Burkholderia cenocepacia (strain HI2424)
                    274        352  Shewanella halifaxensis (strain HAW-EB4)
                    275        352  Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579)
                    276        351  Ralstonia metallidurans (strain CH34 / ATCC 43123 / DSM 2839)
                    277        351  Rhizobium leguminosarum bv. viciae (strain 3841)
                    278        351  Pisum sativum (Garden pea)
                    279        349  Legionella pneumophila (strain Paris)
                    280        348  Bacillus pumilus (strain SAFR-032)
                    281        348  Legionella pneumophila (strain Lens)
                    282        348  Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB 13768)
                    283        347  Sulfolobus tokodaii
                    284        346  Actinobacillus succinogenes (strain ATCC 55618 / 130Z)
                    285        345  Thiobacillus denitrificans (strain ATCC 25259)
                    286        345  Nocardia farcinica
                    287        345  Psychromonas ingrahamii (strain 37)
                    288        345  Shewanella pealeana (strain ATCC 700345 / ANG-SQ1)
                    289        345  Prochlorococcus marinus subsp. pastoris (strain CCMP1378 / MED4)
                    290        343  Glycine max (Soybean)
                    291        342  Mycobacterium tuberculosis (strain ATCC 25177 / H37Ra)
                    292        342  Neisseria meningitidis serogroup C / serotype 2a (strain ATCC 700532 / FAM18)
                    293        342  Legionella pneumophila subsp. pneumophila 
                    294        340  Saccharophagus degradans (strain 2-40 / ATCC 43961 / DSM 17024)
                    295        339  Silicibacter pomeroyi
                    296        339  Desulfovibrio vulgaris (strain Hildenborough / ATCC 29579 / NCIMB 8303)
                    297        339  Burkholderia ambifaria (strain ATCC BAA-244 / AMMD) (Burkholderia cepacia 
                    298        338  Pseudoalteromonas atlantica (strain T6c / BAA-1087)
                    299        338  Shewanella sediminis (strain HAW-EB3)
                    300        336  Macaca mulatta (Rhesus macaque)
                    301        332  Geobacillus thermodenitrificans (strain NG80-2)
                    302        331  Staphylococcus aureus (strain USA300 / TCH1516)
                    303        331  Caenorhabditis briggsae
                    304        331  Rhodopirellula baltica
                    305        330  Mycobacterium bovis (strain BCG / Pasteur 1173P2)
                    306        329  Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia 
                    307        329  Lactococcus lactis subsp. cremoris (strain MG1363)
                    308        329  Nitrosospira multiformis (strain ATCC 25196 / NCIMB 11849)
                    309        329  Bordetella avium (strain 197N)
                    310        328  Pseudomonas stutzeri (strain A1501)
                    311        328  Rhodoferax ferrireducens (strain DSM 15236 / ATCC BAA-621 / T118)
                    312        327  Symbiobacterium thermophilum
                    313        326  Zymomonas mobilis
                    314        326  Fusobacterium nucleatum subsp. nucleatum
                    315        324  Burkholderia pseudomallei (strain 1106a)
                    316        322  Clostridium perfringens (strain ATCC 13124 / NCTC 8237 / Type A)
                    317        322  Thermoplasma acidophilum
                    318        321  Thermus thermophilus (strain HB27 / ATCC BAA-163 / DSM 7039)
                    319        321  Wolinella succinogenes
                    320        321  Methanococcus maripaludis
                    321        321  Rhodospirillum rubrum (strain ATCC 11170 / NCIB 8255)
                    322        320  Alcanivorax borkumensis (strain SK2 / ATCC 700651 / DSM 11573)
                    323        319  Bacillus thuringiensis (strain Al Hakam)
                    324        319  Methylobacillus flagellatus (strain KT / ATCC 51484 / DSM 6875)
                    325        319  Geobacter metallireducens (strain GS-15 / ATCC 53774 / DSM 7210)
                    326        318  Triticum aestivum (Wheat)
                    327        318  Streptococcus agalactiae serotype Ia
                    328        318  Bacteroides thetaiotaomicron
                    329        317  Rhodopseudomonas palustris (strain HaA2)
                    330        316  Corynebacterium diphtheriae
                    331        316  Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1)
                    332        315  Burkholderia pseudomallei (strain 668)
                    333        315  Rhodopseudomonas palustris (strain BisB18)
                    334        315  Sinorhizobium medicae (strain WSM419) (Ensifer medicae)
                    335        315  Azoarcus sp. (strain BH72)
                    336        314  Marinobacter aquaeolei  (Marinobacter hydrocarbonoclasticus 
                    337        313  Clostridium tetani
                    338        313  Burkholderia mallei (strain NCTC 10247)
                    339        312  Methanosarcina barkeri (strain Fusaro / DSM 804)
                    340        312  Brucella canis (strain ATCC 23365 / NCTC 10854)
                    341        312  Brucella suis (strain ATCC 23445 / NCTC 10510)
                    342        311  Hordeum vulgare (Barley)
                    343        311  Campylobacter jejuni (strain RM1221)
                    344        311  Nitrobacter winogradskyi (strain Nb-255 / ATCC 25391)
                    345        310  Thiomicrospira crunogena (strain XCL-2)
                    346        309  Streptococcus pneumoniae serotype 2 (strain D39 / NCTC 7466)
                    347        309  Alkalilimnicola ehrlichei (strain MLHE-1)
                    348        308  Burkholderia mallei (strain NCTC 10229)
                    349        308  Prochlorococcus marinus (strain NATL2A)
                    350        305  Clostridium perfringens (strain SM101 / Type A)
                    351        305  Ochrobactrum anthropi (strain ATCC 49188 / DSM 6882 / NCTC 12168)
                    352        304  Sulfolobus acidocaldarius
                    353        304  Rhodopseudomonas palustris (strain BisB5)
                    354        303  Carboxydothermus hydrogenoformans (strain Z-2901 / DSM 6008)
                    355        302  Haloarcula marismortui (Halobacterium marismortui)
                    356        302  Bacteroides fragilis
                    357        301  Nitrobacter hamburgensis (strain X14 / DSM 10229)
                    358        300  Burkholderia mallei (strain SAVP1)
                    359        300  Gluconobacter oxydans (Gluconobacter suboxydans)
                    360        300  Mesorhizobium sp. (strain BNC1)
                    361        300  Streptococcus thermophilus (strain CNRZ 1066)
                    362        298  Roseobacter denitrificans (strain ATCC 33942 / OCh 114) (Erythrobacter sp.  
                    363        298  Streptococcus thermophilus (strain ATCC BAA-250 / LMG 18311)
                    364        297  Synechococcus sp. (strain CC9902)
                    365        297  Cryptococcus neoformans (Filobasidiella neoformans)
                    366        297  Prochlorococcus marinus (strain MIT 9312)
                    367        295  Staphylococcus aureus
                    368        295  Bartonella henselae (Rochalimaea henselae)
                    369        295  Psychrobacter arcticus (strain DSM 17307 / 273-4)
                    370        294  Pyrobaculum aerophilum
                    371        294  Nitrosomonas eutropha (strain C91)
                    372        293  Cavia porcellus (Guinea pig)
                    373        293  Helicobacter hepaticus
                    374        291  Lactococcus lactis subsp. cremoris (strain SK11)
                    375        290  Streptococcus sanguinis (strain SK36)
                    376        290  Desulfotalea psychrophila
                    377        289  Streptococcus gordonii (strain Challis / ATCC 35105 / CH1 / DL1 / V288)
                    378        289  Legionella pneumophila (strain Corby)
                    379        289  Synechococcus sp. (strain JA-3-3Ab) 
                    380        289  Thermoplasma volcanium
                    381        289  Bartonella quintana (Rochalimaea quintana)
                    382        288  Synechococcus sp. (strain CC9605)
                    383        288  Synechococcus sp. (strain JA-2-3B'a(2-13)) 
                    384        287  Moorella thermoacetica (strain ATCC 39073)
                    385        286  Brucella ovis (strain ATCC 25840 / 63/290 / NCTC 10512)
                    386        286  Streptococcus pyogenes serotype M28
                    387        286  Psychrobacter cryohalolentis (strain K5)
                    388        286  Halorhodospira halophila (strain DSM 244 / SL1) (Ectothiorhodospira halophila 
                    389        285  Pseudomonas putida
                    390        284  Jannaschia sp. (strain CCS1)
                    391        284  Streptococcus pyogenes serotype M5 (strain Manfredo)
                    392        282  Rhodopseudomonas palustris (strain BisA53)
                    393        282  Haemophilus somnus (strain 2336) (Histophilus somni (strain 2336))
                    394        282  Lactobacillus sakei subsp. sakei (strain 23K)
                    395        281  Rhodobacter sphaeroides (strain ATCC 17029 / ATH 2.4.9)
                    396        280  Trichodesmium erythraeum (strain IMS101)
                    397        280  Silicibacter sp. (strain TM1040)
                    398        280  Bifidobacterium longum
                    399        279  Ustilago maydis (Smut fungus)
                    400        279  Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9)
                    401        279  Wigglesworthia glossinidia brevipalpis
                    402        278  Spinacia oleracea (Spinach)
                    403        277  Campylobacter jejuni subsp. jejuni serotype O:23/36 (strain 81-176)
                    404        277  Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
                    405        276  Lactobacillus johnsonii
                    406        275  Campylobacter jejuni subsp. jejuni serotype O:6 (strain 81116 / NCTC 11828)
                    407        275  Porphyromonas gingivalis (Bacteroides gingivalis)
                    408        274  Equus caballus (Horse)
                    409        274  Propionibacterium acnes
                    410        272  Gorilla gorilla gorilla (Lowland gorilla)
                    411        272  Polaromonas sp. (strain JS666 / ATCC BAA-500)
                    412        272  Leifsonia xyli subsp. xyli
                    413        270  Bacteroides fragilis (strain ATCC 25285 / NCTC 9343)
                    414        269  Francisella tularensis subsp. tularensis
                    415        269  Bradyrhizobium sp. (strain ORS278)
                    416        269  Clostridium botulinum (strain Langeland / NCTC 10281 / Type F)
                    417        269  Aspergillus oryzae
                    418        268  Blochmannia floridanus
                    419        268  Rhodococcus sp. (strain RHA1)
                    420        268  Bacteriophage T4
                    421        268  Desulfovibrio desulfuricans (strain G20)
                    422        268  Acidovorax avenae subsp. citrulli (strain AAC00-1)
                    423        267  Helicobacter pylori (strain HPAG1)
                    424        267  Anaeromyxobacter dehalogenans (strain 2CP-C)
                    425        266  Magnetospirillum magneticum (strain AMB-1 / ATCC 700264)
                    426        265  Lactobacillus acidophilus
                    427        265  Clostridium novyi (strain NT)
                    428        264  Janthinobacterium sp. (strain Marseille) (Minibacterium massiliensis)
                    429        264  Mycobacterium ulcerans (strain Agy99)
                    430        264  Chlorobium chlorochromatii (strain CaD3)
                    431        263  Ureaplasma parvum (Ureaplasma urealyticum biotype 1)
                    432        263  Neisseria meningitidis serogroup C (strain 053442)
                    433        262  Rhodobacter capsulatus (Rhodopseudomonas capsulata)
                    434        262  Paracoccus denitrificans (strain Pd 1222)
                    435        262  Streptococcus pyogenes serotype M12 (strain MGAS9429)
                    436        261  Streptococcus pyogenes serotype M4 (strain MGAS10750)
                    437        260  Corynebacterium glutamicum (strain R)
                    438        260  Desulfitobacterium hafniense (strain Y51)
                    439        260  Chlamydophila caviae
                    440        258  Streptococcus pyogenes serotype M2 (strain MGAS10270)
                    441        258  Polaromonas naphthalenivorans (strain CJ2)
                    442        257  Myxococcus xanthus (strain DK 1622)
                    443        257  Clostridium beijerinckii (strain ATCC 51743 / NCIMB 8052) 
                    444        257  Francisella tularensis subsp. holarctica (strain LVS)
                    445        257  Prochlorococcus marinus (strain MIT 9301)
                    446        257  Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
                    447        257  Synechococcus sp. (strain CC9311)
                    448        256  Thermotoga petrophila (strain RKU-1 / ATCC BAA-488 / DSM 13995)
                    449        256  Herminiimonas arsenicoxydans
                    450        256  Pelodictyon luteolum (strain DSM 273) (Chlorobium luteolum (strain DSM 273))
                    451        255  Acidovorax sp. (strain JS42)
                    452        255  Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
                    453        255  Prochlorococcus marinus (strain MIT 9515)
                    454        255  Synechococcus sp. (strain WH7803)
                    455        255  Mycobacterium avium (strain 104)
                    456        254  Clostridium botulinum (strain ATCC 19397 / Type A)
                    457        254  Vaccinia virus (strain Copenhagen) (VACV)
                    458        253  Thermobifida fusca (strain YX)
                    459        253  Corynebacterium jeikeium (strain K411)
                    460        253  Novosphingobium aromaticivorans (strain DSM 12444)
                    461        252  Prochlorococcus marinus (strain AS9601)
                    462        252  Mycobacterium vanbaalenii (strain DSM 7251 / PYR-1)
                    463        251  Mycobacterium sp. (strain MCS)
                    464        250  Lactobacillus salivarius subsp. salivarius (strain UCC118)
                    465        250  Bdellovibrio bacteriovorus
                    466        249  Rhodobacter sphaeroides (strain ATCC 17025 / ATH 2.4.3)
                    467        248  Methylibium petroleiphilum (strain PM1)
                    468        248  Clostridium kluyveri (strain ATCC 8527 / DSM 555 / NCIMB 10680)
                    469        248  Campylobacter jejuni subsp. doylei (strain ATCC BAA-1458 / RM4099 / 269.97)
                    470        247  Alkaliphilus metalliredigens (strain QYMF)
                    471        246  Blochmannia pennsylvanicus (strain BPEN)
                    472        246  Prochlorococcus marinus (strain NATL1A)
                    473        246  Marinomonas sp. (strain MWYL1)
                    474        245  Prochlorococcus marinus (strain MIT 9215)
                    475        245  Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / ORS 571)
                    476        244  Coxiella burnetii (strain Dugway 5J108-111)
                    477        244  Sulfurimonas denitrificans  (Thiomicrospira denitrificans 
                    478        244  Coxiella burnetii (strain RSA 331 / Henzerling II)
                    479        244  Streptococcus pyogenes serotype M12 (strain MGAS2096)
                    480        244  Geobacter uraniireducens (strain Rf4) (Geobacter uraniumreducens)
                    481        243  Mycobacterium sp. (strain KMS)
                    482        243  Clostridium difficile (strain 630)
                    483        242  Francisella tularensis subsp. tularensis (strain FSC 198)
                    484        241  Mycobacterium sp. (strain JLS)
                    485        241  Desulfovibrio vulgaris subsp. vulgaris (strain DP4)
                    486        240  Lactobacillus casei (strain ATCC 334)
                    487        240  Prochlorococcus marinus (strain MIT 9303)
                    488        239  Francisella tularensis subsp. novicida (strain U112)
                    489        238  Treponema denticola
                    490        237  Acaryochloris marina (strain MBIC 11017)
                    491        237  Bacillus stearothermophilus (Geobacillus stearothermophilus)
                    492        237  Francisella tularensis subsp. holarctica (strain OSU18)
                    493        236  Baumannia cicadellinicola subsp. Homalodisca coagulata
                    494        235  Clostridium botulinum (strain Hall / ATCC 3502 / NCTC 13319 / Type A)
                    495        235  Natronomonas pharaonis (strain DSM 2160 / ATCC 35678)
                    496        235  Syntrophus aciditrophicus (strain SB)
                    497        234  Sphingopyxis alaskensis (Sphingomonas alaskensis)
                    498        234  Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224)
                    499        234  Leptospira borgpetersenii serovar Hardjo-bovis (strain JB197)
                    500        233  Hyphomonas neptunium (strain ATCC 15444)
                    501        232  Pediococcus pentosaceus (strain ATCC 25745 / 183-1w)
                    502        232  Methanococcus maripaludis (strain C7 / ATCC BAA-1331)
                    503        232  Chlorobium phaeobacteroides (strain DSM 266)
                    504        231  Chlamydomonas reinhardtii
                    505        231  Verminephrobacter eiseniae (strain EF01-2)
                    506        230  Pelobacter propionicus (strain DSM 2379)
                    507        230  Alkaliphilus oremlandii (strain OhILAs) (Clostridium oremlandii (strain OhILAs))
                    508        229  Helicobacter acinonychis (strain Sheeba)
                    509        229  Methanococcus maripaludis (strain C5 / ATCC BAA-1333)
                    510        229  Maricaulis maris (strain MCS10)
                    511        229  Deinococcus geothermalis (strain DSM 11300)
                    512        226  Chlamydia trachomatis (strain A/HAR-13 / ATCC VR-571B)
                    513        226  Francisella tularensis subsp. tularensis (strain WY96-3418)
                    514        225  Protochlamydia amoebophila (strain UWE25)
                    515        224  Cricetulus griseus (Chinese hamster)
                    516        223  Desulfotomaculum reducens (strain MI-1)
                    517        223  Francisella tularensis subsp. holarctica (strain FTA)
                    518        223  Syntrophomonas wolfei subsp. wolfei (strain Goettingen)
                    519        222  Dinoroseobacter shibae (strain DFL 12)
                    520        221  Frankia sp. (strain CcI3)
                    521        221  Caulobacter sp. (strain K31)
                    522        220  Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB)
                    523        220  Lactobacillus brevis (strain ATCC 367 / JCM 1170)
                    524        219  Synechococcus sp. (strain RCC307)
                    525        219  Bartonella tribocorum (strain CIP 105476 / IBS 506)
                    526        218  Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC 11842 / DSM 20081)
                    527        218  Chlamydophila abortus
                    528        217  Felis silvestris catus (Cat)
                    529        217  Porphyra purpurea
                    530        217  Leptospira borgpetersenii serovar Hardjo-bovis (strain L550)
                    531        217  Bartonella bacilliformis (strain ATCC 35685 / KC583)
                    532        217  Methanococcoides burtonii (strain DSM 6242)
                    533        216  Dehalococcoides sp. (strain CBDB1)
                    534        215  Dehalococcoides ethenogenes (strain 195)
                    535        215  Rickettsia akari (strain Hartford)
                    536        214  Klebsiella pneumoniae
                    537        212  Granulibacter bethesdensis (strain ATCC BAA-1260 / CGDNIH1)
                    538        212  Parvibaculum lavamentivorans (strain DS-1 / DSM 13023 / NCIMB 13966)
                    539        211  Rickettsia canadensis (strain McKiel)
                    540        210  Mycobacterium gilvum (strain PYR-GCK) (Mycobacterium flavescens 
                    541        210  Francisella philomiragia subsp. philomiragia (strain ATCC 25017)
                    542        210  Anaeromyxobacter sp. (strain Fw109-5)
                    543        210  Rickettsia rickettsii (strain Sheila Smith)
                    544        210  Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 / NCTC 11154)
                    545        209  Gibberella zeae (Fusarium graminearum)
                    546        209  Streptococcus suis (strain 98HAH33)
                    547        208  Nitratiruptor sp. (strain SB155-2)
                    548        208  Porphyra yezoensis
                    549        208  Caldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903)
                    550        207  Pelagibacter ubique
                    551        206  Magnetococcus sp. (strain MC-1)
                    552        206  Mesocricetus auratus (Golden hamster)
                    553        206  Salinibacter ruber (strain DSM 13855)
                    554        206  Prosthecochloris vibrioformis  (Chlorobium vibrioforme subsp. thiosulfatophilum  (Chlorobium phaeovibrioides 
                    555        204  Chlamydophila felis (strain Fe/C-56)
                    556        204  Lactobacillus delbrueckii subsp. bulgaricus (strain ATCC BAA-365)
                    557        204  Psychrobacter sp. (strain PRwf-1)
                    558        203  Encephalitozoon cuniculi
                    559        203  Tropheryma whipplei (strain TW08/27) (Whipple's bacillus)
                    560        202  Tropheryma whipplei (strain Twist) (Whipple's bacillus)
                    561        202  Parabacteroides distasonis (strain ATCC 8503 / DSM 20701 / NCTC 11152)
                    562        202  Lactobacillus reuteri (strain ATCC 23272 / DSM 20016 / F275)
                    563        201  Acidiphilium cryptum (strain JF-5)
                    564        201  Sphingomonas wittichii (strain RW1 / DSM 6014 / JCM 10273)
                    565        201  Vaccinia virus (strain Western Reserve / WR) (VACV)
                    566        201  Acidobacteria bacterium (strain Ellin345)
                    567        201  Rubrobacter xylanophilus (strain DSM 9941 / NBRC 16129)
                    568        200  Picrophilus torridus
                    569        200  Saccharopolyspora erythraea (strain NRRL 23338)
                    
                    
                    
                    3.3  Taxonomic distribution of the sequences
                    
                    Kingdom        sequences (% of the database)
                    Archaea           14694 (  4%)
                    Bacteria         224003 ( 57%)
                    Eukaryota        141583 ( 36%)
                    Viruses           12387 (  3%)
                    
                    
                    Within Eukaryota:
                    
                    Category            sequences (% of Eukaryota) (% of the complete database)
                    Human                  20070 ( 14%)           (  5%)
                    Other Mammalia         42975 ( 30%)           ( 11%)
                    Other Vertebrata       13982 ( 10%)           (  4%)
                    Viridiplantae          23475 ( 17%)           (  6%)
                    Fungi                  21941 ( 15%)           (  6%)
                    Insecta                 5528 (  4%)           (  1%)
                    Nematoda                3765 (  3%)           (  1%)
                    Other                   9847 (  7%)           (  3%)
                    
                    
                    4.  SEQUENCE SIZE
                    
                    Repartition of the sequences by size (excluding fragments)
                    
                    From   To  Number             From   To   Number
                    1-  50    6645             1001-1100     2928
                    51- 100   29356             1101-1200     1993
                    101- 150   41958             1201-1300     1582
                    151- 200   41301             1301-1400     1428
                    201- 250   41103             1401-1500     1115
                    251- 300   36011             1501-1600      556
                    301- 350   35089             1601-1700      435
                    351- 400   30808             1701-1800      380
                    401- 450   25341             1801-1900      352
                    451- 500   21000             1901-2000      282
                    501- 550   14954             2001-2100      178
                    551- 600   10904             2101-2200      246
                    601- 650    9329             2201-2300      244
                    651- 700    6517             2301-2400      164
                    701- 750    5321             2401-2500      113
                    751- 800    3918             >2500          860
                    801- 850    3418
                    851- 900    3641
                    901- 950    2986
                    951-1000    2114
                    
                    
                    The average sequence length in UniProtKB/Swiss-Prot is 359 amino acids.
                    
                    The shortest sequence is   GWA_SEPOF (P83570):     2 amino acids.
                    The longest sequence is  TITIN_MOUSE (A2ASS6): 35213 amino acids.
                    
                    
                    5.  JOURNAL CITATIONS
                    
                    Note: the following citation statistics reflect the number of distinct
                    journal citations.
                    
                    Total number of journals cited in this release of UniProtKB/Swiss-Prot: 1930
                    
                    
                    5.1 Table of the frequency of journal citations
                    
                    Journals cited 1x:  630
                    2x:  266
                    3x:  133
                    4x:  100
                    5x:   73
                    6x:   54
                    7x:   44
                    8x:   38
                    9x:   34
                    10x:   24
                    11- 20x:  150
                    21- 50x:  150
                    51-100x:   91
                    >100x:  143
                    
                    
                    5.2  List of the most cited journals in UniProtKB/Swiss-Prot
                    
                    Nb    Citations   Journal name
                    --    ---------   -------------------------------------------------------------
                    1        16362   Journal of Biological Chemistry
                    2         7669   Proceedings of the National Academy of Sciences of the U.S.A.
                    3         4700   Journal of Bacteriology
                    4         4405   Gene
                    5         4201   Biochemical and Biophysical Research Communications
                    6         4182   Nucleic Acids Research
                    7         3749   FEBS Letters
                    8         3504   Biochemistry
                    9         3483   The EMBO Journal
                    10         3128   Molecular and Cellular Biology
                    11         3010   European Journal of Biochemistry
                    12         2973   Nature
                    13         2831   Biochimica et Biophysica Acta
                    14         2713   Journal of Molecular Biology
                    15         2434   Genomics
                    16         2419   Cell
                    17         2020   Biochemical Journal
                    18         1893   Science
                    19         1629   Journal of Virology
                    20         1587   Molecular Microbiology
                    21         1431   Journal of Cell Biology
                    22         1427   Plant Molecular Biology
                    23         1290   Molecular and General Genetics
                    24         1232   Virology
                    25         1208   Nature Genetics
                    26         1201   Genes and Development
                    27         1196   Human Molecular Genetics
                    28         1122   Journal of Biochemistry
                    29         1109   Plant Physiology
                    30         1108   Oncogene
                    31         1104   The American Journal of Human Genetics
                    32          985   Development
                    33          922   Journal of Immunology
                    34          907   Human Mutation
                    35          869   Genetics
                    36          850   Molecular Biology of the Cell
                    37          816   Infection and Immunity
                    38          803   Structure
                    39          772   Journal of General Virology
                    40          757   Archives of Biochemistry and Biophysics
                    41          723   Yeast
                    42          718   The Plant Cell
                    43          701   Blood
                    44          672   Microbiology
                    45          651   Molecular Cell
                    46          617   Developmental Biology
                    47          611   Journal of Cell Science
                    48          600   FEMS Microbiology Letters
                    49          598   Cancer Research
                    50          597   The Plant Journal
                    51          564   Human Genetics
                    52          564   Nature Structural Biology
                    53          533   Mechanisms of Development
                    54          525   Current Biology
                    55          511   Current Genetics
                    56          477   Applied and Environmental Microbiology
                    57          476   Journal of Neuroscience
                    58          467   Acta Crystallographica, Section D
                    59          466   Journal of Clinical Investigation
                    60          463   Protein Science
                    61          462   Neuron
                    62          460   Mammalian Genome
                    63          423   Immunogenetics
                    64          421   The Journal of Experimental Medicine
                    65          420   Toxicon
                    66          415   Molecular Endocrinology
                    67          410   Molecular and Biochemical Parasitology
                    68          408   American Journal of Physiology
                    69          379   Journal of Neurochemistry
                    70          365   Endocrinology
                    71          360   Journal of Molecular Evolution
                    72          354   DNA and Cell Biology
                    73          351   The Journal of Clinical Endocrinology and Metabolism
                    74          346   DNA Sequence
                    75          332   Molecular Biology and Evolution
                    76          315   Bioscience, Biotechnology, and Biochemistry
                    77          307   Journal of Medical Genetics
                    78          306   Brain Research. Molecular Brain Research
                    79          286   Biological Chemistry Hoppe-Seyler
                    80          280   Proteins
                    81          272   Cytogenetics and Cell Genetics
                    82          261   Comparative Biochemistry and Physiology
                    83          260   Peptides
                    84          256   Journal of Investigative Dermatology
                    85          251   Antimicrobial Agents and Chemotherapy
                    86          245   Journal of General Microbiology
                    87          245   Molecular Pharmacology
                    88          240   Biology of Reproduction
                    89          239   Plant and Cell Physiology
                    90          239   Nature Cell Biology
                    91          233   Experimental Cell Research
                    92          225   Genome Research
                    93          215   Hoppe-Seyler's Zeitschrift fur Physiologische Chemie
                    94          213   Virus Research
                    95          210   Neurology
                    96          197   Developmental Dynamics
                    97          194   Molecular Plant-Microbe Interactions
                    98          193   RNA
                    99          191   DNA Research
                    100          188   European Journal of Immunology
                    101          185   Biochimie
                    102          181   Tissue Antigens
                    103          175   Annals of Neurology
                    104          174   European Journal of Human Genetics
                    105          168   Planta
                    106          167   Journal of Human Genetics
                    107          166   Genes to Cells
                    108          163   Molecular and Cellular Endocrinology
                    109          163   Immunity
                    110          163   Developmental Cell
                    111          159   DNA
                    112          155   Molecular Phylogenetics and Evolution
                    113          154   American Journal of Medical Genetics
                    114          152   Hemoglobin
                    115          150   Archives of Microbiology
                    116          150   Eukaryotic cell
                    117          148   The New England Journal of Medicine
                    118          147   Insect Biochemistry and Molecular Biology
                    119          146   Bioorganicheskaia Khimiia
                    120          139   Investigative Ophthalmology and Visual Science
                    121          137   Molecular Reproduction and Development
                    122          136   Diabetes
                    123          134   Glycobiology
                    124          134   Animal Genetics
                    125          132   Molecular Immunology
                    126          129   General and Comparative Endocrinology
                    127          128   Molecular and Cellular Neuroscience
                    128          125   International Journal of Cancer
                    129          121   Archives of Virology
                    130          119   Agricultural and Biological Chemistry
                    131          116   The FASEB Journal
                    132          112   British Journal of Haematology
                    133          112   EMBO Reports
                    134          111   Molecular Genetics and Metabolism
                    135          111   Clinical Genetics
                    136          110   Journal of Protein Chemistry
                    137          108   Biological Chemistry
                    138          106   Molecular Genetics and Genomics
                    139          106   Journal of Cellular Biochemistry
                    140          105   Journal of Neuroscience Research
                    141          104   Neuroscience Letters
                    142          103   Journal of Molecular Endocrinology
                    143          103   Journal of Lipid Research
                    144          100   Biochemistry and Molecular Biology International
                    
                    
                    6.  STATISTICS FOR SOME LINE TYPES
                    
                    The following table summarizes the total number of some UniProtKB/Swiss-Prot lines,
                    as well as the number of entries with at least one such line, and the
                    frequency of the lines.
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry
                    ---------------------------------  -------- ---------  ---------
                    
                    References (RL)                     716052              1.82
                    1     Journal                          584653    309924    1.49
                    2     Submitted to EMBL/GenBank/DDBJ   124370    114305    0.32
                    3     Submitted to other databases       5069      4680    0.01
                    4     Book citation                       594       584   <0.01
                    5     Plant Gene Register                 543       531   <0.01
                    6     Thesis                              389       387   <0.01
                    7     Unpublished observations            287       283   <0.01
                    8     Patent                              141       139   <0.01
                    9     Worm Breeder's Gazette                6         6   <0.01
                    
                    Total number of distinct authors cited in UniProtKB/Swiss-Prot: 263407.
                    
                    Comments (CC)                      1625064              4.14
                    1     SIMILARITY                       455111    369189    1.16
                    2     FUNCTION                         281813    271302    0.72
                    3     SUBCELLULAR LOCATION             225139    220959    0.57
                    4     CATALYTIC ACTIVITY               157277    143739    0.40
                    5     SUBUNIT                          154763    154763    0.39
                    6     PATHWAY                           91738     79969    0.23
                    7     COFACTOR                          65345     59933    0.17
                    8     TISSUE SPECIFICITY                29543     29543    0.08
                    9     PTM                               29031     23754    0.07
                    10    MISCELLANEOUS                     26924     24573    0.07
                    11    DOMAIN                            24285     21420    0.06
                    12    ALTERNATIVE PRODUCTS              16919     16919    0.04
                    13    SEQUENCE CAUTION                  10382     10382    0.03
                    14    INTERACTION                        9471      9471    0.02
                    15    INDUCTION                          9204      9204    0.02
                    16    DEVELOPMENTAL STAGE                7584      7584    0.02
                    17    WEB RESOURCE                       6317      5139    0.02
                    18    ENZYME REGULATION                  6276      6276    0.02
                    19    CAUTION                            5356      5249    0.01
                    20    DISEASE                            4375      3018    0.01
                    21    MASS SPECTROMETRY                  3571      2713    0.01
                    22    BIOPHYSICOCHEMICAL PROPERTIES      2236      2236    0.01
                    23    POLYMORPHISM                        718       688   <0.01
                    24    RNA EDITING                         544       544   <0.01
                    25    ALLERGEN                            447       447   <0.01
                    26    TOXIC DOSE                          379       371   <0.01
                    27    BIOTECHNOLOGY                       236       234   <0.01
                    28    PHARMACEUTICAL                       80        80   <0.01
                    
                    Features (FT)                      2470799              6.29
                    1     CHAIN                            398905    388707    1.02
                    2     TRANSMEM                         269851     55094    0.69
                    3     METAL                            179485     44921    0.46
                    4     BINDING                          127583     40359    0.32
                    5     DOMAIN                           118825     68530    0.30
                    6     CONFLICT                         108345     37614    0.28
                    7     STRAND                           106813     10124    0.27
                    8     MOD_RES                          104497     37238    0.27
                    9     TOPO_DOM                         104334     21254    0.27
                    10    HELIX                            103676     10650    0.26
                    11    ACT_SITE                          94617     56024    0.24
                    12    CARBOHYD                          86991     22400    0.22
                    13    DISULFID                          85328     21608    0.22
                    14    REPEAT                            72946     11107    0.19
                    15    NP_BIND                           70900     48718    0.18
                    16    VARIANT                           60913     12807    0.16
                    17    REGION                            60758     33829    0.15
                    18    COMPBIAS                          37679     21541    0.10
                    19    VAR_SEQ                           35538     15081    0.09
                    20    SIGNAL                            29376     29366    0.07
                    21    MOTIF                             25878     16782    0.07
                    22    TURN                              25694      8574    0.07
                    23    SITE                              25012     14433    0.06
                    24    ZN_FING                           24643      9992    0.06
                    25    MUTAGEN                           24327      5884    0.06
                    26    COILED                            14972      9908    0.04
                    27    INIT_MET                          12351     12351    0.03
                    28    NON_TER                           10933      8359    0.03
                    29    LIPID                              9425      6043    0.02
                    30    PROPEP                             9382      7808    0.02
                    31    DNA_BIND                           8799      8132    0.02
                    32    PEPTIDE                            7590      4655    0.02
                    33    TRANSIT                            5616      5533    0.01
                    34    CA_BIND                            3347      1388    0.01
                    35    CROSSLNK                           3031      2096    0.01
                    36    NON_CONS                           1432       581   <0.01
                    37    UNSURE                              667       223   <0.01
                    38    NON_STD                             340       266   <0.01
                    
                    Cross-references (DR)              6953477             17.71
                    1     InterPro                         954161    365778    2.43
                    2     EMBL                             678519    383792    1.73
                    3     GO                               659518    261671    1.68
                    4     Pfam                             505996    353567    1.29
                    5     PROSITE                          356139    223270    0.91
                    6     RefSeq                           355438    325205    0.91
                    7     GeneID                           341505    324984    0.87
                    8     KEGG                             300528    280532    0.77
                    9     GenomeReviews                    256332    238769    0.65
                    10    HAMAP                            205008    204908    0.52
                    11    HOGENOM                          198402    198399    0.51
                    12    TIGRFAMs                         185636    173805    0.47
                    13    Gene3D                           180250    149065    0.46
                    14    BioCyc                           145962    139440    0.37
                    15    PANTHER                          143138    132190    0.36
                    16    PRINTS                           123838    101263    0.32
                    17    NMPDR                            117067    117064    0.30
                    18    PIR                              110967    101279    0.28
                    19    ProDom                           109281    106447    0.28
                    20    SMART                            104656     79501    0.27
                    21    HSSP                              83910     83910    0.21
                    22    UniGene                           78433     72798    0.20
                    23    HOVERGEN                          75109     75109    0.19
                    24    Ensembl                           66712     65180    0.17
                    25    PIRSF                             58210     58210    0.15
                    26    ArrayExpress                      53103     53103    0.14
                    27    PDBsum                            52185     13136    0.13
                    28    PDB                               52185     13136    0.13
                    29    SMR                               49807     49807    0.13
                    30    GermOnline                        41973     41363    0.11
                    31    TIGR                              31613     30912    0.08
                    32    CleanEx                           30182     29548    0.08
                    33    HGNC                              18843     18702    0.05
                    34    LinkHub                           18105     18105    0.05
                    35    IntAct                            16471     16471    0.04
                    36    PhosphoSite                       15991     15991    0.04
                    37    PharmGKB                          15825     15815    0.04
                    38    MGI                               15680     15629    0.04
                    39    MIM                               15171     12072    0.04
                    40    H-InvDB                           11260      9566    0.03
                    41    DIP9000      8950    0.02
                    42    MEROPS                             7206      6910    0.02
                    43    RGD6999      6994    0.02
                    44    TAIR                               6998      6884    0.02
                    45    SGD6640      6538    0.02
                    46    CYGD                               6628      6523    0.02
                    47    HPA5789      4704    0.01
                    48    DrugBank                           5326      1627    0.01
                    49    PeptideAtlas                       5168      5168    0.01
                    50    GeneDB_Spombe                      4460      4419    0.01
                    51    EcoGene                            4331      4328    0.01
                    52    EchoBASE                           4159      4124    0.01
                    53    WormPep                            3884      3180    0.01
                    54    FlyBase                            3692      3564    0.01
                    55    Gramene                            3681      3681    0.01
                    56    WormBase                           3578      3494    0.01
                    57    Reactome                           3416      2069    0.01
                    58    SubtiList                          2819      2818    0.01
                    59    Orphanet                           2633      1673    0.01
                    60    dictyBase                          2568      2478    0.01
                    61    GeneFarm                           2252      2231    0.01
                    62    ZFIN                               2105      2089    0.01
                    63    StyGene                            1653      1649   <0.01
                    64    TubercuList                        1473      1437   <0.01
                    65    SWISS-2DPAGE                       1182      1182   <0.01
                    66    PseudoCAP                          1180      1171   <0.01
                    67    ListiList                          1131      1123   <0.01
                    68    REPRODUCTION-2DPAGE                1029       941   <0.01
                    69    AGD769       763   <0.01
                    70    LegioList                           699       697   <0.01
                    71    PhotoList                           692       692   <0.01
                    72    Leproma                             650       647   <0.01
                    73    PeroxiBase                          503       492   <0.01
                    74    World-2DPAGE                        495       495   <0.01
                    75    CGD471       471   <0.01
                    76    MaizeGDB                            468       463   <0.01
                    77    ProMEX                              423       423   <0.01
                    78    DisProt                             397       394   <0.01
                    79    OGP378       378   <0.01
                    80    SagaList                            373       372   <0.01
                    81    REBASE                              351       343   <0.01
                    82    ECO2DBASE                           351       299   <0.01
                    83    GlycoSuiteDB                        282       282   <0.01
                    84    BuruList                            264       264   <0.01
                    85    PHCI-2DPAGE                         244       244   <0.01
                    86    VectorBase                          236       229   <0.01
                    87    BindingDB                           210       210   <0.01
                    88    MypuList                            198       198   <0.01
                    89    DOSAC-COBS-2DPAGE                   150       150   <0.01
                    90    Aarhus/Ghent-2DPAGE                 126        96   <0.01
                    91    Siena-2DPAGE                        102       102   <0.01
                    92    HSC-2DPAGE                           85        85   <0.01
                    93    2DBase-Ecoli                         84        84   <0.01
                    94    PhosSite                             73        73   <0.01
                    95    Cornea-2DPAGE                        67        67   <0.01
                    96    COMPLUYEAST-2DPAGE                   59        59   <0.01
                    97    euHCVdb                              55        44   <0.01
                    98    PMMA-2DPAGE                          52        52   <0.01
                    99    PptaseDB                             31        31   <0.01
                    100   Rat-heart-2DPAGE                     28        28   <0.01
                    101   ANU-2DPAGE                           22        22   <0.01
                    
                    Number of explicitly cross-referenced databases: 102
                    Number of implicitly cross-referenced databases:  23
                    
                    
                    7.  MISCELLANEOUS STATISTICS
                    
                    Total number of distinct authors cited in UniProtKB/Swiss-Prot: 254724
                    
                    Total number of entries encoded on a Mitochondrion: 4375
                    Total number of entries encoded on a Plasmid: 3430
                    Total number of entries encoded on a Plastid: 9853
                    Total number of entries encoded on a Plastid; Apicoplast: 16
                    Total number of entries encoded on a Plastid; Chloroplast: 9444
                    Total number of entries encoded on a Plastid; Cyanelle: 145
                    Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 118
                    
                    Number of fragments: 8097
                    Number of additional sequences produced by alternative splicing, initiation or promoter usage: 26284
                    
                    
                

UniProtKB/TrEMBL protein database release 39.0 statistics

                    
                    1.  INTRODUCTION
                    
                    Release 39.0 of 22-Jul-2008 of UniProtKB/TrEMBL contains 6'070'085 sequence entries
                    comprising 624'149'168 amino acids.
                    
                    815'041 sequences have been added since release 38, the sequence data of
                    6'451 existing entries has been updated and the annotations of
                    5'255'044 entries have been revised. This represents an increase of 15%.
                    
                    
                    
                    2.  AMINO ACID COMPOSITION
                    
                    2.1  Composition in percent for the complete database
                    
                    Ala (A) 8.57   Gln (Q) 3.89   Leu (L) 9.85   Ser (S) 6.77
                    Arg (R) 5.53   Glu (E) 6.06   Lys (K) 5.23   Thr (T) 5.60
                    Asn (N) 4.19   Gly (G) 7.07   Met (M) 2.42   Trp (W) 1.34
                    Asp (D) 5.26   His (H) 2.20   Phe (F) 4.04   Tyr (Y) 3.03
                    Cys (C) 1.33   Ile (I) 5.96   Pro (P) 4.81   Val (V) 6.66
                    
                    Asx (B) 0.000  Glx (Z) 0.000  Xaa (X) 0.07
                    
                    
                    2.2  Classification of the amino acids by their frequency
                    
                    Leu, Ala, Gly, Ser, Val, Glu, Ile, Thr, Arg, Asp, Lys, Pro, Asn, Phe,
                    Gln, Tyr, Met, His, Trp, Cys
                    
                    
                    3.  TAXONOMIC ORIGIN
                    
                    Total number of species represented in this release of UniProtKB/TrEMBL: 170489
                    
                    The first twenty species represent 954002 sequences:  15.7 % of the
                    total number of entries.
                    
                    
                    3.1 Table of the frequency of occurrence of species
                    
                    Species represented 1x:78172
                    2x:30985
                    3x:16319
                    4x: 9281
                    5x: 5325
                    6x: 3971
                    7x: 2943
                    8x: 2395
                    9x: 1875
                    10x: 2217
                    11- 20x: 9774
                    21- 50x: 3492
                    51-100x: 1416
                    >100x: 2324
                    
                    
                    
                    3.2  Table of the most represented species
                    
                    ------  ---------  --------------------------------------------
                    Number  Frequency  Species
                    ------  ---------  --------------------------------------------
                    1     238675  Human immunodeficiency virus 1
                    2      95231  Oryza sativa subsp. japonica (Rice)
                    3      54861  Homo sapiens (Human)
                    4      54323  Vitis vinifera (Grape)
                    5      50188  Trichomonas vaginalis G3
                    6      44675  Mus musculus (Mouse)
                    7      44524  Arabidopsis thaliana (Mouse-ear cress)
                    8      42163  Hepatitis C virus
                    9      39808  Paramecium tetraurelia
                    10      39254  Oryza sativa subsp. indica (Rice)
                    11      35653  Physcomitrella patens subsp. patens
                    12      28243  Drosophila melanogaster (Fruit fly)
                    13      28067  Tetraodon nigroviridis (Green puffer)
                    14      27250  uncultured bacterium
                    15      24942  Danio rerio (Zebrafish) (Brachydanio rerio)
                    16      24842  Nematostella vectensis (Starlet sea anemone)
                    17      20534  Caenorhabditis elegans
                    18      20490  Trypanosoma cruzi
                    19      20180  Culex quinquefasciatus (Southern house mosquito)
                    20      20099  Hepatitis B virus (HBV)
                    21      19172  Caenorhabditis briggsae
                    22      17883  Laccaria bicolor (strain S238N-H82) (Bicoloured deceiver) 
                    23      16803  Aedes aegypti (Yellowfever mosquito)
                    24      16685  Tetrahymena thermophila SB210
                    25      16302  Botryotinia fuckeliana (strain B05.10) (Noble rot fungus) (Botrytis cinerea)
                    26      15880  Phaeosphaeria nodorum (Septoria nodorum)
                    27      14718  Chlamydomonas reinhardtii
                    28      14679  Plasmodium chabaudi
                    29      14325  Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) (White mold) 
                    30      14158  Anopheles gambiae (African malaria mosquito)
                    31      14036  Aspergillus niger
                    32      13492  Coprinopsis cinerea (strain Okayama-7 / 130 / FGSC 9003) (Inky cap fungus) 
                    33      12757  Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea)
                    34      12419  Xenopus laevis (African clawed frog)
                    35      12062  Pyrenophora tritici-repentis Pt-1C-BFP
                    36      11941  Aspergillus oryzae
                    37      11788  Plasmodium berghei
                    38      11698  Dictyostelium discoideum (Slime mold)
                    39      11570  Brugia malayi (Filarial nematode worm)
                    40      10914  Chaetomium globosum (Soil fungus)
                    41      10714  Podospora anserina
                    42      10426  Neurospora crassa
                    43      10323  Coccidioides immitis
                    44      10318  Hepatitis C virus subtype 1b
                    45      10267  Aspergillus terreus (strain NIH 2624)
                    46      10262  Neosartorya fischeri  (Aspergillus fischerianus 
                    47      10040  Escherichia coli
                    48       9990  Drosophila pseudoobscura (Fruit fly)
                    49       9905  Aspergillus fumigatus (strain CEA10 / CBS 144.89 / FGSC A1163) 
                    50       9896  Bos taurus (Bovine)
                    51       9834  Schistosoma japonicum (Blood fluke)
                    52       9799  Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
                    53       9673  Cryptococcus neoformans (Filobasidiella neoformans)
                    54       9650  Aspergillus fumigatus (Sartorya fumigata)
                    55       9469  Trypanosoma brucei
                    56       9456  Emericella nidulans (Aspergillus nidulans)
                    57       9287  Candida albicans (Yeast)
                    58       9227  Monosiga brevicollis (Choanoflagellate)
                    59       9203  Ajellomyces capsulata (strain NAm1 / WU24) (Darling's disease fungus) 
                    60       9201  Sorangium cellulosum (strain So ce56) (Polyangium cellulosum (strain So ce56))
                    61       8983  Aspergillus clavatus
                    62       8826  Rhodococcus sp. (strain RHA1)
                    63       8781  Rattus norvegicus (Rat)
                    64       8607  Entamoeba dispar SAW760
                    65       8603  Methylobacterium nodulans ORS 2060
                    66       8513  Stigmatella aurantiaca DW4/3-1
                    67       8475  Simian immunodeficiency virus (isolate CPZ GAB1) (SIV-cpz) 
                    68       8437  Plesiocystis pacifica SIR-1
                    69       8398  Helicobacter pylori (Campylobacter pylori)
                    70       8249  Microscilla marina ATCC 23134
                    71       8205  Burkholderia xenovorans (strain LB400)
                    72       8129  Bradyrhizobium japonicum
                    73       8027  Leishmania infantum
                    74       7970  Ostreococcus tauri
                    75       7935  Acaryochloris marina (strain MBIC 11017)
                    76       7887  Leishmania braziliensis
                    77       7810  Plasmodium yoelii yoelii
                    78       7642  Pseudomonas aeruginosa
                    79       7575  Solibacter usitatus (strain Ellin6076)
                    80       7514  Plasmodium vivax
                    81       7503  Streptomyces coelicolor
                    82       7501  Rhizobium leguminosarum bv. trifolii WSM1325
                    83       7463  Burkholderia phymatum (strain DSM 17167 / STM815)
                    84       7463  Plasmodium falciparum
                    85       7401  Ostreococcus lucimarinus (strain CCE9901)
                    86       7349  Burkholderia pseudomallei 305
                    87       7293  Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182)
                    88       7292  Burkholderia sp. (strain 383) (Burkholderia cepacia 
                    89       7274  Clostridium bolteae ATCC BAA-613
                    90       7267  Streptomyces avermitilis
                    91       7221  Burkholderia multivorans (strain ATCC 17616 / 249)
                    92       7197  Burkholderia phytofirmans (strain DSM 17436 / PsJN)
                    93       7136  Rhizobium loti (Mesorhizobium loti)
                    94       7132  Frankia sp. (strain EAN1pec)
                    95       7124  Burkholderia ambifaria MEX-5
                    96       7122  Leishmania major
                    97       7081  Burkholderia vietnamiensis (strain G4 / LMG 22486) (Burkholderia cepacia 
                    98       7061  Myxococcus xanthus (strain DK 1622)
                    99       7005  Streptomyces griseus subsp. griseus (strain JCM 4626 / NBRC 13350)
                    100       6981  Burkholderia cenocepacia (strain MC0-3)
                    
                    
                    3.3  Taxonomic distribution of the sequences
                    
                    
                    Kingdom        sequences (% of the database)
                    Archaea          117313 (  2%)
                    Bacteria        3404071 ( 56%)
                    Eukaryota       1895580 ( 31%)
                    Viruses          648091 ( 11%)
                    Other              5029 ( <1%)
                    
                    
                    
                    Within Eukaryota:
                    
                    
                    Category            sequences (% of Eukaryota) (% of the complete database)
                    Human                  54862 (  3%)           (  1%)
                    Other Mammalia        135932 (  7%)           (  2%)
                    Other Vertebrata      211928 ( 11%)           (  3%)
                    Viridiplantae         484409 ( 26%)           (  8%)
                    Fungi                 361583 ( 19%)           (  6%)
                    Insecta               190201 ( 10%)           (  3%)
                    Nematoda               56736 (  3%)           (  1%)
                    Other                 399929 ( 21%)           (  7%)
                    
                    
                    
                    4.  SEQUENCE SIZE
                    
                    Repartition of the sequences by size (excluding fragments)
                    
                    From   To  Number             From   To   Number
                    1-  50  208566             1001-1100    40261
                    51- 100  637056             1101-1200    27366
                    101- 150  747961             1201-1300    18801
                    151- 200  699159             1301-1400    12801
                    201- 250  674548             1401-1500    10254
                    251- 300  573450             1501-1600     7455
                    301- 350  535981             1601-1700     5842
                    351- 400  410656             1701-1800     4650
                    401- 450  347643             1801-1900     3538
                    451- 500  284281             1901-2000     3059
                    501- 550  190417             2001-2100     2455
                    551- 600  141809             2101-2200     2451
                    601- 650  103668             2201-2300     1886
                    651- 700   83899             2301-2400     1594
                    701- 750   71389             2401-2500     1362
                    751- 800   62475             >2500        11470
                    801- 850   46437
                    851- 900   42027
                    901- 950   29461
                    951-1000   23945
                    
                    
                    
                    
                    The average sequence length in UniProtKB/TrEMBL is   322 amino acids.
                    
                    The shortest sequence is Q16047_HUMAN:     4 amino acids.
                    The longest sequence is  Q3ASY8_CHLCH: 36805 amino acids.
                    
                    
                    
                    5.  STATISTICS FOR SOME LINE TYPES
                    
                    The following table summarizes the total number of some UniProtKB/TrEMBL lines,
                    as well as the number of entries with at least one such line, and the
                    frequency of the lines.
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry
                    ---------------------------------  -------- ---------  ---------
                    
                    References (RL)                    7640409              1.26
                    Submitted to EMBL/GenBank/DDBJ  4154948   3514648    0.68
                    Journal                         3352515   3093695    0.55
                    Thesis                             6880      6824   <0.01
                    Book citation                      4356      4312   <0.01
                    Submitted to other databases       3480      3473   <0.01
                    Other                            118230    116766    0.02
                    
                    Comments (CC)                      4393499              0.72
                    SIMILARITY                      1358723   1235686    0.22
                    CAUTION                         1317857   1317857    0.22
                    CATALYTIC ACTIVITY               449239    383227    0.07
                    FUNCTION                         442937    425981    0.07
                    SUBCELLULAR LOCATION             362423    362395    0.06
                    PATHWAY                          163108    149477    0.03
                    SUBUNIT                          149694    148693    0.02
                    COFACTOR                         138898    136672    0.02
                    MISCELLANEOUS                      5726      5726   <0.01
                    INTERACTION                        4295      4295   <0.01
                    DOMAIN                              599       599   <0.01
                    
                    Features (FT)                      2546720              0.42
                    NON_TER                         2098331   1246419    0.35
                    CHAIN                            276320    220896    0.05
                    SIGNAL                           171508    171508    0.03
                    TRANSIT                             561       561   <0.01
                    
                    Cross-references (DR)             56036818              9.23
                    GO                             11116789   3570918    1.83
                    InterPro                        9220668   4167387    1.52
                    EMBL                            6851513   6062637    1.13
                    Pfam                            5198497   3844857    0.86
                    RefSeq                          2971744   2875731    0.49
                    GeneID                          2957505   2869408    0.49
                    PROSITE                         2842142   1869467    0.47
                    KEGG                            1860398   1795530    0.31
                    Gene3D                          1757519   1506810    0.29
                    GenomeReviews                   1596334   1546434    0.26
                    PRINTS                          1089660    917737    0.18
                    HOGENOM                         1061239   1061235    0.17
                    SMART                           1008925    792054    0.17
                    NMPDR                            957022    957011    0.16
                    TIGRFAMs                         939056    858853    0.15
                    PANTHER                          905430    859383    0.15
                    ProDom                           700180    668524    0.12
                    SMR                              494328    494243    0.08
                    HOVERGEN                         316989    316798    0.05
                    BioCyc                           304168    291467    0.05
                    UniGene                          275255    251204    0.05
                    PIRSF                            262205    262205    0.04
                    HSSP                             261663    261371    0.04
                    TIGR                             198869    191592    0.03
                    PIR                              182023    149002    0.03
                    Ensembl                          157982    151190    0.03
                    ArrayExpress                     100469    100437    0.02
                    Gramene                           69959     69959    0.01
                    euHCVdb                           47728     47728    0.01
                    MGI                               40387     40202    0.01
                    FlyBase                           34972     34832    0.01
                    HGNC                              29172     29143   <0.01
                    VectorBase                        29057     28725   <0.01
                    MEROPS                            26390     25729   <0.01
                    TAIR                              19447     19396   <0.01
                    WormPep                           19423     19320   <0.01
                    WormBase                          19414     19320   <0.01
                    ZFIN                              16063     16056   <0.01
                    LinkHub                           12019     12019   <0.01
                    dictyBase                         10181     10179   <0.01
                    CGD6987      6987   <0.01
                    RGD5924      4066   <0.01
                    PDBsum                             5860      3334   <0.01
                    PDB5860      3334   <0.01
                    IntAct                             5461      5460   <0.01
                    LegioList                          5399      5369   <0.01
                    ListiList                          4684      4667   <0.01
                    PseudoCAP                          4390      4387   <0.01
                    PhotoList                          3988      3864   <0.01
                    BuruList                           3976      3942   <0.01
                    AGD3925      3925   <0.01
                    REBASE                             3685      3660   <0.01
                    TubercuList                        2517      2511   <0.01
                    DIP2276      2271   <0.01
                    PeroxiBase                         2082      2076   <0.01
                    SagaList                           1721      1627   <0.01
                    PhosphoSite                        1404      1404   <0.01
                    Leproma                             957       956   <0.01
                    MypuList                            584       580   <0.01
                    GeneDB_Spombe                       515       510   <0.01
                    ProMEX                              483       483   <0.01
                    World-2DPAGE                        418       418   <0.01
                    SGD327       327   <0.01
                    PeptideAtlas                        194       194   <0.01
                    PharmGKB                            121       121   <0.01
                    PHCI-2DPAGE                         103       103   <0.01
                    Reactome                             67        62   <0.01
                    ANU-2DPAGE                           59        59   <0.01
                    SWISS-2DPAGE                         29        29   <0.01
                    REPRODUCTION-2DPAGE                  16        16   <0.01
                    CYGD16        16   <0.01
                    PMMA-2DPAGE                           3         3   <0.01
                    Siena-2DPAGE                          2         2   <0.01
                    COMPLUYEAST-2DPAGE                    1         1   <0.01
                    
                    Number of explicitly cross-referenced databases: 102
                    
                    
                    6.  MISCELLANEOUS STATISTICS
                    
                    Total number of distinct authors cited in UniProtKB/TrEMBL: 266582
                    
                    Total number of entries encoded on a Mitochondrion: 213912
                    Total number of entries encoded on a Plasmid: 97022
                    Total number of entries encoded on a Plastid: 4959
                    Total number of entries encoded on a Plastid; Apicoplast: 264
                    Total number of entries encoded on a Plastid; Chloroplast: 74165
                    Total number of entries encoded on a Plastid; Cyanelle: 7
                    Total number of entries encoded on a Plastid; Non-photosynthetic plastid: 237
                    
                    Number of fragments: 1245645
                    
                

Submissions and Updates

We welcome feedback from our users. We would especially appreciate your notifying us if you find that sequences belonging to your field of expertise are missing from the database. We also would like to be notified about annotations to be updated, if, for example, the function of a protein has been clarified or if new information about post-translational modifications has become available.

Submit new sequence data, updates and corrections at http://www.uniprot.org/support/submissions.shtml

For all queries regarding submissions to UniProtKB and to submit new protein sequence data, please contact:

UniProt Knowledgebase
The EMBL Outstation - The European Bioinformatics Institute
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom

Telephone: (+44 1223) 494 462
Telefax: (+44 1223) 494 468
E-mail:


Download information

Minor releases (every 3 weeks)

The latest data of the UniProt Knowledgebase is available in various format (flatfile, XML or FASTA) at http://www.uniprot.org/database/download.shtml. The data is further supplemented by a file containing the sequences of all additional alternative isoforms annotated in UniProtKB/Swiss-Prot. This data set is documented in the file ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/README.varsplic

Major releases

For users who wish to download the UniProt Knowledgebase only occasionally, we distribute the latest major release (updated 3 times per year) in flatfile format. Previous UniProtKB/Swiss-Prot and UniProtKB/TrEMBL are archived under ftp://ftp.uniprot.org/pub/databases/uniprot/previous_major_releases. The UniProt Knowledgebase major release is also available on DVD from the EBI.


Contact

EMBL Outstation
European Bioinformatics Institute (EBI)
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
United Kingdom

Telephone: (+44 1223) 494 444
Fax: (+44 1223) 494 468
Electronic mail address: /
WWW server: http://www.ebi.ac.uk/


SIB Swiss Institute of Bioinformatics
Centre Medical Universitaire
1, rue Michel Servet
1211 Geneva 4
Switzerland

Telephone: (+41 22) 379 50 50
Fax: (+41 22) 379 58 58
Electronic mail address:
WWW server: http://www.expasy.org/


Protein Information Resource (PIR)
Georgetown University Medical Center
3300 Whitehaven St., Suite 1200
Washington, DC 20008
United States of America

Telephone: (+1 202) 687 1039
Fax: (+1 202) 687 0057)
Electronic mail address:
WWW server: http://pir.georgetown.edu

Citation

If you want to cite UniProt in a publication, please use the following reference:

The UniProt Consortium
"The Universal Protein Resource (UniProt)"
Nucleic Acids Res. 36:D190-D195(2008) doi:10.1093/nar/gkm895