Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
         UniProtKB/Swiss-Prot protein knowledgebase release 2010_11 statistics
                    
                    
                    1.  INTRODUCTION
                    
                    Release 2010_11 of 02-Nov-10 of UniProtKB/Swiss-Prot contains 522019 sequence entries,
                    comprising 184241293 amino acids abstracted from 192744 references. 
                    
                    1012 sequences have been added since release 2010_10, the sequence data of
                    195 existing entries has been updated and the annotations of
                    174873 entries have been revised.
                    
                    Number of fragments: 8819
                    Number of additional sequences produced by alternative splicing, initiation or promoter usage, or ribosomal frameshifting: 29562
                    
                    
                    Protein existence (PE):           entries     %
                    
                    1: Evidence at protein level        71345   13.7%
                    2: Evidence at transcript level     67601   12.9%
                    3: Inferred from homology          367123   70.3%
                    4: Predicted                        14290    2.7%
                    5: Uncertain                         1660    0.3%
                    
                    The growth of the database is summarized below.
                    
                    
                    
                    
                    2.  TAXONOMIC ORIGIN
                    
                    Total number of species represented in this release of UniProtKB/Swiss-Prot: 12265
                    
                    The first twenty species represent 108879 sequences:  20.9 % of the total
                    number of entries.
                    
                    
                    2.1 Table of the frequency of occurrence of species
                    
                    Species represented 1x: 5274
                    2x: 1748
                    3x:  910
                    4x:  604
                    5x:  435
                    6x:  352
                    7x:  251
                    8x:  210
                    9x:  191
                    10x:  113
                    11- 20x:  596
                    21- 50x:  383
                    51-100x:  176
                    >100x: 1022
                    
                    
                    2.2  Table of the most represented species
                    
                    ------  ---------  --------------------------------------------
                    Number  Frequency  Species
                    ------  ---------  --------------------------------------------
                    1      20259  Homo sapiens (Human)
                    2      16320  Mus musculus (Mouse)
                    3       9590  Arabidopsis thaliana (Mouse-ear cress)
                    4       7551  Rattus norvegicus (Rat)
                    5       6579  Saccharomyces cerevisiae (Baker's yeast)
                    6       5795  Bos taurus (Bovine)
                    7       4976  Schizosaccharomyces pombe (Fission yeast)
                    8       4429  Escherichia coli (strain K12)
                    9       4254  Bacillus subtilis
                    10       4252  Dictyostelium discoideum (Slime mold)
                    11       3309  Caenorhabditis elegans
                    12       3273  Xenopus laevis (African clawed frog)
                    13       3091  Drosophila melanogaster (Fruit fly)
                    14       2683  Danio rerio (Zebrafish) (Brachydanio rerio)
                    15       2581  Oryza sativa subsp. japonica (Rice)
                    16       2210  Pongo abelii (Sumatran orangutan)
                    17       2179  Gallus gallus (Chicken)
                    18       1993  Escherichia coli O157:H7
                    19       1782  Methanocaldococcus jannaschii (Methanococcus jannaschii)
                    20       1773  Salmonella typhimurium
                    21       1773  Haemophilus influenzae
                    22       1714  Mycobacterium tuberculosis
                    23       1669  Shigella flexneri
                    24       1667  Escherichia coli O6
                    25       1566  Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
                    26       1375  Sus scrofa (Pig)
                    27       1342  Salmonella typhi
                    28       1283  Pseudomonas aeruginosa
                    29       1223  Mycobacterium bovis
                    30       1161  Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey)
                    31       1018  Synechocystis sp. (strain PCC 6803)
                    32        997  Yersinia pestis
                    33        993  Archaeoglobus fulgidus
                    34        942  Vibrio cholerae
                    35        929  Salmonella paratyphi A
                    36        924  Staphylococcus aureus (strain N315)
                    37        923  Staphylococcus aureus (strain Mu50 / ATCC 700699)
                    38        913  Rhizobium meliloti (Sinorhizobium meliloti)
                    39        909  Acanthamoeba polyphaga mimivirus (APMV)
                    40        897  Staphylococcus aureus (strain COL)
                    41        895  Staphylococcus aureus (strain MW2)
                    42        889  Staphylococcus aureus (strain MSSA476)
                    43        886  Staphylococcus aureus (strain MRSA252)
                    44        882  Oryctolagus cuniculus (Rabbit)
                    45        881  Salmonella choleraesuis
                    46        879  Escherichia coli O6:K15:H31 (strain 536 / UPEC)
                    47        871  Shigella sonnei (strain Ss046)
                    48        864  Yersinia pseudotuberculosis
                    49        845  Ashbya gossypii (Yeast) (Eremothecium gossypii)
                    50        835  Escherichia coli O9:H4 (strain HS)
                    51        829  Escherichia coli O139:H28 (strain E24377A / ETEC)
                    52        825  Shigella boydii serotype 4 (strain Sb227)
                    53        820  Kluyveromyces lactis (Yeast) (Candida sphaerica)
                    54        817  Escherichia coli (strain UTI89 / UPEC)
                    55        814  Escherichia coli (strain ATCC 8739 / DSM 1576 / Crooks)
                    56        813  Candida albicans (Yeast)
                    57        804  Shigella dysenteriae serotype 1 (strain Sd197)
                    58        794  Vibrio parahaemolyticus
                    59        785  Escherichia coli (strain SMS-3-5 / SECEC)
                    60        781  Neurospora crassa
                    61        778  Candida glabrata (Yeast) (Torulopsis glabrata)
                    62        778  Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum)
                    63        777  Pasteurella multocida
                    64        773  Aquifex aeolicus
                    65        765  Canis familiaris (Dog) (Canis lupus familiaris)
                    66        765  Escherichia coli (strain K12 / DH10B)
                    67        759  Escherichia coli O127:H6 (strain E2348/69 / EPEC)
                    68        759  Escherichia coli (strain K12 / MC4100 / BW2952)
                    69        757  Escherichia coli O17:K52:H18 (strain UMN026 / ExPEC)
                    70        757  Escherichia coli (strain 55989 / EAEC)
                    71        757  Staphylococcus epidermidis (strain ATCC 35984 / RP62A)
                    72        756  Escherichia coli O8 (strain IAI1)
                    73        756  Staphylococcus epidermidis (strain ATCC 12228)
                    74        751  Escherichia coli O45:K1 (strain S88 / ExPEC)
                    75        750  Escherichia coli (strain SE11)
                    76        750  Shigella flexneri serotype 5b (strain 8401)
                    77        748  Escherichia coli O7:K1 (strain IAI39 / ExPEC)
                    78        746  Streptomyces coelicolor
                    79        742  Escherichia coli O157:H7 (strain EC4115 / EHEC)
                    80        738  Photorhabdus luminescens subsp. laumondii
                    81        731  Bacillus halodurans
                    82        731  Vibrio vulnificus
                    83        726  Escherichia coli O81 (strain ED1a)
                    84        725  Yersinia enterocolitica serotype O:8 / biotype 1B (strain 8081)
                    85        724  Bacillus anthracis
                    86        720  Salmonella enteritidis PT4 (strain P125109)
                    87        716  Emericella nidulans (Aspergillus nidulans)
                    88        715  Vibrio vulnificus (strain YJ016)
                    89        715  Salmonella paratyphi B (strain ATCC BAA-1250 / SPB7)
                    90        713  Yersinia pestis bv. Antiqua (strain Nepal516)
                    91        713  Salmonella paratyphi A (strain AKU_12601)
                    92        712  Staphylococcus aureus (strain NCTC 8325)
                    93        712  Yersinia pseudotuberculosis serotype O:1b (strain IP 31758)
                    94        711  Salmonella agona (strain SL483)
                    95        711  Salmonella newport (strain SL254)
                    96        710  Salmonella heidelberg (strain SL476)
                    97        709  Yersinia pestis bv. Antiqua (strain Antiqua)
                    98        709  Salmonella schwarzengrund (strain CVM19633)
                    99        706  Escherichia coli O1:K1 / APEC
                    100        700  Salmonella dublin (strain CT_02021853)
                    101        699  Enterobacter sp. (strain 638)
                    102        698  Klebsiella pneumoniae subsp. pneumoniae (strain ATCC 700721 / MGH 78578)
                    103        697  Shigella boydii serotype 18 (strain CDC 3083-94 / BS512)
                    104        687  Mycoplasma pneumoniae
                    105        686  Pan troglodytes (Chimpanzee)
                    106        686  Escherichia fergusonii (strain ATCC 35469 / DSM 13698 / CDC 0568-73)
                    107        686  Klebsiella pneumoniae (strain 342)
                    108        684  Pseudomonas syringae pv. tomato
                    109        682  Salmonella gallinarum (strain 287/91 / NCTC 13346)
                    110        678  Anabaena sp. (strain PCC 7120)
                    111        671  Pseudomonas putida (strain KT2440)
                    112        668  Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696)
                    113        667  Mycobacterium leprae
                    114        666  Staphylococcus aureus (strain USA300)
                    115        666  Yersinia pestis (strain Pestoides F)
                    116        658  Rhizobium sp. (strain NGR234)
                    117        656  Zea mays (Maize)
                    118        655  Serratia proteamaculans (strain 568)
                    119        645  Bradyrhizobium japonicum
                    120        643  Escherichia coli
                    121        642  Staphylococcus aureus (strain bovine RF122 / ET3-1)
                    122        638  Bacillus cereus (strain ATCC 14579 / DSM 31)
                    123        637  Yersinia pseudotuberculosis serotype O:3 (strain YPIII)
                    124        635  Salmonella arizonae (strain ATCC BAA-731 / CDC346-86 / RSK2980)
                    125        633  Yersinia pseudotuberculosis serotype IB (strain PB1/+)
                    126        621  Shewanella oneidensis
                    127        619  Agrobacterium tumefaciens (strain C58 / ATCC 33970)
                    128        615  Treponema pallidum
                    129        614  Ralstonia solanacearum (Pseudomonas solanacearum)
                    130        610  Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
                    131        609  Staphylococcus haemolyticus (strain JCSC1435)
                    132        608  Enterobacter sakazakii (strain ATCC BAA-894)
                    133        603  Rhizobium loti (Mesorhizobium loti)
                    134        603  Yarrowia lipolytica (Candida lipolytica)
                    135        602  Methanobacterium thermoautotrophicum
                    136        602  Staphylococcus saprophyticus subsp. saprophyticus 
                    137        600  Salmonella paratyphi C (strain RKS4594)
                    138        598  Yersinia pestis bv. Antiqua (strain Angola)
                    139        596  Photobacterium profundum (Photobacterium sp. (strain SS9))
                    140        596  Listeria monocytogenes
                    141        590  Bacillus cereus (strain ATCC 10987)
                    142        590  Xanthomonas campestris pv. campestris
                    143        588  Listeria innocua
                    144        585  Rickettsia prowazekii
                    145        584  Helicobacter pylori (Campylobacter pylori)
                    146        584  Pectobacterium carotovorum subsp. carotovorum (strain PC1)
                    147        581  Lactococcus lactis subsp. lactis (Streptococcus lactis)
                    148        581  Aspergillus fumigatus (Sartorya fumigata)
                    149        579  Neisseria meningitidis serogroup B
                    150        576  Brucella suis
                    151        572  Brucella melitensis
                    152        572  Buchnera aphidicola subsp. Acyrthosiphon pisum 
                    153        569  Bacillus thuringiensis subsp. konkukian
                    154        565  Helicobacter pylori J99 (Campylobacter pylori J99)
                    155        562  Buchnera aphidicola subsp. Schizaphis graminum
                    156        560  Bacillus cereus (strain ZK / E33L)
                    157        560  Pseudomonas syringae pv. syringae (strain B728a)
                    158        557  Bacillus licheniformis (strain DSM 13 / ATCC 14580)
                    159        557  Pseudomonas aeruginosa (strain UCBPP-PA14)
                    160        556  Neisseria meningitidis serogroup A
                    161        555  Xanthomonas axonopodis pv. citri (Citrus canker)
                    162        553  Vibrio fischeri (strain ATCC 700601 / ES114)
                    163        551  Pseudomonas fluorescens (strain Pf0-1)
                    164        549  Oceanobacillus iheyensis
                    165        547  Clostridium acetobutylicum
                    166        545  Caulobacter crescentus (Caulobacter vibrioides)
                    167        545  Pseudomonas fluorescens (strain Pf-5 / ATCC BAA-477)
                    168        538  Pseudomonas syringae pv. phaseolicola (strain 1448A / Race 6)
                    169        535  Caenorhabditis briggsae
                    170        529  Listeria monocytogenes serotype 4b (strain F2365)
                    171        524  Erwinia tasmaniensis (strain DSM 17950 / Et1/99)
                    172        522  Sodalis glossinidius (strain morsitans)
                    173        522  Xylella fastidiosa
                    174        521  Bordetella bronchiseptica (Alcaligenes bronchisepticus)
                    175        521  Oryza sativa subsp. indica (Rice)
                    176        519  Streptococcus pneumoniae
                    177        513  Chromobacterium violaceum
                    178        512  Xylella fastidiosa (strain Temecula1 / ATCC 700964)
                    179        511  Thermotoga maritima
                    180        509  Vibrio cholerae serotype O1 (strain ATCC 39541 / Ogawa 395 / O395)
                    181        507  Bordetella parapertussis
                    182        507  Buchnera aphidicola subsp. Baizongia pistaciae (strain Bp)
                    183        507  Pseudomonas aeruginosa (strain PA7)
                    184        506  Bordetella pertussis
                    185        505  Staphylococcus aureus (strain Newman)
                    186        504  Haemophilus ducreyi
                    187        504  Geobacillus kaustophilus
                    188        500  Pseudomonas entomophila (strain L48)
                    189        498  Deinococcus radiodurans
                    190        498  Brucella abortus
                    191        497  Rickettsia conorii
                    192        496  Bacillus clausii (strain KSM-K16)
                    193        492  Haemophilus influenzae (strain 86-028NP)
                    194        492  Streptomyces avermitilis
                    195        491  Corynebacterium glutamicum (Brevibacterium flavum)
                    196        490  Xanthomonas campestris pv. campestris (strain 8004)
                    197        490  Vibrio harveyi (strain ATCC BAA-1116 / BB120)
                    198        490  Clostridium perfringens
                    199        488  Bacillus amyloliquefaciens (strain FZB42)
                    200        487  Burkholderia pseudomallei (Pseudomonas pseudomallei)
                    201        487  Shewanella sp. (strain MR-7)
                    202        484  Pseudomonas aeruginosa (strain LESB58)
                    203        484  Staphylococcus aureus (strain Mu3 / ATCC 700698)
                    204        484  Shewanella sp. (strain MR-4)
                    205        483  Mannheimia succiniciproducens (strain MBEL55E)
                    206        483  Mycoplasma genitalium
                    207        481  Proteus mirabilis (strain HI4320)
                    208        481  Methanosarcina acetivorans
                    209        475  Synechococcus elongatus (strain PCC 7942) (Anacystis nidulans R2)
                    210        473  Pseudomonas putida (strain F1 / ATCC 700007)
                    211        472  Burkholderia sp. (strain 383) (Burkholderia cepacia 
                    212        472  Brucella abortus (strain 2308)
                    213        472  Thermosynechococcus elongatus (strain BP-1)
                    214        469  Acinetobacter sp. (strain ADP1)
                    215        469  Enterococcus faecalis (Streptococcus faecalis)
                    216        466  Pyrococcus horikoshii
                    217        465  Rhodopseudomonas palustris
                    218        465  Xanthomonas campestris pv. vesicatoria (strain 85-10)
                    219        465  Pseudomonas putida (strain GB-1)
                    220        464  Shewanella frigidimarina (strain NCIMB 400)
                    221        462  Anabaena variabilis (strain ATCC 29413 / PCC 7937)
                    222        462  Shewanella sp. (strain ANA-3)
                    223        461  Burkholderia mallei (Pseudomonas mallei)
                    224        461  Lactobacillus plantarum
                    225        460  Ralstonia eutropha  (Cupriavidus necator 
                    226        459  Methanosarcina mazei (Methanosarcina frisia)
                    227        458  Pyrococcus abyssi
                    228        457  Streptococcus pneumoniae (strain ATCC BAA-255 / R6)
                    229        457  Ralstonia eutropha (strain JMP134) (Alcaligenes eutrophus)
                    230        456  Aeromonas hydrophila subsp. hydrophila (strain ATCC 7966 / NCIB 9240)
                    231        455  Staphylococcus aureus (strain JH1)
                    232        454  Halobacterium salinarium (Halobacterium halobium)
                    233        453  Rickettsia felis (Rickettsia azadi)
                    234        453  Xanthomonas oryzae pv. oryzae (strain MAFF 311018)
                    235        452  Shewanella baltica (strain OS185)
                    236        452  Pseudomonas putida (strain W619)
                    237        449  Staphylococcus aureus (strain JH9)
                    238        449  Streptococcus mutans
                    239        448  Methylococcus capsulatus
                    240        448  Thermoanaerobacter tengcongensis
                    241        447  Ovis aries (Sheep)
                    242        447  Aeromonas salmonicida (strain A449)
                    243        446  Mycobacterium paratuberculosis
                    244        446  Vibrio fischeri (strain MJ11)
                    245        446  Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158)
                    246        444  Hahella chejuensis (strain KCTC 2396)
                    247        444  Pseudomonas mendocina (strain ymp)
                    248        443  Dechloromonas aromatica (strain RCB)
                    249        441  Streptococcus pyogenes serotype M6
                    250        441  Pyrococcus furiosus
                    
                    
                    
                    2.3  Taxonomic distribution of the sequences
                    
                    
                    
                    Kingdom        sequences (% of the database)
                    Archaea           18369 (  4%)
                    Bacteria         324655 ( 62%)
                    Eukaryota        164011 ( 31%)
                    Viruses           14984 (  3%)
                    
                    
                    Within Eukaryota:
                    
                    
                    
                    Category            sequences (% of Eukaryota) (% of the complete database)
                    Human                  20260 ( 12%)           (  4%)
                    Other Mammalia         44898 ( 27%)           (  9%)
                    Other Vertebrata       16355 ( 10%)           (  3%)
                    Viridiplantae          29945 ( 18%)           (  6%)
                    Fungi                  26953 ( 16%)           (  5%)
                    Insecta                 8118 (  5%)           (  2%)
                    Nematoda                4129 (  3%)           (  1%)
                    Other                  13353 (  8%)           (  3%)
                    
                    
                    
                    3.  SEQUENCE SIZE
                    
                    Repartition of the sequences by size (excluding fragments)
                    
                    From   To  Number             From   To   Number
                    1-  50    8529             1001-1100     3552
                    51- 100   40209             1101-1200     2460
                    101- 150   56133             1201-1300     1930
                    151- 200   56203             1301-1400     1795
                    201- 250   55084             1401-1500     1431
                    251- 300   48328             1501-1600      638
                    301- 350   48817             1601-1700      514
                    351- 400   41765             1701-1800      429
                    401- 450   34370             1801-1900      395
                    451- 500   27556             1901-2000      325
                    501- 550   19508             2001-2100      202
                    551- 600   13917             2101-2200      267
                    601- 650   11679             2201-2300      278
                    651- 700    8321             2301-2400      167
                    701- 750    6920             2401-2500      129
                    751- 800    4963             >2500         1022
                    801- 850    4305
                    851- 900    4826
                    901- 950    3670
                    951-1000    2563
                    
                    
                    
                    
                    The average sequence length in UniProtKB/Swiss-Prot is 352 amino acids.
                    
                    The shortest sequence is   GWA_SEPOF (P83570):     2 amino acids.
                    The longest sequence is  TITIN_MOUSE (A2ASS6): 35213 amino acids.
                    
                    
                    4.  JOURNAL CITATIONS
                    
                    Note: the following citation statistics reflect the number of distinct
                    journal citations.
                    
                    Total number of journals cited in this release of UniProtKB/Swiss-Prot: 2094
                    
                    
                    4.1 Table of the frequency of journal citations
                    
                    Journals cited 1x:  674
                    2x:  285
                    3x:  145
                    4x:  104
                    5x:   90
                    6x:   66
                    7x:   35
                    8x:   36
                    9x:   32
                    10x:   32
                    11- 20x:  167
                    21- 50x:  168
                    51-100x:  100
                    >100x:  160
                    
                    
                    4.2  List of the most cited journals in UniProtKB/Swiss-Prot
                    
                    Nb    Citations   Journal name
                    --    ---------   -------------------------------------------------------------
                    1        18263   Journal of Biological Chemistry
                    2         8440   Proceedings of the National Academy of Sciences of the U.S.A.
                    3         5070   Journal of Bacteriology
                    4         4569   Biochemical and Biophysical Research Communications
                    5         4513   Gene
                    6         4314   Nucleic Acids Research
                    7         4011   FEBS Letters
                    8         3934   Biochemistry
                    9         3791   The EMBO Journal
                    10         3466   Molecular and Cellular Biology
                    11         3280   Nature
                    12         3118   European Journal of Biochemistry
                    13         3115   Journal of Molecular Biology
                    14         3001   Biochimica et Biophysica Acta
                    15         2732   Cell
                    16         2481   Genomics
                    17         2202   Biochemical Journal
                    18         2160   Science
                    19         2073   Journal of Virology
                    20         1792   Molecular Microbiology
                    21         1601   Journal of Cell Biology
                    22         1510   Plant Molecular Biology
                    23         1398   Genes and Development
                    24         1380   Plant Physiology
                    25         1365   Virology
                    26         1339   Human Molecular Genetics
                    27         1334   Nature Genetics
                    28         1306   Molecular and General Genetics
                    29         1291   The American Journal of Human Genetics
                    30         1215   Oncogene
                    31         1191   Development
                    32         1171   Journal of Biochemistry
                    33         1094   Human Mutation
                    34         1047   Molecular Biology of the Cell
                    35         1023   Journal of Immunology
                    36          997   Genetics
                    37          937   The Plant Cell
                    38          904   Structure
                    39          891   Infection and Immunity
                    40          884   Journal of General Virology
                    41          867   Molecular Cell
                    42          836   Archives of Biochemistry and Biophysics
                    43          808   Blood
                    44          796   The Plant Journal
                    45          763   Microbiology
                    46          759   Yeast
                    47          751   Journal of Cell Science
                    48          736   Developmental Biology
                    49          679   Cancer Research
                    50          665   Current Biology
                    51          663   FEMS Microbiology Letters
                    52          598   Mechanisms of Development
                    53          597   Nature Structural Biology
                    54          595   Human Genetics
                    55          567   Acta Crystallographica, Section D
                    56          558   Protein Science
                    57          549   Applied and Environmental Microbiology
                    58          542   Journal of Neuroscience
                    59          527   Toxicon
                    60          524   Current Genetics
                    61          511   Neuron
                    62          509   Journal of Clinical Investigation
                    63          472   Mammalian Genome
                    64          467   American Journal of Physiology
                    65          452   The Journal of Experimental Medicine
                    66          451   Immunogenetics
                    67          446   Molecular Endocrinology
                    68          422   Molecular and Biochemical Parasitology
                    69          414   Journal of Neurochemistry
                    70          411   The Journal of Clinical Endocrinology and Metabolism
                    71          390   Endocrinology
                    72          379   Proteins
                    73          379   Journal of Molecular Evolution
                    74          370   Bioscience, Biotechnology, and Biochemistry
                    75          367   DNA and Cell Biology
                    76          359   Molecular Biology and Evolution
                    77          357   DNA Sequence
                    78          351   Journal of Medical Genetics
                    79          331   Plant and Cell Physiology
                    80          322   Nature Cell Biology
                    81          321   Tissue Antigens
                    82          315   Brain Research. Molecular Brain Research
                    83          307   Peptides
                    84          306   Experimental Cell Research
                    85          298   Comparative Biochemistry and Physiology
                    86          289   Biological Chemistry Hoppe-Seyler
                    87          285   Antimicrobial Agents and Chemotherapy
                    88          280   Journal of Investigative Dermatology
                    89          276   Cytogenetics and Cell Genetics
                    90          270   Molecular Pharmacology
                    91          262   Biology of Reproduction
                    92          259   Developmental Cell
                    93          250   Genome Research
                    94          248   Journal of General Microbiology
                    95          246   Neurology
                    96          243   Developmental Dynamics
                    97          242   RNA
                    98          237   Virus Research
                    99          218   Planta
                    100          215   Hoppe-Seyler's Zeitschrift fur Physiologische Chemie
                    101          211   Molecular Plant-Microbe Interactions
                    102          207   European Journal of Immunology
                    103          206   Biochimie
                    104          206   DNA Research
                    105          203   Annals of Neurology
                    106          203   Genes to Cells
                    107          197   Immunity
                    108          195   European Journal of Human Genetics
                    109          194   The New England Journal of Medicine
                    110          193   Eukaryotic cell
                    111          192   Nature Structural and Molecular Biology
                    112          188   The FEBS Journal
                    113          187   Journal of Human Genetics
                    114          175   Molecular and Cellular Endocrinology
                    115          175   EMBO Reports
                    116          171   Investigative Ophthalmology and Visual Science
                    117          169   The FASEB Journal
                    118          166   Archives of Microbiology
                    119          165   American Journal of Medical Genetics
                    120          163   Molecular Phylogenetics and Evolution
                    121          162   Insect Biochemistry and Molecular Biology
                    122          159   DNA
                    123          157   Molecular Immunology
                    124          153   Molecular Reproduction and Development
                    125          153   Diabetes
                    126          153   Hemoglobin
                    127          153   Archives of Virology
                    128          152   Bioorganicheskaia Khimiia
                    129          150   Glycobiology
                    130          146   Clinical Genetics
                    131          142   International Journal of Cancer
                    132          142   Journal of the American Chemical Society
                    133          140   Journal of Cellular Biochemistry
                    134          140   PLoS ONE
                    135          138   Molecular Genetics and Metabolism
                    136          137   General and Comparative Endocrinology
                    137          137   Animal Genetics
                    138          137   Molecular and Cellular Neuroscience
                    139          132   BMC Genomics
                    140          131   Nature Immunology
                    141          131   Biological Chemistry
                    142          130   British Journal of Haematology
                    143          129   Molecular Genetics and Genomics
                    144          127   American Journal of Medical Genetics. Part A
                    145          123   Journal of Lipid Research
                    146          122   Circulation Research
                    147          122   Agricultural and Biological Chemistry
                    148          120   Proteomics
                    149          118   Journal of Medicinal Chemistry
                    150          118   Protein Expression and Purification
                    
                    
                    5.  STATISTICS FOR SOME LINE TYPES
                    
                    The following table summarizes the total number of some UniProtKB/Swiss-Prot lines,
                    as well as the number of entries with at least one such line, and the
                    frequency of the lines.
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry
                    ------------------------------------  -------- ---------  ---------
                    
                    References (RL)                       944524                 1.81                                         
                    Journal                            745842     392882      1.43       1                                 
                    Submitted to EMBL/GenBank/DDBJ     185898     171828      0.36       2                                 
                    Submitted to other databases        10701       9263      0.02       3                                 
                    Book citation                         646        632     <0.01       4                                 
                    Plant Gene Register                   565        553     <0.01       5                                 
                    Thesis                                403        400     <0.01       6                                 
                    Unpublished observations              293        289     <0.01       7                                 
                    Patent                                170        168     <0.01       8                                 
                    Worm Breeder's Gazette                  6          6     <0.01       9                                 
                    
                    Total number of distinct authors cited in UniProtKB/Swiss-Prot: 294614
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry  Rank
                    ------------------------------------  -------- ---------  ---------  ----
                    Comments (CC)                        2252147                 4.31                                         
                    ALLERGEN                              473        473     <0.01      26                                 
                    ALTERNATIVE PRODUCTS                19111      19111      0.04      13                                 
                    BIOPHYSICOCHEMICAL PROPERTIES        3281       3281      0.01      22                                 
                    BIOTECHNOLOGY                         279        277     <0.01      28                                 
                    CATALYTIC ACTIVITY                 226672     206860      0.43       4                                 
                    CAUTION                              7191       7048      0.01      19                                 
                    COFACTOR                           101156      92913      0.19       7                                 
                    DEVELOPMENTAL STAGE                  9054       9054      0.02      17                                 
                    DISEASE                              4369       2962      0.01      21                                 
                    DISRUPTION PHENOTYPE                 3090       3090      0.01      23                                 
                    DOMAIN                              33065      29170      0.06      11                                 
                    ENZYME REGULATION                    9326       9326      0.02      16                                 
                    FUNCTION                           391165     374953      0.75       2                                 
                    INDUCTION                           12357      12357      0.02      15                                 
                    INTERACTION                         12853      12853      0.02      14                                 
                    MASS SPECTROMETRY                    4614       3502      0.01      20                                 
                    MISCELLANEOUS                       30543      28179      0.06      12                                 
                    PATHWAY                            128166     117117      0.25       6                                 
                    PHARMACEUTICAL                         84         84     <0.01      29                                 
                    POLYMORPHISM                          796        761     <0.01      24                                 
                    PTM                                 36853      29698      0.07       9                                 
                    RNA EDITING                           613        613     <0.01      25                                 
                    SEQUENCE CAUTION                    38321      38321      0.07       8                                 
                    SIMILARITY                         608772     497355      1.17       1                                 
                    SUBCELLULAR LOCATION               302809     297636      0.58       3                                 
                    SUBUNIT                            224105     224105      0.43       5                                 
                    TISSUE SPECIFICITY                  34121      34121      0.07      10                                 
                    TOXIC DOSE                            455        444     <0.01      27                                 
                    WEB RESOURCE                         8453       6738      0.02      18                                 
                    
                    Total number of comment topics: 29
                    
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry  Rank
                    ------------------------------------  -------- ---------  ---------  ----
                    Features (FT)                        3295288                 6.31                                         
                    ACT_SITE                           130217      78403      0.25       9                                 
                    BINDING                            214356      59707      0.41       4                                 
                    CA_BIND                              3760       1549      0.01      35                                 
                    CARBOHYD                           101065      25660      0.19      13                                 
                    CHAIN                              528544     516724      1.01       1                                 
                    COILED                              18630      12696      0.04      26                                 
                    COMPBIAS                            50210      26251      0.10      18                                 
                    CONFLICT                           117891      41367      0.23      11                                 
                    CROSSLNK                             5921       3545      0.01      34                                 
                    DISULFID                            97968      26307      0.19      14                                 
                    DNA_BIND                            10995      10123      0.02      30                                 
                    DOMAIN                             146377      87375      0.28       6                                 
                    HELIX                              131852      13774      0.25       8                                 
                    INIT_MET                            14883      14883      0.03      27                                 
                    INTRAMEM                             1565        731     <0.01      38                                 
                    LIPID                               10658       6781      0.02      31                                 
                    METAL                              280848      69230      0.54       3                                 
                    MOD_RES                            181588      60116      0.35       5                                 
                    MOTIF                               32712      21003      0.06      22                                 
                    MUTAGEN                             32668       7762      0.06      23                                 
                    NON_CONS                             1903        722     <0.01      37                                 
                    NON_STD                               351        276     <0.01      39                                 
                    NON_TER                             11937       9084      0.02      28                                 
                    NP_BIND                            107779      68664      0.21      12                                 
                    PEPTIDE                              9110       6014      0.02      32                                 
                    PROPEP                              11406       9752      0.02      29                                 
                    REGION                              96845      52447      0.19      15                                 
                    REPEAT                              89639      13229      0.17      16                                 
                    SIGNAL                              35342      35332      0.07      21                                 
                    SITE                                38173      22682      0.07      20                                 
                    STRAND                             132001      12873      0.25       7                                 
                    TOPO_DOM                           118042      24274      0.23      10                                 
                    TRANSIT                              6999       6912      0.01      33                                 
                    TRANSMEM                           341276      69889      0.65       2                                 
                    TURN                                31372      10870      0.06      24                                 
                    UNSURE                               2240        440     <0.01      36                                 
                    VAR_SEQ                             39513      16984      0.08      19                                 
                    VARIANT                             80124      16533      0.15      17                                 
                    ZN_FING                             28528      12424      0.05      25                                 
                    
                    Total number of feature keys: 39
                    
                    
                    
                    Total    Number of  Average
                    Line type / subtype                number   entries    per entry  Rank      Category
                    ------------------------------------  -------- ---------  ---------  ----      -------------------------------------------
                    Cross-references (DR)               13881413                26.59                                                           
                    2DBase-Ecoli                           85         85     <0.01     119      2D gel databases                             
                    Aarhus/Ghent-2DPAGE                   126         96     <0.01     116      2D gel databases                             
                    AGD                                   851        845     <0.01      95      Organism-specific databases                  
                    ANU-2DPAGE                             23         23     <0.01     125      2D gel databases                             
                    ArachnoServer                         549        545     <0.01     101      Organism-specific databases                  
                    ArrayExpress                        58319      58319      0.11      40      Gene expression databases                    
                    Bgee                                39503      39491      0.08      46      Gene expression databases                    
                    BindingDB                             297        297     <0.01     111      Other                                        
                    BioCyc                             251758     243154      0.48      19      Enzyme and pathway databases                 
                    BRENDA                              65209      62409      0.12      38      Enzyme and pathway databases                 
                    CAZy                                 7230       6487      0.01      69      Protein family/group databases               
                    CGD                                   575        567     <0.01     100      Organism-specific databases                  
                    CleanEx                             30178      29528      0.06      48      Gene expression databases                    
                    COMPLUYEAST-2DPAGE                    101        100     <0.01     118      2D gel databases                             
                    ConoServer                            613        587     <0.01      99      Organism-specific databases                  
                    Cornea-2DPAGE                          67         67     <0.01     120      2D gel databases                             
                    CTD                                 65566      64780      0.13      37      Organism-specific databases                  
                    CYGD                                 6638       6542      0.01      72      Organism-specific databases                  
                    dictyBase                            4134       4134      0.01      84      Organism-specific databases                  
                    DIP                                 12388      12267      0.02      63      Protein-protein interaction databases        
                    DisProt                               397        394     <0.01     107      3D structure databases                       
                    DOSAC-COBS-2DPAGE                     149        147     <0.01     115      2D gel databases                             
                    DrugBank                             5317       1626      0.01      74      Other                                        
                    EchoBASE                             4167       4163      0.01      83      Organism-specific databases                  
                    ECO2DBASE                             352        300     <0.01     110      2D gel databases                             
                    EcoGene                              4396       4394      0.01      80      Organism-specific databases                  
                    eggNOG                             217959     217959      0.42      20      Phylogenomic databases                       
                    EMBL                               868848     511926      1.66       3      Sequence databases                           
                    Ensembl                             73548      57397      0.14      33      Genome annotation databases                  
                    EnsemblBacteria                     97613      84473      0.19      28      Genome annotation databases                  
                    EnsemblFungi                        14665      14523      0.03      59      Genome annotation databases                  
                    EnsemblMetazoa                      12784       8398      0.02      62      Genome annotation databases                  
                    EnsemblPlants                       13603      12098      0.03      60      Genome annotation databases                  
                    EnsemblProtists                      4323       4206      0.01      82      Genome annotation databases                  
                    euHCVdb                                55         44     <0.01     121      Organism-specific databases                  
                    EuPathDB                              258        258     <0.01     113      Organism-specific databases                  
                    FlyBase                              5717       5343      0.01      73      Organism-specific databases                  
                    Gene3D                             278669     219828      0.53      18      Family and domain databases                  
                    GeneCards                           20449      19812      0.04      52      Organism-specific databases                  
                    GeneDB_Spombe                        4978       4934      0.01      76      Organism-specific databases                  
                    GeneFarm                             2720       2706      0.01      87      Organism-specific databases                  
                    GeneID                             464829     445314      0.89       6      Genome annotation databases                  
                    Genevestigator                      65064      65064      0.12      39      Gene expression databases                    
                    GenoList                             7048       7036      0.01      70      Organism-specific databases                  
                    GenomeReviews                      379990     359885      0.73      10      Genome annotation databases                  
                    GermOnline                          41926      41307      0.08      45      Gene expression databases                    
                    GlycoSuiteDB                          280        280     <0.01     112      PTM databases                                
                    GO                                2110807     488696      4.04       1      Ontologies                                   
                    Gramene                              4482       4482      0.01      79      Organism-specific databases                  
                    H-InvDB                             13210      12312      0.03      61      Organism-specific databases                  
                    HAMAP                              308457     308313      0.59      16      Family and domain databases                  
                    HGNC                                19715      19537      0.04      54      Organism-specific databases                  
                    HOGENOM                            361592     361592      0.69      12      Phylogenomic databases                       
                    HOVERGEN                            74572      74572      0.14      32      Phylogenomic databases                       
                    HPA                                 11299       8338      0.02      64      Organism-specific databases                  
                    HSSP                                29321      29321      0.06      49      3D structure databases                       
                    InParanoid                          66727      66727      0.13      36      Phylogenomic databases                       
                    IntAct                              24340      24338      0.05      51      Protein-protein interaction databases        
                    InterPro                          1685030     497816      3.23       2      Family and domain databases                  
                    IPI                                 89849      64346      0.17      30      Sequence databases                           
                    KEGG                               436707     415949      0.84       8      Genome annotation databases                  
                    LegioList                             761        759     <0.01      96      Organism-specific databases                  
                    Leproma                               670        667     <0.01      98      Organism-specific databases                  
                    MaizeGDB                              470        466     <0.01     103      Organism-specific databases                  
                    MEROPS                              10280       9947      0.02      65      Protein family/group databases               
                    MGI                                 16221      16175      0.03      57      Organism-specific databases                  
                    MIM                                 16388      12863      0.03      56      Organism-specific databases                  
                    MINT                                17491      17491      0.03      55      Protein-protein interaction databases        
                    NextBio                             48844      48842      0.09      43      Other                                        
                    NMPDR                              131027     131022      0.25      25      Genome annotation databases                  
                    OGP                                   377        377     <0.01     108      2D gel databases                             
                    OMA                                369741     369741      0.71      11      Phylogenomic databases                       
                    Orphanet                             3540       2241      0.01      85      Organism-specific databases                  
                    OrthoDB                             56805      56805      0.11      41      Phylogenomic databases                       
                    PANTHER                            186370     170933      0.36      22      Family and domain databases                  
                    Pathway_Interaction_DB               4567       1665      0.01      78      Enzyme and pathway databases                 
                    PDB                                 70720      16268      0.14      35      3D structure databases                       
                    PDBsum                              70720      16268      0.14      34      3D structure databases                       
                    PeptideAtlas                         5168       5168      0.01      75      Proteomic databases                          
                    PeroxiBase                            737        725     <0.01      97      Protein family/group databases               
                    Pfam                               694648     486458      1.33       4      Family and domain databases                  
                    PharmGKB                            15779      15768      0.03      58      Organism-specific databases                  
                    PHCI-2DPAGE                           247        247     <0.01     114      2D gel databases                             
                    PhosphoSite                         20432      20432      0.04      53      PTM databases                                
                    PhosSite                              352        352     <0.01     109      PTM databases                                
                    PhylomeDB                          122249     122249      0.23      26      Phylogenomic databases                       
                    PIR                                115796     105806      0.22      27      Sequence databases                           
                    PIRSF                               84484      84484      0.16      31      Family and domain databases                  
                    PMAP-CutDB                           1394       1394     <0.01      90      Other                                        
                    PMMA-2DPAGE                            52         52     <0.01     122      2D gel databases                             
                    PptaseDB                               34         34     <0.01     123      Protein family/group databases               
                    PRIDE                               54132      54132      0.10      42      Proteomic databases                          
                    PRINTS                             137177     118789      0.26      24      Family and domain databases                  
                    ProDom                              27674      27495      0.05      50      Family and domain databases                  
                    ProMEX                                459        459     <0.01     104      Proteomic databases                          
                    PROSITE                            463427     294991      0.89       7      Family and domain databases                  
                    ProtClustDB                        325333     325333      0.62      14      Phylogenomic databases                       
                    ProteinModelPortal                 412564     412564      0.79       9      3D structure databases                       
                    PseudoCAP                            1222       1213     <0.01      92      Organism-specific databases                  
                    Rat-heart-2DPAGE                       28         28     <0.01     124      2D gel databases                             
                    Reactome                             7842       4675      0.02      67      Enzyme and pathway databases                 
                    REBASE                                441        400     <0.01     106      Protein family/group databases               
                    RefSeq                             487240     445619      0.93       5      Sequence databases                           
                    REPRODUCTION-2DPAGE                  1255       1034     <0.01      91      2D gel databases                             
                    RGD                                  7455       7451      0.01      68      Organism-specific databases                  
                    SGD                                  6638       6558      0.01      71      Organism-specific databases                  
                    Siena-2DPAGE                          102        102     <0.01     117      2D gel databases                             
                    SMART                              154505     117856      0.30      23      Family and domain databases                  
                    SMR                                348143     348143      0.67      13      3D structure databases                       
                    STRING                             205604     205599      0.39      21      Protein-protein interaction databases        
                    SUPFAM                             313989     250771      0.60      15      Family and domain databases                  
                    SWISS-2DPAGE                         1184       1183     <0.01      93      2D gel databases                             
                    TAIR                                 9660       9560      0.02      66      Organism-specific databases                  
                    TCDB                                 3450       3443      0.01      86      Protein family/group databases               
                    TIGR                                34171      33402      0.07      47      Genome annotation databases                  
                    TIGRFAMs                           284622     264837      0.55      17      Family and domain databases                  
                    TubercuList                          1734       1698     <0.01      89      Organism-specific databases                  
                    UCD-2DPAGE                            511        502     <0.01     102      2D gel databases                             
                    UCSC                                48573      39591      0.09      44      Genome annotation databases                  
                    UniGene                             92527      84267      0.18      29      Sequence databases                           
                    VectorBase                            444        430     <0.01     105      Genome annotation databases                  
                    World-2DPAGE                          915        904     <0.01      94      2D gel databases                             
                    WormBase                             4598       3780      0.01      77      Organism-specific databases                  
                    Xenbase                              4379       4312      0.01      81      Organism-specific databases                  
                    ZFIN                                 2629       2618      0.01      88      Organism-specific databases                  
                    
                    Total number of cross-referenced databases: 125
                    
                    6.  AMINO ACID COMPOSITION
                    
                    6.1  Composition in percent for the complete database
                    
                    Ala (A) 8.27   Gln (Q) 3.93   Leu (L) 9.67   Ser (S) 6.52
                    Arg (R) 5.53   Glu (E) 6.76   Lys (K) 5.85   Thr (T) 5.33
                    Asn (N) 4.05   Gly (G) 7.09   Met (M) 2.42   Trp (W) 1.08
                    Asp (D) 5.45   His (H) 2.27   Phe (F) 3.86   Tyr (Y) 2.91
                    Cys (C) 1.36   Ile (I) 5.98   Pro (P) 4.69   Val (V) 6.87
                    
                    Asx (B) 0.000  Glx (Z) 0.000  Xaa (X) 0.00
                    
                    
                    
                    Legend: gray = aliphatic, red = acidic, green = small hydroxy,
                    blue = basic, black = aromatic, white = amide, yellow = sulfur
                    
                    
                    6.2  Classification of the amino acids by their frequency
                    
                    Leu, Ala, Gly, Val, Glu, Ser, Ile, Lys, Arg, Asp, Thr, Pro, Asn, Gln,
                    Phe, Tyr, Met, His, Cys, Trp
                    
                    
                    7.  MISCELLANEOUS STATISTICS
                    
                    4448 entries are encoded on a mitochondrion, and 3588 are encoded on a plasmid.
                    
                    12180 entries are encoded on a plastid, 
                    of which 21 are encoded on apicoplasts, 
                    11620 on chloroplasts, 
                    46 on organellar chromatophores,
                    145 on cyanelles, 
                    149 on non-photosynthetic plastids and 
                    199 on unspecified types of plastid.
                    
                    Number of entries with at least one sequence correction: 69667