Skip to main content

Table 1 Overview of the data and data sources used in this study

From: HaMStR: Profile hidden markov model based search for orthologs in ESTs

Proteins  
InParanoid a Homo sapiens
  Monodelphis domestica
  Ciona intestinalis
  Drosophila melanogaster
  Caenorhabditis elegans
  Saccharomyces cerevisiae
  Debaryomyces hansenii
  Kluyveromyces lactis
  Candida glabrata
  Yarrowia lipolytica
Broad Institute b Uncinocarpus reesii
  Stagonospora nodorum
  Chaetomium globosum
  Clavispora lusitaniae (Candida lusitaniae)
  Pichia guillermondii (Candida guillermondii)
  Candida tropicalis
  Candida albicans
UniProt c Aspergillus fumigatus
  Aspergillus terreus
  Aspergillus oryzae
  Ashbya gossypii
Genoscope d Podospora anserina
ESTs  
dbEST e Ustilago maydis (39308)
  Cryptococcus neoformans (59041)
  Phanerochaete chrysosporium (13189)
  Coprinopsis cinerea (15715)
  Schizosaccharomyces pombe (8123)
  Ajellomyces capsulatus (26389)
  Neurospora crassa (20089)
  Trichoderma atroviride (1656)
  Trichoderma asperellum (1882)
  Trichoderma harzianum (12165)
  Fusarium oxysporum (9248)
  Bortrytis cinerea (10982)
  Sclerotinia sclerotiorum (1494)
  Fusarium graminearum (6678)
TGI f Coccidioides immitis (9312)
  Aspergillus nidulans (13100)
  Fusarium verticillioides (11126)
  Magnaporthe grisea (20890)
  1. a http://inparanoid.sbc.su.se
  2. b http://www.broadinstitute.org/science/data#
  3. c ftp://ftp.ebi.ac.uk/pub/databases/integr8/fasta/proteomes
  4. d http://podospora.igmors.u-psud.fr/
  5. e http://www.ncbi.nlm.nih.gov/dbEST. The numbers of ESTs per organism are given in parenthesis.
  6. f http://compbio.dfci.harvard.edu/tgi. The numbers of tentative consensus sequences are given in parenthesis.