A phylogenomic profile of globins
© Vinogradov et al; licensee BioMed Central Ltd. 2006
Received: 15 December 2005
Accepted: 07 April 2006
Published: 07 April 2006
Globins occur in all three kingdoms of life: they can be classified into single-domain globins and chimeric globins. The latter comprise the flavohemoglobins with a C-terminal FAD-binding domain and the gene-regulating globin coupled sensors, with variable C-terminal domains. The single-domain globins encompass sequences related to chimeric globins and «truncated» hemoglobins with a 2-over-2 instead of the canonical 3-over-3 α-helical fold.
A census of globins in 26 archaeal, 245 bacterial and 49 eukaryote genomes was carried out. Only ~25% of archaea have globins, including globin coupled sensors, related single domain globins and 2-over-2 globins. From one to seven globins per genome were found in ~65% of the bacterial genomes: the presence and number of globins are positively correlated with genome size. Globins appear to be mostly absent in Bacteroidetes/Chlorobi, Chlamydia, Lactobacillales, Mollicutes, Rickettsiales, Pastorellales and Spirochaetes. Single domain globins occur in metazoans and flavohemoglobins are found in fungi, diplomonads and mycetozoans. Although red algae have single domain globins, including 2-over-2 globins, the green algae and ciliates have only 2-over-2 globins. Plants have symbiotic and nonsymbiotic single domain hemoglobins and 2-over-2 hemoglobins. Over 90% of eukaryotes have globins: the nematode Caenorhabditis has the most putative globins, ~33. No globins occur in the parasitic, unicellular eukaryotes such as Encephalitozoon, Entamoeba, Plasmodium and Trypanosoma.
Although Bacteria have all three types of globins, Archaeado not have flavohemoglobins and Eukaryotes lack globin coupled sensors. Since the hemoglobins in organisms other than animals are enzymes or sensors, it is likely that the evolution of an oxygen transport function accompanied the emergence of multicellular animals.
The number of globin families has grown markedly since the 1970's, when they were limited to the α- and β-globins and Mbs, mostly of vertebrates, and the SHbs (legHbs) of legume plants [1–3]. The ensuing years brought to light several new globins: SHbs in plants other than legumes, NsHbs in a wide variety of plants [4–6], and FHbs, chimeric proteins (~400aa) comprising an N-terminal globin domain and a C-terminal FAD- and NAD-binding domain, related to the ferredoxin-NADP+ reductases, found in E. coli  and in yeasts [8, 9]. Concurrently, "truncated" globins, sequences shorter than normal (<130aa), were discovered in protozoa , cyanobacteria [11–13], a nemertean , and bacteria [15, 16]. Globins longer than normal (>160aa), similar to the "truncated" Hbs, were observed in a green alga [17, 18] and in plants . The crystal structures of several "truncated" Hbs showed them to have a novel 2-over-2 α-helical fold instead of the canonical 3-over-3 α-helical fold, with an abbreviated A helix, a decreased CE interhelical region and most of the F helix occurring as a loop [20–22]. Consequently, we advocate using 2/2 Hb instead of "truncated", to indicate the distinctive secondary structure of this group of globins and to bring order to the chaotic terminology in the existing databases, where "truncated", "cyanobacterial", "protozoan" and "2-over-2" are all in current use.
The utilization of molecular biological techniques allowed the detection of globins in organisms where their presence was unsuspected, such as the nematode Caenorhabditis elegans and in other nematodes [23–30], the dipteran Drosophila melanogaster  and the urochordate Ciona intestinalis . The recent, rapid accumulation of genomic information has resulted in a substantial increase in newly recognized globins. This list includes the Ngbs [33, 34] and Cygbs [35–37], which are believed to occur in all vertebrates from humans to birds and fish , GbE, the eye-specific globin in the domestic chicken Gallus gallus, related to Cygb , and GbX, a new and fifth type of globin gene in fish and amphibians, thought to have been lost in the higher vertebrates . FHbs were found in a variety of bacterial groups [16, 41–43] together with SDgbs, which align with the FHb globin domains [42, 43]. Furthermore, globin coupled sensor proteins, chimeric two-domain gene regulators (~300 to >700aa) comprising an N-terminal globin domain, were discovered in an archaean and several bacterial groups [44–47]. Lastly, "protoglobins" (~195aa) related to the former, were found in archaea and in several bacteria  and proposed to represent the "ancestral" globin.
It should be emphasized, that the presence of putative globins inferred using SUPERFAMILY and blastp and psiblast searches of the GenBank database, does not require having a completed genome. On the other hand, the absence of a globin is certain only when a complete genome is searched and no sequence is found.
Number of bacterial genomes with one or more classes of. 2-over-2 Hbs
Coexistence of FHbs with SDgbs in bacterial genomes.
Coexistence of FHbs/SDgbs with 2-over-2 Hbs in bacterial genomes.
FHb or SDgb only
With class 1
With class 2
With class 3
With class 1+2
With class 1+3
With class 2+3
Alignment of bacterial 2/2 Hb sequences indicates that they can be divided into three separate classes [16, 53]. The length of the 2/2 Hbs varies from 118aa for the cyanobacterium Nostoc (ZP_00112318) to 260aa in Streptomyces avermitilis (NP_824492), with the majority (>60) being <140aa. Examination of coexistence of the three classes of 2/2 Hbs (Table 1) shows an interesting trend: although only the Alphaproteobacterium Hyphomonas neptunium has both 1 and 3 and two have all three classes, 14 genomes have classes 1 and 2 and 18 have classes 2 and 3. These results suggest that the class 1 and 3 2/2 Hbs are derived from the class 2 2/2 Hbs, as pointed out very recently by Vuletich and Lecomte .
The finding of an FHb in E. coli  led to the discovery of some 30 bacterial FHbs [16, 42, 43, 54]; currently, the number is over 70 (Supplementary Data Table 3). R. Poole and his group  and Frey and Kallio  were the first to point out the similarity between the FHb globin domains, the first bacterial Hb to be sequenced, that of Vitreoscilla stercoraria , and several other SD bacterial globins. We now have a total of 25 SDgbs in 22 genomes. The crystal structures of the P. aeruginosa (PDB: 1tu9)  and Vitreoscilla stercoraria (PDB: 2vhb) , show them to be very similar to the globin domains of the FHbs from E. coli (PDB: 1gvh)  and Ralstonia (Alcaligenes) eutropha (PDB: 1cqx) (59]. It is appropriate to note here that Ralstonia has undergone two name alterations recently, to Wautersia eutropha and now to Cupriavidus necato r . Although 22 genomes have SDgbs, three of them have two different globins: Bradyrhizobium, Rhodopseudomonas and Novosphingobium. Table 2 shows that FHbs and SDgbs coexist in only 6 genomes: Chromobacterium, Photobacterium, Pseudomonas aeruginosa, Rhodopirellula, Thermobifida and Vibrio parahaemolyticus. Table 3 shows the statistics of occurrence of 2/2 Hbs with the FHbs/SDgbs; the latter tend to occur with class 2 and with both class 2 and 3 2/2 Hbs.
M. Alam and his group have identified 27 GCSs [46, 47] and 3 Pgbs from bacteria – Chloroflexus aurantiacus ZP_00359040 (227aa), Thermobifida fusca ZP_00293478 (197aa) and Thermosynechococcus elongatus NP_682779 (194aa) . Blastp searches found an additional two dozen GCSs, and Pgbs in Thermus thermophilus YP_005074 (203aa) and the actinobacterium Rubrobacter xylanophilus ZP_00200180 (196aa) [49, 61]. The recent crystal structure of B. subtilis GCS shows it to have a 3/3 fold .
The genomes with the largest number of globins, 5 to 7, are all from the Alpha- and Betaproteobacteria; they do include however, redundant sequences. The following have all three lineages of globins:Burkholderia fungorum (9.67 Mbp), Chromobacterium violaceum (4.75 Mbp), Novosphingobium aromaticivorans (4.21 Mbp) and Silicibacter sp.TM1040 (4.14 Mbp), each with 5 globins, the three Bordetella species (4.09–5.34 Mbp), Exiguobacterum sp.255–1 5 (2.89 Mbp), Shewanella baltica (5.01 Mbp), Sinorhizobium meliloti and Thermobifida fusca (3.64 Mbp), each with 4 globins, and Azotobacter vinelandii (5.42 Mbp), all the Bacillus species (except B. licheniformis and B. stearothermophilus) (4.20–5.50 Mbp) and Geobacillus kaustophilus (3.59 Mbp), each with 3 globins. At the other end of the scale, the smallest genome (1.31 Mbp) to have a globin, a 2/2 Hb1, is that of the marine Alphaproteobacterium Pelagibacter ubiques (Rickettsiales), followed by Aquifex aeolicus (1.59 Mbp), and the Epsilonbacterium Campylobacter jejuni (1.64 Mbp), with one (SDgb) and two (2/2Hb and SDgb) globins, respectively.
The bacterial divisions represented by 10 or more genomes, rank in the following order of having globins: Betaproteobacteria – 92% (22/24) > Actinobacteria – 85% (22/26) > Gammaproteobacteria – 76% (38/50) > Alphaproteobacteria – 69% (24/35) > Cyanobacteria – 58% (7/12) > Firmicutes – 49% (19/39). Overall, 161 of the 245 (~65%) bacterial genomes have globins and the number of globins varies from 1 to 7.
Number of globins found in bacterial genomes and the corresponding mean genome sizes.
No. of globins
No of completed genomes
Mean genome size, Mbp
1.9 ± 1.1
3.9 ± 1.5
4.2 ± 1.2
4.7 ± 1.2
6.5 ± 1.9
The globins identified in eukaryote genomes are listed in Supplemental Data Tables 4–6 [see Additional File 1]. The globins listed for the vertebrate genomes, Danio (Brachydanio) rerio, Fugu (Takifugu) rubripes, Gallus gallus, Homo sapiens, Mus musculus, Rattus norvegicus and Xenopus tropicalis, include the familiar α- and β-globins and Mb, as well as the recently discovered Ngbs [33, 34] and Cygbs [35–37]. A recent addition to human α-globins is μ-globin (gi|51510893|NP_001003938, 141aa) which appears to be very similar to the avian α-D globin . In addition, an eye-specific globin, GbE, was found in Gallus gallus  and a new globin, GbX, was discovered in the fish, bird and amphibian genomes . Curiously, no Mb-like sequence was found in either Xenopus tropicalis or X. laevis. The zebrafish Danio rerio has 6 embryonic globins , three α and three β- only 5 are listed in the GenBank. The pufferfish Fugu rubripes, appears to have four α-globin chains ; only three of them are listed in the GenBank.
The four globins found in the genome of the urochordate Ciona intestinalis (sea squirt), have been identified earlier . This important finding corrects the erroneous claim of total absence of globins in the report of the genome .
At least one putative globin (gi|72169631|XP_795670, 175aa) was found in the recently completed genome assembly of the echinoderm Strongylocentrotus purpuratus (sea urchin). Used as query in blastp searches it hits the GlbXs and Cygbs of fish, and other vertebrate Cygbs, before recognizing the coelomic globin (gi|729576;P80018) of a fellow echinoderm, the holothurian Caudina arenicola.
There appear to be 4 putative globins in Anopheles gambiae: gi|55246163| EAA03862.3 (150aa), gi|31201271|XP_309583 (192aa), gi|31198253|XP_308074 (215aa) and gi|57914327|XP_555006 (182aa). Their alignment shows that the first three are closely related, sharing essentially identical globin domains: gi|55246163 has an 18aa deletion in the BC helix and the CD corner, while gi|31198253 lacks part of the G and all of the H helices. Thus, it is likely that gi|31201271 and gi|57914327 are the only stable globins. The C-terminal 115aa globin domain of gi|31198253 is reminiscent of the Mb gene sequence from the antarctic icefish Champsocephalus esox (gi|24266940|AAN52371, 103aa) that does not appear to be expressed . It remains to be determined whether the N-terminal domain of gi|31198253 stabilizes it enough to be expressed.
At least 33 putative globins and globin domains were identified in Caenorhabditis elegans, together with their orthologs in C. briggsae : they are listed in Supplemental Data Table 5 [see Additional File 1]; their alignment is provided in Supplemental Data Fig. 2 [see Additional File 2]. Several C. elegans/C. briggsae orthologs have unusually long interhelical inserts between the G and H helices: 40aa in CE01012/CBP03870) (216aa), 21aa in CE01528 (CBP00622) (266aa), 22aa in CE34964/CBP23619 (196aa) and 21aa in CE34658/CBP07299 (230aa). Although three pairs, CE17437/CBP02580, CE04582/CBP03989 and CE35828/CBP14482, have N-terminal globin domains, several have C-terminal domains: CE03523/CBP15914, CE04843/CBP01576, CE05316/CBP02907, CE12774/CBP21478), CE36044/CBP02078), CE29586/CBP02293, CE30683/CBP02390 and CE31132/CBP15597. The N-terminal portion of the latter is identified as a G protein-coupled receptor-like domain with 7 putative transmembrane helices (identified using TMHMM Server V.2.0). The nonglobin domains in the remaining proteins could not be identified via blastp searches.
Although globin genes were thought to be absent in the genome of Drosophila melanogaster , the presence of at least one (CG9734) has been demonstrated unequivocally by Burmester and Hankeln [31, 70]. There appear to be two more putative globins, CG15180 (209aa) and CG14675 (195aa); both provide acceptable alignments. Three putative globins are also found in D. pseudoobscura, EAL28410 (152aa), EAL28093 (641aa, 40–150) and EAL 28094 (130aa) , and appear to be orthologs of D. melanogaster CG9734, CG15180 and CG14675, respectively. Although the D. pseudoobscura EAL28093 and EAL28094 are scored by FUGUE as certainly globins (Z ~10), the former lacks an appropriate F8 His residue and the latter appears to be missing the A helix. Psiblast searches show that the ortholog pairs CG15180/EAL28093 and CG14675/EAL28094) recognize each other but not the CG9734/EAL28410 pair, and also recognize the Anopheles globins (ENSANG19788, 22287 and 26474), together with distant recognition of echinoderm globins and vertebrate β-globins. The CG9734/EAL28410 pair hits the other known insect globins (Gasterophilus, Chironomus species, Kiefferulus, Tokunagayusurika), the vertebrate Cygbs and the arthropod multidomain Hbs.
Three of the four Arabidopsis thaliana sequences, GLBs 1–3 have been identified earlier [19, 72]: GLB1 and GLB2 are NsHbs and GLB3 is a 2/2Hb. A 693aa protein (gi|7486404|T04457) was found to have an N-terminal 2/2Hb domain, with the first 138aa (up to position H13), identical to GLB3. Although the ~500aa C-terminal portion does not correspond to any known protein domain(s), a blastp search using it as query found an identical sequence in the N-terminal portion of the 2154aa protein TEBICHI (gi|62241195|BAD93700). Both proteins share the same region on Arabidopsis chromosome 4 and TEBICHI has a helicase domain at 490–890aa and a polymerase domain at 1700–2150aa; furthermore, this N-terminal portion is missing in animal homologues of TEB protein, MUS308/POLQ (Dr. Soichi Inagaki, personal communication).
The genome of Oryza sativa contains the four NsHbs identified by Arredondo-Peter et al. [73, 74], a 172aa 2/2Hb (gi|50725383|BAD32857) and a 145aa globin (gi|50932383|XP_475719). A blastp search identified the latter as another NsHbs.
Several putative globins occur in the genome of the green alga Chlamydomonas reinhardtii: 160981 (C_240146) (136aa), 160982 (C_240147) (147aa), 157690 (C_1780015) (231aa), and two larger sequences, 168934 (C_60169) (476aa) with one N-terminal globin-like domain, and 153190 (C_100138) (837aa) with two consecutive N-terminal globin domains. Two globins were found in the genome of the diatom Phaeodactylum tricornutum: PTMM3909 and 05212. Two globins occur in the genome of another diatom Thalassiosira pseudonana, Scaffold_137 (158aa) and Scaffold_18 (160aa), and a single 185aa globin, CMR319C, in the genome of the red alga Cyanidioschyzon merolae. Blastp searches identified all the C. reinhardtii globins, the Phaeodactylum PTMM 05212 and the Thalassiosira Scaffold_18 as 2/2 Hbs, and the Cyanidioschyzon CMR319C,Phaeodactylum PTMM 3909 and Thalassiosir a Scaffold_137 as SDgbs, closely related to the bacterial and fungal FHbs and the bacterial SDgbs.
The globins present in the fungi are listed in Supplemental Data Table 6 [see Additional File 1]. All the recently completed fungal genomes, which belong overwhelmingly to the Ascomycota (17 of 19), have FHbs; 14 have two to four FHbs, probably due to genome duplication . In addition to Saccharomyces cerevisiae NP_014165 (426aa, 154–302) which was known to have a central globin domain , we found an additional 12 FHbs to have globin domains within the central portion of their sequences. All belong to the Saccharomycotina: Candida albicans EAK92722 (563aa, 298–463), Candida glabrata XP_448033 (432aa, 124–267), Eremothecium (Ashbya) gossypii NP_982746 (436aa, 184–339), Kluyveromyces lactis XP_453939 (430aa, 181–323), Kluyveromyces waltii Kwal_22190 (421aa, 171–314) Kwal_4395 (460aa, 202–352) and Kwal_24852 (543, 173–315), Saccharomyces bayanus ORFP:20532(424aa, 171–303), Saccharomyces mikatae ORFP:18051 (410aa, 135–290), Saccharomyces paradoxus ORFP:18484 (426aa, 155–306) and Yarrowia lipolytica XP_502881 (463aa, 186–325) and XP_499869 (471aa, 194–333). Although the globin domains align well with the other FHbs, the N- and C-terminal portions do not recognize the C-terminal moieties of the other fungal and bacterial FHbs or any other known protein, as found earlier for the S. cerevisiae FHb . Surprisingly, the percent identities between the CDFHbs and the fungal and bacterial FHb globin domains is about 40% and 30%, respectively.
No globin genes appear to exist in the genomes of the Apicomplexans Entamoeba histolytica and Plasmodium falciparum, as well as the Microsporidian Encephalitozoon cuniculi and the Euglenazoan Trypanosoma brucei. Although a Plasmodium bergei globin-like protein of 228aa (Q86QI8) is listed in the GenBank, it is probably an artefact, since when used as a query in blastp searches, it hits only vertebrate β- and α-globins.
Compared to Archaea and Bacteria, the fraction of eukaryote genomes with globins is much higher, over 90%. Although all vertebrates have Hb and Mb and probably Ngb and Cygb as well , the Antarctic icefish belonging to the family Channichthyidae do not express Hb . Furthermore, at least 6 of the 16 icefish species also do not express Mb . The lack of Hb is due apparently to deletion of the β-globin gene , which occurred during the last 10 to 16 Myr, the approximate date of divergence of the Notothenioid lineage including the Channichthyidae . In contrst, the lack of Mb appears to be due to errors in Mb gene transcription . It is not known whether the icefish have Ngbs and/or Cygbs. In the remaining two chordate phyla, the urochordate Ciona has 4 globins (see above), which when used as queries in psiblast searches, recognize the vertebrate globins and the eukaryote and bacterial FHbs and SDgbs. The report of a Hb in the notochord of the cephalochordate amphioxus (Branchiostoma californiense) , suggests that globins are present in all chordates. Among the remaining deuterostome phyla, intracellular Hbs have been reported in two of the 5 classes of echinoderms [84, 85], and we find a putative globin in the recent assembly of the genome from the sea urchin Strongylocentrotus purpuratus, which belongs to another class. No globins have been reported so far in hemichordates.
Among the remaining metazoans, direct visual observation of Hbs is highly episodic among the metazoans, and varies from total absence of Hb in porifera and cnidaria, to frequent presence in some nematode and platyhelminth groups . In the largest metazoan group, the insects, of which about 900,000 species have been described, and which outnumber the combined total of all other animal species, with estimates ranging from 2 × 106 to 100 × 106 , globins are so far known to be present only in Dipterans, such as Gasterophilus, Drosophila and Chironomus. It is evident that much work needs to be done before we have any clear idea of globin distribution in metazoans other than deuterostomes.
Among the lower eukaryotes, our census suggests that globins could be ubiquitous in fungi (FHbs), mycetozoa (FHbs), diplomonads (FHbs), ciliates (2/2 Hbs), stramenopiles (SDgb, 2/2Hb), rhodophytes (SDgb), chlorophytes (2/2Hb) and plants (SHbs, NsHbs, 2/2 Hbs). In contrast, the genomes of pathogenic microsporidians, entamoebae, apicomplexans anad trypanosomes are devoid of globins. In view of a very large potential diversity of small, bacterial-sized eukaryotes revealed by recent, culture-independent surveys [87, 88], much remains to be done.
Since our census indicates that GCSs do not occur in eukaryotes, it is necessary to consider the heme-regulated eukaryotic initiation factor 2α kinase (~630aa), which was found recently to have two heme-binding domains, of which the N-terminal domain was considered to be globin-like [89, 90] Although it was demonstrated that His78 and His123 were the two residues involved in heme binding , we find that the N-terminal 138aa of the rabbit protein (gi|462439|P33279|E2AK1 RABIT) is not recognized as a globin in CD or FUGUE searches. A blastp search of the GenBank database found at least 6 mammalian counterparts; clearly, a structure is required to determine whether these sequences are globins.
We have discussed elsewhere  the possible evolutionary scenarios for the emergence of the proposed three lineages of globins in two structural classes: (1) the 3-over-3 SD globins and FHbs, (2) the 3-over-3 GCS/Pgbs, and (3), the 2-over-2 SDgbs. Here, we would like to reiterate our proposal that all metazoan and plant globins originated from a SDgb related to the present day bacterial and algal SDgbs and the N-terminal globin domains of fungal and bacterial FHbs. This proposal is based on two results, one of which, is the clustering of all metazoan and plant sequences with the bacterial and eukaryote FHbs and SDgbs and separate from the branches encompassing 2/2 Hbs and GCSs/Pgbs, in the Bayesian phylogenetic tree obtained earlier  and in the one shown in Fig. 5. Another key result, is that bacterial SDgbs and eukaryote FHb globin domains used as queries in iterated psiblast searches, recognize (i.e. have E values substantially lower than threshold) vertebrate globins, particularly Ngbs, and other metazoan globin groups, ahead of the 2/2 Hbs and GCSs/Pgbs . Conversely, vertebrate Ngbs as a group, recognize the bacterial SDgbs ahead of other vertebrate globins. Furthermore, the tree shown in Fig. 5 is in very good agreement with recent models of vertebrate globin evolution [36, 62, 94], wherein duplication of the ancestral globin gene resulted in neuroglobin and cellular globin genes, with subsequent duplication of the latter into cellular and Hb gene loci, followed by additional duplications of the cellular globin locus into Mb and Cygb and of the Hb gene into the α- and β-globin genes.
The first attempt at defining the molecular phylogeny of globins  was a tree based on 245 sequences, all from eukaryotes. A subsequent study by Moens et al.  based on 700 sequences, including bacterial ones, proposed that all globins «evolved from a family of ancestral, ca. 17 kDa hemeproteins, which displayed the globin fold and functioned as redox proteins». The present survey of extant globins in the three kingdoms of life provides some interesting additional details: we now know that there are three lineages of globins, two with a 3/3 α-helical fold and one with the 2/2 fold, and that only the latter occurs at present in all three kingdoms of life. Furthermore, it appears likely that formation of chimeric proteins containing globin domains occurred, as separate events, prior to the emergence of the three kingdoms of life. There are no obvious explanations for the absence of FHbs in Archaea and of GCSs in eukaryotes. Although it is not possible at present to decide which of the two folds originated from the other, the occurrence of 2/2 Hbs in all three kingdoms of life would suggest that it is the ancestral fold. A recent computational study of plant Hb folding showed that one of the folding modules of rice 3/3 NsHb overlaps the 2/2 fold of Mycobacterium tuberculosis HbO (class 2) suggesting that it is an ancient structural feature of globins .
The pesence of multiple globins in all three kingdoms of life raises the question of their function, paticularly among the unicellular organisms. Although a complete discussion is not possible here, some general observations can be made, limited to organisms other than Animalia. The bacterial FHbs appear to have a role primarily in the detoxification of NO, via two NADH dependent activities, an NO dioxygenase activity under aerobic conditions, or an NO reductase activity under anaerobic conditions [42, 43, 98–100]. Furthermore, the recent demonstration that the E. coli FHb binds lipids and is an efficient alkylhydroperoxide reductase, suggests that it may be involved in the repair of lipid membranes damaged by oxidative/nitrosative stress [101–103]. The function(s) of bacterial SDgbs appear to be somewhat similar to the FHbs. Although the function of Vitreoscilla Hb is to increase the effective intracellular O2concentration under microaerobic conditions [42, 104], that of Campylobacter SDgb was shown recently to function only in NO scavenging and detoxification and not provide resistance against superoxide or peroxides . In eukaryotes, such as yeasts and Dictyostelium, FHbs also provide protection against NO [43, 106, 78], as well as enhancement of respiration, directly by functioning as an O2 buffer and indirectly, by reducing the NO concentration in the mitochondria which otherwise inhibits respiration [107–110]. In pathogenic microorganisms, FHb provides protection from human macrophage NO-mediated killing and promotes the virulence of bacteria, e.g. Salmonella , and of yeasts, Candida  and Cryptococcus neoformans, a worldwide pathogen causing pulmonary infection in animals and humans [113, 114]. It is interesting to note, as suggested by one of the reviewers, that since most of the Archaea are not pathogenic, they may have dispensed with the FHbs and their protective role in bacteria.
In plants, the SHbs have been shown to be required for establishing a low oxygen concentration for the effective functioning of the bacterial nitrogenase necessary for nitrogen fixation . Although the role of 2/2 Hbs in plants is completely unknown, the NsHbs, induced in plant cells upon exposure to low oxygen concentrations, are thought to play a role in a metabolic pathway which also involves nitric oxide and which provides an alternative type of respiration to the mitochondrial electron transport under hypoxic conditions [116, 117].
The GCSs represent one of the four known families of heme-based sensors , and can be subdivided into two groups, the HemATs and the gene regulators. The former are aerotactic heme sensors, with a C-terminal domain that is related to chemotaxis methyl-accepting proteins. Although Bacillus subtilis HemAT elicits an aerophilic response, that from the archaean Halobacterium salinarum provides an aerophobic response [44, 45]. The gene regulators have C-terminal domains, some of which may regulate second messengers and others that have unknown functions [47,48.59,118]. Although little is known about the specific function of the Pgbs, they have a Cys at position E19, similar to globins of the annelids living in sulfide-rich environments, including deep sea hydrothermal vents and marine sediments, where this residue has been implicated in sulfide binding [119, 120].
Recent studies of Mycobacterium tuberculosis 2/2 Hbs, HbN (2/2 Hb1) and HbO (2/2Hb2), indicate that the former functions in NO detoxification, while HbO, which differs substantially from HbN in structure , is expressed in association with cell membranes and significantly enhances respiration, suggesting an interaction with the electron transport chain [122–126]. It remains to be seen whether these findings apply to other bacterial 2/2 Hbs. Although nothing is known about the function of the 2/2Hb3s, it is safe to assume that it must differ from the other two classes, since 32 bacteria have two and Mycobacterium avium ssp. tuberculosis and Methylococcus capsulatus have all three classes of 2/2 Hbs.
Finally, it is appropriate to mention here the case of the extracellular Hb from the nematode Ascaris suum, an octamer of two-domain globin chains, long known for its high oxygen affinity , whose function appears to be that of an NO reductase , similar to that of fungal and bacterial FHbs. Interestingly FUGUE searches using FHb and SDgb sequences as queries, invariably include the Ascaris Hb domain 1 structure (1ash) among the highest scoring globin structures. This case provides an additional bit of evidence in support of our proposal that the FHbs and related SDgbs from bacteria, algae and unicellular eukaryotes, including fungi, are part of one of the three globin lineages, and the one from which originated all metazoan globins and all plant SHbs and NsHbs .
The phylogenomic profile derived from our survey of genomes from the three kingdoms of life presented here, delineates the present day limits of the occurrence of the three lineages of globins, and provides a clear view of the work that remains to be done. It appears likely that in contrast to archaea, where ~20% of the known genomes have globins, the majority of bacteria will be shown to have globins. Based on the prevailing opinion that all plants have globins, it is likely that this will also hold true for the unicellular, photosynthesizing eukaryotes. Globin occurrence in other unicellular eukaryotes is likely to be episodic, just as in the case of nonvertebrate metazoans.
Another obvious conclusion is that globins are mostly enzymes and less frequently sensors, and that transport of oxygen is a function that developed relatively recently, accompanying the emergence of multicellular organisms. There are at least two more known instances of evolution from enzyme to transporter. One, is the convergent evolution of indoleamine dioxygenase into a muscle heme protein with Mb-like oxygen binding properties, in gastropod molluscs . Another are hemocyanins, the copper containing respiratory proteins in molluscs and arthropods, which have evolved from phenoloxidases, prior to the divergence of Protostomes and Deuterostomes . The recent finding of hemerythrins, similar to the nonheme iron respiratory protein of Sipunculids, Brachiopods and Priapulids, in the methanotrophic Gammaproteobacterium Methylococcus capsulatus, and in other prokaryotes , and also in the hydrothermal vent annelid Riftia pachyptila (X. Bailly et al., unpublished observations), where they may have an enzymatic function, suggests yet another possible instance.
Identification of globin sequences
Putative globins and globin domains were identified in the genomes of 49 eukaryotes, 26 archaea and 245 bacteria, listed in Supplementary Data Table 1, using two approaches. In one, we examined the gene assignments based on a library of hidden Markov models , listed on the SUPERFAMILY site http://supfam.mrc-lmb.cam.ac.uk, discarded sequences shorter than 100aa and checked the alignments for the presence of His at Mb-fold position F8. In the other, we performed blastp and tblastn (version 9.2.2) searches with pairwise alignment , of completed and unfinished genomes in the GenBank, using the NCBI Entrez retrieval system http://www.ncbi.nlm.nih.gov/BLAST/, including the genomes of the rhodophyte (red alga) Cyanidioschyzon merolae http://merolae.biol.s.u-tokyo.ac.jp, chlorophyte (green algae) Chlamydomonas reinhardtii http://www.biology.duke.edu/chlamy/, and the diatoms Phaeodactylum tricornutum http://avesthagen.szbowler.com and Thalassiosira pseudonana http://genome.jgi-psf.org/thaps1/thaps1.home.html. Blastp searches use the Expect value (E) to assess the matches between the query sequence and each of the sequences in a database. Thus, E = 0.1 signifies that the probability of finding by chance, another match with the query sequence having the same score, is 1 in 10. We define recognition to be a hit with E < 0.005, the default threshold, and with the pairwise alignment fulfilling the following two criteria: proper alignment of the F8 His residues and of helices BC through G. It should be noted that blastp searches often misalign the E helices when the E7 residues are different, e.g. His and Q; however, the rest of the alignment is unaffected.
In cases where the identification of a putative globin was uncertain, searches employing CD-Search v.2.02 http://www.ncbi.nlm.nih.gov, PFAM http://www.sanger.uc.uk and FUGUE http://www-cryst.bioc.cam.ac.uk were used to determine whether the borderline sequence should be accepted as a globin.
Alignment of sequences
The putative globin sequences were aligned manually, using the procedure employed earlier in the alignment of over 700 globins , based on the myoglobin fold [138, 139], the pattern of predominantly hydrophobic residues at 37 conserved, solvent-inaccessible positions with mean solvent-accessible areas of <15Å2 , including 33 intra-helical residues defining helices A through H, A8, A11, A12, A15, B6, B9, B10, B13, B14, C4, E4, E7, E8, E11, E12, E15, E18, E19, F1, F4, G5, G8, G11, G12, G13, G15, G16, H7, H8, H11, H12, H15, and H19, the three inter-helical residues at CD1, CD4 and FG4, and the invariant His at F8. Only amino acids which occur at the 33 intrahelical positions in the foregoing alignment were allowed in the alignment of the putative globin sequences. Although earlier alignments by Kapp et al.  and by Moens et al.  had indicated that there were two invariant residues in globins, F8His and CD1Phe, the 2/2Hb family can accommodate other hydrophobic residues, such as Tyr/Met/Leu/Ile/Val at the CD1 position as well as Ala/Ser/Thr/Leu at the distal E7 position, in addition to His and Gln [16, 20–22]. Hence, in our alignments, we required a His at the proximal F8 position, a residue at the distal E7 position in the order of preference His>Gln>Leu~Thr>Ala~Val~Ser~Tyr, a hydrophobic residue at position CD1 in the order of preference Phe>Tyr>Leu>Met>Ile>Val. At position C4, usually a Pro, we accepted Ala or Ser or Thr but not a charged residue. At the interhelical position CD4, we sought a hydrophobic residue, when available, and at position FG4 we placed a hydrophobic residue in the order of preference Ile>Leu~Val>Met>Phe~Tyr. Furthermore, we avoided deletions in any of the helical regions and placed no limit on the number of residues within the interhelical regions.
Bayesian phylogenetic trees were obtained employing MrBayes Version 3.1.1 ; four chains were run simultaneously for 2000000 generations and trees were sampled every 100 generations generating a total of 20000 trees. PAUP version 4.0b10  was used for viewing and editing. The JTT transition matrix  was used as the stochastic model of amino acid substitution.
- FHb – flavohemoglobin:
chimeric proteins (~400aa) comprising a 3-over-3 N-terminal globin and a C-terminal flavin reductase domain
- GCS – globin-coupled sensors:
chimeric proteins (~300 to >700aa) comprising a 3-over-3 N-terminal globin domain and a variable C-terminal portion
3-over-3 symbiotic Hbs of legumes and other plants
3-over-3 nonsymbiotic plant Hbs
- Pgb – protoglobin:
single domain 3-over-3 globin related to the N-terminal domain of GCSs
3-over-3 single domain globin (~140aa) related to the N-terminal of FHbs
all globins that have the canonical 3-over-3 α-helical fold
- 2/2 Hbs – "truncated" Hbs that are not necessarily shorter than ~140aa:
which have the 2-over-2 α-helical fold.
This work was supported by grants from the Fund for Scientific Research Flanders (G.0331.04), the European Commission (QLG3-CT-2002-01548) and the Consejo Nacional de Ciencia y Tecnología (project no. 42873-Q), México. SD is a postdoctoral fellow of the Fund for Scientific Research Flanders (FWO).
- Appleby C: Leghemoglobin and rhizobium respiration. Ann Rev Plant Physiol. 1984, 345: 443-478.Google Scholar
- Vinogradov SN, Walz DA, Pohajdak B, Moens L, Kapp O, Suzuki T, Trotman C: Adventitious variability? The amino acid sequences of nonvertebrate globins. Comp Biochem Physiol. 1993, B106: 1-26.Google Scholar
- Weber RE, Vinogradov SN: Nonvertebrate hemoglobins: functions and molecular adaptations. Physiol Rev. 2001, 81: 569-628.PubMedGoogle Scholar
- Arredondo-Peter R, Hargrove MS, Moran J, Sarath G, Klucas RV: Plant hemoglobins. Plant Physiol. 1998, 118: 1121-1125. 10.1104/pp.118.4.1121.PubMed CentralPubMedGoogle Scholar
- Ross E, Lira-Ruan V, Arredondo-Peter R, Klucas R, Sarath G: Recent insights into plant hemoglobins. Rev Plant Biochem Biotechnol. 2002, 1: 173-189.Google Scholar
- Dordas C, Rivoal J, Hill RD: Plant haemoglobins, nitric oxide and hypoxic stress. Ann Bot (Lond). 2003, 91: 173-178. 10.1093/aob/mcf115.Google Scholar
- Vasudevan S, Armarego W, Shaw D, Lilley P, Dixon N, Poole RK: Isolation and nucleotide sequence of the hmp gene that encodes a haemoglobin-like protein in Escherichia coli K-12. Mol Gen Genet. 1991, 226: 49-58. 10.1007/BF00273586.PubMedGoogle Scholar
- Iwaasa H, Takagi T, Shikama K: Amino acid sequence of yeast hemoglobin. A twodomain structure. J Mol Biol. 1992, 227: 948-954. 10.1016/0022-2836(92)90236-D.PubMedGoogle Scholar
- Zhu H, Riggs A: Yeast flavohemoglobin is an ancient protein related to globins and a reductase family. Proc Natl Acad Sci USA . 1992, 89: 5015-5019.PubMed CentralPubMedGoogle Scholar
- Takagi T: Hemoglobins from single-celled organisms. Curr Opinion Struct Biol. 1993, 3: 413-418. 10.1016/S0959-440X(05)80115-0.Google Scholar
- Potts M, Angeloni S, Ebel R, Bassam D: Myoglobin in a cyanobacterium. Science. 1992, 256: 1690-1692.PubMedGoogle Scholar
- Kaneko T, Tabata S: Complete genome structure of the unicellular cyanobacterium Synechocystis sp. PCC6803. Plant Cell Physio. 1997, 38: 1171-1176.Google Scholar
- Scott N, Falzone C, Vuletich D, Zhao J, Bryant V, Lecomte JT: Truncated hemoglobin from the cyanobacterium Synechococcus sp. PCC 7002: evidence for hexacoordination and covalent adduct formation in the ferric recombinant protein. Biochemistry. 2002, 41: 6902-6910. 10.1021/bi025609m.PubMedGoogle Scholar
- Vandergon T, Riggs C, Gorr T, Colacino J, Riggs A: The mini-hemoglobins in neural and body wall tissue of the nemertean worm Cerebratulus lacteus. J Biol Chem. 1998, 273: 16998-17011. 10.1074/jbc.273.27.16998.PubMedGoogle Scholar
- Couture M, Yeh S, Wittenberg BA, Wittenberg JB, Ouellet Y, Rousseau D, Guertin M: A cooperative oxygen-binding hemoglobin from Mycobacterium tuberculosis. Proc Natl Acad Sci USA. 1999, 96: 1223-11228. 10.1073/pnas.96.20.11223.Google Scholar
- Wittenberg JB, Bolognesi M, Wittenberg BA, Guertin M: Truncated hemoglobins: a new family of hemoglobins widely distributed in bacteria, unicellular eukaryotes, and plants. J Biol Chem. 2002, 277: 871-874. 10.1074/jbc.R100058200.PubMedGoogle Scholar
- Couture M, Chamberland H, St-Pierre B, Lafontaine J, Guertin M: Nuclear genes encoding chloroplast hemoglobins in the unicellular green alga Chlamydomonas eugametos. Mol Gen Genet. 1994, 243: 185-197.PubMedGoogle Scholar
- Couture M, Das T, Lee H, Peisach J, Rousseau D, Wittenberg BA, Wittenberg JB, Guertin M: Chlamydomonas chloroplast ferrous hemoglobin. Heme pocket structure and reactions with ligands. J Biol Chem. 1999, 274: 6898-6910. 10.1074/jbc.274.11.6898.PubMedGoogle Scholar
- Watts R, Hunt P, Hvitved A, Hargrove M, Peacock W, Dennis E: A hemoglobin from plants homologous to truncated hemoglobins of microorganisms. Proc Natl Acad Sci USA. 2001, 98: 10119-101124. 10.1073/pnas.191349198.PubMed CentralPubMedGoogle Scholar
- Pesce A, Couture M, Dewilde S, Guertin M, Yamauchi K, Ascenzi P, Moens L, Bolognesi M: A novel two-over-two alpha-helical sandwich fold is characteristic of the truncated hemoglobin family. EMBO J. 2000, 19: 2424-2434. 10.1093/emboj/19.11.2424.PubMed CentralPubMedGoogle Scholar
- Milani M, Pesce A, Ouellet Y, Ascenzi P, Guertin M, Bolognesi M: Mycobacterium tuberculosi s haemoglobin N displays a protein tunnel suited for O2 diffusion to the haem. EMBO J. 2001, 20: 3902-3909. 10.1093/emboj/20.15.3902.PubMed CentralPubMedGoogle Scholar
- Milani M, Savard P, Oullet H, Ascenzi P, Guertin M, Bolognesi M: A TyrCD1/TrpG8 hydrogen bond network and a TyrB10-TyrCD1 covalent link shape the heme distal site of Mycobacterium tuberculosis hemoglobin O. Proc Natl Acad Sci USA. 2003, 100: 5766-5771. 10.1073/pnas.1037676100.PubMed CentralPubMedGoogle Scholar
- Blaxter ML: Nemoglobins: divergent nematode globins. Parasitology Today. 1993, 9: 353-360. 10.1016/0169-4758(93)90082-Q.PubMedGoogle Scholar
- Kloek A, Sherman D, Goldberg DE: Novel gene structure and evolutionary context of Caenorhabditis elegans globin. Gene. 1993, 129: 215-221. 10.1016/0378-1119(93)90271-4.PubMedGoogle Scholar
- Kloek A, McCarter J, Setterquist R, Schedl T, Goldberg DE: Caenorhabditis globin genes: rapid intronic divergence contrasts with conservation of silent exonic sites. J Mol Evol. 1996, 43: 101-108. 10.1007/BF02337354.PubMedGoogle Scholar
- Blaxter ML, Ingram L, Tweedie S: Sequence, expression and evolution of the globins of the parasitic nematode Nippostrongylus brasiliensis. Mol Biochem Parasitol. 1994, 68: 1-14. 10.1016/0166-6851(94)00127-8.PubMedGoogle Scholar
- Mansell J, Timms K, Tate W, Moens L, Trotman CN: Expression of a globin gene in Caenorhabditis elegans. Biochem Mol Biol Int. 1993, 30: 643-647.PubMedGoogle Scholar
- Vanfleteren JR, Van de Peer Y, Blaxter M, Tweedie S, Trotman C, Lu L, Van Hauwaert M, Moens L: Molecular genealogy of some nematode taxa based on cytochrome c and globin amino acid sequences. Mol Phylogenet Evol. 1994, 3: 92-101. 10.1006/mpev.1994.1012.PubMedGoogle Scholar
- Neuwald A, Liu J, Lipman D, Lawrence C: Extracting protein alignment models from the sequence database. Nucl Acids Re. 1997, 25: 1665-1677. 10.1093/nar/25.9.1665.Google Scholar
- Hoogewijs D, Geuens E, Dewilde S, Moens L, Vierstraete A, Vinogradov SN, Vanfleteren JR: Genome-wide analysis of the globin gene family of C. elegans. IUBMB Life. 2004, 56: 697-702.PubMedGoogle Scholar
- Burmester T, Hankeln T: A globin gene of Drosophila melanogaster. Mol Biol Evol. 1999, 16: 1809-1811.PubMedGoogle Scholar
- Ebner B, Burmester T, Hankeln T: Globin genes are present in Ciona intestinalis. Mol Biol Evo. 2003, 20: 1521-1525. 10.1093/molbev/msg164.Google Scholar
- Burmester T, Weich B, Reinhardt S, Hankeln T: A vertebrate globin expressed in the brain. Nature. 2000, 407: 520-523. 10.1038/35035093.PubMedGoogle Scholar
- Trent J, Watts R, Hargrove M: Human neuroglobin, a hexacoordinate hemoglobin the reversibly binds oxygen. J Biol Chem. 2001, 276: 30106-30110. 10.1074/jbc.C100300200.PubMedGoogle Scholar
- Kawada N, Kristensen D, Asahina K, Nakatani K, Minamiyama Y, Seki S, Yoshizato K: Characterization of a stellate cell-activation associated protein (STAP) with peroxidase activity found in rat hepatic stellate cells. J Biol Chem. 2001, 276: 25318-25323. 10.1074/jbc.M102630200.PubMedGoogle Scholar
- Burmester T, Ebner B, Weich B, Hankeln T: Cytoglobin: a novel globin type ubiquitously expressed in vertebrate tissues. Mol Biol Evol. 2002, 19: 416-421.PubMedGoogle Scholar
- Trent J, Hargrove MS: A ubiquitously expressed human hexacoordinate hemoglobin. J Biol Chem. 2002, 277: 19538-19545. 10.1074/jbc.M201934200.PubMedGoogle Scholar
- Hankeln T, Ebner B, Fuchs C, Gerlach F, Haberkamp M, Laufs TL, Roesner A, Schmidt M, Weich B, Wystub S, Saaler-Reinhardt S, Reuss S, Bolognesi M, De Sanctis D, Marden MC, Kiger L, Moens L, Dewilde S, Nevo E, Avivi A, Weber RE, Fago A, Burmester T: Neuroglobin and cytoglobin in search of their role in the vertebrate globin family. J Inorg Biochem. 2005, 99: 110-119. 10.1016/j.jinorgbio.2004.11.009.PubMedGoogle Scholar
- Kugelstadt D, Haberkamp M, Hankeln T, Burmester T: Neuroglobin, cytoglobin, and a novel, eye-specific globin from chicken. Biochem Biophys Res Commun. 2004, 325: 719-225. 10.1016/j.bbrc.2004.10.080.PubMedGoogle Scholar
- Roesner A, Fuchs C, Hankeln T, Burmester T: A globin gene of ancient evolutionary origin in lower vertebrates: evidence for two distinct globin families in animals. Mol Biol Evol. 2005, 22: 12-20. 10.1093/molbev/msh258.PubMedGoogle Scholar
- Poole RK, Hughes MN: New functions for the ancient globin family: bacterial responses to nitric oxide and nitrosative stress. Mol Microbiol. 2000, 36: 775-783. 10.1046/j.1365-2958.2000.01889.x.PubMedGoogle Scholar
- Frey AD, Kallio PT: Bacterial hemoglobins and flavohemoglobins: versatile proteins and their impact on microbiology and biotechnology. FEMS Microbiol Rev. 2003, 27: 525-545. 10.1016/S0168-6445(03)00056-1.PubMedGoogle Scholar
- Wu G, Wainwright L, Poole RK: Microbial globins. Adv Microb Physiol. 2003, 47: 255-310.PubMedGoogle Scholar
- Hou S, Larsen R, Boudko D, Riley C, Karatan E, Zimmer M, Ordal G, Alam M: Myoglobin-like aerotaxis transducers in Archaea and Bacteria. Nature. 2000, 403: 540-544. 10.1038/35000570.PubMedGoogle Scholar
- Yu H, Saw J, Hou S, Larsen R, Watts K, Johnson M, Zimmer M, Ordal G, Taylor B, Alam M: Aerotactic responses in bacteria to photoreleased oxygen. FEMS Microbiol Lett. 2002, 217: 237-242.PubMedGoogle Scholar
- Hou S, Freitas T, Larsen R, Piatibratov M, Sivozhelezev V, Yamamoto A, Meleshkevitch E, Zimmer M, Ordal G, Alam M: Globin-coupled sensors: a class of heme-containing sensors in Archaea and Bacteria. Proc Natl Acad Sci USA. 2001, 98: 9353-9358. 10.1073/pnas.161185598.PubMed CentralPubMedGoogle Scholar
- Freitas T, Hou S, Alam M: The diversity of globin-coupled sensors. FEBS Lett. 2003, 552: 99-104. 10.1016/S0014-5793(03)00923-2.PubMedGoogle Scholar
- Freitas T, Hou S, Dioum E, Saito J, Newhouse J, Gonzalez G, Gilles-Gonzalez MA, Alam M: Ancestral hemoglobins in Archaea. Proc Natl Acad Sci USA. 2004, 101: 6675-6680. 10.1073/pnas.0308657101.PubMed CentralPubMedGoogle Scholar
- Vinogradov SN, Hoogewijs D, Bailly X, Arredondo-Peter R, Guertin M, Gough J, Dewilde S, Moens L, Vanfleteren JR: Three globin lineages belonging to two structural classes in genomes from the three kingdoms of life. Proc Natl Acad Sc USA. 2005, 102: 11385-11389. 10.1073/pnas.0502103102.Google Scholar
- Baldauf SL, Bjattacharya D, Cockrill J, Hugenholtz D, Pawlowski J, Simpson A: The tree of life. An overview. Assembling the Tree of Life. Edited by: Cracraft J, Donaghue MJ. 2004, Oxford University Press, Oxford, UK, 43-75.Google Scholar
- Zhang W, Phillips GN: Structure of the oxygen sensor in Bacillus subtilis. signal transduction of chemotaxis by control of symmetry. Structure. 2003, 11: 1097-1108. 10.1016/S0969-2126(03)00169-2.PubMedGoogle Scholar
- Schleper C, Jurgens G, Jonuscheit M: Genomic studies of uncultivated Archaea. Nature Rev Microbiol. 2005, 3: 479-488. 10.1038/nrmicro1159.Google Scholar
- Vuletich D, Lecomte JTJ: A phylogenetic and structural analysis of truncated hemoglobins. J Mol Evol. 2006, 62: 196-210. 10.1007/s00239-005-0077-4.PubMedGoogle Scholar
- Lira-Ruan V, Sarath G, Klucas RV, Arredondo-Peter R: In silico analysis of a flavohemoglobin from Sinorhizobium meliloti strain 1021. Microbiol Res. 2003, 158: 215-227. 10.1078/0944-5013-00200.PubMedGoogle Scholar
- Wakabayashi S, Matsubara H, Webster D: Primary sequence of a dimeric bacterial haemoglobin from Vitreoscilla. Nature. 322: 481-483. 10.1038/322481a0.
- Kim Y, Joachimiak A, Skarina T, Bochkarev A, Savchenko A, Edwards A: Crystal structure of Pa3967 from Pseudomonas aeruginosa Pao1, a hypothetical protein which is highly homologous to human hemoglobin in structure. 2004, unpublishedGoogle Scholar
- Tarricone C, Galizzi A, Coda A, Ascenzi P, Bolognesi M: Unusual structure of the oxygen-binding site in the dimeric bacterial hemoglobin from Vitreoscilla sp. Structure. 1997, 5: 497-507. 10.1016/S0969-2126(97)00206-2.PubMedGoogle Scholar
- Ilari A, Bonamore A, Farina A, Johnson K, Boffi A: The X-ray structure of ferric Escherichia coli flavohemoglobin reveals an unexpected geometry of the distal heme pocket. J Biol Chem. 2002, 277: 23725-23732. 10.1074/jbc.M202228200.PubMedGoogle Scholar
- Ermler U, Siddiqui R, Cramm R, Friedrich B: Crystal structure of the flavohemoglobin from Alcaligenes eutrophus at 1.75 A resolution. EMBO J. 1995, 14: 6067-6077.PubMed CentralPubMedGoogle Scholar
- Vandamme P, Coenye T: Taxonomy of the genus Cupriavidus : a tale of lost and found. Int J Syst Evol Microbiol. 2004, 54: 2285-2289. 10.1099/ijs.0.63247-0.PubMedGoogle Scholar
- Freitas T, Saito J, Hou S, Alam M: Globin-coupled sensors, protoglobins, and the last universal common ancestor. J Inorg Biochem. 2005, 99: 23-33. 10.1016/j.jinorgbio.2004.10.024.PubMedGoogle Scholar
- Zhang W, Phillips GN: Structure of the oxygen sensor in Bacillus subtilis. signal transduction of chemotaxis by control of symmetry. Structure. 2003, 11: 1097-1108. 10.1016/S0969-2126(03)00169-2.PubMedGoogle Scholar
- Moran NA: Tracing the evolution of gene loss in obligate bacterial symbionts. Curr Opinion Microbiol. 2003, 6: 512-518. 10.1016/j.mib.2003.08.001.Google Scholar
- Goh S-H, Lee Y, Bhanu N, Cam M, Desper R, Martin B, Moharram R, Gherman R, Miller JL: A newly discovered human α-globin gene. Blood. 2005, 106: 1466-1472. 10.1182/blood-2005-03-0948.PubMed CentralPubMedGoogle Scholar
- Brownlie A, Hersey C, Oates A, Paw B, Falick B, Witkowska H, Flint J, Higgs D, Jessen J, Bahary N, Zhu H, Lin S, Zon L: Characterization of embryonic globin genes of the zebrafish. Dev Biol. 2003, 255: 48-61. 10.1016/S0012-1606(02)00041-6.PubMedGoogle Scholar
- Gillemans N, McMorrow T, Tewari R, Wai A, Burgtorf C, Drabek D, Ventress N, Langeveld A, Higgs D, Tan-Un K, Grosveld F, Philipsen S: Functional and comparative analysis of globin loci in pufferfish and humans. Blood. 2003, 101: 2842-2849. 10.1182/blood-2002-09-2850.PubMedGoogle Scholar
- Dehal P, Satou Y, Campbell RK, Chapman J, Degnan B, De Tomaso A, Davidson B, Di Gregorio A, Gelpke M, Goodstein DM, Harafuji N, Hastings KE, Ho I, Hotta K, Huang W, Kawashima T, Lemaire P, Martinez D, Meinertzhagen IA, Necula S, Nonaka M, Putnam N, Rash S, Saiga H, Satake M, Terry A, Yamada L, Wang HG, Awazu S, Azumi K, Boore J, Branno M, Chin-Bow S, DeSantis R, Doyle S, Francino P, Keys DN, Haga S, Hayashi H, Hino K, Imai KS, Inaba K, Kano S, Kobayashi K, Kobayashi M, Lee BI, Makabe KW, Manohar C, Matassi G, Medina M, Mochizuki Y, Mount S, Morishita T, Miura S, Nakayama A, Nishizaka S, Nomoto H, Ohta F, Oishi K, Rigoutsos I, Sano M, Sasaki A, Sasakura Y, Shoguchi E, Shin-i T, Spagnuolo A, Stainier D, Suzuki MM, Tassy O, Takatori N, Tokuoka M, Yagi K, Yoshizaki F, Wada S, Zhang C, Hyatt PD, Larimer F, Detter C, Doggett N, Glavina T, Hawkins T, Richardson P, Lucas S, Kohara Y, Levine M, Satoh N, Rokhsar DS: The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins. Science. 2002, 298: 2157-2167. 10.1126/science.1080049.PubMedGoogle Scholar
- Small D, Moylan T, Vayda M, Sidell BD: The myoglobin gene of the Antarctic icefish, Chaenocephalus aceratus, contains a duplicated TATAAAA sequence that interferes with transcription. J Exp Biol. 206: 131-139. 10.1242/jeb.00067.
- Rubin GM, Yandell MD, Wortman JR, Gabor Miklos GL, Nelson CR, Hariharan IK, Fortini ME, Li PW, Apweiler R, Fleischmann W, Cherry JM, Henikoff S, Skupski MP, Misra S, Ashburner M, Birney E, Boguski MS, Brody T, Brokstein P, Celniker SE, Chervitz SA, Coates D, Cravchik A, Gabrielian A, Galle RF, Gelbart WM, George RA, Goldstein LS, Gong F, Guan P, Harris NL, Hay BA, Hoskins RA, Li J, Li Z, Hynes RO, Jones SJ, Kuehl PM, Lemaitre B, Littleton JT, Morrison DK, Mungall C, O'Farrell PH, Pickeral OK, Shue C, Vosshall LB, Zhang J, Zhao Q, Zheng XH, Lewis S: Comparative genomics of the eukaryotes. Science. 287: 2204-2215. 10.1126/science.287.5461.2204.
- Hankeln T, Jaenicke V, Kiger L, Dewilde S, Ungerechts G, Schmidt M, Urban J, Marden M, Moens L, Burmester T: Characterization of Drosophila hemoglobin. Evidence for hemoglobin-mediated respiration in insects. J Biol Chem. 2002, 277: 29012-29017. 10.1074/jbc.M204009200.PubMedGoogle Scholar
- Richards S, Liu Y, Bettencourt B, Hradecky P: Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene and cis-element evolution. Genome Res. 2005, 15: 1-18. 10.1101/gr.3059305.PubMed CentralPubMedGoogle Scholar
- Trevaskis B, Watts R, Andersson C, Llewellyn D, Hargrove MS, Olson JS, Dennis ES, Peacock WJ: Two hemoglobin genes in Arabidopsis thaliana : the evolutionary origins of leghemoglobins. Proc Natl Acad Sci USA. 1997, 94: 12230-12234. 10.1073/pnas.94.22.12230.PubMed CentralPubMedGoogle Scholar
- Arredondo-Peter R, Hargrove M, Sarath G, Moran J, Lohrman J, Olson JS, Klucas RV: Rice haemoglobins. Gene cloning, analysis, and O-binding kinetics of a recombinant protein synthesized in Escherichia coli. Plant Physiol. 1997, 115: 1259-1266. 10.1104/pp.115.3.1259.PubMed CentralPubMedGoogle Scholar
- Lira-Ruan V, Ross E, Sarath G, Klucas RV, Arredondo-Peter R: Mapping and analysis of a hemoglobin gene family from Oryza sativa. Plant Physiol Biochem. 2002, 40: 199-202. 10.1016/S0981-9428(02)01365-7.Google Scholar
- Dujon B, Sherman D, Fischer G, Durrens P, Casaregola S: Genome evolution in yeasts. Nature. 2004, 430: 35-44. 10.1038/nature02579.PubMedGoogle Scholar
- Sartori G, Aldegheri L, Mazzotta G, Lanfranchi G, Tournu H, Brown A, Carignani G: Characterization of a new hemoprotein in the yeast Saccharomyces cerevisiae. J Biol Chem. 1999, 274: 5032-5037. 10.1074/jbc.274.8.5032.PubMedGoogle Scholar
- Andersson J, Sjogren A, Davis L, Embley T, Roger RJ: Phylogenetic analyses of diplomonad genes reveal frequent lateral gene transfers affecting eukaryotes. Curr Biol. 2003, 31: 94-102. 10.1016/S0960-9822(03)00003-4.Google Scholar
- Iijima M, Shimizu H, Tanaka Y, Urushihara H: Identification and characterization of two flavohemoglobin genes in Dictyostelium discoideum. Cell Struct Funct. 2000, 25: 47-55. 10.1247/csf.25.47.PubMedGoogle Scholar
- Hemmingsen EA: Respiratory and cardiovascular adaptation in hemoglobin-free fish: resolved and unresolved problems. Biology of Antarctic Fish. Edited by: di Prisco G, Maresca B, Tota B. 1991, Springer-Verlag, New York, 191-203.Google Scholar
- Vayda M, Small D, Yuan M, Costello L, Sidell B: Conservation of the myoglobin gene among Antarctic notothenioid fishes. Mol Mar Biol Biotechnol. 1997, 6: 207-216.PubMedGoogle Scholar
- diPrisco G, Cocca E, Parker S, Detrich H: Tracking the evolutionary loss of hemoglobin expression by the white-blooded Antarctic icefishes. Gene. 2002, 295: 185-191. 10.1016/S0378-1119(02)00691-1.Google Scholar
- Bargelloni L, Lecointre G: Four years of notothenioid systematics: a molecular perspective. Fishes of Antarctica. Edited by: di Prisco G, Pisano E, Clarke A. 1998, Springer-Verlag Italia, Milan, Italy, 259-273.Google Scholar
- Bishop J, Vandergon T, Green D, Doeller J, Kraus D: A high-affinity hemoglobin is expressed in the notochord of amphioxus, Branchiostoma californiense. Biol Bull. 1998, 195: 255-259.Google Scholar
- Baker S, Terwilliger NB: Hemoglobin structure and function in the rat-tailed sea cucumber Paracaudina chilensis. Biol Bull. 1993, 185: 115-122.Google Scholar
- Christensen A, Colacino J, Bonaventura C: Functional and biochemical properties of the hemoglobins of the burrowing brittle star Hemipholis elongata Say (Echinodermata, Ophiuroidea). Biol Bull . 2003, 205: 54-65.PubMedGoogle Scholar
- Nielsen E, Mound L: Global diversity of insects: the problem of estimating numbers. Nature and Human Society: The Quest for a Sustainable World. Edited by: Raven PH. 1997, National Academy Press, Washington, DC, 213-222.Google Scholar
- Moreira D, Lopez-Garcia P: The molecular ecology of microbial eukaryotes unveils a hidden world. Trends Microbiol. 2002, 10: 31-38. 10.1016/S0966-842X(01)02257-0.PubMedGoogle Scholar
- Falkowski P, Katz M, Knoll A, Quigg A, Raven J, Schofield O, Taylor F: The evolution of modern eukaryotic phytoplankton. Science. 2004, 305: 354-360. 10.1126/science.1095964.PubMedGoogle Scholar
- Uma S, Matts R, Guo Y, White S, Chen JJ: The N-terminal region of the heme-regulated eIF2alpha kinase is an autonomous heme binding domain. Eur J Biochem. 2000, 267: 498-506. 10.1046/j.1432-1327.2000.01021.x.PubMedGoogle Scholar
- Rafie-Kolpin M, Chefalo P, Hussain Z, Hahn J, Uma S, Matts R, Chen JJ: Two heme-binding domains of heme-regulated eukaryotic initiation factor-2alpha kinase. N terminus and kinase insertion. J Biol Chem . 2000, 275: 5171-5178. 10.1074/jbc.275.7.5171.PubMedGoogle Scholar
- Inuzuka T, Yun B, Ishikawa H, Takahashi S, Hori H, Matts R, Ishimori K, Morishima I: Identification of crucial histidines for heme binding in the N-terminal domain of the heme-regulated eIF2alpha kinase. J Biol Chem. 2004, 279: 6778-6782. 10.1074/jbc.C300464200.PubMedGoogle Scholar
- LaCount M, Zhang E, Chen Y, Han K, Whitton M, Lincoln D, Woodin S, Lebioda L: The crystal structure and amino acid sequence of dehaloperoxidase from Amphitrite ornat a indicate common ancestry with globins. J Biol Chem. 2000, 27: 18712-18716. 10.1074/jbc.M001194200.Google Scholar
- Hourdez S, Lallier F, De Cian M, Green B, Weber R, Toulmond A: Gas transfer system in Alvinella pompejana: functional properties of intracellular and extracellular hemoglobins. Physiol Biochem Zool. 2000, 73: 365-373. 10.1086/316755.PubMedGoogle Scholar
- Hardison RC: Organization, evolution and regulation of the globin genes. Disorders of Hemoglobin. Edited by: Steinberg MH, Forget BG, Higgs DR, Nagel RL. 2001, Cambridge University Press, Cambridgge, UK, 95-116.Google Scholar
- Goodman M, Pedwaydon J, Czelusniak J, Suzuki T, Gotoh T, Moens L, Shishikura F, Walz DA, Vinogradov SN: An evolutionary tree for invertebrate globin sequences. J Mol Evol. 1988, 27: 236-249. 10.1007/BF02100080.PubMedGoogle Scholar
- Moens L, Vanfleteren J, Van de Peer Y, Peeters K, Kapp O, Czeluzniak J, Goodman M, Blaxter M, Vinogradov SN: Globins in nonvertebrate species: dispersal by horizontal gene transfer and evolution of the structure-function relationships. Mol Biol Evol. 1996, 13: 324-333.PubMedGoogle Scholar
- Nakajima S, Álvarez-Salgado E, Kikuchi T, Arredondo-Peter R: Prediction of folding pathway and kinetics among plant hemoglobins using an average distance map method. Proteins Struct Funct Bioinfor. 2005, 61: 500-506. 10.1002/prot.20658.Google Scholar
- Farres J, Rechsteiner M, Herold S, Frey AD, Kallio PT: Ligand binding properties of bacterial hemoglobins and flavohemoglobins. Biochemistry. 2005, 44: 4125-4134. 10.1021/bi047389d.PubMedGoogle Scholar
- Gardner PM: Nitric oxide dioxygenase function and mechanism of flavohemoglobins, hemoglobins, myoglobins and their associated reductases. J Inorg Biochem. 2005, 99: 247-266. 10.1016/j.jinorgbio.2004.10.003.PubMedGoogle Scholar
- Poole RK: Nitric oxide and nitrosative stress tolerance in bacteria. Biochem Soc Trans. 2005, 33: 176-180. 10.1042/BST0330176.PubMedGoogle Scholar
- Bonamore A, Gentili P, Ilari A, Schinina M, Boffi A: Escherichia coli flavohemoglobin is an efficient alkylhydroperoxide reductase. J BiolChem. 2003, 278: 22272-22277.Google Scholar
- Bonamore A, Farina A, Gattoni M, Schinina M, Bellelli A, Boffi A: Interaction with membrane lipids and heme ligand binding properties of Escherichia coli flavohemoglobin. Biochemistry. 2003, 42: 5792-5801. 10.1021/bi0206311.PubMedGoogle Scholar
- D'Angelo P, Lucarelli D, della Longa S, Benfatto M, Hazemann J, Feis A, Smulevich G, Ilari A, Bonamore A, Boffi A: Unusual heme iron-lipid acyl chain coordination in Escherichia coli flavohemoglobin. Biophys J. 2004, 86: 3882-3892. 10.1529/biophysj.103.034876.PubMed CentralPubMedGoogle Scholar
- Frey AD, Kallio PT: Nitric oxide detoxification – a new era for bacterial globins in biotechnology?. Trends Biotechnol. 2005, 23: 69-73. 10.1016/j.tibtech.2004.12.002.PubMedGoogle Scholar
- Elvers K, Wu G, Gilberthorpe N, Poole RK, Park SF: Role of an inducible single-domain hemoglobin in mediating resistance to nitric oxide and nitrosative stress in Campylobacter jejuni and Campylobacter coli. J Bacteriol. 2004, 186: 5332-5341. 10.1128/JB.186.16.5332-5341.2004.PubMed CentralPubMedGoogle Scholar
- Wu G, Wainwright L, Membrillo-Hernández J, Poole RK: Bacterial hemoglobins: old proteins with "new" functions? Roles of respiratory and nitric oxide metabolism. Respiration in Archea and Bacteria, Diversity of Prokaryotic Electron Transport Carriers. Edited by: Zanoni D. 2004, Academic Press, London, UK, 1: 255-310.Google Scholar
- Buisson N, Labbe-Bois R: Flavohemoglobin expression and function in Saccharomyces cerevisia e. No relationship with respiration and complex response to oxidative stress. J Biol Chem. 1998, 273: 9527-9533. 10.1074/jbc.273.16.9527.PubMedGoogle Scholar
- Elvers K, Turner S, Wainwright L, Marsden G, Hinds J, Cole J, Poole RK, Penn C, Park SF: NssR, a member of the Crp-Fnr superfamily from Campylobacter jejuni, regulates a nitrosative stress-responsive regulon that includes both a single-domain and a truncated haemoglobin. Mol Microbiol. 2005, 57: 735-750. 10.1111/j.1365-2958.2005.04723.x.PubMedGoogle Scholar
- Liu L, Zeng M, Hausladen A, Heitman J, Stamler J: Protection from nitrosative stress by yeast flavohemoglobin. Proc Natl Acad Sci USA. 2000, 97: 4672-4676. 10.1073/pnas.090083597.PubMed CentralPubMedGoogle Scholar
- Shikama K, Matsuoka A: Structure-function relationships in unusual nonvertebrate globins. Crit Rev Biochem Mol Biol. 2004, 39: 217-259. 10.1080/10409230490514008.PubMedGoogle Scholar
- Stevanin T, Poole RK, Demoncheaux E, Read R: Flavohemoglobin Hmp protects Salmonella enterica serovar typhimurium from nitric oxide-related killing by human macrophages. Infect Immun. 2000, 70: 4399-4405. 10.1128/IAI.70.8.4399-4405.2002.Google Scholar
- Ullmann B, Myers H, Chiranand W, Lazzell A, Zhao Q, Vega L, Lopez-Ribot J, Gardner PR, Gustin M: Inducible defense mechanism against nitric oxide in Candida albicans. Eukaryote Cell. 2004, 3: 715-723. 10.1128/EC.3.3.715-723.2004.Google Scholar
- de Jesus-Berrios M, Liu L, Nussbaum J, Cox G, Stamler J, Heitman J: Enzymes that counteract nitrosative stress promote fungal virulence. Curr Biol. 2003, 13: 1963-1968. 10.1016/j.cub.2003.10.029.PubMedGoogle Scholar
- Idnurm A, Reedy J, Nussbaum J, Heitman J: Cryptococcus neoformans virulence gene discovery through insertional mutagenesis. Eukaryote Cell. 2004, 3: 420-429. 10.1128/EC.3.2.420-429.2004.Google Scholar
- Ott T, van Dongen J, Gunther C, Krusell L, Desbrosses G, Vigeolas H, Bock V, Czechowski T, Geigenberger P, Udvardi MK: Symbiotic leghemoglobins are crucial for nitrogen fixation in legume root nodules but not for general plant growth and development. Curr Biol. 2005, 15: 531-5. 10.1016/j.cub.2005.01.042.PubMedGoogle Scholar
- Kundu S, Trent J, Hargrove M: Plants, humans and hemoglobins. Trends Plant Sci. 2003, 8: 387-393. 10.1016/S1360-1385(03)00163-8.PubMedGoogle Scholar
- Igamberdiev A, Baron K, Manac'h-Little N, Stoimenova M, Hill RD: The haemoglobin/nitric oxide cycle: involvement in floooding stress and effects of hormone signaling. Ann Bot. 2005, 96: 557-564. 10.1093/aob/mci210.PubMed CentralPubMedGoogle Scholar
- Gilles-Gonzalez MA, Gonzalez G: Heme-based sensors: defining characteristics, recent developments, and regulatory hypotheses. J Inorg Biochem. 2005, 99: 1-22. 10.1016/j.jinorgbio.2004.11.006.PubMedGoogle Scholar
- Bailly X, Jollivet D, Vanin S, Deutsch J, Zal F, Lallier F, Toulmond A: Evolution of the sulfide-binding function within the globin multigenic family of the deep-sea hydrothermal vent tubeworm Riftia pachyptila. Mol Biol Evol. 2002, 19: 1421-1433.PubMedGoogle Scholar
- Bailly X, Leroy R, Carney S, Collin O, Zal F, Toulmond A, Jollivet D: The loss of the hemoglobin H2 S-binding function in annelids from sulfide-free habitats reveals molecular adaptation driven by Darwinian positive selection. Proc Natl Acad Sci USA. 2003, 100: 5885-5890. 10.1073/pnas.1037686100.PubMed CentralPubMedGoogle Scholar
- Lecomte JTL, Vuletich D, Lesk AM: Structural divergence and distant relationships in proteins: evolution of the globins. Curr Opinion Struct Biol. 2005, 15: 290-301. 10.1016/j.sbi.2005.05.008.Google Scholar
- Ouellet H, Ouellet Y, Richard C, Labarre M, Wittenberg BA, Wittenberg JB, Guertin M: Truncated hemoglobin HbN protects Mycobacterium bovis from nitric oxide. Proc Natl Acad Sci USA. 2002, 99: 5902-5907. 10.1073/pnas.092017799.PubMed CentralPubMedGoogle Scholar
- Pathania R, Navani N, Rajamohan G, Dikshit KL: Mycobacterium tuberculosis hemoglobin HbO associates with membranes and stimulates cellular respiration of recombinant Escherichia coli. J Biol Chem. 2002, 277: 15293-15302. 10.1074/jbc.M111478200.PubMedGoogle Scholar
- Pathania R, Navani N, Gardner AM, Gardner PR, Dikshit KL: Nitric oxide scavenging and detoxification by the Mycobacterium tuberculosis haemoglobin. Mol Microbiol. 2002, 45: 1303-1314. 10.1046/j.1365-2958.2002.03095.x.PubMedGoogle Scholar
- Milani M, Pesce A, Ouellet H, Guertin M, Bolognesi M: Truncated hemoglobins and nitric oxide action. IUBMB Life. 2003, 55: 623-627.PubMedGoogle Scholar
- Mukai M, Savard P, Ouellet H, Guertin M, Yeh S: Unique ligand-protein interactions in a new truncated hemoglobin from Mycobacterium tuberculosis. Biochemistry. 2002, 41: 3897-3905. 10.1021/bi0156409.PubMedGoogle Scholar
- Goldberg DE: Oxygen-avid hemoglobin of Ascaris. Chem Rev. 1999, 99: 3371-3378. 10.1021/cr970152l.PubMedGoogle Scholar
- Minning D, Gow A, Bonaventura J, Braun R, Dewhirst M, Goldberg D, Stamler J: Ascaris haemoglobin is a nitric oxide-activated 'deoxygenase'. Nature. 1999, 401: 497-502. 10.1038/46822.PubMedGoogle Scholar
- Suzuki T, Imai K: Evolution of myoglobin. Cell Mol Life Sci. 1998, 54: 979-1004. 10.1007/s000180050227.PubMedGoogle Scholar
- Immesberger A, Burmester T: Putative phenoloxidases in the tunicate Ciona intestinali s and the origin of the arthropod hemocyanin superfamily. J Comp Physiol. 2004, B174: 169-180.Google Scholar
- Karlsen O, Ramsevik L, Bruseth L, Larsen Ø, Brenner A, Berven F, Jensen H, Lillhaug J: Characterization of a prokaryotic haemerythrin from the methanotrophic bacterium Methylococcus capsulatu s (Bath). FEBS J. 2005, 272: 2428-2440. 10.1111/j.1742-4658.2005.04663.x.PubMedGoogle Scholar
- Gough J, Karplus K, Hughey R, Chothia C: Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol. 2001, 313: 903-919. 10.1006/jmbi.2001.5080.PubMedGoogle Scholar
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralPubMedGoogle Scholar
- Marchler-Bauer A, Bryant S: CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 2004, 32: W327-331.PubMed CentralPubMedGoogle Scholar
- Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy S, Griffiths-Jones S, Howe K, Marshall M, Sonnhammer E: The Pfam protein families database. Nucleic Acids Res. 2002, 30: 276-280. 10.1093/nar/30.1.276.PubMed CentralPubMedGoogle Scholar
- Shi J, Blundell T, Mizuguchi K: FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. J Mol Biol. 2001, 310: 243-257. 10.1006/jmbi.2001.4762.PubMedGoogle Scholar
- Kapp O, Moens L, Vanfleteren J, Trotman C, Suzuki T, Vinogradov SN: Alignment of 700 globin sequences: extent of amino acid substitution and its correlation in volume. Protein Sci. 1995, 4: 2179-2190.PubMed CentralPubMedGoogle Scholar
- Lesk AM, Chothia C: How different amino acid sequences determine similar protein structures. The structure and evolutionary dynamics of the globins. J Mol Biol. 1980, 136: 225-270. 10.1016/0022-2836(80)90373-3.PubMedGoogle Scholar
- Bashford D, Chothia C, Lesk AM: Determinants of a protein fold. Unique features of the globin amino acid sequences. J Mol Biol. 1987, 196: 199-216. 10.1016/0022-2836(87)90521-3.PubMedGoogle Scholar
- Gerstein M, Sonnhammer ELL, Chothia C: Volume changes in protein evolution. J Mol Biol. 1994, 236: 1067-1078. 10.1016/0022-2836(94)90012-4.PubMedGoogle Scholar
- Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.PubMedGoogle Scholar
- Swofford D: Phylogenetic Analysis Using Parsimony. Ver.4.0b 10. 2001, Sinauer AssociatesGoogle Scholar
- Jones DT, Taylor WR, Thornton JM: The rapid generation of mutation data matrices from protein sequences. Cabios. 1992, 8: 275-282.PubMedGoogle Scholar
- Singleton P, Sainsbury D: Dictionary of Microbiology and Molecular Biology. 2001, J. Wiley & Sons, Chichester, UK, 3Google Scholar
- Singleton P: Bacteria in Biology, Biotechnology and Medicine. 2004, J. Wiley & Sons, Chichester, UK, 6Google Scholar
- Jaillon O, Aury J-M, Brune F: Genome duplication in the teleost fish Tetraodon nigroviridis reveal the early veertebrate proto-karyotype. Nature. 2004, 431: 946-975. 10.1038/nature03025.PubMedGoogle Scholar
- Kellis M, Patterson N, Endrizzi M, Birren B, Landers ES: Sequencing and comparison of yeast species to identify genes and regulatory element. Nature. 2003, 423: 241-254. 10.1038/nature01644.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.