Phylogenetic analysis of eIF4E-family members
© Joshi et al; licensee BioMed Central Ltd. 2005
Received: 27 May 2005
Accepted: 28 September 2005
Published: 28 September 2005
Translation initiation in eukaryotes involves the recruitment of mRNA to the ribosome which is controlled by the translation factor eIF4E. eIF4E binds to the 5'-m7Gppp cap-structure of mRNA. Three dimensional structures of eIF4Es bound to cap-analogues resemble 'cupped-hands' in which the cap-structure is sandwiched between two conserved Trp residues (Trp-56 and Trp-102 of H. sapiens eIF4E). A third conserved Trp residue (Trp-166 of H. sapiens eIF4E) recognizes the 7-methyl moiety of the cap-structure. Assessment of GenBank NR and dbEST databases reveals that many organisms encode a number of proteins with homology to eIF4E. Little is understood about the relationships of these structurally related proteins to each other.
By combining sequence data deposited in the Genbank databases, we have identified sequences encoding 411 eIF4E-family members from 230 species. These sequences have been deposited into an internet-accessible database designed for sequence comparisons of eIF4E-family members. Most members can be grouped into one of three classes. Class I members carry Trp residues equivalent to Trp-43 and Trp-56 of H. sapiens eIF4E and appear to be present in all eukaryotes. Class II members, possess Trp→Tyr/Phe/Leu and Trp→Tyr/Phe substitutions relative to Trp-43 and Trp-56 of H. sapiens eIF4E, and can be identified in Metazoa, Viridiplantae, and Fungi. Class III members possess a Trp residue equivalent to Trp-43 of H. sapiens eIF4E but carry a Trp→Cys/Tyr substitution relative to Trp-56 of H. sapiens eIF4E, and can be identified in Coelomata and Cnidaria. Some eIF4E-family members from Protista show extension or compaction relative to prototypical eIF4E-family members.
The expansion of sequenced cDNAs and genomic DNAs from all eukaryotic kingdoms has revealed a variety of proteins related in structure to eIF4E. Evolutionarily it seems that a single early eIF4E gene has undergone multiple gene duplications generating multiple structural classes, such that it is no longer possible to predict function from the primary amino acid sequence of an eIF4E-family member. The variety of eIF4E-family members provides a source of alternatives on the eIF4E structural theme that will benefit structure/function analyses and therapeutic drug design.
The recruitment of mRNAs to the ribosomal apparatus is a key step in the regulation of translation initiation. For the majority of eukaryotic mRNAs, recruitment is dependent upon the activity of the translation initiation factor eIF4E. eIF4E binds to the 5'-m7GTP-cap structure of mRNAs and to the initiation factor eIF4G (reviewed [1–3]). Through the interaction of the eIF4E:eIF4G complex with ribosome bound factor eIF3, the 40 S ribosomal subunit is positioned at the 5'-end of the mRNA. Subsequently, the 40 S ribosomal subunit scans the mRNA (5'-3') for the translational start codon prior to 60 S binding and formation of the first peptide bond.
The crystal structures of eIF4Es from Mus musculus and Homo sapiens and the solution structure of eIF4E from Saccharomyces cerevisiae, in each case bound to cap-analogues, show that each consists of an eight-stranded β-sheet supported by three α-helices forming the palm and back of a 'cupped' hand [4–6]. Two conserved aromatic Trp residues (Trp-56 and Trp-102 for H. sapiens eIF4E) grasp the aromatic guanine residue of the cap-structure through 'π'-bond interactions [4, 5]. Similar interactions of aromatic amino acid residues with the guanine nucleotide of cap-analogues are seen in the structures of other cap-binding proteins which appear to have evolved independently such as the vaccinia virus 2'-O-methyltransferase, VP39, and of the nuclear cap-binding protein subunit, CBP20, suggesting a common evolutionary theme for methylguanosine/nucleotide-interaction . Hydrogen bonds to the guanine base from a Glu residue (Glu-103 of H. sapiens eIF4E) and the adjacent peptide bond stabilize the interaction of the cap-analogue to the protein. A third Trp residue (Trp-166 of H. sapiens eIF4E) interacts with the N7-methyl moiety of the cap-structure. Sequence comparisons of mammalian eIF4E with eIF4Es from plants and S. cerevisiae, coupled with deletion analyses of eIF4Es from S. cerevisiae and Danio rerio, suggest that the N- and C-termini of eIF4E are dispensable for translation and that the core of eIF4E represented by ~170 amino acids (from His-37 to His-200 in H. sapiens eIF4E) is sufficient for binding to the cap-structure and to eIF4G and 4E-BPs [8, 9]. However, the N- and C-termini may be involved in the regulation of eIF4E-activity [10, 11] or affect the stability of the protein.
eIF4E-activity is regulated by the actions of eIF4E-binding proteins or 4E-BPs which share sequence similarity with the eIF4E-binding domain within the N-terminal region of eIF4G (reviewed ). 4E-BPs act as competitive inhibitors of eIF4E-eIF4G interaction [13–15]. Crystal structures of mouse eIF4E bound to 4E-BPs and fragments of eIF4G show that both proteins interact with eIF4E via a common mechanism involving a sequence with the consensus YxxxxLΦ (where Φ is a hydrophobic residue) . Hyper-phosphorylation of 4E-BPs occurs following stimulation of the Akt/FRAP/TOR signal transduction pathway and results in a reduced affinity for eIF4E [17–19].
Studies of M. musculus eIF4E bound to either a fragment of eIF4G or 4E-BP1 have revealed that His-37, Pro-38, Val-69, Trp-73, Leu-131, Glu-132, and Leu-135 (numbers for H. sapiens eIF4E) interact with the eIF4E-binding regions within eIF4G and 4E-BPs . Val-69 and Trp-73 are within a conserved sequence of the consensus (S/T)V(e/d)(e/d)FW (where the acidic residues are not completely conserved). Substitution of a non-aromatic amino acid for Trp-73 has been shown to disrupt the ability of eIF4E to interact with eIF4G and 4E-BPs [20, 21]. Substitution of a Gly residue in place of Val-69 creates an eIF4E variant that binds still binds 4E-BP1 but has a reduced capacity to interact with both eIF4G and 4E-BP2 .
eIF4E is ubiquitously expressed and is generally isolated from cell extracts using m7GTP-affinity matrices. Use of such matrices led to the conclusion that in mammalian cells, eIF4E was represented by a single polypeptide of ~25 kDa. Similar chromatographic resolution of proteins from plant cell extracts suggested that plants differ from mammalian cells in that they contain two different but related proteins termed plant eIF4E and eIF(iso)4E, or p26 and p28 (in reference to their apparent molecular weights as judged by SDS-PAGE) [22, 23]. A gene encoding eIF4E from S. cerevisiae was isolated and shown by southern analyses and gene disruption studies to be the sole eIF4E gene in that organism , a conclusion confirmed by the availability of the sequence of the complete genome. S. cerevisiae lacking a functional eIF4E-gene can be rescued by exogenous expression of mammalian eIF4E showing that S. cerevisiae and mammalian eIF4Es are structurally and functionally comparable . Overall, these findings suggested that, with the exception of plants, organisms contain a single gene that encodes eIF4E.
Growing evidence from genome/EST sequencing projects has revealed that many organisms contain multiple genes encoding proteins that have sequence similarity to recognized, or prototypical, eIF4E proteins such as mammalian eIF4E (reviewed in [25–27]). Consequently, the translation factor eIF4E and its relatives comprise a family of structurally related proteins within a particular organism. To distinguish the recognized vertebrate eIF4E from its relatives, vertebrate eIF4E has since been renamed eIF4E-1  (or eIF4E-1A ). The functions of the eIF4E-related proteins are not yet understood. Some may act as translation factors and stimulate global mRNA recruitment, or specifically the recruitment of a subset of mRNAs [29, 30]. Others may possess only partial activities when compared to prototypical eIF4Es [26, 31] and thus act as inhibitors of mRNA recruitment. Lack of detection of these eIF4E related proteins in fractions derived from cell extracts resolved by m7GTP-affinity chromatography may reflect a variety of underlying causes. The proteins may be expressed ordinarily at low levels, at specific developmental times, or in restricted and untested tissues [9, 26, 30, 32]. Alternatively, they may be unresolved from eIF4E-1 in fractionation by standard polyacrylamide gel electrophoresis. Conversely, they may fail to interact stably with , or recognize structures that differ from, the m7GTP-cap-structure [28, 33, 34] preventing isolation using standard m7GTP-affinity resins.
At the time of writing, BLAST search of the NCBI GenBank NR database using the amino acid sequence of M. musculus eIF4E-1 as a probe recovers <100 unique cDNA sequences of eIF4E-family members with expected values below 14. Only some of these are recognized as eIF4E-family members in the GenBank database. Additional eIF4E-family member sequences can be uncovered from genomic sequences, although these predicted sequences are subject to errors arising from less than adequate predictions of intron/exon boundaries. Through mining of the GenBank dbEST database and assembling sequences of overlapping cDNA fragments to derive consensus cDNA sequences, as well as performing reiterative searches, we have been able to extend the number of identified complete or partial cDNA sequences encoding eIF4E-family members to 379 (derived from 204 taxonomic species). A further 32 eIF4E-family members from 26 additional species can be predicted from the genomic sequences of organisms known to lack or contain few introns in genes transcribed from RNA polymerase II promoters. The sequences of identified eIF4E-family members have been deposited in an internet-accessible database designed for sequence analyses of eIF4E-family members . Analyses of the sequences suggest that the eIF4E-structure has been duplicated numerous times during evolution producing new forms of the protein that may serve other tasks or regulate the activities of the prototypical translation factor.
Results and discussion
Definition of the amino acid core of an eIF4E-family member
Acquisition of nucleotide sequences encoding putative eIF4E-family members
In order to obtain nucleotide coding sequences representing an accurate description of the repertoire of functional genes encoding eIF4E-family members within an organism it was decided that some evidence should support the expression of the eIF4E-family member for which sequence is obtained. Consequently, for the most part, sequences of expressed sequence tags (ESTs) were acquired. In general, direct use of genome sequences was avoided due to the possibility of including pseudogenes and the possible inaccuracy with which intron/exon boundaries can be predicted. However, where sufficient EST data to verify nucleotide sequences encoding the core region of an eIF4E-family member was absent, it was considered reasonable to use genome sequences for confirmation. Furthermore, the use of genome sequences was considered valid for organisms whose genomes are known to lack, or contain few, introns in genes transcribed by RNA polymerase II such as some Protista and yeasts. In such cases, genome sequences were used only if sequences indicated that indeed no introns were present in the gene representing an eIF4E-family member and that expressed cDNAs could be identified in the same, or closely related organisms.
Overall statistics of the dataset of nucleotide sequences encoding eIF4E-family members
Databank from which sequences were acquired
Number of sequences
Number of species
Number of eIF4E-family members
Identification and verification of nucleotide sequences encoding eIF4E-family members
Region within the coding sequences of an eIF4E-family member
Number of eIF4E-family members
Complete coding sequence
Sequence encoding the entire core region3
120 (142 >90%)
Total dataset (any region identified)
Nucleotide sequences encoding an eIF4E-family member from a particular species were aligned to produce complete or partial consensus cDNA sequences. In many cases the initiation and/or stop codons could not be accurately identified because of a lack of overlapping clones to remove sequence errors or a lack of clones representing full length cDNA products. However, sufficient sequence information was usually available to identify the complete core regions. Analyses of consensus sequences encoding 220 representative eIF4E-family members from 118 species of eukaryotes are presented.
The eIF4E-family of proteins
Dissection of dataset with respect to taxonomic divisions
No. of eIF4E-family members identified1
No. of species represented2
Class I eIF4E-family members
Evidence supporting the presence of two distinct Class I sub-group 2 eIF4E-family members represented by viridiplantae eIF4E and eIF(iso)4E, can be found in species from the viridiplantae classes Liliopsida and Eudicotyledons. Sequences representing either one of eIF4E or eIF(iso)4E can be identified in other viridiplantae classes suggesting that the genes arose from an earlier duplication event. The two forms are closely related in sequence (Figure 4A and 4B) and each possesses all the activities attributed to mammalian eIF4E-1 in vitro [23, 37]. However, expression of A. thaliana eIF(iso)4E in S. cerevisiae lacking a functional endogenous eIF4E-gene results in slower growth relative to similar expression of A. thaliana eIF4E . In addition, levels of expression of A. thaliana eIF4E and eIF(iso)4E differ in various A. thaliana tissues; eIF4E is expressed ubiquitously (with the exception of tissues in the zone of specialization of the root); eIF(iso)4E is expressed more abundantly in developing tissues . Subtle differences in their relative activities can be inferred from the requirements for each in potyvirus infected cells. Both plant eIF4E and eIF(iso)4E bind to potyviral genome-linked proteins (Vpgs) . However, strains of A. thaliana lacking eIF(iso)4E, or of P. sativum carrying variants of eIF4E which lack cap-binding ability lose susceptibility to potyvirus infection [40, 41].
In certain plant species from Eudicotyledons, Liliopsida and Coniferospida, multiple forms of eIF4E and eIF(iso)4E can be found. Instances of apparent gene duplication can be seen in species such as Zea mays and Triticum aestivum with respect to eIF4E (p26) and eIF(iso)4E (p28). However, there is no evidence to support the hypothesis that these duplications occurred prior to speciation. In contrast, evidence suggesting gene duplication prior to speciation can be found (compare eIF4E-family member names and residues shaded in cyan in Figure 4A and 4B) in the Solanaceae, in which Lycopersicon esculentum (tomato) and Solanum tuberosum (potato) both have two forms of eIF4E(p26) (A and B), and in Salicaceae which have two forms of eIF(iso)4E(p28) (A and B).
Evidence for gene duplication of Class I eIF4E-family member genes eIF4E-1 can also be found in Metazoa with respect to Chordata, Insecta and Nematoda. Three eIF4E-1 sub-family members can be identified from the zebrafish Danio rerio, termed eIF4E-1A, eIF4E-1B  and eIF4E-1C (Figure 5A and 5B). Orthologues of the gene encoding eIF4E-1B, but not that of eIF4E-1C, can be found in almost all species above Actinoptergyii for which sequence has been acquired. Both eIF4E-1A and eIF4E-1B possess similar levels of identity when compared to mammalian eIF4E-1s and possess all known residues required for interaction with the cap-structure, eIF4G, and 4E-BPs. eIF4E-1A, like mammalian eIF4E-1, is expressed ubiquitously and can restore the growth of S. cerevisiae lacking a functional eIF4E-gene . It can bind to the cap-structure, eIF4G, and 4E-BP in vitro . Conversely, eIF4E-1B is expressed only during early embryogenesis and in the gonads and muscles of adult fish, and is unable to complement yeast lacking eIF4E. Furthermore, eIF4E-1B is unable to bind to the cap-structure, eIF4G, or 4E-BP in vitro.
Drosophila melanogaster has a total of six genes, all of which are expressed, encoding Class I eIF4E-family members eIF4E-1a-f (also termed eIF4E-1, 4, 5, 3, 7, and 6, respectively ). There is also evidence of a seventh Class I eIF4E-family member (termed eIF4E-2 in ) that arises from alternate splicing of primary transcripts and shares the same core sequence as eIF4E-1a. Four of the genes share exon/intron structure in their carboxy-terminal regions and form a cluster in the genome. All Class I eIF4Es from D. melanogaster bind to cap-analogue. Furthermore, all of them, except eIF4E-1f (eIF4E-6 in ), which has a truncated carboxy-terminal domain, are able to interact with D. melanogaster eIF4G or 4E-BP. The expression of each has been shown to vary throughout the life cycle of the fly. Examination of both expressed sequences and partial or complete genome sequences of insect species has not revealed a similar repertoire of Class I eIF4E-family members outside the genus of Drosophila (data not shown).
An alignment of Class I eIF4E-family members from nematodes is presented in Figure 6A. Although only a single Class I eIF4E orthologue can be found in Ascaris suum, many nematodes express more than one Class I family member (Figure 6B). Four of the five C. elegans eIF4E-family members (termed IFEs for i nitiation f actor of e legans), are Class I members. With respect to activities, IFE-3 corresponds to mammalian eIF4E-1 and binds only to mono-methylated cap-structures. However, in nematodes, a proportion of the mRNAs possesses a tri-methyl-cap arising from the post-transcriptional addition of a tri-methyl-cap containing spliced leader RNA (SLRNA) to the 5' end of a transcribed mRNA [43, 44]. The translation of such trans-spliced mRNAs in C. elegans is thought to be mediated by IFE-1, 2, and 5 since they, unlike IFE-3, interact with both mono- and tri-methylated cap-structures [28, 33]. IFE-1, 2, and 5 possess more similarity to IFE-3 in sequence than to Class I family members from other phyla of Metazoa suggesting they arose from gene-duplications of a progenitor IFE-3 (Figure 2 and 6B). Evidence in support of this hypothesis comes from recent studies of the only identified IFE-3-like protein from the nematode Ascaris suum. A. suum IFE-3 (also termed eIF4E-3) can bind and stimulate the translation of mRNAs possessing mono- or tri-methylated cap structures  in vitro. Furthermore, identified sequences from some nematodes, such as the parasitic Haemanchus contortus suggest that they express single form of eIF4E similar to IFE-3 and a single form related to IFE-1, -2 or -5. No direct relatives of IFE-1, -2 and -5-like proteins have been found in any taxonomic group other than Nematoda.
Multiple Class I eIF4E-family members can also be found in the fission yeast Schizosaccharomyces pombe. S. pombe expresses two eIF4E-family members that share 52 % identity termed eIF4E1 and eIF4E2 . Unlike eIF4E1, expression of eIF4E2 is not essential. However, levels of eIF4E1 and eIF4E2 vary with growth temperature and at higher temperatures eIF4E2 is more abundant than eIF4E1. Both eIF4E1 and eIF4E2 can bind the cap-structure with similar affinity in vitro but eIF4E1 has a 100-fold greater affinity for S. pombe eIF4G than eIF4E2 despite the fact that both share all known amino acids required for interaction with eIF4G. No evidence supporting the expression of two genes directly related to those for S. pombe eIF4E1 and eIF4E2 has been found.
Class II eIF4E-family members
Studies have shown the Class II eIF4E-related proteins eIF4E-2A (H. sapiens and M. musculus; also termed eIF4E-2, 4EHP or 4E-LP), IFE-4, (C. elegans), D. melanogaster eIF4E-2 (eIF4E-8 in ), and nCBP (A. thaliana), like the mammalian translation factor eIF4E-1, all bind the m7GTP-cap structure [26, 29, 31, 42]. nCBP from A. thaliana differs from H. sapiens and Mus musculus eIF4E-2A and D. melanogaster eIF4E-2 in that A. thaliana nCBP can interact with eIF4G and participate in productive translation , whereas H. sapiens and M. musculus eIF4E-2A and D. melanogaster eIF4E-2 cannot [26, 42]. Consistent with this observation, M. musculus eIF4E-2A and D. melanogaster eIF4E-2 cannot substitute for S. cerevisiae eIF4E in a strain lacking a functional eIF4E-gene [26, 42]. Recent studies have shown that H. sapiens and M. musculus eIF4E-2A can interact with 4E-BPs but to a lesser degree than mammalian eIF4E-1 [26, 47]. Although mammalian eIF4E-2A mRNA appears to be expressed in all tissues, the levels of M. musculus eIF4E-2A protein are ~10-fold lower than eIF4E-1 .
Given that metazoan eIF4E-2 cannot itself partake in protein synthesis due to its inability to interact with eIF4G, growing evidence suggests a regulatory role for metazoan eIF4E-2 family members. Like mammalian eIF4E-2A, D. melanogaster eIF4E-2 (eIF4E-8 in ) appears to expressed at much lower levels than the major Class I form, eIF4E-1a (eIF4E-1 in ), although it is present at all stages of the life cycle. Mutants of D. melanogaster that express a markedly reduced level of eIF4E-2 show defects in anterior-posterior axis formation during early embryogenesis . Development of the anterior-posterior axis in D. melanogaster embryos is dependent on the distribution of the maternal effect genes which include bicoid and caudal. In the oocyte, caudal mRNA is evenly distributed, whereas bicoid mRNA is restricted to the anterior of the cell. Translation of bicoid mRNA is activated upon fertilization resulting in a gradient of bicoid protein decreasing toward the posterior of the embryo. Through interaction of bicoid with a region within the 3' UTR of caudal mRNA, the translation of caudal mRNA is inhibited resulting in an opposing gradient of caudal expression. Evidence suggests that D. melanogaster eIF4E-2 binds specifically to a region of bicoid resembling the eIF4E-binding region within eIF4G. This suggests that inhibition of caudal mRNA translation is due to sequestration of the caudal mRNA into a inactive 'circular' complex with which eIF4E-1 and ribosomes cannot interact. Such a mechanism of translational regulation through eIF4E-2 may not be restricted to D. melanogaster. The nematode Class II representative, IFE-4, is expressed in C. elegans in pharyngeal and tail neurons, body wall muscle, spermatheca and vulva, suggesting a special use . Reduction of IFE-4 expression by RNA-interference or introduction of a null mutation produces a pleiotropic phenotype that includes an egg laying defect. Microarray analyses of mRNAs translated in the absence of IFE-4 expression suggest that IFE-4 is required for translation of a subset of mRNAs [28, 30]. In mammals, expression from the H. sapiens eIF4E-2A gene (EIF4EL3) is upregulated following conversion of primary solid tumors to associated metastases  further suggesting a regulatory role for this protein.
Evidence for the expression of two distinct sub-forms of Class II eIF4E-family members can be recognized in the Actinopterygii (D. rerio, O. mykiss, and T. rubripes) and Amphibia (A. mexicanum, X. laevis and X. tropicalis) (see species containing both eIF4E-2A and eIF4E-2B in Figure 2, and data not shown). The actinopterygian and amphibian sub-forms termed eIF4E-2A represent orthologues of mammalian eIF4E-2A. The sub-forms termed eIF4E-2B are ~85–90 % identical within the core regions to eIF4E-2A sub-forms from the same species. Examination of the genomes of H. sapiens and M. musculus fails to reveal genes corresponding to a mammalian eIF4E-2B sub-form.
Class III eIF4E-family members
Class III eIF4E-family members from Vertebrata possess a non-aromatic Cys residue at the position equivalent to Trp-56 of H. sapiens eIF4E-1. Since Trp-56 together with Trp-102 of H. sapiens eIF4E-1 partakes in π-bond stacking interactions with the guanine base of the cap-structure, the ability of vertebrate eIF4E-3 to interact with the cap structure would be unexpected. However, studies with M. musculus eIF4E-3 have shown that the protein does interact with the cap-structure in vitro suggesting that stacking of only one aromatic residue is sufficient for cap-interaction . However, the interaction of M. musculus eIF4E-3 with the cap-structure is weaker than that of either mammalian eIF4E-1 or eIF4E-2. Furthermore, M. musculus eIF4E-3 is less able to distinguish between 7-methylated and non-methylated GTP. M. musculus eIF4E-3 can interact with eIF4G but not with 4E-BPs suggesting that it may participate in translation. However, M. musculus eIF4E-3 is unable to rescue the growth of S. cerevisiae lacking a functional eIF4E-gene. The weaker interaction of M. musculus eIF4E-3 with the cap-structure relative to eIF4E-1 suggests that this protein may be involved in sequestration of eIF4G resulting in inhibition of cap-dependent translation. The distribution of eIF4E-3 in adult mice differs from that of both eIF4E-1 and eIF4E-2, with highest levels of expression in skeletal muscle.
eIF4E-family members from some species of Protista show extension or compaction relative to Class I, II, and III eIF4E-family members
eIF4E-family members of the Alveolata, Stramenopiles, and Haptophyta (presented as those of sub-group 8 in Figure 2) possess ~20 % identity and ~40 % similarity with respect to the core regions of eIF4E-family members from Class I, II and III (Figure 3B, and data not shown). Members of sub-group 8 possess either a Trp or Tyr residue equivalent to Trp-56 of H. sapiens eIF4E-1 and a Trp at the residue equivalent to H. sapiens Trp-43 (Figure 9A). Consequently, sub-group 8 members have Class I, Class II (fungal), or Class III-like signatures. However, members of sub-group 8 also possess extended stretches of 12–15 amino acids between residues equivalent to Trp-73 and Trp-102 of H. sapiens eIF4E-1, and 4–9 amino acids between residues equivalent to Trp-102 and Trp-166. Such stretches, the purpose of which are not known, are not seen in any family members of Class I, II and III and suggest that sub-group 8 members have specialized functions relative to those of Class I, II and III eIF4Es. Consequently, subgroup 8 members could be considered a fourth Class of eIF4E-family member.
A clue as to the possible role of extended sequences in the basic structure of eIF4E arises from the studies of eIF4E-family members from the trypanosome Leishmania major. Four intron-less eIF4E-family member genes can be identified from the known genomic sequences of L. major (data not shown). The sequences of two of these members (Leish4E-1 and Leish4E-2) are also presented in Figure 9A. Leish4E-1 and Leish4E-2 possess a Class I-like signature (Trp-residues at the equivalent of H. sapiens eIF4E-1 residues 43 and 56). Like family members of sub-group 8, both Leish4E-1 and Leish4E-2 contain extended amino acid stretches between structural units of the core, although the positions or lengths of the extensions differ from those found in sub-group 8 eIF4Es. In Trypanosomatidae, polycistronic pre-mRNA transcripts are processed to generate monocistronic RNAs which are further modified by the addition of capped spliced leader (SL) RNA. Unlike the tri-methylated SLRNAs from nematodes, the SLRNA of Trypanosomatidae possess mono-methylated cap-structures  that are further modified on the first four transcribed nucleotides by addition of 2'-O methyl groups resulting in all mRNAs containing so-called cap-4 structures . The presence of cap-4 and/or the nucleotide sequence of the SLRNA is thought to be required for efficient recruitment of trans-spliced mRNAs into polysomes . Studies in vitro have shown that L. major, LeishIF4E-1, which contains two areas of extended sequence between the structural units of the core, binds both m7GTP and the cap-4 structure with similar affinities . This is in contrast to mammalian eIF4E-1 which possesses a 5-fold greater affinity for m7GTP compared to cap-4. Although trans-splicing has not been demonstrated in Alveolata, Stramenopiles, or Haptophyta, the presence of extensions within the core regions may signify that these eIF4E-family members, like Leish4E-1, recognize more complex cap-structures, specific modifications, or other sequences close to the cap-structure of a mRNA. This is of particular interest because many representatives of these sub-kingdoms are parasites or infectious agents.
Although reliant only on genomic data, specialization of a different nature can be seen in eIF4E-family members from the microsporidian, Encephalitozoon cuniculi, and of the algal endosymbiont of the cryptophyte Guillardia theta (Figure 9B). The diminutive genome of the G. theta algal endosymbiont (~0.55 Mb ) has undergone extreme compaction relative to the genomes of other Rhodophyta such as Cyanidioschyzon merolae which has a genome of ~16.5 Mb . The E. cuniculi genome is also highly compacted at ~2.9 Mb . Consistent with compaction, predicted Class I-like eIF4E-family members from both possess short N-and/or C-termini relative to eIF4E-family members from other species.
Trp-73 of H. sapiens eIF4E-1 has been shown to be involved in the interaction of eIF4E-1 with eIF4G and 4E-BPs. With the exception of D. melanogaster eIF4E-1d, all members of the three defined non-protist structural classes of eIF4E-family members possess a Trp-residue equivalent to Trp-73 of H. sapiens eIF4E-1. Examination of the eIF4G-binding regions of eIF4E-family members from both E. cuniculi and the G. theta nucleomorph reveals differences relative to H. sapiens eIF4E-1. In both cases, residues equivalent to Trp-73 of H. sapiens eIF4E-1 are substituted by a non-aromatic Leu residue. As discussed earlier, such a substitution in H. sapiens eIF4E-1 has been shown to impair the ability of eIF4E-1 to interact with either eIF4G or with 4E-BPs [20, 21]. In the case of the G. theta algal endosymbiont this variation may not be remarkable since the genome of the nucleomorph appears to lack any identifiable sequence encoding eIF4G-like or 4E-BP-like proteins. Since the genome of the G. theta endosymbiont has been so severely compacted, it is hypothesized that the genes encoding complex cellular functions in the endosymbiont, such as protein synthesis, are limited to the minimal set needed to accomplish the function. Although the endosymbiont has its own mRNAs with 5'-caps and poly(A) tails, elongation and release factors, its genome only encodes a subset of translational initiation factors: eIF1, eIF1A, eIF4A, eIF2 (all subunits, although the alpha subunit is truncated), eIF4E, eIF6 and poly(A) binding protein. The genes for many initiation factors thought to be essential for cannot be identified including eIF4B, eIF5, or the scaffold protein eIF3 (any subunit).
eIF4E, a translational initiation factor found only in eukaryotes has a unique alpha/beta fold that is considered to have no homologues outside the eukaryotes, as determined by sequence comparison or structural analyses . The expansion of sequenced cDNAs and genomic DNAs from organisms of all taxonomic kingdoms has significantly altered our picture of eIF4E which must now be considered to be one member of a structurally related family of eIF4E-like proteins.
Evolutionarily it seems that a single early eIF4E gene has undergone multiple gene duplications generating multiple structural classes. The functions of each member of each structural class remain to be completely understood. However, it is no longer possible to predict eIF4E-function from the primary amino acid sequence of an eIF4E-family member, as exemplified by the functional diversity of examples mentioned here and recently reviewed elsewhere . The ancestral gene of eIF4E has also provided a blueprint for the generation of related proteins with specialized functions found only in certain taxonomic groups. The C. elegans IFE-1, -2 and -5 appear to result from gene duplications that occurred within early nematodes to give rise to a specialized sub-class that recognizes alternate cap structures. No direct relatives of these tri-methyl-cap binding proteins can be found in any other phyla. The extended eIF4Es of certain protists, like L. major have evolved independently to fulfill a similar function. These eIF4E variants seem likely to provide a rich source of variations on the eIF4E structural theme that will provide unique opportunities for structure/function studies and therapeutic drug design. As more sequence data becomes available and more eIF4E-family members are tested for their activities both in vitro and in vivo our understanding of the origins and functions of individual members will advance. The data provided here have been deposited in an internet accessible database for online access to assembled sequences encoding eIF4E-family members . The site has been developed to allow easy searches, as well as sequence comparisons and other analyses.
Acquisition of cDNA sequences encoding eIF4E-family members
The nucleotide sequences encoding M. musculus eIF4E-1, eIF4E-2, and eIF4E-3, T. aestivum eIF4E and eIF(iso)4E, A. thaliana nCBP, C. elegans IFE-1, 2, 3, 4, and 5, and S. cerevisiae eIF4E were used to probe GenBank (NR), and dbEST databases for homologous cDNA sequences from other species through use of the BLAST 2.0 software package. In an iterative process, the retrieved sequences were used to re-probe the databanks to obtain further sequences of overlapping cDNA fragments from the same organism or to obtain related sequences from additional species. For budding yeasts and some protists, which lack, or possess few numbers of, introns in genes transcribed from RNA polymerase II promoters, genomic sequences encoding eIF4Es were also acquired from the GenBank database. The expression of many of these genomic sequences have been verified by the presence of at least one EST sequence for each.
Derivation of consensus cDNA sequences encoding eIF4E-family members
Overlapping nucleotide sequences encoding an eIF4E-family member from a particular species were aligned to produce complete or partial consensus cDNA sequences. In most cases, sequences within 3'-UTRs were used to verify that EST sequences described the same eIF4E-family member. However, due to usage of alternative splicing and/or of alternative polyadenylation sites, 3'-UTRs sometimes differed in sequence. Sequences were not considered verified unless a minimum of two sequences representing overlapping cDNA fragments confirmed the assignment of a single nucleotide. Where multiple nucleotides assignments for a particular base in a consensus sequence were supported by multiple sequences these variations were considered to represent polymorphisms or variations in strains. To allow for such variations, information about the strain used for development of the cDNA library from which an EST was recovered (provided by some submitters to GenBank databanks) was utilized to select a subset of sequences. Where strain information was not available, or where strain information still suggested multiple possibilities for an assignment, the assignment of a base supported by the majority EST sequences was chosen. In almost all cases these variations led to either no change at the amino acid level due to codon-degeneracy, or a single amino acid variation. Furthermore, such variations were either confined to regions encoding segments N-terminal of the core of the eIF4E-family member or had no affect on the prediction of amino acids in positions related to those known to be involved in the interaction of mammalian eIF4E-1 with the cap-structure, eIF4G, and 4E-BPs.
Alignments and analyses of sequences representing eIF4E-family members
Since alignments of mammalian eIF4E-1s, plant eIF4E and eIF(iso)4E and S. cerevisiae eIF4E suggest the presence of an evolutionarily conserved core region and because deletion analyses of S. cerevisiae eIF4E and D. rerio eIF4E-1A suggest that the N-termini and C-termini are dispensable with respect to cap-binding, eIF4G and 4E-BP interaction, sequences representing the core region of an eIF4E-family member were used for analyses unless otherwise stated. Amino acid alignments were performed using ClustalW (version 1.8) software and adjusted as necessary . Alignments of the nucleotide sequences representing the core regions of eIF4E-family members (~480–510 nucleotides per member) were generated by reverse translation of amino acid alignments, and substitution of the factual nucleotide sequences. Concensus phylogenetic trees were generated from nucleotide alignments using the "neighbor-joining" and "boot-strapping" algorithms within Mega 3.0 . In some cases, only a part of the nucleotide sequence representing the core region of an eIF4E-family member could be identified. Where indicated, partial sequences were included in the analyses. Unless stated, names assigned to eIF4E-family members were based on designations previously applied by investigators in the field of translational control.
Analyses of the consensus cDNA sequences of 220 representative eIF4E-family members from 118 species are presented. The predicted amino acid sequences, accession numbers of all sequences used to derive consensus cDNA sequences, and full names of all the species from which they were derived, are supplied in the form of Additional File 1. Sequence alignments and derived concensus cDNA sequences corresponding to all eIF4E-family members identified are available through the 'eIF4E-family member database .
This work was supported by NSF grant MCB-0134013 to R. J. and B. J. K. L. was funded in part by REU awards from NSF and in part by the NOAA-EPP-funded Living Marine Resources Cooperative Science Center.
- Gingras AC, Raught B, Sonenberg N: eIF4 initiation factors: effectors of mRNA recruitment to ribosomes and regulators of translation. Annu Rev Biochem. 1999, 68: 913-963. 10.1146/annurev.biochem.68.1.913.View ArticlePubMedGoogle Scholar
- Hershey JWB, Merrick WC: Pathway and mechanism of initiation of protein synthesis. Translational control of gene expression. Edited by: Sonenberg N, Hershey JWB, Mathews MB. 2000, Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY, 33-88.Google Scholar
- von der Haar T, Gross JD, Wagner G, McCarthy JE: The mRNA cap-binding protein eIF4E in post-transcriptional gene expression. Nat Struct Mol Biol. 2004, 11: 503-511. 10.1038/nsmb779.View ArticlePubMedGoogle Scholar
- Marcotrigiano J, Gingras AC, Sonenberg N, Burley SK: Cocrystal structure of the messenger RNA 5' cap-binding protein (eIF4E) bound to 7-methyl-GDP. Cell. 1997, 89: 951-961. 10.1016/S0092-8674(00)80280-9.View ArticlePubMedGoogle Scholar
- Matsuo H, Li H, McGuire AM, Fletcher CM, Gingras AC, Sonenberg N, Wagner G: Structure of translation factor eIF4E bound to m7GDP and interaction with 4E-binding protein. Nat Struct Biol. 1997, 4: 717-724. 10.1038/nsb0997-717.View ArticlePubMedGoogle Scholar
- Tomoo K, Shen X, Okabe K, Nozoe Y, Fukuhara S, Morino S, Ishida T, Taniguchi T, Hasegawa H, Terashima A, Sasaki M, Katsuya Y, Kitamura K, Miyoshi H, Ishikawa M, Miura K: Crystal structures of 7-methylguanosine 5'-triphosphate (m(7)GTP)- and P(1)-7-methylguanosine-P(3)-adenosine-5', 5'-triphosphate (m(7)GpppA)-bound human full length eukaryotic initiation factor 4E: biological importance of the C-terminal flexible region. Biochem J. 2002, 362: 539-544. 10.1042/0264-6021:3620539.PubMed CentralView ArticlePubMedGoogle Scholar
- Hsu PC, Hodel MR, Thomas JW, Taylor LJ, Hagedorn CH, Hodel AE: Structural requirements for the specific recognition of an m7G mRNA Cap. Biochemistry. 2000, 39: 13730-13736. 10.1021/bi000623p.View ArticlePubMedGoogle Scholar
- Vasilescu S, Ptushkina M, Linz B, Muller PP, McCarthy JE: Mutants of eukaryotic initiation factor eIF-4E with altered mRNA cap binding specificity reprogram mRNA selection by ribosomes in Saccharomyces cerevisiae. J Biol Chem. 1996, 271: 7030-7037. 10.1074/jbc.271.12.7030.View ArticlePubMedGoogle Scholar
- Robalino J, Joshi B, Fahrenkrug SC, Jagus R: Two zebrafish eIF4E family members are functionally divergent and are differentially expressed. J Biol Chem. 2004, 279: 10532-10541. 10.1074/jbc.M313688200.View ArticlePubMedGoogle Scholar
- Gross JD, Moerke NJ, von der Haar T, Lugovskoy AA, Sachs AB, McCarthy JE, Wagner G: Ribosome loading onto the mRNA cap is driven by conformational coupling between eIF4G and eIF4E. Cell. 2003, 115: 739-750. 10.1016/S0092-8674(03)00975-9.View ArticlePubMedGoogle Scholar
- Scheper GC, van Kollenburg B, Hu J, Luo Y, Goss DJ, Proud CG: Phosphorylation of eukaryotic initiation factor 4E markedly reduces its affinity for capped mRNA. J Biol Chem. 2002, 277: 3303-3309. 10.1074/jbc.M103607200.View ArticlePubMedGoogle Scholar
- Richter JD, Sonenberg N: Regulation of cap-dependent translation by eIF4E inhibitory proteins. Nature. 2005, 433: 477-480. 10.1038/nature03205.View ArticlePubMedGoogle Scholar
- Pause A, Belsham GJ, Gingras AC, Donze O, Lin T-A, Lawrence JCJ, Sonenberg N: Insulin-dependent stimulation of protein synthesis via a novel regulator of cap function. Nature. 1994, 371: 762-767. 10.1038/371762a0.View ArticlePubMedGoogle Scholar
- Lin T-A, Kong X, Haystead TAJ, Pause A, Belsham G, Sonenberg N, Lawrence JCJ: PHAS-I as a link between MAP-kinase and translational initiation. Science. 1994, 266: 544-547.View ArticleGoogle Scholar
- Poulin F, Gingras AC, Olsen H, Chevalier S, Sonenberg N: 4E-BP3, a new member of the eukaryotic initiation factor 4E-binding protein family. J Biol Chem. 1998, 273: 14002-14007. 10.1074/jbc.273.22.14002.View ArticlePubMedGoogle Scholar
- Marcotrigiano J, Gingras AC, Sonenberg N, Burley SK: Cap-dependent translation initiation in eukaryotes is regulated by a molecular mimic of eIF4G. Mol Cell. 1999, 3: 707-716. 10.1016/S1097-2765(01)80003-4.View ArticlePubMedGoogle Scholar
- Takata M, Ogawa W, Kitamura T, Hino Y, Kuroda S, Kotani K, Klip A, Gingras AC, Sonenberg N, Kasuga M: Requirement for Akt (protein kinase B) in insulin-induced activation of glycogen synthase and phosphorylation of 4E-BP1 (PHAS-1). J Biol Chem. 1999, 274: 20611-20608. 10.1074/jbc.274.29.20611.View ArticlePubMedGoogle Scholar
- Gingras AC, Kennedy SG, O'Leary MA, Sonenberg N, Hay N: 4E-BP1, a repressor of mRNA translation, is phosphorylated and inactivated by the Akt(PKB) signaling pathway. Genes Dev. 1998, 12: 502-513.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang D, Brunn GJ, Lawrence JC: Mutational analysis of sites in the translational regulator, PHAS-I, that are selectively phosphorylated by mTOR. FEBS Lett. 1999, 453: 387-390. 10.1016/S0014-5793(99)00762-0.View ArticlePubMedGoogle Scholar
- Pyronnet S, Imataka H, Gingras AC, Fukunaga R, Hunter T, Sonenberg N: Human eukaryotic translation initiation factor 4G (eIF4G) recruits mnk1 to phosphorylate eIF4E. EMBO J. 1999, 18: 270-279. 10.1093/emboj/18.1.270.PubMed CentralView ArticlePubMedGoogle Scholar
- Ptushkina M, von der Haar T, Karim MM, Hughes JM, McCarthy JE: Repressor binding to a dorsal regulatory site traps human eIF4E in a high cap-affinity state. EMBO J. 1999, 18: 4068-4075. 10.1093/emboj/18.14.4068.PubMed CentralView ArticlePubMedGoogle Scholar
- Lax S, Fritz W, Browning K, Ravel J: Isolation and characterization of factors from wheat germ that exhibit eukaryotic initiation factor 4B activity and overcome 7-methylguanosine 5'-triphosphate inhibition of polypeptide synthesis. Proc Natl Acad Sci. 1985, 82: 330-333.PubMed CentralView ArticlePubMedGoogle Scholar
- Browning KS, Lax SR, Ravel JM: Identification of two messenger RNA cap binding proteins in wheat germ. J Biol Chem. 1987, 262: 11228-11232.PubMedGoogle Scholar
- Altmann M, Muller PP, Pelletier J, Sonenberg N, Trachsel H: A mammalian translation initiation factor can substitute for its yeast homologue in vivo. J Biol Chem. 1989, 264: 12145-12147.PubMedGoogle Scholar
- Lasko P: The Drosophila melanogaster genome: translation factors and RNA binding proteins. J Cell Biol. 2000, 150: F51-56. 10.1083/jcb.150.2.F51.View ArticlePubMedGoogle Scholar
- Joshi B, Cameron A, Jagus R: Characterization of mammalian eIF4E family members. Eur J Biochem. 2004, 271: 2189-2203. 10.1111/j.1432-1033.2004.04149.x.View ArticlePubMedGoogle Scholar
- Hernandez G, Vazquez-Pianzola P: Functional diversity of the eukaryotic translation initiation factors belonging to eIF4 families. Mech Dev. 2005, 122: 865-876. 10.1016/j.mod.2005.04.002.View ArticlePubMedGoogle Scholar
- Keiper BD, Lamphear BJ, Deshpande AM, Jankowska-Anyszka M, Aamodt EJ, Blumenthal T, Rhoads RE: Functional characterization of five eIF4E isoforms in Caenorhabditis elegans. J Biol Chem. 2000, 275: 10590-10596. 10.1074/jbc.275.14.10590.View ArticlePubMedGoogle Scholar
- Ruud KA, Kuhlow C, Goss DJ, Browning KS: Identification and characterization of a novel cap-binding protein from Arabidopsis thaliana. J Biol Chem. 1998, 24: 10325-10330. 10.1074/jbc.273.17.10325.View ArticleGoogle Scholar
- Dinkova TD, Keiper BD, Korneeva NL, Aamodt EJ, Rhoads RE: Translation of a small subset of Caenorhabditis elegans mRNAs is dependent on a specific eukaryotic translation initiation factor 4E isoform. Mol Cell Biol. 2005, 25: 100-113. 10.1128/MCB.25.1.100-113.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Rom E, Kim HC, Gingras AC, Marcotrigiano J, Favre D, Olsen H, Burley SK, Sonenberg N: Cloning and characterization of 4EHP, a novel mammalian eIF4E-related cap-binding protein. J Biol Chem. 1998, 273: 13104-13109. 10.1074/jbc.273.21.13104.View ArticlePubMedGoogle Scholar
- Amiri A, Keiper BD, Kawasaki I, Fan Y, Kohara Y, Rhoads RE, Strome S: An isoform of eIF4E is a component of germ granules and is required for spermatogenesis in C. elegans. Development. 2001, 128: 3899-3912.PubMed CentralPubMedGoogle Scholar
- Jankowska-Anyszka M, Lamphear BJ, Aamodt EJ, Harrington T, Darzynkiewicz E, Stolarski R, Rhoads RE: Multiple isoforms of eukaryotic protein synthesis initiation factor 4E in Caenorhabditis elegans can distinguish between mono- and trimethylated mRNA cap structures. J Biol Chem. 1998, 273: 10538-10542. 10.1074/jbc.273.17.10538.View ArticlePubMedGoogle Scholar
- Yoffe Y, Zuberek J, Lewdorowicz M, Zeira Z, Keasar C, Orr-Dahan I, M J-A, Stepinsk iJ, Darzynkiewicz E, Shapira M: Cap-binding activity of an eIF4E homolog from Leishmania. RNA. 2004, 10: 1764-1775. 10.1261/rna.7520404.PubMed CentralView ArticlePubMedGoogle Scholar
- The eIF4E-Family Member Database. [http://umbicc3-215.umbi.umd.edu]
- Zuberek J, Jemielity J, Jablonowska A, Stepinsk J, Dadlez M, Stolarski R, Darzynkiewicz E: Influence of electric charge variation at residues 209 and 159 on the interaction of eIF4E with the mRNA 5' terminus. Biochemistry. 2004, 43: 5370-5379. 10.1021/bi030266t.View ArticlePubMedGoogle Scholar
- Aliyeva E, Metz AM, Browning KS: Sequences of two expressed sequence tags (EST) from rice encoding different cap-binding proteins. Gene. 1996, 180: 221-223. 10.1016/S0378-1119(96)00418-0.View ArticlePubMedGoogle Scholar
- Rodriguez CM, Freire MA, Camilleri C, Robaglia C: The Arabidopsis thaliana cDNAs coding for eIF4E and eIF(iso)4E are not functionally equivalent for yeast complementation and are differentially expressed during plant development. Plant J. 1998, 13: 465-473. 10.1046/j.1365-313X.1998.00047.x.View ArticlePubMedGoogle Scholar
- Leonard SPD, Wittmann S, Daigneault N, Fortin MG, Laliberte JF: Complex formation between potyvirus VPg and translation eukaryotic initiation factor 4E correlates with virus infectivity. J Virol. 2000, 74: 7730-7737. 10.1128/JVI.74.17.7730-7737.2000.PubMed CentralView ArticlePubMedGoogle Scholar
- Duprat A, Caranta C, Revers F, Menand B, Browning KS, Robaglia C: The Arabidopsis eukaryotic initiation factor (iso)4E is dispensable for plant growth but required for susceptibility to potyviruses. Plant J. 2002, 32: 927-934. 10.1046/j.1365-313X.2002.01481.x.View ArticlePubMedGoogle Scholar
- Lellis AD, Kasschau KD, Whitham SA, Carrington JC: Loss-of-susceptibility mutants of Arabidopsis thaliana reveal an essential role for eIF(iso)4E during potyvirus infection. Curr Biol. 2002, 12: 1046-1051. 10.1016/S0960-9822(02)00898-9.View ArticlePubMedGoogle Scholar
- Hernandez G, Altmann M, Sierra JM, Urlaub H, Diez Del Corral R, Schwartz P, Rivera-Pomar R: Functional analysis of seven genes encoding eight translation initiation factor 4E (eIF4E) isoforms in Drosophila. Mech Dev. 2005, 122: 529-543. 10.1016/j.mod.2004.11.011.View ArticlePubMedGoogle Scholar
- Blumenthal T: Trans-splicing and polycistronic transcription in Caenorhabditis elegans. Trends Genet. 1995, 11: 132-136. 10.1016/S0168-9525(00)89026-5.View ArticlePubMedGoogle Scholar
- Stover NA, Steele RE: Trans-spliced leader addition to mRNAs in a cnidarian. Proc Natl Acad Sci. 2001, 98: 5693-5698. 10.1073/pnas.101049998.PubMed CentralView ArticlePubMedGoogle Scholar
- Lall S, Friedman CC, Jankowska-Anyszka M, Stepinski J, Darzynkiewicz E, Davis RE: Contribution of trans-splicing, 5'-leader length, cap-poly(A) synergism, and initiation factors to nematode translation in an Ascaris suum embryo cell-free system. J Biol Chem. 2004, 279: 45573-45585. 10.1074/jbc.M407475200.View ArticlePubMedGoogle Scholar
- Ptushkina M, Berthelot K, von der Haar T, Geffers L, Warwicker J, McCarthy JE: A second eIF4E protein in Schizosaccharomyces pombe has distinct eIF4G-binding properties. Nucleic Acids Res. 2001, 29: 4561-4569. 10.1093/nar/29.22.4561.PubMed CentralView ArticlePubMedGoogle Scholar
- Tee AR, Tee JA, Blenis J: Characterizing the interaction of the mammalian eIF4E-related protein 4EHP with 4E-BP1. FEBS Lett. 2004, 564: 58-62. 10.1016/S0014-5793(04)00313-8.View ArticlePubMedGoogle Scholar
- Cho PF, Poulin F, Cho-Park YA, Cho-Park IB, Chicoine JD, Lasko P, Sonenberg N: A new paradigm for translational control: inhibition via 5'-3' mRNA tethering by Bicoid and the eIF4E cognate 4EHP. Cell. 2005, 121: 411-423. 10.1016/j.cell.2005.02.024.View ArticlePubMedGoogle Scholar
- Ramaswamy S, Ross KN, Lander ES, Golub TR: A molecular signature of metastasis in primary solid tumors. Nat Genet. 2003, 33: 49-54. 10.1038/ng1060.View ArticlePubMedGoogle Scholar
- Perry KL, Watkins KP, Agabian N: Trypanosome mRNAs have unusual "cap 4" structures acquired by addition of a spliced leader. Proc Natl Acad Sci. 1987, 84: 8190-8194.PubMed CentralView ArticlePubMedGoogle Scholar
- Zeiner GM, Sturm NR, Campbell DA: The Leishmania tarentolae spliced leader contains determinants for association with polysomes. J Biol Chem. 2003, 278: 38269-38275. 10.1074/jbc.M304295200.View ArticlePubMedGoogle Scholar
- Douglas SE, Penny SL: The plastid genome of the cryptophyte alga, Guillardia theta : complete sequence and conserved synteny groups confirm its common ancestry with red algae. J Mol Evol. 1999, 48: 236-244.View ArticlePubMedGoogle Scholar
- Matsuzaki M, Misumi O, Shin-I T, Maruyama S, Takahara M, Miyagishima SY, Mori T, Nishida K, Yagisawa F, Nishida K, Yoshida Y, Nishimura Y, Nakao S, Kobayashi T, Momoyama Y, Higashiyama T, Minoda A, Sano M, Nomoto H, Oishi K, Hayashi H, Ohta F, Nishizaka S, Haga S, Miura S, Morishita T, Kabeya Y, Terasawa K, Suzuki Y, Ishii Y, Asakawa S, Takano H, Ohta N, Kuroiwa H, Tanaka K, Shimizu N, Sugano S, Sato N, Nozaki H, Ogasawara N, Kohara Y, Kuroiwa T: Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature. 2004, 428: 653-657. 10.1038/nature02398.View ArticlePubMedGoogle Scholar
- Katinka MD, Duprat S, Cornillot E, Metenier G, Thomarat F, Prensier G, Barbe V, Peyretaillade E, Brottier P, Wincker P, Delbac F, El Alaoui H, Peyret P, Saurin W, Gouy M, Weissenbach J, Vivares CP: Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi. Nature. 2001, 414: 450-453. 10.1038/35106579.View ArticlePubMedGoogle Scholar
- Aravind L, Koonin EV: Eukaryote-specific domains in translation initiation factors: implications for translation regulation and evolution of the translation system. Genome Res. 2000, 10: 1172-1184. 10.1101/gr.10.8.1172.PubMed CentralView ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.PubMed CentralView ArticlePubMedGoogle Scholar
- Kumar S, Tamura K, Nei M: MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers. Comput Appl Biosci. 1994, 10: 189-191.PubMedGoogle Scholar
- Dayhoff MO, Schwartz RM, Orcutt BC: A model of evolutionary change in proteins. matrices for detecting distant relationships. Atlas of protein sequence and structure. Edited by: Dayhoff MO. 1978, National Biomedical Research foundation: Washington, D.C, 345-358.Google Scholar
- Bangs JD, Crain PF, Hashizume T, McCloskey JA, Boothroyol JC: Mass spectrometry of mRNA cap 4 from trypanosomatids reveals two novel nucleosides. J Biol Chem. 1992, 267: 9805-9815.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.