Gene make-up: rapid and massive intron gains after horizontal transfer of a bacterial α-amylase gene to Basidiomycetes
© Da Lage et al.; licensee BioMed Central Ltd. 2013
Received: 7 September 2012
Accepted: 30 January 2013
Published: 13 February 2013
Increasing genome data show that introns, a hallmark of eukaryotes, already existed at a high density in the last common ancestor of extant eukaryotes. However, intron content is highly variable among species. The tempo of intron gains and losses has been irregular and several factors may explain why some genomes are intron-poor whereas other are intron-rich.
We studied the dynamics of intron gains and losses in an α-amylase gene, whose product breaks down starch and other polysaccharides. It was transferred from an Actinobacterium to an ancestor of Agaricomycotina. This gene underwent further duplications in several species. The results indicate a high rate of intron insertions soon after the gene settled in the fungal genome. A number of these oldest introns, regularly scattered along the gene, remained conserved. Subsequent gains and losses were lineage dependent, with a majority of losses. Moreover, a few species exhibited a high number of both specific intron gains and losses in recent periods. There was little sequence conservation around insertion sites, then probably little information for splicing, whereas splicing sites, inside introns, showed typical and conserved patterns. There was little variation of intron size.
Since most Basidiomycetes have intron-rich genomes and this richness was ancestral in Fungi, long before the transfer event, we suggest that the new gene was shaped to comply with requirements of the splicing machinery, such as short exon and intron sizes, in order to be correctly processed.
KeywordsGlycosyl hydrolase Lateral gene transfer Fungi Gene duplication Intron gain Protosplice
The ongoing debate on the origin and evolution of spliceosomal introns in eukaryotes has shifted in the last few years on the origin of variations in intron density in genomes, and correlatively, on the relative rates of gain and loss of introns. Indeed, whole genome sequencing of a variety of eukaryote species has revealed an impressive diversity of intron contents. There are intron-poor species, mostly unicellular, such as Saccharomyces cerevisiae, Guillardia theta, Encephalitozoon cuniculi. Intron-rich species are often multicellular, for example vertebrates, the worm Caenorhabditis elegans, the fungus Phanerochaete chrysosporium, the sea squirt Ciona intestinalis. Intron-rich unicellular organisms also exist, like the green alga Chlamydomonas[1, 2]. Several studies have concluded that the last eukaryote common ancestor (LECA) had a mild to high intron density (e.g. [1, 3–7]. However, it seems that the subsequent history, during lineages diversification, has been quite diverse, with massive losses in some lineages, bursts of intron gains followed by either stases or losses, or reset of intron positions in others [1, 2, 4, 7–12]. Several possible reasons have been proposed to explain the contrasted current situation: low population sizes allowing fixation of mildly deleterious introns , variable balance between different mechanisms of DNA repair , selection for optimal exon size, due to spliceosome requiremements , nonsense mediated decay (NMD) . Many studies have shown a large excess of intron losses relative to gains, especially when related species were compared [9, 17–22]. Comparisons of a single gene among more or less related species have also suggested that intron losses outnumbered intron gains since the split of the species studied from their common ancestor. Moreover, repeated, independent loss of the same intron (in the same position) was often noticed [23–27]). In contrast, until recently, clear recent intron gains had not been frequently identified (e.g. [28, 29]). Some gain cases inferred from a given data set appeared, after further sampling, to be recurrent losses . However, some cases of gains outnumbering losses were reported in fungi . Indeed, recent population genomic studies and increasing sequence data show that gains are still occurring [31, 32]. We still have little knowledge of the real tempo of intron gains and losses during evolution along a lineage and the factors that influence it. Dynamics of intron gains and losses in the course of evolution is an attracting issue, given its biological significance. A method for addressing this issue is to survey eukaryote genes horizontally transferred recently from bacteria, which are devoid of spliceosomal introns . Recent transfers followed by intron insertions may give insights into the pace and dynamics of gains, provided that the HGT could be dated.
In vertebrates and possibly other intron-rich genomes, it has been shown that exons exceeding a certain size may be misrecognized by the splicing machinery [15, 33], or prone to premature termination codons, due to the unability for NMD to act upon . We hypothesize that in such intron-rich genomes, intronless genes stemming from horizontal transfer from bacteria should be quickly invaded by introns to shorten the exon size. The NMD hypothesis also posits that introns should be inserted regularly along the gene. Indeed, a study of HGT genes in fungal genomes, mostly Ascomycetes, showed a correlation between intron densities in transgenes of bacterial origin and the recipient genomes . Here we studied an α-amylase gene, previously identified in a Basidiomycete, the white rot Phanerochaete chrysosporium, that was transferred from an Actinobacterium to Agaricomycotina. Alpha-amylases often form multigene families, and most Basidiomycetes already harbor at least one fungal-type α-amylase gene (Carbohydrate Active Enzymes database http://www.cazy.org). Basidiomycetes are ancestrally intron-rich . In this new gene of bacterial origin, we have identified intron gains and losses that occurred since the gene settled in the fungal genome and we estimated the rates of gains and losses, and some characteristics of the introns inserted.
The sequence jgi|Phchr1|7087| from Phanerochaete chrysosporium, already reported to encode an animal-type α-amylase  was used as a query for BLASTP search in GenBank nr and GenBank Fungal Genomes (http://blast.ncbi.nlm.nih.gov), and BLASTP search implemented in the Mycocosm data base at the Joint Genome Institute (http://genome.jgi-psf.org/programs/fungi/, ). The putative retrieved orthologs were then aligned using MAFFT  implemented in the Geneious software (Biomatters Ltd.), and manually corrected for erroneous intron-exon structures when necessary. Those errors were detected when large unique amino acid insertions or deletions were evidenced in the alignment. In these cases, when available and if necessary, expressed sequence tags (EST) were used to confirm intron positions and boundaries. The query sequence contained a C-terminal carbohydrate binding module of the CBM20 family. A number, but not all retrieved sequences possessed a terminal CBM20 domain of variable length, always containing introns. Because it was not present in every sequence, the CBM20 was no longer considered and the alignment was truncated to the C-terminal end of the core protein. Intron positions were mapped onto the alignment according to the annotations of the genomes, mainly those deciphered at the JGI. From this protein alignment, after curation of the alignment with Gblocks  leaving 398 positions (83%) available, a gene tree was built using PhyML , at the http://www.phylogeny.fr web server . After testing various models with MEGA5 , we used the WAG substitution matrix with a gamma distribution of substitution rate across sites (the shape parameter α was estimated from the data with four rate categories). The robustness of the nodes was estimated by 100 bootstrap replicates.
A few species were also investigated experimentally using polymerase chain reaction. DNA samples were supplied by the Hibbett Laboratory at Clark University or purchased from the Centraalbureau voor Schimmelcultures at the Institute of the Royal Netherlands Academy of Arts and Sciences. The primers and experimental conditions are given in Additional file 1: Table S1. Only partial sequence data were obtained from the following species related to P. chrysosporium: Phlebia radiata FPL6140, P. albomellea CBS 275.92, Grifola frondosa MO11 (accession numbers JX310736-JX310738).
In order to infer the antiquity of the α-amylase gene transfer from a bacterium, and the times of intron insertions, we estimated the ages of nodes in a species tree. A fungal species tree was established by compilation of recent literature, which included the species of interest for our study, but also Ascomycetes ([43–51] and especially ), and unpublished data kindly shared by D. S. Hibbett and by the Joint Genome Institute (Binder et al. in preparation) for solving uncertain relationships. We performed a Bayesian analysis with the BEAST program . An alignment was performed for 54 fungal species, using protein sequences of elongation factor 1-alpha, RNA polymerase II largest and second large subunits (EF1α, LSU1 and LSU2, Additional file 2: Table S2) aligned separately using MAFFT , then concatenated. After curation for badly aligned regions with Gblocks , 1671 amino acid positions remained. The tree made from the alignment was constrained to match the established species tree topology. We estimated divergence times using BEAST v1.7.1 , assuming a relaxed uncorrelated lognormal molecular clock model, a Yule speciation process for tree prior, and a WAG + Γ substitution model. The analysis was run for 12 million generations, saving a tree every 1,000th generation. The resulting log file was inspected with Tracer v1.5  to verify that the sample size was large enough to give good estimations of posterior distributions. We found that the steady state had been already reached after two millions generations. After removing the first 2,000 trees as burn-in, the remaining 10,000 sampled trees were analyzed with TreeAnnotator v1.7.1  to estimate the 95% highest posterior intervals of the divergence times. Fossil calibration was possible at two nodes : divergence between Ascomycetes and Basidiomycetes was set to 600 Ma , and divergence between Eurotiomycetes and Sordariomycetes was set to 410 Ma, the age of the oldest likely Sordariomycete [56, 57].
In order to show the occurrence of HGT and its origin, a general gene tree of glycosyl hydrolases of the GH13 family, which have a broad activity range , from various organisms was built from a structural alignment as described in ref. , adding the sequences studied here.
Gains and losses of introns were inferred in a weighted (Dollo) parsimony framework, considering parallel losses much more frequent than parallel gains , as in . Using Mesquite v. 2.75 , we tried parsimony and ML scenarii for intron gains and losses directly on the gene tree. Because numerous gene duplications and gene losses occurred, we tried to reconcile the gene and species trees using Notung 2.6 . The program MALIN  infers the evolution of exon-intron structure in protein-coding orthologs. It could not be used, though, because orthology relationships among the genes could not be solved in most cases (see Results). Finally, the loss and gain events were mapped onto the species tree, not the gene tree. The average rates of intron gains and losses per million year and per branch were computed by considering that events occurred evenly along a branch. For example, if three losses occurred along a branch 12 Ma in length, the loss rate was 3/12 per Ma. Then, the rates for all branches present at a given time were summed and averaged.
Gene transfer from a bacterium
Intron richness, gains and losses
Intron sizes and insertion sites
In eukaryote genomes, an excess of phase 0 introns was often observed [67, 68], including in Fungi . We did not find such a bias, but rather a slight excess of phase 1 introns (28/64), however not significantly different from a 1:1:1 distribution (χ2 = 3.28 n.s.), not counting the putative slided introns. Results were similar for the 17 oldest intron positions.
We noticed no spatial preference for intron insertion (homogeneity test ; the alignment was divided in ten parts of equal length, p = 0.71, n.s.), except that there were no introns in the putative signal peptides. The NMD hypothesis suggests that introns colonizing an empty gene would be prone to regular spacing . We checked whether the 17 ancestral introns were inserted at random or showed a regular pattern along the coding sequence. There was no over-regularity compared to random spacing for these oldest intron positions, as estimated by simulating 10,000 genes with 17 random insertions (p = 0.16). All the extant genes from our data set were checked as well. Overall, there may be some trend towards regular spacing of introns, since about half of the genes showed intron spacing significantly more regular than expected by chance (Additional file 8: Table S4).
The case of Bjerkandera adusta. Intron sliding
Bjerkandera adusta is a close relative of Phanerochaete chrysosporium. It was not included in the analysis shown on Figure 3 and Figure 4 because, intriguingly, none of its three gene copies is close to the ones of P. chrysosporium or its other relatives. In addition, two copies (Bjead1|55696|, Bjead1|141648|) share highest sequence similarity and three intron positions (1, 20, 24) exclusively with remote species such as Stereum hirsutum (Russulales). This illustrates the complicated gene history and might raise the possibility of HGT among fungi. It is also worth noting two occurrences of intron sliding in those copies of B. adusta at positions 4 and 7 (marked by asterisks in Figure 2). Intron 4 is absent from most genes. Thus, one can hypothesize an independent gain, rather than displacement of a preexisting intron. In contrast, in the case of the widespread intron 7, although it is absent from the closest sequences Punst1|74571|, Stehi1|83072| and Stehi1|159685|, one could more likely infer an intron displacement, one base pair apart (phase 0 vs. phase 1). This new phase 0 intron is located at the same position as the widespread intron 1 of animals . Intron sliding was also found in Piriformospora indica (ancestral position 23).
The case of Stereum hirsutumStehi1|78757|
We found four copies in Stereum hirsutum. Stehi1|78757| was the most diverged sequence among all our data set. Strikingly, most of its 21 intron positions were different from the positions found in the other genes (Figure 2). This pattern could be explained by an independent gene insertion from an intronless donor, such as a bacterium or a retrotranscript from an existing copy, followed by de novo intron colonization. Whatever the origin ot this gene copy, it highlights the high intron density in a gene of likely recent origin. Note that two possible cases of parallel intron gains at positions 2 and 4 involve positions found in this copy. This reinforces the hypothesis of true parallel gains. The specific introns of Stehi1|78757| account for one third of the whole number of intron positions.
For studying the dynamics of intron gains in eukaryote genes, it is worth using primitively intronless genes, originating from either bacteria or retro-elements. A few such studies were published recently [73, 74]. In a previous study , we had investigated the dynamics of introns in the α-amylase genes of bilaterian animals, likely of bacterial origin . The putative HGT event was about twice as old as the one studied here. We had retrieved at most three likely ancestral intron positions and only a minority of positions were shared by several phyla, so that it was not possible to infer the pace of intron colonization after the gene insertion. In contrast, in this study, we have shown that a gene of bacterial origin, transferred horizontally into a fungus, was quickly split by numerous introns about 300–400 million years ago. The donor was an actinobacterium, and it is likely that this kind of transfer happened several times independently. Indeed, the related α-amylase gene found in P. graminis and M. laricis-populina is most likely the result of a different transfer event. The position of these sequences in the gene tree (Additional file 5: Figure S2) is clearly not related to the sequences studied here. Moreover, the sequences of these two species share 12 intron positions, ten of which are different from the 64 positions identified in our study, and two positions are common with the "outlier" gene Stehi1|78757|. A similar situation was observed in the nad7 gene, transferred twice independently from mitochondrion in Opisthokonts and in Chlamydomonas reinhardtii. We have no evidence that the carbohydrate binding module CBM20 was co-transferred with the core enzyme gene. CBM20 domains exist in bacteria, but they are common in fungal glycosyl hydrolases too [75, 76], e.g. in the GH15 family, and may have been recruited later in the HGT α-amylase gene through domain shuffling.
The time of the HGT is uncertain, given the scarce fossil data that can be used for time calibration. Therefore, the dates computed here are indicative. It seems however clear that at least 17 introns were inserted within a rather short period, before the divergence of Dacryopinax sp. The origin of these introns could obviously not be retrieved, given the long time elapsed. Even for more recent introns, e.g. specific introns found in P. strigosozonata or S. hirsutum, it was not possible to identify any donor sequence. Indeed, among the various mechanisms proposed for intron gains [2, 32, 77, 78], it has been shown that new introns may be created by random insertion of any DNA fragment or nucleotide filling during DNA repair after double strand break [14, 31, 77, 79], which thus form novel sequences. Comparisons between closely related species, such as P. chrysosporium and P. carnosa also shows the fast divergence of introns.
Inferring gain and loss events along lineages was made difficult by the lack of congruence between the gene tree and the species tree. Relationships between genes were not clear, bootstrap supports were often low. As mentioned above, hidden paralogy was suspected is several cases. Only substantial additional data from other species could help in solving this problem, which led to overestimate the loss rates at some periods. Gains were generally easily mapped, except a few cases where parallel gains were proposed. Parallel losses occurred much more frequently than gains, even not counting the possibly misleading hidden paralogies. Intron 7 was lost five times, intron 34 at least eight times. This is consistent with many reports showing that parallel losses are common relative to parallel intron gains (e.g. [60, 80]). Correlatively, we have shown that, after the initial burst of gains in the empty gene, the rate of gains dropped and the rate of losses increased up to a large excess of loss over gain, as already observed [80, 81]. The high activity of specific gains and losses in a few terminal branches of our data set, P. strigosozonata and S. hirsutum, remains unexplained, especially as there is no such activity in their relatives (G. trabeum and H. annosum, respectively). This could be related to the occurrence of several copies, four and three, respectively. High rates of intron gains and losses were reported in paralogous genes . In S. hirsutum, a lot of specific gains occurred in a particular gene copy, Stehi1|78757|, which has probably a quite different history. It is unclear whether this copy originated in an independent HGT from a related bacterium or stems from a processed cDNA. In the latter case, there should be some sequence similarity with the parent gene, which was not found in the extant gene copies of this species. This gene must have been acquired much more recently than the gene shared by most Agaricomycotina. And yet, it is very intron-rich. This point adds relevance to our hypothesis that primitively intronless genes in intron-rich genomes are prone to be quickly provided with interrupting sequences. Rapid acquisition of introns was also observed in mitochondrial-derived genes, assumed to be primitively intronless  and in mammalian "domesticated genes" stemming from tranposable elements .
The HGT α-amylase gene could be a suitable model for studying the evolution of information content around intron sites. However, we found a low level of information at positions −2 to +2 surrounding intron sites, contrasting with our results in animals , where the classical AG/G protosplice consensus  was majoritary. This may indicate an absence of insertional sequence preference. As in our previous study, we noticed an even weaker level of information around empty sites. Information was not significantly stronger around older introns than recent introns, but this result suffers from a low number of data and a high variance for recent insertions. As underlined by Rogozin and colleagues , evolution towards the protosplice consensus may be a slow process, and our gene may be too recent. The increase of information after loss of old introns is surprising, because if intron neighborhood is involved in intron recognition and splicing, which is well established, one would rather expect a relaxation of constraints after intron loss, thus unbiased base composition.
In contrast, information at both 5' and 3' splicing sites was strong and typical of fungal introns , suggesting that, whereas exonic neighborhood may be not crucial for splicing, intronic splicing sequences are important for proper intron recognition. Another important feature for efficient splicing may be a short intron size. Indeed, we have shown the low variability of size in our data set, whatever the species and the intron position. This could be indicative of a functional constraint. This is consistent with , who have shown that short intron sizes contributed importantly to intron detection in Basidiomycetes.
Altogether, our data suggest that several features were important to confer to the transferred gene suitable characteristics regarding splicing efficiency: short introns; shortening the exons to a small size through multiple intron gains, although exon sizes were more variable than intron sizes; and rather regular intron spacing along the entire gene, perhaps for efficient nonsense mediated decay . It is not clear whether intron gains were positively selected. It has been proposed that introns colonized eukaryotic genomes by random fixation in low population size species, while they were mildly deleterious [13, 83, 84]. However, in our case, the HGT gene settled in a genome that was probably already intron-rich , endowed with a spliceosome adapted to cope with intron-rich genes. The potential deleterious effect of inserting an intron might have been balanced by the advantage of splitting the gene in smaller pieces. Therefore, one can assume that there was a rather strong selective pressure for either gene loss, or gene "make-up" to look like other fungal genes. The ecological advantage of getting new abilities for polysaccharide degradation by gene capture may explain that this gene was acquired and made active several times independently. Indeed, acquisition of bacterial GH or other degrading enzymes by fungi has been shown to be advantageous .
Our results need now to be generalized by investigating other genes recently transferred from bacteria, in both intron-rich and intron-poor genomes, in order to confirm whether introns colonized intronless genes rapidly, with a density related to the genome average.
Horizontal gene transfer
Carbohydrate binding module
Most of the sequence data were produced by the US Department of Energy Joint Genome Institute (http://www.jgi.doe.gov/) in collaboration with the user community. We are grateful to the JGI team and Igor Grigoriev for sharing unpublished data. We warmly thank David Hibbett and Ricardo Garcia-Sandoval at Clark University for their help, especially by giving DNA samples and sharing unpublished results. We thank Dan Cullen for advice regarding P. chrysosporium. We thank Julien Fumey for computer simulations. This work was funded by regular funding of the CNRS. SJ was supported by the grant No. 2/0148/11 from the Slovak grant agency VEGA.
- Jeffares DC, Mourier T, Penny D: The biology of intron gain and loss. Trends Genet. 2006, 22 (1): 16-22. 10.1016/j.tig.2005.10.006.PubMedView ArticleGoogle Scholar
- Roy SW, Gilbert W: The evolution of spliceosomal introns: patterns, puzzles and progress. Nature Rev Genet. 2006, 7: 211-221.PubMedGoogle Scholar
- Roy SW: Intron-rich ancestors. Trends Genet. 2006, 22 (9): 468-471. 10.1016/j.tig.2006.07.002.PubMedView ArticleGoogle Scholar
- Stajich JE, Dietrich FS, Roy SW: Comparative genomic analysis of fungal genomes reveals intron-rich ancestors. Genome Biol. 2007, 8 (10): R233-View ArticleGoogle Scholar
- Csűrös M, Rogozin IB, Koonin EV: Extremely intron-rich genes in the Alveolate ancestors inferred with a flexible Maximum-Likelihood approach. Mol Biol Evol. 2008, 25 (5): 903-911. 10.1093/molbev/msn039.PubMedView ArticleGoogle Scholar
- Roy SW, Gilbert W: Complex early genes. Proc Natl Acad Sci USA. 2005, 102 (6): 1986-1991. 10.1073/pnas.0408355101.PubMed CentralPubMedView ArticleGoogle Scholar
- Csuros M, Rogozin IB, Koonin EV: A detailed history of intron-rich eukaryotic ancestors inferred from a global survey of 100 complete genomes. PLoS Comput Biol. 2011, 7 (9): e1002150-10.1371/journal.pcbi.1002150.PubMed CentralPubMedView ArticleGoogle Scholar
- Rogozin IB, Wolf YI, Sorokin AV, Mirkin BG, Koonin EV: Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution. Curr Biol. 2003, 13: 1512-1517. 10.1016/S0960-9822(03)00558-X.PubMedView ArticleGoogle Scholar
- Roy SW, Penny D: Patterns of intron loss and gain in plants: intron loss-dominated evolution and genome-wide comparison of O. sativa and A. thaliana. Mol Biol Evol. 2007, 24 (1): 171-181.PubMedView ArticleGoogle Scholar
- Teich R, Grauvogel C, Petersen J: Intron distribution in Plantae: 500 million years of stasis during land plant evolution. Gene. 2007, 394: 96-104. 10.1016/j.gene.2007.02.011.PubMedView ArticleGoogle Scholar
- Edvardsen RB, Lerat E, Maeland AD, Flat M, Tewari R, Jensen MF, Lehrach H, Reinhardt R, Seo H-C, Chourrout D: Hypervariable and highly divergent intron-exon organizations in the Chordate Oikopleura dioica. J Molec Evol. 2004, 59: 448-457. 10.1007/s00239-004-2636-5.PubMedView ArticleGoogle Scholar
- Denoeud F, Henriet S, Mungpakdee S, Aury J-M, Da Silva C, Brinkmann H, Mikhaleva J, Olsen LC, Jubin C, Canestro C, et al: Plasticity of animal genome architecture unmasked by rapid evolution of a pelagic tunicate. Science. 2010, 300: 1381-1385.View ArticleGoogle Scholar
- Lynch M: The origins of eukaryotic gene structure. Mol Biol Evol. 2006, 23 (2): 450-468.PubMedView ArticleGoogle Scholar
- Farlow A, Meduri E, Schlötterer C: DNA double-strand break repair and the evolution of intron density. Trends Genet. 2011, 27 (1): 1-6. 10.1016/j.tig.2010.10.004.PubMed CentralPubMedView ArticleGoogle Scholar
- Berget SM: Exon recognition in vertebrate splicing. J Biol Chem. 1995, 270 (6): 2411-2414.PubMedView ArticleGoogle Scholar
- Lynch M, Kewalramani A: Messenger RNA surveillance and the evolutionary proliferation of introns. Mol Biol Evol. 2003, 20 (4): 563-571. 10.1093/molbev/msg068.PubMedView ArticleGoogle Scholar
- Cho S, Jin S-W, Cohen A, Ellis RE: A phylogeny of Caenorhabditis reveals frequent loss of introns during nematode evolution. Genome Res. 2004, 14: 1207-1220. 10.1101/gr.2639304.PubMed CentralPubMedView ArticleGoogle Scholar
- Coulombe-Huntington J: Intron loss and gain in Eukaryotes. 2008, Montreal: McGill UniversityGoogle Scholar
- Coulombe-Huntington J, Majewski J: Intron loss and gain in Drosophila. Mol Biol Evol. 2007, 24 (12): 2842-2850.PubMedView ArticleGoogle Scholar
- Roy SW, Fedorov A, Gilbert W: Large-scale comparison of intron positions in mammalian genes shows intron loss but not gain. Proc Natl Acad Sci USA. 2003, 100 (12): 7158-7162. 10.1073/pnas.1232297100.PubMed CentralPubMedView ArticleGoogle Scholar
- Roy SW, Irimia M, Penny D: Very little intron gain in Entamoeba histolytica genes laterally transferred from Prokaryotes. Mol Biol Evol. 2006, 23 (10): 1824-1827. 10.1093/molbev/msl061.PubMedView ArticleGoogle Scholar
- Roy SW: Smoke without fire: most reported cases on intron gain in Nematodes instead reflect intron losses. Mol Biol Evol. 2006, 23 (12): 2259-2262. 10.1093/molbev/msl098.PubMedView ArticleGoogle Scholar
- Da Lage J-L, Wegnez M, Cariou M-L: Distribution and evolution of introns in Drosophila amylase genes. J Molec Evol. 1996, 43: 334-347. 10.1007/BF02339008.PubMedView ArticleGoogle Scholar
- Krzywinski J, Besanski NJ: Frequent intron loss in the White gene: a cautionary tale for phylogeneticists. Mol Biol Evol. 2002, 19 (3): 362-366. 10.1093/oxfordjournals.molbev.a004091.PubMedView ArticleGoogle Scholar
- Wada H, Kobayashi M, Sato R, Satoh N, Miyasaka H, Shirayama Y: Dynamic insertion-deletion of introns in deuterostome EF-1 alpha genes. J Molec Evol. 2002, 54 (1): 118-128. 10.1007/s00239-001-0024-y.PubMedView ArticleGoogle Scholar
- Maczkowiak F, Da Lage J-L: Origin and evolution of the Amyrel gene in the α-amylase family of Diptera. Genetica. 2006, 128: 145-158. 10.1007/s10709-005-5578-y.PubMedView ArticleGoogle Scholar
- Da Lage J-L, Maczowiak F, Cariou M-L: Phylogenetic distribution of intron positions in alpha-amylase genes of Bilateria suggests numerous gains and losses. PLoS One. 2011, 6 (5): e19673-10.1371/journal.pone.0019673.PubMed CentralPubMedView ArticleGoogle Scholar
- Bhattacharya D, Lutzoni F, Reeb V, Simon D, Nason J, Fernandez F: Widespread occurrence of spliceosomal introns in the rDNA genes of Ascomycetes. Mol Biol Evol. 2000, 17 (12): 1971-1984. 10.1093/oxfordjournals.molbev.a026298.PubMedView ArticleGoogle Scholar
- Flakowski J, Bolivar I, Fahrni J, Pawlowski J: Tempo and mode of spliceosomal intron evolution in actin of Foraminifera. J Molec Evol. 2006, 63: 30-41. 10.1007/s00239-005-0061-z.PubMedView ArticleGoogle Scholar
- Nielsen CB, Friedman B, Birren B, Burge CB, Galagan JE: Patterns of intron gain and loss in Fungi. PLoS Biol. 2004, 2 (12): e422-10.1371/journal.pbio.0020422.PubMed CentralPubMedView ArticleGoogle Scholar
- Li W, Tucker AE, Sung W, Thomas WK, Lynch M: Extensive, recent intron gains in Daphnia populations. Science. 2009, 326: 1260-1262. 10.1126/science.1179302.PubMedView ArticleGoogle Scholar
- Torriani SFF, Stukenbrock EH, Brunner PC, Donald BAM, Croll D: Evidence for extensive recent intron transposition in closely related Fungi. Curr Biol. 2011, 21 (23): 2017-2022. 10.1016/j.cub.2011.10.041.PubMedView ArticleGoogle Scholar
- Veis A: Amelogenin gene splice products: potential signaling molecules. Cell Mol Life Sci. 2003, 60 (1): 38-55. 10.1007/s000180300003.PubMedView ArticleGoogle Scholar
- Marcet-Houben M, Gabaldón T: Acquisition of prokaryotic genes by fungal genomes. Trends Genet. 2010, 26 (1): 5-8. 10.1016/j.tig.2009.11.007.PubMedView ArticleGoogle Scholar
- Da Lage J-L, Danchin EGJ, Casane D: Where do animal α-amylases come from? An interkingdom trip. FEBS Lett. 2007, 581: 3927-3935. 10.1016/j.febslet.2007.07.019.PubMedView ArticleGoogle Scholar
- Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B: The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucl Ac Res. 2009, 37: D233-D238. 10.1093/nar/gkn663.View ArticleGoogle Scholar
- Grigoriev IV, Nordberg H, Shabalov I, Aerts A, Cantor M, Goodstein D, Kuo A, Minovitsky S, Nikitin R, Ohm RA, et al: The genome portal of the department of energy joint genome Institute. Nucl Ac Res. 2012, 40 (1): D26-D32. 10.1093/nar/gkr947.View ArticleGoogle Scholar
- Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30 (14): 3059-3066. 10.1093/nar/gkf436.PubMed CentralPubMedView ArticleGoogle Scholar
- Talavera G, Castresana J: Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007, 56: 564-577. 10.1080/10635150701472164.PubMedView ArticleGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.PubMedView ArticleGoogle Scholar
- Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, Chevenet F, Dufayard J-F, Guindon S, Lefort V, Lescot M, et al: Phylogeny.fr: robust phylogenetic analysis for the non-specialist. Nucl Ac Res. 2008, 36 (Web Server Issue): W465-W469.View ArticleGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralPubMedView ArticleGoogle Scholar
- Binder M, Hibbett DS, Larsson K-H, Larsson E, Langer E, Langer G: The phylogenetic distribution of resupinate forms across the major clades of mushroom-forming fungi (Homobasidiomycetes). Syst Biodiv. 2005, 3 (2): 113-157. 10.1017/S1477200005001623.View ArticleGoogle Scholar
- James TY, Kauff F, Schoch CL, Matheny PB, Hofstetter V, Cox CJ, Celio G, Gueidan C, Fraker E, Miadlikowska J, et al: Reconstructing the early evolution of Fungi using a six-gene phylogeny. Nature. 2006, 443 (7113): 818-822. 10.1038/nature05110.PubMedView ArticleGoogle Scholar
- Matheny PB, Curtis JM, Hoffstetter V, Aime MC, Moncalvo J-M, Ge Z-W, Yang Z-L, Slot JC, Ammirati JF, Baroni TJ, et al: Major clades of Agaricales: a multilocus phylogenetic overview. Mycologia. 2006, 98 (6): 982-995. 10.3852/mycologia.98.6.982.PubMedView ArticleGoogle Scholar
- Fitzpatrick DA, Logue ME, Stajich JE, Butler G: A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis. BMC Evol Biol. 2006, 6: 99-10.1186/1471-2148-6-99.PubMed CentralPubMedView ArticleGoogle Scholar
- Spatafora JW, Sung G, Johnson D, Hesse C, O'Rourke B, Serdani M, Spotts R, Lutzoni F, Hofstetter V, Miadlikowska J, et al: A five-gene phylogeny of Pezizomycotina. Mycologia. 2006, 98 (6): 1018-1028. 10.3852/mycologia.98.6.1018.PubMedView ArticleGoogle Scholar
- Matheny PB, Wang Z, Binder M, Curtis JM, Lim YW, Nilsson RH, Hughes KW, Hofstetter V, Ammirati JF, Schoch CL, et al: Contributions of rpb2 and tef1 to the phylogeny of mushrooms and allies (Basidiomycota, Fungi). Mol Phylogenet Evol. 2007, 43: 430-451. 10.1016/j.ympev.2006.08.024.PubMedView ArticleGoogle Scholar
- Marcet-Houben M, Gabaldón T: The tree versus the Forest: the fungal tree of life and the topological diversity within the yeast phylome. PLoS One. 2009, 4 (9): e4357-PubMed CentralPubMedView ArticleGoogle Scholar
- Garcia-Sandoval R, Wang Z, Binder M, Hibbett D: Molecular phylogenetics of the Gloeophyllales and relative ages of clades of Agaromycotina producing a brown rot. Mycologia. 2011, 103 (3): 510-524. 10.3852/10-209.PubMedView ArticleGoogle Scholar
- Skrede I, Engh IB, Binder M, Carlsen T, Kauserud H, Bendiksby M: Evolutionary history of Serpulaceae (Basidiomyota): molecular phylogeny, historical biogeography and evidence for a single transition of nutritional mode. BMC Evol Biol. 2011, 11 (30):
- Floudas D, Binder M, Riley R, Barry K, Blanchette RA, Henrissat B, Martinez AT, Otillar R, Spatafora JW, Yadav JS, et al: The Paleozoic origin of enzymatic lignin decomposition reconstructed from 31 fungal genomes. Science. 2012, 336 (6089): 1715-1719. 10.1126/science.1221748.PubMedView ArticleGoogle Scholar
- Drummond AJ, Suchard MA, Xie D, Rambaut A: Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012, 29 (8): 1969-1973. 10.1093/molbev/mss075.PubMed CentralPubMedView ArticleGoogle Scholar
- Rambaut A, Drummond AJ: Tracer v1.4. 2007, Available from http://beast.bio.ed.ac.uk/TracerGoogle Scholar
- Redecker D, Kodner R, Graham LE: Glomalean fungi from the Ordovician. Science. 2000, 289: 1920-1921. 10.1126/science.289.5486.1920.PubMedView ArticleGoogle Scholar
- Taylor TN, Hass H, Kerp H: The oldest fossil ascomycetes. Nature. 1999, 399: 648-649. 10.1038/21349.PubMedView ArticleGoogle Scholar
- Taylor TN, Hass H, Kerp H, Krings M, Hanlin RT: Perithecial ascomycetes from the 400 million year old Rhynie chert: an example of ancestral polymorphism. Mycologia. 2005, 97 (1): 269-285. 10.3852/mycologia.97.1.269.PubMedView ArticleGoogle Scholar
- Stam MR, Danchin EGJ, Rancurel C, Coutinho PM, Henrissat B: Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of α-amylase-related proteins. Prot Engineer Design Sel. 2006, 19 (12): 555-562. 10.1093/protein/gzl044.View ArticleGoogle Scholar
- Da Lage J-L, Feller G, Janeček Š: Horizontal gene transfer from Eukarya to Bacteria and domain shuffling: the α-amylase model. Cell Mol Life Sci. 2004, 61: 97-109. 10.1007/s00018-003-3334-y.PubMedView ArticleGoogle Scholar
- Rogozin IB, Sverdlov AV, Babenko VN, Koonin EV: Analysis of evolution of exon-intron structure of eukaryotic genes. Brief Bioinform. 2005, 6 (2): 118-134. 10.1093/bib/6.2.118.PubMedView ArticleGoogle Scholar
- Maddison WP, Maddison DR: Mesquite: a modular system for evolutionary analysis. Version 2.75. 2011Google Scholar
- Vernot B, Stolzer M, Goldman A, Durand D: Reconciliation with non-binary species trees. J Comput Biol. 2008, 15 (8): 981-1006. 10.1089/cmb.2008.0092.PubMed CentralPubMedView ArticleGoogle Scholar
- Csurös M: Malin: maximum likelihood analysis of intron evolution in eukaryotes. Bioinformatics. 2008, 24 (13): 1538-1539. 10.1093/bioinformatics/btn226.PubMed CentralPubMedView ArticleGoogle Scholar
- Henrissat B, Davies G: Structural and sequence-based classification of glycoside hydrolases. Curr Op Struc Biol. 1997, 7 (5): 637-644. 10.1016/S0959-440X(97)80072-3.View ArticleGoogle Scholar
- Chen W, Xie T, Shao Y, Chen F: Phylogenomic relationships between amylolytic enzymes from 85 strains of Fungi. PLoS One. 2012, 7 (11): e49679-10.1371/journal.pone.0049679.PubMed CentralPubMedView ArticleGoogle Scholar
- Richards TA: Genome evolution: horizontal movements in the Fungi. Curr Biol. 2011, 21 (4): R166-10.1016/j.cub.2011.01.028.PubMedView ArticleGoogle Scholar
- Fedorov A, Suboch G, Bujakov M, Fedorova L: Analysis of nonuniformity in intron phase distribution. Nucl Ac Res. 1992, 20 (10): 2552-2557.View ArticleGoogle Scholar
- Lynch M: Intron evolution as a population genetic process. Proc Natl Acad Sci USA. 2002, 99 (9): 6118-6123. 10.1073/pnas.092595699.PubMed CentralPubMedView ArticleGoogle Scholar
- Dibb NJ, Newman AJ: Evidence that introns arose at proto-splice sites. EMBO J. 1989, 8: 2015-2021.PubMed CentralPubMedGoogle Scholar
- Sverdlov AV, Rogozin IB, Babenko VN, Koonin EV: Reconstruction of ancestral protosplice sites. Curr Biol. 2004, 14: 1505-1508. 10.1016/j.cub.2004.08.027.PubMedView ArticleGoogle Scholar
- Iwata H, Gotoh O: Comparative analysis of information content relevant to recognition of introns in many species. BMC Genomics. 2011, 12: 45-10.1186/1471-2164-12-45.PubMed CentralPubMedView ArticleGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.PubMed CentralPubMedView ArticleGoogle Scholar
- Ahmadinejad N, Dagan T, Gruenheit N, Martin W, Gabaldon T: Evolution of spliceosomal introns following endosymbiotic gene transfer. BMC Evol Biol. 2010, 10: 57-10.1186/1471-2148-10-57.PubMed CentralPubMedView ArticleGoogle Scholar
- Kordiš D, Kokošar J: What can domesticated genes tell us about the intron gain in Mammals. Int J Evol Biol. 2012: 278981-
- Janeček Š, Svensson B, MacGregor EA: Relation between domain evolution, specificity, and taxonomy of the α-amylase family members containing a C-terminal starch-binding domain. Eur J Biochem. 2003, 270: 635-645. 10.1046/j.1432-1033.2003.03404.x.PubMedView ArticleGoogle Scholar
- Rodriguez-Sanoja R, Oviedo N, Sanchez S: Microbial starch-binding domain. Curr Opin Microbiol. 2005, 8: 260-267. 10.1016/j.mib.2005.04.013.PubMedView ArticleGoogle Scholar
- Yenerall P, Krupa B, Zhou L: Mechanismes of intron gain and loss in Drosophila. BMC Evol Biol. 2011, 11: 364-10.1186/1471-2148-11-364.PubMed CentralPubMedView ArticleGoogle Scholar
- Cohen NE, Shen R, Carmel L: The role of reverse transcriptase in intron gain and loss mechanisms. Mol Biol Evol. 2012, 29 (1): 179-186. 10.1093/molbev/msr192.PubMedView ArticleGoogle Scholar
- Ragg H: Intron creation and DNA repair. Cell Mol Life Sci. 2010, 68 (2): 235-242.PubMedView ArticleGoogle Scholar
- Rogozin IB, Carmel L, Csuros M, Koonin EV: Origin and evolution of spliceosomal introns. Biol Direct. 2012, 7: 11-10.1186/1745-6150-7-11.PubMed CentralPubMedView ArticleGoogle Scholar
- Carmel L, Wolf YI, Rogozin IB, Koonin EV: Three distinct modes of intron dynamics in the evolution of eukaryotes. Genome Res. 2007, 17: 1034-1044. 10.1101/gr.6438607.PubMed CentralPubMedView ArticleGoogle Scholar
- Babenko VN, Rogozin IB, Mekhedov SL, Koonin EV: Prevalence of intron gain over intron loss in the evolution of paralogous gene families. Nucl Ac Res. 2004, 32 (12): 3724-3733. 10.1093/nar/gkh686.View ArticleGoogle Scholar
- Lynch M, Richardson AO: The evolution of spliceosomal introns. Curr Opin Genet Dev. 2002, 12: 701-710. 10.1016/S0959-437X(02)00360-X.PubMedView ArticleGoogle Scholar
- Lynch M, Conery JS: The origins of genome complexity. Science. 2003, 302: 1401-1404. 10.1126/science.1089370.PubMedView ArticleGoogle Scholar
- Garcia-Vallvé S, Romeu A, Palau J: Horizontal gene transfer of glycosyl hydrolases of the rumen fungi. Mol Biol Evol. 2000, 17 (3): 352-361. 10.1093/oxfordjournals.molbev.a026315.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.