- Research article
- Open Access
Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster
BMC Evolutionary Biology volume 7, Article number: 111 (2007)
The biosynthesis of aflatoxin (AF) involves over 20 enzymatic reactions in a complex polyketide pathway that converts acetate and malonate to the intermediates sterigmatocystin (ST) and O-methylsterigmatocystin (OMST), the respective penultimate and ultimate precursors of AF. Although these precursors are chemically and structurally very similar, their accumulation differs at the species level for Aspergilli. Notable examples are A. nidulans that synthesizes only ST, A. flavus that makes predominantly AF, and A. parasiticus that generally produces either AF or OMST. Whether these differences are important in the evolutionary/ecological processes of species adaptation and diversification is unknown. Equally unknown are the specific genomic mechanisms responsible for ordering and clustering of genes in the AF pathway of Aspergillus.
To elucidate the mechanisms that have driven formation of these clusters, we performed systematic searches of aflatoxin cluster homologs across five Aspergillus genomes. We found a high level of gene duplication and identified seven modules consisting of highly correlated gene pairs (aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL). With the exception of A. nomius, contrasts of mean Ka/Ks values across all cluster genes showed significant differences in selective pressure between section Flavi and non-section Flavi species. A. nomius mean Ka/Ks values were more similar to partial clusters in A. fumigatus and A. terreus. Overall, mean Ka/Ks values were significantly higher for section Flavi than for non-section Flavi species.
Our results implicate several genomic mechanisms in the evolution of ST, OMST and AF cluster genes. Gene modules may arise from duplications of a single gene, whereby the function of the pre-duplication gene is retained in the copy (aflF/aflE) or the copies may partition the ancestral function (aflA/aflB). In some gene modules, the duplicated copy may simply augment/supplement a specific pathway function (aflR/aflS and aflX/aflY) or the duplicated copy may evolve a completely new function (aflT/aflQ and aflC/aflW). Gene modules that are contiguous in one species and noncontiguous in others point to possible rearrangements of cluster genes in the evolution of these species. Significantly higher mean Ka/Ks values in section Flavi compared to non-section Flavi species indicate increased positive selection acting in the evolution of genes in OMST and AF gene clusters.
Filamentous fungi produce a wide variety of economically important secondary metabolites (extrolites). An extrolite is any outwardly directed chemical compound that is excreted or accumulated in the cell wall of a living organism . Many of these extrolite compounds are beneficial, such as antibiotics, food grade pigments, enzymes, vitamins, lipids, and various pharmaceuticals; however, others, such as mycotoxins, have deleterious effects . Mycotoxins are some of the most toxic natural substances known and have been estimated to contaminate up to 25% of the world's food production . Although mycotoxins are widespread, the evolutionary/ecological basis for their production is largely unknown. There are several classes of mycotoxins, based on structural and chemical properties, including polyketides (e.g. sterigmatocystin and aflatoxins; ), cyclic peptides, alkaloids, sesquiterpenoids (e.g. trichothecenes; ) and epipolythiodioxopiperazines (e.g. gliotoxin; ). The aflatoxin (AF) pathway is one of the most intensively studied and well characterized of the polyketide pathways. Aflatoxins are a family of toxic and carcinogenic metabolites that are responsible for contamination of agricultural crops, resulting in staggering losses to the agricultural industry and untold impact on human health worldwide [7, 8].
Aflatoxin-producing fungi primarily belong to Aspergillus section Flavi, which includes A. flavus and A. parasiticus, the species most responsible for aflatoxin contamination of oil-rich crops such as corn, peanuts, cottonseed, and tree nuts . There are four major classes of AF, depending on the presence of the characteristic polyketide dihydro- (B1 and G1) or tetrahydro- (B2 and G2) bisfuran rings  (Figure 1). A. flavus produces aflatoxins B1 and B2 and often another mycotoxin, cyclopiazonic acid (CPA) [11, 12]. Isolates differ considerably in the amount of aflatoxins produced, and populations of A. flavus vary in proportions of strains that produce both aflatoxins and CPA, aflatoxins alone, CPA alone, and neither mycotoxin . Divergence within A. flavus has allowed for further classification of two phenotypic groups based on the morphology of the sclerotia, which are either large (L) or small (S) with a diameter of greater than or less than 400 μm, respectively . Geiser et al. [13, 14] subdivided A. flavus into two groups based on RFLPs of nuclear-coding genes and DNA sequences. Group I contains both L and S strains that produce aflatoxins B1 and B2, whereas Group II comprises only S strains that often produce B and G aflatoxins and represents, at least in part, an unnamed taxon. A. parasiticus primarily infects peanuts and is uncommon in aerial crops such as corn and cottonseed . The species produces both B and G aflatoxins at generally high concentrations and nonaflatoxigenic isolates are uncommon; CPA is not produced . Nonaflatoxigenic isolates of A. parasiticus instead often accumulate O-methylsterigmatocystin (OMST), an immediate precursor to aflatoxin B1 . Section Flavi species other than A. flavus and A. parasiticus are mostly of minor importance to agriculture and include A. nomius, A. bombycis, and the unnamed taxon, all of which produce aflatoxins B1, B2, G1, and G2, and A. pseudotamarii, which produces aflatoxins B1 and B2 [15, 16].
To better understand aflatoxin production in the Aspergilli, the organization, function and regulation of genes involved in AF biosynthesis has been a focus of study [17, 18]. The genes in AF biosynthesis are clustered in a 70-kb DNA region and encode at least 23 coregulated transcripts under the control of the regulatory gene aflR [19, 20]. In both the AF and sterigmatocystin (ST) gene clusters, aflR is a positive regulatory gene required for the transcriptional activation of most, if not all, pathway genes . As shown in Figure 1, ST is produced by several fungal species, including A. nidulans, a model genetic system that has been used to identify the genes involved in ST biosynthesis . The ST and OMST precursors are environmentally stable mycotoxins and are chemically and structurally similar to AF. The accumulation of particular extrolites of the AF biosynthetic family often differs at the species level for Aspergilli. For instance, A. nidulans synthesizes only ST, while strains of A. ochraceoroseus have been shown to accumulate ST and AF (Figure 1). In comparison, Aspergillus species in section Flavi, including A. flavus, A. parasiticus, A. bombycis, A. nomius, and A. pseudotamarii, predominantly synthesize AF. These section Flavi species have an identical cluster configuration, whereas gene order in A. ochraceoroseus is more similar to the ST cluster in A. nidulans , indicating that gene order does not determine whether ST or AF is synthesized . The recent availability of the complete genome of A. flavus as well as other Aspergillus species [24–26] will allow us to further assess the role of gene duplication, recruitment and reorganization in the evolution of this important pathway.
To date eight Aspergillus genomes have been sequenced, including the model organism A. nidulans  and species of industrial (A. niger , A. oryzae ), medical (A. fumigatus , A. terreus , A. fischerianus , A. clavatus ) and agricultural (A. flavus ) importance. All genomes contain eight chromosomes but vary in their overall size and in the number of predicted genes. For example, the genomes of A. oryzae (37.2 Mb, 12,319 predicted genes ) and A. flavus (36.3 Mb, 13,091 predicted genes ) are very similar and approximately 20% larger than the genomes of A. fumigatus (28.8 Mb, 10,114 predicted genes ), A. nidulans (30.1 Mb, 10,701 predicted genes ) and A. terreus (29.2 Mb, 10,406 predicted genes ). Preliminary comparative genome analyses reveal large non-syntenous regions resulting from insertions or deletions in subtelomeric sequences, intra-molecular recombinations, variation in the number of repeated elements, tandem repeats, and gene duplicates . The proximity of the AF gene cluster to the telomere in A. flavus, and the enrichment of secondary metabolite genes in subtelomeric regions in the Aspergilli in general, may facilitate the rapid reorganization and evolution of these genes in a species-specific fashion. This may explain the specificity of AF pathway extrolite profiles (chemotypes) for specific Aspergillus taxa.
The biological significance of AF chemotypes, like that of the majority of fungal secondary extrolites, is unclear. Numerous intriguing ideas regarding the function of AF pathway gene products have been offered and studies indicate that the role of these compounds in the survival of Aspergillus spp. may be extremely diverse [35, 36]. Aflatoxins are not essential to the growth of Aspergilli under certain conditions and are not required for successful competition in AF-producing strains [35, 37]. However, there may be an association between the biosynthesis of AF and developmental processes governing sporulation. Several studies have demonstrated that chemical inhibitors, mutations, and various environmental stimuli that suppress the synthesis of AF also affect or inhibit sporulation in Aspergillus spp. [36, 38]. Although we do not fully understand the biological significance of AF extrolites, the fact that AF and ST clusters are under strong purifying selection  indicates that clustering is actively maintained to counteract degradation by random neutral processes. In this study, we show that gene duplication and modularity as well as positive selection are responsible for the ordering and clustering of genes in the AF pathway of Aspergillus.
AF homologs and gene modules in Aspergillus
We used the predicted polypeptide sequences in A. parasiticus AF gene cluster as our reference sequences in TBLASTN and TBLASTX comparisons of the A. nidulans, A. fumigatus, A. flavus, A. terreus, and A. oryzae genome databases. The genomes for A. nidulans, A. fumigatus, A. flavus, A. terreus and A. oryzae provide 13X, 11X, 10X, 11X and 9X sequence coverage, respectively [24–26, 34]. Table 1 summarizes the map location (chromosome or contig), E-value, percent coverage, and gene orientation, which is the direction of transcription depending on whether the top (+) or bottom (-) strand is being transcribed, for the two best homologs across all five Aspergillus genomes. The total number of putative duplicates for each cluster gene is plotted in Figure 2A.
In general, there is conservation of gene order and direction of transcription for specific groups of two or more AF pathway genes. We tested the hypothesis that genes showing a similar pattern of copy number across species have been duplicated together in groups that we term 'gene modules'. If the average copy number was less than two across all five genomes then we also considered the proximity of genes in inferring gene modules. Correlated genes that are not genomically proximate reflect historical modules that have undergone recent reorganization. The dendrogram in Figure 2B shows that gene copy number for groups of two or three AF cluster genes is significantly correlated (P < 0.05; 0.8 <r2< 1). These highly correlated genes or modules, which may function as distinct biological units in AF biosynthesis, are color coded in Figure 3.
We identified seven putative gene modules across the five Aspergillus genomes. Not all genes in modules are syntenic across all genomes. There is conservation in gene order and direction of transcription for 1) all genes in the A. parasiticus, A. flavus and A. oryzae AF gene clusters, 2) modules with two genes (e.g., aflR/aflS, aflA/aflB) in the A. nidulans ST cluster and the A. parasiticus, A. flavus, A. oryzae AF clusters, and 3) at least two cluster genes (aflA/aflB) in A. fumigatus and A. terreus genomes (Figure 3). Syntenic partial clusters of five genes (aflC, aflS, aflR, aflX and aflY) were identified in A. fumigatus and A. terreus. Both the A. fumigatus partial cluster and the A. nidulans ST cluster reside on chromosome 4 while the A. parasiticus, A. flavus and A. oryzae AF gene clusters are located near the telomere of chromosome 3. From these data alone, the phylogenetic relationships among A. fumigatus, A. terreus, A. nidulans and section Flavi species can not be fully resolved, but the observed synteny in the partial clusters of A. fumigatus and A. terreus may indicate that similar evolutionary mechanisms have influenced the evolution of these clusters. Gene modules that are contiguous only in the AF clusters of certain species may arise from gene reorganization that reunites previously separated genes. A striking example is aflG/aflL, which is contiguous only in the cluster of section Flavi species, suggesting either recruitment from other genomic locations or reorganization of cluster genes from an ST ancestor (Figure 3). Population genetic analyses of molecular sequence variation in the aflatoxin gene cluster of A. parasiticus support the latter hypothesis . Other putative gene modules aflF/aflE, aflT/aflQ, and aflC/aflW are separated by more than 35 kb in ST and AF gene clusters.
There was no evidence of partial clustering of two or more gene modules residing outside the AF and ST clusters. Thus, we focused on the gene module itself and examined the orientation and separation of genes in modules residing outside the cluster (Table 1). Our definition of a gene module is independent of the physical proximity of genes. Even gene modules that are syntenic in all species clusters vary in their degree of synteny when residing outside of the cluster. For example, in A. flavus, the two aflA/aflB gene modules that map to chromosome 3 but reside outside the cluster are nonsyntenous. In one module, the aflA and aflB genes are separated by 30 kb and in the other module by approximately 40 kb. Other gene modules residing outside the cluster show a high degree of synteny. For example, a copy of aflF/aflE on chromosome 7 of A. nidulans (not shown in Table 1) is contiguous and aflF and aflE are separated by less than 1 kb, comparable to the distance separating contiguous gene modules in the cluster. In some cases the orientation of genes in modules residing outside the cluster in one species matches the configuration of genes in a different species. For example, a copy of the aflX/aflY module on chromosome 8 of A. nidulans (Table 1) has the same order and gene orientation as aflX/aflY found in the AF clusters of section Flavi species (both genes negatively transcribed). This conservation further supports the vertical transmission of these modules.
Initially we observed conserved syntenic relationships among AF gene clusters that mirrored phylogenetic species groupings. For example, within section Flavi, all species show high conservation in gene order and direction of transcription. A second grouping that includes A. fumigatus and A. terreus has conserved partial clusters. The apparent outlier, A. nidulans, shares gene modules with both groups as well as local rearrangements of modules, giving rise to a unique cluster configuration that is intermediate in size to partial and full gene clusters. Indeed, if cluster configuration is indicative of higher-order phylogenetic relationships among these species, then molecular variation in cluster genes would be expected to track with the underlying phylogeny and could potentially also be linked to evolutionary/ecological processes of species adaptation and diversification.
The impact of positive (adaptive) or negative (purifying) selection on putative orthologs in full or partial AF clusters in Aspergillus was determined by calculating the ratio of amino acid (Ka) to synonymous (Ks) substitutions using GenomeHistory . The magnitude of the Ka/Ks ratio provides evidence of genes under strong functional constraints (Ka/Ks < 1) or undergoing adaptive evolution (Ka/Ks > 1). We considered a linear model that parameterizes the selective pressure (Ka/Ks) on gene clusters in terms of variation across all cluster genes and species. Contrasts between section Flavi and non-section Flavi species showed significant differences in mean Ka/Ks values (t = -6.78, P < 0.0001), and mean Ka/Ks values were significantly higher for section Flavi species than for non-section Flavi species (Figure 4). With the exception of A. nomius, pairwise contrasts among section Flavi species indicated no significant differences in mean Ka/Ks values for A. parasiticus, the A. parasiticus partial cluster duplication, A. flavus and A. oryzae. Similarly, there were no significant differences in mean Ka/Ks values among non-section Flavi species; however, mean Ka/Ks values for A. nomius were more similar to Ka/Ks values of partial clusters in A. fumigatus and A. terreus than to the A. nidulans cluster (t = 3.13, P < 0.01).
Our systematic genomic searches for duplicated AF cluster homologs followed by correlation analysis revealed seven putative gene modules: aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. Not all the genes in these modules are contiguous across all five Aspergillus species. The strong correlation observed among noncontiguous members of gene modules that are sometimes separated by more than 30 kb is consistent with vertical transmission but argues against horizontal transfer, which would require a simultaneous transfer of unlinked copies to all species, a highly unlikely event. Further evidence in support of vertical transmission is the report of putative homologs of AF genes in the pine needle pathogen, Dothistroma septosporum (previously known as D. pini; [42, 43]) and in the plant pathogen, Cercospora nicotianae . Among the putative AF orthologs identified in D. septosporum, the gene with the highest percent amino acid identity, dotA, shows 80% similarity to aflM of A. parasiticus [42, 43]. In C. nicotianae, the CRG1 N-terminus zinc finger motif is homologous to the zinc finger domains of various regulatory proteins, including aflR of Aspergillus species . The existence of aflM and aflR homologs in two ascomycete classes (Dothideomycetes and Eurotiomycetes) further argues against horizontal gene transfer and suggests that high sequence identity is the result of descent from a common ancestor and strong purifying selection.
It has been long proposed that metabolic gene clusters may be transferred horizontally between organisms [45, 46]; however, direct experimental evidence that horizontal gene transfer maintains clustering in fungi is lacking. The phylogenetic evidence in support of horizontal gene transfer is also weak. In fact, phylogenetic analysis of polyketide synthases among fungal species indicates that gene duplications and losses can explain the data equally well and there is no need to invoke horizontal gene transfer . Our comparative analyses suggest that intra-genomic reorganization followed by vertical descent and gene loss is a more plausible mechanism and may explain the variation in chemotype profiles for different Aspergillus species. For example, A. nomius and A. bombycis produce both B and G aflatoxins whereas A. flavus synthesizes predominantly B aflatoxins. Species producing only B aflatoxins may have evolved due to the loss of genes required for the synthesis of G aflatoxins . Specifically, aflU, which is missing or nonfunctional in A. flavus isolates, may be important in G aflatoxin production since the disruption of aflU in A. parasiticus results in the production of only B aflatoxins . Indeed, the location of the AF cluster in the telomeric region of A. nidulans, A. flavus and A. oryzae would facilitate gene loss as well as recombination, DNA inversions, partial deletions, translocations and other genomic rearrangements [39, 48–50].
Comparative analysis of complete and partial AF clusters across five Aspergillus species revealed a striking modular organization of pathway genes. We hypothesize that gene modules that are contiguous in one species and noncontiguous in others are the result of rearrangements in an ancestral species. For example, four cluster genes separate aflG and aflL in A. nidulans whereas aflG and aflL are contiguous in section Flavi gene clusters. If aflG and aflL underwent reorganization in the evolution of section Flavi species from an ancestor with a cluster configuration similar to A. nidulans, this suggests that the arrangement of aflG and aflL in the cluster does not determine whether ST or AF is synthesized. Indeed, A. ochraceoroseus has a cluster configuration very similar to A. nidulans and can synthesize both ST and AF . Furthermore, gene modules need not be contiguous or clustered to remain functional. For example, an aflR duplicate that resides outside the cluster in some A. parasiticus strains has been reported to regulate AF biosynthesis , and aflR in the cluster can control the expression of other genes within the genome . In contrast, aflD is not expressed at native levels when moved outside of the A. parasiticus cluster, indicating that clustering does play an important role in regulating the expression of some AF biosynthetic genes .
Several hypotheses have been proposed to explain clustering in fungal genomes. Clustering can be a means of optimizing coregulation of genes, although clustering is not a prerequisite for coregulation as evidenced by the discovery of global regulatory genes of secondary metabolite clusters in Aspergillus spp. [54, 55]; conversely, regulatory genes contained within gene clusters can control the expression of other genes outside of the clusters . Selection acting on the cluster itself has also been invoked to explain the presence of gene clusters. In this case, the selection is independent of the selective advantage that the products of the pathway confer on the host organism . This "selfish cluster" hypothesis postulates that horizontal gene transfer is an important mechanism for propagating and maintaining gene clusters in eukaryotes, reminiscent of the "selfish operon" hypothesis proposed in prokaryotes . Other hypotheses postulate coadaptation and possibly gene duplication and differentiation as driving forces in gene cluster evolution .
Several mechanisms may have been important in the evolution and retention of AF gene modules. Gene modules may have arisen from duplications of a single gene whereby the copy retained the function of the pre-duplication gene, as observed with the nor reductase genes, aflF/aflE . Alternatively, gene modules may have undergone subfunctionalization in which copies partition the ancestral function, as with the fatty acid synthases, aflA/aflB [57, 58]. Other gene modules comprise genes that augment a specific pathway function, as exemplified by aflR/aflS, the pathway-specific transcription activator and enhancer , and aflX/aflY, the genes required for the conversion of versicolorin A to demethylsterigmatocystin . The functional relationships among genes in noncontiguous modules aflT/aflQ and aflC/aflW are unknown but could include neofunctionalization, an adaptive process in which a completely new function has evolved for the duplicated copy. In addition to these localized gene duplication events, we cannot rule out a whole-genome duplication in an Aspergillus ancestor; conclusive evidence for this will require further analysis of gene duplicates among several genomes .
Adaptive processes may extend beyond gene modules to entire clusters of genes. We hypothesize that gene cluster evolution was driven by selection for new chemotypes, in this case, OMST and AF from an ST ancestor. If AF gene clusters evolved by the reorganization and recruitment of additional genes in an ST ancestor, then partial clusters synthesizing intermediate compounds might represent the earliest or ancestral clusters. Are the partial clusters identified in A. fumigatus and A. terreus functional and are they the building blocks for larger clusters? Phylogenetic studies with sufficient taxon sampling suggest that A. fumigatus and A. terreus are ancestral to section Flavi [24, 62]. Both A. fumigatus and A. terreus have the aflA/aflB gene modules and partial clusters of five genes: aflC, aflS, aflR, aflX and aflY. It has been speculated that a partial cluster consisting of aflC, aflR, aflS, aflA, and aflB would have allowed an Aspergillus ancestor to stabilize the polyketide to an anthraquinone . Anthraquinones are colorful polycyclic aromatic hydrocarbons that accumulate in spores and may aid in their dispersal via arthropods and protection from predation . Spore dispersal would impart increasing selective pressures on fungi to synthesize an arsenal of polyketide derivatives to facilitate the colonization of diverse and sometimes hostile environments. Indeed, our estimates of mean Ka/Ks values were significantly higher in section Flavi than in non-section Flavi species, indicating increased positive selection acting on genes in OMST and AF clusters relative to the ST cluster in A. nidulans and partial clusters in A. fumigatus and A. terreus.
Overall Ka/Ks ratios for AF homologs were less than one for both section Flavi and non-section Flavi species, indicating an ongoing process of purifying selection acting to eliminate mutations that have deleterious effects on chemotype biosynthesis. Our estimates of Ka/Ks were consistent with values reported by Ehrlich and coworkers in AF and ST clusters . Within section Flavi, our micro-evolutionary analyses in A. parasiticus  suggest that the most recent common ancestor (MRCA) either produced high levels of G1 relative to B1 or was an OMST producer. Since no species is known to produce only G aflatoxins, a more likely hypothesis is that the MRCA of section Flavi was a B and G aflatoxin producer and that selection has been acting on the G1/B1 ratio. One possible MRCA is A. nomius, a clear outgroup to section Flavi species that produces both B and G aflatoxins [63, 64]. Another possibility is the unnamed taxon, which can also synthesize B and G aflatoxins . The differences in aflatoxins produced by different species most likely represent a complex process that involves purifying and positive selection acting on a B and G producing ancestor; specific demographic, environmental and/or evolutionary processes in populations that maintain or break down AF gene clusters; and the actions of specific genes that are involved in AF pathway regulation  or other global regulatory genes of secondary metabolite clusters [54, 55]. If the AF cluster arose from rearrangements of gene and/or gene modules in an ancestral Aspergillus species, then the signature of cluster reorganization may still be evident in descendent species. Preliminary analysis of molecular variation in the aflatoxin gene cluster of A. parasiticus  provides evidence for cluster reorganization from an ST ancestor, as well as evidence for recombination, balancing selection and chemotype-specific adaptation.
Based on correlation and cluster analyses of AF gene cluster duplicates across five Aspergillus species, we inferred seven gene modules: aflA/aflB, aflR/aflS, aflX/aflY, aflF/aflE, aflT/aflQ, aflC/aflW, and aflG/aflL. Our definition of a module includes the possibility that genes may become separated after their duplication and we hypothesize that differences in gene order between AF and ST clusters may be the result of gene reorganization in an ST ancestor. Gene duplication and vertical transmission appear to be the driving forces in the evolution and retention of AF gene modules across all five Aspergillus species. Gene modules may arise from duplications of a single gene, whereby the copy retains the function of the pre-duplication gene (aflF/aflE) or partitions the ancestral function (aflA/aflB). Alternatively, the duplicated copy may simply augment or supplement a specific pathway function (aflR/aflS and aflX/aflY) or evolve a completely new function as exemplified with aflT/aflQ and aflC/aflW. Significantly higher mean Ka/Ks values in section Flavi compared to non-section Flavi species is evidence of adaptation and increased positive selection acting on genes in OMST and AF clusters relative to the ST cluster in A. nidulans and partial clusters in A. fumigatus and A. terreus. Whether patterns of gene duplication and modularity in the aflatoxin gene cluster are further influenced by evolutionary processes in populations that maintain or break down AF gene clusters are unknown and an important area of further research.
AF homologs in Aspergillus
Genes were considered orthologous if they satisfied the following criteria: 1) at least two genes were syntenic, 2) the genes were the best reciprocal TBLASTN and TBLASTX hits with an E-value less than 10-8, and 3) the genes showed amino acid similarities of approximately 40% or greater and at least 70% of the amino acids could be aligned to the reference sequence. Results from BLAST searches were further parsed to determine if cluster genes were single copy or duplicated. The total number of putative gene copies within each genome was determined using the above criteria with two exceptions: 1) reciprocal BLAST hits were not performed, and 2) an E-value less than 10-20 was used when there was more than one copy to decrease the number of false positives.
We identified as modules any group of two AF cluster genes that are highly correlated (P < 0.05; 0.8 <r2< 1) across the five Aspergillus genomes. We assessed correlation and clustering using Kendall's coefficient of concordance implemented in the R statistical package . This was followed by a series of F- tests to test the null hypothesis of no relationship between each pair of highly correlated genes . Significance thresholds were Bonferroni-corrected by dividing by the total number of tests performed.
Phylogenetic studies support a basal placement of A. nidulans and A. terreus relative to A. fumigatus and section Flavi species [24, 62]. Because all species in section Flavi share a recent common ancestor and are related to non-section Flavi species by an underlying phylogeny, we cannot assume independence among species with respect to their Ka/Ks values. We therefore tested whether there was a difference in mean Ka/Ks values between AF cluster homologs in section Flavi versus non-section Flavi species by constructing a linear model to account for variation between genes. This model can be written as Ka/Ks = mean of all Ka/Ks values + gene effect + species effect + error.
We tested the null hypothesis that there is no difference in mean Ka/Ks between species in section Flavi and non-section Flavi by computing and testing arbitrary species contrasts. For example, a contrast of the form c(-3,5,5,-3,-3,-3,-3,5) where the species order is A. flavus, A. fumigatus, A. nidulans, A. nomius, A. oryzae, A. parasiticus partial cluster, A. parasiticus, and A. terreus would compare the mean Ka/Ks of the section Flavi species with the mean Ka/Ks of the non-section Flavi species. In the above contrast, all species in section Flavi are assigned the same numerical value (-3) and non-section Flavi species are given a different number (5) such that the sum of both groups in the contrast is zero (-3 × 5 + 5 × 3). Contrasts were computed using the fit.contrast function implemented by Gregory R. Warnes in the gmodels package in R . The function returns a matrix containing the estimated regression coefficients, standard errors, t-values and two-sided P-values. A significant test result may indicate a difference in selective constraints on amino acid substitutions or adaptive evolution between the two species groups.
Frisvad JC, Samson RA: Polyphasic taxonomy of Penicillium subgenus Penicillium. A guide to identification of food and air-borne terverticillate Penicillia and their mycotoxins. Studies in Mycology. 2004, 49: 1-173.
Adrio JL, Demain AL: Fungal biotechnology. Int Microbiol. 2003, 6 (3): 191-199. 10.1007/s10123-003-0133-0.
Bennett JW, Klich M: Mycotoxins. Clin Microbiol Rev. 2003, 16 (3): 497-516. 10.1128/CMR.16.3.497-516.2003.
Payne GA, Brown MP: Genetics and physiology of aflatoxin biosynthesis. Annual Review of Phytopathology. 1998, Annual Reviews, 36: 392-362. 10.1146/annurev.phyto.36.1.329.
Desjardins AE, Hohn TM, McCormick SP: Trichothecene biosynthesis in Fusarium species: chemistry, genetics, and significance. Microbiol Rev. 1993, 57 (3): 595-604.
Gardiner DM, Waring P, Howlett BJ: The epipolythiodioxopiperazine (ETP) class of fungal toxins: distribution, mode of action, functions and biosynthesis. Microbiology. 2005, 151 (Pt 4): 1021-1032. 10.1099/mic.0.27847-0.
Robens J, Cardwell KF: The costs of mycotoxin management in the United States. Aflatoxin and Food Safety. Edited by: Abbas HK. 2005, Boca Raton , CRC Taylor & Francis, 1-12.
Wang JS, Tang L: Epidemiology of aflatoxin exposure and human liver cancer. Aflatoxin and Food Safety. Edited by: Abbas HK. 2005, Boca Raton , CRC Taylor & Francis, 195-211.
Horn BW: Ecology and population biology of aflatoxigenic fungi in soil. Aflatoxin and Food Safety. Edited by: Abbas HK. 2005, Boca Raton , CRC Taylor & Francis, 95-116.
Ehrlich KC, Chang P-K, Yu J, Cotty PJ: Aflatoxin biosynthesis cluster gene cypA is required for G aflatoxin formation. Appl Environ Microbiol. 2004, 70 (11): 6518-6524. 10.1128/AEM.70.11.6518-6524.2004.
Horn BW, Dorner JW: Regional differences in production of aflatoxin B1 and cyclopiazonic acid by soil isolates of Aspergillus flavus along a transect within the United States. Appl Environ Microbiol. 1999, 65 (4): 1444-1449.
Horn BW, Greene RL, Sobolev VS, Dorner JW, Powell JH, Layton RC: Association of morphology and mycotoxin production with vegetative compatibility groups in Aspergillus flavus, A. parasiticus, and A. tamarii. Mycologia. 1996, 88 (4): 574-587. 10.2307/3761151.
Geiser DM, Pitt JI, Taylor JW: Cryptic speciation and recombination in the aflatoxin-producing fungus Aspergillus flavus. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95 (1): 388-393. 10.1073/pnas.95.1.388.
Geiser DM, Dorner JW, Horn BW, Taylor JW: The phylogenetics of mycotoxin and sclerotium production in Aspergillus flavus and Aspergillus oryzae. Fungal Genet Biol. 2000, 31 (3): 169-179. 10.1006/fgbi.2000.1215.
Frisvad JC, Skouboe P, Samson RA: Taxonomic comparison of three different groups of aflatoxin producers and a new efficient producer of aflatoxin B1, sterigmatocystin and 3-O-methylsterigmatocystin, Aspergillus rambellii sp. nov. Syst Appl Microbiol. 2005, 28 (5): 442-453. 10.1016/j.syapm.2005.02.012.
Cary JW, Ehrlich KC: Aflatoxigenicity in Aspergillus: molecular genetics, phylogenetic relationships and evolutionary implications. Mycopathologia. 2006, 162 (3): 167-177. 10.1007/s11046-006-0051-8.
Yu J, Chang P-K, Ehrlich KC, Cary JW, Bhatnagar D, Cleveland TE, Payne GA, Linz JE, Woloshuk CP, Bennett JW: Clustered pathway genes in aflatoxin biosynthesis. Appl Environ Microbiol. 2004, 70 (3): 1253-1262. 10.1128/AEM.70.3.1253-1262.2004.
Yabe K, Nakajima H: Enzyme reactions and genes in aflatoxin biosynthesis. Appl Microbiol Biotechnol. 2004, 64 (6): 745-755. 10.1007/s00253-004-1566-x.
Chang P-K, Cary JW, Bhatnagar D, Cleveland TE, Bennett JW, Linz JE, Woloshuk CP, Payne GA: Cloning of the Aspergillus parasiticus apa-2 gene associated with the regulation of aflatoxin biosynthesis. Appl Environ Microbiol. 1993, 59 (10): 3273-3279.
Payne GA, Nystrom GJ, Bhatnagar D, Cleveland TE, Woloshuk CP: Cloning of the afl-2 gene involved in aflatoxin biosynthesis from Aspergillus flavus. Appl Environ Microbiol. 1993, 59 (1): 156-162.
Yu JH, Butchko RA, Fernandes M, Keller NP, Leonard TJ, Adams TH: Conservation of structure and function of the aflatoxin regulatory gene aflR from Aspergillus nidulans and A. flavus. Curr Genet. 1996, 29 (6): 549-555.
Brown DW, Yu JH, Kelkar HS, Fernandes M, Nesbitt TC, Keller NP, Adams TH, Leonard TJ: Twenty-five coregulated transcripts define a sterigmatocystin gene cluster in Aspergillus nidulans. Proc Natl Acad Sci U S A. 1996, 93 (4): 1418-1422. 10.1073/pnas.93.4.1418.
Cary JW, Klich MA, Beltz SB: Characterization of aflatoxin-producing fungi outside of Aspergillus section Flavi. Mycologia. 2005, 97 (2): 425-432.
Galagan JE, Calvo SE, Cuomo C, Ma LJ, Wortman JR, Batzoglou S, Lee SI, Basturkmen M, Spevak CC, Clutterbuck J, Kapitonov V, Jurka J, Scazzocchio C, Farman M, Butler J, Purcell S, Harris S, Braus GH, Draht O, Busch S, D'Enfert C, Bouchier C, Goldman GH, Bell-Pedersen D, Griffiths-Jones S, Doonan JH, Yu J, Vienken K, Pain A, Freitag M, Selker EU, Archer DB, Penalva MA, Oakley BR, Momany M, Tanaka T, Kumagai T, Asai K, Machida M, Nierman WC, Denning DW, Caddick M, Hynes M, Paoletti M, Fischer R, Miller B, Dyer P, Sachs MS, Osmani SA, Birren BW: Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae. Nature. 2005, 438 (7071): 1105-1115. 10.1038/nature04341.
Nierman WC, Pain A, Anderson MJ, Wortman JR, Kim HS, Arroyo J, Berriman M, Abe K, Archer DB, Bermejo C, Bennett J, Bowyer P, Chen D, Collins M, Coulsen R, Davies R, Dyer PS, Farman M, Fedorova N, Fedorova N, Feldblyum TV, Fischer R, Fosker N, Fraser A, Garcia JL, Garcia MJ, Goble A, Goldman GH, Gomi K, Griffith-Jones S, Gwilliam R, Haas B, Haas H, Harris D, Horiuchi H, Huang J, Humphray S, Jimenez J, Keller N, Khouri H, Kitamoto K, Kobayashi T, Konzack S, Kulkarni R, Kumagai T, Lafton A, Latge JP, Li W, Lord A, Lu C, Majoros WH, May GS, Miller BL, Mohamoud Y, Molina M, Monod M, Mouyna I, Mulligan S, Murphy L, O'Neil S, Paulsen I, Penalva MA, Pertea M, Price C, Pritchard BL, Quail MA, Rabbinowitsch E, Rawlins N, Rajandream MA, Reichard U, Renauld H, Robson GD, Rodriguez de Cordoba S, Rodriguez-Pena JM, Ronning CM, Rutter S, Salzberg SL, Sanchez M, Sanchez-Ferrero JC, Saunders D, Seeger K, Squares R, Squares S, Takeuchi M, Tekaia F, Turner G, Vazquez de Aldana CR, Weidman J, White O, Woodward J, Yu JH, Fraser C, Galagan JE, Asai K, Machida M, Hall N, Barrell B, Denning DW: Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature. 2005, 438 (7071): 1151-1156. 10.1038/nature04332.
Machida M, Asai K, Sano M, Tanaka T, Kumagai T, Terai G, Kusumoto K, Arima T, Akita O, Kashiwagi Y, Abe K, Gomi K, Horiuchi H, Kitamoto K, Kobayashi T, Takeuchi M, Denning DW, Galagan JE, Nierman WC, Yu J, Archer DB, Bennett JW, Bhatnagar D, Cleveland TE, Fedorova ND, Gotoh O, Horikawa H, Hosoyama A, Ichinomiya M, Igarashi R, Iwashita K, Juvvadi PR, Kato M, Kato Y, Kin T, Kokubun A, Maeda H, Maeyama N, Maruyama J, Nagasaki H, Nakajima T, Oda K, Okada K, Paulsen I, Sakamoto K, Sawano T, Takahashi M, Takase K, Terabayashi Y, Wortman JR, Yamada O, Yamagata Y, Anazawa H, Hata Y, Koide Y, Komori T, Koyama Y, Minetoki T, Suharnan S, Tanaka A, Isono K, Kuhara S, Ogasawara N, Kikuchi H: Genome sequencing and analysis of Aspergillus oryzae. Nature. 2005, 438 (7071): 1157-1161. 10.1038/nature04300.
Aspergillus nidulans Database – Broad Institute . [http://www.broad.mit.edu/annotation/genome/aspergillus_nidulans/Home.html]
Aspergillus niger v1.0. [http://genome.jgi-psf.org/Aspni1/Aspni1.home.html]
DOGAN - Database of the Genomes Analyzed at NITE (National Institute of Technology and Evaluation). [http://www.bio.nite.go.jp/dogan/MicroTop?GENOME_ID=ao]
Aspergillus fumigatus Genome Project . [http://www.tigr.org/tdb/e2k1/afu1/]
Aspergillus nidulans Database – Broad Institute. [http://www.broad.mit.edu/annotation/genome/aspergillus_terreus/Home.html]
Neosartorya fischeri Genome Project (TIGR). [http://www.tigr.org/tdb/e2k1/nfa1/intro.shtml]
Aspergillus clavatus Genome Project (TIGR). [http://www.tigr.org/tdb/e2k1/acla1/intro.shtml]
Aspergillus flavus Genome Database. [http://www.aspergillusflavus.org/]
Bhatnagar D, Ehrlich KC, Cleveland TE: Molecular genetic analysis and regulation of aflatoxin biosynthesis. Appl Microbiol Biotechnol. 2003, 61 (2): 83-93.
Yu J, Bhatnagar D, Ehrlich KC: Aflatoxin biosynthesis. Rev Iberoam Micol. 2002, 19 (4): 191-200.
Horn BW, Greene RL, Dorner JW: Inhibition of aflatoxin B1 production by Aspergillus parasiticus using nonaflatoxigenic strains: role of vegetative compatibility. Biological Control. 2000, 17: 147-154. 10.1006/bcon.1999.0798.
Calvo AM, Wilson RA, Bok JW, Keller NP: Relationship between secondary metabolism and fungal development. Microbiol Mol Biol Rev. 2002, 66 (3): 447-459. 10.1128/MMBR.66.3.447-459.2002.
Ehrlich KC, Yu J, Cotty PJ: Aflatoxin biosynthesis gene clusters and flanking regions. J Appl Microbiol. 2005, 99 (3): 518-527. 10.1111/j.1365-2672.2005.02637.x.
Carbone I, Jakobek JL, Ramirez-Prado JH, Horn BW: Recombination, balancing selection and adaptive evolution in the aflatoxin gene cluster of Aspergillus parasiticus. Mol Ecol. 2007
Conant GC, Wagner A: GenomeHistory: a software tool and its application to fully sequenced genomes. Nucleic Acids Res. 2002, 30 (15): 3378-3386. 10.1093/nar/gkf449.
Bradshaw RE, Bhatnagar D, Ganley RJ, Gillman CJ, Monahan BJ, Seconi JM: Dothistroma pini, a forest pathogen, contains homologs of aflatoxin biosynthetic pathway genes. Appl Environ Microbiol. 2002, 68 (6): 2885-2892. 10.1128/AEM.68.6.2885-2892.2002.
Bradshaw RE, Zhang S: Biosynthesis of dothistromin. Mycopathologia. 2006, 162 (3): 201-213. 10.1007/s11046-006-0054-5.
Chung KR, Daub ME, Kuchler K, Schuller C: The CRG1 gene required for resistance to the singlet oxygen-generating cercosporin toxin in Cercospora nicotianae encodes a putative fungal transcription factor. Biochem Biophys Res Commun. 2003, 302 (2): 302-310. 10.1016/S0006-291X(03)00171-2.
Walton JD: Horizontal gene transfer and the evolution of secondary metabolite gene clusters in fungi: an hypothesis. Fungal Genet Biol. 2000, 30 (3): 167-171. 10.1006/fgbi.2000.1224.
Andersson JO: Lateral gene transfer in eukaryotes. Cell Mol Life Sci. 2005, 62 (11): 1182-1197. 10.1007/s00018-005-4539-z.
Kroken S, Glass NL, Taylor JW, Yoder OC, Turgeon BG: Phylogenomic analysis of type I polyketide synthase genes in pathogenic and saprobic ascomycetes. Proc Natl Acad Sci U S A. 2003, 100 (26): 15670-15675. 10.1073/pnas.2532165100.
Kusumoto K, Nogata Y, Ohta H: Directed deletions in the aflatoxin biosynthesis gene homolog cluster of Aspergillus oryzae. Curr Genet. 2000, 37 (2): 104-111. 10.1007/s002940050016.
Chang P-K, Horn BW, Dorner JW: Sequence breakpoints in the aflatoxin biosynthesis gene cluster and flanking regions in nonaflatoxigenic Aspergillus flavus isolates. Fungal Genetics and Biology. 2005, 42 (11): 914-923. 10.1016/j.fgb.2005.07.004.
Wong S, Wolfe KH: Birth of a metabolic gene cluster in yeast by adaptive gene relocation. Nat Genet. 2005, 37 (7): 777-782. 10.1038/ng1584.
Cary JW, Dyer JM, Ehrlich KC, Wright MS, Liang SH, Linz JE: Molecular and functional characterization of a second copy of the aflatoxin regulatory gene, aflR-2, from Aspergillus parasiticus. Biochim Biophys Acta. 2002, 1576 (3): 316-323.
Price MS, Yu J, Nierman WC, Kim HS, Pritchard B, Jacobus CA, Bhatnagar D, Cleveland TE, Payne GA: The aflatoxin pathway regulator AflR induces gene transcription inside and outside of the aflatoxin biosynthetic cluster. FEMS Microbiol Lett. 2006, 255 (2): 275-279. 10.1111/j.1574-6968.2005.00084.x.
Chiou CH, Miller M, Wilson DL, Trail F, Linz JE: Chromosomal location plays a role in regulation of aflatoxin gene expression in Aspergillus parasiticus. Appl Environ Microbiol. 2002, 68 (1): 306-315. 10.1128/AEM.68.1.306-315.2002.
Yu JH, Keller N: Regulation of secondary metabolism in filamentous fungi. Annu Rev Phytopathol. 2005, 43: 437-458. 10.1146/annurev.phyto.43.040204.140214.
Duran RM, Cary JW, Calvo AM: Production of cyclopiazonic acid, aflatrem, and aflatoxin by Aspergillus flavus is regulated by veA, a gene necessary for sclerotial formation. Appl Microbiol Biotechnol. 2007, 73 (5): 1158-1168. 10.1007/s00253-006-0581-5.
Lawrence JG, Roth JR: Selfish operons: horizontal transfer may drive the evolution of gene clusters. Genetics. 1996, 143 (4): 1843-1860.
Hitchman TS, Schmidt EW, Trail F, Rarick MD, Linz JE, Townsend CA: Hexanoate synthase, a specialized type I fatty acid synthase in aflatoxin B1 biosynthesis. Bioorg Chem. 2001, 29 (5): 293-307. 10.1006/bioo.2001.1216.
Watanabe CM, Townsend CA: Initial characterization of a type I fatty acid synthase and polyketide synthase multienzyme complex NorS in the biosynthesis of aflatoxin B1. Chem Biol. 2002, 9 (9): 981-988. 10.1016/S1074-5521(02)00213-2.
Chang P-K: The Aspergillus parasiticus protein AFLJ interacts with the aflatoxin pathway-specific regulator AFLR. Mol Genet Genomics. 2003, 268 (6): 711-719.
Cary JW, Ehrlich KC, Bland JM, Montalbano BG: The aflatoxin biosynthesis cluster gene, aflX, encodes an oxidoreductase involved in conversion of versicolorin A to demethylsterigmatocystin. Appl Environ Microbiol. 2006, 72 (2): 1096-1101. 10.1128/AEM.72.2.1096-1101.2006.
Wolfe KH, Shields DC: Molecular evidence for an ancient duplication of the entire yeast genome. Nature. 1997, 387 (6634): 708-713. 10.1038/42711.
Tamura M, Kawahara K, Sugiyama J: Molecular phylogeny of Aspergillus and associated teleomorphs in the Trichocomaceae (Eurotiales). Integration of modern taxonomic methods for Penicillium and Aspergillus classification. Edited by: Samson RA, Pitt JI. 2000, Amsterdam , Harwood Academic; Marston, 510-
Ehrlich KC, Montalbano BG, Cotty PJ: Sequence comparison of aflR from different Aspergillus species provides evidence for variability in regulation of aflatoxin production. Fungal Genet Biol. 2003, 38 (1): 63-74. 10.1016/S1087-1845(02)00509-1.
Peterson SW, Ito Y, Horn BW, Goto T: Aspergillus bombycis, a new aflatoxigenic species and genetic variation in its sibling species, A. nomius. Mycologia. 2001, 93: 689-703. 10.2307/3761823.
The R Project for Statistical Computing. [http://www.R-project.org]
Franzblau AN: A primer of statistics for non-statisticians. 1958, New York , Harcourt, Brace, 150 p.-
Venables WN, Ripley BD: Modern applied statistics with S. Statistics and computing. 2002, New York , Springer, xi, 495 p.-4th
Kurtzman CP, Horn BW, Hesseltine CW: Aspergillus nomius, a new aflatoxin-producing species related to Aspergillus flavus and Aspergillus tamarii. Antonie Van Leeuwenhoek. 1987, 53 (3): 147-158. 10.1007/BF00393843.
Ito Y, Peterson SW, Wicklow DT, Goto T: Aspergillus pseudotamarii, a new aflatoxin producing species in Aspergillus section Flavi. Mycological Research. 2001, 105 (2): 233-239. 10.1017/S0953756200003385.
Galagan JE, Henn MR, Ma LJ, Cuomo CA, Birren B: Genomics of the fungal kingdom: insights into eukaryotic biology. Genome Res. 2005, 15 (12): 1620-1631. 10.1101/gr.3767105.
Chang P-K, Yu J: Characterization of a partial duplication of the aflatoxin gene cluster in Aspergillus parasiticus ATCC 56775. Appl Microbiol Biotechnol. 2002, 58 (5): 632-636. 10.1007/s00253-002-0935-6.
We thank Doug Brown (Center for Integrated Fungal Research) for bioinformatics support, Dr. Elie Hajj Moussa (Lebanese University) for preliminary insights on macro-scale patterns, and David Aylor (Bioinformatics Research Center, NC State University) for help in developing the correlation tests and linear models in R. This work was supported in part by the University of North Carolina General Administration under an award for High Performance Computing (HPC) and Computational Sciences. Estimates of Ka/Ks using GenomeHistory were performed on HPC resources provided by the NC State Information Technology Division with support from the Office of the Provost and Office of Research and Graduate Studies. This work was funded by the North Carolina Cooperative State Research, Education, and Extension Service, grant numbers 2004-35400-14429, 2005-34500-15893, 2006-35604-16666, and by the National Research Initiative of the USDA Cooperative State Research, Education and Extension Service, grant number 2005-35319-16126 to I. C.
IC and JHRP conceived the study and contributed equally to the acquisition, statistical analysis and interpretation of data. JLJ and BWH were involved in drafting the manuscript and revising it critically for important intellectual content. All authors read and approved the final manuscript.
Ignazio Carbone, Jorge H Ramirez-Prado contributed equally to this work.
About this article
Cite this article
Carbone, I., Ramirez-Prado, J.H., Jakobek, J.L. et al. Gene duplication, modularity and adaptation in the evolution of the aflatoxin gene cluster. BMC Evol Biol 7, 111 (2007) doi:10.1186/1471-2148-7-111
- Gene Module
- Much Recent Common Ancestor
- Partial Cluster
- Secondary Metabolite Cluster