- Research article
- Open Access
A multi gene sequence-based phylogeny of the Musaceae (banana) family
© Christelová et al; licensee BioMed Central Ltd. 2011
- Received: 6 October 2010
- Accepted: 16 April 2011
- Published: 16 April 2011
The classification of the Musaceae (banana) family species and their phylogenetic inter-relationships remain controversial, in part due to limited nucleotide information to complement the morphological and physiological characters. In this work the evolutionary relationships within the Musaceae family were studied using 13 species and DNA sequences obtained from a set of 19 unlinked nuclear genes.
The 19 gene sequences represented a sample of ~16 kb of genome sequence (~73% intronic). The sequence data were also used to obtain estimates for the divergence times of the Musaceae genera and Musa sections. Nucleotide variation within the sample confirmed the close relationship of Australimusa and Callimusa sections and showed that Eumusa and Rhodochlamys sections are not reciprocally monophyletic, which supports the previous claims for the merger between the two latter sections. Divergence time analysis supported the previous dating of the Musaceae crown age to the Cretaceous/Tertiary boundary (~ 69 Mya), and the evolution of Musa to ~50 Mya. The first estimates for the divergence times of the four Musa sections were also obtained.
The gene sequence-based phylogeny presented here provides a substantial insight into the course of speciation within the Musaceae. An understanding of the main phylogenetic relationships between banana species will help to fine-tune the taxonomy of Musaceae.
- Internal Transcribe Spacer
- Markov Chain Monte Carlo
- Maximum Parsimony Analysis
- Basic Chromosome Number
- Bayesian Inference Analysis
The global annual production of bananas and plantains (Musa spp.) amounts to > 120 Mt , making this species one of the world's most important fruit crops. As well as their prominence as a dessert fruit, they provide a vital source of carbohydrates to many inhabitants of the humid tropics. Musa production, like that of all crop species, is endangered by a range of pests and diseases, affecting both the yield and quality of the fruit. While the large-scale commercial plantations can secure production by frequent applications of fungicide and pesticide, this form of crop management is increasingly recognized as environmentally irresponsible. Meanwhile, smallholders, who together account for at least 85% of world production, can seldom afford the expense of chemical control, and their crop remains vulnerable to diseases and pests. Improvement of cultivated banana via breeding is hampered by the absence of sexual reproduction and narrow genetic basis. As a result, attention has turned to non-cultivated wild relatives as sources of new genes for banana improvement. This, underlines a renewed interest to analyze and conserve genetic diversity within Musa spp., which in turn has raised a number of questions related to their taxonomy.
The banana family (Musaceae) has been assigned to the order Zingiberales in the clade commelinids in the monocots  and has been conventionally divided into the three genera Musa, Ensete and Musella. The genus Musa is characterized by a set of morphological descriptors, and has a basic chromosome number (x) of 9, 10 or 11. The genus has been sub-divided into the four sections Eumusa (x = 11; comprising most of the cultivated species), Rhodochlamys (x = 11), Australimusa (x = 10) and Callimusa (x = 9, 10) [3, 4]. More recently, Argent  added a fifth section, Ingentimusa (x = 7), containing just a single species M. ingens. However, since this one species (x = 7) grows within the Australimusa region (New Guinea), its section-status is not evident when compared to M. beccarii (x = 9), which grows in the Callimusa region (Borneo) and remains classified as a Callimusa.
With the application of DNA-based tools, this conventionally-based taxonomy has become increasingly difficult to justify. Thus, based on RFLP genotyping, Gawel et al.  proposed a merger between Eumusa and Rhodochlamys, a suggestion consistent with nuclear genome sizes and the distribution of rDNA loci , as well as with the phylogenetic analysis based on the ITS and organellar DNA . Jarret and Gawel  further proposed combining Australimusa and Callimusa into a single section, a suggestion supported by AFLP genotypes acquired by Wong et al. . However, the results of AFLP genotyping led Ude et al.  to argue that the conventional taxonomy of Musa was in fact tenable.
The ease of DNA sequencing has revolutionized phylogenetic methodology. The most frequent targets for this type of analysis have been extra-nuclear DNA i.e. chloroplast and mitochondrial genes [12–16] and the internal transcribed spacers (ITS) separating the tandem organized ribosomal genes in the 45S rDNA locus [17–19]. The prevalently uniparental mode of inheritance of the chloroplast and mitochondrion limits to some extent the usefulness of extra-nuclear sequences, and moreover, it has been established that this DNA tends to evolve more slowly than do the nuclear genes, which presents difficulties in employing it for phylogenetic purposes . Concerted evolution , a bias due to analyzing a single locus and hidden paralogy all militate against relying solely on ITS variation for molecular systematics and evolutionary analysis [22, 23].
Single and low copy nuclear gene sequences are thought to provide a higher level of discrimination than either extra-nuclear genes or ribosomal spacers [24–26]. The lower frequency of informative sites within these sequences can, however, prevent their use for the resolution of phylogeny both at lower taxonomic levels and among rapidly diversifying lineages. The greater resolving power of low copy nuclear sequence has been recently demonstrated in rice . Low copy nuclear genes also suffer less homoplasy than does ITS  and are seldom subjected to concerted evolution. Intronic sequence is particularly useful, since the level of selection pressure on its non-coding DNA is relaxed . The major drawback to the use of low copy sequence is the need to distinguish between paralogs and orthologs. As yet in the Musaceae family, however, all published sequence-based phylogenetic studies have targeted extra-nuclear and/or ribosomal DNA sequence.
The phylogeny of the Musaceae remains controversial. Typing via organellar and ribosomal DNA has been employed by Boonruangrod et al. [29, 30]. Li et al.  and Liu et al.  applied sequence analysis of ribosomal ITS coupled with the chloroplast gene evidence. More generally, evolutionary relationships within the monocotyledonous species [32–34] and in the Zingiberales in particular [35, 36], have produced date estimates for the divergence of the Musaceae (61-110 Mya) and the genus Musa (51 Mya). Based on a study of genome duplication, Paterson et al.  suggested that the divergence of Musa occurred 142 Mya, although this estimate was conceded to require further sequence information before it could be accepted. Clearly, a more robust picture of banana phylogeny and divergence time requires a systematic sampling of gene sequences distributed throughout the genome. Thus, we set out to clarify main frame of evolutionary relationships within the Musaceae, and to date the divergence of particular Musa sections, using a set of single or low copy nuclear gene sequences.
A priori taxonomic status of the panel of 13 Musaceae entries
Centre (ITC) code
acuminata [Colla] *
acuminata [Colla] *
zebrina [Van Houtte ex. Planch.]
balbisiana [Colla] #
Pisang Klutuk Wulung
balbisiana [Colla] #
Musa maclayi Hung Si
Target gene selection and primer design
Identity and sequence details of the set of 19 genes targeted for phylogenetic purposes
Candidate gene designation
Original MusaEST (NCBI accession number) a
Corresponding O. sativachromosome
Homologous region on the O. sativachromosome (bp) b
Amplified successfully from the outgroup species (S. nicolai)
Intron fraction in the final alignment (%)
Aligned sequence length (bp) c
Mean GC content (%) c
11000940 - 11002303
Stomatal cytokinesis defective protein
22165522 - 22168085
Electron transport protein SCO1/SenC family protein
3229271 - 3231294
Putative non-phototropic hypocotyl 3 (NPH3)
21628715 - 21631333
Endoribonuclease dicer homolog
1177218 - 1178862
28468384 - 28470215
22349032 - 22350073
2717353 - 2718249
Protein of unknown function; DUF89 family protein
12551039 - 12553284
T-complex protein 1, eta subunit (TCP-1-eta)
28301038 - 28302942
NAD+ synthase domain containing protein
3637535 - 3641458
Ribosomal protein s6 RPS6-2
25669518 - 25671532
mRNA capping enzyme, large subunit family protein
4670797 - 4673706
Methylcrotonyl-CoA carboxylase beta chain
20354213 - 20356265
13652033 - 13655072
Succinoaminoimidazole-carboximide ribonucleotide synthetase family protein
17688901 - 17690929
Methionine aminopeptidase 1
18996256 - 18997164
Initiation factor 2B family protein
12576657 - 12577797
DNA polymerase delta catalytic subunit
4368226 - 4369480
Gene fragment amplification, cloning and seqeuncing
A standard amplification protocol was applied to each of the 19 primer pairs. Each reaction contained 40 ng template, with the PCR program composed of an initial denaturation step (94°C/5 min), followed by 35 cycles of 94°C/30 s, 57°C/30 s and 72°C/35 s, and ending with an extension step of 72°C/10 min. Amplicons were treated with exonuclease/alkaline phosphatase (ExoSAP-IT®, USB, Cleveland, OH, USA) and then either sequenced directly, or first cloned into the TOPO vector (Invitrogen, Carlsbad, USA) before sequencing. Cycle sequencing was performed on three independent amplicons per gene target, using a BigDye® Terminator v3.1 Cycle Sequencing kit (Applied Biosystems, Foster City, USA), following the manufacturer's instructions. Sequencing reaction products were purified using a CleanSEQ kit (Agencourt Bioscience Corp., Beckman Coulter, Beverly, USA), and then separated on an ABI 3730Xl DNA analyzer (Applied Biosystems). All the resulting sequences have been deposited within GenBank [GenBank: HM118565-HM118820]. Raw sequence data were assembled and edited using DNA Baser v2 software . Consensus sequences were aligned by ClustalW  using default parameters, as implemented in the MEGA4 software package . Multiple DNA sequence alignments were inspected and any ambiguously aligned segments were removed prior to phylogenetic analysis.
Evolutionary models as selected by MrModeltest v2.3 software for each of the individual gene fragments and for the combined datasets using AIC criteria
data set A
data set B
Systematic bias and congruence testing
The incongruence length difference (ILD) test  (implemented in PAUP* v4.0b as the partition homogeneity test) was applied to estimate the level of potential incongruence in the data. The data set was partitioned into individual genes and analyzed under heuristic search with 1000 replicates. A χ2 test for base composition homogeneity across taxa was conducted in TREE-PUZZLE v5.2  software. The level of nucleotide substitution saturation was evaluated in DAMBE  software by plotting transitions and transversions against pairwise genetic distance. ML mapping using the quartet puzzling method  was applied to investigate whether the phylogenetic information content of the data was sufficient for inference purposes. ML mapping was also performed within TREE-PUZZLE v5.2 software with all possible quartets, applying the corresponding evolutionary model and exact model parameter estimation settings.
Dating of nodes
BEAST software v1.4.8 software was used to estimate the divergence times for the major Musaceae clades. This approach has the advantage of simultaneous estimation of substitution model parameters, topology, branch lengths and fossil-based date calibration, using the Bayesian inference and MCMC method. Calibration was based on the carbon dating of Ensete oregonense fossil seeds, given as 43 Mya according to Manchester and Kress . The analysis was conducted over four independent MCMC runs, each consisting of 1,000,000 generations under the relaxed clock model, with an uncorrelated lognormal distribution. The fossil calibration was set as the most recent common ancestor (tMRCA) parametric tree prior. The results were retrieved after combining the individual MCMC runs' tree files and the maximum clade credibility tree was constructed after the initial 25% burn-in generations were discarded.
Taxon and gene sampling
The amount of available sequence information for Musa species is confined at present and hence the development of low-copy gene markers for phylogenetic studies in this species has been laborious and time consuming. Despite this, we were able to develop 19 markers from gene regions. Only single- or low copy genes were selected with expected random distribution in the genome of Musa to make sure that unlinked loci are compared. As the genome sequence of Musa is not yet available, the selection of random distributed loci assumed colinearity with the rice genome [52, 53].
The 19 gene-based markers [GenBank: HM118565-HM118820] developed and used in the present study represent until now by far the largest set of gene markers ever used in the Musaceae. Ideally, a phylogenetic study should comprise all taxa and a high number of unlinked DNA markers. However, from practical reasons these numbers are reduced and, in fact, may not be necessary. While some authors argue that incomplete taxon sampling has a negative impact on the phylogenetic accuracy [54, 55], other authors do not support this view and prefer increasing the number of nucleotide characters sampled over the number of taxa in order to reveal the correct phylogeny without a major distortion of accuracy of the main evolution relationships [56–58]. Here, we favored the latter approach with partial taxon sampling of representatives [stratified sampling; ], rather than analyzing a few genomic loci on a large set of species. However, if felt necessary, the marker set developed in this work can be easily applied in other species and subspecies of Musaceae.
Sequence data characterization and systematic bias testing
The 19 gene fragments covered a length of 16,012 bp, of which 26.9% was exonic. The genic sequences were treated independently as a single-gene data and in two matrixed-modes according to the ability to amplify the genes from the outgroup species S. nicolai (see Table 2 for details); namely the dataset A (containing all 19 gene sequences from 13 genotypes, excl. S. nicolai) and the dataset B (containing sequences of 9 genes from all 14 genotypes, incl. the outgroup species S. nicolai). Dataset A (all 19 fragments from the 13 Musaceae entries) was based on 16,012 bp of sequence, of which 1,056 bases were informative, while dataset B (nine gene fragments from the Musaceae entries plus S. nicolai) was based on 7,404 bp of sequence, which included 492 informative sites. The χ2 test used to detect heterogeneity in base composition indicates that there was no significant variation in the AT/GC content among species for individual genes (P = 0.382-1.000). The overall reduced proportion of GC in most of the sequences (see Table 2) may be an artifact of the deliberate maximization of intronic sequence in the sample, since plant intronic sequence has an AT bias . The GC content of the intronic fraction was 34.6%, compared to 45.0% in the exonic fraction.
The constancy of the evolutionary rate was verified using a relative rate test, which revealed some heterogeneity in the sequences (data not shown). However, after a re-analysis based on RY-coded (purines/pyrimidines) sequence, which ignores transitions by focusing on the slower evolving transversions , the topologies generated were similar to those obtained from the full nucleotide sequence data. This implied that the rate heterogeneity was not large enough to significantly bias the deduced phylogenies.
Phylogenetic reconstruction based on individual gene fragments
Results of the likelihood-mapping based on the quartet puzzling algorithm
data set A
data set B
Based on the ILD analysis, the individual gene fragment partitions were highly incongruent (P < 0.001) and thus not directly combinable. However, it has been suggested that the ILD test should not be used as an exclusive measure of data partition combinability , as it is known to be susceptible to both types I [false positives; ] and II [false negatives; ] error. When Rokas et al.  combined sequence data derived from a set of different genes, conflicting signals from individual gene sequences were resolved and the resulting phylogeny was strongly supported. The joint use of a set of gene sequences for phylogenetic inference depends largely on nucleotide composition bias and substitution saturation . Since the χ2 test applied to the Musaceae sequence data indicated the absence of any base composition bias, and substitution saturation of the aligned sequences could be excluded (Figure 1), the combined set of gene fragment sequences was then used for phylogenetic reconstruction.
Phylogenetic reconstruction based on the combined sequence data
MP analysis of dataset A yielded a single fully resolved most parsimonious tree (length = 2333; CI = 0.8678 excluding non-informative characters; RI = 0.9337; RC = 0.8648) with significantly high level of bootstrap support for each of the individual branches (Figure 2). The internal branches among the M. acuminata accessions and the Rhodochlamys species, as well as within the Australimusa/Callimusa clade were dichotomous. The ML analysis supported an identical tree topology with high bootstrap support values. Although the BI analysis also produced a fully resolved tree with a high posterior probability for all nodes (Additional File 3), the monophyly of Ensete and Musella at the genus level was not supported. Due to the lack of an outgroup for dataset A, E. ventricosum was used as a surrogate, a choice which probably accounted for the MP and ML-based phylogenies. The fact that these phylogenies were likely artefactual was confirmed by the use of the midpoint rooting method, which generated the same topology as emerged from the BI analysis and from dataset B (see below).
In order to assess how much phylogenetic information was contributed by the coding and non-coding fractions, the exonic and intronic sequences were analyzed separately. This was possible given that substitution saturation was not reached in either partition (Figure 1). As expected, the intronic sequence outnumbered the exonic, both in terms of the frequency of variable bases (15.2% vs 7.1%) and of parsimony informativeness (7.9% vs 3.3%). The phylogenies reconstructed by ML, MP and BI analysis consisted of a single tree with strong statistical branch support. The trees' topology was identical to that of combined dataset. Thus, the inclusion of non-coding sequence did not introduce erroneous phylogenetic signals, but rather enhanced the robustness of the phylogenetic reconstruction.
Taxonomic implications of the sequence-based phylogeny
The final topology (Figure 3) confirmed the Musaceae family in general, and the Musa genus in particular, to be monophyletic. The monotypic genus Musella appeared as a sister species to the E. ventricosum. The validity of Musella as a genus has been questioned in previous studies and a merger between Musella and Ensete species has been suggested . On the contrary, the recent study of Li et al.  based on ITS and chloroplast loci did not come to a similar definite conclusion and underlined a need for sampling more molecular markers in order to provide the answer. Although more representatives of both of the genera would be necessary to elucidate this issue, the large set of phylogenetic markers presented here provides an excellent tool for addressing this question in future studies.
For many years, Musa has been divided into four sections, on the basis of morphological descriptors and basic chromosome number . However, it is important to quote Cheesman's flexible view: "The groups have deliberately been called sections rather than subgenera in an attempt to avoid the implication that they are of equal rank. I am inclined to regard the division between Eumusa and Rhodochlamys as unessential, though it is convenient to maintain as long as it remains as well marked in the field as it is at present. On the other hand the seed of Callimusa almost justifies its segregation as a distinct genus, and would do so were not Australimusa intermediate in some characters between it and Eumusa" . Recently, several DNA sequence-based analyses have indeed questioned the validity of some of the four sections. In particular, Eumusa and Rhodochlamys representatives have been in some cases demonstrated to be more closely related to one another than to their sectional relatives, as was shown for some Australimusa and Callimusa species [6, 7, 9, 10].
The present data indicate a close relationship between the species of Rhodochlamys and M. acuminata (Eumusa). The position of M. ornata within the A-genome group of Eumusa section (Figure 3) agrees with the findings of other authors [7, 10, 31, 70], and indicates that Rhodochlamys and Eumusa are not reciprocally monophyletic. Various Eumusa × Rhodochlamys hybrids have been observed, and are likely to be numerous in the monsoon region of SE Asia . Although the current molecular data in relation to the morphological observation indicate that the claims for merging of Rhodochlamys and Eumusa [6, 8, 10] were justified, final resolution of this issue will require a better representation of species within both sections. The new set of phylogenetic markers developed in this study can be applied easily in future to analyze in detail phylogenetic relationships between and within Musaceae taxa.
In contrast to the clustering of M. balbisiana with M. textilis (section Australimusa), as reported by Liu et al. , the present data identified a clearly separated group of M. balbisiana entries within clade I, suggesting that this species is phylogenetically quite distinct from other Eumusa species. The distance between M. acuminata and M. balbisiana appears to be greater than between it and the Rhodochlamys species (Figure 3), as has also been noted by others [8, 11, 31]; these relationships are consistent with conclusions based on cytogenetic and hybridization studies [72, 73]. The clear separation between M. balbisiana and M. acuminata is particularly interesting given that almost all varieties of edible (polyploid) banana are thought to have evolved from natural hybrids between these two species .
Based on the gene fragment sequences, M. textilis fell, as expected, into the Australimusa section within Clade II (Figure 3), which also includes the Callimusa species. The two representatives of the section Callimusa included in this study differ in the basic chromosome number (Table 1), reflecting the noted controversy of Callimusa as a natural section [9, 10, 74]. M. beccarii and M. coccinea did not form a strictly separated Callimusa cluster; instead, their close relationship to Australimusa species was apparent (Figure 3). The only representative of Fe'i bananas (parthenocarpic edible types distributed throughout Pacific islands) in this study appears to be most closely related to M. maclayi, in line with Simmonds , who considered M. maclayi to be a wild progenitor of the Fe'i banana.
Estimation of time of divergence
Estimates of divergence time for species within the Musaceae family
to Figure 3)
HPD range a
Calibration point (Ensete oregonense fossil record)
M. coccinea -remaining
Eumusa B genome-remaining
M. manni (Rhodochlamys)-remaining
Eumusa A genome
M. beccarii (Callimusa)-remaining
Fe'i - M. maclayi
Despite the fact that the estimated age of the Musaceae family (69 Mya) is much younger than the 110 Mya postulated by Kress and Specht , the two estimates for the age of the Musa genus (50.7 Mya and 51.4 Mya) are indistinguishable. As the Musaceae are over-represented in our sample (as compared to other Zingiberales families), the current estimate probably represents a minimal age for the radiation of the Musaceae. The present data can be used to date the speciation events within both Australimusa/Callimusa and Rhodochlamys/Eumusa to some 28 Mya (Figure 3, nodes C, D). Within the Clade I, the B genome lineage (M. balbisiana species) was the first to diverge, followed by the M. mannii lineage, representing the Rhodochlamys section, at 20 Mya. Speciation within the A genome lineage (M. acuminata species) began 11.4 Mya. The minimum age of M. ornata, which appears to belong to the A genome group within Eumusa section, is estimated to be 8.7 Mya (Figure 3; node I).
Although M. mannii is an "imperfectly understood small species up to 1.3 m high with purplish-red bracts that do not curl back" , it undoubtedly belongs to section Rhodochlamys, which is confined to the monsoon-affected areas of Southeast Asia. Its characteristic dry-season die-back is presumably an adaptation to drought, and contrasts with the behavior of the Eumusa species endemic to the same geographical region, which survive the dry season, although often in poor condition . The monsoon regime was established following the formation of the Himalayas and the Tibetan plateau, and is thought to have stabilized in its current form around 20-25 Mya . The estimated divergence date of M. mannii (20 Mya, Table 5) could therefore reflect an adaptation to climate change. The later divergence time of the other Rhodochlamys member, M. ornata, could be explained by its probable derivation from a hybrid between M. velutina (section Rhodochlamys) and M. flaviflora, belonging to a taxon intermediate between Rhodochlamys and Eumusa .
The speciation of the Callimusa species can be dated between 8.8 and 28.7 Mya, while the divergence of the Australimusa species occurred ~5 Mya (Figure 3, nodes H, J). The relatively recent emergence of the section Australimusa is consistent with its perception as an evolutionarily rather young group . Shepherd  determined that the "species" within this section behave genetically as a single species, which he therefore designated Musa textilis Née. The current phylogeny (Figure 3) supports this view, implying that M. textilis could well be the founding species of the entire section. Numerical taxonomy has placed M. textilis equidistant from the four Musa sections . In this context it is worth noting that robust and sterile diploid hybrids ('Canton') between M. textilis (x = 10) and 'Pacol' (a form of M. balbisiana, x = 11) are common in The Philippines.
The divergence of M. coccinea appears to be rather older than that of the members of the Australimusa section (Table 5). Unsuccessful attempts to cross two Callimusa species M. coccinea and M. borneensis led Shepherd  to suggest that they differentiated from one another long before the evolution of the Australimusa species. The seed morphology of Callimusa species is very different from that of any of the other Musa sections, being cylindrical, barrel- or top-shaped, and marked externally by a transverse line or groove. When ripe, they develop a large, empty chalazal (perisperm) chamber above the groove [10, 77]. Although the molecular data alone indicate the paraphyletic position Callimusa to Australimusa entries (Figure 3), given the above mentioned morphological aspects and the flexibility of the term "section" by Cheesman  we believe that merging the two Musa sections with x = 10, as proposed by Wong et al.  and indicated by Li et al. , is not tenable.
The gene sequence-based phylogeny presented here provides a substantial insight into the course of speciation within the Musaceae. The data tend to sustain the close relationship of Rhodochlamys and Eumusa species, supporting the possibility of merging the two sections into a single one. A greater number of species sampled could generate an improved classification, and could help in clarifying the relationship between the species Rhodochlamys and M. acuminata, as well as to confirm the generic status of Musella and Ensete. Based on the largest amount of nucleotide characters for Musaceae obtained to date, this study provides the first estimates of divergence times for individual Musa sections and genome groups within the Musaceae. Although limited by the number of species sampled from individual sections and subgroups, we provide a plausible reconstruction of speciation events within the Musaceae, a family which has given rise to one of mankind's major crops.
We thank our colleagues Marie Seifertová, MSc. and Ms. Radka Tušková for their excellent technical assistance. We are grateful to Ir. Ines Van den Houwe for providing much of the plant material, François Côte for the gift of in vitro rooted plants of M. balbisiana 'PKW' and Dr. Martin Dančák for supplying leaves of S. nicolai. This research was jointly supported by the Academy of Sciences of the Czech Republic (grant award IAA600380703), Internal Grant Agency of Palacký University (grant award no. Prf-2010-001) and by the Ministry of Education, Youth and Sports of the Czech Republic and the European Regional Development Fund (Operational Programme Research and Development for Innovations No. CZ.1.05/2.1.00/01.0007).
- FAOStat. [http://faostat.fao.org/default.aspx]
- Angiosperm Phylogeny Group: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Bot J Linn Soc. 2009, 161: 105-121. 10.1111/j.1095-8339.2009.00996.x.View ArticleGoogle Scholar
- Cheesman EE: Classification of the bananas II. The genus Musa L. Kew Bulletin. 1947, 2: 106-117. 10.2307/4109207.View ArticleGoogle Scholar
- Simmonds NW, Shepherd K: The taxonomy and origins of cultivated bananas. Bot J Linn Soc. 1955, 55: 302-312. 10.1111/j.1095-8339.1955.tb00015.x.View ArticleGoogle Scholar
- Argent GCG: The wild bananas of Papua New Guinea. Notes R Bot Gard Edinburgh. 1976, 35: 77-114.Google Scholar
- Gawel NJ, Jarret RL, Whittemore AP: Restriction fragment length polymorphism (RFLP)-based phylogenetic analysis of Musa. Theor Appl Genet. 1992, 84: 286-290. 10.1007/BF00229484.PubMedGoogle Scholar
- Bartoš J, Alkhimova O, Doleželová M, De Langhe E, Doležel J: Nuclear genome size and genomic distribution of ribosomal DNA in Musa and Ensete (Musaceae): taxonomic implications. Cytogenet Genome Res. 2005, 109: 50-57.View ArticlePubMedGoogle Scholar
- Li LF, Häkkinen M, Yuan YM, Hao G, Ge XJ: Molecular phylogeny and systematics of the banana family (Musaceae) inferred from multiple nuclear and chloroplast DNA fragments, with a special reference to the genus Musa. Mol Phylogenet Evol. 2010, 57: 1-10. 10.1016/j.ympev.2010.06.021.View ArticlePubMedGoogle Scholar
- Jarret RL, Gawel NJ: Molecular markers, genetic diversity and systematics. Bananas and plantains. Edited by: Gowen S. 1995, London: Chapman and Hall, 67-83.Google Scholar
- Wong C, Kiew R, Argent G, Set O, Lee SK, Gan YY: Assessment of the validity of the sections in Musa (Musaceae) using AFLP. Ann Bot. 2002, 90: 231-238. 10.1093/aob/mcf170.View ArticlePubMedPubMed CentralGoogle Scholar
- Ude G, Pillay M, Nwakanma D, Tenkouano A: Analysis of genetic diversity and sectional relationships in Musa using AFLP markers. Theor Appl Genet. 2002, 104: 1239-1245. 10.1007/s00122-001-0802-3.View ArticlePubMedGoogle Scholar
- Olmstead RG, Palmer JD: A chloroplast DNA phylogeny of the Solanaceae: Subfamilial relationships and character evolution. Ann Missouri Bot Gard. 1992, 79: 346-360. 10.2307/2399773.View ArticleGoogle Scholar
- Doyle JJ, Doyle JL, Ballenger JA, Dickson EE, Kajita T, Ohashi H: A phylogeny of the chloroplast gene rbcL in the Leguminosae: Taxonomic correlations and insights into the evolution of nodulation. Am J Bot. 1997, 84: 541-554. 10.2307/2446030.View ArticlePubMedGoogle Scholar
- Beckert S, Steinhauser S, Muhle H, Knoop V: A molecular phylogeny of bryophytes based on nucleotide sequence of the mitochondrial nad5 gene. Plant Sys Evol. 1999, 218: 179-192. 10.1007/BF01089226.View ArticleGoogle Scholar
- Graham SW, Olmstead RG: Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms. Am J Bot. 2000, 87: 1712-1730. 10.2307/2656749.View ArticlePubMedGoogle Scholar
- Swangpol S, Volkaert H, Sotto RC, Seelanan T: Utility of selected non-coding chloroplast DNA sequences for lineage assessment of Musa interspecific hybrids. J Biochem Mol Biol. 2007, 40: 577-587.View ArticlePubMedGoogle Scholar
- Baldwin BG: Phylogenetic utility of the internal transcribed spacers of nuclear ribosomal DNA in plants: An example from the Compositae. Mol Phylogenet Evol. 1992, 1: 3-16. 10.1016/1055-7903(92)90030-K.View ArticlePubMedGoogle Scholar
- Compton JA, Culham A, Gibbings JG, Jury SL: Phylogeny of Actaea including Cimicifuga (Ranunculaceae) inferred from nrDNA ITS sequence variation. Biochem Syst Ecol. 1998, 26: 185-197. 10.1016/S0305-1978(97)00102-6.View ArticleGoogle Scholar
- Kress WJ, Prince LM, Williams KJ: The phylogeny and a new classification of the gingers (Zingiberaceae): Evidence from molecular data. Am J Bot. 2002, 89: 1682-1696. 10.3732/ajb.89.10.1682.View ArticlePubMedGoogle Scholar
- Eyre-Walker A, Gaut BS: Correlated rates of synonymous site evolution across plant genomes. Mol Biol Evol. 1997, 14: 455-460.View ArticlePubMedGoogle Scholar
- Dover G: Concerted evolution, molecular drive and natural selection. Curr Biol. 1994, 4: 1165-1166. 10.1016/S0960-9822(00)00265-7.View ArticlePubMedGoogle Scholar
- Álvarez I, Wendel JF: Ribosomal ITS sequences and plant phylogenetic inference. Mol Phylogenet Evol. 2003, 29: 417-434.View ArticlePubMedGoogle Scholar
- Feliner GN, Roselló JA: Better the devil we know? Guidelines for insightful utilization of nrDNA ITS species-level evolutionary studies in plants. Mol Phylogenet Evol. 2007, 44: 911-919. 10.1016/j.ympev.2007.01.013.View ArticleGoogle Scholar
- Small RL, Ryburn JA, Cronn RC, Seelanan T, Wendel JF: The tortoise and the hare: Choosing between noncoding plastome and nuclear ADH sequences for phylogeny reconstruction in a recently diverged plant group. Am J Bot. 1998, 85: 1301-1315. 10.2307/2446640.View ArticlePubMedGoogle Scholar
- Bailey CD, Doyle JJ: Potential phylogenetic utility of the low-copy nuclear gene pistillata in dicotyledonous plants: Comparison to nrDNA ITS and trnL intron in Sphaerocardamum and other Brassicaceae. Mol Phylogenet Evol. 1999, 13: 20-30. 10.1006/mpev.1999.0627.View ArticlePubMedGoogle Scholar
- Schulte K, Barfuss MHJ, Zizka G: Phylogeny of Bromelioideae (Bromeliaceae) inferred from nuclear and plastid DNA loci reveals the evolution of the tank habit within the subfamily. Mol Phylogenet Evol. 2009, 51: 327-339. 10.1016/j.ympev.2009.02.003.View ArticlePubMedGoogle Scholar
- Zou XH, Zhang FM, Zhang JG, Zang LL, Tang L, Wang J, Sang T, Ge S: Analysis of 142 genes resolves the rapid diversification of the rice genus. Genome Biol. 2008, 9: R49-10.1186/gb-2008-9-3-r49.View ArticlePubMedPubMed CentralGoogle Scholar
- Whittall JB, Medina-Marino A, Zimmer EA, Hodges SA: Generating single-copy nuclear gene data for a recent adaptive radiation. Mol Phylogenet Evol. 2006, 39: 124-134. 10.1016/j.ympev.2005.10.010.View ArticlePubMedGoogle Scholar
- Boonruangrod R, Desai D, Fluch S, Berenyi M, Burg K: Identification of cytoplasmic ancestor gene-pools of Musa acuminata Colla and Musa balbisiana Colla and their hybrids by chloroplast and mitochondrial haplotyping. Theor Appl Genet. 2008, 118: 43-55. 10.1007/s00122-008-0875-3.View ArticlePubMedGoogle Scholar
- Boonruangrod R, Fluch S, Burg K: Elucidation of origin of the present day hybrid banana cultivars using the 5'ETS rDNA sequence information. Mol Breeding. 2009, 24: 77-91. 10.1007/s11032-009-9273-z.View ArticleGoogle Scholar
- Liu AZ, Kress WJ, Li DZ: Phylogenetic analyses of the banana family (Musaceae) based on nuclear ribosomal (ITS) and chloroplast (trnL-F) evidence. Taxon. 2010, 59: 20-28.Google Scholar
- Bremer K: Early Cretaceous lineages of monocot flowering plants. Proc Natl Acad Sci USA. 2000, 97: 4707-4711. 10.1073/pnas.080421597.View ArticlePubMedPubMed CentralGoogle Scholar
- Janssen T, Bremer K: The age of major monocot groups inferred from 800+ rbcL sequences. Bot J Linn Soc. 2004, 146: 385-398. 10.1111/j.1095-8339.2004.00345.x.View ArticleGoogle Scholar
- Anderson CL, Janssen T: Monocots. The timetree of life. Edited by: Hedges SB, Kumar S. 2009, New York: Oxford University Press, 203-212.Google Scholar
- Kress WJ, Prince LM, Hahn WJ, Zimmer E: Unraveling the evolutionary radiation of the families of the Zingiberales using morphological and molecular evidence. Syst Biol. 2001, 50: 926-944. 10.1080/106351501753462885.View ArticlePubMedGoogle Scholar
- Kress WJ, Specht CD: The evolutionary and biogeographic origin and diversification of the tropical monocot order Zingiberales. Aliso. 2006, 22: 619-630.Google Scholar
- Paterson AH, Bowers JE, Chapman BA: Ancient polyploidization predating the divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci USA. 2004, 101: 9903-9908. 10.1073/pnas.0307901101.View ArticlePubMedPubMed CentralGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.View ArticlePubMedGoogle Scholar
- Lessa EP: Rapid surveying of DNA sequence variation in natural populations. Mol Biol Evol. 1992, 9: 323-330.PubMedGoogle Scholar
- DNA Baser sequence assembly software. [http://www.dnabaser.com/]
- Thompson JD, Higgins DG, Gibson TJ: Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.View ArticlePubMedPubMed CentralGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.View ArticlePubMedGoogle Scholar
- Swofford DL: PAUP*, Phylogenetic Analysis Using Parsimony (*and Other Methods) v4.0b10. 2003, Sunderland: Sinauer AssociatesGoogle Scholar
- Nylander JAA: MrModeltest v2. Program distributed by the author. 2004, Uppsala: Evolutionary Biology CentreGoogle Scholar
- Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.View ArticlePubMedPubMed CentralGoogle Scholar
- Molecular evolution, phylogenetics and epidemiology software. [http://tree.bio.ed.ac.uk/software/figtree/]
- Farris JS, Källersjö M, Kluge AG, Bult C: Testing significance of incongruence. Cladistics. 1994, 10: 315-319. 10.1111/j.1096-0031.1994.tb00181.x.View ArticleGoogle Scholar
- Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18: 502-504. 10.1093/bioinformatics/18.3.502.View ArticlePubMedGoogle Scholar
- Xia X, Xie Z: DAMBE: Software package for data analysis in molecular biology and evolution. J Hered. 2001, 92: 371-373. 10.1093/jhered/92.4.371.View ArticlePubMedGoogle Scholar
- Strimmer K, Von Haeseler A: Likelihood mapping: A simple method to visualize phylogenetic content of a sequence alignment. Proc Natl Acad Sci USA. 1997, 94: 6815-6819. 10.1073/pnas.94.13.6815.View ArticlePubMedPubMed CentralGoogle Scholar
- Manchester SR, Kress WJ: Fossil bananas (Musaceae): Ensete oregonense sp. nov. from the Eocene of western North America and its phytogeographic significance. Am J Bot. 1993, 80: 1264-1272. 10.2307/2445709.View ArticleGoogle Scholar
- Cheung F, Town CD: A BAC end view of the Musa acuminata genome. BMC Plant Biol. 2007, 7: 29-10.1186/1471-2229-7-29.View ArticlePubMedPubMed CentralGoogle Scholar
- Lescot M, Piffanelli P, Ciampi AY, Ruiz M, Blanc G, Leebens-Mack J, Da Silva FR, Santos CMR, D'Hont A, Garsmeur O, Vilarinhos AD, Kanamori H, Matsumoto T, Ronning CM, Cheung F, Haas BJ, Althoff R, Arbogast T, Hine E, Pappas GJ, Sasaki T, Souza MT, Miller RNG, Glaszmann JC, Town CD: Insights into the Musa genome: Syntenic relationships to rice and between Musa species. BMC Genomics. 2008, 9: 58-10.1186/1471-2164-9-58.View ArticlePubMedPubMed CentralGoogle Scholar
- Zwickl DJ, Hillis DM: Increased taxon sampling greatly reduces phylogenetic error. Syst Biol. 2002, 51: 588-598. 10.1080/10635150290102339.View ArticlePubMedGoogle Scholar
- Hillis DM, Pollock DD, McGuire JA, Zwickl DJ: Is sparse taxon sampling a problem for phylogenetic inference?. Syst Biol. 2003, 52: 124-126. 10.1080/10635150390132911.View ArticlePubMedPubMed CentralGoogle Scholar
- Poe S, Swofford DL: Taxon sampling revisited. Nature. 1999, 398: 299-300. 10.1038/18592.View ArticlePubMedGoogle Scholar
- Rosenberg MS, Kumar S: Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci USA. 2001, 98: 10751-10756. 10.1073/pnas.191248498.View ArticlePubMedPubMed CentralGoogle Scholar
- Rosenberg MS, Kumar S: Taxon sampling, bioinformatics and phylogenomics. Syst Biol. 2003, 52: 119-124. 10.1080/10635150390132894.View ArticlePubMedPubMed CentralGoogle Scholar
- Hillis DM: Taxonomic sampling, phylogenetic accuracy, and investigator bias. Syst Biol. 1998, 47: 3-8. 10.1080/106351598260987.View ArticlePubMedGoogle Scholar
- Lorković ZJ, Wieczorek DA, Lambermon MHL, Filipowicz W: Pre-mRNA splicing in higher plants. Trends Plant Sci. 2000, 5: 160-167.View ArticlePubMedGoogle Scholar
- Jeffroy O, Brinkmann H, Delsuc F, Philippe H: Phylogenomics: the beginning of incongruence?. Trends Genet. 2006, 22: 225-231. 10.1016/j.tig.2006.02.003.View ArticlePubMedGoogle Scholar
- Phillips MJ, Delsuc F, Penny D: Genome-scale phylogeny and the detection of systematic biases. Mol Biol Evol. 2004, 21: 1455-1458. 10.1093/molbev/msh137.View ArticlePubMedGoogle Scholar
- Akaike H: A new look at the statistical model identification. IEEE Trans Autom Control. 1974, 19: 716-723. 10.1109/TAC.1974.1100705.View ArticleGoogle Scholar
- Posada D, Buckley TR: Model selection and model averaging in phylogenetics: Advantages of Akaike information criterion and Bayesian approaches over likelihood ratio tests. Syst Biol. 2004, 53: 793-808. 10.1080/10635150490522304.View ArticlePubMedGoogle Scholar
- Yoder AD, Irwin JA, Payseur BA: Failure of the ILD to determine data combinability for slow loris phylogeny. Syst Biol. 2001, 50: 408-424. 10.1080/106351501300318003.View ArticlePubMedGoogle Scholar
- Planet PJ: Tree disagreement: Measuring and testing incongruence in phylogenies. J Biomed Inform. 2006, 39: 86-102. 10.1016/j.jbi.2005.08.008.View ArticlePubMedGoogle Scholar
- Ramírez MJ: Further problems with the incongruence length difference test: ''hypercongruence'' effect and multiple comparisons. Cladistics. 2006, 22: 289-295.View ArticleGoogle Scholar
- Rokas A, Williams BL, King N, Carroll SB: Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature. 2003, 425: 798-804. 10.1038/nature02053.View ArticlePubMedGoogle Scholar
- Ruangsuttapha S, Eimert K, Schröder MB, Silayoi B, Denduangboripant J, Kanchanapoom K: Molecular phylogeny of banana cultivars from Thailand based on HAT-RAPD markers. Genet Resour Crop Evol. 2007, 54: 1565-1572. 10.1007/s10722-006-9169-2.View ArticleGoogle Scholar
- Hřibová E, Čížková J, Christelová P, Taudien S, De Langhe E, Doležel J: The ITS1-5.8S-ITS2 sequence region in the Musaceae: structure, diversity and use in molecular phylogeny. Plos One. 6: e17863-Google Scholar
- Simmonds NW: Botanical results of the banana collecting expeditions, 1954-5. Kew Bulletin. 1956, 11: 463-489. 10.2307/4109131.View ArticleGoogle Scholar
- Simmonds NW: Isolation in Musa, sections Eumusa and Rhodochlamys. Evolution. 1954, 8: 65-74. 10.2307/2405666.View ArticleGoogle Scholar
- Shepherd K: Cytogenetics of the genus Musa. 1999, Montpellier: INIBAPGoogle Scholar
- Wong C, Kiew R, Ohn S, Lamb A, Lee SK, Gan LH, Gan YY: Sectional placement of three Bornean species of Musa (Musaceae) based on AFLP. Gardens' Bulletin Singapore. 2001, 53: 327-341.Google Scholar
- Argent GCG: Musaceae. The European Garden Flora. Volume II. Monocotyledons (Part 2): Juncaceae to Orchidaceae. Edited by: Walters SM, Brady A, Brickell CD, Cullen J, Green PS, Lewis J, Matthews VA, Webb, DA, Yeo PF, Alexander JCM. 1984, New York: Cambridge University Press, 117-119.Google Scholar
- Harris N: The elevation history of the Tibetan Plateau and its implications for the Asian monsoon. Palaeogeogr Palaeoclimatol Palaeoecol. 2006, 241: 4-15. 10.1016/j.palaeo.2006.07.009.View ArticleGoogle Scholar
- Simmonds NW: The evolution of the bananas. 1962, London: LongmanGoogle Scholar
- De Langhe E: Diversity in the genus Musa: its significance and its potential. Acta Hort. 2000, 540: 81-88.View ArticleGoogle Scholar
- Shepherd K: Observations on Musa taxonomy. Identification of genetic diversity in the genus Musa: Proceedings of an international workshop held at los Baños, Philippines, 5-10 September 1988. Edited by: Jarret RL. 1990, Montpellier: INIBAP, 158-165.Google Scholar
- Tamura K, Nei M: Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993, 10: 512-526.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.