Positive selection and ancient duplications in the evolution of class B floral homeotic genes of orchids and grasses
© Mondragón-Palomino et al; licensee BioMed Central Ltd. 2009
Received: 30 November 2008
Accepted: 21 April 2009
Published: 21 April 2009
Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged.
The DEF-like genes from Orchidaceae form four well-supported, ancient clades of orthologues. In contrast, orchid GLO-like genes form a single clade of ancient orthologues and recent paralogues. DEF-like genes from orchid clade 2 (OMADS3-like genes) are under less stringent purifying selection than the other orchid DEF-like and GLO-like genes. In comparison with orchids, purifying selection was less stringent in DEF-like and GLO-like genes from Poales. Most importantly, positive selection took place before the major organ reduction and losses in the floral axis that eventually yielded the zygomorphic grass floret.
In DEF-like genes of Poales, positive selection on the region mediating interactions with other proteins or DNA could have triggered the evolution of the regulatory mechanisms behind the development of grass-specific reproductive structures. Orchidaceae show a different trend, where gene duplication and transcriptional divergence appear to have played a major role in the canalization and modularization of perianth development.
One important goal of contemporary biology is to understand how changes in developmental processes generate evolutionary novelties at the morphological level. The growing field of evolutionary developmental biology ('evo-devo'), approaches this question by determining how changes in the number, sequence and expression of developmental regulatory genes bring about formation of new structures. In plants and animals, these developmental regulatory factors have expanded during evolution (e.g. by gene and genome duplication) to form large and diverse gene families linked by complex genetic and physical interactions [1–3].
Mutations in transcriptional regulators of development often do not significantly affect the complete organism because their function is generally confined to a single category of organs or modules . Thus, it has been hypothesized that developmental transcription factors are more likely to evolve new functions and so coordinate the development of viable morphological novelty . The importance of duplication and diversification of genes encoding transcription factors (e.g. Hox genes) is substantiated by genomic analyses showing that these kinds of genes are specifically retained in plant  and animal genomes [6, 7]. Additionally, these genes show diverging patterns of expression, unequal rates of substitution and positive selection [6–11].
Positive selection is also involved in the diversification of several groups of plant developmental transcription factors [12–17]. Recent research has focused on those encoded by members of the MIKC-type MADS-box gene family because of their key role in the development and evolutionary diversification of the angiosperm flower [18–22]. Thus, characterizing their patterns of molecular evolution is essential to understanding their function and the mechanisms of morphological evolution. Because different functional classes of MADS-box genes form distinct clades [23–25], their phylogeny is an important aid to identify and test hypotheses explaining the different selective regimes that are generally considered to drive their evolution.
The plant-specific proteins encoded by MIKC-type MADS-box genes have an unique and highly-conserved domain structure that includes MADS- (M-), intervening (I-), keratin-like (K-) and C-terminal (C-) domains . The MADS-domain is mostly involved in DNA-binding and, together with the I-domain, mediates the formation of dimers. The K-domain plays an important role in protein – protein interaction during both dimerization and the formation of multimeric complexes. The C-terminal domain is the most variable region. In some cases it is involved in transcription activation, but it may also be involved in multimeric complex formation (reviewed in ).
The ABCDE model of flower development (reviewed in ), describes the genetic interactions of the five major classes of floral homeotic selector genes termed class A, B, C, D and E genes, almost all of which are MIKC-type genes. Each of these gene classes determines the identity of different floral organs: Class A and E genes specify sepals; class A, B and E genes determine petals; the combination of class B, C and E genes specifies stamens (male reproductive organs); class C and E genes determine carpels (female reproductive organs); and class D genes determine ovules.
Of special interest to our study are class B MADS-box genes encoding transcription factors, key to the specification of petal and stamen identity [27–32]. A gene duplication event that preceded the origin of extant angiosperms gave rise to DEF- and GLO-like genes, the two major lineages of class B genes in angiosperms [33–35]. The regulatory role of class B genes in some model plants such as Arabidopsis thaliana and Antirrhinum majus involves obligatory heterodimerization of proteins from the DEF and GLO lineages [27, 29, 36]. Moreover, these heterodimers form higher order complexes with other classes of MADS-domain proteins [37–39].
Recent analyses of the molecular evolution of class B-like MADS-box genes in angiosperms detected positive selection after two key duplication events that generated first DEF- and GLO-like genes and later euAP3-type and TM6-type genes, which are the major sublineages of DEF-like genes . The analysis of Hernández-Hernández et al. showed that during evolution, positive selection probably modified the central property of protein complex formation because most of the selected sites belong to the K-domain mediating protein – protein interactions in the complexes of MADS-domain transcription factors . Thus, the evolutionary emergence and divergence of DEF- and GLO-like genes after duplication enabled the formation of obligate heterodimeric complexes involved in the determination of floral organ identity, while the evolution of the class B gene lineages of euAP3-type and TM6-type genes may be associated with the morphological canalization of the core eudicot flower [22, 40].
Flowers of many monocots are actinomorphic, with two trimerous whorls of highly similar petaloid organs called tepals. In contrast, at least three kinds of organ identity exist in the zygomorphic orchid perianth: in the first floral whorl there are three outer tepals (T1–T3; often also termed 'sepals'). In the second whorl there are two lateral inner tepals (t1, t2; 'petals') and a median inner tepal (t3) called lip or labellum [41, 42]. The orchid family is divided into five monophyletic subfamilies that successively diverged from each other: Apostasioideae, Vanilloideae, Cypripedioideae, Orchidoideae and Epidendroideae [43, 44]. In each subfamily the three types of perianth organs have specific features, though the lip typically adopts the widest range of morphologies. In Apostasia, one of two genera of the Apostasioideae, the lip resembles the other tepals , while in flowers of Cypripedioideae the lip is "deeply pouched and inflated" .
In contrast, the reproductive organs of the typical grass floret are surrounded (from inner to outer) by two lodicules, palea and lemma following a zygomorphic organization. One or more florets are surrounded by two glumes, thus constituting a spikelet, the characteristic reproductive unit of grass inflorescences [47, 48]. Class B gene expression and corresponding homeotic transformations in mutant plants suggest that the lodicules are probably homologous with eudicot petals, even though lodicules are small, often glandular-looking organs that appear very different from typical petals [49, 50]. The homologies of palea, lemma and glumes to organs of other angiosperms, such as sepals or prophylls remain controversial .
Many monocot species have several copies of DEF- and GLO-like genes which are expressed differently from their orthologues first characterized in Antirrhinum majus and Arabidopsis thaliana [35, 52]. For example, the development of petaloid tepals in the outer whorl of Tulipa gesneriana and other petaloid monocots is probably determined by the heterotopic expression of DEF- and GLO-like genes [53–56]. Although this research suggests there are alternative ways in which class B proteins regulate the genes encoding them and are associated with novel morphologies, it is still far from clear how MADS-box gene duplication and transcriptional divergence is associated with developmental flexibility and the evolutionary diversification of floral morphology. An opportunity to address this question comes from recent studies in putative class B genes from Orchidaceae (orchids; order Asparagales), which indicate that the high degree of perianth diversity of this family might be associated to duplication of MADS-box genes [57–61]. Specifically, studies on the orchid species Habenaria radiata, Dendrobium crumenatum, Phalaenopsis equestris and the hybrid Oncidium "Gower Ramsey", have indicated that the petaloid character of the outer tepals of orchid flowers is due to heterotopic expression of class B genes HrDEF, DcOAP3A, PeMADS2 and PeMADS5, and OMADS3, respectively [57, 58, 60, 61]. In particular, the analyses of Phalaenopsis equestris and Dendrobium crumenatum indicated that the specific combination of duplicate gene expression in each whorl is associated with development of three distinct groups of organs: the outer and inner lateral tepals and the exceptionally diverse orchid lip [58, 60].
Recently, we linked these distinct patterns of expression and organ identity determination with preliminary data on the molecular phylogeny of orchid DEF-like genes [42, 62]. We argued, that the orchid perianth evolved from a lily-like ancestor as a result of duplication of DEF-like genes and subsequent regulatory changes that brought about differential expression of the paralogues. Our model predicts that the extant diversity of the orchid perianth results from changes in expression of some of these four DEF-like genes, or from changes in the downstream targets of the proteins that they encode . These two scenarios are not mutually exclusive and the pattern of selection on these genes could help to distinguish between them. We hypothesize that the occurrence of distinct patterns of molecular evolution in each clade of orchid DEF-like genes may substantiate the hypothesis that each of them is associated to distinct protein- or DNA-binding capabilities.
The DEF- and GLO-like genes from the order Poales provide a useful point of comparison for testing this hypothesis because they are essential for specification of stamens and the grass-specific lodicules [49, 50] and are expressed in homologous whorls of the grasses and their closest relatives. This indicates that the morphological differentiation of grasses from tepaloid monocots is possibly the result of changes in the downstream targets of class B transcription factors .
Here, we present an in-depth phylogenetic analysis of putative class B DEF- and GLO-like genes from the monocots. In this study, we substantially widen the sample of DEF- and GLO-like genes, incorporating new sequences from four out of five orchid subfamilies: Vanilloideae, Cypripedioideae, Orchidoideae and Epidendroideae, as well as Hypoxis, a member of the Asparagales frequently employed as outgroup in phylogenetic analyses of the Orchidaceae. The molecular phylogenies that we generate are essential for testing hypothesis on the selective regimes that affected class B-like MADS-box genes during the evolution of the orchid perianth. We compare the molecular evolution and expansion of DEF- and GLO-like genes in the Orchidaceae and order Poales to understand the processes of duplication and natural selection behind the genes associated with the development of the perianths and stamens of the two largest groups of the monocots.
Ancient and conserved paralogy in DEF-like genes, but not in GLO-like genes from orchids
We isolated three to four different cDNA sequences of DEF-like genes from each of the orchid species Vanilla planifolia (Vanilloideae), Phragmipedium longifolium (Cypripedioideae), Spiranthes odorata (Orchidoideae) and Gongora galeata (Epidendroideae), whereas in Hypoxis villosa (Hypoxidaceae), we found only two different sequences. In contrast, in almost all the orchid species analyzed only a single GLO-like sequence was identified, with the exception of Habenaria radiata, where two GLO-like genes have been isolated . In Hypoxis villosa two different sequences were found (Table S1 in Additional file 1).
The molecular phylogeny of all monocot DEF-like sequences in GenBank (Figure 1) shows strong support for the existence of four orchid-specific clades of orthologous genes that we designate according to the first reported sequence in the literature [57, 58]: PeMADS2-like genes (clade 1), OMADS3-like genes (clade 2), PeMADS3-like genes (clade 3) and PeMADS4-like genes (clade 4). The phylogenetic reconstruction indicates that clades 1 and 2, as well as clade 3 and 4, are sister clades (Figure 1). The topology of each clade reproduces the phylogeny of the Orchidaceae, indicating that the genes within these clades are orthologues (different genes that originated in speciation events) . The phylogenetic analysis indicates that the sequences in each of the four orchid DEF-like clades have features specific to each clade and suggest that, following gene duplication, they might have acquired different functions. In particular, the C-domain of the corresponding proteins is a region possibly involved in protein – protein interactions in multimeric complexes . This region experienced a remarkable level of clade-specific diversification (Additional file 2), whereas in the M-, I- and K-domains most of the substitutions involve amino acids of similar chemical properties. In the positions encoding the MIK-region, nucleotide identity ranges from 68 to 92%, but as expected, it drops to a range between 46 to 90% in the region encoding the C-terminal domain of sequences in clades 1, 3 and 4. This reduced identity is particularly pronounced in clade 2, where it ranges from 28 to 57% as a result of a large number of substitutions (e.g. PeMADS5 and SpodoDEF2) or early stop codons (e.g. OMADS3 and GogalDEF2) that eliminated the positions encoding for the otherwise highly conserved C-terminal 'PI-derived' and 'paleoAP3' motifs (Additional file 2). Finally, the monophyletic status of the individual clades of DEF-like genes in Orchidaceae (Asparagales) and in the orders Liliales, Poales and Commelinales is supported with posterior probabilities of at least 99% (nodes indicated with arrows, Figure 1). However, the phylogenetic relationships between these clades are undefined by these data.
Each region of the alignments is distinctly variable and thus has different proportions of phylogenetic informative positions. For example, this is reflected in the higher support levels and phylogenetic resolution in the tree obtained with the MIK-region of DEF-like genes (Figure 1), compared with the one estimated only with the C-terminal domain or with both regions together (Additional files 4 and 5). This indicates that different domains of these transcription factors are subject to different selective regimes. In contrast, the high degree of conservation in all regions of the GLO-like genes is reflected on the relatively similar phylogenetic reconstruction obtained with complete sequences (Figure 2), or individual regions (Additional files 6 and 7).
The apparent lack of family-wide duplications in the GLO-like genes within Orchidaceae contrasts with the two well-supported (PP = 0.92 and 1.0, respectively) clades formed by the other Asparagales sequences (Figure 2). According to their species composition, we estimate that these two clades A1 and A2 at least go back to the origin of the Hypoxidaceae and are present in the relatively advanced family Asparagaceae (Figure 2). Most importantly, clade A1 is clearly associated with the corresponding clade of the family Orchidaceae (PP = 0.89).
The grasses belong to an order (Poales) that also shows a well-supported and group-specific internal duplication of their corresponding GLO-like genes [63, 69]. Furthermore, the clades corresponding to the Liliales, Zingiberales, Arecales and Alismatales are also monophyletic (supported with PPs ≥ 0.91) and contain species-specific gene duplications (Figure 2).
DEF-like genes in clades 1 and 2 of Orchidaceae evolve under different rates of substitution
For every pair of sequences tested, the Maximum Likelihood-based (Relative Rate Test) RRT in HyPhy employs a (Likelihood Ratio Test) LRT to evaluate whether the data fit the null hypothesis of a molecular clock where the branch lengths of a phylogeny are equal or an alternative hypothesis where branch lengths are different.
After adjusting all the results of the RRTs for multiple comparisons, we found that 128 of 231 pairs of orchid DEF-like genes rejected the null hypothesis of a constant rate of substitution (P ≤ 0.027). In order to compare the rate differences between the pairs of DEF-like sequences that rejected the null hypothesis, we grouped them according to the clade where they belong (Figure 1) and estimated the median of the relative rates of synonymous and nonsynonymous substitution of these groups (Additional file 8). In the case of nonsynonymous substitutions, the most significant comparisons involve species from orchid Clades 1 and 2 with rate between 0.10 and 0.30 substitutions/nucleotide (Additional file 8A). This means that the genes from clade 2 have a rate of nonsynonymous substitutions about three times higher than those of sequences in clades 3 and 4 (Additional file 8A). On the other hand, the relative rate of synonymous substitution fluctuates independently of the clades compared (Additional file 8B).
In contrast, all 45 pairwise comparisons of orchid GLO-like genes failed to reject the null hypothesis (P ≥ 0.05) and have a median rate of nonsynonymous and synonymous substitution of 0.023 and 0.534 substitutions per nucleotide, respectively. Similarly, the RRT with DEF- and GLO-like genes from Poales did not reject the null hypothesis of constant rates of substitution.
Purifying selection affected orchid DEF- and GLO-like genes differently
Models of codon substitution employed on the present analysis.
Distribution of ω along sites follows a β distribution, no selection
Alignments of DEF- and GLO-like genes without basal angiosperm outgroup.
β distribution where ω > 1. Implements BEBb.
p0, p, q, ωs, where s = number of site categories
one ω value for all branches
Alignments of DEF- and GLO-like genes without basal angiosperm outgroup.
n different ω values on n specified branches
Different ω values corresponding to each of the user-specified groups of branches
Clades and sites
nearly neutral evolution
Clades 1 and 2: 0 < ω0 < 1, ω1 = 1
Alignment of orchid DEF-like genes from clades 1 and 2.
In site classes 2 and 3 selective pressure varies in different parts of the phylogeny. Implements BEBb.
Clade 1: 0 < ω0 < 1, ω1 = 1, ω2
Alignment of orchid DEF-like genes from clades 3 and 4.
Clade 2: 0 < ω0 < 1, ω1 = 1, ω3
Alignment of Poales GLO-like genes from clades 1 and 2.
Proportions: p0, p1 c
Assumes several site classes with independently estimated ω.
ω0, ω1, ω2
Same datasets analyzed with M1a vs. MC.
Proportions: p0, p1,
In site class 2, selective pressure is different on each clade.
Clade 1: ω0, ω1, ω2 Clade 1
Clade 2: ω0, ω1, ω2 Clade 2
Proportions: p0, p1 d
Branches and sites
Neutral or purifying selection on individual codons along specific clades.
Site class Background Foreground
Alignments of DEF- and GLO-like genes without basal angiosperm outgroup. Specific branches of the phylogenies were tested in separate analyses.
0 < ω0 < 1
0 < ω0 < 1
ω1 = 1
ω1 = 1
0 < ω0 < 1
ω2 = 1
ω1 = 1
ω2 = 1
Proportions: p0, p1 d
Tests for positive selection on individual codons along specific clades. Implements BEBb. Only foreground clades experience positive selection.
Site class Background Foreground
0 < ω0 < 1
0 < ω0 < 1
ω1 = 1
ω1 = 1
0 < ω0 < 1
ω1 = 1
Proportions: p0, p1 d
In the case of DEF-like genes, we tested five hypotheses where H0 assumes that ω adopted the same value along all branches of the phylogeny. Using LRT we compared this scenario with four relevant alternative hypothesis: H1, where a specific ω is estimated for the branches of clade 1 (C1); H2, where clade 2 (C2) has a different ω to the rest of the branches; H3, where orchid clades C1, C2 and C3 have estimated ω values different to the rest of the branches and H4, where the branches of each of the four clades of the orchid DEF-like genes and the single clade of Poales DEF-like genes (P1) have ω estimates significantly different from the rest of the branches (Figure 3).
Hypotheses H0 and H3, which assume no significant differences between all or some of the groups of branches tested, are significantly (≤ 0.0001) rejected in favor of H4, where specific ω values were estimated for the different groups of DEF-like genes from Orchidaceae and Poales (Figure 3). Most importantly, in hypothesis H4 where ω is estimated for each group of branches, these values converge and indicate that although all clades tested are under strong purifying selection, the group of branches in orchid clade C2 has an ω value of 0.1978, which is more than twice those estimated for other groups of branches (Figure 3).
Comparisons of H0 vs H1 and H7 rejected the scenario where ω is the same in all branches, thus favoring the alternative hypotheses where the orchid clade has ω = 0.043 (P ≤ 0.0001) (Figure 4). Like the DEF-like sequences, all clades of GLO-like genes are under strong purifying selection, but this is especially pronounced in the case of the single orchid clade (Figure 4).
Parameter estimates and LRT of M3 vs. MD and M1a vs. MC in DEF-like genes from Orchidaceae-specific clades 1 and 2 1 .
Estimate of parameters
k = 2
ω0 = 0.04544, f0 = 0.63768
ω1 = 0.42599, f1 = 0.36232
k = 3
ω0 = 0.04544, f0 = 0.63768
ω1 = 0.42599, f1 = 0.36232
ω2 = 38.73814, f2 = 0.00000
k = 2
ω0 = 0.04102, f0 = 0.60628
ω1C1 = 0.26333, ω1C2 = 0.59513, f1= 0.39372
k = 3
ω 0 = 1.28841, f 0 = 0.02038
ω 1 = 0.03309, f 1 = 0.55669
ω 2C1 = 0.21766, ω 2C2 = 0.50952, f 2 = 0.42294
ω0= 0.09066, f0 = 0.75319
ω1 = 1.00000, f1 = 0.24681
ω 0 = 0.03251, f 0 = 0.55437
ω 1 = 1.00000, f 1 = 0.02680
ω 2C1 = 0.21104, ω 2C2 = 0.50104, f 2 = 0.41883
Parameter estimates and LRT of M3 vs MD and M1a vs MC in DEF-like genes from Orchidaceae-specific clades 3 and 4 1 .
Estimate of parameters
k = 2
ω0 = 0.02182, f0 = 0.76519
ω1 = 0.23442, f1 = 0.23481
k = 3
ω0 = 0.02182, f0 = 0.09116
ω1 = 0.02182, f1= 0.67403
ω2 = 0.23442, f2 = 0.23481
k = 2
ω0 = 0.02175, f0= 0.76410
ω1C4 = 0.24611, ω1C3 = 0.22536, f1= 0.23590
k = 3
ω0 = 0.23556, f0 = 0.23301
ω1 = 0.01603, f1 = 0.00000
ω2C4 = 0.02665, ω2C3 = 0.01863, f2 = 0.76699
ω0 = 0.05538, f0 = 0.95421
ω1 = 1.00000, f1 = 0.04579
ω 0 = 0.21797, f 0 = 0.23733
ω 1 = 1.00000, f 1 = 0.00687
ω 2C4 = 0.02599, ω 2C3 = 0.01796, f 2 = 0.75580
Parameter estimates and LRT of M3 vs. MD and M1a vs. MC in GLO-like genes from Poales-specific clades 1 and 2 1 .
Estimate of parameters
k = 2
ω0 = 0.01674, f0 = 0.58033
ω1 = 0.22754, f1 = 0.41967
k = 3
ω0 = 0.01674, f0 = 0.58033
ω1 = 0.22754, f1 = 0.41967
ω2 = 32.14375, f2 = 0.00000
k = 2
ω0 = 0.22754, f0 = 0.41968
ω1P1 = 0.01605, ω1P2 = 0.01734, f1 = 0.58032
k = 3
ω0 = 48.10696, f0 = 0.00000
ω1 = 0.01695, f1 = 0.58176
ω2P1 = 0.26322, ω2P2 = 0.20131, f2 = 0.41824
ω0 = 0.08103, f0 = 0.96439
ω1 = 1.00000, f1 = 0.03561
ω 0 = 0.21164, f 0 = 0.42529
ω 1 = 1.00000, f 1 = 0.01051
ω 2P1 = 0.01484, ω 2P2 = 0.01608, f 2 = 0.56420
A similar set of branch-site tests used to study the divergence of the two clades of Poales GLO-like genes (Table 4) showed that MC, the model where shifts in ω take place after gene duplication, fits the data significantly better than other hypotheses (P < 0.0001). This result confirms that although P1 and P2 are under purifying selection (ω0= 0.211) as previously shown with M0 vs M2, ω is especially stringent when considering the clades individually (ω2P1 = 0.0148 and ω2P2 = 0.01608, respectively) (Table 4).
Positive selection preceded diversification of DEF-like genes in Poales
Parameter estimates and LRT of MA1 vs. MA in DEF-like genes from Poales.
Estimate of parameters
ω0 = 0.08303, f0 = 0.89319
ω1 = 1.00000, f1 = 0.03486
ω2a fore =936.29115, ω2a back =0.08303, f2a = 0.06925
ω2b fore =936.29115, ω2b back =1.00000, f2b = 0.00270
45 L 0.612
50 S 0.703
54 L 0.531
55 S 0.996**
77 I 0.556
82 A 0.588
86 K 0.583
89 N 0.991**
92 N 0.517
103 K 0.689
116 I 0.573
117 K 0.976*
126 L 0.625
132 L 0.993**
152 V 0.604
159 H 0.535
164 R 0.899←
167 E 0.742
170 N 0.931←
175 E 0.639
177 Y 0.996**
181 L 0.610
ω0 = 0.08196, f0 = 0.78512
ω1 = 1.00000, f1 = 0.03088
ω2a fore =1.00000, ω2a back =0.08196, f2a = 0.17704
ω2b fore =1.00000, ω2b back =1.00000, f2b = 0.00696
2δ = 8.433394 df = 1 P = 0.0037
ω0 = 0.08281, f0 = 0.86235
ω1 = 1.00000, f1 = 0.03283
ω2a fore =4.43236, ω2a back =0.08281, f2a = 0.10097
ω2b fore =4.43236, ω2b back =1.00000, f2b = 0.00384
14 S 0.813
42 E 0.747
44 S 0.846
50 S 0.993**
54 L 0.571
57 F 0.799
64 T 0.787
75 S 0.992**
78 N 0.706
81 S 0.788
103 K 0.893
128 E 0.892
142 N 0.821
154 N 0.736
171 Y 0.801
ω0 = 0.08278, f0 = 0.76740
ω1 = 1.00000, f1 = 0.02922
ω2a fore =1.00000, ω2a back =0.08278, f2a = 0.19592
ω2b fore =1.00000, ω2b back =1.00000, f2b = 0.00746
2δ = 2.204866 df = 1 P = 0.1376
ω0 = 0.08393, f0 = 0.95687
ω1 = 1.00000, f1 = 0.03593
ω2a fore =8.40943, ω2a back =0.08393, f2a = 0.00694
ω2b fore =8.40943, ω2b back =1.00000, f2b = 0.00026
33 T 0.517
183 L 0.954*
ω0 = 0.08401, f0 = 0.95276
ω1 = 1.00000, f1 = 0.03571
ω2a fore =1.00000, ω2a back =0.08401, f2a = 0.01111
ω2b fore =1.00000, ω2b back =1.00000, f2b = 0.00042
2δ = 0.93905 df = 1 P = 0.3325
Furthermore, we tracked the occurrence of positive selection along the branches representing the divergence of DEF-like genes in Poales following the emergence of E. elephas and S. angusifolia (Table 5, Figure 3). Although in these analyses the foreground branches have ω > 1 and sites under positive selection were identified, the LRT does not reject the null model MA1.
Similarly, we tested for positive selection in branches that in Orchidaceae precede and follow the duplications of DEF-like clades 1, 2, 3 and 4 (Figure 3, Table S3 in Additional file 1). The LRTs of these analyses did not reject the null hypothesis where over 95% of the codon sites are under purifying selection (ω0 = 0.084).
Further analyses in different nodes of the evolution of GLO-like genes in the Orchidaceae and Poales did not reject the null hypothesis where most of the sites have ω0 = 0.112 (Figure 4 and Table S4 in Additional file 1).
The phylogeny of MADS-box genes indicates that functional classes A, B, C+D and E of floral homeotic genes are grouped in distinct clades [23–25], suggesting that duplication and functional diversification of MADS-box genes contributed significantly to the evolution of the flower structure [23, 33, 72, 73]. Thus, studying the phylogeny of MADS-box genes is a powerful tool to better understand the evolution of plant morphology [23, 33, 73–75]. Here we investigate the molecular evolution of putative class B genes from monocots, with special emphasis on DEF- and GLO-like genes from orchids and Poales. We have chosen these genes because they most likely specify the identity of both perianth organs and stamens throughout the angiosperms, and they are thus essential for understanding important aspects of the morphological divergence of the orchids and grasses from the rest of the monocots [41, 42, 63, 76, 77]. Specifically, we determined whether shifts in the rates and patterns of nucleotide substitution of duplicated class B genes are correlated with their subsequent functional divergence and thus with the evolution of novel perianth structures in orchids and grasses.
Ancient duplication of DEF-like genes possibly facilitated the diversification of the orchid flower morphology
The phylogenetic analyses presented here strongly support the existence of four ancient, orchid-specific clades of DEF-like genes. These four clades and their internal branches are generally supported with PP values > 0.99. Our data strongly suggest that the sequences within each clade are orthologues, because they reproduce the systematic relationships reported for the four most derived subfamilies of the Orchidaceae: (Vanilloideae(Cypripedioideae(Orchidoideae, Epidendroideae))) [44, 78]. This indicates that the clades are the result of ancient gene duplication events that at least precede the origin of the subfamily Vanilloideae, which according to recent estimates, emerged between 71 to 62 MYA . The ancient origin, conservation and unique expression (see below) of these distinct groups of orchid-specific genes suggest that they have key roles in determining the organ identities behind the characteristic orchid floral morphology [42, 62]. Our data show that clades 1 and 2, as well as clades 3 and 4, are sister to each other. Although clades 1 and 2 seem most closely related to some other DEF-like genes in the Asparagales (Figure 1), the data available do not allow unequivocal reconstruction of the deeper phylogenetic relationships between orchid paralogous groups and the rest of the monocot sequences.
Exceptionally, we found that members of subfamily Orchidaceae have two lineages of GLO-like genes (Figure 2). We argue that these two lineages may be the result of a subfamily-specific duplication because exhaustive RACE on several cDNA pools from species in subfamilies Vanilloideae (Vanilla planifolia), Cypripedioideae (Phragmipedium longifolium) and Epidendroideae (Gongora galeata) yielded only a single GLO-like gene. The fact that GLO-like genes from Cypripedioideae and Epidendroideae seem to associate with different lineages of Orchidoideae (Figure 2), may be the result of differential gain and loss of duplicate genes during the evolution of these subfamilies.
The phylogeny of DEF-like genes provides the evolutionary context needed to interpret functional information on these sequences and suggests a particular model of perianth organ determination and evolution [42, 62]. Mapping the known expression patterns onto the phylogenies of orchid DEF-like sequences shows that genes belonging to the same clade have the same or very similar expression domains . These clade-specific expression patterns suggest that the duplication events that gave rise to the ancestors of these clades were followed by transcriptional and functional differentiation of each paralogue.
Specifically, in addition to expression in the gynostemium, genes from both clades 1 and 2 are expressed in the outer and inner tepals, whereas the expression of genes in clades 3 and 4 is limited to the inner tepals or to the lip, respectively . Considering both the expression patterns of each clade and the events of gene duplication that generated them, it seems likely that after the first gene duplication, the ancestor of genes in clades 3 and 4 produced the differences between inner (expression "on") and outer tepals (expression "off"). The resulting morphology might still exist in the basal genus Apostasia (subfamily Apostasioideae) whose flowers do not yet possess elaborate lips . Similarly, the distinction between lateral inner tepals and the lip emerged after a second gene duplication affecting the ancestor of clades 3 and 4, followed by changes in the cis-regulatory regions of the duplicated genes that resulted in differential expression and determination of lip identity by clade 4 genes . Validating this scenario involves further characterization of the patterns of expression of DEF- and GLO-like genes in the basal and relatively species-poor families Apostasioideae, Vanilloideae and Cypripedioideae.
Although genes in clade 1 and 2 are expressed in all the perianth organs, it is unlikely that they have a completely redundant role in the determination of perianth organ identity. According to our phylogenetic reconstruction, these clades were already present in the Vanilloideae, an orchid subfamily that emerged at least 62 million years ago . Retention of duplicate genes for such a long time is alone a strong argument for functional diversification. Furthermore, our analyses show that the members in these clades have substantial differences in their C-terminal domains, distinct rates of nonsynonymous substitution and significant differences in their respective patterns of purifying selection (Figures 1, 3, Tables 2, 3, Additional file 2). Specifically, in the sequences of clade 2 DEF-like genes, a relatively high proportion of non-synonymous substitutions followed the duplication that generated this clade and eventually caused the truncations in the open reading frames that characterize the sequences analyzed here. Despite the high level of divergence of clade 2 orchid DEF-like genes, our detailed characterization of ω along the phylogeny of all orchid class B genes did not indicate specific branches or sites where positive selection took place. Our estimation of ω along sites, branches and specific clades documents a scenario of prevalent purifying selection (Figures 3, 4, Tables 2 to 4) that agrees with previous analyses of class B genes and other floral organ identity genes [13, 14, 22, 80]. However, we cannot completely rule out the occurrence of positive selection, since its effects on few sites during a brief evolutionary period can be masked by ensuing and continuous purifying selection [81, 82].
The uniform expression of GLO-like genes in the perianth organs of orchids [59–61], indicates that although these genes are probably essential for proper flower development, they may not play a role in determining the different organ identities in the orchid perianth . However, further work is needed to determine whether the subfamily Orchidoideae-specific gene duplication (Figure 2) is associated with differential patterns of expression and function and thus with the specific morphology of this subfamily.
Similarly to DEF-like sequences, all clades of GLO-like genes were under strong purifying selection, but this is especially pronounced in the case of the single clade of genes from orchids (Figure 4). We think the differences in the ω ratio reflect the distinct selective constraints affecting duplicated class B genes. Specifically, the ratios corresponding to the generally monogenic clades of the Orchidaceae (ω = 0.043) and Liliales (ω = 0.0907) are lower than those of the rest of the Asparagales and the Poales, where there are two GLO-like loci (ω = 0.1206 and ω = 0.157, respectively). We reasoned that in orchids, and to a lesser extent in GLO-like sequences from Liliales, the lower rate of substitution reflects the stable co-evolutionary interaction between the product of a single GLO-like gene with several DEF-like interaction partners. Also, duplicated loci in clades A1 and A2 from the rest of the Asparagales, as well as P1 and P2 from the Poales, may be completely or partially redundant and thus under less stringent purifying selection than their single-copy homologues in other species (Figure 4).
The fact that each of the clades of DEF- and GLO-like sequences replicates the systematic relationships of the four most advanced subfamilies of the Orchidaceae [44, 78] suggests that these sequences may be useful for reconstructing the phylogeny of the Orchidaceae, provided the analysis exclusively involves orthologous sequences.
Positive selection in DEF-like genes from Poales preceded the evolution of the grass floret
The order Poales contains 18 families, of which three are represented in our analyses of DEF-like genes: Restionaceae (Elegia elephas), Joinvilleaceae (Joinvillea ascendens) and Poaceae with the rest of the species (Streptochaeta angustifolia, Oryza sativa, Triticum aestivum, Hordeum vulgare, Zea mays). Recent phylogenetic analyses showed that the early-diverging lineage of the Restionaceae is the sister group of a clade containing Joinvilleaceae and the more derived Poaceae; the latter comprising most of the species of Poales [47, 65]. Previous studies  sampled species representing the morphological transition from the typical monocot flower to the grass floret. Specifically, the flowers of the basal Elegia elephas and Joinvillea ascendens are actinomorphic, possessing two trimerous whorls of tepals and three or six stamens, respectively . In contrast, the floret of the early-diverging grass Streptochaeta angustifolia has 12 bracts. The trimerous arrangement of the six bracts VII to XII have been interpreted as the first and second perianth whorls . Most importantly, the expression of class B gene SaAP3 in the last three bracts and in the six stamens suggests that these bracts are second whorl organs, possibly a transitional form preceding the evolution of actual lodicules [63, 83]. Moreover, the study of Whipple et al (2007) suggests that class B genes control the identity of second whorl organs in a broader sense than only petal identity. The morphologies outlined above contrast with the grass floret found in the rest of the Poaceae species analyzed here. Specifically, the floret is formed by one lemma and one palea subtending a flower formed by two or three lodicules, and the male and female reproductive organs. The grass floret is also a zygomorphic structure like the orchid flower, mostly due to frequent differential suppression of stamens from different whorls [84, 85] and suppression of the adaxial lodicule in most derived grasses (e.g. Hordeum, Oryza) [48, 84].
In this context, the evidence for positive selection in the branch that represents the divergence of Poales DEF-like genes from the rest of the monocot genes (Figure 3, Table 5) suggests that during the morphological divergence of grasses a series of nonsynonymous substitutions took place before the emergence of the characteristic grass floret. Positive selection probably continued along the branches following the divergence from Elegia elephas and Streptochaeta angustifolia (Table 5), but the corresponding signal eventually became masked by positions under purifying selection that probably encode amino acids essential for a grass-specific network of class B gene targets. The facts that class B transcription factors SILKY1 (DEF-like) and ZMM16 (GLO-like) from maize share conserved heterodimerization specificity with Arabidopsis thaliana class B proteins APETALA3 and PISTILLATA in vitro and rescue the corresponding null mutants makes it conceivable that the residues mediating the interaction between these two sets of class B proteins already existed in the most recent common ancestor of monocots and eudicots . This not only suggests that the mechanisms of organ identity determination for second whorl organs were established early during the evolution of the angiosperms, but also that the subsequent morphological divergence in the grasses is probably associated with the lineage-specific substitutions in the K-domain that we detected. Substitutions in this domain may have the potential for changing higher order complex formation of class B proteins and thus their binding specificity to downstream target genes, eventually enabling them to coordinate the development of novel perianth structures.
Methods for detecting positive natural selection, like the ones we employed here, are powerful tools to generate and experimentally verify interesting hypothesis on the evolution of new gene functions and phenotypes. Recently, molecular adaptation of polygalacturonase inhibitor protein, TRIM5α, feruloyl esterase A and salicylic acid methyltransferase has been tested on proteins encoded by genes where nucleotides under positive selection were modified via site-directed mutagenesis or by domain-swapping experiments generating genes encoding chimeric proteins [86–89]. In the future, similar approaches could be employed to assess the functional consequences of positive selection in DEF-like transcription factors from Poales. For example, hypothesis on the transcriptional activity of these proteins could be evaluated by substituting sites under positive selection with the ones found in present-day orchids or the ones inferred to have existed in their common ancestor. The interactions of chimeric or mutagenized DEF-like proteins from Poales with other MADS-domain transcription factors and DNA could be assayed in vitro via yeast-two-hybrid and electrophoretic mobility shift assays , respectively, or in planta employing Fluorescence Resonance Energy Transfer (FRET) .
Functional consequences of C-terminal deletions in clade 2 DEF-like proteins of orchids
Previous phylogenetic analyses of the DEF- and GLO-like proteins identified specific motifs characteristic for certain plant lineages [33, 35]. Similarly to what we observed in the different lineages of orchid DEF-like genes, most of these characteristic motifs evolved in the C-terminal domain of the encoded proteins. This domain is highly variable and in class B proteins it might be involved in the formation of multimeric transcription factor complexes [39, 92]. The evolutionary importance of these differences in the C-terminal domain is supported by their long evolutionary conservation (Additional file 2). In particular, the family specific C-terminal deletions observed in orchid DEF-like proteins from clade 2 are exceptional if one considers that in all published DEF-like proteins, C-terminal deletions are only species specific (M. Mondragón-Palomino, unpublished results). The importance of these orchid-specific deletions is highlighted by recent findings on the occurrence of frameshift mutations in the regions encoding C-terminal motifs of the members of the clades of DEF- and GLO-like genes [93, 94]. For example, the study by Kramer et al indicates that the C-terminal motif 'euAP3' resulted from a translational frameshift caused by a single nucleotide deletion in the ancestral motif 'paleoAP3'. Because the paleoAP3 motif is found in DEF-like proteins throughout the angiosperms (e.g. in TM6-like proteins of eudicots), whereas the euAP3 motif is found only in DEF-like proteins of higher eudicots, Lamb and Irish , suggested that there is a causal relationship between duplication of DEF-like genes, mutations in the C-domain, functional differentiation of DEF-like genes and the emergence of specific floral morphologies. Although the previous evidence argues for the functional importance of the C-terminal deletions in the orchid clade 2 proteins, there is surprisingly little experimental evidence for a function of the C-terminal domain, not only in orchids but also in other species. Specifically, recent independent studies with truncated DEF-like proteins from Arabidopsis thaliana and Chloranthus spicatus suggest that the C-terminal domain is not essential for floral identity [96, 97]. Interestingly, yeast-two hybrid experiments involving OMADS3 from orchid Oncidium "Gower Ramsey", a clade 2 protein lacking the conserved C-terminal motifs PI-derived and paleoAP3 (Additional file 2), showed that this protein forms homodimers , opening up the possibility of novel regulatory functions for the proteins of clade 2. However, results derived from experiments using DEF-like proteins with truncated C-terminal domains of other monocots do not lend support to a particular role of this domain in dimerization [68, 90].
The 'orchid code' suggests that the diversification of the orchid perianth started with changes in the regulatory regions of the duplicated class B genes, which were soon followed by changes that led to the recognition of different target genes . The conserved clade-specific motifs that characterize each clade may be the result of diversifying selection taking place early and transiently during the divergence of the ancestral paralogues, but possibly after some changes that affected the domains of expression had already occurred. Subsequently, purifying selection may have been stabilizing the network of protein – protein and protein – DNA interactions involving four DEF-like proteins.
Our analyses suggest that diversification of the K-domain in DEF-like genes of Poales may have triggered the initial changes in transcription factor interactions that coordinate the development of the lodicules (Figure 6). Our comparative analysis of class B genes in the Orchidaceae and Poales, as well as published information on their pattern of expression, suggests that in these species-rich groups positive selection and transcriptional divergence have had different influences on the evolution of morphological diversification. In a wider context, the presented results suggest the preservation and functional diversification of paralogous genes involves case-specific differential divergence of coding and regulatory sequences (Figure 6).
Pre-anthesis flower buds of Hypoxis villosa (Hypoxidaceae) and orchids Vanilla planifolia (Vanilloideae),Phragmipedium longifolium (Cypripedioideae), Spiranthes odorata (Orchidoideae) and Gongora galeata (Epidendroideae) were collected from the living collections of the Halle (Halle an der Saale), Wilhelma (Stuttgart) and Heidelberg Botanical Gardens. They were preserved and transported in liquid nitrogen or RNAlater (Sigma) and stored at -80°C. The choice of the orchid species was based on their subfamily membership, the availability of blooming individuals and the number of plants present in living collections. In this study, we did not include material from the subfamily Apostasioideae because there were no specimens available in the living orchid collections outside of their natural range of distribution in Southeast Asia. We included H. villosa because this species and the rest of the family Hypoxidaceae (Asparagales) show important similarities to the orchids [98, 99] and members of the Hypoxidaceae are frequently used as point of comparison in recent molecular phylogenies of the orchids.
Bud material was used for total RNA isolation with Biomol's reagent, following the protocol of the manufacturer. Total RNA was employed for cDNA synthesis by using a poly-T primer with Fermentas MuLV Reverse Transcriptase. MADS-box gene specific sequences were isolated by 3' RACE from at least two different cDNA pools from each species with primer pair RQVT2 (5'-CGR CAR GTG ACS TTC TSC AAR CG-3') and AB07 (5'-GAC TCG AGT CGA CAT CTG-3') under conditions specified by the manufacturer for Taq polymerase (Fermentas). PCR products of about 700 bp were cloned into vectors pGEMT (Promega) or pJET1 (Fermentas) following the protocols of the manufacturers. The ligation products where electroporated into E. coli XL1 Blue, and 50 to 100 positive clones from each ligation were selected and sequenced in both directions multiple times with vector-specific primers on an ABI 3730xl DNA Analyzer using Big Dye Terminator chemistry. The resulting sequences were assembled and managed with the program SEQUENCHER (4.5, Gene Codes Corporation). The level of identity and phylogenetic association of these sequences with previously identified monocot DEF- and GLO-like sequences was determined with different strategies using BLAST  to all plant sequences in GenBank. The sequences that where unequivocally identified as those of DEF- or GLO-like transcripts were employed to generate specific primers to isolate the 5' end of the sequence with the 5'-RACE kit (Roche). Assembly of the 5' and 3' partial sequences allowed us to design specific primers to isolate the corresponding full-length sequences.
We assembled two datasets by retrieving with BLAST and keyword searches all monocot DEF- or GLO-like nucleotide and amino acid sequences deposited in NCBI's GenBank until September 20th 2007 (Table S1 in Additional file 1). The amino acid sequences were aligned using the program T-coffee [101, 102] to the corresponding DEF- and GLO-like conceptual amino acid translations of the cDNA sequences that we isolated from orchids. In these alignments we included as outgroup representatives the following sequences from basal angiosperms: KjAP3 (Kadsura japonica),AP3-1 (Illicium floridanum),IhAP3-1 (Illicium henryi),BsAP3 (Brasenia schreberi),NjAP3-3 (Nuphar japonica),EfAP3 (Euryale ferox),NymAP3 (Nymphaea sp.), NtAP3_1 (Nymphaea tetragona),AP3-1 (Nuphar variegata) and AmAP3 (Amborella trichopoda) for the phylogeny of DEF-like sequences. Similarly, for the phylogeny of GLO-like genes we employed the sequences of KjPI (Kadsura japonica), PI (Illicium floridanum), IhPI-1 (Illicium henryi), BsPI (Brasenia schreberi), CcPI (Cabomba caroliniana), NtPI (Nuphar tetragona), EfPI (Euryale ferox), NjPI-1, NjPI-2 (Nuphar japonica) and AtPI (Amborella trichopoda). In an initial step, the alignments were assessed with GBLOCKS  and T-coffee. Subsequently, after manual improvement using GenDoc , we used amino-acid sequence alignments as a template to align the corresponding nucleotide sequences. The nucleotide alignment of DEF-like genes contained 61 sequences and was 762 bp long, whereas the GLO-like matrix contained 74 sequences and was 696 bp long.
Phylogenetic reconstruction and analysis
We analyzed the alignments of DEF- and GLO-like sequences on their complete length or separated in the positions encoding the MIK region and the C-terminal domain, as defined in . The program MODELTEST (3.7)  was employed to determine which model of nucleotide substitution fits better to each alignment according to the corrected Akaike Information Criterion . The parameters of the best model were employed to reconstruct the phylogenies of DEF- and GLO-like genes with Mr. Bayes (3.1.2) . We performed preliminary runs of Bayesian Inference to determine the point where the Likelihood estimates resulting from each generation converges on a single value. Based on this, we performed all analyses of Bayesian inference for 2,000,000 generations, sampling every 100th and with a burn-in of 3,000 generations. Finally, we obtained a consensus tree with the rest of the results and used the posterior probabilities (PP) to estimate the statistical support for each node.
Determining the substitution saturation of the dataset
We determined the level of substitution saturation of DEF- and GLO-like sequences by plotting their genetic distance versus the number of transitions and transversions. Because the number of transitions relative to that of transversions decreases with increasing divergence, the point of substitution saturation corresponds with the distance value where the plots diverge from linearity and reach a plateau . The pairwise genetic distances of both DEF- and GLO-like genes were obtained in PAUP 4 beta 10 (Swofford 2002) using the nucleotide substitution model GTR+I+G which best fitted the data as estimated by analysis with MODELTEST (see previous section). The pairwise number of transitions and transversions corresponding to each dataset was obtained using DAMBE 4.5.56  and the plots were drawn with MicroSoft Excel.
Relative Rate Test
We employed the Maximum Likelihood (ML) pairwise Relative Rate Test (RRT) as implemented in the program HyPhy (v. 1.0)  to estimate the relative rate of substitution between the orchid and Poales DEF- and GLO-like sequences and their corresponding outgroup sequence TGDEFA and TGGLO from Tulipa gesneriana (Liliaceae), respectively. We chose these as representatives of the outgroup because the phylogenetic analysis presented in this paper showed that their relationship to the sequences from the rest of the monocots is well supported and their level of nucleotide substitution is not saturated.
The Maximum Likelihood-based RRT in HyPhy employs a Likelihood Ratio Test (LRT) to evaluate whether the sequence data fits the null hypothesis of a molecular clock (equal rates) represented by a tree of three taxa where all parameters are constrained to be equal along its branches, or an alternative hypothesis where such parameters are free to adopt different values. In order to infer the parameters of these phylogenetic hypotheses for each pair of sequences, we used the Muse-Gaut model  of codon substitution with nucleotide equilibrium frequencies based on their position in the codon (in Hyphy nomenclature MG94W9). The resulting parameter estimates were compared through series of Likelihood Ratio Tests that involved all pairs of sequences in relation to the corresponding outgroups. There is no simple procedure to reduce the effect of multiple comparisons in this series of P-values because the RRTs share the same outgroup and thus are not independent. Alternatively, the developers of HyPhy recommend adjusting the P-values of the LRTs for False Discovery Rate by implementing the Benjamini-Hochberg procedure (S. Kosakovsky-Pond, Hyphy on-line discussion forum). The False Discovery Rate (FDR) is the proportion of false positive results among all significant results. With the Benjamini-Hochberg correction we enforced a FDR = 0.05 by ranking all n P-values from smallest to largest, then dividing them by n and multiplying the result by 0.05 . The LRT comparisons that rejected the null hypothesis of neutrality are those where the original P-value was smaller than the corrected value.
For consistency with analyses described in the following section, the RRT analysis excludes sequences from basal angiosperms and all positions with indels.
Analysis of evolutionary patterns of divergence
The ratio (ω) of the rate of nonsynonymous substitutions at nonsynonymous sites (dN) to synonymous (dS) substitutions at synonymous sites was estimated to figure out whether the coding region of a gene is under negative (purifying) selection (ω < 1), positive selection (ω > 1) or evolves neutrally (ω = 1). Variations of ω along the evolutionary history of a gene family indicate changes in both the associated selective regime and functional constraints. Because different selective regimes might affect only some branches or sites during relatively short evolutionary time , we analyzed the heterogeneity of selective pressures in DEF- and GLO-like genes with the program codeml from the PAML package [112, 113], by comparing five pairs of models (abbreviated M) that describe the pattern of codon substitution at the levels of: (a) codon sites (M7 vs M8), (b) specific branches in a phylogeny (M0 vs M2) and (c) sites from specific clades (M1a vs MC and M3 vs MD) or (d) sites from specific branches (MA and MA1). Table 1 summarizes the main features and parameters of each model of codon substitution and the datasets analyzed with them. Each of these models serves to estimate ω and other parameters. Comparing the likelihood of these estimates via a Likelihood Ratio Test (LRT) determines whether a model that considers positive selection fits the data better than one assuming neutral or purifying selection. A detailed description of each test is provided elsewhere [70, 112, 114].
We investigated the occurrence of positive selection along codon sites by comparing the model M8 with model M7 . Then, we characterized the variation in selection pressure among branches of DEF- and GLO-like genes with a series of tests comparing the null hypothesis M0 "one ω ratio for all branches", with different alternative hypothesis based on M2 "different ω ratios", where independent ω values are estimated for specific branches  (Table 1).
We implemented clade and site tests of models M1a vs MC and M3 vs MD, to determine whether functional constraints differ significantly between clades after gene duplication in the two paralogous groups of DEF-like genes in Orchidaceae (Clades 1 and 2) or (Clades 3 and 4) (Figure 1), or the duplicate GLO-like genes from Poales (Figure 2). Clade models MC and MD assume that there were variations in the selective pressures among the amino acids encoded by a gene and that some of these sites also experienced changes in selective regimes at different points of their evolutionary history, such as changes occurring after gene duplication  (Table 1).
With branch-site models A and A1 , we tested for the occurrence of positive selection on individual codons along specific branch groups of DEF- and GLO-like genes from the orchids and Poales. Model A assumes that the branches in the phylogeny are divided a priori in foreground and background clades, where only the former may have experienced positive selection (Table 1). In the null model MA1 ω2 = 1, thus comparing model MA with MA1 is a direct test of positive selection on the foreground clades.
The relative fit of the parameters estimated by each of these models of codon substitution is represented by a maximum likelihood value that can be compared between nested hypotheses by a Likelihood Ratio Test (LRT). The LRT statistics are assumed to be χ2 distributed with degrees of freedom equal to the difference in the number of parameters between models. The application of these models requires a phylogenetic assumption, but since the process of ML inference with program codeml is computationally intensive, for most of the tests we employed unrooted trees inferred as previously described from a smaller dataset. This dataset does not include sequences of basal angiosperms and sequences with large sections of indels (gaps) because we did not want to consider them as additional character states. The alignment of DEF-like genes is 747 bp long and contains 41 sequences, while the alignment of GLO-like genes is 666 bp long and has 61 sequences. The tests involving clade-sites models required the use of three alignments that only included members of clades 1 and 2, 3 and 4 from the Orchidaceae DEF-like genes or sequences from clades 1 and 2 of the Poales, respectively (Table 1). The phylogenies corresponding to these alignments were constructed as described previously.
To detect variations in parameter estimation, we performed and compared the analyses of each model at least twice. As with the RRTs with HyPhy, all analyses with codeml were conducted without considering positions with indels. In addition to the tests performed with PAML, we carried out an independent estimation of ω in DEF-like genes with the on-line implementation of the Hyphy package http://www.datamonkey.org. Specifically, we employed the FEL (Fixed Effects Likelihood) method to estimate the rates of nonsynonymous and synonymous substitution across codons with the same alignment and phylogeny previously used to test M0 with PAML.
We thank the Botanical Gardens of Halle an der Saale, Stuttgart, Heidelberg (HEID) and Jena for permission to obtain plant material from their living collections. Many thanks to Domenica Schnabelrauch and Bettina Kästner for technical assistance and to Paula Rudall, Richard Bateman, Maria Anisimova, Rainer Melzer, Lydia Gramzow and three anonymous reviewer for insightful comments and suggestions on different versions of this manuscript.
This work was funded by the Deutsche Akademische Austausch Dienst (A/04/21600) to MMP and the Volkswagen Stiftung (I/81 901) to MMP and GT.
- Meyerowitz EM: Plants compared to animals: The broadest comparative study of development. Science. 2002, 295: 1482-1485. 10.1126/science.1066609.PubMedGoogle Scholar
- Kellogg EA: Evolution of developmental traits. Curr Opin Plant Biol. 2004, 7: 92-98. 10.1016/j.pbi.2003.11.004.PubMedGoogle Scholar
- De Robertis EM: Evo-Devo:Variations on ancestral themes. Cell. 2008, 132: 185-195. 10.1016/j.cell.2008.01.003.PubMed CentralPubMedGoogle Scholar
- Doebley J, Lukens L: Transcriptional regulators and the evolution of plant form. Plant Cell. 1998, 10: 1075-1082. 10.1105/tpc.10.7.1075.PubMed CentralPubMedGoogle Scholar
- Blanc G, Wolfe KH: Functional divergence of duplicated genes formed by polypoidy during Arabidopsis evolution. Plant Cell. 2004, 16: 1679-1691. 10.1105/tpc.021410.PubMed CentralPubMedGoogle Scholar
- Peer Van de Y, Taylor JS, Braasch I, Meyer A: The ghost of selection past: rates of evolution and functional divergence of anciently duplicated genes. J Mol Evol. 2001, 53: 436-446. 10.1007/s002390010233.PubMedGoogle Scholar
- Shiu SH, Byrnes JK, Pan R, Zhang P, Li WH: Role of positive selection in the retention of duplicate genes in mammalian genomes. Proc Natl Acad Sci USA. 2006, 103: 2232-2236. 10.1073/pnas.0510388103.PubMed CentralPubMedGoogle Scholar
- Fares MA, Bezemer D, Moya A, Marín I: Selection on coding regions determined Hox7 genes evolution. Mol Biol Evol. 2003, 20: 2104-2112. 10.1093/molbev/msg222.PubMedGoogle Scholar
- Steinke D, Salzburger W, Braasch I, Meyer A: Many genes in fish have species-specific asymmetric rates of molecular evolution. BMC Genomics. 2006, 7: 20-10.1186/1471-2164-7-20.PubMed CentralPubMedGoogle Scholar
- Crow KD, Stadler PF, Lynch VJ, Amemiya C, Wagner GP: The "Fish-Specific" Hox cluster duplication is coincident with the origin of teleosts. Mol Biol Evol. 2006, 23: 121-136. 10.1093/molbev/msj020.PubMedGoogle Scholar
- Lynch VJ, Roth JJ, Wagner GP: Adaptive evolution of Hox-gene homeodomains after cluster duplications. BMC Evol Biol. 2006, 6: 86-10.1186/1471-2148-6-86.PubMed CentralPubMedGoogle Scholar
- Olsen KM, Womack A, Garrett AR, Suddith JI, Purugganan MD: Contrasting evolutionary forces in the Arabidopsis thaliana floral developmental pathway. Genetics. 2002, 160: 1641-1650.PubMed CentralPubMedGoogle Scholar
- Lukens L, Doebley J: Molecular evolution of the teosinte branched gene among maize and related grasses. Mol Biol Evol. 2001, 18 (4): 627-638.PubMedGoogle Scholar
- Ree RH, Citerne HL, Lavin M, Cronk QCB: Heterogenous selection on LEGCYC paralogs in relation to flower morphology and the phylogeny of Lupinus (Leguminosae). Mol Biol Evol. 2004, 21: 321-331. 10.1093/molbev/msh022.PubMedGoogle Scholar
- Yang X, Tuskan GA, Cheng ZM: Divergence of the Dof gene families in poplar, Arabidopsis, and rice suggests multiple modes of gene evolution after duplication. Plant Physiol. 2006, 142: 820-830. 10.1104/pp.106.083642.PubMed CentralPubMedGoogle Scholar
- Yang Z, Wang X, Gu S, Hu Z, Xu H, Xu C: Comparative study of SBP-box gene family in Arabidopsis and rice. Gene. 2007, 407: 1-11. 10.1016/j.gene.2007.02.034.PubMedGoogle Scholar
- Li J, Clegg MT, Jiang T: Excess non-synonymous substitutions suggest that positive selection episodes occurred during the evolution of DNA-binding domains in the Arabidopsis R2R3-MYB gene family. Plant Mol Biol. 2003, 52: 627-642. 10.1023/A:1024875232511.Google Scholar
- Purugganan MD, Boyles AL, Suddith JI: Variation and selection at the CAULIFLOWER floral homeotic gene accompanying the evolution of domesticated Brassica oleracea. Genetics. 2000, 155: 855-862.PubMed CentralPubMedGoogle Scholar
- Barrier M, Robichaux RH, Purugganan MD: Accelerated regulatory gene evolution in an adaptive radiation. Proc Natl Acad Sci USA. 2001, 98: 10208-10213. 10.1073/pnas.181257698.PubMed CentralPubMedGoogle Scholar
- Moore RC, Grant SR, Purugganan MD: Molecular population genetics of redundant floral-regulatory genes in Arabidopsis thaliana. Mol Biol Evol. 2005, 22: 91-103. 10.1093/molbev/msh261.PubMedGoogle Scholar
- Martinez-Castilla LP, Alvarez-Buylla ER: Adaptive evolution in the Arabidopsis MADS-box gene family inferred from its complete resolved phylogeny. Proc Natl Acad Sci USA. 2003, 100: 13407-13412. 10.1073/pnas.1835864100.PubMed CentralPubMedGoogle Scholar
- Hernández-Hernández T, Martínez-Castilla LP, Alvarez-Buylla ER: Functional diversification of B MADS-box homeotic regulators of flower development: Adaptive evolution in protein-protein interaction domains after major gene duplication events. Mol Biol Evol. 2007, 24: 465-481. 10.1093/molbev/msl182.PubMedGoogle Scholar
- Theissen G, Kim JT, Saedler H: Classification and phylogeny of the MADS-box multigene family suggest defined roles of MADS-box gene subfamilies in the morphological evolution of eukaryotes. J Mol Evol. 1996, 43: 484-516. 10.1007/BF02337521.PubMedGoogle Scholar
- Johansen B, Pedersen LB, Skipper M, Frederiksen S: MADS-box gene evolution-structure and transcription patterns. Mol Phyl Evol. 2002, 23: 458-480. 10.1016/S1055-7903(02)00032-5.Google Scholar
- Becker A, Theissen G: The major clades of MADS-box genes and their role in the development and evolution of flowering plants. Mol Phyl Evol. 2003, 29: 464-489. 10.1016/S1055-7903(03)00207-0.Google Scholar
- Kaufmann K, Melzer R, Theissen G: MIKC-type MADS-domain proteins: structural modularity, protein interactions and network evolution in land plants. Gene. 2005, 347: 183-198. 10.1016/j.gene.2004.12.014.PubMedGoogle Scholar
- Schwarz-Sommer Z, Hue I, Huijser P, Flor PJ, Hansen R, Tetens P, Lönning W-E, Saedler H, Sommer H: Characterization of the Antirrhinum floral homeotic MADS-box gene deficiens: evidence for DNA binding and autoregulation of its persistent expression throughout flower development. EMBO J. 1992, 11: 251-263.PubMed CentralPubMedGoogle Scholar
- Jack T, Brockman LL, Meyerowitz E: The homeotic gene APETALA3 of Arabidopsis thaliana encodes a MADS box and is expressed in petals and stamens. Cell. 1992, 68: 683-697. 10.1016/0092-8674(92)90144-2.PubMedGoogle Scholar
- Trobner W, Ramirez L, Motte P, Hue I, Huijser P, Lonnig WE, Saedler H, Sommer H, Schwarz-Sommer Z: GLOBOSA : a homeotic gene which interacts with DEFICIENS in the control of Antirrhinum floral organogenesis. EMBO J. 1992, 11: 4693-4704.PubMed CentralPubMedGoogle Scholar
- Jack T, Fox GL, Meyerowitz E: Arabidopsis homeotic gene APETALA3 ectopic expression: Transcriptional and posttranscriptional regulation determine floral organ identity. Cell. 1994, 76: 703-716. 10.1016/0092-8674(94)90509-6.PubMedGoogle Scholar
- Goto K, Meyerowitz EM: Function and regulation of the Arabidopsis floral homeotic gene PISTILLATA. Genes Dev. 1994, 8: 1548-1560. 10.1101/gad.8.13.1548.PubMedGoogle Scholar
- Krizek BA, Meyerowitz EM: The Arabidopsis homeotic genes APETALA3 and PISTILLATA are sufficient to provide the B class organ identity function. Development. 1996, 122: 11-22.PubMedGoogle Scholar
- Kramer EM, Dorit RL, Irish VF: Molecular evolution of genes controlling petal and stamen development: Duplication and divergence within the APETALA3 and PISTILLATA MADS-box gene lineages. Genetics. 1998, 149: 765-783.PubMed CentralPubMedGoogle Scholar
- Kramer EM, Irish VF: Evolution of genetic mechanisms controlling petal development. Nature. 1999, 399: 144-148. 10.1038/20172.PubMedGoogle Scholar
- Kim S, Yoo M-J, Albert VA, Farris JS, Soltis PS, Soltis DE: Phylogeny and diversification of B-function MADS-box genes in angiosperms: evolutionary and functional implications of a 260-million-year-old duplication. Am J Bot. 2004, 91: 2102-2118. 10.3732/ajb.91.12.2102.PubMedGoogle Scholar
- Sommer H, Beltran JP, Huijser P, Pape H, Lonnig WE, Saedler H, Schwarz-Sommer Z: Deficiens, a homeotic gene involved in the control of flower morphogenesis in Antirrhinum majus: the protein shows homology to transcription factors. EMBO J. 1990, 9: 605-613.PubMed CentralPubMedGoogle Scholar
- Egea-Cortines M, Saedler H, Sommer H: Ternary complex formation between the MADS-box proteins SQUAMOSA, DEFICIENS and GLOBOSA is involved in the control of floral architecture in Antirrhinum majus. EMBO J. 1999, 18: 5370-5379. 10.1093/emboj/18.19.5370.PubMed CentralPubMedGoogle Scholar
- Causier B, Cook H, Davies B: An Antirrhinum ternary complex factor specifically interacts with C-function and SEPALLATA-like MADS-box factors. Plant Mol Biol. 2003, 52: 1051-1062. 10.1023/A:1025426016267.PubMedGoogle Scholar
- Honma T, Goto K: Complexes of MADS-box proteins are sufficient to convert leaves into floral organs. Nature. 2001, 409: 525-529. 10.1038/35054083.PubMedGoogle Scholar
- Winter K-U, Saedler H, Theissen G: On the origin of class B floral homeotic genes: functional substitution and dominant inhibition in Arabidopsis by expression of an orthologue from the gymnosperm. Plant J. 2002, 31: 457-475. 10.1046/j.1365-313X.2002.01375.x.PubMedGoogle Scholar
- Rudall PJ, Bateman RM: Roles of synorganisation, zygomorphy and heterotopy in floral evolution: The gynostemium and labellum of orchids and other lilioid monocots. Biol Rev. 2002, 77: 403-441. 10.1017/S1464793102005936.PubMedGoogle Scholar
- Mondragón-Palomino M, Theißen G: MADS about the evolution of orchid flowers. Trends Plant Sci. 2008, 13: 51-59.PubMedGoogle Scholar
- Freudenstein JV, Berg Van den C, Goldman DH, Kores PJ, Molvray M, Chase MW: An expanded plastid DNA phylogeny of orchidaceae and analysis of Jackknife branch support strategy. Amer J of Bot. 2004, 91: 149-157. 10.3732/ajb.91.1.149.Google Scholar
- Cameron KM: A comparison and combination of plastid atpB and rbcL gene sequences for inferring phylogenetic relationships within Orchidaceae. Monocots: Comparative Biology and Evolution Excluding Poales. Edited by: Columbus JT, Friar EA, Porter JM, Prince LM, Simpson MG. 2006, Claremont: Rancho Santa Ana Botanical Garden, I: 447-464.Google Scholar
- Kocyan A, Endress PK: Floral structure and development of Apostasia and Neuwiedia (Apostasioideae) and their relationships to other orchids. Int J Plant Sci. 2001, 162: 847-867. 10.1086/320781.Google Scholar
- Pridgeon AM, Cribb PJ, Chase MW, Rasmussen F, eds: General Introduction, Apostasioideae, Cypripedioideae. 1999, Oxford: Oxford University Press
- Linder HP, Rudall PJ: Evolutionary history of poales. Ann Rev Ecol Evol Syst. 2005, 36: 107-124. 10.1146/annurev.ecolsys.36.102403.135635.Google Scholar
- Yamaguchi T, Hirano HY: Function and diversification of MADS-box genes in Rice. ScientificWorldJournal. 2006, 6: 1923-1932. 10.1100/tsw.2006.320.PubMedGoogle Scholar
- Ambrose BA, Lerner DR, Ciceri P, Padilla CM, Yanofsky MF, Schmidt RJ: Molecular and genetic analyses of the silky1 gene reveal conservation in floral organ specification between eudicots and monocots. Mol Cell. 2000, 5: 569-579. 10.1016/S1097-2765(00)80450-5.PubMedGoogle Scholar
- Nagasawa N, Miyoshi M, Sano Y, Satoh H, Hirano H, Sakai H, Nagato Y: SUPERWOMAN1 and DROOPING LEAF genes control floral organ identity in rice. Development. 2003, 130: 705-718. 10.1242/dev.00294.PubMedGoogle Scholar
- Erbar C: Current opinions in flower development and evo-devo approach in plant phylogeny. Plant Syst Evol. 2007, 269: 107-132. 10.1007/s00606-007-0579-1.Google Scholar
- Zahn LM, Leebens-Mack J, DePamphilis CW, Ma H, Theissen G: To B or not to B a flower: the role of DEFICIENS and GLOBOSA orthologs in the evolution of the angiosperms. J Hered. 2005, 96: 225-240. 10.1093/jhered/esi033.PubMedGoogle Scholar
- van Tunen AJ, Eikelboom W, Angenent G: Floral organogenesis in Tulipa. Flow Newsl. 1993, 16: 33-38.Google Scholar
- Kanno A, Saeki H, Kameya T, Saedler H, Theissen G: Heterotopic expression of class B floral homeotic gene supports a modified ABC model for tulip (Tulipa gesneriana). Plant Mol Biol. 2003, 52: 831-841. 10.1023/A:1025070827979.PubMedGoogle Scholar
- Nakamura T, Fukuda T, Nakano M, Hasebe M, Kameya T, Kanno A: The modified ABC model explains the development of the petaloid perianth of Agapanthus praecox ssp. orientalis (Agapanthaceae) flowers. Plant Mol Biol. 2005, 58: 435-445. 10.1007/s11103-005-5218-z.PubMedGoogle Scholar
- Nakada M, Komatsu M, Ochiai T, Ohtsu K, Nakazono M, Nishizawa NK, Ko N, Nishiyama R, Kameya T, Kanno A: Isolation of MaDEF from Muscari armeniacum and analysis of its expression using laser microdissection. Plant Sci. 2006, 170: 143-150. 10.1016/j.plantsci.2005.08.021.Google Scholar
- Hsu HF, Yang CH: An Orchid (Oncidium Gower Ramsey) AP3-like MADS gene regulates floral formation and initiation. Plant Cell Physiol. 2002, 43: 1198-1209. 10.1093/pcp/pcf143.PubMedGoogle Scholar
- Tsai WC, Kuoh CS, Chuang MH, Chen WH, Chen HH: Four DEF-like MADS box genes displayed distinct floral morphogenetic roles in Phalaenopsis orchid. Plant Cell Physiol. 2004, 46: 831-844. 10.1093/pcp/pch095.Google Scholar
- Tsai WC, Lee PF, Chen HI, Hsiao YY, Wei WJ, Pan ZJ, Chuang MH, Kuoh CS, Chen WH, Chen HH: PeMADS6, a GLOBOSA/PISTILLATA-like gene in Phalaenopsis equestris involved in petaloid formation, and correlated with flower longevity and ovary development. Plant Cell Physiol. 2005, 46: 1125-1139. 10.1093/pcp/pci125.PubMedGoogle Scholar
- Xu Y, Teo LL, Zhou J, Kumar PP, Yu H: Floral organ identity genes in the orchid Dendrobium crumenatum. Plant J. 2006, 46: 54-68. 10.1111/j.1365-313X.2006.02669.x.PubMedGoogle Scholar
- Kim S-Y, Yun P-Y, Fukuda T, Ochiai T, Yokoyama J, Kameya T, Kanno A: Expression of a DEFICIENS-like gene correlates with the differentiation between sepal and petal in the orchid, Habenaria radiata (Orchidaceae). Plant Sci. 2007, 172: 319-326. 10.1016/j.plantsci.2006.09.009.Google Scholar
- Mondragón-Palomino M, Theißen G: Why are orchid flowers so diverse? Reduction of evolutionary constraints by paralogues of class B floral homeotic genes. Ann Bot (Lond). 2009Google Scholar
- Whipple CJ, Zanis MJ, Kellogg EA, Schmidt RJ: Conservation of B class gene expression in the second whorl of a basal grass and outgroups links the origin of loducules and petals. Proc Natl Acad Sci USA. 2007, 104: 1081-1086. 10.1073/pnas.0606434104.PubMed CentralPubMedGoogle Scholar
- Chase MW, Cameron KM, Barrett RL, Freundenstein JV: DNA data and orchidaceae systematics: a new phylogenetic classification. Orchid conservation. Edited by: Dixon KW, Kell SP, Barrett RL, Cribb PJ. 2003, Kota Kinabalu, Sabah: Natural History Publications, 69-89.Google Scholar
- Chase MW, Fay MF, Devey DS, Maurin O, Ronsted N, Davies TJ, Pillon Y, Petersen G, Seberg O, Tamura MN, et al: Multigene analyses of monocot relationships: A summary. Monocots: Comparative Biology and Evolution Excluding Poales. Edited by: Columbus JT, Friar EA, Porter JM, Prince LM, Simpson MG. 2006, Claremont: Rancho Santa Ana Botanical Garden, I: 63-75.Google Scholar
- Graham SW, Zgurski JM, McPherson MA, Cherniawsky DM, Saarela JM, Horne EFC, Smith SY, Wong WA, O'Brien H, Biron VL, et al: Robust inference of monocot deep phylogeny using and expanded multigene plastid data set. Monocots: Comparative Biology and Evolution Excluding Poales. Edited by: Columbus JT, Friar EA, Porter JM, Prince LM, Simpson MG. 2006, Claremont: Rancho Santa Ana Botanical Garden, 1: 3-21.Google Scholar
- Davis JI, Stevenson DW, Petersen G, Seberg O, Campbell LM, Freudenstein JV, Goldman DH, Hardy CR, Michelangeli FA, Simmons MP, Specht CD, Vergara-Silva F, Gandolfo M: A phylogeny of the monocots, as inferred from rbcL and atpA sequence variation, and a comparison of methods for calculating Jackknife and Bootstrap values. Syst Bot. 2004, 29: 467-510. 10.1600/0363644041744365.Google Scholar
- Tzeng TY, Liu HC, Yang CH: The C-terminal sequence of LMADS1 is essential for the formation of homodimers for B function proteins. J Biol Chem. 2004, 279: 10747-10755. 10.1074/jbc.M311646200.PubMedGoogle Scholar
- Münster T, Wingen LU, Faigl W, Werth S, Saedler H, Theissen G: Characterization of three GLOBOSA-like MADS-box genes from maize: evidence for ancient paralogy in one class of floral homeotic B-function genes of grasses. Gene. 2001, 262: 1-13. 10.1016/S0378-1119(00)00556-4.PubMedGoogle Scholar
- Bielawski JP, Yang Z: A maximum likelihood method for detecting functional divergence at the individual codon sites, with application to gene family evolution. J Mol Evol. 2004, 59: 121-132. 10.1007/s00239-004-2597-8.PubMedGoogle Scholar
- Yang Y, Fanning L, Jack T: The K domain mediates heterodimerization of the Arabidopsis floral organ identity proteins, APETALA3 and PISTILLATA. Plant J. 2003, 33: 47-59. 10.1046/j.0960-7412.2003.01473.x.PubMedGoogle Scholar
- Ferrario S, Immink RG, Angenent GC: Conservation and diversity in flower land. Curr Opin Plant Biol. 2004, 7: 84-91. 10.1016/j.pbi.2003.11.003.PubMedGoogle Scholar
- Theissen G, Becker A, Di Rosa A, Kanno A, Kim JT, Munster T, Winter K-U, Saedler H: A short history of MADS-box genes in plants. Plant Mol Biol. 2000, 42: 115-149. 10.1023/A:1006332105728.PubMedGoogle Scholar
- Purugganan MD, Rounsley SD, Schmidt RJ, Yanfosky MF: Molecular evolution of flower development: Diversification of the plant MADS-box regulatory gene family. Genetics. 1995, 140: 345-356.PubMed CentralPubMedGoogle Scholar
- Parenicova L, de Folter S, Kieffer M, Horner DS, Favalli C, Busscher J, Cook HE, Ingram RM, Kater MM, Davies B, Angenent GC, Colombo L: Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis : new openings to the MADS world. Plant Cell. 2003, 15: 1538-1551. 10.1105/tpc.011544.PubMed CentralPubMedGoogle Scholar
- Whipple CJ, Ciceri P, Padilla CM, Ambrose BA, Bandong SL, Schmidt R: Conservation of B-class floral homeotic gene function between maize and Arabidopsis. Development. 2004, 131: 6083-6091. 10.1242/dev.01523.PubMedGoogle Scholar
- Bateman RM, Rudall PJ: The good, the bad and the ugly: Using naturally occurring terata to distinguish the possible from the impossible in orchid floral evolution. Monocots: Comparative Biology and Evolution Excluding Poales. Edited by: Columbus JT, Friar EA, Porter JM, Prince LM, Simpson MG. 2006, Claremont: Rancho Santa Ana Botanical Garden, I: 481-496.Google Scholar
- Cameron KM: Molecular systematics of Orchidaceae: A literature review and an example using five plastid genes. Proceedings of the 17th World Orchid Conference: 24 April – 1 May 2005; Shah Alam. 2005, Malaysia: Natural History Publications, 80-96.Google Scholar
- Ramirez SR, Gravendeel B, Singer RB, Marshall CR, Pierce NE: Dating the origin of the Orchidaceae from a fossil orchid with its pollinator. Nature. 2007, 448: 1042-1045. 10.1038/nature06039.PubMedGoogle Scholar
- Aceto S, Montieri S, Sica M, Gaudio L: Molecular evolution of the OrcPI locus in natural populations of Mediterranean orchids. Gene. 2007, 392: 299-305. 10.1016/j.gene.2007.01.005.PubMedGoogle Scholar
- Gillespie JH: The causes of Molecular Evolution. 1991, Oxford: Oxford University PressGoogle Scholar
- Anisimova M, Bielawski JP, Yang Z: Accuracy and power of bayes prediction of amino acid sites under positive selection. Mol Biol Evol. 2002, 19 (6): 950-958.PubMedGoogle Scholar
- Sajo MG, Longhi-Wagner HM, Rudall PJ: Reproductive morphology of the early-divergent grass Streptochaeta and its bearing on the homologies of the grass spikelet. Plant Syst Evol. 2008, 275: 245-255. 10.1007/s00606-008-0080-5.Google Scholar
- Rudall PJ, Bateman RM: Evolution of zygomorphy in monocot flowers: iterative patterns and developmental constraints. New Phytol. 2004, 162: 25-44. 10.1111/j.1469-8137.2004.01032.x.Google Scholar
- Rudall PJ, Stuppy W, Cunniff J, Kellogg EA, Briggs BG: Evolution of reproductive structures in grasses (Poaceae) inferred by sister-group comparisons with their putative closest living relatives, Ecdeiocoleaceae. Am J Bot. 2005, 92: 1432-1443. 10.3732/ajb.92.9.1432.PubMedGoogle Scholar
- Sawyer SL, Wu LI, Emerman M, Malik H: Positive selection of primate TRIM5α identifies a critical species-specific retroviral restriction domain. Proc Natl Acad Sci USA. 2005, 102: 2832-2837. 10.1073/pnas.0409853102.PubMed CentralPubMedGoogle Scholar
- Bishop JG: Directed mutagenesis confirms the functional importance of positively selected sites in polygalacturonase inhibitor protein. Mol Biol Evol. 2005, 22: 1531-1534. 10.1093/molbev/msi146.PubMedGoogle Scholar
- Levasseur A, Gouret P, Lesage-Meessen L, Asther M, Asther M, Record E, Pontarotti P: Tracking the connection between evolutionary and functional shifts using the fungal lipase/feruloyl esterase A family. BMC Evol Biol. 2006, 6:Google Scholar
- Barkman TJ, Martins TR, Sutton E, Stout JT: Positive selection for single amino acid change promotes substrate discrimination of a plant volatile-producing enzyme. Mol Bio Evol. 2007, 24: 1320-1329. 10.1093/molbev/msm053.Google Scholar
- Winter K-U, Weiser C, Kaufmann K, Bohne A, Kirchner C, Kanno A, Saedler H, Theissen G: Evolution of class B floral homeotic proteins: obligate heterodimerization originated from homodimerization. Mol Biol Evol. 2002, 19: 587-596.PubMedGoogle Scholar
- Tonaco IA, Borst JW, de Vries SC, Angenent GC, Immink RG: In vivo imaging of MADS-box transcription factor interactions. J Exp Bot. 2006, 57: 33-42. 10.1093/jxb/erj011.PubMedGoogle Scholar
- Immink RG, Ferrario S, Busscher-Lange J, Kooiker M, Busscher M, Angenent GC: Analysis of the petunia MADS-box transcription factors. Mol Genet Genomics. 2003, 268: 598-606.PubMedGoogle Scholar
- Vandenbussche M, Theissen G, Peer Van de Y, Gerats T: Structural diversification and neo-functionalization during floral MADS-box gene evolution by C-terminal frameshift mutations. Nucl Acids Res. 2003, 31: 4401-4409. 10.1093/nar/gkg642.PubMed CentralPubMedGoogle Scholar
- Kramer EM, Su HJ, Wu CC, Hu JM: A simplified explanation for the frameshift mutation that created a novel C-terminal motif in the APETALA3 gene lineage. BMC Evol Biol. 2006, 6: 30-10.1186/1471-2148-6-30.PubMed CentralPubMedGoogle Scholar
- Lamb RS, Irish VF: Functional divergence within the APETALA3/PISTILLATA floral homeotic gene lineages. Proc Natl Acad Sci USA. 2003, 100: 6558-6563. 10.1073/pnas.0631708100.PubMed CentralPubMedGoogle Scholar
- Piwarzyk E, Yang Y, Jack T: Conserved C-terminal motifs of the Arabidopsis proteins APETALA3 and PISTILLATA are dispensable for floral organ identity function. Plant Physiol. 2007, 1495-1505. 10.1104/pp.107.105346. 145
- Su K, Zhao S, Shan H, Kong H, Lu W, Theissen G, Chen Z, Meng Z: The MIK region rather than the C-terminal domain of AP3-like class B floral homeotic proteins determines functional specificity in the development and evolution of petals. New Phytol. 2008, 178: 544-558. 10.1111/j.1469-8137.2008.02382.x.PubMedGoogle Scholar
- Kocyan A, Endress PK: Floral structure and development and systematic aspects of some 'lower' Asparagales. Plant Syst Evol. 2001, 187-216. 10.1007/s006060170011. 229
- Rudall PJ, Chase MW, Cutler DF, Rusby J, De Brujin AY: Anatomical and molecular systematics of Asteliaceae and Hypoxidaceae. Bot J Linn Soc. 1998, 127: 1-42. 10.1111/j.1095-8339.1998.tb02086.x.Google Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedGoogle Scholar
- Notredame C, Higgins DG, Heringa J: T-Coffee: A novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302: 205-217. 10.1006/jmbi.2000.4042.PubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucl Acids Res. 2004, 1792-1797. 10.1093/nar/gkh340. 32
- Talavera G, Castresana J: Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic Biology. 2007, 56: 564-577. 10.1080/10635150701472164.PubMedGoogle Scholar
- Nicholas KB, Nicholas HB, Deerfield DW: GeneDoc: Analysis and Visualization of Genetic Variation. EMBnet Newsletter. 1997, 4: 14-Google Scholar
- Posada D, Crandall KA: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.PubMedGoogle Scholar
- Posada D, Buckley TR: Model selection and model averaging in phylogenetics: advantages of the AIC and Bayesian approaches over likelihood ratio tests. Syst Biology. 2004, 53: 793-808. 10.1080/10635150490522304.Google Scholar
- Altekar G, Dwarkadas S, Huelsenbeck JP, Ronquist F: Parallel Metropolis-coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. Bioinformatics. 2004, 20: 407-415. 10.1093/bioinformatics/btg427.PubMedGoogle Scholar
- Xia X, Xie Z: DAMBE: Data analysis in molecular biology and evolution. J Heredity. 2001, 92: 371-373. 10.1093/jhered/92.4.371.Google Scholar
- Kosakovsky Pond SL, Frost SDW, Muse SV: HyPhy: Hypothesis testing using phylogenies. Bioinformatics. 2005, 21: 676-679. 10.1093/bioinformatics/bti079.Google Scholar
- Muse SV, Gaut BS: A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol Biol Evol. 1994, 11: 715-724.PubMedGoogle Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: A practical and powerful approach to multiple testing. J Royal Stat Soc Series B. 1995, 57: 289-300.Google Scholar
- Yang Z, Nielsen R, Goldman N, Pedersen AMK: Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000, 155: 431-449.PubMed CentralPubMedGoogle Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. CABIOS. 1997, 13: 555-556.PubMedGoogle Scholar
- Yang Z: PAML:Phylogenetic Analysis by Maximum Likelihood. 2005, 3.15Google Scholar
- Yang Z: Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol Biol Evol. 1998, 15: 568-573.PubMedGoogle Scholar
- Zhang J, Nielsen R, Yang Z: Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005, 22: 2472-2479. 10.1093/molbev/msi237.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.