The evolutionary genetics of highly divergent alleles of the mimicry locus in Papilio dardanus
© Thompson et al.; licensee BioMed Central Ltd. 2014
Received: 25 February 2014
Accepted: 19 June 2014
Published: 31 August 2014
The phylogenetic history of genes underlying phenotypic diversity can offer insight into the evolutionary origin of adaptive traits. This is especially true where single genes have large phenotypic effects, for example in determining polymorphic mimicry in butterflies. Here, we characterise the evolutionary history of two candidate genes for the mimicry switch in the polymorphic Batesian mimic Papilio dardanus coding for the transcription factors engrailed and invected.
We show that phased haplotypes associated with the dominant morphs f. poultoni and f. planemoides are phylogenetically highly divergent, in particular at non-synonymous sites. Some non-synonymous changes are shared between the divergent alleles suggesting either convergence or a shared ancestry. Gene trees for invected do not show this pattern. Despite their great divergence, all engrailed alleles of P. dardanus were monophyletic with respect to alleles of closely related species. Phylogenetic analyses therefore reveal no evidence for introgression from other species. A McDonald-Kreitman test conducted on a population sample from South Africa confirms a significant excess of intraspecific non-synonymous diversity in P. dardanus engrailed, suggesting long-term balanced polymorphism at this locus.
The divergence between engrailed haplotypes suggests an evolutionary history distorted by selection with multiple changes reflecting recurrent selective sweeps. The high level of intraspecific polymorphism observed is characteristic of balancing selection on this locus, as expected if the gene engrailed is under phenotypic selection for the maintenance of multiple mimetic morphs. Non-synonymous changes in key functional portions of a major transcription factor are likely to be deleterious but if maintained in a dominant allele at low frequency, heterozygosity would reduce the associated genetic load.
KeywordsMimicry Balanced polymorphism Supergene Engrailed Phylogenetics Molecular evolution
A key aim of evolutionary biology is to understand the processes that give rise to novel traits. What is the nature of the genetic changes underlying adaptation? How are new alleles introduced into a population, and how are they maintained in the face of varying types of selection? One way of solving these questions is to study polymorphic species – the genetic control of a polymorphism within species can be used to test hypotheses regarding modification of developmental processes at broader taxic scales . To find the genes responsible, we can study nucleotide variation and search for characteristic signatures that selection will leave in the diversity of alleles in a species . Such ‘signatures of selection’ can corroborate the role of the genes in determining the polymorphism. The identity of these genes will contribute to our understanding of the processes generating and maintaining evolutionary diversity.
Another key question underlying studies of phenotypic evolution is how complex phenotypes come under precise genetic control. Locally polymorphic species pose a particular challenge: multiple phenotypic optima are occupied by individuals from a single population requiring precise determination of the various discrete phenotypes despite mating between individuals of different morphs. Such situations can lead to the evolution of supergenes  where a single genetic locus comes to determine the polymorphism. Whatever the exact genomic architecture of the supergene, it seems likely that switching between alternate phenotypes is accomplished by differential regulation of effector genes. The ‘cis-regulatory’ hypothesis for morphological adaptation states that the regulatory regions of genes can evolve at a higher rate (and with fewer constraints) than the protein-coding regions of genes [4, 5]. This hypothesis predicts that morphological evolution will tend to arise through regulatory changes, often in cis-regulatory control of conserved genes involved in developmental processes .
Lepidopteran wing patterns are some of the best known examples of adaptive colouration and include some textbook examples of natural selection in action, including industrial melanism in Biston betularia and Müllerian mimicry in Heliconius butterflies [7–9]. These systems have revealed that wing pattern diversification is controlled by a small number of genes with alleles of large effect [10–12]. When the genomic regions containing these genes are analysed phylogenetically, they often display an evolutionary history that is discordant with that of the rest of the genome, producing topologies that group similar phenotypes together irrespective of species boundaries and geographic structure [13, 14]. In some cases these patterns have resulted from adaptive introgression, or collateral evolution by allele sharing [10, 15]. Reconstructing phylogenies of genomic regions that control phenotypic diversity can therefore be a powerful method for verifying their involvement in generating and maintaining phenotypic diversity, and in increasing our understanding of the processes giving rise to novel and adaptive phenotypes (e.g. ). Recent advances in unpicking the supergene underlying Batesian mimicry in Papilio polytes offers insight into the nature and function of supergenes. In this case, a single coding region, doublesex, was found to determine female polymorphism, possibly through differential expression of isoforms of doublesex. High levels of synonymous and non-synonymous polymorphism were found in the doublesex coding region and alternative alleles were found to be highly divergent.
The H locus has been mapped to a 13.9 cM linkage group and shown to co-segregate with the gene coding for the transcription factor invected and possibly its paralogue engrailed that maps to the same genomic region . More recent work has used a comparative genomic approach to physically map this candidate region and carry out further recombination and SNP-association analyses to test for differences among morphs in this region . Around 24 genes co-segregate with H and are broadly syntenic with previously sequenced lepidopteran genomes. Despite the lack of resolution in positional cloning of H using pedigree information, this study furthermore demonstrated that the forms f. cenea, f. poultoni and f. planemoides showed significant association with SNP variants in engrailed (see Figure 1). None of the other 23 genes showed these SNP associations with all three phenotypes, leading to the conclusion that engrailed alone is the prime candidate for the mimicry switch locus, H. Additionally, linkage disequilibrium in the region was found to be relatively low, but was high for SNPs within engrailed, as would be expected if this locus is a supergene harbouring multiple allelic sites which determine alternative phenotypes and which need to be maintained without recombination to avoid the formation of maladaptive intermediates. Finally, the comparative genomics analysis also revealed evidence for balancing selection based on the non-neutral distribution of SNP variation in the engrailed coding region .
These results indicate that engrailed and possibly invected are strong candidates for the P. dardanus mimicry switch. These genes are highly conserved developmental regulatory genes present in all hexapods . If these genes do indeed determine the wing pattern, nucleotide variation will be affected by selection on the phenotype and therefore may have an evolutionary history discordant with that of unlinked markers, which were shown to follow a predominantly geographic structure . Specifically, phylogenetic analysis of mitochondrial markers in P. dardanus has revealed deeply diverging lineages comprising Eastern and Western mainland African clades and an Indian Ocean clade, but no comparable separation has been detected in nuclear markers including the invected gene and two linked AFLP loci . We additionally predict that similar phenotypes will share alleles at H regardless of their geographic origin; individuals of a given morph are therefore expected to be phylogenetically closely related at loci determining the phenotype (i.e. the H locus).
Here we use gene phylogenies for coding regions of engrailed and invected to study the phylogenetic history of these regions within P. dardanus and across the genus Papilio. Previous work has also demonstrated that P. dardanus forms hybrids in the wild with other Papilio species  and gene flow from other species may be a significant source of evolutionary novelty . We therefore test the monophyly of P. dardanus alleles and the possibility that divergent alleles have undergone introgression, while also establishing a baseline of divergence in these loci beyond the P. dardanus clade. A phylogenetic approach will also be useful for the analysis of engrailed and invected alleles within the P. dardanus lineage to characterise the kind of nucleotide divergence distinguishing the various morphs. Given the known LD in the engrailed coding region the phase of SNP variation can be computationally inferred to obtain distinct alleles whose variation may be tree like and amenable to phylogenetic inference. Variation at these loci within P. dardanus is then further investigated, relative to population divergence in mitochondrial markers. Finally, tests for selection are applied to reveal associations of haplotypes in engrailed and invected with wing pattern and to give insight into the molecular evolution of this region.
Materials and methods
DNA was extracted from legs of fresh or frozen specimens using a Qiagen DNeasy kit. For dried museum collections material (either pinned or stored in envelopes) the protocols of Thomsen et al.  were followed. Briefly, this involves removal of a leg of the butterfly, incubation overnight in a buffer with protease and a purification step using a Qiagen QIAquick PCR cleanup kit. Primer sequences [25, 28, 30, 31] are provided in Additional file 2: Table S2. PCR products were bi-directionally Sanger sequenced using ABI technology. Sequences were checked and edited using Geneious (Version 6.1  and aligned using the MAFFT  plugin for Geneious with default settings. Numbers of variable sites (including parsimony-informative, synonymous and non-synonymous sites) were calculated for each alignment in SITES .
To improve the accuracy of haplotype inference, allelic phase data was directly obtained for 2 individuals per morph through cloning of engrailed-derived PCR amplicons with Invitrogen TOPO TA vector and Invitrogen chemically competent E. coli grown on kanamycin/LB agar. PCR was performed on picked colonies directly prior to Sanger sequencing (sequences submitted to Genbank, accessions KJ507618-KJ507653). In addition, haplotypic phase data was obtained by sequencing H-linked PCR amplicons for 8 specimens (Additional file 3: Table S3) on a Illumina MiSeq (200 bp PE with v2 500 cycle kit, Nextera library prep at an NHM in-house sequencing facility). Resulting reads were trimmed in Geneious version 6.1  using default settings and mapped to a BAC reference (Genbank accession FM995623.2) using Geneious read-mapper (, settings: 15% gaps per read, gap size 50, no words >20x, word length 14, index length 12. 30% mismatches per read, maximum ambiguity 4-fold, iterated 5 times). Paired-end information was used to improve the quality of the mapping, using only matches mapping nearby and ignoring multiple matches. The mapping resulted in coverage for each targeted region in excess of 100x, files of reads mapping to the BAC reference have been submitted to SRA (study PRJEB5625, ERP005044). The resulting pileups were used to call haplotypes using GATK read-backed phasing (GATK version 2.3-9, [36, 37]). Only those haplotypes obtained through cloning or Illumina amplicon sequencing were used to represent P. dardanus in the genus-level phylogenies (note that this sample includes all studied morphs). Individuals of P. dardanus f. lamborni were excluded from the present study due to the existence of a duplication of the engrailed/invected genome region in this morph , potentially confounding phylogenetic or haplotype-level analyses.
Nucleotide substitution models were obtained from jModeltest (version 2.1, [38, 39], testing 11 substitution schemes, each with 4 categories of rate variation among sites) and using the corrected AIC criterion (AICc) to find the best fitting model. Tree searches were performed under the selected model using PhyML (version 20120412, ) with support values assessed using 200 replicates and other settings set to default. Tree statistics (tree length, consistency index, retention index and homoplasy index) were calculated in PAUP* 4.0 .
Haplotype analysiswithin P. dardanus
Sequenced haplotypes from cloned PCR products and Illumina amplicon sequencing were added to P. dardanus genotypic data to assist in inferring phase in the remaining specimens using the program PHASE (version 2.1.1, [41, 42]), with settings 400 iterations, thinning interval = 4 and a burn-in of 150. Haplotypes that were not inferred with certainty (P = 1.000) were discarded. A list of samples for which engrailed and invected haplotypes were successfully inferred is presented in Additional file 4: Table S4. Unrooted phylogenetic trees of P. dardanus engrailed and invected inferred haplotypes were produced using PhyML as described above. The branch-lengths of these trees were rescaled to reflect the numbers of synonymous and non-synonymous substitutions in HyPhy (version 2.1.2, ). Monophyly of the morph-associated engrailed alleles was assessed using the Shimodaira-Hasegawa test for tree selection based on constrained trees (monophyly constraint) that keep target alleles as monophyletic against the unconstrained tree using PAUP* 4.0b10 ; heuristic search performed under likelihood criterion with 10 random stepwise addition replicates and tree bisection-reconnection branch swapping, using nucleotide substitution model given by jModeltest).
McDonald-Kreitman tests were performed in DnaSP (version 5.10.1, ) using 405 bp amplicon engrailed haplotypes inferred with PHASE from a population sample of 35 wild-caught specimens of P. dardanus subspecies cenea (Additional file 5: Table S5). The McDonald-Kreitman test requires a sufficiently divergent outgroup, such that there is little or no shared polymorphism, so we used a specimen of P. rex, rather than the more closely related P. phorcas or P. constantinus.
Phylogeny of engrailed and invected exons across the genusPapilio
Outside of the P. dardanus clade, the overall topology is largely consistent with previous findings [45, 46] with the included subgenera, principally Princeps, Druryia and Menelaides recovered as monophyletic. Within the subgenus Menelaides, the species P. memnon and P. rumanzovia are not reciprocally monophyletic with one specimen of each species in an unresolved position at the base of this clade. Consistent with previous analyses  our tree places P. rex within subgenus Princeps; P. rex is therefore corroborated as sister to the clade (P. dardanus, P. constantinus, P. phorcas).
The dataset for invected consisted of 81 terminals (26 species) within Papilio and an alignment of 332 bps. The resulting tree also reveals a monophyletic P. dardanus group, with P. rex as sister, and is composed of highly divergent alleles that separate the monophyletic P. dardanus from its closest relatives P. phorcas and P. constantinus. The divergences within P. dardanus are more uniform than in the engrailed locus and there is no association of haplotypes with particular phenotypes.
Intraspecific variation in P. dardanus engrailed haplotypes
The haplotypes can also be labelled according to the population from which they were sampled (Figure 5B). All of the haplotypes from Western Africa (subspecies P. d. dardanus) can be found within a single clade, however this lineage also contains a few Eastern African haplotypes from P. d. polytrophus. Subspecies P. d. meseres is treated separately as this has been suggested to be a ‘transitional’ or hybrid race where the Eastern and Western lineages meet around Lake Victoria [47–49]. Consistent with this hypothesis, individuals from this region are found to possess haplotypes of both Eastern and Western groups (Figure 5B). Finally, the f. planemoides-associated allele is found in individuals from both Eastern and Western African populations, possibly demonstrating a history of this allele independent of the population-level biogeography . In summary, there is some geographic structure within our sampling of engrailed alleles, but allelic divergence within P. dardanus is deeper than any geographic structure.
P. dardanus invected haplotypes
Nature of changes
Segregating and fixed synonymous and non-synonymous differences in the comparison P . dardanus cenea - P. rex
P. dardanus cenea – P. rex
We show here that phased haplotypes of engrailed associated with wing pattern forms are highly divergent phylogenetically. It has recently been shown that sets of SNPs across engrailed show strong association with the wing patterns of f. poultoni and f. planemoides and that this region shows elevated linkage disequilibrium (Figure 1; ). The presence of multiple linked SNPs is a prerequisite for the recognition of phylogenetically distinct lineages, and we show here that these morph-associated SNPs define a small number of morph-associated haplotypes. Each individual of these two morphs possesses at least one of these haplotypes, which occupy exceptionally long branches in the haplotype tree, even by comparison to the corresponding sequences across distantly related species of Papilio (Figure 3). The divergence of the morph-associated haplotypes is contributed disproportionately by non-synonymous changes. The allele diversity does not follow the strong geographic structure suggested by the mtDNA, which splits the populations in eastern and western branches ( and Additional file 6: Figure S1) in roughly the same way as genitalic characters , although other nuclear markers also show only weak subdivision ( and unpublished). Taken together, these findings strongly suggest that the genomic region around the first exon of engrailed diverges from patterns of neutral variation at phylogenetic and genetic levels, which further supports a role of this region for controlling wing pattern variation in P. dardanus.
The phylogenetic analysis revealed several details about the evolutionary history of engrailed and the adjacent invected loci. First, the higher-level analyses of the genus Papilio (Figures 3 and 4 respectively) did not yield many surprises; expected subclades were largely recovered for both loci, albeit with generally poor support due to the short (<800 bp) fragment length and high homoplasy (Additional file 7: Table S6). Hence it can be concluded that the engrailed and invected loci are useful to track the lineage history, and are not generally distorted by selection outside of P. dardanus. Another key result is that P. dardanus is recovered as monophyletic in the engrailed phylogeny, despite the inclusion of the divergent haplotypes, arguing against an origin for divergent alleles in other species. The intraspecific trees of P. dardanus alleles were characterised by a largely unresolved polytomy at the base in both loci, with very little internal structure related to geography, confirming the conclusions of  based on a more limited sample. Additionally, there is no obvious phylogenetic structure beyond the divergent alleles associated with f. poultoni and f. planemoides in the sequenced region of engrailed (Figure 5). Hence, aside from these divergent alleles, the phylogenies corroborate the findings from the SNP associations , that did not relate the recessive morphs to particular SNPs.
The existence of multiple fixed differences in the two divergent alleles is likely a result of recurrent fixation of novel mutations. The two groups of morph-associated alleles may share some changes due to a common ancestry or alternatively, if these non-synonymous changes have a direct functional role, this might represent convergence between the two alleles. Indeed, all of the changes that unite the f. poultoni and f. planemoides alleles are non-synonymous, consistent with the second hypothesis. However, the observation of an elevated level of non-synonymous intraspecific polymorphism in P. dardanus engrailed is not limited to the obvious cases of the morph-associated alleles. The results of the McDonald-Kreitman test provide strong evidence for non-neutral evolution in this gene. This has been observed with the SNP variation in the highly polymorphic Kenyan population used for the initial association studies , and is confirmed here for a second population from a different region (South Africa) that does not even include the most divergent alleles. The engrailed locus in P. dardanus therefore shows a pattern of evolution that differs from the remainder of the genus Papilio and also from the adjacent invected locus, in that non-synonymous changes have accumulated at a much increased evolutionary rate.
It is highly surprising to find such rapid coding sequence evolution in a gene that is so highly conserved across the arthropods. In particular, one of the changes unique to f. planemoides occurs in the conserved EH1 domain of the Engrailed protein, changing the core motif from FSISNIL in all other P. dardanus to YSISNIL which confers a change in the otherwise conserved core of FxIxxIL . Given this conserved sequence it seems likely that this change may affect binding of Engrailed to the transcriptional repressor Groucho [52, 53]. Whether or not the coding changes in the two divergent alleles affect the function of the Engrailed protein remains an open question. Similarly, we cannot be certain whether these variant sites are themselves the focus of selection, or whether their fixation is the result of hitchhiking due to linkage with other changes. In the latter case, we hypothesise that favourable selection due to a novel mimetic resemblance outweighs the negative effects of a linked genetic load; selection coefficients for mimicry are likely to be very high .
Theory predicts that alleles with a high linked genetic load, as might be conferred by the amino acid changes in the EH1 domain, are likely to be under selection to increase in dominance: linked genetic load is ‘sheltered’ in rare dominant alleles as individuals will nearly always be heterozygous for these alleles . In the Batesian mimicry system of P. dardanus, the equilibrium frequency is the abundance of the mimic relative to their model at which fitness is maximised, without predators associating a pattern with palatability rather than toxicity . The equilibrium allele frequency will therefore be on average lower for a dominant allele as compared to alleles further down the dominance hierarchy. Hence, we speculate that whether the coding sequence changes at engrailed are in any way functional or not, their likely negative pleiotropic effects may be shielded from selection by the recessive-morph alleles at engrailed. This shielding of deleterious mutations through heterozygosity may therefore explain why it is especially the dominant morphs which show such a large excess of non-synonymous mutations.
Finally, we may ask what these results contribute to our understanding of the evolution of the supergene that was hypothesized to underlie the phenotypic variation in P. dardanus. The evolution of linkage disequilibrium to maintain co-adapted alleles is a core facet of classical supergene theory [57, 58] and other supergene systems have been shown to possess complex genomic architectures that may act to reduce recombination between supergene alleles [12, 59–61]. Here we show that the genomic signature of selection evident from the phylogenetic trees and increased rates of non-synonymous changes does not extend beyond engrailed; we find little evidence for morph-associated variants at linked genes such as invected. A similar pattern was seen in the SNP analysis , along with the low level of linkage disequilibrium observed elsewhere in the H region. The high level of diversity uncovered in a population sample of P. dardanus is indicative of negative frequency-dependent selection, as would be expected for a locus underlying polymorphic Batesian mimicry, as selection against over-abundant phenotypes results in a balanced polymorphism, which can be detected by a signature of increased diversity. The existence of multiple fixed changes in these alleles could be the result of recurrent selective sweeps fixing the variants, with either the coding region sites under selection themselves or hitchhiked to fixation as the result of selection on linked variants. The changes might also reflect the build-up of genetic load due to a reduction of recombination as predicted by classic supergene theories of polymorphic mimicry. The previous observation of low linkage disequilibrium beyond the engrailed locus  means that this predicted effect only affects a single gene which, however, spans a large genomic region of ~70 kb .
The findings of high levels of non-synonymous diversity, potentially affecting protein structure, are similar to patterns observed in the doublesex coding region in P. polytes. Both the P. dardanus engrailed locus and P. polytes doublesex have alternative alleles associated with different female mimetic morphs and these alternative alleles differentiated by multiple changes with a high ratio of non-synonymous to synonymous SNPs in the coding regions. These two mimetic butterflies have evolved similar systems of Batesian mimetic polymorphism through a complex sequence of changes in single developmentally important genes, although the genes involved are very different.
The gene engrailed is a well-supported candidate for the mimicry switch locus in P. dardanus. Following the comparative genomics work of , we here investigated the evolutionary genetics of the engrailed and adjacent invected genes using phased alleles within the P. dardanus lineage and also provided the wider geographic and phylogenetic context by including additional populations of P. dardanus and other members of the genus Papilio. We also expanded the tests for non-neutral variation to a geographically distinct population, along with the use of a less divergent outgroup. These analyses revealed that dominant morphs are associated with highly divergent haplotypes of this gene in P. dardanus, with a large proportion of the divergence occurring at non-synonymous sites, but not in other species. In addition, there is no evidence for introgression from other species to explain the high level of divergence from other P. dardanus haplotypes. Furthermore, the high levels of non-synonymous polymorphism observed in P. dardanus engrailed are consistent with long-term balancing selection, mirroring similar findings in P. polytes. The study provides new insights into the fascinating evolution of mimetic polymorphisms in the genus Papilio and shows that, while the polymorphism is generated by different genomic regions, the evolutionary processes that build up the phenotypic diversity at the genome level are similar between species of Papilio.
Availability of supporting data
DNA sequences are in Genbank under accession numbers given in a supplemental table.
We are grateful to Steve Collins of the Afrotropical Butterfly Research Institute, Nairobi for the provision of samples and assistance with fieldwork, Butterfly Conservation Ghana, the Lepidopterists Society of Africa, Bennie and Andre Coetzer, Dr Oskar Brattstrom and Erik van Bergen for assistance with fieldwork and Andrew Knapp and Alessandra Pollara for assistance with preparation of samples for sequencing. Funded by NE/F006225/1 of the Natural Environment Research Council of the UK. MJTN was funded through a NERC Postdoctoral Fellowship (NE/I021578/1).
- Joron M, Jiggins CD, Papanicolaou A, McMillan WO: Heliconius wing patterns: an evo-devo model for understanding phenotypic diversity. Heredity. 2006, 97: 157-167.PubMedView Article
- Nielsen R: Molecular signatures of natural selection. Annu Rev Genet. 2005, 39: 197-218.PubMedView Article
- Thompson MJ, Jiggins CD: Supergenes and their role in evolution. Heredity. 2014, 113 (1): 1-8.PubMedView Article
- Carroll SB: Evolution at two levels: on genes and form. PLoS Biol. 2005, 3: e245-PubMedPubMed CentralView Article
- Rebeiz M, Pool JE, Kassner VA, Aquadro CF, Carroll SB: Stepwise modification of a modular enhancer underlies adaptation in a Drosophila population. Science. 2009, 326: 1663-7.PubMedPubMed CentralView Article
- Prud’homme B, Gompel N, Carroll SB: Emerging principles of regulatory evolution. Proc Natl Acad Sci U S A. 2007, 104 (Suppl 1): 8605-12.PubMedPubMed CentralView Article
- van’t Hof AE, Edmonds N, Dalíková M, Marec F, Saccheri IJ: Industrial melanism in British peppered moths has a singular and recent mutational origin. Science. 2011, 332: 958-960.PubMedView Article
- Counterman B, Araujo-Perez F, Hines HM, Baxter SW, Morrison CM, Lindstrom DP, Papa R, Ferguson LC, Joron M, Ffrench-Constant RH, Smith CP, Nielsen DM, Chen R, Jiggins CD, Reed RD, Halder G, Mallet J, McMillan WO: Genomic hotspots for adaptation: the population genetics of Müllerian mimicry in Heliconius erato. PLoS Genet. 2010, 6: e1000796-PubMedPubMed CentralView Article
- Baxter SW, Nadeau NJ, Maroja LS, Wilkinson P, Counterman B, Dawson A, Beltrán M, Perez-Espona S, Chamberlain NL, Ferguson LC, Clark R, Davidson C, Glithero R, Mallet J, McMillan WO, Kronforst MR, Joron M, Ffrench-Constant RH, Jiggins CD: Genomic hotspots for adaptation: the population genetics of Müllerian mimicry in the Heliconius melpomene clade. PLoS Genet. 2010, 6: e1000794-PubMedPubMed CentralView Article
- The Heliconius Genome Consortium:: Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature. 2012, 487: 94-8.
- Naisbit RE, Jiggins CD, Mallet J: Mimicry: developmental genes that contribute to speciation. Evol Dev. 2007, 5: 269-80.View Article
- Kunte K, Zhang W, Tenger-Trolander A, Palmer DH, Martin A, Reed RD, Mullen SP, Kronforst MR: Doublesex is a mimicry supergene. Nature. 2014, 507: 229-32.PubMedView Article
- Hines HM, Counterman BA, Papa R, Albuquerque de Moura P, Cardoso MZ, Linares M, Mallet J, Reed RD, Jiggins CD, Kronforst MR, McMillan WO: Wing patterning gene redefines the mimetic history of Heliconius butterflies. Proc Natl Acad Sci U S A. 2011, 108: 19666-71.PubMedPubMed CentralView Article
- Pardo-Diaz C, Salazar C, Baxter SW, Merot C, Figueiredo-Ready W, Joron M, McMillan WO, Jiggins CD: Adaptive introgression across species boundaries in Heliconius butterflies. PLoS Genet. 2012, 8: e1002752-PubMedPubMed CentralView Article
- Stern DL: The genetic causes of convergent evolution. Nat Rev Genet. 2013, 14: 751-764.PubMedView Article
- Colosimo PF, Hosemann KE, Balabhadra S, Villarreal G, Dickson M, Grimwood J, Schmutz J, Myers RM, Schluter D, Kingsley DM: Widespread parallel evolution in sticklebacks by repeated fixation of Ectodysplasin alleles. Science. 2005, 307: 1928-33.PubMedView Article
- Nijhout HF: Polymorphic mimicry in Papilio dardanus: mosaic dominance, big effects, and origins. Evol Dev. 2003, 5: 579-92.PubMedView Article
- Thompson MJ, Timmermans MJTN: Characterising the phenotypic diversity of Papilio dardanus wing patterns using an extensive museum collection. Plos one. 2014, 9 (5): e96815-doi:10.1371/journal.pone.0096815PubMedPubMed CentralView Article
- Clarke CA, Sheppard PM: The genetics of Papilio dardanus Brown. I. Race cenea from South Africa. Genetics. 1959, 44: 1347-1358.PubMedPubMed Central
- Clarke CA, Sheppard PM: The genetics of Papilio dardanus Brown. II. Races dardanus, polytrophus, meseres, and tibullus. Genetics. 1960, 45: 439-457.PubMedPubMed Central
- Clarke CA, Sheppard PM: The genetics of Papilio dardanus Brown. III. Race antinorii from Abyssinia and race meriones from Madagascar. Genetics. 1960, 45: 683-698.PubMedPubMed Central
- Clarke CA, Sheppard PM: The genetics of Papilio dardanus Brown. IV. Data on race ochracea, race flavicornis, and further information on races polytrophus and dardanus. Genetics. 1962, 47: 909-920.PubMedPubMed Central
- Clarke CA, Sheppard PM: The genetics of some mimetic forms of Papilio dardanus Brown, and Papilio glaucus Linn. J Genet. 1959, 56: 236-259.View Article
- Clark R, Brown SM, Collins SC, Jiggins CD, Heckel DG, Vogler AP: Colour pattern specification in the mocker swallowtail Papilio dardanus: the transcription factor invected is a candidate for the mimicry locus H. Proc R Soc B Biol Sci. 2008, 275: 1181-1188.View Article
- Timmermans MJTN, Baxter SW, Clark R, Heckel DGG, Vogel H, Collins S, Papanicolaou A, Fukova I, Joron M, Thompson MJ, Jiggins CD, ffrench-Constant RH, Vogler AP: Comparative genomics of the mimicry switch in Papilio dardanus. Proc R Soc B Biol Sci. 2014, 281 (1787): 20140465-doi:10.1098/rspb.2014.0465View Article
- Peel AD, Telford MJ, Akam M: The evolution of hexapod engrailed-family genes: evidence for conservation and concerted evolution. Proc R Soc B Biol Sci. 2006, 273: 1733-42.View Article
- Clark R, Vogler AP: A phylogenetic framework for wing pattern evolution in the mimetic mocker swallowtail Papilio dardanus. Mol Ecol. 2009, 18: 3872-84.PubMedView Article
- Thompson MJ, Vane-Wright RI, Timmermans MJTN: Hybrid origins: DNA techniques confirm that Papilio nandina is a species hybrid (Papilionidae). J Lepid Soc. 2011, 65: 199-201.
- Thomsen PF, Elias S, Gilbert TM, Haile J, Munch K, Kuzmina S, Froese DG, Sher A, Holdaway RN, Willerslev E: Non-destructive sampling of ancient insect DNA. PLoS One. 2009, 4: e5048-PubMedPubMed CentralView Article
- Kronforst MR: Primers for the amplification of nuclear introns in Heliconius butterflies. Mol Ecol Notes. 2005, 5 (1): 158-162.View Article
- Barraclough TG, Hogan JE, Vogler AP: Testing whether ecological factors promote cladogenesis in a group of tiger beetles (Coleoptera: Cicindelidae). Proc R Soc B Biol Sci. 1999, 266: 1061-1067.View Article
- Drummond A, Ashton B, Buxton S, Cheung M, Cooper A, Duran C, Field M, Heled J, Kearse M, Markowitz S, Moir R, Stones-Havas S, Sturrock S, Thierer T, Wilson A: Geneious v5.3. 2010, Available from http://www.geneious.com
- Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30: 3059-66.PubMedPubMed CentralView Article
- Hey J, Wakeley J: A coalescent estimator of the population recombination rate. Genetics. 1997, 145: 833-846.PubMedPubMed Central
- Sturrock S, Meintjes P: The Geneious 6.0.3 Read Mapper.
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20: 1297-303.PubMedPubMed CentralView Article
- DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ: A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011, 43: 491-8.PubMedPubMed CentralView Article
- Posada D: jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008, 25: 1253-6.PubMedView Article
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704.PubMedView Article
- Swofford DL: Phylogenetic Analysis Using Parsimony (*and Other Methods). 2003
- Stephens M, Smith NJ, Donnelly P: A new statistical method for haplotype reconstruction from population data. Am J Hum Genet. 2001, 68: 978-89.PubMedPubMed CentralView Article
- Stephens M, Scheet P: Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation. Am J Hum Genet. 2005, 76: 449-62.PubMedPubMed CentralView Article
- Kosakovsky Pond SL, Frost SDW, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005, 21: 676-9.View Article
- Librado P, Rozas J: DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-2.PubMedView Article
- Zakharov EV, Caterino MS, Sperling FAH: Molecular phylogeny, historical biogeography, and divergence time estimates for swallowtail butterflies of the genus Papilio (Lepidoptera: Papilionidae). Syst Biol. 2004, 53: 193-215.PubMedView Article
- Caterino MS, Sperling FAH: Papilio phylogeny based on mitochondrial cytochrome oxidase I and II genes. Mol Phylogenet Evol. 1999, 11: 122-37.PubMedView Article
- Poulton EB: The most interesting butterfly in the world. J East African Nat Hist Soc. 1924, 20: 4-22.
- Ford EB: The genetics of Papilio dardanus Brown (Lep.). Trans R Entomol Soc London. 1936, 85: 435-466.View Article
- Turner JRG: Geographical variation and evolution in the males of the butterfly Papilio dardanus Brown (Lepidoptera: Papilionidae). Trans R Entomol Soc London. 1963, 115: 239-259.View Article
- McDonald JH, Kreitman M: Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991, 351: 652-654.PubMedView Article
- Copley RR: The EH1 motif in metazoan transcription factors. BMC Genomics. 2005, 6: 169-PubMedPubMed CentralView Article
- Smith ST, Jaynes JB: A conserved region of engrailed, shared among all en-, gsc-, Nk1-, Nk2-and msh-class homeoproteins, mediates active transcriptional repression in vivo. Development. 1996, 122: 3141-PubMedPubMed Central
- Jiménez G, Paroush Z, Ish-Horowicz D: Groucho acts as a corepressor for a subset of negative regulators, including Hairy and Engrailed. Genes Dev. 1997, 11: 3072-3082.PubMedPubMed CentralView Article
- Kapan DD: Three-butterfly system provides a test of Müllerian mimicry. Nature. 2001, 409: 18-20.View Article
- Llaurens V, Billiard S, Castric V, Vekemans X: Evolution of dominance in sporophytic self-incompatibility systems: I. Genetic load and coevolution of levels of dominance in pollen and pistil. Evolution. 2009, 63: 2427-37.PubMedView Article
- Clarke B: Frequency-dependent selection for the dominance of rare polymorphic genes. Evolution. 1964, 18: 364-369.View Article
- Darlington CD, Mather K: Elements of Genetics. 1949, London: George Allen & Unwin Ltd
- Charlesworth D, Charlesworth B: Mimicry: the hunting of the supergene. Curr Biol. 2011, 21: R846-8.PubMedView Article
- Joron M, Frezal L, Jones RT, Chamberlain NL, Lee SF, Haag CR, Whibley AC, Becuwe M, Baxter SW, Ferguson LC, Wilkinson P, Salazar C, Davidson C, Clark R, Quail MA, Beasley H, Glithero R, Lloyd C, Sims S, Jones MC, Rogers J, Jiggins CD, Ffrench-Constant RH: Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry. Nature. 2011, 477: 203-6.PubMedPubMed CentralView Article
- Huynh LY, Maney DL, Thomas JW: Chromosome-wide linkage disequilibrium caused by an inversion polymorphism in the white-throated sparrow (Zonotrichia albicollis). Heredity. 2010, 106: 537-546.PubMedPubMed CentralView Article
- Wang J, Wurm Y, Nipitwattanaphon M, Riba-Grognuz O, Huang Y-C, Shoemaker D, Keller L: A Y-like social chromosome causes alternative colony organization in fire ants. Nature. 2013, 493: 664-8.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.