- Research article
- Open Access
Scent of a break-up: phylogeography and reproductive trait divergences in the red-tailed bumblebee (Bombus lapidarius)
BMC Evolutionary Biologyvolume 13, Article number: 263 (2013)
The Pleistocene climatic oscillations are considered as a major driving force of intraspecific divergence and speciation. During Ice Ages, populations isolated in allopatric glacial refugia can experience differentiation in reproductive traits through divergence in selection regimes. This phenomenon may lead to reproductive isolation and dramatically accentuates the consequences of the climatic oscillations on species. Alternatively, when reproductive isolation is incomplete and populations are expanding again, further mating between the formerly isolated populations can result in the formation of a hybrid zone, genetic introgression or reinforcement speciation through reproductive trait displacements. Therefore changes in reproductive traits driven by population movements during climatic oscillations can act as an important force in promoting pre-zygotic isolation. Notwithstanding, divergence of reproductive traits has not been approached in the context of climatic oscillations. Here we investigate the impact of population movements driven by climatic oscillations on a reproductive trait of a bumblebee species (Bombus lapidarius). We characterise the pattern of variation and differentiation across the species distribution (i) with five genes (nuclear and mitochondrial), and (ii) in the chemical composition of male marking secretions (MMS), a key trait for mate attraction in bumblebees.
Our results provide evidence that populations have experienced a genetic allopatric differentiation, in at least three main refugia (the Balkans, Centre-Eastern Europe, and Southern Italy) during Quaternary glaciations. The comparative chemical analyses show that populations from the Southern Italian refugium have experienced MMS differentiation and an incipient speciation process from another refugium. The meeting of Southern Italian populations with other populations as a result of range expansion at a secondary contact zone seems to have led to a reinforcement process on local MMS patterns.
This study suggests that population movement during Quaternary climatic oscillations can lead to divergence in reproductive traits by allopatric differentiation during Ice Ages and by reinforcement during post-glacial recolonization.
Historic events such as climatic and topographic changes have triggered current patterns of biodiversity [1, 2]. The Pleistocene glaciations, in particular, are considered as a major driving force of intraspecific divergence, diversification and speciation (e.g. [3–5]). The Pleistocene epoch experienced important climatic oscillations  that have deeply modified the geographic range of temperate species within the Palaearctic region (e.g. ). During the Ice Ages, changes in species distribution occured as the ice sheets advanced to reach climatically more favourable regions, called refugia [8, 9]. At the end of each Ice Age, some populations expanded from these refugia to recolonize previously glaciated areas [5, 10].
Geographic isolation between conspecific populations (e.g. between refugia) is recognised to play an important role in allopatric speciation (e.g. [11, 12]). When gene flow has been limited, or absent, among populations, different or identical alleles can become fixed depending upon the forces of natural selection and/or genetic drift [13, 14]. Thus genetic divergence occurs over a long period, imparting reproductive isolation and possibly leading to speciation . When reproductive isolation is incomplete and formerly isolated lineages meet as a result of range expansion, further mating between the populations produces hybrids, resulting in the formation of a hybrid zone (e.g. [15, 16]), genetic introgression (e.g. ) or reinforcement speciation through pre-zygotic isolation mechanisms (e.g. [18–20]).
Sexual selection is well recognized for its important role in promoting speciation through its variation among allopatric populations [20–23], either through the local features of a mating signal , or different signals in different environments that confer local adaptations [20, 25]. Divergence in mating preferences that parallel divergence in mating signals give rise to discrimination against foreign mates, and can ultimately result in reproductive isolation and speciation ([24–27] but see also ). This phenomenon may dramatically increase the consequences of climatic oscillations on species. Notwithstanding, the divergence in reproductive traits has not been approached in the context of climatic oscillations and migration of populations. Previous studies have mainly focused on the consequences of climatic oscillations on genetic traits (e.g. [10, 15, 16]) while the consequences on reproductive traits and comprehensive integrative analyses of reproductive traits, geographical variation, and genetic differentiation are lacking.
Here we investigated the phylogeography, population structure, and geographical variation of a reproductive trait (male marking secretion, MMS) in the red-tailed bumblebee, Bombus lapidarius. Bumblebees are an excellent system for exploring historical biogeographic patterns of Holarctic groups and the evolution of the courtship signal (e.g. [23, 29]). These bees live in the coldest areas inhabited by insects and have been able to recolonize areas depopulated by Ice Ages in the last three million years . Their early historical biogeography (from Eocene-Oligocene to Miocene) has already been explored  while their recent history (from Pliocene to Holocene) has received comparatively far less attention (e.g. ). B. lapidarius is a widespread temperate West-Palaearctic bumblebee that includes several subspecies which ex-hibit different colour patterns (Figure 1A, ): (i) B. lapidarius lapidarius in the European plains, Balkans and West Anatolia, (ii) B. lapidarius decipiens in the Iberian and South Italian peninsulas, (iii) B. lapidarius caucasicus and B. lapidarius eriophorus in N.E. Anatolia and the Caucasus, and (iv) B. lapidarius atlanticus in the Moroccan Atlas.
The courtship signal of bumblebee males includes both behavioural and chemical features (see  for a review). Here, we focus on the most commonly studied trait, the male marking secretions (MMS). Most bumblebee males patrol along paths where they scent-mark objects with their MMS that attract conspecific virgin females . The MMS consist of a complex mixture (mainly aliphatic compounds) with several major components and with intraspecific variation (e.g. [33, 34]). MMS are produced de novo by cephalic labial glands  from saturated fatty-acids by the action of species-specific esterases, desaturases and reductases in cephalic labial glands [35, 36].
In this study, we used a phylogeographic approach based on five genes (nuclear and mitochondrial) along with comparative chemical analyses of MMS to determine the consequences of historic events on the chemical composition of male courtship signals. Our purposes were (i) to infer the phylogeographic pattern of this widespread European pollinator, (ii) to detect a putative divergence of MMS across its distribution range, and (iii) to compare the phylogeographic pattern and the reproductive trait variation pattern.
Genetic divergence and phylogeography of the red-tailed bumblebee
Patterns of sequence variation
The final molecular dataset spanned 3933 aligned nucleotides: 1056 bp from COI (184 parsimony informative sites [PIS]), 465 bp from Cytb (37 PIS), 791 bp from EF-1α F2 copy containing a ~200 bp intron (9 PIS), 711 bp from LWRh (2 PIS), and 910 bp from PEPCK (11 PIS). Alignments did not require additional indels. LWRh was monomorphic across all B. lapidarius populations while other loci were polymorphic (Figure 1C). Therefore, we excluded the LWRh from the following results. In total, 16 unique haplotypes were observed for COI, 7 for Cytb, 6 for EF-1α, and 16 for PEPCK among 244 European and East Turkish individuals (Figure 1C). For geographic groups, haplotype diversity ranged from 0 to 0.8 and nucleotide diversity ranged from 0 to 0.006 (Additional file 1, Figure 2). Our results displayed endemic haplotypes or haplogroups from (Figures 1C, 2): (i) Gotland Island, Great Britain, Ireland, Southern Italy, East Turkey and Spain for COI; (ii) Great Britain, East Turkey, and Southern Italy for Cytb; (iii) Southern Italy, East Turkey, and Spain for PEPCK.
Phylogeny and haplotype relationships
All phylogenetic analyses and MJM networks performed on the same genetic dataset led to quite similar topologies (Figure 1C, see also supplementary trees [TreeBASE: ID S14299]). Phylogenetic analyses based on EF-1α did not show genetic divergence between geographic groups and failed to support strongly the relationship with outgroups. In the same way, despite the distinctive haplotypes of some geographic groups, phylogeny based on PEPCK failed to resolve relationships between haplogroups while trees based on COI and Cytb resolved the relationship between haplogroups. Phylogenetic analyses based on nuclear genes (EF-1α + PEPCK) failed to resolve the relationship between genetic groups while analyses based on mitochondrial genes (COI + Cytb) and all genes combined (COI dominated the phylogenetic signal) resolved the relationship between genetic groups. Phylogeny based on all genes combined showed a large-scale structuring in main groups that have a predominantly non-overlapping geographic distribution (Figure 1D). B. lapidarius is split into three major lineages: an East Turkey lineage (L1), a Southern Italy and SE European lineage (L2), and a large group including all other European populations (L3) (Figure 1D). The lineage L2 included two sub-lineages: a Southern Italian lineage (L2a) and a SE European lineage (L2b). Inside L3, three distinct geographic groups were distinguished: a Gotland group (UG), a British Isles group (UB), and a Spanish group (US) but they were poorly supported. However there is evidence for divergence between islands, peninsulas and other populations and for maintenance of distinct genetic structure over time. Due to the large genetic divergence of the East Turkey lineage (L1), its very restricted geographic distribution, and the lack of MMS data, we chose to exclude these samples from our analyses of population structure.
Based on the COI data, the estimated divergence between East Turkish (L1) and European populations (L2 + L3) was within a range of 0.49 - 9.56 mya, with a mean of 2.81 mya and median of 1.32 mya (ESS: 222.99) while estimated divergence between European haplogroups (L2 versus L3) was within a range of 0.03-0.96 mya, with a mean of 0.28 mya and median of 0.13 mya (ESS: 231.21). Correspondingly, average simple pairwise divergence (i) between L1 and L2 + L3 was 9.68% or ~ 4.9 mya based on the 2% divergence per million years clock commonly applied to insect mitochondrial DNA , and (ii) between L2 and L3 was 1.01% or ~ 0.51 mya.
Geographic structuring among COI, Cytb and PEPCK haplotypes within European populations was significantly supported by AMOVA results (global ΦST statistic = 0.87 (COI), 0.79 (Cytb), 0.61 (PEPCK), p-value < 0.01). These results suggested a potentially low level of gene flow between several adjacent populations. EF-1α and LWRh datasets were discarded from SAMOVA due to their poor genetic structuring (global ΦST statistic = 0.35, p-value < 0.01 for EF-1 α and p-value > 0.01 for LWRh). In SAMOVA, the ΦCT value increased asymptotically with increasing number of groups, levelling out at: seven groups for COI (ΦCT = 0.87, ΦST = 0.93, ΦSC = 0.60) with local ΦCT maximum at five groups (ΦCT = 0.85, ΦST = 0.94, ΦSC = 0.57) and three groups (ΦCT = 0.81, ΦST = 0.94, ΦSC = 0.66), five groups for Cytb (ΦCT = 0.89, ΦST = 0.90, ΦSC = 0.06), nine groups for PEPCK (ΦCT = 0.82, ΦST = 0.80, ΦSC = -0.09) but above five groups (ΦCT = 0.78, ΦST = 0.81, ΦSC = 0.13), additional groups represented single sampling sites making little biological sense. All SAMOVA clustering (based on COI, Cytb and PEPCK) split the Southern Italian populations and most of the SE European populations from other populations while only PEPCK clustering split Spanish populations from other populations (Additional file 2). Clustering based on mitochondrial genes split some insular populations (Additional file 2). These SAMOVA results were quite congruent to geographic groups observed in MJN and phylogenetic results.
Inter-individual genetic distance and genetic variability
The interpolation map based on multi-loci genetic distances showed larger local divergence in DNA sequences in sympatric areas between L2b and L3, and L2a and L3 (Central-Eastern Europe and Central Italy, respectively, Figure 3A). Nucleotide and haplotype diversities calculated for each geographic group (Additional file 1, Figure 2) showed: (i) a higher diversity for COI in South Italy and Central-Eastern Europe (h from 0.521 to 0.8, π from 0.0048 to 0.0063), (ii) higher diversities for Cytb in Central-Eastern Europe (h from 0.467 to 0.667, π from 0.001 to 0.00171), (iii) a higher diversity for EF1α in Central-Eastern Europe and Italy (h from 0.511 to 0.733, π from 0.00161 to 0.00188), (iv) higher diversities for PEPCK in Central-Eastern Europe, South Italy and Spain (h from 0.456 to 0.8, π from 0.00055 to 0.00132). Interpolation maps based on averaged genetic diversity for all markers confirmed the trend of a larger diversity in South Italy and Central-Eastern Europe (Figure 3B).
MMS variation of the red-tailed bumblebee
55 compounds were detected in the MMS of B. lapidarius (Additional file 3, Additional file 4). The five main compounds detected were aliphatic compounds: hexadec-9-en-1-ol (C16H32O), hexadecane-1-ol (C16H34O), hexadecenoic acid (Δ9 and Δ7, C16H30O2), hexadecenyl hexadecenoate (C32H60O2), and hexadecyl hexadecenoate (C32H62O2). Hexadec-7-enoic acid was only detected in South Italian populations while hexadec-9-enoic acid was detected in other populations. The North Italian populations displayed specific compounds hexadecenal and hexadecanal. The MMS compositions of B. lapidarius lapidarius were similar to the previous studies [32, 38, 39] except for two minor components (tetradecanol and ethyl hexadecenoate).
The nMDS (stress value = 0.13) and perMANOVA analyses separated the European populations into two main groups: Southern Italian populations (group G1) and all other European populations (group G2) (perMANOVA: DF = 1, F = 49.93, p-value < 0.01; Figure 4A). G1 corresponded to the lineage L2a while G2 corresponded to the lineages L2b + L3 (Figure 4B). The IndVal method reveals 18 significant indicator compounds for G1 and seven for G2.
Inter-individual MMS distance and MMS variability
The interpolation map based on the MMS distance showed larger local divergence in MMS composition in the Alps, Central Italy, Eastern Europe, and Western France (Figure 3C). The MMS variability interpolation map (Figure 3D) revealed (i) the lowest variability in Central Italy, (ii) low variability in West Pyrenees and Carpathian region, and (iii) high variability in NE Europe, the Alps and Central Europe. Comparison of the MMS variability showed significant differences between all geographic groups (AMOVA of multivariate dispersion p-value < 0.01, permutation test of multivariate dispersion p–value < 0.01) and between pairwise geographic groups confirmed these observations (Additional file 5).
Comparative statistical analyses between geographic origin, colour pattern, genetic and MMS
Mantel tests showed significant positive low correlations between (i) genetic distances versus geographic distances (Mantel’s r = 0.18, p-value < 0.01), (ii) MMS distances versus geographic distances (Mantel’s r = 0.13, p-value < 0.01), (iii) colour distances versus geographic distances (Mantel’s r = 0.32, p-value < 0.01), (iv) genetic distances versus MMS distances (Mantel’s r = 0.18, p-value < 0.01), (v) genetic distances versus colour distances (Mantel’s r = 0.41, p-value < 0.01), and (vi) MMS distances versus colour distances (Mantel’s r = 0.19, p-value < 0.01). Partial Mantel tests showed significant positive low correlations between (i) genetic distance vs. MMS distance (Mantel’s r = 0.17, p-value < 0.01), (ii) genetic distances versus colour distances (Mantel’s r = 0.38, p-value < 0.01), (iii) MMS distances versus colour distances (Mantel’s r = 0.16, p-value < 0.01).
Our correlation analysis showed no linear correlation between genetic diversity and MMS variability (p-value = 0.25).
Phylogeography of the red-tailed bumblebee
Phylogeographic structure is usually consistent with long-term isolation in multiple refugia during climatic oscillations (e.g. ). In this way, the Quaternary glacial events have been shown to explain the intraspecific genetic pattern of many European species [4, 15, 16]. During Ice Ages, the ranges of most European temperate species contracted to the southern peninsulas of Iberia, Italy and Balkans (classic theory: e.g. ). Glaciers, European mountains or repeated marine-flooding during the Pleistocene have constituted major vicariant barriers between these southern refugia leading to allopatric genetic divergence in many species (e.g. [16, 40]). Interglacial and postglacial recolonizations of Europe have arisen from these Mediterranean glacial refugia and principally from the Balkans [16, 41]. Therefore, it is to be expected that populations of temperate species surviving in Mediterranean refugia display higher genetic variability in the southern peninsulas .
Here, our results (trees, networks, Mantel tests, and SAMOVA) detect an intraspecific phylogeographic structure within B. lapidarius with three major lineages (L1, L2, and L3; Figure 1; Additional file 2). The roughly estimated molecular clock analyses date divergences between European lineages (L2 and L3) to the Quaternary while the divergence of East Turkish populations takes place earlier (Pliocene). Therefore, the divergence of European lineages can be considered as a consequence of allopatric differentiation during Ice Ages.
A previous study based on morphology has hypothesized that B. lapidarius has survived up to the last Ice Age in Iberia, Italy, Balkans, and Turkey prior to a post-glacial recolonization of Europe from the Balkans . Our results suggest an alternative scenario of the recent history of B. lapidarius (Figure 5).
The four lineages seem to arise from four regions: East Turkey, South Italy, the Balkan region, and Central-Eastern Europe (C.E. Europe). (i) East Turkey hosts largely differentiated and long-term divergent populations (Figure 1; see molecular clock results). These populations do not contribute to the European genetic pool. (ii) South Italy has acted as a refugium as shown by its large genetic diversity (Figure 3B) and its endemic haplotypes gathered in one distinct lineage L2a (Figures 1, 2 and 3B, Additional file 1). (iii) The Balkan region has acted as a refugium according to its specific haplotypes gathered in one distinct lineage L2b (Figure 1, Additional file 1). The low genetic diversity of the Balkan region could be due to repeated bottlenecking of populations in this region and/or a prolonged bottleneck during past glacial oscillations (e.g. [4, 43]). (iv) C.E. Europe has a large genetic diversity (Figure 3B) and many distinct haplotypes gathered in one specific lineage L3 (Figures 1, 2 and 3B, Additional file 1). Even if the genetic diversity is increased by admixture with an allopatric lineage L2b (Figures 1, 3A; see in other species ), C.E. Europe hosts the largest diversity of specific haplotypes (Figures 1, 2, 3B). Therefore, this region seems to have been a major glacial refuge for B. lapidarius. This evidence fits well with the theory of cryptic refugia in Central, Northern or Eastern Europe, an alternative to the classic theory, observed in other temperate species (review in ).
The status of Iberia remains confused according to the slight differentiation/low diversity in mitochondrial markers and the higher differentiation/high diversity in nuclear markers of Iberian populations (similar to South Italian, East Turkish or South East European populations; see Figure 2). The poorly resolved relationship with the lineage L3 (Figure 1) makes it impossible to know if Iberia acted as a refugium (with a strong bottleneck) or has been secondarily colonized by populations from C.E. Europe.
European refugia have probably been isolated by Alpine glaciers, the Alps, Carpathians, Dinaric Alps or repeated marine-flooding of the Crati-Sibari plain (North Calabria, ) during the Pleistocene period leading to allopatric divergences of European lineages. In contrast, the frequent drops in the level of the Adriatic Sea during the Quaternary Ice Ages  probably allowed gene flow between the Balkans and Italian Peninsula leading to a close relationships between these lineages as observed in rodents (e.g. ) and in turtles (e.g. ). According to our results, most of Europe has been recolonized from the C.E. Europe at the end of the last Ice Age (Figure 1). The C.E. European lineage has experienced allopatric differentiation (Figure 1C-D; SAMOVA results) probably by reduction of the gene flow  resulting from natural sea barriers separating the British Isles and Gotland. The Balkan lineage also dispersed from its refugium and largely mixed with the C.E. European lineage in Central Europe (Figures 1, 3A), a common suture zone in many species (review in ). In contrast the South Italian lineage did not contribute to the postglacial colonization, and geographically slightly overlapped the C.E. European lineage in a narrow area in Central Italy (a similar contact zone exists for amphibians and reptiles [51, 52]). This was most likely caused by the invasion of North Italy by C.E. European lineage and a competitive exclusion process [53, 54] between lineages in this region.
Reproductive traits are generally assumed to be mainly shaped by (i) intraspecific interactions to maximize encounter rates among conspecific mates (sexual selection; [55, 56]) and (ii) interspecific interactions to maintain isolation barriers and decrease the likelihood of hybridization events among syntopic sister species [21, 57, 58]. This results in a stabilizing selection on reproductive traits to promote a species-specific signal (e.g. [21, 58]). Across species distribution, local reproductive trait variations (e.g. in moths: , in flies: , in solitary bees: , in birds: ), promoted by selection for specific optimized reproductive traits [58, 62], can appear due to changes in factors that affect communication systems: (i) mutation of genes involved in reproductive traits (e.g. [62, 63]), (ii) intraspecific interactions like local preferences of the receivers [23, 64], (iii) the presence/abundance of species with a similar courtship signal which would result in selection for releasers with the most distinct, optimized reproductive traits (e.g. ). These changes in factors that affect communication can happen through allopatric differentiation as observed in insular bumblebees where genetic divergence (fostered by geographic isolation) and local specific sexual selection (female preferences, e.g. ) lead to MMS divergence from other populations . This reproductive trait differentiation can persist in secondary sympatry if the differentiation has led to the establishment of a reproductive (pre-zygotic) isolation barrier (allopatric hypothesis, see ). Alternatively, reproductive trait divergence can happen when formerly isolated lineages meet as a result of range expansion at a secondary contact zone (reinforcement hypothesis; see [25, 66, 67]). Reinforcement leads to local adaptations in secondary contact zones (e.g. [68, 69]).
Our statistical analyses (Figure 4, perMANOVA results) support MMS quantitative as well as qualitative differentiations of Southern Italian populations from other European populations of B. lapidarius. These qualitative changes (e.g. from hexadec-9-enoic acid in Southern Italian individuals to hexadec-7-enoic acid in other individuals) have probably arisen from enzyme change (e.g. from Δ9-desaturase to Δ7-desaturase acting on palmitic acid) by mutations of enzyme genes (e.g. ) or by activation of a non-functional enzyme gene transcript present in a common ancestor, as observed in moths .
The MMS differentiation is poorly explained by geographic distance but displays an obvious geographic pattern (Figure 4, Mantel tests results). The MMS divergence is also weakly correlated with the genetic distance but strongly matches the divergence of the lineage L2a (Figure 4B; Mantel tests results). This suggests that the South Italian MMS differentiation is related to the divergence of the South Italian lineage. Moreover, the lack of gene flow without an obvious geographic barrier between Northern and Southern Italy (Figures 1, 2, 3A, but see [30, 42] for putative hybrid records) suggests a reproductive isolation fostered by species intrinsic isolation mechanisms, probably by MMS differentiation leading to pre-zygotic isolation barriers between populations. Therefore we expect that the MMS divergence has appeared subsequent to (i) allopatric divergence during Ice Ages and persistence during current interglacial period (allopatric hypothesis) or (ii) a reinforcement process during the range expansion and the admixture with the C.E. lineage (reinforcement hypothesis). The Southern Italian MMS differentiation seems due to allopatric divergence during glacial isolation rather than reinforcement in secondary contact zone. This is because the Southern Italian MMS pattern is spread over the whole area of the Southern Italian lineage while reinforcement should mainly lead to local adaptations in secondary contact zones as observed in flies .
Our statistical analyses demonstrate that the intrapopulational MMS variability is not correlated to the intrapopulational genetic diversity. The geographic areas of high/low intrapopulational MMS variability do not match with areas of high/low intrapopulational genetic diversity (Figure 3B-D). The MMS variability map shows the lowest intrapopulational MMS variability in the contact zone between C.E. European and Southern Italian lineages. We hypothesise that this local decrease of intrapopulational MMS variability could be due to the intensification of the stabilizing selection on MMS for local specific optimized reproductive traits further to new interaction between closely related taxa (lineage L2a and L3) during the postglacial range expansion. This local reinforcement process is also supported by the divergence of Northern Italian populations of C.E. European lineage from other C.E. European lineage populations in MMS composition which increases the MMS divergence with Southern Italian populations (Additional file 3).
The comparisons of the phylogeographic patterns, MMS pattern distribution and MMS variability distribution suggest that the MMS divergence leading to pre-zygotic isolation between Southern Italian and C.E. European lineages has been fostered by past allopatry between refugia and persists in the current interglacial period (allopatric hypothesis). The post-glacial range expansion of these lineages differentiated in MMS seems to have led to a reinforcement process in the secondary contact zone. Further studies in contact zones (e.g. testing the fitness of hybrids, bioassays) are needed to check this hypothesis.
Colour pattern variation
The genetic distances and lineages only partially reflect the colour patterns  (Figure 1, see also Mantel test results). The colour pattern B. lapidarius caucasicus fits with L1 while other colour forms (B. lapidarius decipiens and B. lapidarius lapidarius) spread in L2 and L3 (Figure 1). The MMS distance is also poorly correlated to the colour pattern distance. The MMS and colour patterns are only congruent for the South Italy individuals (Figure 4C). This incongruence between colour patterns, MMS and phylogeny should be a consequence of local Müllerian mimicry processes (e.g. convergence of Spanish B. lapidarius to the Pyrenean colour pattern ) as observed in other bumblebees [71, 72].
The consequence of genetic and MMS divergences between populations fostered by species range contraction during climatic oscillations can range from simple regional variations (e.g. ), to a speciation processes between allopatric refugia (e.g. [16, 74]). Several pieces of evidence strongly suggest an incipient speciation between Southern Italian and other European lineages of B. lapidarius contrary to the current literature (e.g. ): (i) the concordance of genetic divergence in tree topologies derived from mitochondrial and nuclear genes [4, 75], (ii) the persistence of the genetic divergence through time despite the current secondary contact zone , (iii) the divergences in chemical reproductive traits (e.g. qualitative changes or double-bond position shifts) which can lead to discrimination by receivers and to pre-mating isolation  as observed in moths (e.g. ) and in bumblebees . Therefore, Southern Italy could be a speciation centre for B. lapidarius as observed in amphibians [52, 77, 78] or in reptiles (e.g. [48, 51, 79]). In the same way, the large mitochondrial and nuclear divergences observed in East Turkish B. lapidarius strongly suggest that they are a different species (MMS data not available). In contrast, the speciation process has not happened in other lineages according to current gene flow and the lack of MMS and morphological differentiations. However, these speciation hypotheses need to be checked with further integrative taxonomy, including morphologic, chemical, ethologic and genetic criteria (e.g. ) based on an adapted species concept  in order to assess the taxonomic status within the B. lapidarius complex.
Quaternary climatic oscillations seem to have led to the European B. lapidarius population contraction in three main refugia. These refugia were localized inside Mediterranean peninsulas (Southern Italy and the Balkan) and in Central-Eastern Europe. Inside the Southern Italian refugium, populations have experienced allopatric MMS differentiation and an incipient speciation process from other lineages. At the end of Ice Age, the range expansion of lineages have led to secondary contact zone between the C.E. Europe lineage and Southern Italy lineage where a reinforcement process on local MMS patterns seems to take place.
This study suggests that the population movement during Quaternary climatic oscillations can lead to divergence in reproductive traits by allopatric differentiation during Ice Age and by reinforcement during post-glacial recolonization.
259 males of B. lapidarius covering its whole European distribution were collected (Figure 1A, Additional file 6). Males were killed by freezing at -20°C. The MMS were extracted in 400 μl of hexane from dissected cephalic labial glands or entire cut heads . Bodies without heads were preserved in 99% ethanol for DNA extraction. All samples were stored at -40°C until analyses. Populations from the Caucasus (B. lapidarius eriophorus) and Morocco (B. lapidarius atlanticus) were not collected. Closely related species were also collected as outgroup  (see Additional file 6).
We sequenced five genes commonly used to study interspecific and intraspecific relationships (e.g. [80, 83, 84]: mitochondrial cytochrome oxidase 1 (COI), cytochrome b (Cytb), nuclear protein-coding genes long-wavelength rhodoposin copy 1 (LWhH), elongation factor-1 alpha F2 copy (EF-1α), and phosphoenolpyruvate carboxykinase (PEPCK). We used a multi-gene approach to check congruence of phylogenetic and phylogeographic patterns inferred from independent genes.
DNA preparation, amplification and sequencing
Total DNA was extracted using a QIAGEN DNeasy® Tissue Kit (Quiagen Inc., Valencia, CA). Legs were removed from the specimen, crushed using liquid nitrogen, and digested (four hours in proteinase K at 56°C). Polymerase chain reaction (PCR) amplifications were carried out for all genes and all samples using primer pair Apl2013/ApH2931  for COI, CB1/CB2  for Cytb, LWRhF/LWRhR  for LWRh, F2-ForH/F2-RevH2  for EF-1α, and FHv4/RHv4  for PEPCK. PCR amplifications were carried out by initial denaturing for three minutes at 94°C, 35 (COI, LWRh, EF-1α) or 40 (Cytb and PEPCK) cycles of one minute denaturing at 94°C, one minute annealing at 51°C (COI), 50°C (Cytb), 60°C (LWRh), 54°C (EF-1α) or 48.5°C (PEPCK), two minutes elongation at 72°C and a final extension for ten minutes at 72°C. Voucher specimens and PCR products used in molecular investigation were deposited at the University of Mons (Belgium). Genes were sequenced with an ABI 3730XL sequencer (Applied Biosystems, Foster City, CA, USA), with ABI 3730 DNA analyzer or by GENOSCOPE (Centre National de Séquençage; Evry, France). Both strands of each PCR product were sequenced. Consensus sequences were computed with CodonCode Aligner 3.0.1. There is no uncertainty in the consensus sequences. The bumblebee origin of each sequence was checked with BLAST 2.2.20 . The alignment was performed by MAFFT ver.6. (using FFT-NS-2 algorithms, default parameters) . The data matrix was computed on Mesquite 2.74 (build 486) . Translation to proteins (using the Drosophila mitochondrial DNA genetic code or Universal genetic code) was performed on Mesquite. Sequences were deposited in GenBank [GenBank: KC915396 to KC916646] (Additional file 6).
We performed phylogeographic and phylogenetic analyses to investigate the relationships between haplotypes and to infer the phylogenetic lineages. A test of saturation was applied to each gene fragment in PAUP* 4.0b 10 .
Haplotype networks were computed using the me-dian-joining method (MJM, ) in Network 126.96.36.199 (http://www.fluxus-engineering.com) for each gene. The median-joining method uses a maximum parsimony approach to search for all the shortest phylogenetic trees for a given data set . To reconstruct the network, we weighted transversions twice as high as transitions.
Phylogenetic analyses were carried out using distance (neighbour joining [NJ]), maximum likelihood (ML) and Bayesian (MB) methods. Trees were rooted with species from outgroup. For each approach, we analysed (i) each genes individually, (ii) all mitochondrial genes combined (COI + Cytb), (iii) all nuclear genes combined (EF-1α + LWRh + PEPCK), and (iv) all genes combined. We assessed the level of incongruence in phylogenetic reconstructions among genes by comparing congruence and incongruence of well-supported clades across NJ, ML and MB trees of each gene. Overall, the gene trees were not strongly conflicting with each other and well-supported clades were similar across all trees (see supplementary trees at [TreeBASE ID: S14299]). We did not perform the incongruence length difference test  because its utility in evaluating homogeneity test has been questioned (e.g. ).
NJ analyses were performed using PAUP*. We used jModeltest  using the Akaike information criteria corrected for small sample sizes (AICc, ) to determine the best fitting substitution models (GTR + G). The robustness of the NJ trees was assessed by bootstrap resampling (10,000 random replications, ).
ML analyses were conducted in GARLI 2.0 . Each gene was partitioned to explore the best substitution model: (i) EF-1α into two exons and one intron, (ii) LWRh into three exons and two introns, (iii) PEPCK into two exons and two introns, and (iv) COI, Cytb and each nuclear exon by base positions (1st, 2nd and 3rd). The best fitting substitution models were chosen with jModeltest using the AICc for each dataset. The chosen models were: (i) for COI: TIM1 + G (1st), F81 (2nd) and TPM1uf + G (3rd); (ii) for Cytb: TrN + I + G (1st), F81 (2nd) and HKY + G (3rd); (iii) for EF-1α exon 1: F81 (1st and 2nd) and HKY (3rd); (iv) for EF-1α intron: HKY; (v) for EF-1α exon 2: F81 (1st), K80 (2nd) and TPM3 (3rd); (vi) for LWRh exon 1: F81 (1st and 2nd) and K80 (3rd); (vii) for LWRh introns 1 and 2: F81; (viii) for LWRh exon 2: F81 (1st), JC (2nd) and K80 (3rd): (ix) for LWRh exon 3: JC (1st, 2nd and 3rd); (x) for PEPCK introns 1 and 2: HKY; (xi) for PEPCK exon 1: TrN (1st), JC (2nd) and TrNef (3rd); (xii) for PEPCK exon 2: F81 + I (1st ), K80 (2nd) and JC (3rd). A random starting tree and the automated stopping criterion (stop when the ln score remained constant for 20,000 consecutive generations) were used. Ten independent runs in GARLI were carried out; the topology and –ln(L) were identical among replicates. The highest likelihood of one of those runs was retained. Statistical confidence in nodes was evaluated using 10,000 non-parametric bootstrap replicates  using the automated stopping criteria set at 10,000 generations. Topologies with bootstrap values ≥70% were considered well supported .
MB analyses were carried out using MrBayes 3.1.2 . The model selection process was the same as that for ML analyses. Selected models which are not implemented in MrBayes were substituted by the closest overparameterized model . The TIM1, TIM1uf, and TrN were replaced by the GTR model whereas the TPM3 and TrNef were replaced by the SYM model. The proportion of invariant sites (I) and gamma distributed rates (G) defined in jModeltest were conserved in all models. Five independent analyses were carried out (30 million generations, four chains with mixed-models, default priors, saving trees every 100 generations). The analyses were stopped after checking convergence between runs using the average standard deviation of split frequencies and by plotting likelihood values across generations using Tracer 1.4 . The first three million generations were discarded as burn-in. The phylogeny and posterior probabilities were then estimated from the remaining trees and a majority-rule 50% consensus tree was constructed. Topologies with posterior probabilities ≥ 0.95 were considered as well supported .
Estimating divergence time
Following the approach of Duennes et al. , we analyzed the COI dataset in BEAST v1.7.2  to roughly estimate the time to most recent common ancestor of B. lapidarius haplogroups. Using the GTR + G model selected by jModeltest, we ran Markov chain Monte Carlo simulations with the coalescent constant population size tree model and the strict clock model. Prior, the relaxed uncorrelated lognormal molecular clock model  was implemented to assess the clock-like nature of the data. The sampled marginal posterior probability (PP) distribution of both standard deviation and coefficient of variation of substitution rates among tree branches included 0 in the 95% HPD, indicating that there was no strong evidence for substantial rate heterogeneity among lineages . We specified a range of possible substitution rates which include extreme rate of insect mitochondrial genes recorded in the literature (see ) using a flat prior ranging from 1×10-9 to 1×10-7 substitutions site-1 and year-1. Simulations were run for 30 million generations, sampling every 1000 generations. Four independent runs were assessed in Tracer v1.4.1  to confirm convergence, determine burn-in, and examine the effective sample size of all posterior parameters. Log files from each run were combined in LogCombiner v1.6.1  for final parameter estimates.
Identifying population structure
Population structure was assessed for each gene by estimating “global” ΦST statistic on populations (i.e. only AMOVA ΦST estimator when considering only one group of populations ) using ARLEQUIN 3.5  with 100,000 permutations. When global ΦST statistic was significant for one gene, we analyzed the population structure using the spatial AMOVA procedure (SAMOVA)  implemented in SPADS 1.0 . This procedure assigns populations to groups based on geographical vicinity and sequence similarity. The most likely structure corresponds to the partition of populations that maximized among-group variation measured by the AMOVA ΦST statistic . We performed ten independent repetitions of 10,000 simulated annealing steps for K values varying from two to 45.
Estimating genetic diversity
Genetic diversity was estimated with the number of different haplotypes, the haplotype diversity (h, ), and the nucleotide diversity (π, ) using DnaSP  within defined geographic groups and for each gene separately. The defined geographic groups gathered samples from the same geographic regions (see Additional file 6).
Cartography of inter-individual genetic distance and intra-population genetic diversity
We used an extension of the method developed by Miller  to represent spatial variations of genetic patterns (distance and diversity). The method of Miller  is based on a connectivity network (e.g. a Delaunay triangulation) built from the sampling localities. Inter-individual genetic distances are first estimated and assigned to landscape coordinates at midpoints of each Delaunay triangulation edge. An interpolation procedure (inverse distance-weighted interpolation, [115, 116]) is then used to infer genetic distances at locations on a uniformly spaced grid. Here, we also used this method to represent the spatial variation of any kind of distance and measures of diversity. The interpolation procedure of measures of diversity is based on (diversity) values directly estimated at each sampling point rather than on (distance) values assigned at midpoints of each edge of a connectivity network. Interpolation surfaces were generated with the MATLAB (The MathWorks, Inc) functions GDisPAL for “genetic distance patterns across landscapes” and GDivPAL for “genetic diversity patterns across landscapes” . All generated surfaces were finally superimposed on a map of the distribution area of the geographic region studied. We generated two surfaces based (i) inter-individual genetic distance measured as DNA sequence mismatches averaged over the four polymorphic loci (i.e. averaged p-distances over loci), and (ii) on genetic diversity measured with the nucleotide diversity estimated within each geographic group (see Additional file 1) and averaged over the four polymorphic loci Both the inter-individual genetic distance and nucleotide diversity were computed with SPADS . The geographic coordinates of geographic groups were the barycenter between all sampling places included in geographic groups. All surfaces were generated using a distance weighting parameter a = 2 and distance surfaces were both based on a Delaunay triangulation connectivity network. For the interpolation parameter a, we selected a value that allowed to generate interpolation surfaces displaying local details. As advised by Miller et al , we performed the distance interpolations using (i) residual distances derived from the linear regression of genetic vs. geographical distances  and (ii) MMS vs. geographic distances in order to account for potential correlation between genetic and geographic distances.
Chemical analyses of MMS
We analyzed the MMS of 226 individuals (Additional file 6). The composition of MMS was determined by gas chromatography-mass spectrometry (GC/MS) on a Finigan GCQ with a DB-5 ms non-polar capillary column (5% phenyl (methyl) polysiloxane stationary phase; 30 m × 0.25 mm × 0.25 μm) and an ion trap in electron impact mode “full scan (300-600)”. We used a splitless injection mode (220°C) and helium as carrier gas (50 cm/s). The temperature programme of the column was set to 70°C for two minutes and then increased at a rate of 10°C/min to 320°C. The temperature was then held at 320°C for five minutes. Compounds were identified in Xcalibur™ using their mass spectra compared to those at National Institute of Standards and Technology library (NIST, U.S.A) using NIST MS Search 2.0. The double bond positions were determined i) from mass spectra of dimethyl disulphide adducts of unsaturated components (reaction time: four hours) and ii) by chemical ionization with acetonitrile as a reaction gas. The products were analyzed by GC/MS using the same temperature programme as for the original extracts. An ion trap GC/MS instrument (Varian Saturn, 2000) was used for chemical ionization. The compound relative quantification of MMS was estimated using a gas chromatograph Shimadzu GC-2010 with a SLB-5 ms non-polar capillary column 5% diphenyl/95% dimethyl siloxane; 30 m × 0.25 mm × 0.25 μm) and a flame ionization detector. The chromatographic conditions were the same as above. The absolute amounts of compounds were quantified using GCsolution Postrun (Shimadzu Corporation) with automatic peak detection and noise measurement. Relative amounts (RA in%) of compounds in each sample were calculated by dividing the absolute amounts of compounds by the total absolute amount of compounds in each sample. We did not use any correction factor to calculate the RA of individual compounds. All compounds for which RA were recorded as less than 0.1% for all specimens were discarded . The data matrix for each species was elaborated with the relative proportion of each compound for each individual  using GCAligner 1.0  before final check.
Comparative statistical analyses based on MMS
All statistical analyses were performed using R  to detected MMS differentiations between populations. Data consisting of the relative proportion of all compounds were transformed (log (x-1)) to reduce the large difference of abundance between highly and slightly concentrated compounds and standardized (mean = 0, standard deviation = 1) to reduce the sample concentration effect . We compared the profile for each sample (ungrouped) with non-metric multidimensional scaling (nMDS) ordination using a Bray-Curtis similarity matrix, three dimensions and 50 runs (R-package ecodist, ). When we detected MMS differentiations between population groups in nMDS, we assessed results by performing permutation-based version of the multivariate analysis of variance (perMANOVA) using the Bray-Curtis similarity matrix and 10,000 permutations (R-package vegan, ). Like conventional analyses of variances, the perMANOVA calculates an F statistic by taking the ratio of among group sums of squares to within group sums of squares. The perMANOVA is robust to violations of multivariate normality but requires homogeneity of variances. When we tested more than two groups and the returned p-value was significant (p < 0.01), multiple pairwise comparisons were conducted and p-values were adjusted using Bonferroni’s correction to avoid increases of type error I due to multiple testing. Prior to perMANOVA we checked its assumption (similar variance homogeneity between groups) by performing a distance-based test for multivariate homogeneity of group dispersions (MHGH) for a one-way ANOVA design . The procedure first calculates the Euclidean distances between MMS composition and respective geographic group centroids on Pearson r correlation matrices (R- package vegan, ). To test if one group was more variable than the other, the magnitudes of these distances were then compared between groups using ordinary ANOVA. Moreover, a permutation test was run to generate a permutation distribution of F under the null hypothesis of no difference in dispersion between groups (R- package vegan, ).
Specific MMS compounds
To determine compounds specific to groups defined by nMDS and confirmed by perMANOVA, we used the indicator value (IndVal) method . A high value is obtained when the compound is specific and regular to a particular group compared to the whole set of observations. The statistical significance of a compound as an indicator at the 0.05 level was evaluated using a randomization procedure.
Chemical variability within populations
We compared variability of MMS between each geographic group (pairwise analyses) using a distance-based test for MHGH for a one-way ANOVA design .
Cartography of inter-individual MMS distance and intra-population MMS variability
We used the extended Miller’s  method (i.e. GDisPAL and GDivPAL functions, see above) to represent spatial variations of MMS patterns (distance and variability). We generated two different surfaces: (i) on MMS distance (Bray-Curtis similarity matrix with dataset transformed and standardized), (ii) on MMS variability (the mean Euclidean distance between MMS composition of each sample and its respective geographic group centroids (computed on Bray-Curtis similarity matrix with dataset transformed and standardized)). MMS variability was estimated within each geographic group (see Additional file 1). The geographic coordinates of geographic groups were the same as genetic analyses. All surfaces were generated using a distance weighting parameter a = 2 and distance surfaces were both based on a Delaunay triangulation connectivity network. Furthermore, we performed the distance interpolations using residual distances derived from the linear regression of MMS versus. geographical distances in order to account for potential correlation between MMS and geographical distances.
Comparative statistical analyses between geographic origin, colour patterns, genetic and MMS
We investigated correlations between genetic divergence, colour patterns, MMS differentiation, and geographic distance by performing six Mantel tests  with 10,000 random permutations in R (R-package vegan): (i) genetic distances versus geographic distances, (ii) MMS distances versus geographic distances, (iii) colour distances versus geographic distances, (iv) genetic distances versus MMS distances, (v) genetic distances versus colour distances, and (vi) MMS distances versus colour distances. Geographic distances between all samples were computed with Geographic Distance Matrix Generator version 1.2.3 . Genetic distances were calculated in the same way as for the cartography of inter-individual genetic distance (see above) and were thus based on DNA sequence mismatches averaged over the four polymorphic loci (i.e. averaged p-distances over loci) computed with SPADS . The MMS distance matrix was the individual-by-individual Bray-Curtis distance matrix based on MMS compounds. The colour distance matrix compared the colour pattern between all individuals and was generated as followed: when two individuals shared the same colour pattern, their colour distance was set to “0” and otherwise it was set to “1”.
We also performed three partial Mantel tests based on 10,000 random permutations controlling for spatial autocorrelation between (i) genetic distances versus MMS distances, (ii) genetic distances versus colour distances, and (iii) MMS distances versus colour distances.
We used a generalized linear model  to estimate the potential linear correlation between genetic diversity and MMS variability. To estimate diversities we gathered samples from the same geographic groups (Additional file 6) and excluded geographic groups with only one sample. Genetic diversity used was the nucleotide diversity estimated within each geographic group and averaged across loci. MMS variability was the mean Euclidean distance between MMS composition of each sample and its respective geographic group centroids (computed on Bray-Curtis similarity matrix with dataset transformed and standardized).
The genetic datasets supporting the results of this article are available at TreeBase http://purl.org/phylo/treebase/phylows/study/TB2:S14299. Genbank accession numbers are listed in the Additional file 6 - Table of sampling. The MMS dataset supporting the results of this article is included within the article and its Additional file 3.
Analysis of molecular variance
- C.E. Europe:
Central and Eastern Europe
East Turkey lineage
Southern Italy (L2a) and South Eastern European (L2b) lineages
C.E. European lineage
Nuclear protein-coding genes long-wavelength rhodoposin copy 1
Multivariate homogeneity of group dispersions
Male marking secretion
Permutation-based version of the multivariate analysis of variance
Wallace A: The cause of an ice age. Nature. 1896, 53: 220-221.
Molengraaff G, Weber M: On the relation between the pleistocene glacial period and the origin of the Sunda Sea (Java- and South China-Sea), and its influence on the distribution of coralreefs and on the land- and freshwater fauna. Proceeding Sect Sci. 1921, 23: 395-439.
Reinig WF: Die Holarktis. Ein Beitrag zur diluvialen und alluvialen Geschichte der zirkumpolaren Faunen- und Florengebiete. 1937, Jena, Germany: Verlag von Gustav Fischer
Avise JC: Phylogeography: the history and formation of species. 2000, Cambridge, MA: Harvard University Press
Hewitt GM: Genetic consequences of climatic oscillations in the quaternary. Philos Trans R Soc B. 2004, 359: 183-195. 10.1098/rstb.2003.1388.
Andersen BG, Borns HWJ: The ice age world: an introduction to quaternary history and research with emphasis on North America and Northern Europe during the last 2.5 million. 1994, Oxford, UK: Oxford University Press
Bennett KD: Evolution and ecology: the pace of life. 1997, Cambridge, UK: Cambridge University Press
Zagwijn WH: The beginning of the ice age in Europe and its major subdivisions. Quat Sci Rev. 1992, 11: 583-591. 10.1016/0277-3791(92)90015-Z.
Stewart JR, Lister AM, Barnes I, Dalén L: Refugia revisited: individualistic responses of species in space and time. Proc R Soc B Biol Sci. 2010, 277: 661-671. 10.1098/rspb.2009.1272.
Taberlet P: Biodiversity at the intraspecific level: the comparative phylogeographic approach. J Biotechnol Genome Anal Chang Face Biotechnol. 1998, 64: 91-100.
Coyne JA, Orr HA: Speciation. 2004, Sunderland, MA: Sinauer
Sobel JM, Chen GF, Watt LR, Schemske DW: The biology of speciation. Evolution. 2010, 64: 295-315. 10.1111/j.1558-5646.2009.00877.x.
Blondel J: Biogéographie: approche écologique et évolutive. 1995, Paris: Mason
Mayr E: Systematics and the origin of species. 1942, New York: Columbia University Press
Hewitt GM: Some genetic consequences of ice ages, and their role in divergence and speciation. Biol J Linn Soc. 1996, 58: 247-276.
Hewitt GM: Speciation, hybrid zones and phylogeography - or seeing genes in space and time. Mol Ecol. 2001, 10: 537-549.
Excoffier L, Foll M, Petit RJ: Genetic consequences of range expansions. Annu Rev Ecol Evol Syst. 2009, 40: 481-501. 10.1146/annurev.ecolsys.39.110707.173414.
Bush GL: Sympatric speciation in animals: new wine in old bottles. Trends Ecol Evol. 1994, 9: 285-288. 10.1016/0169-5347(94)90031-0.
Foster SA: The geography of behaviour: an evolutionary perspective. Trends Ecol Evol. 1999, 14: 190-195. 10.1016/S0169-5347(98)01577-8.
Boughman JW: How sensory drive can promote speciation. Trends Ecol Evol. 2002, 17: 571-577. 10.1016/S0169-5347(02)02595-8.
Paterson HEH: Evolution and the recognition concept of species. 1993, Baltimore, MD: The Johns Hopkins University Press
Gee JM: IEG and behavioral response to conspecific and heterospecific calls in hybridizing avian species, California and Gambel’s quail. Integr Comp Biol. 2005, 45: 1000-
Lecocq T, Vereecken NJ, Michez D, Dellicour S, Lhomme P, Valterová I, Rasplus J-Y, Rasmont P: Patterns of genetic and reproductive traits differentiation in mainland vs. Corsican populations of bumblebees. PLoS One. 2013, 8: e65642-10.1371/journal.pone.0065642.
Ptacek MB: The role of mating preferences in shaping interspecific divergence in mating signals in vertebrates. Behav Process. 2000, 51: 111-134. 10.1016/S0376-6357(00)00123-6.
Nosil P, Vines TH, Funk DJ: Perspective: reproductive isolation caused by natural selection against immigrants from divergent habitats. Evolution. 2005, 59: 705-719.
West-Eberhard MJ: Sexual selection, social competition, and speciation. Q Rev Biol. 1983, 58: 155-183. 10.1086/413215.
Panhuis TM, Butlin R, Zuk M, Tregenza T: Sexual selection and speciation. Trends Ecol Evol. 2001, 16: 364-371. 10.1016/S0169-5347(01)02160-7.
Vereecken NJ, Mant J, Schiestl FP: Population differentiation in female sex pheromone and male preferences in a solitary bee. Behav Ecol Sociobiol. 2007, 61: 811-821. 10.1007/s00265-006-0312-z.
Hines HM: Historical biogeography, divergence times, and diversification patterns of bumble bees (Hymenoptera: Apidae: Bombus). Syst Biol. 2008, 57: 58-75. 10.1080/10635150801898912.
Reinig WF: On the variation of Bombus lapidarius L. and its cuckoo, Psithyrus rupestris Fabr., with notes on mimetic similarity. J Genet. 1935, 30: 321-356. 10.1007/BF02982243.
Baer B: Bumblebees as model organisms to study male sexual selection in social insects. Behav Ecol Sociobiol. 2003, 54: 521-533. 10.1007/s00265-003-0673-5.
Bergström G, Svensson BG, Appelgren M, Groth I: Complexity of bumble bee marking pheromones: biochemical, ecological and systematical interpretations. Syst Assoc. 1981, 19: 175-183.
Žáček P, Kalinová B, Šobotník J, Hovorka O, Ptácek V, Coppée A, Verheggen F, Valterová I: Comparison of age-dependent quantitative changes in the male labial gland secretion of Bombus terrestris and Bombus lucorum. J Chem Ecol. 2009, 35: 698-705. 10.1007/s10886-009-9650-4.
Coppée A: Bombus terrestris (L. 1758): A complex species or a species complex? - Intraspecific pheromonal and genetic variations of Bombus terrestris (L.), Impacts on the speciation. PhD thesis. 2010, Université de Mons, Laboratoire de Zoologie
Žáček P, Prchalová-Hornákov D, Tykva R, Kindl J, Vogel H, Svatoš A, Pichová I, Valterová I: De Novo biosynthesis of sexual pheromone in the labial gland of bumblebee males. ChemBioChem. 2013, 14: 361-371. 10.1002/cbic.201200684.
Luxová A, Valterová I, Stránský K, Hovorka O, Svatoš A: Biosynthetic studies on marking pheromones of bumblebee males. Chemoecology. 2003, 13: 81-87.
Brower AVZ: Rapid morphological radiation and convergence among races of the butterfly Heliconius erato inferred from patterns of mitochondrial DNA evolution. Proc Natl Acad Sci USA. 1994, 91: 6491-6495. 10.1073/pnas.91.14.6491.
Svensson BG, Bergström G: Volatile marking secretions from the labial gland of north European pyrobombus D. T. males (Hymenoptera, Apidae). Insectes Soc. 1977, 24: 213-224. 10.1007/BF02227172.
Bergman P: Chemical communication in bumblebee premating behaviour. PhD thesis. 1997, Göteborg University, Department of Chemical Ecology
Cucci L: Raised marine terraces in the northern Calabrian Arc (southern Italy): a 600 kyr-long geological record of regional uplift. Ann Geophys. 2004, 47: 1391-1406.
Hewitt GM: Post-glacial re-colonization of European biota. Biol J Linn Soc. 1999, 68: 87-112. 10.1111/j.1095-8312.1999.tb01160.x.
Reinig WF: Bastardierungszonen und mischpopulationen bei Hummeln (Bombus) und Schmarotzerhummeln (Psithyrus) (Hymenopt., Apidae). Mittr Mue Entomol Ges. 1969, 59: 1-89.
Petit RJ, Aguinagalde I, De Beaulieu J-L, Bittkau C, Brewer S, Cheddadi R, Ennos R, Fineschi S, Grivet D, Lascoux M, Mohanty A, Müller-Starck G, Demesure-Musch B, Palmé A, Martín JP, Rendell S, Vendramin GG: Glacial refugia: hotspots but not melting pots of genetic diversity. Science. 2003, 300: 1563-1565. 10.1126/science.1083264.
Stewart JR, Lister AM: Cryptic northern refugia and the origins of the modern biota. Trends Ecol Evol. 2001, 16: 608-613. 10.1016/S0169-5347(01)02338-2.
Martini I, Sagri M, Colella A: Neogene-Quaternary basins of the inner Apennines and Calabria arc. Anat an Orogen Apennines Adjac Mediterr basins. Edited by: Martini I. 2001, Dordrecht, Netherlands: Kluwer Academic Publishers, 375-400.
Strahler A, Strahler A: Elements of physical geography. 1989, New York, NY: John Wiley & Sons, 3
Michaux JR, Libois R, Filippucci M-G: So close and so different: comparative phylogeography of two small mammal species, the yellow-necked fieldmouse (Apodemus flavicollis) and the woodmouse (Apodemus sylvaticus) in the western Palearctic region. Heredity. 2005, 94: 52-63. 10.1038/sj.hdy.6800561.
Fritz U, D’Angelo S, Pennisi MG, LoValvo M: Variation of Sicilian pond turtles, Emys trinacris - what makes a species cryptic?. Amphibia-Reptilia. 2006, 27: 513-529. 10.1163/156853806778877095.
Mac Arthur R, Wilson E: The theory of island biogeography. 1967, Princeton, N.J: Princeton University Press
Hewitt GM: The genetic legacy of the quaternary ice ages. Nature. 2000, 405: 907-913. 10.1038/35016000.
Joger U, Fritz U, Guicking D, Kalyabina-Hauf S, Nagy ZT, Wink M: Phylogeography of western Palaearctic reptiles - spatial and temporal speciation patterns. Zool Anz. 2007, 246: 293-313. 10.1016/j.jcz.2007.09.002.
Mattoccia M, Marta S, Romano A, Sbordoni V: Phylogeography of an Italian endemic salamander (genus Salamandrina): glacial refugia, postglacial expansions, and secondary contact. Biol J Linn Soc. 2011, 104: 903-922. 10.1111/j.1095-8312.2011.01747.x.
Gause GF: Experimental studies on the struggle for existence: 1. Mixed population of two species of yeast. J Exp Biol. 1932, 9: 389-402.
Hardin G: The competitive exclusion principle. Science. 1960, 131: 1292-1297. 10.1126/science.131.3409.1292.
Andersson M: Sexual selection. 1994, Princeton, USA: Princeton University Press
Wyatt TD: Pheromones and animal behaviour: communication by smell and taste. 2003, Cambridge, UK: Cambridge University Press
Loftus-Hills JJ, Littlejohn MJ: Reinforcement and reproductive character displacement in Gastrophryne carolinensis and G. olivacea (Anura: Microhylidae): a reexamination. Evolution. 1992, 46: 896-906. 10.2307/2409744.
Symonds MRE, Moussalli A, Elgar MA: The evolution of sex pheromones in an ecologically diverse genus of flies. Biol J Linn Soc. 2009, 97: 594-603. 10.1111/j.1095-8312.2009.01245.x.
Clearwater JR, Foster SP, Muggleston SJ, Dugdale JS, Priesner E: Intraspecific variation and interspecific differences in sex pheromones of sibling species in Ctenopseustis obliquana complex. J Chem Ecol. 1991, 17: 413-429. 10.1007/BF00994342.
Blyth JE, Lachaise D, Ritchie MG: Divergence in multiple courtship song traits between Drosophila santomea and D. yakuba. Ethology. 2008, 114: 728-736. 10.1111/j.1439-0310.2008.01519.x.
Förschler MI, Kalko EKV: Geographical differentiation, acoustic adaptation and species boundaries in mainland citril finches and insular Corsican finches, superspecies Carduelis citrinella. J Biogeogr. 2007, 34: 1591-1600. 10.1111/j.1365-2699.2007.01722.x.
Löfstedt C: Moth pheromone genetics and evolution. Philos Trans R Soc B. 1993, 340: 167-177. 10.1098/rstb.1993.0055.
Takanashi T, Huang Y, Takahasi KR, Hoshizaki S, Tatsuki S, Ishikawa Y: Genetic analysis and population survey of sex pheromone variation in the adzuki bean borer moth, Ostrinia scapulalis. Biol J Linn Soc. 2005, 84: 143-160.
Roelofs WL, Liu W, Hao G, Jiao H, Rooney AP, Linn CE: Evolution of moth sex pheromones via ancestral genes. Proc Natl Acad Sci USA. 2002, 99: 13621-13626. 10.1073/pnas.152445399.
Templeton AR: Mechanisms of speciation: a population genetics approach. Annu Rev Ecol Syst. 1981, 12: 23-48. 10.1146/annurev.es.12.110181.000323.
Dobzhansky T: Speciation as a stage in evolutionary divergence. Am Nat. 1940, 74: 312-321. 10.1086/280899.
Noor MAF: Reinforcement and other consequences of sympatry. Heredity. 1999, 83: 503-508. 10.1038/sj.hdy.6886320.
Higgie M, Chenoweth S, Blows MW: Natural selection and the reinforcement of mate recognition. Science. 2000, 290: 519-521. 10.1126/science.290.5491.519.
Lukhtanov VA: Dobzhansky’s rule and reinforcement of prezygotic reproductive isolation in zones of secondary contact. Biol Bull Rev. 2011, 1: 2-12. 10.1134/S2079086411010051.
Delmas R: Contribution à l’étude de la faune française des Bombinae (Hymenoptera, Apoidea, Bombidae). Ann Soc Entomol Fr. 1976, 12: 247-290.
Williams PH: The distribution of bumblebee colour patterns worldwide: possible significance for thermoregulation, crypsis, and warning mimicry. Biol J Linn Soc. 2007, 92: 97-118. 10.1111/j.1095-8312.2007.00878.x.
Hines HM, Williams PH: Mimetic colour pattern evolution in the highly polymorphic Bombus trifasciatus (Hymenoptera: Apidae) species complex and its comimics. Zool J Linn Soc. 2012, 166: 805-826. 10.1111/j.1096-3642.2012.00861.x.
Tòth M, Löfstedt C, Blair BW, Cabello T, Farag AI, Hansson BS, Kovalev BG, Maini S, Nesterov EA, Pajor I, Sazonov AP, Shamshev IV, Subchev M, Szöcs G: Attraction of male turnip moths Agrotis segetum (Lepidoptera: Noctuidae) to sex pheromone components and their mixtures at 11 sites in Europe, Asia, and Africa. J Chem Ecol. 1992, 18: 1337-1347. 10.1007/BF00994360.
Santucci F, Emerson BC, Hewitt GM: Mitochondrial DNA phylogeography of European hedgehogs. Mol Ecol. 1998, 7: 1163-1172. 10.1046/j.1365-294x.1998.00436.x.
Avise JC: What is the field of biogeography, and where is it going?. Taxon. 2004, 53: 893-898. 10.2307/4135555.
Coppée A, Mathy T, Cammaerts M-C, Verheggen FJ, Terzo M, Iserbyt S, Valterová I, Rasmont P: Age-dependent attractivity of males’ sexual pheromones in Bombus terrestris (L.) [Hymenoptera, Apidae]. Chemoecology. 2011, 21: 75-82. 10.1007/s00049-011-0070-x.
Canestrelli D, Nascetti G: Phylogeography of the pool frog Rana (Pelophylax) lessonae in the Italian peninsula and Sicily: multiple refugia, glacial expansions and nuclear-mitochondrial discordance. J Biogeogr. 2008, 35: 1923-1936. 10.1111/j.1365-2699.2008.01946.x.
Canestrelli D, Sacco F, Nascetti G: On glacial refugia, genetic diversity, and microevolutionary processes: deep phylogeographical structure in the endemic newt Lissotriton italicus. Biol J Linn Soc. 2012, 105: 42-55. 10.1111/j.1095-8312.2011.01767.x.
Nagy ZT, Joger U, Guicking D, Amann T, Wink M: Phylogeography of the European whip snake, Coluber (Hierophis) viridiflavus Lacépède, 1789, inferred from nucleotide sequences of the mitochondrial cytochrome b gene and ISSR genomic fingerprinting. Biota. 2002, 3: 109-118.
Lecocq T, Lhomme P, Michez D, Dellicour S, Valterová I, Rasmont P: Molecular and chemical characters to evaluate species status of two cuckoo bumblebees: Bombus barbutellus and Bombus maxillosus (Hymenoptera, Apidae, Bombini). Syst Entomol. 2011, 36: 453-469. 10.1111/j.1365-3113.2011.00576.x.
De Queiroz K: Species concepts and species delimitation. Syst Biol. 2007, 56: 879-886. 10.1080/10635150701701083.
De Meulemeester T, Gerbaux P, Boulvin M, Coppée A, Rasmont P: A simplified protocol for bumble bee species identification by cephalic secretion analysis. Insectes Soc. 2011, 58: 227-236. 10.1007/s00040-011-0146-1.
Cameron SA, Hines HM, Williams PH: A comprehensive phylogeny of the bumble bees (Bombus). Biol J Linn Soc. 2007, 91: 161-188. 10.1111/j.1095-8312.2007.00784.x.
Pedersen BV: European bumblebees (Hymenoptera: Bombini) - phylogenetic relationships inferred from DNA sequences. Insect Syst Evol. 2002, 33: 361-386. 10.1163/187631202X00208.
Simon C, Frati F, Beckenbach A, Crespi B, Liu H, Flook P: Evolution, weighting, and phylogenetic utility of mitochondrial gene sequences and a compilation of conserved polymerase chain reaction primers. Ann Entomol Soc Am. 1994, 87: 652-701.
Mardulyn P, Cameron SA: The major opsin in bees (Insecta: Hymenoptera): a promising nuclear gene for higher level phylogenetics. Mol Phylogenet Evol. 1999, 12: 168-176. 10.1006/mpev.1998.0606.
Hines HM, Cameron SA, Williams PH: Molecular phylogeny of the bumble bee subgenus Pyrobombus (Hymenoptera: Apidae: Bombus) with insights into gene utility for lower-level analysis. Invertebr Syst. 2006, 20: 289-303. 10.1071/IS05028.
Zhang Z, Schwartz S, Wagner L, Miller W: A greedy algorithm for aligning DNA sequences. J Comput Biol. 2000, 7: 203-214. 10.1089/10665270050081478.
Katoh K, Misawa K, Kuma K-I, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30: 3059-3066. 10.1093/nar/gkf436.
Maddison W, Maddison D: Mesquite: a modular system for evolutionary analysis. 2007, Version 2.75. 2011. Available from http://mesquiteproject.org
Swofford DL: PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. 2001, Sunderland: Sinauer Associates
Bandelt H-J, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999, 16: 37-48. 10.1093/oxfordjournals.molbev.a026036.
Farris JM, Källersjo M, Kulge AG, Bult AC: Testing significance of incongruence. Cladistics. 1994, 10: 315-319. 10.1111/j.1096-0031.1994.tb00181.x.
Barker FK, Lutzoni FM: The utility of the incongruence length difference test. Syst Biol. 2002, 51: 625-637. 10.1080/10635150290102302.
Posada D: jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008, 25: 1253-1256. 10.1093/molbev/msn083.
Hurvich CM, Tsai C-L: Regression and time series model selection in small samples. Biometrika. 1989, 76: 297-307. 10.1093/biomet/76.2.297.
Felsenstein J: Phylogenies and the comparative method. Am Nat. 1985, 125: 1-15. 10.1086/284325.
Zwickl DJ: Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criteria. PhD thesis. 2006, University of Texas, School of Biological Sciences
Hillis DM, Bull JJ: An empirical test of bootstrapping as a method for assessing confidence in phylogenetic analysis. Syst Biol. 1993, 42: 182-192.
Ronquist F, Huelsenbeck JP: MrBayes 3: bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.
Huelsenbeck JP, Rannala B: Frequentist properties of bayesian posterior probabilities of phylogenetic trees under simple and complex substitution models. Syst Biol. 2004, 53: 904-913. 10.1080/10635150490522629.
Rambaut A, Drummond AJ: Tracer version 1.4. http://beast.bio.ed.ac.uk/Tracer,
Wilcox TP, Zwickl DJ, Heath TA, Hillis DM: Phylogenetic relationships of the dwarf boas and a comparison of bayesian and bootstrap measures of phylogenetic support. Mol Phylogenet Evol. 2002, 25: 361-371. 10.1016/S1055-7903(02)00244-0.
Duennes MA, Lozier JD, Hines HM, Cameron SA: Geographical patterns of genetic divergence in the widespread Mesoamerican bumble bee Bombus ephippiatus (Hymenoptera: Apidae). Mol Phylogenet Evol. 2012, 64: 219-231. 10.1016/j.ympev.2012.03.018.
Drummond AJ, Rambaut A: BEAST: bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A: Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4: 699-710.
Excoffier L, Smouse PE, Quattro JM: Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992, 131: 479-491.
Excoffier L, Lischer HEL: Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010, 10: 564-567. 10.1111/j.1755-0998.2010.02847.x.
Dupanloup I, Schneider S, Excoffier L: A simulated annealing approach to define the genetic structure of populations. Mol Ecol. 2002, 11: 2571-2581. 10.1046/j.1365-294X.2002.01650.x.
Dellicour S, Mardulyn P: SPADS 1.0: a toolbox to perform spatial analyses on DNA sequence datasets. Mol Ecol Resour. 2013, in press
Nei M, Tajima F: Maximum likelihood estimation of the number of nucleotide substitutions from restriction sites data. Genetics. 1983, 105: 207-217.
Nei M: Phylogeography: the history and formation of species. 1987, New York, NY: Columbia University Press
Rozas J, Rozas R: DnaSP version 2.0: a novel software package for extensive molecular population genetics analysis. Comput Appl Biosci. 1997, 13: 307-311.
Miller MP: Alleles In Space (AIS): computer software for the joint analysis of interindividual spatial and genetic information. J Hered. 2005, 96: 722-724. 10.1093/jhered/esi119.
Watson DF, Philips GM: A refinement of inverse distance weighted interpolation. Geo-processing. 1985, 2: 315-327.
Watson DF: Contouring: a guide to the analysis and display of spatial data. 1992, New York, NY: Pergamon Press
Manni F, Guérard E, Heyer E: Geographic patterns of (genetic, morphologic, linguistic) variation: how barriers can be detected by using Monmonier’s algorithm. Hum Biol. 2004, 76: 173-190. 10.1353/hub.2004.0034.
Terzo M, Valterová I, Rasmont P: Atypical secretions of the male cephalic labial glands in bumblebees: the case of Bombus (Rhodobombus) mesomelas Gerstaecker(Hymenoptera, Apidae). Chem Biodivers. 2007, 4: 1466-1471. 10.1002/cbdv.200790124.
Dellicour S, Lecocq T: GCALIGNER 1.0: an aligment program to compute a multiple sample comparison data matrix from large eco-chemical datasets obtained by gas chromatography. J Sep Sci. 2013, 36: 3206-3209.
R Development Core Team: R: A language and environment for statistical computing. 2012, Vienna, Austria
Goslee SC, Urban DL: The ecodist package for dissimilarity-based analysis of ecological data. J Stat Softw. 2007, 22: 1-19.
Oksanen FJ, Blanchet G, Kindt R, Legendre P, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Wagner H: R package version 2.0-2. Vegan: community ecology package. 2011, http://CRAN.R-project.org/package=vegan,
Anderson MJ: Distance-based tests for homogeneity of multivariate dispersions. Biometrics. 2006, 62: 245-253. 10.1111/j.1541-0420.2005.00440.x.
Dufrêne M, Legendre P: Species assemblages and indicator species: the need for a flexible asymmetrical approach. Ecol Monogr. 1997, 67: 345-366.
Mantel N: The detection of disease clustering and a generalized regression approach. Cancer Res. 1967, 27: 209-220.
Ersts PJ: Geographic distance matrix generator(version 1.2.3). 2012, American Museum of Natural History, Center of Biodiversity and Conservation, Available from http://biodiversityinformatics.amnh.org/open_source/gdmg
Dobson AJ: An introduction to generalized linear models. 2001, Boca Raton, Florida: CRC Press, 2
We are grateful to P. Mardulyn for his hospitality at the Université Libre de Bruxelles (Brussels, Belgium) for the molecular analyses. We gratefully acknowledge B. Cederberg (Uppsala, Sweden), A. Cetkovic (Belgrade, Serbia), M. Cornalba (Pavia, Italy), T. De Meulemeester (Leiden, Netherlands), M. Franzen (Halle, Germany), A. Jenic (Ljubljana, Slovenia), A. Lachaud (Nantes, France), R. Moerman (Brussels, Belgium), O. Ponchau (Mons, Belgium), and M. Terzo (Mons, Belgium) for providing samples or for their help and support in sampling. The authors acknowledge the Fonds pour la recherche dans l’industrie et l’agriculture (FRIA) and Fonds pour la Formation à la Recherche Fondamentale et Collective (FNRS, FRFC 2.4613.10) for financial support. TL, PL and SD are FRIA grant holders. The research leading to these results has received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 244090, STEP Project (Status and Trends of European Pollinators, http://www.step-project.net). MV is funded by the FNRS. Partial financial support (IV) was provided by the Academy of Sciences of the Czech Republic (subvention for development of research organization RVO: 61388963). This project was supported by the network Bibliothèque du Vivant funded by the CNRS, the Muséum National d'Histoire Naturelle (MNHN) and the Institut National de la Recherche Agronomique (INRA), and technically supported by the Genoscope. Special thanks are also due to N. Brasero (Mons, Belgium) and S. Lambert (Mons, Belgium) for helpful comments and discussions on earlier versions of this manuscript. We also thank three anonymous referees for their constructive and useful comments. Authors acknowledge D. Baldock (Milford, UK), the Centre de Langues Vivantes (University of Mons), David Notton (the Natural History Museum, London, UK), Stuart P.M. Roberts (University of Reading, Reading, UK) for correcting the English. Computational resources have been provided by the Interuniversity Scientific Computing Facility center (iSCF) of the University of Namur and the High Performance Computing Centre of the University of Brussels (HPC cluster “Hydra”).
The authors declare that they have no competing interests.
TL and SD conceived the study and interpreted the results. The experiments were designed by TL. TL, SD, MV, and IV analysed the data. TL, SD, DM, PL, IV, J-YR, and PR contributed to materials/analysis tools. TL wrote the manuscript. All authors finalised the manuscript. All authors read and approved the final manuscript.