Open Access

Mitogenomic phylogenetic analyses of the Delphinidae with an emphasis on the Globicephalinae

  • Julia T Vilstrup1,
  • Simon YW Ho2,
  • Andrew D Foote1,
  • Phillip A Morin3,
  • Danielle Kreb4,
  • Michael Krützen5,
  • Guido J Parra6, 7,
  • Kelly M Robertson3,
  • Renaud de Stephanis8, 9,
  • Philippe Verborgh8,
  • Eske Willerslev1,
  • Ludovic Orlando1 and
  • M Thomas P Gilbert1Email author
BMC Evolutionary Biology201111:65

DOI: 10.1186/1471-2148-11-65

Received: 18 November 2010

Accepted: 10 March 2011

Published: 10 March 2011

Abstract

Background

Previous DNA-based phylogenetic studies of the Delphinidae family suggest it has undergone rapid diversification, as characterised by unresolved and poorly supported taxonomic relationships (polytomies) for some of the species within this group. Using an increased amount of sequence data we test between alternative hypotheses of soft polytomies caused by rapid speciation, slow evolutionary rate and/or insufficient sequence data, and hard polytomies caused by simultaneous speciation within this family. Combining the mitogenome sequences of five new and 12 previously published species within the Delphinidae, we used Bayesian and maximum-likelihood methods to estimate the phylogeny from partitioned and unpartitioned mitogenome sequences. Further ad hoc tests were then conducted to estimate the support for alternative topologies.

Results

We found high support for all the relationships within our reconstructed phylogenies, and topologies were consistent between the Bayesian and maximum-likelihood trees inferred from partitioned and unpartitioned data. Resolved relationships included the placement of the killer whale (Orcinus orca) as sister taxon to the rest of the Globicephalinae subfamily, placement of the Risso's dolphin (Grampus griseus) within the Globicephalinae subfamily, removal of the white-beaked dolphin (Lagenorhynchus albirostris) from the Delphininae subfamily and the placement of the rough-toothed dolphin (Steno bredanensis) as sister taxon to the rest of the Delphininae subfamily rather than within the Globicephalinae subfamily. The additional testing of alternative topologies allowed us to reject all other putative relationships, with the exception that we were unable to reject the hypothesis that the relationship between L. albirostris and the Globicephalinae and Delphininae subfamilies was polytomic.

Conclusion

Despite their rapid diversification, the increased sequence data yielded by mitogenomes enables the resolution of a strongly supported, bifurcating phylogeny, and a chronology of the divergences within the Delphinidae family. This highlights the benefits and potential application of large mitogenome datasets to resolve long-standing phylogenetic uncertainties.

Background

The mitochondrial genome is typically non-recombining, has a relatively high substitution rate, and has a smaller effective population size than the nuclear genome [1]. These properties can increase the probability of congruence between the mitochondrial gene tree and the species tree, helping to resolve relationships between recently divergent taxa [2]. Under some conditions, however, phylogenetic analyses of mitochondrial DNA (mtDNA) sequence data can fail to resolve the relationships among taxa into a fully bifurcating tree. Theoretical and empirical studies suggest that greater phylogenetic resolution and bootstrap support for inter-specific nodes should be achievable by increasing the amount of sequence data [37]. Additionally, including sequence data from more than one gene will reduce the influence of any variation between genes in phylogenetic signal due to selection or the effects of stochastic lineage sorting [8].

However, not all polytomic relationships are 'soft', and some cannot be resolved by adding sequence data [9]. In some cases even complete mitochondrial genomes have failed to resolve multifurcating relationships (e.g., [10]). Although unlikely [11], a 'hard' molecular polytomy could be the result of a true simultaneous speciation event into multiple daughter species [9, 1214]. Such events could occur when multiple populations of an ancestral species become simultaneously isolated during periods of rapid environmental change [12]. Simultaneous adaptive radiation into multiple species could also occur in sympatry due to intra-specific competition and assortative mating if species occupy a narrow niche width [15].

The use of high-throughput sequencing and pooled tagging methods [16] has the potential to generate large amounts of manifold-coverage sequence data for large numbers of samples quickly and at a relatively low cost, allowing improved resolution of phylogenetic relationships to be achieved routinely (e.g., [17]). Such an approach can allow the differentiation between rapid and simultaneous cladogenesis and, when combined with additional tests, can determine if a molecular polytomy reflects a true species polytomy (e.g., [1820]).

Previous phylogenetic studies using a variety of markers have suggested an episodic rapid rate of speciation within the Delphinidae family [2128]. This has resulted in several long-standing uncertainties in the phylogenetic relationships within this group. Analysis of sequences of the cytochrome b gene (1,140 bp) [21] led to a number of suggested taxonomic revisions from previous classifications based on morphology [29]. However, a number of these revisions had poor support and required further analysis, e.g., the inclusion of Grampus griseus (Risso's dolphin), and removal of Orcinus orca (killer whale) from the Globicephalinae subfamily, and the grouping of O. orca with the genus Orcaella into a proposed Orcininae subfamily. The genus Lagenorhynchus was found to be polyphyletic, with L. albirostris and L. acutus removed from the remaining four congeners and not found to be closely related to each other [21]. There was high support for paraphyly of the genera Tursiops and Stenella [21], which has been subsequently strengthened by mitogenomic and multi-locus analyses [25, 28]. More recent studies, which included both mtDNA and nuclear DNA (nuDNA), suggested an additional revision of the placement of Steno bredanensis (rough-toothed dolphin) with Orcaella [24, 26]; however, another multi-locus study grouped S. bredanensis with the Sotalia genus, consistent with previous classifications [27]. These more recent analyses using both nuDNA and mtDNA [24, 26, 27] have failed to confirm all of the revisions suggested by LeDuc et al. [21] and have not produced consistent estimates of relationships within this family. For example the ordering of the branches containing the species O. orca and L. albirostris, and the subfamilies Delphininae, Globicephilinae, and Lissodelphininae has differed considerably among studies [24, 26, 27]. In addition to being highly variable, the placement of L. albirostris has typically been one of the most weakly supported [21, 26, 27].

Here we estimate the phylogeny of Delphinidae using complete mitogenomes generated using high-throughput sequencing. This is the first time complete mitogenome sequences have been published for five of the species, and mitogenome sequences from a further three species have previously only been used as outgroup species for an intra-specific phylogenetic study on the killer whale [17]. In total, mitogenome sequences from 17 of the 37 extant species within Delphinidae were included in the analyses. This case study on a rapidly radiating group is one of the first to test the power of large sequence datasets produced by parallel-tagged high-throughput sequencing in combination with improved analytical techniques to address long-standing phylogenetic uncertainties. Specifically we test three revisions suggested by LeDuc et al. [21] regarding the placement of O. orca, G. griseus, and L. albirostris, and a further suggested revision regarding the placement of S. bredanensis [24]. We ultimately test the hypothesis that these uncertainties result from a true species polytomy caused by simultaneous speciation.

Results and Discussion

Sequencing

In total 18 mitogenome sequences were generated for this study, including multiple representatives per species. Eight of these sequences were incomplete (spanning between 10,681 and 16,672 bp), Table 1; Additional file 1), however at least 1 complete genome was sequenced for each species. The 10 complete sequences had on average 20× coverage of the whole mitogenome of approximately 16,445 bp. In combination with previously published mtDNA genome sequences we were able to use this dataset to reconstruct the most complete and highly resolved mitogenome phylogeny of Delphinidae to date (Figure 1).
Table 1

Samples used in study, including outgroup and Delphininae sequences from Genbank (accession numbers are given), and multiple specimens per species used in some analyses.

Species

Common name

Genbank Acc. No.

Source

Lagenorhynchus albirostris

White beaked dolphin

AJ554061

Ref. [46]

Feresa attenuata

Pygmy killer whale

JF289171

This study

Feresa attenuata

Pygmy killer whale

JF289172

This study

Peponocephala electra

Melon-headed whale

JF289175

This study

Peponocephala electra

Melon-headed whale

JF289176

This study

Globicephala macrorhynchus

Short-finned pilot whale

HM060334

Ref. [17]

Globicephala macrorhynchus*

Short-finned pilot whale

JF339975

This study

Globicephala macrorhynchus*

Short-finned pilot whale

JF339976

This study

Globicephala macrorhynchus

Short-finned pilot whale

JF339974

This study

Globicephala melas*

Long-finned pilot whale

JF339973

This study

Globicephala melas

Long-finned pilot whale

HM060333

Ref. [17]

Globicephala melas

Long-finned pilot whale

JF339972

This study

Pseudorca crassidens

False killer whale

JF289173

This study

Pseudorca crassidens

False killer whale

JF289174

This study

Pseudorca crassidens

False killer whale

HM060332

Ref. [17]

Grampus griseus

Risso's dolphin

EU557095

Ref. [28]

Orcaella brevirostris

Irrawaddy dolphin

JF289177

This study

Orcaella heinsohni*

Australian snubfin dolphin

JF339978

This study

Orcaella heinsohni*

Australian snubfin dolphin

JF339979

This study

Orcaella heinsohni*

Australian snubfin dolphin

JF339980

This study

Orcaella heinsohni*

Australian snubfin dolphin

JF339981

This study

Orcaella heinsohni

Australian snubfin dolphin

JF339977

This study

Orcinus orca

Killer whale

GU187186

Ref. [17]

Orcinus orca

Killer whale

GU187180

Ref. [17]

Steno bredanensis*

Rough-toothed dolphin

JF339982

This study

Stenella attenuata

Pantropical spotted dolphin

EU557096

Ref. [28]

Sousa chinensis

Indopacific humpbacked dolphin

EU557091

Ref. [28]

Tursiops truncatus

Common bottlenose dolphin

EU557093

Ref. [28]

Stenella coeruleoalba

Striped dolphin

EU557097

Ref. [28]

Delphinus capensis

Long-beaked common dolphin

EU557094

Ref. [28]

Tursiops aduncus

Indian Ocean bottlenose dolphin

EU557092

Ref. [28]

Monodon monoceros

Narwhal

AJ554062

Ref. [46]

Phocoena phocoena

Harbor porpoise

AJ554063

Ref. [46]

Inia geoffrensis

Amazon river dolphin

AJ554059

Ref. [46]

Lipotes vexillifer

Yangtze river dolphin

AY789529

Ref. [47]

Asterisks indicate incomplete sequences (see Additional file 1).

https://static-content.springer.com/image/art%3A10.1186%2F1471-2148-11-65/MediaObjects/12862_2010_Article_1670_Fig1_HTML.jpg
Figure 1

Bayesian phylogenetic reconstruction of selected taxa within Delphinidae based on analysis of 21 partitioned mitogenome sequences under the uncorrelated lognormal relaxed clock model. Node labels indicate posterior probabilities and node bars the 95% HPD of the estimated node age. Outgroup taxa used to root the tree include L. vexillifer, I. geoffrensis, M. monoceros, and P. phocoena, and are not shown. Illustrations are provided by U. Gorter, (not to scale).

Phylogenetic analyses

Phylogenetic estimates were consistent across Bayesian and maximum-likelihood methods used in this study, including partitioned and unpartitioned analyses on the full data set, and a subset consisting of a single representative per species. With one exception we were able to reject alternative topologies (Table 2). Based on Bayes factors, we found strong support for allowing different partitions of the mitogenome alignment to have distinct evolutionary models in the phylogenetic analysis (see Additional file 2). Although the harmonic-mean estimators of Bayes factors such as those implemented in the software packages used here can be unreliable [30, 31], the magnitude of our estimated Bayes factors (see Additional file 2) make it unlikely that we have been misled into selecting a suboptimal partitioning strategy for this alignment. In the partitioned analysis, the mean substitution rate across the 12 protein-coding genes and 2 rRNA genes was 9.86 × 10-3 substitutions/site/My, with a 95% highest posterior density (HPD) interval of 8.45 × 10-3 - 1.13 × 10-2 substitutions/site/My. The coefficient of variations of rates was 0.527 (95% HPD interval 0.405 - 0.648), indicating the presence of substantial rate variation among lineages. The fastest data partition (third codon sites) evolved at about 16.6 times the rate of the slowest data partition (second codon sites).
Table 2

The p-values of the approximately unbiased test and weighted or unweighted Kishino-Hasegawa and Shimodaira-Hasegawa tests are provided as well as the bootstrap and Bayesian posterior probabilities of the selected topology.

Test

rank

item

obs

au

np

bp

pp

kh

sh

wkh

wsh

A.

1

A1

-119.3

1.000

1.000

1.000

1.000

1.000

1.000

1.000

1.000

 

2

A3

119.3

<0.001

<0.001

0

<0.0001

0

0

0

0

 

3

A2

121.3

<0.001

<0.001

0

<0.0001

0

0

0

0

B.

1

B1

-5.0

0.902

0.882

0.883

0.993

0.876

0.926

0.876

0.974

 

2

B3

5

0.01

0.01

0.117

0.007

0.124

0.501

0.124

0.237

 

3

B2

178.4

<0.001

<0.001

0

<0.001

0

0

0

0

C.

1

C1

-554.2

1.000

1.000

1.000

1.000

1.000

1.000

1.000

1.000

 

2

C2

554.2

<0.001

<0.001

0

<0.001

0

0

0

0

D.

1

D2

-291.8

1.000

1.000

1.000

1.000

1.000

1.000

1.000

1.000

 

2

D1

291.8

<0.001

<0.001

0

<0.001

0

0

0

0

Tested alternative topologies are shown in Figure 2. Obs = observed log-likelihood difference to the best topology; au = approximately unbiased; np = bootstrap probability of the topology (i.e., the probability that the given topology has the largest likelihood in 10 scaled sets of 10,000 bootstrap replicates); bp = np with 10 non-scaled sets of 10,000 bootstrap replicates; pp = Bayesian posterior probabilities of the model; kh = Kishino-Hasegawa; sh = Shimodeira-Hasegawa; wkh = weighted KH; wsh = weighted SH.

Our analyses strongly support the placement by LeDuc et al. [21] of Grampus griseus within the Globicephalinae subfamily, a result that now finds wide support from a range of different markers [24, 26, 27]. In contrast to LeDuc et al.'s [21] revisions we were able to reject the suggested grouping of Orcinus orca with Orcaella: the proposed Orcininae subfamily. Instead we find that O. orca is sister taxon to the rest of the Globicephalinae subfamily, which also contains the Orcaella genus. We were also able to reject the placement of Steno bredanensis in Globicephalinae as proposed by Caballero et al. [24] and supported by McGowen et al. [26]. Traditionally, and based on morphology alone, S. bredanensis has been placed in the subfamily Stenoninae with the genera Sousa and Sotalia [29]. Sousa chinensis has since been moved to the Delphininae subfamily and our analyses suggest that S. bredanensis is sister taxon to the rest of the species within this clade. Membership of these subfamilies appears to be supported by shared derived morphological characteristics, however, a thorough cladistic morphological analysis at the genus level remains lacking.

Our analyses suggest Lagenorhynchus albirostris and the Delphinidae family are sister taxa, but neither approximately unbiased nor other topological tests (weighted or unweighted Kishino-Hasegawa and Shimodaira-Hasegawa tests) were able to reject an alternative topology (Table 2). However, maximum-likelihood bootstrap values and Bayesian posterior probabilities supported L. albirostris as the sister taxon of the Delphinidae family (88.3% and 0.993 respectively), leaving only marginal support for the two alternative topologies tested (Figure 2). The positioning of L. albirostris and the three major Delphinidae subfamilies, Globicephalinae, Delphininae, and Lissodelphininae (not represented in our analyses), has been inconsistent among published studies [e.g. [21, 22, 24, 26, 27]]. Such phylogenetic uncertainty suggests that this may represent a true species polytomy. However, unlike previous studies, our positioning of L. albirostris had high support (Figure 1), and the support of approximately unbiased tests for an alternative topology was only marginally significant (Table 2). Periods of rapid environmental change can lead to rapid or even simultaneous speciation events, which would result in a hard polytomy [12]. Our time-calibrated mitogenomic phylogeny placed the splitting of the three major subfamilies and L. albirostris during the late Miocene (11.6-5.3 Mya; Figure 1), a period of fluctuating temperature and sea level [32]. This dating is consistent with the period during which Steeman et al. [27] detected a significant increase in net diversification rates within Delphinidae. Therefore, we suggest that a rapid radiation during the period of extreme environmental fluctuations in the late Miocene best explains the lower support for the phylogenetic positioning of L. albirostris within Delphinidae.
https://static-content.springer.com/image/art%3A10.1186%2F1471-2148-11-65/MediaObjects/12862_2010_Article_1670_Fig2_HTML.jpg
Figure 2

The alternative topologies tested (Table 2). The letters in the alphanumeric label indicates the taxon whose position was tested (also in bold): A = Orcinus orca; B = Lagenorhynchus albirostris; C = Grampus griseus; D = Steno bredanensis, while the numbers correspond to item numbers in Table 2.

Consistent with geographical distribution, morphological differences, and a high number of fixed differences in the mtDNA control region [33], we found a deep phylogenetic divergence between Orcaella brevirostris and O. heinsohni (Figure 1). This divergence is dated to the Pliocene (5.3-2.6 Mya; Figure 1), a period characterised by elevated cyclical fluctuations in sea temperatures and sea levels with an overall trend of cooling temperatures and an increasing west-to-east sea surface temperature (SST) difference across the equatorial Pacific [3437]. Our data also indicate a rapid radiation of the other extant species within the Globicephalinae subfamily during this period of climatic variation.

Caveats and recommended future work

The inclusion of nuDNA loci in some of the recent studies [24, 26, 27] may explain the incongruencies noted above. The mitogenome is a single, maternally inherited, haploid locus and might not wholly reflect the underlying pattern of population divergence and lineage formation (e.g., [38]). Nuclear genes can have greater resolving power for deep-level phylogenetic inference [39] and, when combined with whole mitogenomes, they have also been able to improve resolution in cases of apparent incomplete lineage sorting due to rapid radiations [40]. Historical hybridization is another potential source of incongruence between mtDNA and nuDNA trees [2, 12]. This is a particularly important factor to consider if the Biological Species Concept rather than the Phylogenetic Species Concept is being used to define species boundaries, as reproductive isolation is a key criterion for species status under the former [41].

A recent study based on the control regions of the two pilot whale species, Globicephala melas and G. macrorhynchus, found no support for reciprocal monophyly [42]. It was suggested that this could be due to incomplete lineage sorting or a lack of sufficient data. We found complete lineage sorting with high monophyletic support for all species for which we had multiple representatives, including the two pilot whale species (Figure 1). However, we had small sample numbers from a limited geographical range for all taxa. Including single or low numbers of representatives of each species assumes that the individuals sequenced and included in the phylogenetic analyses are representative of a reciprocally monophyletic clade [41]. However, increased sampling may reveal instances of paraphyly or polyphyly [41]. Increased sampling and mitogenome sequencing, such as conducted on the killer whale [17], is needed to confirm complete lineage sorting for the remaining species included in this study. Increased taxon sampling, such as the inclusion of the species within the Lissodelphininae, may also further reduce phylogenetic error [7, 43, 44].

Conclusions

We have used complete mtDNA genome sequences to resolve the phylogenetic relationships within the Delphinidae with high support. Future studies could apply similar methods to resolve para- and polyphyletic genera, e.g., the four Lagenorhynchus species within the subfamily Lissodelphininae [45]. Our study further supports previous empirical and theoretical studies [37], that increasing sequence data can improve phylogenetic resolution, even in species known to have undergone rapid radiation in the recent past, and can help to discriminate between hard and soft polytomies. However, some clades will remain unresolved, most likely due to simultaneous speciation events [12]. With current high-throughput sequencing techniques, animal mitogenomes can be sequenced relatively quickly and at low cost, and are an attractive candidate for future phylogenetic analyses, particularly if combined with current analytical tools that can aid in the reconstruction of challenging phylogenies.

Methods

Sample collection, DNA extraction, amplification and sequencing

Epidermal tissue was obtained by remote biopsying of free-ranging animals [46] and stored in 20% dimethyl sulphoxide (DMSO) saturated with salt [47].

Genomic DNA was extracted from epidermis using the Qiagen DNeasy (Qiagen DNeasy, Valencia, CA, USA) kit following the manufacturer's guidelines. The mitochondrial genome was amplified in 3-5 overlapping amplicons (dependent on DNA preservation) using previously published long-range PCR primers [17] and primers designed specifically for this study (see Additional file 3). Additional sequencing primers were also designed for gap filling using conventional Sanger sequencing at the commercial service offered by Macrogen (Seoul, South Korea). Each 25 μl PCR contained 1 μl extracted DNA, 1× PCR buffer, 2 mM MgSO4, 0.4 μM of each primer, 100 nM mixed dNTPs and 0.1 μl High Fidelity Platinum Taq (Invitrogen, Carlsbad, California). PCR amplifications were performed using an MJ Thermocycler with a 4 minute activation step at 94°C, followed by 35 cycles of 94°C for 30 seconds, 62°C for 30 seconds, 68°C for 6 minutes 30 seconds, followed by a final extension period of 72°C for 7 minutes. For amplifications under 6 kb in size the extension time was reduced to 3-5 minutes.

The amplified PCR products were purified using an Invitek PCRapace purification kit (PCRapace, Invitek, Berlin, Germany) and quantified using a NanoDrop spectrophotometer (NanoDrop Products, Wilmington, DE) to determine DNA concentration used to balance and pool amplicons in equimolar ratios. Length of fragment, ratio of fragment lengths per individual, and DNA concentration was taken into account when balancing the samples. Samples were either individually tagged according to Meyer et al. [16] and built into shotgun sequencing libraries following the manufacturer's instructions (454 Life Sciences, Branford, CT), or grouped into sets of 8-10, where within each sample set individual libraries were made to contain a different 10 bp multiplexing identifier (MID) tag, allowing libraries to be combined prior to emulsion PCR. Sequencing libraries were quantified by qPCR [16] and pooled at equimolar concentrations. Library pools were divided among regions on GS FLX sequencing runs, using either LR70 or Titanium chemistry (454 Life Sciences). Sequencing data was parsed into individual extractions and identifier tags were removed using a custom tag-removal Perl script (M. Rasmussen, unpublished, University of Copenhagen).

Phylogenetic analyses

Sequences were assembled using gsMapper (Roche Applied Science, Indianapolis, IN, USA) and aligned by eye using SE-AL v2.0a11 (A. Rambaut, University of Oxford), while Geneious (Biomatters Ltd., Auckland, New Zealand) and Sequencher v4.8 (Genes Code Corporation, Ann Arbor, MI) were used to check coverage and sequence reliability. Conspicuous indels, base position differences, and differences in homopolymeric regions were double-checked and sequences with higher coverage were generally given preference. In order to ensure that the data was not affected by the erroneous incorporation of nuclear pseudogenes (numts) we visually assessed the recovered sequences for the presence of stop codons or frame-shift mutations in the aligned protein-coding genes. We observed no evidence that numts might be present in the data. This may be explained by a combination of (a) the general difficulty with PCR amplifying long amplicons, requiring relatively high levels of template for successful amplication, and thus (b) the fact that mtDNA template copy numbers are much higher than those for nuDNA templates, leading to preferential mtDNA over nuDNA amplification.

A total of 35 mitogenome sequences were used in the analyses, representing 21 species (Table 1). Of these, 18 mitogenomes were amplified and sequenced for this study and 17 mitogenomes attained from Genbank [17, 30, 48, 49], which included 4 outgroup sequences (narwhal, Monodon monoceros; harbor porpoise, Phocoena phocoena; Yangtze river dolphin, Lipotes vexillifer; Amazon river dolphin, Inia geoffrensis). The 18 generated sequences consist of 5 species (pygmy killer whale, Feresa attenuata; melon-headed whale, Peponocephala electra; Irrawaddy dolphin, Orcaella brevirostris; Australian snubfin dolphin, Orcaella heinsohni; rough-toothed dolphin, Steno bredanensis) whose mitogenomes had not been previously sequenced prior to this study.

A single representative mitogenome from each of the 21 species was used for the initial Bayesian phylogenetic analysis and divergence date estimation. This step was taken so that a speciation prior could be used for the tree topology and node times. Sequences were aligned and the 2 rRNA and 12 protein-encoding genes (excluding ND6) were used to form a data set comprising 13,958 sites. Stop codons were removed from all genes and the control region was excluded from analysis due to saturation, repetitive sequences, and alignment ambiguities. The resulting alignment was divided into four partitions: first, second and third codon sites of the protein-coding genes (3,792 bp per partition), and rRNA genes (2,582 bp). A comparison of Bayesian information criterion values in Modelgenerator [50] were used to find the optimal time-reversible substitution model for each partition. This criterion has been found to perform well in relation to other criteria used in evolutionary model selection [51]. The selected models were GTR+I+G for first and third codon sites, HKY+I+G for second codon sites, and TN93+I+G for the rRNA partition. In all cases, rate variation among sites was modelled using a gamma distribution with six categories [52]. There was little variation in the base frequencies across taxa (see Additional file 4).

The Bayesian phylogenetic analysis was performed using BEAST v1.6 [53]. An uncorrelated lognormal relaxed-clock model was used to allow rate variation among branches [54]. A Bayes-factor analysis indicated that this model received decisive support in comparison to a strict-clock model. The four data partitions shared the same relaxed clock but were allowed to have different relative rates. An exponential prior with a mean of 1/3 was used for the standard deviation of the lognormal distribution of rates, and a Yule prior was specified for the tree topology and relative divergence times. To enable the estimation of absolute divergence times in the tree, four calibrations based primarily on fossil calibrations, along with estimated divergence dates from published studies [27, 28, 55] (see Additional file 5), were incorporated into the analysis. The calibrations were implemented in the form of uniform prior distributions for the ages of the four nodes, and monophyly was enforced on the clades defined by these four nodes.

Posterior distributions of parameters, including the tree topology and divergence times, were estimated by Markov chain Monte Carlo (MCMC) sampling. Samples were drawn every 5,000 MCMC steps over a total of 50,000,000 steps. The first 10% of samples were discarded as burn-in. Convergence to the stationary distribution and acceptable mixing were investigated using the diagnostic software Tracer v1.5 (Rambaut and Drummond, 2007, University of Oxford). From the set of posterior samples, the tree with the highest product of clade credibilities was identified and the branch lengths were rescaled to match mean posterior estimates.

Additional phylogenetic analyses were performed in order to examine the effect of data partitioning. First, the analysis was repeated without data partitioning, so that the protein-coding genes and the rRNA genes shared the same substitution model and mean evolutionary rate. Second, analyses were performed on three data sets in which sites were randomly assigned to the four data partitions mentioned earlier. Randomisation of sites (sampling without replacement) was performed using the Java application SiteSampler v1.1 [56] Support for the different partitioning schemes was examined by assessing Bayes factors, calculated using a harmonic-mean estimator in the software Tracer v1.6 [57, 58].

A second set of phylogenetic analyses was performed on the full dataset (35 mitogenomes), including multiple representatives per species, coding missing data as N and using data partitioning for each codon position, rRNA, and control region. For each partition, the best model of molecular evolution that was compatible with models implemented in MrBayes 3.12 [59] was selected using the Bayesian information criterion. The selected models were TN93+I+G for the first and second codon sites, HKY+I+G for third codon sites and the control region, and HKY+I+G for the rRNA genes. Similar models were recovered after RY-recoding a hypervariable part of the control region (nucleotides 15556-15588 according to the Globicephala macrorhynchus, Genbank accession number HM060334, reference mitogenome). Bayesian phylogenetic analysis was performed using the 5-partition datasets (with and without RY-recoding of the control region) using MrBayes 3.12 [59]. Posterior distributions of parameters were estimated using two independent MCMC analyses, each comprising one cold and three heated chains. Samples from the posterior were drawn every 1,000 steps over a total of 10 million steps, which appeared to be sufficient to keep the average standard deviation of split frequencies below the critical value of 0.01. The first 25% of samples were discarded as burn-in. A majority-rule consensus tree was constructed from the posterior sample of trees. A supplemental set of analyses was performed with MrBayes 3.12 and PhyML 3.0 [60] using an unpartitioned dataset under a GTR+I+G model as selected using ModelTest [61] and Akaike Information Criterion, with and without RY-recoding of the hypervariable segment of the control region. All analyses yielded identical tree topologies and similar node support values. For maximum-likelihood analyses, the strength of the phylogenetic signal was assessed via non-parametric bootstrapping with 250 pseudo-replicates. In addition, using Consel 0.1 k [62] and site-wise likelihood values recovered from PhyML analyses, levels of statistical support for alternative topologies were evaluated from the p-values of approximately unbiased tests, weighted or unweighted unilateral Kishino-Hasegawa and Shimodeira-Hasegawa tests, and bootstrap and Bayesian posterior probabilities for the selected topologies (Table 2). All trees were drawn using Dendroscope [63].

Declarations

Acknowledgements

The authors thank the Danish Council for Independent Research: Natural Sciences, and the Danish National Research Foundation and the Systematics Research Fund from the Linnean Society for generous funding. We thank Bill Perrin, Sebastián Duchene Garzon, and Eric Archer, four anonymous reviewers, and the editors for critical input and comments on the manuscript. We also thank Morten Rasmussen for bioinformatics support, and we thank Uko Gorter for whale illustrations.

Authors’ Affiliations

(1)
Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen
(2)
School of Biological Sciences, University of Sydney
(3)
Southwest Fisheries Science Center, NOAA Fisheries
(4)
Yayasan Konservasi RASI
(5)
Evolutionary Genetics Group, Anthropological Institute and Museum, University of Zurich
(6)
School of Biological Sciences, Flinders University, Lincoln Marine Science Centre
(7)
South Australian Research and Development Institute (Aquatic Sciences)
(8)
CIRCE, Conservation Information and Research on Cetaceans
(9)
Departamento de Biologia de la Conservación, Estación Biologica de Donana, CSIC

References

  1. Brown WM, Prager EM, Wang A, Wilson AC: Mitochondrial DNA sequences of primates: tempo and mode of evolution. J Mol Evol. 1982, 18: 225-239. 10.1007/BF01734101.View ArticlePubMed
  2. Moore WS: Inferring phylogenies from mtDNA variation: Mitochondrial-gene trees versus nuclear-gene trees. Evolution. 1995, 49: 718-726. 10.2307/2410325.View Article
  3. Cummings MP, Otto SP, Wakeley J: Sampling properties of DNA sequence data in phylogenetic analysis. Mol Biol Evol. 1995, 12: 814-822.PubMed
  4. Saitou N, Nei M: The number of nucleotides required to determine the branching order of three species with special reference to the human-chimpanzee-gorilla divergence. J Mol Evol. 1986, 24: 189-204. 10.1007/BF02099966.View ArticlePubMed
  5. Ruvolo M, Disotell TR, Allard MW, Brown WM, Honeycutt RL: Resolution of the African homonid trichotomy by use of a mitochondrial gene sequence. Proc Natl Acad Sci USA. 1991, 88: 1570-1574. 10.1073/pnas.88.4.1570.View ArticlePubMedPubMed Central
  6. DeFilippis VR, Moore WS: Resolution of phylogenetic relationships among recently evolved species as a function of amount of DNA sequence: An empirical study based on woodpeckers (Aves: Picidae). Mol Phyl Evol. 2000, 16: 143-160. 10.1006/mpev.2000.0780.View Article
  7. Rokas A, Carroll SB: More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy. Mol Biol Evol. 2005, 22: 1337-1344. 10.1093/molbev/msi121.View ArticlePubMed
  8. Rohland N, Malaspinas AS, Pollack JL, Slatkin M, Matheus P, Hofreiter M: Proboscidean mitogenomics: Chronology and mode of elephant evolution using mastodon as outgroup. PLoS Biol. 2007, 5: e207-10.1371/journal.pbio.0050207.View ArticlePubMedPubMed Central
  9. Maddison W: Reconstructing character evolution on polytomous cladograms. Cladistics. 1989, 5: 365-377. 10.1111/j.1096-0031.1989.tb00569.x.View Article
  10. Willerslev E, Gilbert MTP, Binladen J, et al: Analysis of complete mitochondrial genomes from extinct and extant rhinoceroses reveals lack of phylogenetic resolution. BMC Evol Biol. 2009, 9: 95-10.1186/1471-2148-9-95.View ArticlePubMedPubMed Central
  11. Hudson RR: Gene genealogies and the coalescent process. Oxford Surv Evol Biol. 1990, 7: 1-44.
  12. Hoelzer GA, Melnick DJ: Patterns of speciation and limits to phylogenetic resolution. Trends Ecol Evol. 1994, 9: 104-107. 10.1016/0169-5347(94)90207-0.View ArticlePubMed
  13. Jackman TR, Larson A, de Queiroz K, Losos JB: Phylogenetic relationships and tempo of early diversification in Anolis lizards. Syst Biol. 1999, 48: 254-285. 10.1080/106351599260283.View Article
  14. Walsh HE, Kidd MG, Moum T, Friesen VL: Polytomies and the power of phylogenetic inference. Evolution. 1999, 53: 932-937. 10.2307/2640732.View Article
  15. Bolnick DI: Multi-species outcomes in a common model of sympatric speciation. J Theor Biol. 2006, 241: 734-744.View ArticlePubMed
  16. Meyer M, Stenzel U, Hofreiter M: Parallel tagged sequencing on the 454 platform. Nature Protocols. 2008, 3: 267-278. 10.1038/nprot.2007.520.View ArticlePubMed
  17. Morin PA, Archer FI, Foote AD, et al: Complete mitochondrial genome phylogeographic analysis of killer whales (Orcinus orca) indicates multiple species. Genome Res. 2010, 20: 908-916. 10.1101/gr.102954.109.View ArticlePubMedPubMed Central
  18. Shen YY, Liang L, Sun YB, Yue BS, Yang XJ, Murphy RW, Zhang YP: A mitogenomic perspective on the ancient, rapid radiation in the Galliformes with an emphasis on the Phasianidae. BMC Evol Biol. 2010, 10: 132-10.1186/1471-2148-10-132.View ArticlePubMedPubMed Central
  19. Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16: 1114-1116.View Article
  20. Slowinski JB: Molecular polytomies. Mol Phyl Evol. 2001, 19: 114-120. 10.1006/mpev.2000.0897.View Article
  21. LeDuc RG, Perrin WF, Dizon AE: Phylogenetic relationships among the Delphinid cetaceans based on full cytochrome b sequences. Mar Mamm Sci. 1999, 15: 619-648. 10.1111/j.1748-7692.1999.tb00833.x.View Article
  22. May-Collado L, Agnarsson I: Cytochrome b and Bayesian inference of whale phylogeny. Mol Phyl Evol. 2006, 38: 344-354. 10.1016/j.ympev.2005.09.019.View Article
  23. Nishida S, Goto M, Pastene LA, Kanda N, Koike H: Phylogenetic relationships among cetaceans revealed by Y-chromosome sequences. Zool Sci. 2007, 24: 723-732. 10.2108/zsj.24.723.View ArticlePubMed
  24. Caballero S, Jackson J, Mignucci-Giannoni AA, Barrios-Garrido H, Beltrán-Pedreros S, Montiel-Villalobos MG, Robertson KM, Baker CS: Molecular systematics of South American dolphins Sotalia: sister taxa determination and phylogenetic relationships, with insights into a multilocus phylogeny of the Delphinidae. Mol Phyl Evol. 2008, 46: 252-268. 10.1016/j.ympev.2007.10.015.View Article
  25. Kingston SE, Adams LD, Rosel PE: Testing mitochondrial sequences and anonymous nuclear markers for phylogeny reconstruction in a rapidly radiating group: molecular systematics of the Delphinidae (Cetacea: Odontoceti: Delphinidae). BMC Evol Biol. 2009, 9: 245-10.1186/1471-2148-9-245.View ArticlePubMedPubMed Central
  26. McGowen MR, Spaulding M, Gatesy J: Divergence date estimation and a comprehensive molecular tree of extant cetaceans. Mol Phyl Evol. 2009, 53: 891-906. 10.1016/j.ympev.2009.08.018.View Article
  27. Steeman ME, Hebsgaard MB, Fordyce RE, et al: Radiation of extant cetaceans driven by restructuring of the oceans. Syst Biol. 2009, 58: 573-585. 10.1093/sysbio/syp060.View ArticlePubMedPubMed Central
  28. Xiong YE, Brandley MC, Xu S, Zhou K, Yang G: Seven new dolphin mitochondrial genomes and a time-calibrated phylogeny of whales. BMC Evol Biol. 2009, 9: 20-10.1186/1471-2148-9-20.View ArticlePubMedPubMed Central
  29. Perrin WF: Dolphins, porpoises, and whales. An action plan for the conservation of biological diversity: 1988-1992, IUCN, Gland, Switzerland. 1989
  30. Lartillot N, Philippe H: Computing Bayes factor using thermodynamic integration. Syst Biol. 2005, 55: 195-207. 10.1080/10635150500433722.View Article
  31. Xie W, Lewis PO, Fan Y, Kuo L, Chen M-H: Improving marginal likelihood estimation for Bayesian phylogenetic model selection. Syst Biol. 2011.
  32. Haq BH, Hardenbol J, Vail PR: Chronology of fluctuating sea levels since the Triassic. Science. 1987, 235: 1156-1162. 10.1126/science.235.4793.1156.View ArticlePubMed
  33. Beasley I, Robertson KM, Arnold P: Description of a new dolphin, the Australian snubfin dolphin Orcaella heinsohni sp. N. (Cetacea, Delphinidae). Mar Mam Sci. 2005, 21: 365-400. 10.1111/j.1748-7692.2005.tb01239.x.View Article
  34. Dwyer GS, Cronin TM, Baker PA, Raymo ME, Buzas JS, Corrège T: North Atlantic deepwater temperature change during late Pliocene and late Quaternary climactic cycles. Science. 1995, 270: 1347-1351. 10.1126/science.270.5240.1347.View Article
  35. Ravelo AC, Andreasen DH, Lyle M, Olivarez-Lyle A, Wara MW: Regional climate shifts caused by gradual global cooling in the Pliocene epoch. Nature. 2004, 429: 263-267. 10.1038/nature02567.View ArticlePubMed
  36. Wara MW, Ravelo A-C, Delaney ML: Permanent El Niño-like conditions during the Pliocene warm period. Science. 2005, 309: 758-761. 10.1126/science.1112596.View ArticlePubMed
  37. Lawrence KT, Liu Z, Herbert TD: Evolution of the Eastern Tropical Pacific through Pio-Pleistocene glaciation. Science. 2006, 312: 79-83. 10.1126/science.1120395.View ArticlePubMed
  38. Weisrock DW, Shaffer HB, Storz BL, Storz SR, Voss AR: Multiple nuclear gene sequences identify species boundaries in the rapidly radiating clade of Mexican ambystomatid salamanders. Mol Ecol. 2006, 15: 2489-2503. 10.1111/j.1365-294X.2006.02961.x.View ArticlePubMed
  39. Springer MS, DeBry RW, Douady C, Amrine HM, Madsen O, de Jong WW, Stanhope MJ: Mitochondrial Versus Nuclear Gene Sequences in Deep-Level Mammalian Phylogeny Reconstruction. Mol Biol Evol. 2001, 18 (2): 132-143.View ArticlePubMed
  40. McCracken KG, Sorenson MD: Is homoplasy or lineage sorting the source of incongruent mtDNA and nuclear gene trees in the stiff-tailed ducks (Nomonyx-Oxyura). Syst Biol. 2005, 54: 35-55. 10.1080/10635150590910249.View ArticlePubMed
  41. Milinkovitch , LeDuc , Tiedemann , Dizon : Applications of Molecular Data in Cetacean Taxonomy and Population Genetics with Special Emphasis on Defining Species Boundaries. 'Marine Mammals: Biology and Conservation'. Edited by: Evans PGH, J A. 2002, Kluwer Academic Press, 325-359.View Article
  42. Oremus M, Gales R, Dalebout ML, Funahashi N, Endo T, Kage T, Steel D, Baker SC: Worldwide mitochondrial DNA diversity and phylogeography of pilot whales (Globicephala spp). Biol J Linn Soc. 2009, 98: 729-744. 10.1111/j.1095-8312.2009.01325.x.View Article
  43. Pollock DD, Zwickl DJ, McGuire JA, Hillis DM: Increased taxon sampling is advantageous for phylogenetic inference. Syst Biol. 2002, 51 (4): 664-671. 10.1080/10635150290102357.View ArticlePubMedPubMed Central
  44. Zwickl DJ, Hillis DM: Increased taxon sampling greatly reduces phylogenetic error. Syst Biol. 2002, 51 (4): 588-598. 10.1080/10635150290102339.View ArticlePubMed
  45. Harlin-Cognato AD, Honeycutt RL: Multi-locus phylogeny of dolphins in the subfamily Lissodelphininae: character synergy improves phylogenetic resolution. BMC Evol Biol. 2006, 6: 87-10.1186/1471-2148-6-87.View ArticlePubMedPubMed Central
  46. Palsbøll PJ, Larsen F, Hansen ES: Sampling of skin biopsies from free-ranging large cetaceans in West Greenland: development of new biopsy tips and bolt designs. Rep Int Whaling Commn. 1991, 71-79. Special 13
  47. Amos W, Hoelzel AR: Long-term preservation of whale skin for DNA analysis. Rep Int Whaling Commn. 1991, 99-103. Special 13
  48. Arnason U, Gullberg A, Janke A: Mitogenomic analyses provide new insights into cetacean origin and evolution. Gene. 2004, 333: 27-34. 10.1016/j.gene.2004.02.010.View ArticlePubMed
  49. Yan J, Zhou K, Yang G: Molecular phylogenetics of river dolphin's and the baiji mitochondrial genome. Mol Phyl Evol. 2005, 37: 743-750. 10.1016/j.ympev.2005.06.009.View Article
  50. Keane TM, Creevey CJ, Pentony MM, Naughton TJ, McInerney JO: Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol. 2006, 6: 29-10.1186/1471-2148-6-29.View ArticlePubMedPubMed Central
  51. Luo A, Qiao H, Zhang Y, Shi W, Ho SYW, Xu W, Zhang A, Zhu C: Performance of criteria for selecting evolutionary models in phylogenetics: a comprehensive study based on simulated datasets. BMC Evol Biol. 2010, 10: 242-10.1186/1471-2148-10-242.View ArticlePubMedPubMed Central
  52. Yang Z: Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Mol Biol Evol. 1993, 10: 1396-1401.PubMed
  53. Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.View ArticlePubMedPubMed Central
  54. Drummond AJ, Ho SYW, Phillips MJ, Rambaut A: Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4: e88-10.1371/journal.pbio.0040088.View ArticlePubMedPubMed Central
  55. Aguirre-Fernandez G, Barnes LG, Aranda-Manteca FJ, Fernandez-Rivera JR: Protoglobicephala mexicana, a new genus and species of Pliocene fossil dolphin (Cetacea; Odontoceti; Delphinidae) from the gulf of California, Mexico. Boletin de la Sociodad Geologica Mexicana. 2009, 61 (2): 245-265.
  56. Ho SYW, Lanfear R: Characterising rate variation in cetacean mitogenomes using codon-partitioned relaxed clocks. Mitochondrial DNA. 2010, 21 (3-4): 138-146. 10.3109/19401736.2010.494727.View ArticlePubMed
  57. Rambaut A, Drummond A: Tracer, version 1.4. 2007, [http://tree.bio.ed.ac.uk/software/tracer]
  58. Suchard MA, Weiss RE, Sinsheimer JS: Bayesian selection of continuous-time Markov chain evolutionary models. Mol Biol Evol. 2001, 18 (6): 1001-1013.View ArticlePubMed
  59. Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics Applications Note. 2001, 17 (8): 754-755.View Article
  60. Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.View ArticlePubMed
  61. Posada D, Crandall KA: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14 (9): 817-818. 10.1093/bioinformatics/14.9.817.View ArticlePubMed
  62. Shimodaira H, Hasegawa M: CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics. 2001, 17 (12): 1246-1247. 10.1093/bioinformatics/17.12.1246.View ArticlePubMed
  63. Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R: Dendroscope: An interactive viewer for large phyogenetic trees. BMC Bioinformatics. 2007, 8: 460-10.1186/1471-2105-8-460.View ArticlePubMedPubMed Central

Copyright

© Vilstrup et al; licensee BioMed Central Ltd. 2011

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.