Vanishing native American dog lineages
- Santiago Castroviejo-Fisher†1, 2, 3,
- Pontus Skoglund†1,
- Raúl Valadez4,
- Carles Vilà5 and
- Jennifer A Leonard1, 5Email author
© Castroviejo-Fisher et al; licensee BioMed Central Ltd. 2011
Received: 3 November 2010
Accepted: 21 March 2011
Published: 21 March 2011
Dogs were an important element in many native American cultures at the time Europeans arrived. Although previous ancient DNA studies revealed the existence of unique native American mitochondrial sequences, these have not been found in modern dogs, mainly purebred, studied so far.
We identified many previously undescribed mitochondrial control region sequences in 400 dogs from rural and isolated areas as well as street dogs from across the Americas. However, sequences of native American origin proved to be exceedingly rare, and we estimate that the native population contributed only a minor fraction of the gene pool that constitutes the modern population.
The high number of previously unidentified haplotypes in our sample suggests that a lot of unsampled genetic variation exists in non-breed dogs. Our results also suggest that the arrival of European colonists to the Americas may have led to an extensive replacement of the native American dog population by the dogs of the invaders.
Dogs colonized the Americas with early human groups from Asia  and were widespread by the time Europeans arrived late in the 15th century . Most of the dogs around the world today have mitochondrial DNA (mtDNA) control region sequences that form a well-defined phylogenetic clade (Clade I) . Genetic characterization of ancient American dogs (samples obtained from human settlements that had not been in contact with Europeans, hereafter referred to as native American dogs) revealed a unique set of mtDNA sequences that clustered within this clade, but have not been observed in extant dogs [e.g. [3–6]]. Most notably, a subclade (Ia) has so far only been identified in ancient American dogs and had a frequency of 62% in ancient Latin American animals . However, most genetic studies are based on purebred dogs, and since most internationally recognized breeds today are primarily European or Asian in origin, it is possible that American dog lineages have been excluded.
Results and discussion
In the total sample of 400 modern American dogs (Genbank numbers in Table S1), we identified 40 unique mtDNA haplotypes, of which 23 (57.5%) had not been identified in previous studies that included samples from around the world. Haplotypes were widespread across sampled localities and we did not detect any geographic pattern. This shows that significant undescribed diversity is likely to be present in non-purebred dogs . In addition, the level of mtDNA nucleotide diversity in the present Latin American population (θπ = 0.017, = 40,000) indicates an effective population size similar to that in ancient America (θπ = 0.015, = 35,000).
We investigated the probability of this result under different demographic models and found that the observation of a private haplogroup in the ancient sample with a frequency as high as Ia allows rejection of complete continuity between ancient and present populations for an extensive range of assumptions. For the constant size population model (see Methods), we found that all models with an assumed N e > 3,000 could be rejected at the 5% significance level, and the population expansion scenario was similarly rejected for initial N e > 2,100 (P < 0.05).
Arrival by Columbus to the Americas in 1492 was quickly followed by the arrival of conquistadors, missionaries and colonists from Europe with their livestock, pets, commensals and pathogens, all of which had an important impact on native American populations and culture [7, 8]. Our results show that native American dog populations were also impacted. The extent of this impact is unexpected because of the large historical population size of dogs in the Americas and the existence of potential refugia (e.g. isolated human groups) where native lineages could have survived. Several factors might have contributed to this replacement, including direct persecution [7, 9], preference for the often larger newly arrived dogs, or susceptibility to introduced infectious diseases. Future studies including more ancient and modern dogs and more genetic markers, such as neutral autosomal markers, genes of known function and Y-chromosome markers, will contribute to a deeper understanding of the causes of and extent to which the native American dog population has changed since the arrival of Columbus.
Using molecular data and statistical modeling, we demonstrate that an important amount of mitochondrial haplotype diversity exists in undersampled non-breed dog populations, and that the breadth of the impact of post-Columbian colonization on the Americas has been underestimated. The extensive replacement of the native American dog population inferred from our data set illustrates that even cultural and biological elements that are not specific targets of invaders can be profoundly affected at a continental scale and in a short period of time.
We used published primers (ThrL and DLHc) and PCR conditions  to amplify and sequence in both directions a fragment of 425 bp from the mitochondrial control region (Table S1). Sequences were compared to available sequences in GenBank with megablast to check for nuclear insertions and new haplotypes. Additional published sequences from dogs [1, 3, 4, 10] and wolves [10–14] were included to place our study in a historical and geographical framework (e.g. to root phylogenetic trees) as in [1–4].
We used the E-INS-i option in MAFFT  to align all sequences included in the study. We performed phylogenetic analysis of the complete dataset with and without removing redundant haplotypes using MacClade 4.06 . We ran 1000 searches and 2000 bootstrap replicates in Garli0.96  to search for a maximum likelihood topology under the model TrN+I+G selected by Modeltest 3.7  using the Akaike information criterion. Sequences that clustered in Clade I , were further analyzed by constructing haplotype networks. We focused on Clade I because all but one of the ancient American dog sequences cluster within this clade. We built median-joining  haplotype networks in NETWORK 4.5 http://www.fluxus-engineering.com/ and a statistical parsimony network in TCS 1.21  with gaps as an informative fifth state. Both methods yielded similar results and only the result of the former analysis is shown.
We estimated mitochondrial effective population size ( ) for the ancient sequences described by reference  and a corresponding subset of our modern sequences where all samples collected north of Mexico were excluded in order to better match the geographical range of the ancient dogs from reference  that died before Columbus first arrived to the Americas. We used nucleotide diversity π as a direct estimate of the population-scaled mutation rate θ π and the expression θ π = 2N e μ where μ is the mutation rate per nucleotide and generation (note that N e will be affected by inclusion of multiple clades). We computed π using DNAsp v.5.00.07 , and used a conservative estimate of μ in the dog mitochondrial control region that assumed a divergence time between gray wolves and coyotes 2 million years ago (2.13 × 10-7 per generation) . We note that assuming a 1 million year divergence time (e.g. ) would result in higher estimates of and thus decrease the probability of observing large differences between samples from different time points even further. We also note that mtDNA data is not optimal for accurately estimating in natural populations, but its use here is motivated by our interest for the possible magnitude of genetic drift acting on American dog mitochondria, particularly in the last ~1000 years (see below), and by the fact that no autosomal sequence data is currently available.
We investigated the probability of observing a private haplogroup in the ancient sample with a frequency as high as Clade Ia in a hypothesis-testing framework for three different population genetic models for the demographic history of American dogs using Serial Sim-Coal [23, 24], the only coalescent simulator currently available that allows modeling of both population structure and samples from different time points. By simulating 10,000 independent genealogical histories and investigating the resulting topologies with custom scripts, we approximated the probability that at least 8 of 13 lineages sampled 1000 years ago (corresponding to the ancient Latin American samples)  would be monophyletic with respect to 299 lineages sampled at the present (corresponding to modern Latin American samples included in the study; Figure 3A), given different assumptions about N e . The general approach of estimating the possible contribution of Native American dogs to the extant gene pool in a simulation-based hypothesis-testing framework was chosen because few analytical tools (i.e. mathematical models) have been developed to deal with probabilities under the structured coalescent with temporal samples. Simulations thus provide a flexible alternative.
First, we tested a demographic model with a single continuous population that either maintained a constant size during all of its history or started growing exponentially 200 generations ago to an N e of 15,000. To investigate the possible contribution of the native American dog population to the current population, we constructed an isolation-admixture model more in agreement with historical evidence (Figure 3B). Starting from a single population at the present, the lineages in the population were divided into two isolated populations 400 years ago (representing the post-Columbian colonization of the Americas by Europeans) with probability c and 1-c, respectively. At 1000 years ago, 13 new samples were taken from one of the subpopulations (New World, representing the data available on ancient Latin American dogs ). At 13,000 years further in the past the American population underwent a bottleneck, implying a tenfold reduction on its effective size, and was joined together again with the Old World population at 14,000 years ago, representing the isolation of American dogs from the Eurasian populations from which they originated . The exact time American and Eurasian populations were isolated is the subject of much controversy, so we use this conservative number (14,000 years before present). However, we found that the timing of this event and the severity of the bottleneck had only marginal effects on the probability of a private ancient haplogroup compared to the admixture proportions. We investigated the effect of assuming a generation time of both 2 and 3 years.
We thank all that have provided samples (Table S1), L. F. Groenelveld for help constructing the network and Mattias Jakobsson for comments on a previous draft of the manuscript. This research was supported by the Swedish Research Council, Carl Tryggers Foundation, CSIC, Universidad de los Andes, Fulbright/Ministry of Education (Spain) and Programa de Captación del Conocimiento para Andalucía (Spain).
- Leonard JA, Wayne RK, Wheeler J, Valadez R, Guillén S, Vilà C: Ancient DNA Evidence for Old World Origin of New World Dogs. Science. 2002, 298: 1613-10.1126/science.1076980.View ArticlePubMed
- Snyder LM, Leonard JA: Dog. Handbook of North American Indians. Enviroments, origins and populations. Edited by: Sturtevant WC. 2006, Washington: Smithsonian Institution, 3: 452-462.
- Vilà C, Savolainen P, Maldonado JE, Amorim IR, Rice JE, Honeycutt RL, Crandall KA, Lundeberg J, Wayne RK: Multiple and ancient origins of the domestic dog. Science. 1997, 276: 1687-1689.View ArticlePubMed
- Savolainen P, Zhang J, Luo J, Lundeberg J, Leitner T: Genetic evidence for an East Asian origin of domestic dogs. Science. 2002, 298: 1610-1613. 10.1126/science.1073906.View ArticlePubMed
- Boyko AR, Boyko RH, Boyko CM, Parker HG, Castelano M, Corey L, Degenhardt J, Auton A, Hedimbl L, Kityo R, Ostrander EA, Shoenebeck J, Todhunter RJ, Jones P, Bustamante CD: Complex population structure in African village dogs and its implication for inferring dog domestication history. Proc Natl Acad Sci USA. 2009, 106: 13903-13908. 10.1073/pnas.0902129106.View ArticlePubMedPubMed Central
- Pang J-F, Kluetsch C, Zou X-J, Zhang A, Luo L-Y, Angleby H, Ardalan A, Ekström C, Sköllermo A, Lundeberg J, Matsumura S, Leitner T, Zhang Y-P, Savolainen P: mtDNA Data Indicate a Single Origin for Dogs South of Yangtze River, Less Than 16,300 Years Ago, from Numerous Wolves. Mol Biol Evol. 2009, 26: 2849-2864. 10.1093/molbev/msp195.View ArticlePubMedPubMed Central
- Crosby AW: The Columbian exchange: biological and cultural consequences of 1492. 1972, Westport: Greenwood press
- Diamond J: Guns, germs, and steel: the fates of human societies. 1997, New York: Norton
- Valadez R, Mendoza V: El perro como legado cultural. Nuevos Aportes, Revista de Arqueología Boliviana. 2005, 2: 14-32.
- Muñoz-Fuentes V, Darimont CT, Paquet P, Leonard JA: The genetic legacy of extirpation and re-colonization in Vancouver Island wolves. Cons Gen. 2010, 11: 547-556. 10.1007/s10592-009-9974-1.View Article
- Vilà C, Amorim IR, Leonard JA, Posada D, Castroviejo J, Petrucci-Fonseca F, Crandall KA, Ellegren H, Wayne RK: Mitochondrial DNA phylogeography and population history of the grey wolf Canis lupus. Mol Ecol. 1999, 8: 2089-2103.View ArticlePubMed
- Sharma DK, Maldonado JE, Jhala YV, Fleischer RC: Ancient wolf lineages in India. Proc R Soc Lond B. 2004, 271 (Suppl 3): S1-S4. 10.1098/rsbl.2003.0071.View Article
- Musiani M, Leonard JA, Cluff HD, Gates CC, Mariani S, Paquet PC, Vilá C, Wayne RK: Differentiation of tundra/taiga and boreal coniferous forest wolves, coat color and association with migratory caribou. Mol Ecol. 2007, 16: 4149-10.1111/j.1365-294X.2007.03458.x.View ArticlePubMed
- Muñoz-Fuentes V, Darimont CT, Wayne RK, Paquet P, Leonard JA: Ecological factors drive differentiation in wolves from British Columbia. J Biogeogr. 2009, 36: 1516-1531. 10.1111/j.1365-2699.2008.02067.x.View Article
- Katoh K, Toh H: Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform. 2008, 9: 286-10.1093/bib/bbn013.View ArticlePubMed
- Maddison DR, Maddison WP: MacClade: analysis of phylogeny and character evolution. 2000, Vers. 4.0 Sinauer: Sunderland
- Zwickl DJ: Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. Ph.D. Thesis. 2006, The University of Texas at Austin
- Posada D, Crandall KA: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-10.1093/bioinformatics/14.9.817.View ArticlePubMed
- Bandelt HJ, Forster P, Rohl A: Median-joining phylogenies for inferring intraspecific phylogenies. Mol Biol Evol. 1999, 16: 37-View ArticlePubMed
- Clement M, Posada D, Crandall K: TCS: a computer program to estimate gene genealogies. Mol Ecol. 2000, 9: 1657-10.1046/j.1365-294x.2000.01020.x.View ArticlePubMed
- Librado P, Rozas J: DNAsp v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-1452. 10.1093/bioinformatics/btp187.View ArticlePubMed
- Leonard JA, Wayne RK: Native Great Lakes wolves were not restored. Biol Lett. 2008, 4: 95-98. 10.1098/rsbl.2007.0354.View ArticlePubMedPubMed Central
- Excoffier L, Novembre J, Schneider S: SIMCOAL: a general coalescent program for simulation of molecular data in interconnected populations with arbitrary demography. J Hered. 2000, 91: 506-509. 10.1093/jhered/91.6.506.View ArticlePubMed
- Anderson CNK, Ramakrishnan U, Chan YL, Hadly EA: Serial SimCoal: A population genetic model for data from multiple populations and points in time. Bioinformatics. 2005, 21: 1733-1734. 10.1093/bioinformatics/bti154.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.