Large-scale mitochondrial DNA analysis in Southeast Asia reveals evolutionary effects of cultural isolation in the multi-ethnic population of Myanmar
© Summerer et al.; licensee BioMed Central Ltd. 2014
Received: 1 October 2013
Accepted: 26 January 2014
Published: 28 January 2014
Myanmar is the largest country in mainland Southeast Asia with a population of 55 million people subdivided into more than 100 ethnic groups. Ruled by changing kingdoms and dynasties and lying on the trade route between India and China, Myanmar was influenced by numerous cultures. Since its independence from British occupation, tensions between the ruling Bamar and ethnic minorities increased.
Our aim was to search for genetic footprints of Myanmar’s geographic, historic and sociocultural characteristics and to contribute to the picture of human colonization by describing and dating of new mitochondrial DNA (mtDNA) haplogroups. Therefore, we sequenced the mtDNA control region of 327 unrelated donors and the complete mitochondrial genome of 44 selected individuals according to highest quality standards.
Phylogenetic analyses of the entire mtDNA genomes uncovered eight new haplogroups and three unclassified basal M-lineages. The multi-ethnic population and the complex history of Myanmar were reflected in its mtDNA heterogeneity. Population genetic analyses of Burmese control region sequences combined with population data from neighboring countries revealed that the Myanmar haplogroup distribution showed a typical Southeast Asian pattern, but also Northeast Asian and Indian influences. The population structure of the extraordinarily diverse Bamar differed from that of the Karen people who displayed signs of genetic isolation. Migration analyses indicated a considerable genetic exchange with an overall positive migration balance from Myanmar to neighboring countries. Age estimates of the newly described haplogroups point to the existence of evolutionary windows where climatic and cultural changes gave rise to mitochondrial haplogroup diversification in Asia.
Myanmar (Burma), the largest country in Mainland Southeast Asia (SEA), covers an area of 676,578 km2 and is inhabited by ~55 million people. The fast evolutionary rate  and the non-recombining uniparental inheritance  of the mitochondrial DNA (mtDNA) generally qualifies mtDNA as highly potent marker for population and phylogenetic studies and mtDNA analyses have a long tradition in the exploration of human evolution . Thanks to increasing knowledge on its mutation rate [4–7] mtDNA is also a valid tool for age estimates. Although Myanmar plays a crucial role for the population history of Southeast Asia , due to the long-lasting isolation of the country by its political regime, only very few mitochondrial DNA (mtDNA) data are available so far . In order to close this gap on the genetic map of Southeast Asia, we collected DNA samples from 327 unrelated donors originating from 13 of the 14 political regions representing the most important ethnic groups of Myanmar and genotyped the entire mitochondrial control region (16024–16569; 1–576) of all samples and the entire mitochondrial genome of a subset of 44 selected samples.
This dataset from Myanmar is of great historic interest, because SEA is a key region of human population history with a first entry of anatomically modern humans of African descent about 60,000 years ago [10, 11], who continued their way through the coastal route to Island SEA and Australia . Following the glacial retreat in that area, also a north- and eastward migration towards the Yangtse and Yellow River basins of the ancestors of Sino-Tibetan tribes began . So, also the initial colonization of China and the rest of East Asia had its origin in SEA [12, 13]. Much later, probably driven by a Neolithic agricultural revolution, the Tibeto-Burman (Burmese-Lolo and Karen) branches of Sino-Tibetans moved back southwards through Yunnan to Myanmar and the SEA peninsula [11, 14, 15]. Ruled by changing kingdoms and dynasties , occupied by the British Empire (1824–1948) and lying on the trade route between India and China , Myanmar was influenced by a variety of cultures.
Analyzing mtDNA data from Myanmar is of great genetic interest, because in spite of accumulating knowledge in recent years [8, 18–22] the resolution of the mitochondrial haplogroup phylogeny in SEA, especially in macrohaplogroup M, is still very low  compared to West-Eurasian haplogroups. Moreover, in population size analyses on mitochondrial DNA data, Atkinson et al. (2008) discovered that on the Indian subcontinent plus mainland SEA the first pronounced population expansion outside Africa took place around 52,000 years ago, and between 45,000 and 20,000 years before present the majority of the global population of Homo sapiens lived in that area .
Finally this dataset is also of sociocultural interest, because Myanmar is subdivided into more than 100 ethnic groups amongst them the Bamar represent 68% of the population. Other important minorities are Shan (10%), Karen (7%), Arakanese (4%), Chinese (3%) and the ethno-linguistically related Mon and Khmer (2% each). Since Myanmar’s independence from the British occupation, a lot of tensions emerged between the ruling Bamar and the remaining ethnic minorities, who suffered from government’s repression . Amongst others especially the Karen people struggle against the domination of the Bamar culture .
With this study we expected to address the following historic questions: Is the changeful history of Myanmar reflected in its mitochondrial DNA diversity? How does its population structure fit into the overall picture of SEA? Was there a contribution of Indo-European language speakers to the genepool of Myanmar through the North Indian corridor or from the time as British colony? In addition, we aimed at contributing to the picture of human colonization of the Asian continent with the description and dating of new mitochondrial haplogroups. Finally, we were searching for mitochondrial genetic footprints of the ethnic heterogeneity of Myanmar and for signs of genetic isolation of individual ethnic groups.
Haplogroup composition and new lineages
The study included 327 DNA samples from Myanmar citizens, who lived in Northern Thailand at the time of sample collection. The Myanmar sample, consisting of 327 mitochondrial control region sequences and 44 complete mitochondrial genomes, which constitute a subset of the 327 control region samples, exhibited pronounced mtDNA diversity displaying 113 distinct CR lineages, including eight in this study newly defined haplogroups and three different not classified basal M branches (Additional file 1: Table S1). F1a1a with 15.9% of all sequences was by far the most frequent haplogroup in this study, followed by C4b1 (7.0%), B6 (6.4%) and A4 (5.2%). R9b1a1a, D4 and G2b1a reached 4.6% each. The 78 individuals actually belonging to M split into 50 different haplogroups, 29 of them with only a single representative. The most common haplogroup in M was M21a (1.8%) (Additional file 1: Table S1).
Population structure of Myanmar
Comparison of the Bamar and the Karen people, two main ethnic groups in Myanmar
Migration analyses of Myanmar and four Southeast Asian regions displayed a vivid exchange of genetic material between the countries and demonstrated a strong outwards migration of Myanmar to all analyzed neighboring regions (for details see Additional file 4: Table S4). The migration rate from Myanmar to the neighboring country Thailand was 1.6-times higher than the migration rate from Thailand to Myanmar. Nearly twice as many people seemed to emigrate from Myanmar to Laos than Laotians immigrated to Myanmar. While the migration rates between Myanmar and Vietnam were nearly balanced, we observed a 1.3-time higher emigration rate from Myanmar to Hong Kong than from Hong Kong to Myanmar. Although the calculated migrations values and directions were not entirely consistent across the five analyses, in summary the effective migration rates characterized Myanmar genetically as an emigrating population.
Age estimates of new haplogroups
The age estimates of the eight newly described haplogroups fell into four different periods. The oldest haplogroup M90 with an estimated age of 34,400 (95%-CI: 31,900 – 37,000) years dated in the Pleistocene, long before the last glacial maximum (LGM). The haplogroups M91 with an estimated age of 25,600 (95%-CI: 24,400 – 26,800) years and M49e (25,100; 95%-CI: 23,300 – 26,800 years) emerged at the beginning of the LGM, whereas the age of M49e1 (20,200; 95%-CI: 18,400 – 22,100 years) denoted the end of the LGM. The four remaining new haplogroups turned out to be much younger: B6a1 (10,200; 95%-CI: 9,800 – 10,600 years) and M20a (9,800; 95%-CI: 8,500 – 11,100 years) dated at the beginning of the Holocene, whereas the youngest new haplogroups M90a (4,800; 95%-CI: 4,300 – 5,200 years) and G2b1a1 (4,700; 95%-CI: 3,200 – 6,200 years) fell into the metal ages.
Tracing the reasons for Myanmar’s extraordinarily diverse population structure
Myanmar turned out to be a hotspot for mitochondrial DNA diversity in Southeast Asia (SEA): The analyzed 327 mitochondrial control region sequences could be assigned to 113 different haplogroups (Additional file 1: Table S1) and from 44 complete mitochondrial genome sequences 11 new lineages emerged (Figure 1). This diversity can partly be explained by SEA’s long population history of over 60,000 years [8, 12], and by the hypothesis that due to a Pleistocene population expansion, ~60% of the global human population lived in that area about 38,000 years ago . Its geographic position on the trade route between the vast empires of India and China  and along the way of important migration events at several different time periods [8, 13, 18, 32] could be another explanation for Myanmar’s haplogroup diversity. Furthermore, Burma’s population structure with more than 100 ethnicities scattered over a geographically differentiated country could have also contributed to its genetic variability. Such great variety of M-lineages as we found in Myanmar has only been described before in India . Therefore, the prior hypothesis that the diversification of macrohaplogroup M originated in India  and that basal M-lineages spread to Myanmar could be extended by the theory that the radiation of M-lineages took place in a geographically wider area including Southeast Asia. Regarding the most frequent haplogroups, an interesting picture arose: In addition to the expected SEA groups [20, 34] F1a1a (15.9%), R9b1a1a (4.6%), and M21a (1.8%) (Additional file 1: Table S1), the five other frequent mitochondrial haplogroups from our Burma sample (C4b1, B6, A4, D4 and G2b1a, together 27.8%) mainly represented Northeast and East Asian lineages [35–38]. B6 (6.4%) and D4 (4.6%), however, were also found in Island SEA [18, 34, 36]. A closer look on the newly described lineages (Figure 1) further supports an astonishing intermediate biogeographic position of Myanmar: M20a descends from a recently described haplogroup M20, found in Mainland SEA  and southern China . Also the new lineage M91, directly descending from the M-node, has a previously detected member from southern China (NCBI Accession Nr. HM030537) . The new haplogroup M49e and its subgroup M49e1, in contrast, derive from a lineage (M49) only described from tribal populations in Northeast India  and G2b1a1 as well as B6a1 arise from East Asian haplogroups [36, 38]. The remaining new haplogroup M90 with its subgroup M90a as well as the three new lineages with only one representative support the assumption of a native emergence of basal M-lineages in this region. Despite the before mentioned specific features of Myanmar, the overall main haplogroup composition (Figure 2b) and the pairwise Fst-values (visualized as multidimensional scaling plot in Figure 3) characterize the country as a typical SEA population. The characteristic Mainland SEA haplogroups  B5a, F1a1a and M7b* were abundant in Myanmar (in sum 21.4%), only the M7-group was with 3.4% clearly under-represented compared to the four other SEA populations [23, 29–31] included in this study (M7* between 12.7 and 18.2%). The further exceptions from SEA populations, namely the high percentages of the mainly East Asian haplogroups A and C (together 14.7% of the Myanmar sample) and the extraordinary diversity of superhaplogroup M*, could be attributed to the intrinsic characteristic of two ethnic groups: the over-representation of the haplogroups A4 (8.28%) and C4b1 (12.41%) in the Karen population compared to other Southeast Asian populations (p < 0.001) was the reason for the observed high percentage of East Asian haplogroups, and the large variety of haplogroup M in the Bamar population (37.07%) was responsible for the unexpected diversity of the M-cluster in Myanmar. Taken all together, Burma’s mitochondrial haplogroup composition, reflecting the country’s geographic position as well as its complex population structure, generally fits into the overall Southeast Asian picture, but adds important findings to it.
Bamar and Karen share the same country but are genetically contrasting populations
The haplogroup composition of the two main ethnic groups in our study, the numerically (68% of the population) and politically dominating group in Myanmar, the Bamar, and the most repressed and also most separatist ethnicity, the Karen (7% of the population), differed enormously: The Bamar sample with its 80 different mitochondrial lineages was extraordinarily diverse, whereas almost three quarters (73.1%) of the Karen population could be assigned to only five haplogroups (Figure 4 upper panel).The multi-dimensional scaling plot of Fst-values (Figure 4 lower panel) underlined those differences. Pairwise mismatch distribution plots showed signs of a recent demographic expansion for the Bamar, but demographic equilibrium and many sequences with minimal differences in the Karen population pointed to effects of genetic isolation (Figure 4 upper panel). The conclusion that Bamar and Karen differ genetically is not new: a study on CYP2C19*3 allele frequencies, an important gene for the response to barbiturates and malaria drugs  and a survey on heme-oxygenase 1 promoter polymorphisms  found significant differences between participants of Bamar or Karen origin. These findings could be interpreted both historically and by recent sociocultural aspects of present-time Myanmar: Although Bamar and Karen both are of Tibeto-Burman origin and derive from ancient tribes of northwestern China , their history differs: the Bamar migrated from Yunnan/China into the Irrawaddy valley in the 7th century AD, thereby replacing and absorbing the original tribes of Pyu and Mon . In the course of that absorption they presumably “incorporated” older local haplogroups . The ethnic origin of the Karen and the migration routes before their arrival in Myanmar around the sixth century AD is largely unknown . Their own legends tell about an ancestry from a “sandy region in the North”, sometimes interpreted as the Gobi Desert, but a more plausible explanation includes an origin in the Yellow River region, and a migration over Yunnan along the Salween or Mekong into today’s Shan state and then into the mountain regions farther south (now Kayin State) [42, 43]. The Karen claim to be amongst the first settlers of Myanmar and despite their diverse linguistic and cultural subgroups, their different religions and different geographic locations, they define themselves as unity with a clear dissociation from the Bamar . Suppression and displacement of Karen and other peoples were already documented in the 18th century , and many Karen fled and still flee to Thailand . Even until very recently, there was an armed conflict between the Bamar government and the Karen National Union in Myanmar . Another reason for the observed dissimilarity between those two ethnic groups could be the traditionally matrilocal and matrilineal culture of the Karen [26, 42], which cannot be found in the Bamar. While the haplogroup diversity for mitochondrial DNA is supposed to be generally lower within matrilocal groups than within patrilocal groups and vice versa for the Y-chromosome , an analogous pattern in genetic diversity was not confirmed with Y-chromosome data, where Burmese-Lolo (i.e. Bamar) and Karen were shown to exhibit similar haplotype patterns . In summary, the distinct genetic dissimilarity between Bamar and Karen seems to result, besides their different population sizes, not so much from geographic separation of the two ethnic groups, but from divergent population histories and even more from their different cultural traditions and social constraints.
Can the genetic influence of Myanmar on its neighboring countries be explained by their shared ancestry alone or is it partially caused by politically forced migration events?
The recent history of Myanmar, with its isolation politics lasting for decades, resulted in a very low published immigration estimate of 0.23 persons per 1000 inhabitants and a net emigration estimate of 0.3/1000 migrants per year (CIA World fact book 2013, https://www.cia.gov/library/publications/the-world-factbook). Therefore, the genetically manifested vivid exchange of Myanmar with neighboring regions and an emigration exceeding immigration by over 30% (Additional file 1: Table S4), is an interesting finding. The magnitude of the observed migration rates could reflect the widely shared population history of Myanmar, Thailand, Laos, Vietnam and South China, all descending from Sino-Tibetan tribes [11, 14, 15]. However, the majority of the present genetic exchange, particularly with the Thai population, seems to originate from the Karen minority: The Thai samples included in this study come from the Chiang Mai region , where traditionally many Karen live, nowadays a considerable proportion of them as refugees . Migration analyses, especially the dominating outwards migration, could therefore also indicate an active marginalization and displacement of ethnic minorities, especially of the Karen people with ~140,000 Karen refugees alone in Thailand.
Did climate change and cultural inventions influence human population history in Mainland Southeast Asia?
The timing of the most recent common ancestor of the youngest newly described haplogroups M90a (mean time estimate 4,800 years) and G2b1a1 (4,700 years) falls into a time of cultural change. An agricultural revolution in southern China with subsequent expansion to Mainland and Island SEA took place around that time . Haplogroup G2b1a1, originating from East Asian lineages [36, 38] would perfectly trace this migration wave. The emerging of haplogroups M20a and B6a1 in the early Holocene, together with F1a1a also appearing in Indochina at the same time , fits perfectly into a time of a supposed expansion of people from Mainland SEA and southern China, possibly driven by global warming and sea level rises, to Island SEA [18, 32, 34, 47, 48]. The potential Northeast Indian haplogroup M49e (25,100 years) and its subgroup M49e1 (20,200 years) appear at the beginning, respectively versus the end of the LGM, which would be a typical coalescent time of East Asian lineages in Northeast Asia . Also the new Southeast Asian lineage M91 (25,600) dates at the beginning of the LGM together with R9b, another lineage from Indochina . It is not clear what could have driven a diversification of mitochondrial haplogroups in SEA during the LGM; perhaps an extension of settlement area to the current islands of Sumatra, Java and Borneo, being parts of the Asian mainland at that time, could serve as an explanation. About the emerging of haplogroup M90 (34,400 years) in the Pleistocene and the additional three undated new basal M-lineages we can only speculate that the proposed population expansion in South and Southeast Asia between 45,000 and 20,000 years before present  resulted in new haplogroups. The age estimates of our eight newly described haplogroups, together with previously published coalescence times of other lineages, point to the existence of distinct “evolutionary time slots” where climatic as well as cultural changes gave rise to mitochondrial haplogroup diversification in Southeast Asia.
Strengths and limitations
This survey provides detailed insight into human mitochondrial DNA evolution in Myanmar for the first time, and thereby sheds more light onto a crucial stage of Southeast Asian population history. Our study collection was selected with the aim of covering a maximum of Myanmar’s diversity; indeed, it included samples from every political region except Mandalay division, and from 11 different ethnicities including the numerically most important. At the same time it provides sufficient sample depth to address ethnicity specific questions in the case of Bamar and Karen, and therefore optimizes the information content within the limits of a manageable sample size. Another important strength of this study represent the personal interviews with each study participant following a sophisticated questionnaire and therefore making assignment problems or misinterpretations by third persons very unlikely. Thanks to this questionnaire (Additional file 5: Table S5), we were able to obtain reliable information on age, sex, specific ethnicity and geographic origin of the participants, and in addition also the places of birth of maternal ancestors of the preceding two generations were compiled.
However, a major limitation of our study is that samples were collected from Myanmar citizens living in Northern Thailand at the time of sample collection, and were not collected in Myanmar per se. In addition, our dataset still suffers, compared to the actual demographic situation of Myanmar, from an uneven sample distribution with an under-representation of the populous Irrawaddy basin. For important ethnic groups like Shan, Arakanese, Mon and Chin, sample sizes were too small to address sub-population specific questions. A second caveat of this study in regard of demographic estimates is that mtDNA, even the complete mtDNA genome, only represents a single locus. We negotiated those shortcomings by specifically addressing the Bamar as the dominant group in Myanmar, and the Karen, representing the most persecuted and most separatist minority in Myanmar, for population genetic analyses. We restricted the interpretation of the demographic analysis results to migration rates, where an estimate based on one locus is valid .
Conclusions and outlook
The multi-ethnic population and the complex history of Myanmar were well reflected in its distinct mitochondrial DNA heterogeneity. The mitochondrial haplogroup distribution in Myanmar showed a typical Southeast Asian pattern, confirming earlier findings but also adding new information: the population sample of Myanmar displayed quite a few parallels to North and Northeast Asian and also to South Asian populations. No traces of European or African influence to the maternal gene pool of Myanmar were detected. The population structure of the extraordinarily diverse Bamar differed substantially from that of the Karen people who displayed signs of genetic isolation. These results, together with the genetically manifested net outwards migration to neighboring regions, raise questions about the sociocultural circumstances in Myanmar. The description and dating of eight new mitochondrial haplogroups and the detection of three further basal M lineages shed more light on the population history of Southeast Asia.
This study resulted in important findings, but, as always with gaining knowledge, further questions arise: Does the genetic pattern of other ethnic groups in Myanmar, respectively in Southeast Asia, display such great differences as Bamar and Karen? What does this imply in terms of cultural and social issues? Is the exceptional haplogroup diversity found in Bamar so very special in this region and if this is the case, what lies behind that? How would the overall picture of a detailed study from a country in the same region, but with a more homogeneous ethnic structure differ from that of Myanmar? To address these and many more questions, it seems worth spending time on developing sound study designs with meaningful questionnaires going into detail in terms of biogeographic and cultural population characteristics like the presence and distribution of ethnic groups. Furthermore, the phylogeny of the human mitochondrial genome, especially in macrohaplogroup M, is far from being exhaustively investigated and a plethora of new lineages still await to be discovered in future studies. The use of next generation sequencing technologies will help to obtain more complete mitochondrial genome data in a shorter time span, but one always has to keep an eye on highest sequence quality, a prerequisite for indisputable findings.
Phylogenetic analyses of 44 entire mtDNA genomes uncovered eight new haplogroups and three unclassified basal M-lineages. The multi-ethnic population and the complex history of Myanmar were reflected in its mtDNA heterogeneity. Population genetic analyses of Burmese control region sequences combined with population data from neighboring countries revealed that the Myanmar haplogroup distribution showed a typical Southeast Asian pattern, but also Northeast Asian and Indian influences. The population structure of the extraordinarily diverse Bamar differed from that of the Karen people, who displayed signs of genetic isolation. Migration analyses indicated a considerable genetic exchange with an overall positive migration balance from Myanmar to neighboring countries. Age estimates of the newly described haplogroups point to the existence of evolutionary windows where climatic and cultural changes gave rise to mitochondrial haplogroup diversification in Asia.
DNA samples and sequencing
Blood samples were collected from 327 unrelated Burmese individuals, who came from 13 of the 14 regions of Myanmar, but lived in Thailand at the time of the sample collection. The study was approved by the Thai Ministry of Public Health in accordance with international ethical standards. According to the Declaration of Helsinki, participation in the study was on voluntary basis and informed consent was obtained from all donors (Chiang Mai University, Thailand, January 9, 2008). Detailed information on the geographical origin and ethnic background of the samples including the maternal region of was collected from all samples can be found in Figure 2 and Additional file 1: Table S1. An example of such a questionnaire on maternal origin is given in Additional file 5: Table S5. Genomic DNA was extracted on a BioRobot EZ1 advanced Workstation (QIAGEN, Hilden, Germany) and quantified on an Infinite® 200 NanoQuant (Tecan Group Ltd., Männedorf, Switzerland). Mitochondrial DNA control region (CR; nucleotide positions 16024–16569; 1–576) sequences were obtained following the protocol described in Brandstätter et al. . To refine haplogroup affiliations we additionally sequenced informative coding region segments in 24 samples (listed in Additional file 1: Table S1). In addition, we performed complete mitochondrial genome sequencing as described in Kloss-Brandstätter et al.  in order to refine the phylogeny of 44 samples that could not be assigned to a haplogroup more specific than paragroups M* or N*.
Sequence evaluation, quality assurance and haplogroup assignment
Sequence electropherograms were aligned to the revised Cambridge Reference Sequence (rCRS; NC_012920)  and evaluated independently by two different mtDNA technicians with the sequence analysis software Sequencher (v5.0, GeneCodes, Ann Arbor, MI). Despite a recent suggestion to replace the rCRS by a Reconstructed Sapiens Reference Sequence (RSRS; ), we prefer to maintain the rCRS as proposed by many mtDNA scientists [54, 55]. Validation was performed by a senior mtDNA scientist using the mtDNA management software eCOMPAGT . MtDNA haplotypes were assigned to haplogroups based on PhyloTree v.15 [27, 28] with HaploGrep .
Population genetic analysis
In order to shed light on the genetic structure of Burma and its phylogeographic position within Southeast Asia, three different datasets were compiled: the first dataset comprised 1294 Southeast Asian samples consisting of samples from this study (n = 327), Hong Kong (n = 376) , Thailand (n = 190) , Vietnam (n = 187)  and Laos (n = 214) . In the second dataset, the geographic range was expanded with additional data from Central Asia (n = 1550; including samples from Uzbekistan, Kazakhstan, Kyrgyzstan, Russia, Afghanistan, Turkmenistan, and Tajikistan)  and East Asia (n = 592) . In the third dataset, we split the Myanmar sample and analyzed the ethnic groups of Bamar (n = 116) and Karen (n = 145) in comparison with other Southeast Asian populations. ARLEQUIN version 188.8.131.52  was used for the calculation of mismatch distributions, molecular diversity indices, and analyses of molecular variance (AMOVA) with entire control region sequences (16024–576) excluding C-insertions on positions 16193, 309, 315 and 573.
For visualizing the AMOVA results, multi-dimensional scaling plots based on pairwise Fst-values were generated with PASW Statistics 18 (SPSS Inc.). Differences in haplogroup-distributions between different populations were evaluated with a Chi-Square Test.
Phylogenetic analyses and age estimates
For phylogenetic tree reconstruction of the 44 complete mitochondrial genomes we included the rCRS . The best fitting model of evolution selected by Modeltest as implemented in the computer program MEGA 5  was Tamura-Nei 1993  with Gamma distribution and invariant sites (TN + G + I). The evolutionary history was inferred with the Maximum Likelihood method using the TN + G + I model in MEGA 5. Initial trees for the heuristic search were obtained by applying the Neighbor-Joining method  to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach. All positions containing gaps and missing data were excluded. Phylogenetic trees were also generated using Markov-Chain-Monte-Carlo (MCMC) sampling with the computer program BEASTMC3 v1.7.4 . Analyses were performed with two different models of evolution, TN + G  and HKY , and the “tree prior” was set to coalescent, Bayesian skyline with 10 groups and a constant skyline model. One cold and two hot chains were run with the temperatures of 0.5 and 0.333 and a chain swapping at every 100 generations. 20,000,000 generations were sampled every 2000 steps and the first 10% of generations were regarded as burn-in. Each log file was reviewed in TRACER 1.5  for the stability of MCMC chains  and the tree files were combined with TreeAnnotator v.1.7.5 included in the BEAST package . For all MEGA and BEAST analyses, a constant clock with a mutation rate of 1.665 × 10-8 base substitution per nucleotide per year  for the complete mitochondrial genome was applied. Mean age estimates and for each new haplogroup were calculated as arithmetic means from the singular age estimates obtained from the five different phylogenetic analyses (two repetitions of MEGA, three repetitions of BEAST). The 95%-confidence intervals were obtained from the arithmetic means and corresponding standard deviations.
Estimation of migration rates
Migration rates, defined as M = m/μ, where M is the effective migration rate, m the immigration rate per generation, and μ is the mutation rate per generation and site, between Myanmar and four other Southeast Asian countries (Hong Kong, Thailand, Vietnam and Laos) were inferred with MIGRATE version 3.4.2 [68, 69] using coalescence theory. In order to get an unbiased estimate of the mutation rate, a subsample of 180 randomly chosen sequences per population was used. MIGRATE uses both Markov Chain Monte Carlo-based (MCMC) Bayesian and Maximum Likelihood (ML) procedures to calculate population genetic parameters . In this study we applied the ML approach assuming a constant mutation rate; 10 short chains and 1 long chain were run, each with a sampling increment of 100; the short chains sampled 50,000 genealogies and the long chain sampled 500,000 genealogies, all with a “burn-in” of 10,000. “Heating” (Metropolis-Coupled MCMC; “MCMCMC”) with four chains (temperatures 1.0, 1.5, 3, and 1,000,000) and a static heating scheme was applied. Migrate analyses were repeated four times with independent random sampling of sequences. In addition, Lamarc-2.1.8 (Maximum Likelihood Parameter Estimation using Hastings-Metropolis Markov Chain Monte Carlo)  was used to estimate migration rates, using the information of 50 randomly picked mitochondrial control region sequences per population. Maximum likelihood analyses (F84 base substitution model, base frequencies estimated, TI/TV ratio = 2) were performed with 10 initial and 2 final Markov chains sampling 500 respectively 10,000 trees, with a sampling increment of 20, a burn-in of 1000 and chain temperatures of 1, 1.1 and 1.2. For the final estimate of the migration rates between the selected populations, the results of each and every analysis were averaged.
All sequences from Myanmar were submitted to NCBI GenBank (http://www.ncbi.nlm.nih.gov/genbank); 327 mitochondrial control region sequences are available under the Accession Numbers JX288765-JX289091 and 44 complete mitochondrial genomes are available under the Accession Numbers JX289092-JX289135. The assignment of all samples to their sequence Accession Numbers is listed in Additional file 1: Table S1.
The project was supported by the Austrian Cancer Society/Tirol and by the Tyrolean Science Fund (Tiroler Wissenschaftsfonds). We are grateful to all study participants for their cooperation and donation of DNA, and we thank Dr. Shafinaz Hussein (Columbia University, NY, USA) for communicating with donors in Burmese language. Finally, we thank Claudia Lamina (Innsbruck Medical University, Austria) for statistical assistance.
- Brown WM, George M, Wilson AC: Rapid evolution of animal mitochondrial DNA. Proc Natl Acad Sci U S A. 1979, 76: 1967-1971. 10.1073/pnas.76.4.1967.PubMedPubMed CentralView ArticleGoogle Scholar
- Avise JC, Arnold J, Ball RM, Bermingham E, Lamb T, Neigel JE, et al: Intraspecific phylogeography: the mitochondrial DNA bridge between population genetics and systematics. Annu Rev Ecol Syst. 1987, 18: 489-522.View ArticleGoogle Scholar
- Ingman M, Kaessmann H, Pääbo S, Gyllensten U: Mitochondrial genome variation and the origin of modern humans. Nature. 2000, 408: 708-713. 10.1038/35047064.PubMedView ArticleGoogle Scholar
- Soares P, Ermini L, Thomson N, Mormina M, Rito T, Röhl A, et al: Correcting for purifying selection: an improved human mitochondrial molecular clock. Am J Hum Genet. 2009, 84: 740-759. 10.1016/j.ajhg.2009.05.001.PubMedPubMed CentralView ArticleGoogle Scholar
- Endicott P, Ho SY, Metspalu M, Stringer C: Evaluating the mitochondrial timescale of human evolution. Trends Ecol Evol. 2009, 24: 515-521. 10.1016/j.tree.2009.04.006.PubMedView ArticleGoogle Scholar
- Kivisild T, Shen P, Wall DP, Do B, Sung R, Davis K, et al: The role of selection in the evolution of human mitochondrial genomes. Genetics. 2006, 172: 373-387.PubMedPubMed CentralView ArticleGoogle Scholar
- Mishmar D, Ruiz-Pesini E, Golik P, Macaulay V, Clark AG, Hosseini S, et al: Natural selection shaped regional mtDNA variation in humans. Proc Natl Acad Sci U S A. 2003, 100: 171-176. 10.1073/pnas.0136972100.PubMedPubMed CentralView ArticleGoogle Scholar
- Kong QP, Sun C, Wang HW, Zhao MA, Wang WZ, Zhong L, et al: Large-scale mtDNA screening reveals a surprising matrilineal complexity in east asia and its implications to the peopling of the region. Mol Biol Evol. 2011, 28: 513-522. 10.1093/molbev/msq219.PubMedView ArticleGoogle Scholar
- Wang HW, Mitra B, Chaudhuri TK, Palanichamy MG, Kong QP, Zhang YP: Mitochondrial DNA evidence supports northeast Indian origin of the aboriginal Andamanese in the Late Paleolithic. J Genet Genomics. 2011, 38: 117-122. 10.1016/j.jgg.2011.02.005.PubMedView ArticleGoogle Scholar
- Su B, Xiao JH, Underhill P, Deka R, Zhang WL, Akey J, et al: Y-chromosome evidence for a northward migration of modern humans into eastern Asia during the last Ice Age. Am J Hum Genet. 1999, 65: 1718-1724. 10.1086/302680.PubMedPubMed CentralView ArticleGoogle Scholar
- Su B, Xiao CJ, Deka R, Seielstad MT, Kangwanpong D, Xiao JH, et al: Y chromosome haplotypes reveal prehistorical migrations to the Himalayas. Hum Genet. 2000, 107: 582-590. 10.1007/s004390000406.PubMedView ArticleGoogle Scholar
- Yao Y-G, Kong Q-P, Bandelt H-J, Kivisild T, Zhang Y-P: Phylogeographic differentiation of mitochondrial DNA in Han Chinese. Am J Hum Genet. 2002, 70: 635-651. 10.1086/338999.PubMedPubMed CentralView ArticleGoogle Scholar
- Cai X, Qin Z, Wen B, Xu S, Wang Y, Lu Y, et al: Human migration through bottlenecks from Southeast Asia into East Asia during Last Glacial Maximum revealed by Y chromosomes. Plos One. 2011, 6: e24282-10.1371/journal.pone.0024282.PubMedPubMed CentralView ArticleGoogle Scholar
- Wen B, Xie X, Gao S, Li H, Shi H, Song X, et al: Analyses of genetic structure of Tibeto-Burman populations reveals sex-biased admixture in southern Tibeto-Burmans. Am J Hum Genet. 2004, 74: 856-865. 10.1086/386292.PubMedPubMed CentralView ArticleGoogle Scholar
- Diamond J, Bellwood P: Farmers and their languages: the first expansions. Science. 2003, 300: 597-603. 10.1126/science.1078208.PubMedView ArticleGoogle Scholar
- Lieberman V: Strange Parallels: Southeast Asia in Global Context, c. 800–1830. Vol. 1: Integration on the Mainland. 2003, Cambridge, UK: Cambridge University PressGoogle Scholar
- Christian JL: Trans-Burma trade routes to China. Pac Aff. 1940, 13: 173-191. 10.2307/2751052.View ArticleGoogle Scholar
- Hill C, Soares P, Mormina M, Macaulay V, Clarke D, Blumbach PB, et al: A mitochondrial stratigraphy for island southeast Asia. Am J Hum Genet. 2007, 80: 29-43. 10.1086/510412.PubMedPubMed CentralView ArticleGoogle Scholar
- Endicott P, Ho SY: A Bayesian evaluation of human mitochondrial substitution rates. Am J Hum Genet. 2008, 82: 895-902. 10.1016/j.ajhg.2008.01.019.PubMedPubMed CentralView ArticleGoogle Scholar
- Peng M-S, Quang HH, Dang K-P, Trieu AV, Wang H-W, Yao Y-G, et al: Tracing the Austronesian footprint in Mainland Southeast Asia: a perspective from mitochondrial DNA. Mol Biol Evol. 2010, 27: 2417-2430. 10.1093/molbev/msq131.PubMedView ArticleGoogle Scholar
- Soares P, Rito T, Trejaut J, Mormina M, Hill C, Tinkler-Hundal E, et al: Ancient voyaging and polynesian origins. Am J Hum Genet. 2011, 88: 239-247. 10.1016/j.ajhg.2011.01.009.PubMedPubMed CentralView ArticleGoogle Scholar
- Gunnarsdóttir ED, Li MK, Bauchet M, Finstermeier K, Stoneking M: High-throughput sequencing of complete human mtDNA genomes from the Philippines. Genome Res. 2011, 21: 1-11. 10.1101/gr.107615.110.PubMedPubMed CentralView ArticleGoogle Scholar
- Bodner M, Zimmermann B, Röck A, Kloss-Brandstätter A, Horst D, Horst B, et al: Southeast Asian diversity: first insights into the complex mtDNA structure of Laos. BMC Evol Biol. 2011, 11: 49-10.1186/1471-2148-11-49.PubMedPubMed CentralView ArticleGoogle Scholar
- Atkinson QD, Gray RD, Drummond AJ: mtDNA variation predicts population size in humans and reveals a major Southern Asian chapter in human prehistory. Mol Biol Evol. 2008, 25: 468-474. 10.1093/molbev/msm277.PubMedView ArticleGoogle Scholar
- Smith M: Ethnic groups in Burma: Development, Democracy and Human Rights. 1994, Anti-Slavery International: London, UK, 8Google Scholar
- Malseed K: Where there is no movement: Local resistance and the potential for solidarity. J Agrar Change. 2008, 8: 489-514. 10.1111/j.1471-0366.2008.00178.x.View ArticleGoogle Scholar
- van Oven M: Revision of the mtDNA tree and corresponding haplogroup nomenclature. Proc Natl Acad Sci U S A. 2010, 107: E38-E39. 10.1073/pnas.0915120107.PubMedPubMed CentralView ArticleGoogle Scholar
- van Oven M, Kayser M: Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009, 30: E386-E394. 10.1002/humu.20921.PubMedView ArticleGoogle Scholar
- Zimmermann B, Bodner M, Amory S, Fendt L, Röck AW, Horst D, et al: Forensic and phylogeographic characterization of mtDNA lineages from northern Thailand (Chiang Mai). Int J Legal Med. 2009, 123: 495-501. 10.1007/s00414-009-0373-4.PubMedView ArticleGoogle Scholar
- Irwin JA, Saunier JL, Strouss KM, Diegoli TM, Sturk KA, O’Callaghan JE, et al: Mitochondrial control region sequences from a Vietnamese population sample. Int J Legal Med. 2008, 122: 257-259. 10.1007/s00414-007-0205-3.PubMedView ArticleGoogle Scholar
- Irwin JA, Saunier JL, Beh P, Strouss KM, Paintner CD, Parsons TJ: Mitochondrial DNA control region variation in a population sample from Hong Kong, China. Forensic Sci Int Genet. 2009, 3: e119-e125. 10.1016/j.fsigen.2008.10.008.PubMedView ArticleGoogle Scholar
- Jinam TA, Hong LC, Phipps ME, Stoneking M, Ameen M, Edo J, et al: Evolutionary history of continental southeast asians: “early train” hypothesis based on genetic analysis of mitochondrial and autosomal DNA data. Mol Biol Evol. 2012, 29: 3513-3527. 10.1093/molbev/mss169.PubMedView ArticleGoogle Scholar
- Chandrasekar A, Kumar S, Sreenath J, Sarkar BN, Urade BP, Mallick S, et al: Updating phylogeny of mitochondrial DNA macrohaplogroup M in India: dispersal of modern human in South Asian corridor. PLoS ONE. 2009, 4: e7447-10.1371/journal.pone.0007447.PubMedPubMed CentralView ArticleGoogle Scholar
- Hill C, Soares P, Mormina M, Macaulay V, Meehan W, Blackburn J, et al: Phylogeography and ethnogenesis of aboriginal Southeast Asians. Mol Biol Evol. 2006, 23: 2480-2491. 10.1093/molbev/msl124.PubMedView ArticleGoogle Scholar
- Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Rogalla U, Perkova M, et al: Origin and post-glacial dispersal of mitochondrial DNA haplogroups C and D in northern Asia. PLoS ONE. 2010, 5: e15214-10.1371/journal.pone.0015214.PubMedPubMed CentralView ArticleGoogle Scholar
- Derenko M, Malyarchuk B, Denisova G, Perkova M, Rogalla U, Grzybowski T, et al: Complete mitochondrial DNA analysis of eastern eurasian haplogroups rarely found in populations of northern asia and eastern europe. Plos One. 2012, 7 (2): e32179-10.1371/journal.pone.0032179.PubMedPubMed CentralView ArticleGoogle Scholar
- Tanaka M, Cabrera VM, González AM, Larruga JM, Takeyasu T, Fuku N, et al: Mitochondrial genome variation in eastern Asia and the peopling of Japan. Genome Res. 2004, 14: 1832-1850. 10.1101/gr.2286304.PubMedPubMed CentralView ArticleGoogle Scholar
- Kong Q-P, Bandelt H-J, Sun C, Yao Y-G, Salas A, Achilli A, et al: Updating the east asian mtDNA phylogeny: a prerequisite for the identification of pathogenic mutations. Hum Mol Genet. 2006, 15: 2076-2086. 10.1093/hmg/ddl130.PubMedView ArticleGoogle Scholar
- Tassaneeyakul W, Mahatthanatrakul W, Niwatananun K, Na-Bangchang K, Tawalee A, Krikreangsak N, et al: CYP2C19 genetic polymorphism in Thai, Burmese and Karen populations. Drug Metab Pharmacokinet. 2006, 21: 286-290. 10.2133/dmpk.21.286.PubMedView ArticleGoogle Scholar
- Kuesap J, Hirayama K, Kikuchi M, Ruangweerayut R, Na-Bangchang K: Study on association between genetic polymorphisms of haem oxygenase-1, tumour necrosis factor, cadmium exposure and malaria pathogenicity and severity. Malaria J. 2010, 9: 260-10.1186/1475-2875-9-260.View ArticleGoogle Scholar
- Moore E: Bronze and Iron Age sites in Upper Myanmar: Chindwin, Samon and Pyu. 2003, Spring: SOAS Bulletin of Burma Research, Vol.1 Nr.1, ISSN 1479-8484Google Scholar
- Marshall HI: The karen people of burma: a study in anthropology and ethnology. Ohio State University Bulletin. 1922, 26: 1-329.Google Scholar
- Kuroiwa Y, Verkuyten M: Narratives and the constitution of a common identity: the karen in burma. Identities-Glob Stud. 2008, 15: 391-412.View ArticleGoogle Scholar
- Gravers M: Waiting for a righteous ruler: the karen royal imaginary in thailand and burma. JSoutheast Asian studies. 2012, 43: 340-363. 10.1017/S0022463412000094.View ArticleGoogle Scholar
- Oota H, Settheetham-Ishida W, Tiwawech D, Ishida T, Stoneking M: Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal residence. Nat Genet. 2001, 29: 20-21. 10.1038/ng711.PubMedView ArticleGoogle Scholar
- Kongmaroeng C, Romphruk A, Ruangwerayut R, Paupairoj C, Leelayuwat C, Inoko H, et al: HLA-B*15 subtypes in Burmese population by sequence-based typing. Tissue Antigens. 2009, 74: 164-167. 10.1111/j.1399-0039.2009.01281.x.PubMedView ArticleGoogle Scholar
- Soares P, Trejaut JA, Loo JH, Hill C, Mormina M, Lee CL, et al: Climate change and postglacial human dispersals in southeast Asia. Mol Biol Evol. 2008, 25: 1209-1218. 10.1093/molbev/msn068.PubMedView ArticleGoogle Scholar
- Karafet TM, Hallmark B, Cox MP, Sudoyo H, Downey S, Lansing JS, et al: Major east–west division underlies Y chromosome stratification across Indonesia. Mol Biol Evol. 2010, 27: 1833-1844. 10.1093/molbev/msq063.PubMedView ArticleGoogle Scholar
- Kuhner MK: LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters. Bioinformatics. 2006, 22: 768-770. 10.1093/bioinformatics/btk051.PubMedView ArticleGoogle Scholar
- Brandstätter A, Niederstätter H, Pavlic M, Grubwieser P, Parson W: Generating population data for the EMPOP database - an overview of the mtDNA sequencing and data evaluation processes considering 273 Austrian control region sequences as example. Forensic Sci Int. 2007, 166: 164-175. 10.1016/j.forsciint.2006.05.006.PubMedView ArticleGoogle Scholar
- Kloss-Brandstätter A, Schäfer G, Erhart G, Hüttenhofer A, Coassin S, Seifarth C, et al: Somatic mutations throughout the entire mitochondrial genome Are associated with elevated PSA levels in prostate cancer patients. Am J Hum Genet. 2010, 87: 802-812. 10.1016/j.ajhg.2010.11.001.PubMedPubMed CentralView ArticleGoogle Scholar
- Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N: Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999, 23: 147-10.1038/13779.PubMedView ArticleGoogle Scholar
- Behar DM, Van OM, Rosset S, Metspalu M, Loogvali EL, Silva NM, et al: A “Copernican” reassessment of the human mitochondrial DNA tree from its root. Am J Hum Genet. 2012, 90: 675-684. 10.1016/j.ajhg.2012.03.002.PubMedPubMed CentralView ArticleGoogle Scholar
- Salas A, Coble M, Desmyter S, Grzybowski T, Gusmao L, Hohoff C, et al: A cautionary note on switching mitochondrial DNA reference sequences in forensic genetics. Forensic Sci Int Genet. 2012, 6: e182-e184. 10.1016/j.fsigen.2012.06.015.PubMedView ArticleGoogle Scholar
- Bandelt H-J, Kloss-Brandstätter A, Richards MB, Yao YG, Logan I: The case for the continuing use of the revised Cambridge Reference Sequence (rCRS) and the standardization of notation in human mitochondrial DNA studies. J Hum Genet. 2013, doi:10.1038/jhg.2013.120Google Scholar
- Weissensteiner H, Schönherr S, Specht G, Kronenberg F, Brandstätter A: eCOMPAGT integrates mtDNA: import, validation and export of mitochondrial DNA profiles for population genetics, tumour dynamics and genotype-phenotype association studies. BMC Bioinforma. 2010, 11: 122-10.1186/1471-2105-11-122.View ArticleGoogle Scholar
- Kloss-Brandstätter A, Pacher D, Schönherr S, Weißensteiner H, Binna R, Specht G, et al: HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum Mutat. 2011, 32: 25-32. 10.1002/humu.21382.PubMedView ArticleGoogle Scholar
- Irwin JA, Ikramov A, Saunier J, Bodner M, Amory S, Röck AW, et al: The mtDNA composition of Uzbekistan: a microcosm of Central Asian patterns. Int J Legal Med. 2010, 124: 195-204. 10.1007/s00414-009-0406-z.PubMedView ArticleGoogle Scholar
- Lee HY, Yoo J-E, Park MJ, Chung U, Shin K-J: Mitochondrial DNA control region sequences in Koreans: identification of useful variable sites and phylogenetic analysis for mtDNA data quality control. Int J Legal Med. 2006, 120: 5-14. 10.1007/s00414-005-0005-6.PubMedView ArticleGoogle Scholar
- Excoffier L, Lischer HE: Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010, 10: 564-567. 10.1111/j.1755-0998.2010.02847.x.PubMedView ArticleGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMedPubMed CentralView ArticleGoogle Scholar
- Tamura K, Nei M: Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993, 10: 512-526.PubMedGoogle Scholar
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.PubMedGoogle Scholar
- Drummond AJ, Suchard MA, Xie D, Rambaut A: Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012, 29: 1969-1973. 10.1093/molbev/mss075.PubMedPubMed CentralView ArticleGoogle Scholar
- Hasegawa M, Kishino H, Yano T: Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol. 1985, 22: 160-174. 10.1007/BF02101694.PubMedView ArticleGoogle Scholar
- Rambaut A, Drummond AJ: Tracer v1.4. http://beast.bio.ed.ac.uk/Tracer. 2007. Ref Type: Online Source
- Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.PubMedPubMed CentralView ArticleGoogle Scholar
- Beerli P, Felsenstein J: Maximum-likelihood estimation of migration rates and effective population numbers in two populations using a coalescent approach. Genetics. 1999, 152: 763-773.PubMedPubMed CentralGoogle Scholar
- Beerli P, Felsenstein J: Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach. Proc Natl Acad Sci U S A. 2001, 98: 4563-4568. 10.1073/pnas.081068098.PubMedPubMed CentralView ArticleGoogle Scholar
- Beerli P: Comparison of Bayesian and maximum-likelihood inference of population genetic parameters. Bioinformatics. 2006, 22: 341-345. 10.1093/bioinformatics/bti803.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.