Evidence of recombination among early-vaccination era measles virus strains
© Schierup et al; licensee BioMed Central Ltd. 2005
Received: 11 April 2005
Accepted: 06 October 2005
Published: 06 October 2005
The advent of live-attenuated vaccines against measles virus during the 1960'ies changed the circulation dynamics of the virus. Earlier the virus was indigenous to countries worldwide, but now it is mediated by a limited number of evolutionary lineages causing sporadic outbreaks/epidemics of measles or circulating in geographically restricted endemic areas of Africa, Asia and Europe. We expect that the evolutionary dynamics of measles virus has changed from a situation where a variety of genomic variants co-circulates in an epidemic with relatively high probabilities of co-infection of the individual to a situation where a co-infection with strains from evolutionary different lineages is unlikely.
We performed an analysis of the partial sequences of the hemagglutinin gene of 18 measles virus strains collected in Denmark between 1965 and 1983 where vaccination was first initiated in 1987. The results were compared with those obtained with strains collected from other parts of the world after the initiation of vaccination in the given place. Intergenomic recombination among pre-/early-vaccination strains is suggested by 1) estimations of linkage disequilibrium between informative sites, 2) the decay of linkage disequilibrium with distance between informative sites and 3) a comparison of the expected number of homoplasies to the number of apparent homoplasies in the most parsimonious tree. No significant evidence of recombination could be demonstrated among strains circulating at present.
We provide evidence that recombination can occur in measles virus and that it has had a detectable impact on sequence evolution of pre-vaccination samples. We were not able to detect recombination from present-day sequence surveys. We believe that the decreased rate of visible recombination may be explained by changed dynamics, since divergent strains do not meet very often in current epidemics that are often spawned by a single sequence type. Signs of pre-vaccination recombination events in the present-day sequences are not strong enough to be detectable.
Measles virus (MV) has a genome with negative polarity, consisting of non-segmented single-stranded RNA of approximately 15.9 kb. MV belongs to the Paramyxoviridae family in the order of Mononegavirales and only a single serotype is known. It is among the most infectious viruses known for humans, and no other host species has been identified. Only human populations of a considerable size are able to sustain circulation . Global vaccination programs have resulted in a dramatic decline in measles cases and the documented discontinuation of indigenous circulation in a number of countries [2–4] has encouraged authorities to accomplish global control of measles. However, measles still cause a large number of deaths every year, mainly in developing countries, where endemic circulation of MV is still ongoing [5, 6] as a result of poor vaccination coverage.
Intergenomic recombination has been documented in a vast number of virus species within most families of RNA and DNA viruses. Recombination can allow variants of a population to escape from the current fitness peak (escape from Muller's ratchet) and re-appear with a new phenotypic make-up and/or even re-establish in a new host-relationship [7, 8]. However, recombination has never convincingly been documented in species of the Mononegavirales, and it is speculated whether this reflects an inability of these viruses to recombine. Yet, the order of Mononegavirales comprises groups of viruses with an apparently large evolutionary potential with frequent shifts of host species. Over the past decades, a considerable number of Mononegavirales members causing diseases in host species in which they had not previously been recognized have been identified. Examples are phocid distemper virus , Hendra virus , Menangele virus , Nipah virus  – not to forget the re-emerging divergent members of the Filoviridae (Marburg -and Ebola-like viruses) [see [13, 14]]. The origins of these emerging viruses have not been identified and the mechanism(s) of their ability to explore new niches remains enigmatic.
Co-infection of host cells with phylogenetically distinct virus strains is required for recombination events to be detectable in sequence surveys. As a result of the global vaccination against measles a situation of endemic co-circulation of multiple strains [5, 6, 15] shifted to a situation with a limited number of strains being re-introduced to susceptible subpopulations in major geographical regions [2–4, 16]. By sequencing part of the hemagglutinin gene , we recently characterized 18 MV strains collected during the pre-/early-vaccination era in Denmark. In the present study, the partial sequence of the hemagglutinin coding region of those older strains is subjected to further analysis and compared with strains sampled after vaccination (generally more recently identified) using various approaches to test for the presence of intergenomic recombination.
Results and discussion
The term pre-/early-vaccination era isolates used for isolates collected in Denmark during the period of 1965–83 is meant to reflect that these isolates are from a period when vaccination against measles was not practiced in Denmark but was gradually becoming common practice in many other countries in the World. Thus, it cannot be excluded that vaccination in other countries influenced measles virus strains circulating in Denmark at the time, but it is anticipated that these isolates still bear valuable information on the nature of strains circulating before an influence of vaccination was imposed.
Basic population genetics summary of the data sets of the pre-vaccination and post-vaccination eras.
Danish early-vaccination era data set
Global post-vaccination era data set
Length of sequences
Number of sequences
Average number of differences
# synonymous substitutions
# non-synonymous substitutions
Average Pi (nucleotide diversity)
1. all sites
2. synonymous sites
3. non-synonymous sites
Codon usage bias (ENC)
Figure 1b shows a tree with the same sequences as in Figure 1a, but adding the 18 pre-/early-vaccination era isolates from Denmark. As shown previously , the Danish isolates cluster with genome types A, C2 and E. It is also clear that the inclusion of pre-/early-vaccination era samples makes the distinction of these three genome types less obvious since the similarity distances between post-vaccination era representatives of the genome types A, C2 and E are broken down to the sum of minor differences between the pre-/early-vaccination era isolates. This might reflect that the pre-/early-vaccination era sequences are generally older than the rest and thus at the basis of division of genome types as are also the strains used for vaccine development (e.g. Edmonston). Alternatively, frequent recombination among pre-/early-vaccination era genome types at that time would lead to a poorly resolved tree. Less recombination among different surviving strains after vaccination would then lead to the more differentiated genome types seen today. The subsequent recombination analysis addresses this possibility.
Analysis of recombination
Summary of numerical analysis of recombination in the samples of the pre-vaccination and post-vaccination eras A negative correlation implies that LD decays with distance, the P-values are obtained by a permutation test (see Methods).
-0.26 (P < 0.01)
-0.03 (P < 0.05)
D' correlation (P-value)
-0.32 (P < 0.001)
0.01 (P > 0.05)
LDhat estimate of ρ (P-value)
15.8 (P < 0.002)
7.2 (P > 0.05)
An analysis of expected number of homoplasies caused by parallel mutations and the number of apparent homoplasies in the most parsimonious tree was performed on the pre-/early-vaccination data set [see [32, 35]]. The basic idea is to investigate whether the number of apparent homoplasies in the most parsimonious tree is likely to be due to parallel (recurrent) mutations. If this is not the case, it is a strong indication of recombination since recombination easily creates "incompatible pairs of sites", i.e. pairs of sites that do not fit into the same phylogenetic tree, and thus will appear as homoplasies if assuming no recombination and a single phylogenetic tree. There are 800 sites. Of these, 267 are third position sites, and among these we observe 20 transition mutations. If we assume equal mutation rates at all third position sites, then the number of transition mutations we expect have happened while observing 20 different ones is 20.8+/-0.9 (calculated as the sum of geometric distributions). In the most parsimonious tree found using PAUP*, there are 26 transition mutations. In other words, given the observed number of mutation events of the transition type, we expect 0.8+/-0.9 parallel mutations/homoplasies under the assumption of no recombination (and no codon usage bias). We observe, however, 6 parallel mutations in the tree. This suggests that some of the apparent homoplasies resulted from recombination rather than from recurrent mutations. We also calculated the effective sites number  from the observed codon usage bias. However, since the codon usage bias is low, the effect is minor, and only 1.0+/-1.0 homoplasies are expected under this model, again significantly smaller than the six apparent homoplasies. A large amount of mutation rate heterogeneity at the synonymous sites offers an alternative explanation of the excess homoplasies. While this explanation cannot be ruled out, analysis of the post-vaccination data set does not support large rate heterogeneity at silent sites in the evolution of present-day measles virus, since the number of homoplasies in this data set can be explained by recurrent mutations (results not shown).
Given these different lines of evidence for recombination, it is of interest to try to estimate the rate of recombination needed to explain the data. The most appropriate method is the finite site, composite likelihood approach implemented in LDhat . The result (Table 2) is that a significantly positive recombination rate is found in the pre-/early-vaccination sequences, much reflecting the results of the similar R2 test. An estimated rate of ρ = 15.8 corresponds to that an expected 45 recombination events have happened in the ancestral history of the 18 pre-/early-vaccination sequences. The estimated recombination rate appears smaller than rates reported for HIV and other viruses . The estimated recombination rate in the post-vaccination era data sets also shows a positive rate of ρ, but it not significantly different from zero by the permutation test (Table 2). The lack of evidence of recombination in present day sequences of MV strains is consistent with what was also observed by .
In conclusion, the analysis of pre-/early-vaccination era MV sequences shows evidence of recombination at rates important to the evolution of MV. The five different tests of recombination should not be considered independent tests and some of the results might be explained by alternative mechanisms such as convergent evolution of functionally important sites and rate heterogeneity of synonymous variation. However, all tests agree and provide evidence of recombination both through an excess of apparent homoplasies compared to the expected frequency of parallel mutations, and through a decrease in LD with distance, which is difficult to explain by any other hypothesis than recombination. Furthermore, it is difficult to imagine a mechanism other than recombination by which apparent homoplasies could occur pairwise or in triplets in distant parts of the sequence considering also that such patterns are not seen in the post-vaccination data set.
The evidence of recombination among pre-/early-vaccination era MV strains and the lack of detection of recombination among post-vaccination era MV strains are consistent with the shift in epidemiology from a situation of co-circulation of strains in populations to a bottleneck situation with incidental introduction of a single strain to a susceptible sub-population of a geographical region. Given that intergenomic recombination and co-infection of individuals are common phenomena of MV it might be assumed that the lineages surviving till today did emerge from a pool of recombining pre-vaccination era strains. A lower level or absence of recombination due to changed epidemiology since then has erased our ability to detect recombination in a global sample of present day MV despite its high level of variability. It is possible that the Danish pre-/early-vaccination era strains are representatives of the very pool in which recombination took place while present-day MV strains are representatives of temporally and/or geographically separated lineages. Analysing informative sites in other parts of the MV genome of present-day lineages render it unlikely that these lineages could have derived from a clonal population structure of a global pool of MV strains (L. S. Christensen, unpublished data).
The template for replication in members of the Paramyxovirinae is a nucleocapsid complex in which each nucleocapsid monomer (N) is predicted to be associated by hydrophobic bonds with 6 nucleotides in such a way as to resist non-ionic detergent and high salt and to protect the RNA from RNase digestion . This tight association of RNA and protein has been dubbed "the rule of six" and excludes the intracellular presence of naked viral RNA molecules. It raises the question of the mechanism of RNA polymerase template recognition and has provided an explanation of the hypothesis that recombination possibly cannot occur in this group of viruses. Our data suggests that a mechanism of partial unwinding of the nucleocapsid structure exists to allow homologous intergenomic recombination or RNA polymerase template shift during replication.
Measles virus appear to possess the ability to recombine but the present-day epidemiology of the virus where different sequence types rarely or never meet make the impact of recombination on the distribution of sequence diversity negligible. However, in the prevaccination area, endemic MV allowed more divergent strains to meet and recombine. The present-day strains are thus descendants of recombined sequences but the signal of the early recombination is lost in present-day sequences.
The Danish pre-/early-vaccination sequences consist of an 800 base-pair region (nt. 659 to 1458) of the hemagglutinin coding region of 18 strains collected in Denmark, Greenland and the Faeroe Islands between 1965 and 1983 (GenBank accession numbers AJ417850-AJ417867) . Post-vaccination sequences of 40 strains, representative of the 22 phylogenetic clusters identified [17–19], were trimmed to match the region of the pre-/early-vaccination era sequences. GenBank accession numbers of 33 of these sequences can be found in . The remaining 7 HA sequences that complete the list of globally circulating genome types, identified at present, are those of strains Mvi/Gambia/93 (Type B3, Acc. No AF484955) , MVi/Alberta.CAN/20.00/1 (Type D7, Acc. No. AF410986) (G.A. Tipples et al., submitted 14-Aug-2001), MVi/Montreal.CAN/19.98 (Type D8, Acc. No. AF410985) (G.A. Tipples et al., sumitted 14-Aug-2001), MVi/Vic.AU/12.99 (Type D9, Acc No. AY127853) , (Type G2, Acc. No. AF243851) , MVi/Gresik.INO/18.02 (Type G3, Acc. No. AY184218) (P.A. Rota and S.L. Liffick, submitted 19-Nov-2002), and China94-1 (Type H2, Acc. No. AF045203) .
The alignment of the 18 pre-/early-vaccination and 40 post-vaccination sequences does not contain any gaps. The computer program DnaSP 3.99  was used for estimation of standard parameters of population genetics. Segregating sites were classified as synonymous or non-synonymous, except for complex codons where a site may be classified as both. Phylogenetic trees from the post-vaccination data set and the two data sets combined were built using MEGA 2.1  and the minimum evolution criterion. The HKY substitution model  was assumed. Bootstrap values were estimated from 2000 re-samples.
Recombination was examined using five different, but complementary, approaches. (i) A graphical method by which sets of sites in strong linkage disequilibrium (LD) were visually identified and marked with different colourings (Fig. 2). It was then investigated whether different sets of blocks were incompatible by the four-gamete test . If blocks are incompatible, one will infer either recombination or recurrent mutation of the sites in a block at about the same time. (ii) A plot of significant pairwise linkage disequilibria was constructed using DnaSP. (iii) Decay of linkage disequilibrium with distance (measured as either the squared correlation coefficient R2 or the standardized measure D') was investigated following  and estimated using the R2-program . Analysis was restricted to informative sites. (iv) Estimation of the scaled population recombination rate ρ by a composite maximum likelihood approach , using the LDhat program of . This program also allows a test of the null hypothesis of no recombination (ρ = 0) by a permutation test. This analysis was also restricted to informative sites. (v) Comparison of the expected number of homoplasies to the number of apparent homoplasies in the most parsimonious phylogenetic tree, closely following the approach of , using PAUP* .
We thank Oliver Pybus, Roald Forsberg, Freddy B. Christensen and an anonymous reviewer for very helpful comments to the manuscript,. Rikke Jonson, Lis Nielsen and Gunilla Trolle at the Department of Clinical Microbiology, Rigshospitalet, and Ellen Christensen at Department of Virology, Statens Serum Institut are acknowledged for expert technical assistance, Enette B. Knudsen for editing. The study was supported by Sygekassernes Helsefond (grant No. 11/282-94), the Danish Medical Research Council (grant No. 12-1667), the Novo Nordisk Foundation and the Danish Natural Sciences Research Council (grant no. 1262). Frank Jorgensen and Paul Sharp are acknowledged for valuable discussions.
- Griffin DE: Measles virus. Virology. Edited by: Knipe DM, Howley PM. 2001, Philadelphia: Lippincott Williams & Wilkins, 1401-1441. 4Google Scholar
- Rota JS, Heath JL, Rota PA, King GE, Celma ML, Carabaña J, Fernandez-Muñoz R, Brown D, Jin L, Bellini WJ: Molecular epidemiology of measles virus: identification of pathways of transmission and implications for measles elimination. JID. 1996, 173: 32-37.View ArticlePubMedGoogle Scholar
- Rima BK, Earle JAP, Yeo RP, Herlihy L, Baczko K, ter Meulen V, Carabana J, Caballero M, Celma ML, Fernandez-Munoz R: Temporal and geographical distribution of measles virus genotypes. J Gen Virol. 1995, 76: 73-1180.View ArticleGoogle Scholar
- Dahl L, Christensen LS, Schöller S, Westh H, Plesner A-M: Sequence analysis of the hemagglutinin gene of measles virus isolates in Denmark 1997–1998: no evidence of persistent circulation of measles virus in Denmark. APMIS. 2000, 108: 267-72. 10.1034/j.1600-0463.2000.d01-54.x.View ArticlePubMedGoogle Scholar
- Hanses F, Truong AT, Ammerlaan W, Ikusika O, Adu F, Oyefolu AO, Omilabu SA, Muller CP: Molecular epedemioology of Nigerian and Ghanian measles virus isolates reveals a genotype circulating widely in western and central Africa. J Gen Virol. 1999, 80: 871-7.View ArticlePubMedGoogle Scholar
- Truong AT, Kreis S, Ammrelaan W, Hartter HK, Adu F, Omilabu SA, Berbers GAM, Muller CP: Genotyping and antigenic characteristics of hemagglutinin proteins of African measles virus isolates. Virus Res. 1999, 62: 89-95. 10.1016/S0168-1702(99)00072-6.View ArticlePubMedGoogle Scholar
- Christensen LS: The population biology of suid herpesvirus 1. Acta Pathologica Microbiologica et Immunologica Scandinavica. 1995, 103 (Supplementum 48): 1-48.Google Scholar
- Burke DS: Recombination in HIV: an important viral evolutionary strategy. Emerging Infectious Diseases. 1997, 3: 253-259.PubMed CentralView ArticlePubMedGoogle Scholar
- Osterhaus AD, Vedder EJ: Identification of viruses causing recent seal deaths. Nature. 1988, 335 (6185): 20-10.1038/335020a0.View ArticlePubMedGoogle Scholar
- O'Sullivan JB, Allworth AM, Paterson DL, Snow TM, Boots R, Gleeson LJ: Fatal encephalitis due to novel paramyxovirus transmitted from horses. Lancet. 1997, 349: 93-95. 10.1016/S0140-6736(96)06162-4.View ArticlePubMedGoogle Scholar
- Philbey AW, Kirkland PD, Ross AD, Davis RJ, Gleeson AB, Love RJ, Daniels PW, Gould AR, Hyatt AD: An apparently new virus (Family Paramyxoviridae) infectious for pigs, humans and fruit bats. Emerging Infectious Diseases. 1998, 4: 269-275.PubMed CentralView ArticlePubMedGoogle Scholar
- Chua KB, Bellini WJ, Rota PA, Harcourt BH, Tamin A, Lam SK, Ksiazek TG, Rollin PE, Zaki SR, Shieh W, Goldsmith CS, Gubler DJ, Roehrig JT, Eaton B, Gould AR, Olson J, Field H, Daniels P, Ling AE, Peters CJ, Anderson LJ, Mahy BW: Nipah virus: a recently emergent deadly paramyxovirus. Science. 2000, 288: 1432-1435. 10.1126/science.288.5470.1432.View ArticlePubMedGoogle Scholar
- Monath TP: Ecology of Marburg and Ebola viruses: speculations and directions for future research. J Infect Dis. 1999, 179 (Suppl 1): S127-138.View ArticlePubMedGoogle Scholar
- Balter M: On the trail of Ebola and Marburg viruses. Science. 2000, 290: 924-925.Google Scholar
- Christensen LS, Schöller S, Schierup MH, Vestergaard BF, Mordhorst CH: Sequence analysis of pre- and early-vaccination era strains of measles virus in Denmark 1965–83 reveal diversity of ancient strains circulating globally. APMIS. 2002, 110: 113-22. 10.1034/j.1600-0463.2002.100201.x.View ArticlePubMedGoogle Scholar
- Mulders MN, Truong AT, Muller CP: Monitoring of measles elimination using molecular epidemiology. Vaccine. 2001, 19: 2245-2249. 10.1016/S0264-410X(00)00453-9.View ArticlePubMedGoogle Scholar
- World Health Organisation: Expanded programme on immunization-standardization of the nomenclature for describing the genetic characteristics of wild-type measles virus. Weekly Epidemiological Record. 1998, 73: 265-272.Google Scholar
- World Health Organisation: Nomenclature for describing the genetic characteristics of wild-type measles viruses (update). Weekly Epidemiological Record. 2001, 76: 249-251.Google Scholar
- Rota PA, Bellini WJ: Update on the global distribution of genotypes of wild type measles viruses. J Infect Dis. 2003, 187 (Suppl 1): S270-6. 10.1086/368042.View ArticlePubMedGoogle Scholar
- Waku DK, Nerrienet E, Mfoupouendoun J, Tene G, Whittle H, Wild TF: Measles virus strains circulating in Central and West Africa: Geographical distribution of two B3 genotypes. J Med Virol. 2002, 68 (3): 433-440. 10.1002/jmv.10222.View ArticleGoogle Scholar
- Chibo D, Riddell M, Catton M, Lyon M, Lum G, Birch C: Studies of measles viruses circulating in Australia between 1999 and 2001 reveal a new genotype. Virus Res. 2003, 91 (2): 213-221. 10.1016/S0168-1702(02)00273-3.View ArticlePubMedGoogle Scholar
- Rota PA, Liffick S, Rosenthal S, Heriyanto B, Chua KB: Measles genotype G2 in Indonesia and Malaysia. Lancet. 2000, 355 (9214): 1557-1558. 10.1016/S0140-6736(05)74612-2.View ArticlePubMedGoogle Scholar
- Xu W, Tamin A, Rota JS, Zhang L, Bellini WJ, Rota PA: New genetic group of measles virus isolated in the People's Republic of China. Virus Res. 1998, 54 (2): 147-156. 10.1016/S0168-1702(98)00020-3.View ArticlePubMedGoogle Scholar
- Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003,Google Scholar
- Kumar S, Tamura K, Jakobsen IB, Nei M: MEGA2: molecular evolutionary genetics analysis software. Bioinformatics. 2001, 17: 1244-5. 10.1093/bioinformatics/17.12.1244.View ArticlePubMedGoogle Scholar
- Hasegawa M, Kishino H, Yano T: Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution. 1985, 22: 160-174. 10.1007/BF02101694.View ArticlePubMedGoogle Scholar
- Hudson RR, Kaplan NL: Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics. 1985, 111: 147-64.PubMed CentralPubMedGoogle Scholar
- Awadalla P, Eyre-Walker A, Smith JM: Linkage disequilibrium and recombination in hominid mitochondrial DNA. Science. 1999, 286: 2524-2525. 10.1126/science.286.5449.2524.View ArticlePubMedGoogle Scholar
- Recombination analysis software. [http://www.brics.dk/~compbio/r2]
- Fearnhead P, Donnelly P: Estimating recombination rates from population genetic data. Genetics. 2001, 159: 1299-1318.PubMed CentralPubMedGoogle Scholar
- McVean GAT, Awadalla P, Fearnhead P: A coalescent-based method for detecting recombination from gene sequences. Genetics. 2002, 160: 1231-1241.PubMed CentralPubMedGoogle Scholar
- Eyre-Walker A, Smith NH, Smith JM: How clonal are human mitochondria?. Proceedings of the Royal Society of London Series B-Biological Sciences. 1999, 266: 477-483. 10.1098/rspb.1999.0662.View ArticleGoogle Scholar
- Swofford DL: PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. 2001, Sinauer Associates, Sunderland, MassachusettsGoogle Scholar
- Krings M, Stone A, Schmitz RW, Krainitzki H, Stoneking M, Paabo S: Neandertal DNA sequences and the origin of modern humans. Cell. 1997, 90: 19-30. 10.1016/S0092-8674(00)80310-4.View ArticlePubMedGoogle Scholar
- Smith JM, Smith NH: Detecting recombination from gene trees. Mol Biol Evol. 1998, 15: 590-599.View ArticlePubMedGoogle Scholar
- Egelman EH, Wu SS, Amrein M, Protner A, Murti G: The Sendai virus nucleocapsid exists in at least four different helical states. J Virol. 1989, 63: 2233-2243.PubMed CentralPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.