- Research article
- Open Access
Evolutionary analysis of foot-and-mouth disease virus serotype SAT 1 isolates from east africa suggests two independent introductions from southern africa
BMC Evolutionary Biologyvolume 10, Article number: 371 (2010)
In East Africa, foot-and-mouth disease virus serotype SAT 1 is responsible for occasional severe outbreaks in livestock and is known to be maintained within the buffalo populations. Little is known about the evolutionary forces underlying its epidemiology in the region. To enhance our appreciation of the epidemiological status of serotype SAT 1 virus in the region, we inferred its evolutionary and phylogeographic history by means of genealogy-based coalescent methods using 53 VP1 coding sequences covering a sampling period from 1948-2007.
The VP1 coding sequence of 11 serotype SAT 1 FMD viruses from East Africa has been determined and compared with known sequences derived from other SAT 1 viruses from sub-Saharan Africa. Purifying (negative) selection and low substitution rates characterized the SAT 1 virus isolates in East Africa. Two virus groups with probable independent introductions from southern Africa were identified from a maximum clade credibility tree. One group was exclusive to Uganda while the other was present within Kenya and Tanzania.
Our results provide a baseline characterization of the inter-regional spread of SAT 1 in sub-Saharan Africa and highlight the importance of a regional approach to trans-boundary animal disease control in order to monitor circulating strains and apply appropriate vaccines.
Foot-and-mouth disease (FMD) is an acute, highly communicable and economically important disease of livestock and it also affects wild ruminants . The causative agent, foot-and-mouth disease virus (FMDV) belongs to the Aphthovirus genus in the family Picornaviridae. Its positive-sense, single-stranded RNA genome of 8.5 kb is translated into a polyprotein which is post-translationally cleaved to 4 structural (VP1, VP2, VP3, VP4) and 8 nonstructural proteins . The structural proteins form the capsid of the virion and, with the exception of VP4, are surface exposed. The VP1 is involved in the interaction with the host cells via the RGD-dependent integrins . The coding sequence for VP1 has been widely used in studies of evolutionary dynamics of FMDV needed for the understanding of the epidemiological patterns of these viruses and for determining possible sources of outbreaks [4–6]. The genetic diversity of FMDV is a consequence of the high mutation rate due to the error-prone RNA polymerase lacking proofreading activity .
There are seven immunologically distinct serotypes (O, A, C, SAT 1, SAT 2, SAT 3 and Asia 1) of FMDV, each with a wide spectrum of antigenic and epidemiological subtypes distributed around the world . The Southern Africa Territories (SAT) serotypes are restricted in their distribution mainly to sub-Saharan Africa and they co-exist with the Euro-Asiatic (O, A, C) serotypes in the East African region although serotype C has not been reported since 2004. In southern Africa, the epidemiology of the SAT serotypes is mainly associated with African buffalos (Syncerus caffer) which act as reservoirs and sources of outbreaks [8, 9]. In eastern Africa, FMD is prevalent in wildlife and within the African buffalo in particular although their role in the epidemiology of the disease has not been as widely studied as in southern Africa. Most outbreaks of FMD in the region are reported among livestock populations. The African buffalo has been reported to be a carrier of the SAT serotypes but not the Euro-Asiatic serotypes in East Africa [10–12]. This is similar to the situation in southern Africa. Widespread animal movements in the eastern Africa region are possibly responsible for long-term circulation and reintroductions of FMDV strains, including SAT 1 . However, little quantitative information exists about the extent of such livestock and wildlife mediated dispersal of FMDV as well as the origin and evolutionary history of the SAT 1 viruses circulating in eastern Africa [13, 14]. Furthermore, the connectivity between the individual countries and the main routes of dispersal remain unknown, although such information would be of great value in containing the spread of the disease and avoiding introduction of novel strains against which existing vaccine programs may offer little protection.
We have investigated the emergence of FMDV SAT 1 diversity in the region by inferring the phylogeographic history by means of genealogy-based coalescent methods. Furthermore, we have tested for evidence of recombination in the data set which is known to bias phylogenetic inferences as described previously [15–17].
Phylogenetic relationships, substitution rates and divergence times
The VP1 coding sequences of 11 additional serotype SAT 1 FMD viruses from East Africa have been determined. Using this information, the complete VP1 coding sequences of 8 southern Africa, 14 western Africa, 3 Sudanese, 1 Ethiopian and 27 East Africa FMD serotype SAT 1 viruses from the period 1948 to 2007 were analysed to determine phylogenetic relationships, phylogeography, divergence times and substitution rates. Dating of the common root of the samples showed considerable uncertainty in determination with a mean estimate for the most recent common ancestor (TMRCA) at 538 years before present (ybp) (95% highest posterior density (HPD): 228-897 ybp). The inferred maximum clade credibility (MCC) tree is shown in Figure 1 with the posterior probabilities for the branches shown. The East African SAT 1 viruses formed two main clades (lineages) labelled A and B supported by high posterior probabilities. The Ugandan viruses differed from those of Tanzania and Kenya and are of mainly one lineage (A) while one isolate (UGA/13/74) grouped with viruses of the Sudan and western Africa. Kenyan and Tanzanian viruses grouped together in lineage B and were related to a Zimbabwean isolate. Only little geographic structure was observed within lineage B isolates from Kenya and Tanzania, suggesting high migration rates between these countries. The mean nt substitution rate was 1.30 × 10-3 substitutions/site/year (s/s/yr) (95% HPD: 5.43 × 10-4-2.18 × 10-3) with distinct variation in rates among the clades. We analysed the East African viruses (comprising 27 samples) separately in BEAST and found relatively lower rates at 2.75 × 10-4 s/s/yr (4.69 × 10-5-7.39 × 10-4), while the western African viruses (comprising 14 samples) had higher rates at 6.91 × 10-3s/s/yr (3.32 × 10-3-1.04 × 10-2).
While the location of the root of the SAT1 tree could not be identified with particular confidence (Bayes factor, BF = 1.5 comparing the posterior probability of the root being in southern Africa against Sudan), there was relatively strong support for the location of several of the remaining nodes in the MCC tree (Figure 1). From the location-annotated MCC tree, two separate introductions from southern Africa to East Africa were supported by the data, namely one leading to lineage A and one leading to lineage B. In addition, there was strong support for two separate introductions of SAT 1 from Sudan to western Africa. Bayes factor tests revealed that the most significant routes of inter-regional dispersal were the Sudan-western Africa (BF = 21.4) and southern Africa-Kenya/Tanzania (BF = 4.5). No link between the Ugandan and the Kenyan/Tanzanian samples (between lineage A and B) could be identified, and this was in fact found to be the link with the second-lowest posterior support. The western Africa-Kenya/Tanzania link had the lowest support (results not shown).
Predominant purifying selection in the VP1 coding region of FMDV SAT 1
The majority of codons in the VP1 coding region of FMDV SAT 1 appeared to be under purifying (negative) selection. Of the 221 codons analysed, 153 were found to be under negative selection using three methods (single-likelihood ancestor counting, SLAC, at P = 0.1, fixed effects likelihood, FEL, at P = 0.1 and random effects likelihood, REL, with BF > 50) as summarized in Table 1. Five sites (codons 47, 61, 99, 143 and 147) were identified to be under positive selection by at least one method but no site was identified by all the three methods together at values of P = 0.1 (SLAC and FEL) or BF > 50 (REL). However, at P = 0.2, codon 147 (H/N/E/T, 2 codon positions before the receptor binding motif RGD) was identified by all the methods and was mostly likely to be under true positive selection.
The genetic algorithm (GA) branch analysis showed that 5 rate classes were supported with a large number of models (over 1500 in the 95% confidence set). No branches had significant support for dN > dS although differences existed in the branch selection pattern indicating that some branches may have been under weak positive selection.
The genetic algorithm for recombination detection (GARD) detected a putative recombination breakpoint at nucleotide position 168 with a change in Akaike's Information Criterion (AICc) of more than100 which suggested support for the recombination model while the Kishino-Hasegewa test showed support for significant topological incongruence at P = 0.01. Indeed, the exploratory analysis using the recombination detection programme (RDP2) had at least one method detect some recombinant sequences (TAN/60/99 and K66/80). However, further analysis did not support the view that these sequences were recombinant and the exclusion of these sequences from the analysis did not affect the phylogenetic results, indicating that they are not likely to be true recombinants (results not shown).
FMDV serotype SAT 1 virus strains from East Africa analysed in this study grouped into 2 distinct clades (lineages with > 20% nucleotide divergence) designated here as lineages A and B. While one of these lineages (A) was found exclusively in Uganda, the other had virus strains from Tanzania and Kenya. Over the whole sampling period, Kenyan and Tanzanian isolates were interspersed in one clade of the phylogenetic tree, suggesting that these countries form a single ecosystem for SAT 1. The separate introduction of lineages A and B to eastern Africa from southern Africa was supported by the high posterior probabilities of the location states in the phylogeographic analysis. A close association of Kenyan/Tanzanian and southern African lineages has been observed earlier , but the link between Ugandan and southern Africa lineages reported here reveals a previously undiscovered aspect of the ancestry of the East African SAT 1 lineages. This new finding was not due to a different data set, but rather to our Bayesian phylogeographic analysis framework. When we constructed a neighbour-joining tree using similar methods to  using our data set, we were not able to infer a southern African origin for the Ugandan lineage.
Several interesting aspects about the history of sub-Saharan SAT 1 viruses emerge from our continental phylogeographic approach. First, we found that the most likely root location of SAT 1 is in southern Africa. Because of the relatively deep root of the tree (~538 ybp), we could not achieve unequivocal posterior support for this root location (also see ). We found a strong link between western African and Sudanese SAT 1 sequences (in agreement with ) and our results suggest that the route of entry of SAT 1 into western Africa has been along the Sahel rather than through the rain forest belt surrounding equatorial Africa. A Ugandan isolate from 1974 was found to belong to a lineage otherwise consisting of Sudanese, Ethiopian and western African strains, and the phylogeographic analysis suggested this was an incursion from Sudan. Hence, Ugandan SAT 1 strains appear to be derived from two different sources, southern Africa and Sudan, respectively.
The sampling scheme used in this study may to some extent have affected the outcome of the phylogeographic analysis. For example, we cannot exclude that the inclusion of more samples from Uganda would alter the posterior state probability of some nodes in the tree to reflect an earlier introduction of SAT 1 into Uganda. Given that Uganda is represented in both of the two major clades, it may have played a more prominent role connecting southern African SAT 1 viruses with those of Sudan, Ethiopia and western Africa. Such a scenario seems plausible given the central location of Uganda according to our definition of location states. Furthermore, we cannot exclude that additional samples from Uganda will show phylogenetic affinity with the surrounding countries. This could be tested by acquiring more Ugandan samples. In fact, more recent SAT 1 virus isolates from Uganda have grouped within the Ugandan lineage A , in agreement with the phylogeographic conclusions reported here. In general, however, we stress that our findings should be viewed as a null hypothesis about continental SAT 1 dispersal against which studies based on more comprehensive sampling can be tested. Denser sampling (both temporally and spatially) can be expected to reveal novel dispersal patterns not observed here and further address the fine-scale historical movement of the serotype.
The substitution rate inferred in our study differs considerably from . This leads to a significantly deeper tree, and hence it is difficult for us to put our results into a historical context that includes all FMDV serotypes. Our mean estimate of 538 ybp for the TMRCA of SAT 1 actually predates that of the whole FMDV found in , although it is within the 95% HPDs reported in that study (218-1250 ybp). We caution that the time line of our phylogeographic tree should not be regarded as conclusive and that further studies are needed to establish the rate of evolution in FMDV. Our inferred rate is, however, closer to the reported mean rate of evolution across all serotypes (2.48 × 10-3) for the VP1 coding sequence . In , the SAT 1 virus sequence was found to have a roughly 3-fold faster rate than the species average. We speculate that this exceptionally fast rate could be derived from the sampling scheme in , where many of the included SAT 1 isolates are from the same epidemic outbreaks. This tends to yield faster rates of evolution, since what is recovered is actually the mutation rate rather than the long-term substitution rate subject to selection and other forces , leading to a bias towards higher rates and a shallower tree. In accordance with this, we did find much faster rates of evolution in the western African samples; all collected during two epidemic outbreaks each spanning just two years. However, regionally variable evolutionary rates may in fact reflect real differences in the epidemiological dynamics and host-interaction of FMDV. For example, buffalos and other wildlife may play a more prominent role in the epidemiology of SAT 1 in eastern than in western Africa, and this may give rise to changed patterns of evolution of virus lineages in the two regions. Considerable localized differentiation in evolutionary rates has not previously been observed in FMDV, and although potentially informative concerning epidemiology and evolution, it also complicates evolutionary estimates based on global or widespread sample collections. Given these two (not necessarily mutually exclusive) causes of the observed rate heterogeneity, it is vital that future studies address the caveats in using the VP1 coding sequence to infer evolutionary rates and history.
Purifying (negative) selection was the most predominant evolutionary force at play among the SAT 1 viruses. At least 153 codon positions including the RGD motif (amino acid residue positions 149-151) of VP1, required for receptor interaction, were estimated to be under purifying selection signifying amino acid conservation as reflected in the low evolutionary rates. There was less evidence for positive selection although a few sites may have been under adaptive selection. Amino acid sites that are distinct between the regional virus groups as well as conservation of the RGD motif were observed when inferred using MEGA version 4  and is in agreement with previous reports . These evolutionary patterns may reflect the observed apparent long term circulation of some virus strains in the region previously reported in . It has also been observed that genetic heterogeneity may be limited by evolutionary constraints . There was no evidence for the presence of recombination within the VP1 coding sequences (in agreement with observations that recombination is largely restricted to non-structural coding regions with very few phylogenetic incongruities in the capsid proteins [24–26]) adding confidence to our results.
We have inferred the most likely phylogeographic history of SAT 1 in sub-Saharan Africa. We found evidence that the SAT 1 viruses circulating in Uganda and Kenya/Tanzania represent independent phylogeographic lineages. Kenya and Tanzania appear to experience a much greater exchange of viruses at their respective southern and northern borders through the trans-boundary livestock and wildlife movements (a common feature in this area) than with Uganda. This highlights the importance of a regional approach to trans-boundary animal disease control. It is apparent from the SAT 1 analysis presented here that monitoring of the emerging strains in the region is required for the success of vaccination strategies.
Eleven (10 Kenyan and 1 Tanzanian) SAT 1 virus isolates for this study (collected between 1977 and 2006) were obtained from the Embakasi FMD laboratory in Nairobi which is a repository of all FMD sample materials collected in Kenya. Virus was isolated from clinical material according to standard procedures on baby hamster kidney (BHK) cells. The details of the isolates are shown in Table 2.
Viral RNA extraction, cDNA synthesis and amplification
Total RNA was extracted and cDNA synthesized as previously described . The complete VP1 coding region was amplified using the primer pair, FMD AKS (5'-ATGGGACACAGGTCTGAACTCGA-3') and FMD-2B58  applying PCR reagent volumes and conditions as previously described . PCR products were visualized, purified and cycle-sequenced using the same primers as for PCR above.
In addition to the eleven sequences generated in the study, 42 (17 from East Africa and 25 the rest of Africa) other complete VP1 coding sequences available in the GenBank covering a sampling period from 1948-2007 were included to put the results from East Africa into a continental SAT 1 context.
The sequences were assembled and aligned using the software program Geneious version 4.6 . The best fitting nucleotide substitution model was tested by means of a hierarchical likelihood ratio test (LRT) and the Akaike information criteria (AIC) as implemented in MrModeltest version 2.2 software  and executed in PAUP* version 4b10 software . The selected model was general time-reversible (GTR)  with gamma-distributed rates among sites and a proportion of invariable sites.
Phylogenetic relationships, evolutionary rates and population size changes were co-estimated for the whole data set and geographic subsets using a Bayesian Markov Chain Monte Carlo (MCMC) method implemented in the BEAST (Bayesian evolutionary analysis sampling trees) software version 1.6.0 package http://beast.bio.ed.ac.uk using the selected model of nucleotide substitution. The method utilizes the sampling time of the sequences to infer rates of evolution along lineages, time of TMRCA and demographic history. A recent extension of the software allows tracking of the geographic location state along the phylogenetic tree, yielding posterior estimates of the location of each branch/node in the tree given the phylogenetic uncertainties . Given our limited data set with low representation of many countries, we defined the geographical states as six coherent regions roughly corresponding to areas separated by known topotype boundaries. The regions include: western Africa, Ethiopia, Sudan, Uganda, Kenya/Tanzania and southern Africa (Table 2). We used a Bayesian stochastic search variable selection (BSSVS) without distance informed priors on diffusion rates, as this has been shown not to improve confidence in the phylogeographical state assignment when dispersal patterns are complex such as with many viruses (e.g. ). Rate indicator log files were inspected in Tracer software version 1.4 http://tree.bio.ed.ac.uk/software/tracer/, and Bayes factor tests were carried out to test the most significant routes of dispersal using the Rate Indicator BF tool of the BEAST package.
In a preliminary analysis, we tested four different demographic models/coalescent priors as suggested in . In addition, we tested the appropriateness of a strict clock versus various versions of relaxed clocks available in BEAST . This process of model selection suggested the constant population size model with an uncorrelated exponential clock to be the best fit to the data. We used a HKY+G+I substitution model  with four rate categories, as in a recent influenza study (, Additional file 1). The MCMC chains were run long enough (100 million steps) to allow high effective sample sizes (ESSs) (above 250 for most parameters, minimum 100 for all parameters) with a 10% burn-in as viewed in Tracer. Statistical uncertainties of the substitution rates and the MRCA were summarized as the lower 95%, mean, and upper 95% values of the HPD interval. Mean evolutionary rates (averaged over branches weighted by their lengths) were measured as the number of nucleotide substitution per site per year (s/s/y). Maximum clade credibility trees were obtained using Tree Annotator program in BEAST and visualized with FigTree version 1.1.2 software http://tree.bio.ed.ac.uk/software/figtree/.
Selection and recombination detection
Tests for selection were performed using four methods which estimate selection in a phylogenetic context available in the Datamonkey web interface . The best-fitting nucleotide substitution model was selected using the automated link. To identify codon sites under positive (adaptive) or negative (purifying) selection, we used the single-likelihood ancestor counting, the fixed effects likelihood and the random effects likelihood methods. The SLAC and FEL methods estimate selection on a site-by-site basis with the former method comparing observed to expected synonymous and non-synonymous rates while the latter uses two models which assume independent and equal rates and a likelihood ratio test to determine significance. The REL method determines independent general discrete distributions for the global synonymous and non-synonymous rates using a codon based model which are then used as priors for Empirical Bayes analysis of site selection . The integrative selection analysis option in Datamonkey was then used to increase confidence on the estimation of selection at a site if all three methods support it. To test the hypothesis that different selective environments were acting on the branches of the phylogeny, we used the GA branch method to estimate dN/dS.
To add confidence to our coalescent inferences, the presence of recombination in the data was tested using the GARD method  on the Datamonkey server with topological incongruence significance estimated by the Kishino-Hasegawa test  and also by the exploratory methods implemented in RDP version 2 beta 0.8 software  which included; RDP,  GENECONV,  Bootscan,  MaxChi,  and Chimaera .
Grubman MJ, Baxt B: Foot-and-mouth disease. Clinical Microbiology Reviews. 2004, 17 (2): 465-493. 10.1128/CMR.17.2.465-493.2004.
Belsham GJ: Distinctive features of foot-and-mouth disease virus, a member of the picornavirus family; aspects of virus protein synthesis, protein processing and structure. Progress in Biophysics and Molecular Biology. 1993, 60: 241-260. 10.1016/0079-6107(93)90016-D.
Jackson T, King AMQ, Stuart DI, Fry E: Structure and receptor binding. Virus Research. 2003, 91: 33-46. 10.1016/S0168-1702(02)00258-7.
Haydon DT, Samuel AR, Knowles NJ: The generation and persistence of genetic variation in foot-and-mouth disease virus. Preventive Veterinary Medicine. 2001, 51: 111-124. 10.1016/S0167-5877(01)00210-0.
Knowles NJ, Samuel AR: Molecular epidemiology of foot-and-mouth disease virus. Virus Research. 2003, 91: 65-80. 10.1016/S0168-1702(02)00260-5.
Tully DC, Fares MA: Unravelling selection shifts among foot-and-mouth disease virus (FMDV) serotypes. Evolutionary Bioinformatics. 2006, 2: 211-225.
Domingo E, Escarmí's C, Baranowski E, Ruiz-Jarabo CM, Carrillo E, Núñez JI, Sobrino F: Evolution of foot-and-mouth disease virus. Virus Research. 2003, 91: 47-63. 10.1016/S0168-1702(02)00259-9.
Condy JB, Hedger RS, Hamblin C, Barnett ITR: The duration of foot-and-mouth disease virus carrier state in African buffalo (i) in the individual animal and (ii) in a free-living herd. Comparative Immunology, Microbiology and Infectious Diseases. 1985, 8 (3/4): 259-265. 10.1016/0147-9571(85)90004-9.
Thomson GR, Vosloo W, Bastos ADS: Foot and mouth disease in wildlife. Virus Research. 2003, 91: 145-161. 10.1016/S0168-1702(02)00263-0.
Anderson EC, Doughty WJ, Anderson J, Paling R: The pathogenesis of foot-and-mouth disease in the African buffalo (Syncerus caffer) and the role of this species in the epidemiology of the disease in Kenya. Journal of Comparative Pathology. 1979, 89: 511-519. 10.1016/0021-9975(79)90045-8.
Ayebazibwe C, Mwiine FN, Balinda SN, Tjørnehøj K, Masembe C, Muwanika VB, Okurut ARA, Siegismund HR, Alexandersen S: Antibodies against foot-and-mouth disease (FMD) virus in African buffalos (Syncerus caffer) in selected national parks in Uganda (2001-2003). Transboundary and Emerging Diseases. 2010, 57: 286-292.
Bronsvoort BMDC, Parida S, Handel I, McFarland S, Fleming L, Hamblin P, Kock R: Serological survey for foot-and-mouth disease virus in wildlife in eastern Africa and estimation of test parameters of a nonstructural protein enzyme-linked immunosorbent assay for buffalo. Clinical and Vaccine Immunology. 2008, 15 (6): 1003-1011. 10.1128/CVI.00409-07.
Sahle M, Dwarka RM, Venter EH, Vosloo W: Comparison of SAT-1 foot-and-mouth disease virus isolates obtained from East Africa between 1971 and 2000 with viruses from the rest of sub-Saharan Africa. Archives of Virology. 2007, 152: 797-804. 10.1007/s00705-006-0893-x.
Bastos ADS, Haydon DT, Forsberg R, Knowles NJ, Anderson EC, Bengis RG, Nel LH, Thomson GR: Genetic heterogeneity of SAT-1 type foot-and-mouth disease viruses in southern Africa. Archives of Virology. 2001, 146: 1537-1551. 10.1007/s007050170077.
Schierup MH, Hein J: Consequences of recombination on traditional phylogenetic analysis. Genetics. 2000, 156: 879-891.
Moya A, Holmes EC, González-Candelas F: The population genetics and evolutionary epidemiology of RNA viruses. Nature Reviews | Microbiology. 2004, 2: 279-288.
Heath L, van der Walt E, Varsani A, Martin DP: Recombination patterns in aphthoviruses mirror those found in other picornaviruses. Journal of Virology. 2006, 80 (23): 11827-11832. 10.1128/JVI.01100-06.
Lemey P, Rambaut A, Drummond AJ, Suchard MA: Bayesian phylogeny finds its roots. PLoS Computational Biology. 2009, 5 (9): 1-16. 10.1371/journal.pcbi.1000520.
Ayebazibwe C, Mwine FN, Tjørnehøj K, Balinda SN, Muwanika VB, Okurut AAR, Belsham GJ, Normann P, Siegismund HR, Alexandersen S: The role of African buffalos (Syncerus caffer) in the maintenance of foot-and-mouth disease in Uganda. BioMed Central, Veterinary Research.
Tully DC, Fares MA: The tale of a modern animal plague: tracing the evolutionary history and determining the time-scale for foot and mouth disease virus. Virology. 2008, 382: 250-256. 10.1016/j.virol.2008.09.011.
Ho SYW, Phillips MJ, Cooper A, Drummond AJ: Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Molecular Biology and Evolution. 2005, 22 (7): 1561-1568-10.1093/molbev/msi145.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Molecular Biology and Evolution. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.
Haydon DT, Bastos AD, Knowles NJ, Samuel AR: Evidence for positive selection in foot-and-mouth disease virus capsid genes from field isolates. Genetics. 2001, 157: 7-15.
Carrillo C, Tulman ER, Delhon G, Lu Z, Carreno A, Vagnozzi A, Kutish GF, Rock DL: Comparative genomics of foot-and-mouth disease virus. Journal of Virology. 2005, 79 (10): 6487-6504. 10.1128/JVI.79.10.6487-6504.2005.
Tully DC, Fares MA: Shifts in the selection-drift balance drive the evolution and epidemiology of foot-and-mouth disease virus. Journal of Virology. 2009, 83 (2): 781-790. 10.1128/JVI.01500-08.
Jackson AL, O'Neill H, Maree F, Blignaut B, Carrillo C, Rodriguez L, Haydon DT: Mosaic structure of foot-and-mouth disease virus genomes. Journal of General Virology. 2007, 88: 487-492. 10.1099/vir.0.82555-0.
Sangula AK, Siegismund HR, Belsham GJ, Balinda SN, Masembe C, Muwanika VB: Low diversity of foot-and-mouth disease serotype C virus in Kenya: evidence for probable vaccine strain re-introductions in the field. Epidemiology and Infection. 2010, Available on Cambridge Journals Online, 25 March 2010
Knowles NJ, Samuel AR: Polymerase chain reaction amplification and cycle sequencing of the 1 D gene of foot-and-mouth disease viruses. Session of the research group of the standing technical committee of the European commission for the control of foot-and-mouth disease. 19-22 September 1994. 1995, Vienna, Austria: FAO, Rome
Drummond AJ, Ashton B, Cheung M, Heled J, Kearse M, Moir R, Stones-Havas S, Thierer T, Wilson A: Geneious v4.6. 2009, [http://www.geneious.com/]
Nylander JAA: MrModeltest v2. Program distributed by the author. 2004, Evolutionary Biology Centre, Uppsala University
Swofford DL: PAUP*. Phylogenetic analysis using parsimony (*and other methods). Version4. 2003, Sunderland, Massachusetts: Sinauer Associates, 4
Rodriguez F, Oliver JL, Marfn A, Medina JR: The general stochastic model of nucleotide substitution. Journal of Theoretical Biology. 1990, 142: 485-501. 10.1016/S0022-5193(05)80104-3.
Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology. 2007, 7: 10.1186/1471-2148-7-214.
Suchard MA, Weiss RE, Sinsheimer JS: Bayesian selection of continuous-time markov chain evolutionary models. Molecular Biology and Evolution. 2001, 18 (16): 1001-1013.
Hasegawa M, Kishino H, Yano T: Dating of human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution. 22: 160-174. 10.1007/BF02101694.
Pond SLK, Frost SDW: A genetic algorithm approach to detecting lineage-specific variation in selection pressure. Molecular Biology and Evolution. 2005, 22 (3): 478-485. 10.1093/molbev/msi031.
Pond SLK, Posada D, Gravenor MB, Woelk CH, Frost SDW: Automated phylogenetic detection of recombination using a genetic algorithm. Molecular Biology and Evolution. 2006, 23 (10): 1891-1901. 10.1093/molbev/msl051.
Kishino H, Hasegawa M: Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominodea. Journal of Molecular Evolution. 1989, 29: 170-179. 10.1007/BF02100115.
Martin DP, Williamson C, Posada D: RDP2: recombination detection and analysis from sequence alignments. Bioinformatics. 2005, 21 (2): 260-262. 10.1093/bioinformatics/bth490.
Martin D, Rybicki E: RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000, 16 (6): 562-563. 10.1093/bioinformatics/16.6.562.
Padidam M, Sawyer S, Fauquet CM: Possible emergence of new geminiviruses by frequent recombination. Virology. 1999, 265: 218-225. 10.1006/viro.1999.0056.
Salminen MO, Carr JK, Burke DS, McCutchan FE: Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS Research and Human Retroviruses. 1995, 11: 1423-1425. 10.1089/aid.1995.11.1423.
Maynard Smith J: Analyzing the mosaic structure of genes. Journal of Molecular Evolution. 1992, 34: 126-129.
Posada D, Crandall KA: Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proceedings of the National Academy of Sciences, USA. 2001, 98: 13757-13762. 10.1073/pnas.241370698.
We sincerely thank the Director of Veterinary Services, Kenya, for providing the virus isolates used in the study and Dr Sabenzia Wekesa of the FMD laboratory, Embakasi in particular for the information on the isolates and comments. Teresa Kenduiywo, William Birgen and Eugene Arinaitwe are particularly appreciated for excellent technical assistance. We thank anonymous reviewers for helpful comments on an earlier version of the manuscript. Part of this work was carried out by using the resources of the Computational Biology Service Unit from Cornell University which is partially funded by Microsoft Corporation. This work was supported by the Danish International Development Agency (DANIDA) under the Livestock-Wildlife Diseases in East Africa Project.
AKS, GJB, VM and HRS designed and conceived the study. AKS, SNB and CM generated, collected and aligned the sequences. AKS, RH, SNB and HRS carried out the analysis of the data. AKS, GJB, RH, VM and HRS wrote the paper. All authors read and approved the final manuscript.