Geminiviruses: a tale of a plasmid becoming a virus
© Krupovic et al; licensee BioMed Central Ltd. 2009
Received: 02 January 2009
Accepted: 21 May 2009
Published: 21 May 2009
Geminiviruses (family Geminiviridae) are small single-stranded (ss) DNA viruses infecting plants. Their virion morphology is unique in the known viral world – two incomplete T = 1 icosahedra are joined together to form twinned particles. Geminiviruses utilize a rolling-circle mode to replicate their genomes. A limited sequence similarity between the three conserved motifs of the rolling-circle replication initiation proteins (RCR Reps) of geminiviruses and plasmids of Gram-positive bacteria allowed Koonin and Ilyina to propose that geminiviruses descend from bacterial replicons.
Phylogenetic and clustering analyses of various RCR Reps suggest that Rep proteins of geminiviruses share a most recent common ancestor with Reps encoded on plasmids of phytoplasmas, parasitic wall-less bacteria replicating both in plant and insect cells and therefore occupying a common ecological niche with geminiviruses. Capsid protein of Satellite tobacco necrosis virus was found to be the best template for homology-based structural modeling of the geminiviral capsid protein. Good stereochemical quality of the generated models indicates that the geminiviral capsid protein shares the same structural fold, the viral jelly-roll, with the vast majority of icosahedral plant-infecting ssRNA viruses.
We propose a plasmid-to-virus transition scenario, where a phytoplasmal plasmid acquired a capsid-coding gene from a plant RNA virus to give rise to the ancestor of geminiviruses.
The origin(s) of viruses is a longstanding but yet unresolved question in biology. Several hypotheses were put forward in efforts to understand this enigma (reviewed in ). According to the "Virus-first" hypothesis, viruses emerged in the prebiotic world, just before or in parallel with cellular organisms [2, 3]. The "Reduction" hypothesis states that viruses evolved by reduction from free-living ancient cellular lineages , while the alternative "Escape" hypothesis suggests that viruses originated from cellular genomic fragments that became free of their cellular environment . Irrespective of which of the viral origin hypotheses is considered, these converge in the appreciation of the extreme antiquity of viruses, with origin(s) possibly predating the emergence of the last universal common ancestor (LUCA) of cellular organisms. The ancient origin of viruses is inferred not only from bioinformatic investigations  but, perhaps more convincingly, from the recent flow of structural information on a number of individual viral proteins as well as entire virions. Structural comparison of viruses infecting hosts from all three domains of life (Bacteria, Archaea, and Eukarya) revealed that certain viruses utilize very similar assembly principles and can be grouped accordingly into structure-based viral lineages [6, 7]. The viral lineage hypothesis predicts that viruses existed at the time of (or even before) LUCA and their diversification into bacterial, archaeal and eukaryotic viruses was associated with the emergence of the three cellular domains. But do all virus families come from the dawn of life or can we still witness the more recent emergence of new viral families?
Plasmids comprise another group of parasitic genetic elements that inhabit cells in all three domains of life. Resemblance of plasmids to DNA viruses is apparent, especially when DNA replication strategies are considered . Nevertheless, evolutionary relationships between these two groups are far from being understood. Obviously, the main (and in some cases the only) difference is the presence of the capsid protein-coding gene in the viral genome. For example, there are a number of cryptic plasmids that encode a single protein responsible for DNA replication, while some small viruses of the Circoviridae family bear only two genes [8, 9], one for genome replication and the other one for capsid formation. Members of another virus family, Nanoviridae, contain multipartite genomes where each genomic segment contains a single gene and is packed into a separate isometric capsid . For example, Faba bean necrotic yellows virus contains up to eleven chromosomes . Of special interest are plant-infecting satellite RNA viruses, such as Satellite tobacco necrosis virus (STNV), that encode a single capsid protein and depend on helper viruses for genome replication. It is thus reasonable to assume that acquisition of a capsid gene by a plasmid or, vice versa, loss of a capsid gene by a virus will result in the transition from a plasmid to a genuine virus or from a virus to a plasmid, respectively. This hypothesis should be testable by scrupulous analysis of replication and capsid protein sequences and/or structures.
Results and Discussion
Geminiviruses are plant pathogens and due to their agricultural importance, a great number of sequences from geminiviral isolates has been determined and deposited into databases. We generated a specific sequence pattern to select from the non-redundant BLAST database (including environmental protein sequences) all 1072 protein sequences sharing conserved motifs with Rep proteins of geminiviruses. Many of these sequences are almost identical; therefore, in order to avoid redundancy, the initial dataset was filtered to leave only sequences that are less than 70% identical to each other. After subsequent manual examination, the final dataset contained 40 sequences (see Methods for data collection details). Nineteen of these belonged to geminiviruses, while the rest were from a marine metagenome project (6 sequences), circoviruses (6 sequences), phytoplasmal plasmids (5 sequences), plasmid of Porphyra pulchra (1 sequence), nanovirus (1 sequence), Bifidobacterium catenulatum DSM 16992 (1 sequence), and Nicotiana tabacum (1 sequence). Interestingly, the latter sequence was previously concluded to originate from integration of geminiviral DNA into the plant chromosome . Nanoviruses and circoviruses are small icosahedral viruses with ssDNA genomes. While nanoviruses infect plants, circoviruses replicate in mammalian or avian cells. Bifidobacteria are gram-positive bacteria residing in the gastrointestinal tract of humans and other warm-blooded animals. Interestingly, Rep from B. catenulatum DSM 16992 is homologous to a Rep of the Bifidobacterium pseudocatenulatum plasmid p4M [GenBank:AAM00235], which has been previously observed to be similar to Reps of circoviruses . Phytoplasmas are parasitic bacteria infecting the phloem tissue of plants. Phytoplasmas belong to the class of Mollicutes, which encompasses small pleiomorphic wall-less bacteria, also including mycoplasmas, ureaplasmas, spiroplasmas and acholeplasmas . Phytoplasmas are transmitted by insects that feed on the phloem of infected plants [21, 22]. It should be noted that geminivirus-related bacterial RCR Reps, other than those from phytoplasmal plasmids and B. catenulatum DSM 16992, could not be identified neither by BLAST searches, nor by geminivirus-specific pattern searches (see Methods). Since reasonable sequence conservation is a prerequisite for robust phylogenetic analysis, we did not incorporate RCR Rep sequences from other origins into our dataset.
When Rep proteins of phytoplasmal plasmids were searched for homologues using PSI-BLAST  against bacterial and viral databases at NCBI, only Rep protein sequences of other phytoplasmal plasmids or geminiviruses were identified with significant scores. This suggests that other bacterial RCR Rep proteins share much less similarity with phytoplasmal Reps than those of geminiviruses. Indeed, sequences of bacterial plasmid Reps identified using pattern searches by Koonin and Ilyina (1992) share only three of the five motifs characteristic to geminiviral Reps [15, 17]. Also, there is no significant sequence similarity, other than the three shared motifs, between RCR Reps of bacterial plasmids (other than phytoplasmal plasmids) and geminiviruses. For example, BLAST searches against geminiviral protein sequences at NCBI using as seeds Rep sequences of plasmids pMV158 [GenBank:YP_001586272] and pUB110 [GenBank:CAA27141], the two plasmids whose Reps were found to be the closest to geminiviral Reps , returned no positive hits. Our analysis identifies Reps of phytoplasmal plasmids as the most similar sequences to geminiviral Reps from currently available public protein sequence databases. This observation suggests that geminiviral Reps share a more recent common ancestor with phytoplasmal plasmids than they do with other viral or plasmid RCR Reps.
Interestingly, phytoplasmas and geminiviruses are both obligate parasites occupying a common ecological niche – phloem tissue of plants, which consists of parenchyma cells, sieve-tube cells, and companion cells. Phytoplasmas have been observed in companion cells and phloem parenchyma cells as well as in sieve elements . The same types of cells were shown to contain geminiviral DNA when Nicotiana benthamiana and Lycopersicon esculentum were infected with Tomato yellow leaf curl Sardinia virus and/or Tomato yellow leaf curl virus . It should be noted, however, that not all geminiviruses are phloem-limited . Furthermore, both geminiviruses and phytoplasmas share at least one common insect vector (leafhoppers) that is essential for transmission between plants [21, 27]. It is conceivable that extrachromosomal replicons of phytoplasmas evolved by acquisition of the capsid-coding gene to give rise to geminiviruses.
Next, we superimposed the structural models of the STNV and geminiviral CPs and extracted the structure-based sequence alignment (Fig. 4C). Of the 184 STNV CP amino acid residues for which structural information is available [PDB:2buk], 69.1% had corresponding amino acids in at least one of the four geminiviral CP sequences (75 identical and 52 similar residues) (Fig. 4C). Given the fact that all geminiviral CPs are true homologues, our observation indicates that STNV and geminiviral CPs share not only tertiary but also significantly similar primary structures which further justifies the suggested relationship between these viral CPs. It is obvious from Fig. 4 that secondary structure elements are well conserved and that insertions in the loop regions between β-sheets account for the larger size of geminiviral CPs. The most prominent insertions are observed in the CP of mastrevirus (between βB and βC, and between βF and βG) and begomovirus (between βC and βD, and between βD and βE). The βD/βE loop was identified as essential for controlling whitefly transmission of begomoviruses , whereas the βF/βG loop was proposed to be required for leafhopper transmission .
It is notable that the eight stranded β-barrel fold is characteristic to all icosahedral ssRNA plant and animal viruses  as well as to ssDNA viruses of the Microviridae and Parvoviridae families . Previously, twinned particles of two geminiviruses, Maize streak virus (MSV; Mastrevirus) and African cassava mosaic virus (ACMV; Begomovirus), were resolved using electron cryo-microscopy (cryo-EM) and image reconstruction techniques to 25 Å  and 16–19 Å  resolution, respectively. In both studies the CP of STNV was also found to be the best template for structural modeling of the geminiviral CPs. Successful fitting of the pseudo-atomic model of MSV CP into the cryo-EM density map  strongly corroborates the prediction that CPs of STNV and geminiviruses share the same fold.
All these observations suggest a possible scenario for the origin of geminiviruses. Phylogenetic and clustering analyses of the geminiviral Rep proteins (Figs. 2, 3) indicate that they share a more recent common ancestor with Reps of plasmids from phytoplasmas rather than from other bacteria or viruses. There are two possible ways to explain this relationship. One is that a phytoplasmal cell, while being inside the plant cell, internalized the genome of a geminivirus-like agent, replication and partitioning of which was subsequently stabilized along with the loss of a CP-coding gene. The other possibility is that phytoplasmal plasmids released upon lysis of the bacterial cell in the cytoplasm of the host plant cell were able to obtain a capsid-coding gene from an unknown plant virus. The former possibility seems unlikely since some geminiviruses not only maintained features of prokaryotic replicons, such as typical bacterial promoter sequences , but what is more surprising, are in some instances still able to replicate their DNA in bacterial cells [37, 38]. We were unable to identify any other proteins in addition to RCR Reps common to both, phytoplasmal plasmids and geminiviruses. However, this is not surprising, since protein content required for successful persistence inside bacterial (for plasmids) and plant (for geminiviruses) cells is likely to be different. Furthermore, the capsid volume is a limiting factor dictating the amount of genetic information that can be packaged. So, there is a strong pressure on the genome content of viruses with small capsids leading to the loss of genetic information unnecessary for virus propagation.
What virus might be a donor of a capsid-coding gene to the escaping phytoplasmal plasmid? The vast majority of plant viruses have RNA genomes. Modeling of the geminiviral CP suggests that it folds into the eight-stranded β-barrel (Fig. 4A), a fold common to all isometric ssRNA plant viruses. Notably, STNV encodes a single protein, a capsid protein, which was found to be the closest non-geminiviral relative of the geminiviral CP out of the 231 icosahedral virus capsid proteins whose X-ray structures are currently available at the PDB . STNV possesses the simplest capsid formed from 60 subunits of the CP arranged into T = 1 icosahedral lattice . Pentamers of the CP are the building blocks of the STNV particles . The same is true for geminiviruses . Geminivirus virions are composed of two incomplete icosahedra (110 copies of CP in MSV) that are joined together  (Fig. 1A). Such virion architecture is unique to geminiviruses and is not observed in any other currently known viruses. While the interior volume of the isometric particles is sufficient to pack 1,239 bp of the STNV genome, it is unable to accommodate the larger (2.5 – 3.0 kb ) genome of geminiviruses. Interestingly, it was found that the CP of geminiviruses produces not only twinned wild-type capsids but also isometric and even capsids formed of three incomplete icosahedra (Fig. 1) [40–42]. The valency of the capsid apparently correlates with the length of the packed nucleic acid. It has been shown that noninfectious isometric T = 1 MSV particles contain subgenomic MSV DNA fragments from about 0.2 kb to nearly half of the wild-type genome . Such heterogeneity in particle size and production of noninfectious particles per se might be seen as an indication of ongoing optimization and adaptation of the CP, which was originally utilized to form smaller (isometric) particles, to build larger capsids. Taking into account the high nucleotide substitution rate in geminiviruses, which is similar to that of RNA viruses , the sequence conservation between STNV and geminiviral CPs as well as between phytoplasmal plasmid and geminiviral Reps is striking. It is possible that the emergence of the ancestor geminivirus from a phytoplasmal plasmid and an RNA virus occurred relatively recently on the evolutionary timescale. Although less likely, the possibility of the convergent evolution cannot be ruled out either.
An alternative hypothesis for the origin of geminiviruses is that they are descendants of as yet undiscovered ssDNA viruses with geminiviral-like Reps that have acquired their CP-coding genes either from an RNA or DNA virus by horizontal gene transfer. Indeed, recent metagenomic analysis of samples from a rice paddy soil unveiled the presence of putatively viral replicons with geminivirus/phytoplasma-like Reps but not other geminiviral genes . Unfortunately, metagenomic studies do not provide any information on the origin of the amplified replicons, making it impossible to know with certainty that the amplified DNA does not belong to geminiviruses or plasmids. Therefore, there is currently no evidence to support the hypothesis predicting the existence of a virus that would be a missing link between geminiviruses and other ssDNA viruses.
If geminiviruses originated from phytoplasmal plasmids, is it possible that similar transitions happened several times to give rise to different viral families? As mentioned above, RCR Rep of the Bifidobacterium pseudocatenulatum plasmid p4M [GenBank:AAM00235] was previously shown to be more similar to Reps of various circoviruses than it is to Reps from other bacterial plasmids and viruses . It is therefore tempting to speculate that circoviruses might also be direct descendants of bacterial plasmids.
Phylogenetic as well as complete linkage clustering analysis of RCR Rep proteins from geminiviruses suggests their evolutionary relationship with Rep proteins of phytoplasmal plasmids, while structural modeling of the geminiviral CP points to a connection between geminiviruses and icosahedral ssRNA viruses. We suggest a scenario for the origin of geminiviruses in which acquisition of the capsid protein-coding gene from an ssRNA plant virus by phytoplasmal plasmid gave rise to the ancestor of geminiviruses. This scenario involves two assumptions. First, there was a coinfection of the same plant cell by a phytoplasma and an ssRNA virus. Indeed, such a coinfection has been previously observed. Sugarcane phloem was found to frequently contain both phytoplasmas and Sugarcane yellow leaf viruses (an icosahedral ssRNA virus) [45, 46]. The second assumption is that recombination occurred between the RNA genome of a virus and the DNA molecule of a plasmid. Although recombination between RNA and DNA viruses is not common, there is evidence pointing to the possibility of such gene exchange in the viral world [47, 48]. The scenario proposed here implies that geminiviruses emerged in plant cells through introduction of a structural element (capsid-coding gene) of a plant virus into a plasmid liberated from a plant infecting bacterium. Although this plasmid-to-virus transition does not satisfy the requirements of de novo virogenesis, since a preexisting viral building block was utilized for virion formation, it nevertheless accounts for the emergence of a novel virus family, the Geminiviridae. Consequently, the borderline between the two selfish genetic elements – viruses and plasmids – becomes transparent.
Data collection and phylogenetic analysis
Koonin and Ilyina (1992) found that geminiviral rolling-circle replication (RCR) initiation proteins (Rep) are related to certain bacterial Reps . In order to obtain a dataset for phylogenetic analysis of geminiviral Reps we set out to get all bacterial RCR Reps from the nonredundant protein database at NCBI using PSI-BLAST searches (BLOSUM62 matrix, 0.05 as an E-value cutoff) . Surprisingly, only RCR Reps from phytoplasmal plasmids were identified using this approach. To extend the dataset, we carried out an alternative approach, pattern matching. Rolling circle replication proteins of geminiviruses contain five conserved motifs that are essential for the activity [13–16]. Based on this knowledge, an exact geminivirus-specific sequence pattern, encompassing all the five conserved motifs, was generated: F(T [LI]/[LM]T) [YN]X(1,100)HX [HQ]X(1,100)YXXKX(50,200)GXXXXGK [ST]X(1,100)DD. The residues shown in square brackets are alternatives; X – any amino acid; numbers in parentheses denote the allowed distance between corresponding motifs; slash sign indicates alternation of the dipeptides in the second and third positions in the pattern. The non-redundant protein sequences and environmental protein sequences from BLAST database were downloaded (07.02.2009) from NCBI FTP site and searched for sequences exactly matching the derived pattern without paying attention to the sequences surrounding the conserved motifs (as long as their length falls in the range specified in the pattern). Using this approach sequences missed by BLAST searches are expected to be found. 1072 protein sequences were initially extracted. In order to avoid redundancy, the original dataset was subsequently filtered to leave only sequences with less than 70% identity. As a result, a dataset containing 43 protein sequences was obtained. Of these two sequences were false-positive – a 799 amino acid-long hypothetical protein [GenBank:XP_001614627] from Plasmodium vivax SaI-1 and a 440 amino acid-long hypothetical TrmE domain protein GOS_1133298 [GenBank:EDE42344] from marine metagenome project, which were not included in the further analysis. The resultant dataset (41 sequences) was used to create a multiple sequence alignment using CLUSTALW . One geminiviral sequence [GenBank:ABD67440] was found to be considerably longer (469 aa) than the rest of the sequences. The protein was found to be a fusion of RCR Rep and geminiviral transcriptional activator AC2 and was therefore removed from the alignment. The 40 sequences were realigned and following manual examination and editing the subsequent alignment [see Additional file 1] was utilized for phylogenetic analysis. Maximum likelihood analysis was carried out by using PhyML v2.4.4 , with a WAG  model of amino acid substitution, including a gamma law with 4 categories to take into account differences in evolutionary rates at sites, and an estimated proportion of invariable sites. The robustness of the tree was assessed by bootstrap analysis (1,000 replicates). Bayesian phylogenetic tree was constructed using MrBayes  with a mixed model of amino-acid substitution and a Gamma-law (eight discrete classes). MrBayes was run with four chains for 2.1 × 106 generations and trees were sampled every 100 generations. To construct the consensus tree, the first 25% of the trees were discarded as "burnin".
Complete linkage clustering analysis
Multiple sequence alignment [see Additional file 1] was used to calculate the pairwise distance matrix with MEGA4 . Analyses were conducted using the Poisson correction method. All positions containing gaps and missing data were eliminated from the dataset (Complete deletion option). There were a total of 178 positions in the final dataset. The calculated pairwise distances were used to perform complete linkage clustering analysis, where the distance between two clusters is defined as the distance between the two farthest objects in the two clusters. At each round the clusters are examined and split to two clusters according to the longest distance. The members of the clusters were then grouped within the new cluster that has a shorter distance. The clustering was run until all sequences formed their own clusters.
BioInfoBank MetaServer  was used for prediction of the tertiary structures. The structure of STNV capsid protein (CP)  was determined to be the best template for structural modeling with significance scores ranging from 57.67 – 82.50; scores above 50 are assumed to be significant and correspond to a prediction accuracy of above 90% . The sequences of the geminiviral CPs were individually aligned with the corresponding protein sequence of STNV using version 9.2 of the MODELLER program . Align2d algorithm of the MODELLER program is different from standard sequence-sequence alignment methods because it takes into account structural information from the template when constructing an alignment. This task is achieved through a variable gap penalty function that tends to place gaps in solvent exposed and curved regions, outside secondary structure segments, and between two positions that are close in space. The resulting alignments were utilized to build the three-dimensional models of the four geminiviral CPs using the MODELLER. Ten variants of each CP were generated and one of them was chosen on the basis of having the best stereochemical quality, which was validated using MolProbity . The structural superpositioning of the models with the X-ray structure of the STNV CP was performed using the STAMP algorithm , and the results were visualized with the VMD program .
This work was supported by the Finnish Center of Excellence Program (2006–2011) of the Academy of Finland (Grant 1213467 and Grant 1210253 to DHB). MK is supported by the Viikki Graduate School in Biosciences.
- Forterre P: The origin of viruses and their possible roles in major evolutionary transitions. Virus Res. 2006, 117 (1): 5-16. 10.1016/j.virusres.2006.01.010.View ArticlePubMed
- Koonin EV, Senkevich TG, Dolja VV: The ancient Virus World and evolution of cells. Biol Direct. 2006, 1: 29-10.1186/1745-6150-1-29.PubMed CentralView ArticlePubMed
- Zillig W, Arnold HP, Holz I, Prangishvili D, Schweier A, Stedman K, She Q, Phan H, Garrett R, Kristjansson JK: Genetic elements in the extremely thermophilic archaeon Sulfolobus. Extremophiles. 1998, 2 (3): 131-140. 10.1007/s007920050052.View ArticlePubMed
- Forterre P: The two ages of the RNA world, and the transition to the DNA world: a story of viruses and cells. Biochimie. 2005, 87 (9–10): 793-803. 10.1016/j.biochi.2005.03.015.View ArticlePubMed
- Hendrix RW, Lawrence JG, Hatfull GF, Casjens S: The origins and ongoing evolution of viruses. Trends Microbiol. 2000, 8 (11): 504-508. 10.1016/S0966-842X(00)01863-1.View ArticlePubMed
- Bamford DH, Grimes JM, Stuart DI: What does structure tell us about virus evolution?. Curr Opin Struct Biol. 2005, 15 (6): 655-663. 10.1016/j.sbi.2005.10.012.View ArticlePubMed
- Krupovic M, Bamford DH: Virus evolution: how far does the double beta-barrel viral lineage extend?. Nat Rev Microbiol. 2008, 6 (12): 941-948. 10.1038/nrmicro2033.View ArticlePubMed
- Biagini P, Gallian P, Attoui H, Touinssi M, Cantaloube J, de Micco P, de Lamballerie X: Genetic analysis of full-length genomes and subgenomic sequences of TT virus-like mini virus human isolates. J Gen Virol. 2001, 82 (Pt 2): 379-383.View ArticlePubMed
- Biagini P: Human circoviruses. Vet Microbiol. 2004, 98 (2): 95-101. 10.1016/j.vetmic.2003.10.004.View ArticlePubMed
- Gronenborn B: Nanoviruses: genome organisation and protein function. Vet Microbiol. 2004, 98 (2): 103-109. 10.1016/j.vetmic.2003.10.015.View ArticlePubMed
- Timchenko T, de Kouchkovsky F, Katul L, David C, Vetten HJ, Gronenborn B: A single Rep protein initiates replication of multiple genome components of Faba bean necrotic yellows virus, a single-stranded DNA virus of plants. J Virol. 1999, 73 (12): 10173-10182.PubMed CentralPubMed
- Stanley J, Bisaro DM, Briddon RW, Brown JK, Fauquet CM, Harrison BD, Rybicki EP, Stenger DC: Virus Taxonomy: VIIIth Report of the International Committee on Taxonomy of Viruses. Edited by: Fauquet CM, Mayo MA, Maniloff J, Desselberger U, Ball LA. 2005, London: Elsevier/Academic Press
- Ilyina TV, Koonin EV: Conserved sequence motifs in the initiator proteins for rolling circle DNA replication encoded by diverse replicons from eubacteria, eucaryotes and archaebacteria. Nucleic Acids Res. 1992, 20 (13): 3279-3285. 10.1093/nar/20.13.3279.PubMed CentralView ArticlePubMed
- Gorbalenya AE, Koonin EV, Wolf YI: A new superfamily of putative NTP-binding domains encoded by genomes of small DNA and RNA viruses. FEBS Lett. 1990, 262 (1): 145-148. 10.1016/0014-5793(90)80175-I.View ArticlePubMed
- Vadivukarasi T, Girish KR, Usha R: Sequence and recombination analyses of the geminivirus replication initiator protein. J Biosci. 2007, 32 (1): 17-29. 10.1007/s12038-007-0003-6.View ArticlePubMed
- Desbiez C, David C, Mettouchi A, Laufs J, Gronenborn B: Rep protein of Tomato yellow leaf curl geminivirus has an ATPase activity required for viral DNA replication. Proc Natl Acad Sci USA. 1995, 92 (12): 5640-5644. 10.1073/pnas.92.12.5640.PubMed CentralView ArticlePubMed
- Koonin EV, Ilyina TV: Geminivirus replication proteins are related to prokaryotic plasmid rolling circle DNA replication initiator proteins. J Gen Virol. 1992, 73 (Pt 10): 2763-2766. 10.1099/0022-1317-73-10-2763.View ArticlePubMed
- Bejarano ER, Khashoggi A, Witty M, Lichtenstein C: Integration of multiple repeats of geminiviral DNA into the nuclear genome of tobacco during evolution. Proc Natl Acad Sci USA. 1996, 93 (2): 759-764. 10.1073/pnas.93.2.759.PubMed CentralView ArticlePubMed
- Gibbs MJ, Smeianov VV, Steele JL, Upcroft P, Efimov BA: Two families of rep-like genes that probably originated by interspecies recombination are represented in viral, plasmid, bacterial, and parasitic protozoan genomes. Mol Biol Evol. 2006, 23 (6): 1097-1100. 10.1093/molbev/msj122.View ArticlePubMed
- Razin S, Yogev D, Naot Y: Molecular biology and pathogenicity of mycoplasmas. Microbiol Mol Biol Rev. 1998, 62 (4): 1094-1156.PubMed CentralPubMed
- Christensen NM, Axelsen KB, Nicolaisen M, Schulz A: Phytoplasmas and their interactions with hosts. Trends Plant Sci. 2005, 10 (11): 526-535. 10.1016/j.tplants.2005.09.008.View ArticlePubMed
- Hogenhout SA, Oshima K, Ammar el D, Kakizawa S, Kingdom HN, Namba S: Phytoplasmas: bacteria that manipulate plants and insects. Mol Plant Pathol. 2008, 9 (4): 403-423. 10.1111/j.1364-3703.2008.00472.x.View ArticlePubMed
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.View ArticlePubMed
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.View ArticlePubMed
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMed
- Morilla G, Krenz B, Jeske H, Bejarano ER, Wege C: Tête à tête of Tomato yellow leaf curl virus and Tomato yellow leaf curl Sardinia virus in single nuclei. J Virol. 2004, 78 (19): 10715-10723. 10.1128/JVI.78.19.10715-10723.2004.PubMed CentralView ArticlePubMed
- Rojas MR, Hagen C, Lucas WJ, Gilbertson RL: Exploiting chinks in the plant's armor: evolution and emergence of geminiviruses. Annu Rev Phytopathol. 2005, 43: 361-394. 10.1146/annurev.phyto.43.040204.135939.View ArticlePubMed
- Rossmann MG, Johnson JE: Icosahedral RNA virus structure. Annu Rev Biochem. 1989, 58: 533-573. 10.1146/annurev.bi.58.070189.002533.View ArticlePubMed
- Ginalski K, Elofsson A, Fischer D, Rychlewski L: 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics. 2003, 19 (8): 1015-1018. 10.1093/bioinformatics/btg124.View ArticlePubMed
- Carrillo-Tripp M, Shepherd CM, Borelli IA, Venkataraman S, Lander G, Natarajan P, Johnson JE, Brooks CL, Reddy VS: VIPERdb2: an enhanced and web API enabled relational database for structural virology. Nucleic Acids Res. 2009, D436-442. 10.1093/nar/gkn840. 37 Database
- Jones TA, Liljas L: Structure of Satellite tobacco necrosis virus after crystallographic refinement at 2.5 A resolution. J Mol Biol. 1984, 177 (4): 735-767. 10.1016/0022-2836(84)90047-0.View ArticlePubMed
- Noris E, Vaira AM, Caciagli P, Masenga V, Gronenborn B, Accotto GP: Amino acids in the capsid protein of Tomato yellow leaf curl virus that are crucial for systemic infection, particle formation, and insect transmission. J Virol. 1998, 72 (12): 10050-10057.PubMed CentralPubMed
- Böttcher B, Unseld S, Ceulemans H, Russell RB, Jeske H: Geminate structures of African cassava mosaic virus. J Virol. 2004, 78 (13): 6758-6765. 10.1128/JVI.78.13.6758-6765.2004.PubMed CentralView ArticlePubMed
- Bennett A, McKenna R, Agbandje-McKenna M: A comparative analysis of the structural architecture of ssDNA viruses. Computational and Mathematical Methods in Medicine. 2008, 9 (3–4): 183-196. 10.1080/17486700802168247.View Article
- Zhang W, Olson NH, Baker TS, Faulkner L, Agbandje-McKenna M, Boulton MI, Davies JW, McKenna R: Structure of the Maize streak virus geminate particle. Virology. 2001, 279 (2): 471-477. 10.1006/viro.2000.0739.View ArticlePubMed
- Frischmuth T, Zimmat G, Jeske H: The nucleotide sequence of Abutilon mosaic virus reveals prokaryotic as well as eukaryotic features. Virology. 1990, 178 (2): 461-468. 10.1016/0042-6822(90)90343-P.View ArticlePubMed
- Rigden JE, Dry IB, Krake LR, Rezaian MA: Plant virus DNA replication processes in Agrobacterium: insight into the origins of geminiviruses?. Proc Natl Acad Sci USA. 1996, 93 (19): 10280-10284. 10.1073/pnas.93.19.10280.PubMed CentralView ArticlePubMed
- Selth LA, Randles JW, Rezaian MA: Agrobacterium tumefaciens supports DNA replication of diverse geminivirus types. FEBS Lett. 2002, 516 (1–3): 179-182. 10.1016/S0014-5793(02)02539-5.View ArticlePubMed
- Ban N, Larson SB, McPherson A: Structural comparison of the plant satellite viruses. Virology. 1995, 214 (2): 571-583. 10.1006/viro.1995.0068.View ArticlePubMed
- Casado CG, Javier Ortiz G, Padron E, Bean SJ, McKenna R, Agbandje-McKenna M, Boulton MI: Isolation and characterization of subgenomic DNAs encapsidated in "single" T = 1 isometric particles of Maize streak virus. Virology. 2004, 323 (1): 164-171. 10.1016/j.virol.2004.02.014.View ArticlePubMed
- Frischmuth T, Ringel M, Kocher C: The size of encapsidated single-stranded DNA determines the multiplicity of African cassava mosaic virus particles. J Gen Virol. 2001, 82 (Pt 3): 673-676.View ArticlePubMed
- Jovel J, Preiss W, Jeske H: Characterization of DNA intermediates of an arising geminivirus. Virus Res. 2007, 130 (1–2): 63-70. 10.1016/j.virusres.2007.05.018.View ArticlePubMed
- Duffy S, Holmes EC: Phylogenetic evidence for rapid rates of molecular evolution in the single-stranded DNA begomovirus Tomato yellow leaf curl virus. J Virol. 2008, 82 (2): 957-965. 10.1128/JVI.01929-07.PubMed CentralView ArticlePubMed
- Kim KH, Chang HW, Nam YD, Roh SW, Kim MS, Sung Y, Jeon CO, Oh HM, Bae JW: Amplification of uncultured single-stranded DNA viruses from rice paddy soil. Appl Environ Microbiol. 2008, 74 (19): 5975-5985. 10.1128/AEM.01275-08.PubMed CentralView ArticlePubMed
- Parmessur Y, Aljanabi S, Saumtally S, Dookun-Saumtally A: Sugarcane yellow leaf virus and sugarcane yellows phytoplasma: elimination by tissue culture. Plant Pathology. 2002, 51: 561-566. 10.1046/j.1365-3059.2002.00747.x.View Article
- Scagliusi SM, Lockhart BE: Transmission, characterization, and serology of a luteovirus associated with yellow leaf syndrome of sugarcane. Phytopathology. 2000, 90 (2): 120-124. 10.1094/PHYTO.2000.90.2.120.View ArticlePubMed
- Chappell JD, Prota AE, Dermody TS, Stehle T: Crystal structure of reovirus attachment protein sigma1 reveals evolutionary relationship to adenovirus fiber. EMBO J. 2002, 21 (1–2): 1-11. 10.1093/emboj/21.1.1.PubMed CentralView ArticlePubMed
- Morse MA, Marriott AC, Nuttall PA: The glycoprotein of Thogoto virus (a tick-borne orthomyxo-like virus) is related to the baculovirus glycoprotein GP64. Virology. 1992, 186 (2): 640-646. 10.1016/0042-6822(92)90030-S.View ArticlePubMed
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680. 10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMed
- Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001, 18 (5): 691-699.View ArticlePubMed
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.View ArticlePubMed
- Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, Sali A: Comparative protein structure modeling of genes and genomes. Annu Rev Biophys Biomol Struct. 2000, 29: 291-325. 10.1146/annurev.biophys.29.1.291.View ArticlePubMed
- Lovell SC, Davis IW, Arendall WB, de Bakker PI, Word JM, Prisant MG, Richardson JS, Richardson DC: Structure validation by Calpha geometry: phi, psi and Cbeta deviation. Proteins. 2003, 50 (3): 437-450. 10.1002/prot.10286.View ArticlePubMed
- Russell RB, Barton GJ: Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels. Proteins. 1992, 14 (2): 309-323. 10.1002/prot.340140216.View ArticlePubMed
- Humphrey W, Dalke A, Schulten K: VMD: visual molecular dynamics. J Mol Graph. 1996, 14 (1): 33-38. 10.1016/0263-7855(96)00018-5.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.