- Research article
- Open Access
Evolutionary genomics of mycovirus-related dsRNA viruses reveals cross-family horizontal gene transfer and evolution of diverse viral lineages
BMC Evolutionary Biologyvolume 12, Article number: 91 (2012)
Double-stranded (ds) RNA fungal viruses are typically isometric single-shelled particles that are classified into three families, Totiviridae, Partitiviridae and Chrysoviridae, the members of which possess monopartite, bipartite and quadripartite genomes, respectively. Recent findings revealed that mycovirus-related dsRNA viruses are more diverse than previously recognized. Although an increasing number of viral complete genomic sequences have become available, the evolution of these diverse dsRNA viruses remains to be clarified. This is particularly so since there is little evidence for horizontal gene transfer (HGT) among dsRNA viruses.
In this study, we report the molecular properties of two novel dsRNA mycoviruses that were isolated from a field strain of Sclerotinia sclerotiorum, Sunf-M: one is a large monopartite virus representing a distinct evolutionary lineage of dsRNA viruses; the other is a new member of the family Partitiviridae. Comprehensive phylogenetic analysis and genome comparison revealed that there are at least ten monopartite, three bipartite, one tripartite and three quadripartite lineages in the known dsRNA mycoviruses and that the multipartite lineages have possibly evolved from different monopartite dsRNA viruses. Moreover, we found that homologs of the S7 Domain, characteristic of members of the genus phytoreovirus in family Reoviridae are widely distributed in diverse dsRNA viral lineages, including chrysoviruses, endornaviruses and some unclassified dsRNA mycoviruses. We further provided evidence that multiple HGT events may have occurred among these dsRNA viruses from different families.
Our study provides an insight into the phylogeny and evolution of mycovirus-related dsRNA viruses and reveals that the occurrence of HGT between different virus species and the development of multipartite genomes during evolution are important macroevolutionary mechanisms in dsRNA viruses.
Mycoviruses (fungal viruses) are widespread in all major fungal groups and most of these cause little or no obvious symptoms in their fungal hosts. Current studies on mycoviruses are mainly focused on those of economically important fungi that are typically phytopathogenic fungi. These studies aim to develop mycoviruses as biocontrol agents to combat fungal diseases and exploit them as tools to explore the physiology of their fungal hosts [1, 2]. The recent discoveries of two novel mycoviruses from the white mold fungus Sclerotinia sclerotiorum, one RNA virus related to rubi-like viruses of positive-strand RNA viruses  and another circular single-stranded (ss) DNA virus related to plant geminiviruses , have potentially significant implications for understanding the origin and evolution of related viral lineages. Given that mycoviruses represent essential evolutionary lineage of viruses from one of three kingdoms (plants, fungi and animals), the discovery of novel mycoviruses may not only expand our knowledge of viral diversity and global ecology, but also helps shed light on the evolutionary relationships of viruses.
The majority of known mycoviruses has rigid particles and double-stranded (ds) RNA genomes and are now classified, based on the number of genome segments, into four families: Chrysoviridae, Partitiviridae, Reoviridae, and Totiviridae . Members of family Totiviridae have monopartite dsRNA genomes coding for a capsid protein (CP) and an RNA-dependent RNA polymerase (RdRp). The genomes of members in the families, Partitiviridae, Chrysoviridae and Reoviridae comprise two, four and eleven/twelve segments, respectively. In addition, viruses in the family Endornaviridae, which lack true virions, are currently classified as dsRNA viruses by ICTV, although there is some evidence to suggest that they have evolved from alpha-like viruses of positive strand RNA viruses. Beside fungi, viruses in these families also infect other organisms.
More recently, several monopartite dsRNA viruses with evolutionary links between totiviruses and partitiviruses were isolated from plants [6–8]. Furthermore, a novel bipartite dsRNA mycovirus phylogenetically distantly related to totiviruses and chrysoviruses was reported from the white root rot fungus Rosellinia necatrix . These viruses represent new evolutionary lineages of dsRNA viruses, implying that dsRNA viruses are more diverse than previously recognized. Although an increasing number of complete genomic sequences of viruses have been reported, the evolution of these diverse dsRNA viruses remains to be clarified.
Horizontal gene transfer (HGT) between different viruses is important in virus evolution. The occurrence of HGT is dependent on recombination or reassortment and is capable of generating impressive genetic diversity. While HGT is commonly found in retroviruses, DNA viruses and positive-sense RNA viruses [10–16], it has only rarely been shown to occur in negative-sense RNA viruses and dsRNA viruses [17, 18]. In dsRNA viruses, there are only some sporadic reports of intra-species HGT events that occur in rotaviruses [19–21] and birnaviruses . To date no evidence has been reported of inter-species HGT in dsRNA viruses.
Here, we carried out molecular cloning and complete genomic sequencing of two novel dsRNA viruses from S. sclerotiorum strain Sunf-M, one is monopartite and the other is bipartite. We then performed genome sequence comparisons and phylogenetic analyses involving these two viruses and other related known dsRNA viruses in order to elucidate the phylogenetic relationships and evolution of relevant dsRNA viruses. Moreover, we provided evidence based on sequence comparison and phylogenetic analysis that cross-family HGT events may have occurred between dsRNA viruses from different families.
Results and discussion
Discovery and complete genomic sequencing of dsRNA in S. Sclerotiorumsunf-M
S. sclerotium strain Sunf-M, which was originally isolated from a sclerotium on a diseased sunflower plant (Helianthus annuus), was a normal wild-type strain in colony morphology and virulence. Agarose gel electrophoresis of dsRNA isolated from mycelial extracts of strain Sunf-M revealed the presence of three dsRNA bands, termed L-, M- and S-dsRNA respectively (Figure 1A). The largest L-dsRNA was generally more abundant and migrated slightly slower than the M-dsRNA.
The L-dsRNA was purified from agarose gel with a gel extraction kit, and then subjected to cDNA synthesis, PCR amplification, cloning and sequencing as described before . Computer-assisted sequence assembly showed that the full-length of L-dsRNA cDNA is 9124 bp in length that lacked a poly (A) tail at its 3′-end. Northern blot hybridization confirmed that the sequence was derived from the L-dsRNA (Figure 1B).
Molecular cloning and sequencing of the smallest S-dsRNA was also carried out and the complete nucleotide sequence was determined. Sequencing and Northern hybridization analysis revealed that the S-dsRNA band was actually a doublet consisting of two co-migrating dsRNA segments and thereby the resulting S-dsRNA band was designated as S-1 and S-2 dsRNA, which are 1856 bp and 1783 bp in length, excluding the poly (A) tail, respectively (Figure 1B).
Characterization of the L-dsRNA
The genomic organization of L-dsRNA revealed that it contains two large open reading frames (ORFs): ORF1 (nt 1089–5006) and ORF2 (nt 5054–9071) in different frames on the genomic plus strand. The L-dsRNA has a long 5′-UTR of 1088 bp, but relatively short 3-UTR of 48 bp (Figure 1C). ORF1 potentially encodes a 1305 amino acid (aa) protein with a predicted molecular mass of 144 kDa. A sequence search with BLASTp showed that it shares significant sequence similarity (E value of ≤ 3e-20) with only the hypothetical proteins of three unclassified dsRNA viruses: Grapevine associated totivirus-2 (GrAV-2) , Fusarium graminearum dsRNA mycovirus-3 (FgV-3)  and Phlebiopsis gigantea mycovirus dsRNA 2 (PgRV-2)  in the databases. Although a search of conserved domain database (CDD) revealed a significant match (E value of 8.09e-03) with a partial consensus sequence of the phosphoheptose isomerase (SIS_GmhA; cd05006) (Figure 1C), the function of ORF1 protein is unclear.
ORF2 potentially encodes a 1338-aa protein with a predicted molecular mass of 146.2 kDa. BLASTp searches showed that this protein was most closely related (E value of ≤ 9e-92) to the putative RdRps of the same three viruses mentioned above as well as another unclassified dsRNA virus Diplodia scrobiculata RNA virus 1 (DsRV-1; accession no. NC_013699). In addition, this protein also shares significant sequence similarity with RdRps of members of the families Totiviridae and Chrysoviridae. CDD searches and multiple protein alignment confirmed that ORF2-encoded protein contained a conserved viral RdRp domain (RdRp_4; pfam02123) with eight conserved motifs characteristic of the RdRps in dsRNA viruses (Figure 2).
Intriguingly, a search of CDD revealed that a region of L-dsRNA ORF2 downstream of the RdRp domain shared significant sequence similarity (E value of 5.96e-03) with a consensus sequence of the Phytoreo_S7 domain (pfam07236, aa 266–349) (Figure 1C). The Phytoreo_S7 domain is characteristic of a family consisting of several phytoreovirus S7 proteins known to be viral core proteins with nucleic acid binding activities. The phytoreovirus S7 domain has also been found in FgV-3 , chrysovirus Penicillium chrysogenum virus (PcV)  as well as other viruses (see below).
The ORF2 is likely expressed via a −1 frameshift mechanism as a fusion protein with ORF1-encoded protein. A heptanucleotide sequence (GAAAAAC4997-5003), located in a stretch in-frame with ORF1 stop codon region (nt 4987–5054) and upstream of ORF2 start codon, was identified as a putative shifty heptamer motif that could facilitate ribosomal frameshifting (Figure 1C). In addition, a pseudoknot structure that facilitates pausing of the translating ribosome and increasing the frequency of frameshifting [26, 27] was also found immediately downstream of the shifty heptamer (nt 5027–5114) (Figure 1D). Similar genomic organization and expression strategy are characteristic of some members of the family Totiviridae. Therefore, the results presented in this section suggest that L-dsRNA probably represented a novel dsRNA mycovirus infecting S. sclerotiorum strain Sunf-M. This virus was hence named as Sclerotinia sclerotiorum nonsegmented virus L (SsNsV-L).
Characterization of the S-dsRNA
Both S-1 (1856 bp) and S-2 dsRNA (1783 bp) contain a single ORF on their plus-stranded RNA and share conserved sequences in their 5′-UTRs (Figure 1E and Additional file 1: Figure S1). The S-1 and S-2 dsRNAs potentially encode proteins of 579 and 507 aa, respectively. BLASTp searches showed that the S-1 protein had highest sequence similarity (identity of 47%) to the putative RdRp of Flammulina velutipes isometric virus (BAH08700.1) and the S-2 protein shared highest sequence similarity (identity of 25%) with the putative CP of Rosellinia necatrix partitivirus 2 (BAK53192.1). Both of these two viruses are members of family Partitiviridae. Furthermore, CDD database searches and multiple protein alignment revealed that the S-1 protein contained the consensus motifs of partitivirus RdRps (RNA_dep_RNAP, cd01699) (Figure 2). These results suggest that S-dsRNA is a new member of family Partitiviridae infecting S. sclerotiorum Sunf-M and was named as Sclerotinia sclerotiorum partitivirus S (SsPV-S).
The diverse monopartite lineages of mycovirus-related dsRNA viruses
To elucidate the evolution of mycovirus-related dsRNA viruses including SsNsV-L and SsPV-S, we compared the genome structures and performed phylogenetic analyses for representative members of the families Chrysoviridae, Partitiviridae and Totiviridae as well as other unclassified dsRNA viruses (Figures 3 and 4, Additional file 2: Table S1, and Additional file 1: Figure S2). The analyses revealed that there are diverse evolutionary lineages in the known mycovirus-related dsRNA viruses including at least ten monopartite, three bipartite, one tripartite and three quadripartite lineages.
The monopartite dsRNA family Totiviridae is not a monophyletic group since it comprises five distant evolutionary lineages: GLV-like, IMNV-like, victorivirus-like, ScV L-like, and UmV-H1 (Figure 3 and Additional file 1: Figure S2). In addition, there were variations in genome structures for several members compared with typical members (e.g. ScV-L-A) of the family Totiviridae (Figure 4). For example, the genome of Piscine myocarditis virus AL V-708 (PMV-AL V-708) contains an extra ORF at its 3′-terminal other than the CP and RdRp ORFs  (Figure 4); this is not known to occur in other totiviruses. In addition, contrary to typical totiviruses, the RdRp ORF of Glomus sp. RF1 medium virus (GMRV-RF1) is located at the 5′ terminal half of the genome upstream of CP ORF (Figure 4).
In addition to the five totiviral lineages, the unclassified monopartite dsRNA viruses consist of five other evolutionary lineages: Amalgamaviridae, CiTV-1-like, PgRV-1-like, SsNsV-L-like, and NrV-L1, which are genetically distantly related to each other. The Southern tomato virus (STV) and three other related viruses isolated from plants [6–8, 29] clustered together and were distantly related to totiviruses. Their genomes are shorter than those of totiviruses and no typical virions are associated with them. A new family, Amalgamaviridae, has been proposed to accommodate these four viruses . Our phylogenetic analysis revealed that a yeast virus Zygosaccharomyces bailii virus Z (ZbV-Z) was also likely to be the member of this proposed family. The SsNsV-L-like lineage contained SsNsV-L and four other mycoviruses (Figure 3 and Additional file 1: Figure S2). They have similar genome structures with typical members of the family Totiviridae but their genomes (>9 kb) are much larger than those of known totiviruses (Figure 4). It has been reported that the PgRV-2 and FgV-3 possibly do not form true virions [23, 24]. It remains to be determined if SsNSV-L has a capsid. The CiTV-1-like lineage included two insect viruses and an unclassified plant virus. Despite the similarity in genome organization with members of the family Totiviridae, their genomes (~8 kb) were slightly larger than those of typical totiviruses (4.6–7.0 kb) (Figure 4) and might not assemble in conventional virions . The P. gigantean mycovirus dsRNA 1 (PgRV-1)-like lineage consists of three large monopartite dsRNA viruses and no viral particles have been reported in association with infections by PgRV-1 . A mycovirus Nectria radicicola virus L1 (NrV-L1) is most closely related to partitiviruses and has been assigned as an unclassified member of the family Partitiviridae in database. However, its genome is 6 kb in length , clearly suggesting that it is not likely to be a partitivirus. Hence, it may represent a novel group of monopartite dsRNA viruses.
The diverse bipartite lineages of mycovirus-related dsRNA viruses
The family Partitiviridae generally consists of viruses with bipartite genomes comprising two dsRNA segments. All members of this family clustered together in the phylogenetic tree and could be further separated into four clades: clade I-IV (Figure 3 and Additional file 1: Figure S3). The recently isolated mycovirus Rosellinia necatrix megabirnavirus 1 (RnMBV1), a member of the proposed family Megabirnaviridae was most closely related to the PgRV-1-like monopartite lineage (Figure 3), but it has a bipartite genome encapsidated in isometric virions. Therefore, it may represent a novel bipartite evolutionary lineage of dsRNA viruses. More recently, a novel bipartite dsRNA mycovirus Botrytis porri RNA virus 1 (BpRV1) was isolated from Botrytis porri that belongs to a separate clade distinct from those of other known RNA mycoviruses . Interestingly, the BpRV1-related bipartite dsRNA virus was also isolated from S. sclerotiorum in our lab (Liu LJ et al., unpublished data). These results indicated that there are diverse evolutionary lineages of bipartite dsRNA viruses in nature.
The tripartite lineage of mycovirus-related dsRNA viruses
Although the typical patitiviruses have two genome segments, some viruses in clade IV harbor three segments, two of which encoded CPs. These CPs formed two sister groups in the CP tree and the two CPs in each virus clustered within different groups (see Additional file 1: Figure S3B), suggesting that a possible duplication event occurred before the divergence of these viruses. Because the genomes of many viruses in the clade IV were not completely sequenced, it is not yet known whether all members of this clade possess two CP genes.
The diverse quadripartite lineages of mycovirus-related dsRNA viruses
Currently, ICTV recognizes six viruses as members or probable members in the family Chrysoviridae (Figure 3). Phylogenetic analysis revealed that other related viruses might also be members of this family (Figure 3 and Additional file 1: Figure S2). The extended family is a monophyletic group and can be further divided into two subgroups: clade I and clade II. Members of clade II have only 4 genome segments whereas members of clade I generally have more than four segments. At present, none of clade I viruses is recognized as member or probable member of the genus Chrysovirus. It is worth noting that Aspergillus mycovirus 1816 (AMV1816) has been incorrectly assigned by the GenBank as an unclassified member of the family Totiviridae in database. Four dsRNA segments have been shown to be associated with this virus . Our phylogenetic analysis also suggests that it is probably a member of Chrysoviridae.
Like members of the family Chrysoviridae Alternaria alternate dsRNA mycovirus (AaRV)  and Aspergillus mycovirus 341 (AMV-341) have four genomic dsRNA segments that are encapsidated in isometric virions. However, each genomic segment has a 3′-poly (A) tail, which is not found in chrysoviruses. In addition, the dsRNA 4 segments of these two viruses (~1.5 kb) are much shorter than those of chrysoviruses (~2.4 kb) (Figure 4). Furthermore, phylogenetic analysis revealed that these two viruses are distantly related to chrysoviruses (Figure 3). These results suggest that the two viruses are members of a novel lineage of quadripartite dsRNA viruses.
Rosellinia necatrix quadrivirus 1 (RnQV1), a novel dsRNA virus, has recently been reported from Rosellinia necatrix. Its genome encompasses four segments but all of which are much larger than those of chrysoviruses (Figure 4). Interestingly, the genome of Amasya cherry disease-associated mycovirus (ACDAV) also encompasses four segments but two of which putatively encode RdRps that are related to each other  and which cluster with RnQV1 in the phylogenetic tree (Figure 3). It has been suggested that the two RdRps of ACDAV might derive from two related viruses . Although RnQV1 and ACDAV clustered within the totiviral clade (Figure 3), the feature of multipartite genome indicated that they represented a new lineage of quadripartite dsRNA viruses. Based on the properties of RnQV1, a new family Quadriviridae has been proposed .
Possible origin of multipartite dsRNA viral lineages
From both phylogenetic trees and network, we can delineate different lineages of multipartite dsRNA viruses: AaRV-like, RnQV1-like, RnMBV1 and chrysoviruses. These are generally most closely related to certain monopartite dsRNA viruses and the monopartite viruses were branching deeply at the base of the multipartite viral lineages. These results suggest that multipartite dsRNA viruses likely evolved from different monopartite dsRNA viruses. The extra segments in monopartite dsRNA viruses could be generated through gene acquisition (e.g. RnMBV1 and chrysoviruses) and/or genome separation (e.g. RnQV1 and AaRV). In addition, phylogenetic analysis places the NrV-L1 at the base of partitiviral clade III (see Additional file 1: Figure S3B), raising the possibility that the ancestor of clade III partitiviruses possibly originated from NrV-L1-like monopartite viruses. This finding raised our awareness of the possibility that the partitiviruses in the three other clades probably also evolved from different monopartite dsRNA viruses that have yet to be discovered. Therefore, generation of multipartite genomes may be an important macroevolutionary mechanism in dsRNA viruses.
Taxonomy of mycovirus-related dsRNA viruses
In this study, genome comparisons and phylogenetic analyses revealed that at least 10 monopartite, 3 bipartite, 1 tripartite and 3 quadripartite lineages of mycovirus-related dsRNA viruses are known. Among these, some lineages have been considered as members or tentative members of the families, Totiviridae, Partitiviridae and Chrysoviridae. However, the taxonomy of viruses belonging to the other evolutionary lineages has yet to be considered. Because of differences in genome organization, particle morphology and phylogeny from members of these three families, the establishment of new virus families or new genera is warranted to accommodate the unclassified dsRNA viral lineages.
Current taxonomy of genera in the family Partitiviridae is based on viral hosts (plants or fungi). Our phylogenetic analysis, however, shown that partitiviral clade I and II consisted of a mosaic of plant partitiviruses (of genus Alphacryptovirus) and fungal partitiviruses (of genus Partitivirus). Therefore, the host taxon is not a distinguishing character and it does not reflect the true evolutionary relationships of viruses. Because classification of viruses based on phylogeny not only helps to understand the evolution of viruses but also facilitates the prediction of new virus emergence. Therefore, we propose the establishment of four new genera to reflect the four clades in family Partitiviridae.
It has been considered that dsRNA viruses are polyphyletic and have originated from different lineages of positive-strand RNA viruses [37–39]. The families Partitiviridae and Totiviridae have been suggested to belong to the picorna-like superfamily . Our study revealed that members of the family Chrysoviridae and other diverse evolutionary lineages of mycovirus-related dsRNA viruses are closely related to totiviruses and partitiviruses. Therefore, these viral lineages may also be included in this expanded superfamily, remarkably expanding the known diversity of the picorna-like viruses.
Identification of ‘phytoreo S7 domain’
The S7 protein domain is thought to be characteristic of members of the genus phytoreovirus in family Reoviridae. The finding that S7 domain homologs occur in SsNsV-L, FgV-3 and PcV raised our interest in exploring whether other viruses also have S7 domain homologs. For this purpose, we performed PSI-BLAST searches using the S7 domain sequences of these three viruses and plant phytoreoviruses as seed sequences against NCBI nr database. The results showed that homologs of S7 domain are widely distributed in diverse dsRNA viral lineages (Table 1).
The S7 domain was found in all known chrysoviruses. Interestingly, It occurs in both RdRp and p3 proteins of Clade II chrysoviruses (p3 homolog in several members is p4), but only found in the RdRps of Clade I chrysoviruses. In fact, the 5′-terminal regions of RdRps in Clade II chrysoviruses are homologous with those of their p3/p4 proteins  (Figure 5). The S7 domain is located in these 5′-terminal homologous regions. The homology between RdRp and other proteins was not found in Clade I chrysoviruses.
In addition to four members in genus phytoreovirus, we also found that the S2 protein (guanylyltransferase) of Scylla serrata reovirus SZ-2007 (SsReV-SZ) , a member of a proposed new genus in the family Reoviridae, contains an S7 domain homolog at its 3′-terminal. This domain was also identified in SsNsV-L-related mycoviruses FgV-3, DsRV-1 and PgRV-2 as well as in another unclassified monopartite dsRNA virus, PgRV-1. Moreover, S7 domain homologs were also found in the CP protein of the totivirus GMRV-RF1 and the polyproteins of three endornaviruses.
Compared with those of phytoreoviruses, the S7 domain in other viruses contains only partial sequence of the S7 domain. Multiple alignment of the S7 domain revealed that the sequences were likely to be conserved in diverse dsRNA viral lineages (Figure 6). The function of this domain in these non-phytoreoviruses is not known. It will be of interest to determine whether it plays a role in viral RNA binding and packaging.
Evolution of ‘phytoreo S7 domain’ in diverse viral lineages
Comparison of the domain arrangement in relevant viral proteins shows that the S7 domain is located downstream of the RdRp domain in SsNsV-L, FgV-3, DsRV-1 and PgRV-2. However, the S7 domain resides upstream of the RdRp domain in endornaviruses, chrysoviruses and PgRV-1 (Figure 5). In addition, this domain is also found in other proteins, such as the CP of GMRV-RF1 and the guanylyltransferase of SsReV-SZ. These results suggest that multiple recombination events may have occurred among different viral domains or proteins.
To elucidate the possible evolution of the S7 domain homologs, we constructed phylogenetic tree and network for those from different viral lineages (Figures 7 and 8). The S7 domains from five members of phytoreovirus clustered together and formed a distinct clade in both tree and neighbor-net. All of these from chrysoviruses also clustered together. The phylogeny relationships of the p3/p4 of clade II chrysoviruses were consistent with those of their RdRps, suggesting that the ancestor of clade II chrysoviruses also possessed the S7 domain in its RdRp and p3/p4. Considering that the branches of p3/p4 proteins were located at the base of chrysoviruses clade and that the 5′-terminal regions were homologous between RdRps and p3/p4s (Figure 5), it is most likely that the S7 domains were firstly obtained by the p3/p4 proteins of the ancestral clade II chrysoviruses and then transferred to the RdRp proteins by recombination during evolution. Given the close relationships in the neighbor-net (Figure 8), the S7 domains of clade I chrysoviruses may have been acquired by their progenitor from those in RdRps of the ancestor of clade I chrysoviruses via inter-species HGTs.
Some viruses from unrelated families constituted the third clade. The positions of their S7 domains were different (Figure 5). This suggests that HGTs may have occurred among these distant viruses. The closely related four unclassified mycoviruses SsNsV-L, FgV-3, DsRV-1 and PgRV-2 cluster together and the arrangement of their S7 and RdRp domains were also similar, suggesting that the ancestor of these four viruses may have contained the S7 domain. Interestingly, PgRV-1 and PgRV-2 that coinfect one fungal strain clustered together but the positions of their S7 domains were different, suggesting that HGT between these two viruses may have occurred. Similarly, the endornavirus Helicobasidium mompa endornavirus 1 (HmEV-1)  possibly acquired its S7 domain from the DsRV-1-like virus via HGT.
dsRNA viruses represent an extremely divergent group. Although RdRps are the most highly conserved proteins among RNA viruses, there is little primary-sequence similarity among those encoded by dsRNA viruses from different genera, even those belonging to the same family . For example, the RdRps from several distinct genera in the family Reoviridae show only 10–20% amino acid identity . So far, the phylogenetic relationship of members of Reoviridae with viruses in other families or genera is not clear. Therefore, the obvious conservation of S7 domain in diverse dsRNA viral lineages even from different families implies that it is not likely to be the results of vertical inheritance. Furthermore, the S7 domain only occurred in certain viruses of a given viral group; their phylogenies were not consistent with those of RdRps. For example, only three members of the family Endornaviridae have the S7 domains, and these did not cluster together in S7 domain tree (Figure 6). In addition, the S7 domains in the non-phytoreoviruses are truncated and have different domain arrangement with RdRps (Figure 5). Given these facts, the S7 domain sequences in non-phytoreoviruses were most likely acquired from ancestral phytoreoviruses and then horizontally transferred repeatedly among these diverse dsRNA viral lineages. This suggests that HGT between different virus species is an important macroevolutionary mechanism in dsRNA viruses.
In summary, we report the molecular properties of two novel mycoviruses from S. sclerotiorum strain Sunf-M. One is a monopartite virus representing a distinct evolutionary lineage of dsRNA viruses and the other is a bipartite virus, a new member of the family Partitiviridae. Comprehensive phylogenetic analyses and genome comparisons revealed that there are at least ten monopartite, three bipartite, one tripartite and three quadripartite lineages in the known mycovirus-related dsRNA viruses and those multipartite lineages possibly evolved from different monopartite dsRNA viruses. Moreover, we found that homologs of Phytoreo_S7 Domain are widely distributed in diverse non-phytoreovirus lineages, including, chrysoviruses, endornaviruses as well as some unclassified dsRNA mycoviruses. We further provided convincing evidence that multiple HGT events may have occurred among dsRNA viruses from different families. Our study provides an insight into the phylogeny and evolution of dsRNA viruses and reveals that HGT between different viruses and generation of multipartite genomes are important macroevolutionary mechanisms in dsRNA viruses.
Fungal strain and culture conditions
Wild type S. sclerotiorum isolate Sunf-M was obtained from sunflower (Helianthus annuus) in Hohhot, Inner Mongolia, China. Isolate was routinely cultured on PDA at 20°C and stored at 4°C in PDA tube slants. Small mycelial agar plugs were grown out on cellophane membranes on top of PDA at 20°C for 2 day, and then the mycelium was harvested from the cellophane membranes for dsRNA extraction.
DsRNA extraction and purification
A previously described procedure for dsRNA extraction  was used with minor modifications. Briefly, the mycelium was ground in liquid nitrogen with mortar and pestle. dsRNA was extracted from the mycelium and purified to remove traces of DNA and ssRNA by digestion with S1 nuclease and DNase I (TaKaRa). The dsRNA preparations were fractionated on 1.0% agarose gel and stained with ethidium bromide. The isolated dsRNA was agarose gel purified with an AxyPrep™ DNA Gel Extraction Kit (Axygen).
cDNA synthesis, molecular cloning and sequencing
cDNA synthesis and molecular cloning of the L-dsRNA was performed as previously described . A single-primer amplification method  with minor modifications was used to obtain the full-length clones of S-dsRNA. In brief, a 3′ amino-blocked ligase adaptor (5′-pGCATTGCATCATGATCGATCGAATTCTTTAGTGAGGGTTAATTGCC-NH2-3′) was ligated to the 3′-end of the purified S-dsRNA segment using T4 RNA ligase (Fermentas) and the adaptor-ligated dsRNA was then reverse transcribed with a complementary primer 1 (5′-GGCAATTAACCCTCACTAAAG-3′). Following treatment with RNase H (TAKARA), the cDNA strands were annealed for 10 min at 80°C, for 16 h at 65°C and for 3 h at 30°C and the resulting hybrid was filled in with Klenow Fragment (TAKARA). The cDNA was amplified with another complementary primer 2 (5′-TCACTAAAGAATTCGATCGATC-3′). The resulting products were recovered and purified with a gel extraction kit (Axygen), cloned into the pMD18-T vector (TaKaRa). DNA was sequenced by Sanger methods at the Beijing Genomics Institute (BGI). The full genomic sequences of L- and S-dsRNA were completed and confirmed by overlapping of cDNA clones and each base was determined by sequencing at least two independent clones from both orientations. New sequences generated in this study were deposited in GenBank under accession numbers: JQ513382, GQ280377.1 and GQ280378.1.
Northern blot hybridization
Northern hybridization analysis was performed as previously described . To verify the authenticity of the cDNA clones generated with the purified dsRNA, the cDNA clones from L-dsRNA, S-1 and S-2 dsRNA segments were labeled with 32P] dCTP using a radio-labeling kit (TaKaRa) and used to probe the different RNA blots, respectively.
Data collection and sequence analysis
The genome and protein sequences of the dsRNA viruses used in this study were downloaded from viral genome databases at the NCBI website (http://www.ncbi.nlm.nih.gov/genomes/GenomesHome.cgi?taxid10239). The software package DNAMAN 7 (Lynnon Biosoft, USA) was used for sequence annotations, including nucleotide statistics and ORF searching. Similarity searches of NCBI GenBank database were conducted using the online BLAST program (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Searches for protein domains were performed using NCBI conserved domain database (CDD) (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). The pseudoknot structure of L-dsRNA was predicted by the KnotSeeker program  and was visualized using program VARNAv3-7 .
Sequence alignment and phylogenetic analysis
Multiple alignments of protein sequences were constructed using the M-Coffee web server (http://tcoffee.vital-it.ch/apps/tcoffee/do:mcoffee), which combines the output of several popular aligners into one single multiple sequence alignments. Phylogenetic trees were estimated using two independent methods: Maximum-Likelihood (ML) and Bayesian inference (BI) on the aligned amino acid sequences. BI trees were constructed with MrBayes-3.1.2 ,using WAG models of amino acid substitution with invgamma (+I + Γ), performing two runs each of four Monte Carlo Markov Chains (MCMCs), sampling every 1000th iteration over 1.1 × 106 generations after a burn-in of 101 samples. ML trees were inferred with PhyML 3.0 , using the best-fit model selected by ProtTest2.4  for each dataset, with SPRs algorithms and 8 categories of γ-distributed substitution rates. The reliability of internal branches was evaluated based on SH-aLRT supports. The resulting BI and ML trees were then used to construct the consensus tree for each alignment using TREE-PUZZLE5.2 .
Considering that conflicting phylogenetic signals can lead to tree reconstruction artifacts, we also constructed phylogenetic networks with SplitsTree4 program (http://www.splitstree.org/), using Neighbor-Net method and WAG model of substitution. The unresolved branches or conflicting phylogenetic signals in alignments could be indicated by the box-like structures in neighbor-nets graphs.
Approximate likelihood ratio test
Basic Local Alignment Search Tool
Conserved Domain Database
Horizontal gene transfer
Monte Carlo Markov Chains
National Center for Biotechnology Information
Open reading frames
RNA-dependent RNA polymerase
Ghabrial SA, Suzuki N: Viruses of plant pathogenic fungi. Annu Rev Phytopathol. 2009, 47: 353-384. 10.1146/annurev-phyto-080508-081932.
Pearson MN, Beever RE, Boine B, Arthur K: Mycoviruses of filamentous fungi and their relevance to plant pathology. Mol Plant Pathol. 2009, 10 (1): 115-128. 10.1111/j.1364-3703.2008.00503.x.
Liu HQ, Fu YP, Jiang DH, Li GQ, Xie J, Peng YL, Yi XH, Ghabrial SA: A novel mycovirus that is related to the human pathogen hepatitis E virus and rubi-like viruses. J Virol. 2009, 83 (4): 1981-1991. 10.1128/JVI.01897-08.
Yu X, Li B, Fu YP, Jiang DH, Ghabrial SA, Li GQ, Peng YL, Xie JT, Cheng JS, Huang JB, et al: A geminivirus-related DNA mycovirus that confers hypovirulence to a plant pathogenic fungus. Proc Natl Acad Sci U S A. 2010, 107 (18): 8387-8392. 10.1073/pnas.0913535107.
King AMQ, Adams MJ, Carstens EB, Lefkowitz EJ: (Eds): Virus taxonomy: Ninth Report of the International Committee on taxonomy of viruses. 2011, New York: Elsevier
Martin RR, Zhou J, Tzanetakis IE: Blueberry latent virus: an amalgam of the Partitiviridae and Totiviridae. Virus Res. 2011, 155 (1): 175-180. 10.1016/j.virusres.2010.09.020.
Sabanadzovic S, Abou Ghanem-Sabanadzovic N, Valverde RA: A novel monopartite dsRNA virus from rhododendron. Arch Virol. 2010, 155 (11): 1859-1863. 10.1007/s00705-010-0770-5.
Sabanadzovic S, Valverde RA, Brown JK, Martin RR, Tzanetakis IE: Southern tomato virus: the link between the families Totiviridae and Partitiviridae. Virus Res. 2009, 140 (1–2): 130-137.
Chiba S, Salaipeth L, Lin YH, Sasaki A, Kanematsu S, Suzuki N: A novel bipartite double-stranded RNA mycovirus from the white root rot fungus Rosellinia necatrix: molecular and biological characterization, taxonomic considerations, and potential for biological control. J Virol. 2009, 83 (24): 12801-12812. 10.1128/JVI.01830-09.
van der Walt E, Rybicki EP, Varsani A, Polston JE, Billharz R, Donaldson L, Monjane AL, Martin DP: Rapid host adaptation by extensive recombination. J Gen Virol. 2009, 90 (Pt 3): 734-746.
Froissart R, Roze D, Uzest M, Galibert L, Blanc S, Michalakis Y: Recombination every day: abundant recombination in a virus during a single multi-cellular host infection. PLoS Biol. 2005, 3 (3): e89-10.1371/journal.pbio.0030089.
Katz RA, Skalka AM: Generation of diversity in retroviruses. Annu Rev Genet. 1990, 24: 409-445. 10.1146/annurev.ge.24.120190.002205.
Chare ER, Holmes EC: A phylogenetic survey of recombination frequency in plant RNA viruses. Arch Virol. 2006, 151 (5): 933-946. 10.1007/s00705-005-0675-x.
Worobey M, Holmes EC: Evolutionary aspects of recombination in RNA viruses. J Gen Virol. 1999, 80 (Pt 10): 2535-2543.
Bujarski JJ: Recombination. Encyclopedia of virology, Volume 4. Edited by: Mahy BWJ, Regenmortel MHV. 2008, Oxford: Elsevier, 374-382. 3
Hellen CU, de Breyne S: A distinct group of hepacivirus/pestivirus-like internal ribosomal entry sites in members of diverse picornavirus genera: evidence for modular exchange of functional noncoding RNA elements by recombination. J Virol. 2007, 81 (11): 5850-5863. 10.1128/JVI.02403-06.
Chare ER, Gould EA, Holmes EC: Phylogenetic analysis reveals a low rate of homologous recombination in negative-sense RNA viruses. J Gen Virol. 2003, 84 (Pt 10): 2691-2703.
Hon CC, Lam TTY, Yip CW, Wong RTY, Shi M, Jiang JW, Zeng FY, Leung FCC: Phylogenetic evidence for homologous recombination within the family Birnaviridae. J Gen Virol. 2008, 89: 3156-3164. 10.1099/vir.0.2008/004101-0.
Phan TG, Okitsu S, Maneekarn N, Ushijima H: Evidence of intragenic recombination in G1 rotavirus VP7 genes. J Virol. 2007, 81 (18): 10188-10194. 10.1128/JVI.00337-07.
Parra GI, Bok K, Martinez M, Gomez JA: Evidence of rotavirus intragenic recombination between two sublineages of the same genotype. J Gen Virol. 2004, 85: 1713-1716. 10.1099/vir.0.79851-0.
Suzuki Y, Gojobori T, Nakagomi O: Intragenic recombinations in rotaviruses. FEBS Lett. 1998, 427 (2): 183-187. 10.1016/S0014-5793(98)00415-3.
Al Rwahnih M, Daubert S, Urbez-Torres JR, Cordero F, Rowhani A: Deep sequencing evidence from single grapevine plants reveals a virome dominated by mycoviruses. Arch Virol. 2011, 156 (3): 397-403. 10.1007/s00705-010-0869-8.
Yu J, Kwon SJ, Lee KM, Son M, Kim KH: Complete nucleotide sequence of double-stranded RNA viruses from Fusarium graminearum strain DK3. Arch Virol. 2009, 154 (11): 1855-1858. 10.1007/s00705-009-0507-5.
Kozlakidis Z, Hacker CV, Bradley D, Jamal A, Phoon X, Webber J, Brasier CM, Buck KW, Coutts RH: Molecular characterisation of two novel double-stranded RNA elements from Phlebiopsis gigantea. Virus Genes. 2009, 39 (1): 132-136. 10.1007/s11262-009-0364-z.
Jiang D, Ghabrial SA: Molecular characterization of Penicillium chrysogenum virus: reconsideration of the taxonomy of the genus Chrysovirus. J Gen Virol. 2004, 85 (Pt 7): 2111-2121.
Alam SL, Atkins JF, Gesteland RF: Programmed ribosomal frameshifting: much ado about knotting!. Proc Natl Acad Sci U S A. 1999, 96 (25): 14177-14179. 10.1073/pnas.96.25.14177.
Dinman JD: Programmed ribosomal frameshifting goes beyond viruses: organisms from all three kingdoms use frameshifting to regulate gene expression, perhaps signaling a paradigm shift. Microbe Wash DC. 2006, 1 (11): 521-527.
Haugland O, Mikalsen AB, Nilsen P, Lindmo K, Thu BJ, Eliassen TM, Roos N, Rode M, Evensen O: Cardiomyopathy syndrome of Atlantic Salmon (Salmo salar L.) is caused by a double-stranded RNA virus of the Totiviridae family. J Virol. 2011, 85 (11): 5275-5286. 10.1128/JVI.02154-10.
Li L, Tian Q, Du Z, Duns GJ, Chen J: A novel double-stranded RNA virus detected in Primula malacoides is a plant-isolated partitivirus closely related to partitivirus infecting fungal species. Arch Virol. 2009, 154 (4): 565-572. 10.1007/s00705-009-0342-8.
Spear A, Sisterson MS, Yokomi R, Stenger DC: Plant-feeding insects harbor double-stranded RNA viruses encoding a novel proline-alanine rich protein and a polymerase distantly related to that of fungal viruses. Virology. 2010, 404 (2): 304-311. 10.1016/j.virol.2010.05.015.
Ahn IP, Lee YH: A viral double-stranded RNA up regulates the fungal virulence of Nectria radicicola. Mol Plant Microbe Interact. 2001, 14 (4): 496-507. 10.1094/MPMI.2001.14.4.496.
Wu M, Jin F, Zhang J, Yang L, Jiang D, Li G: Characterization of a novel bipartite double-stranded RNA mycovirus conferring hypovirulence in the phytopathogenic fungus Botrytis porri. J Virol. 2012, 86 (12): 6605-6619. 10.1128/JVI.00292-12.
van Diepeningen AD, Debets AJ, Hoekstra RF: Intra- and interspecies virus transfer in Aspergilli via protoplast fusion. Fungal Genet Biol. 1998, 25 (3): 171-180. 10.1006/fgbi.1998.1096.
Aoki N, Moriyama H, Kodama M, Arie T, Teraoka T, Fukuhara T: A novel mycovirus associated with four double-stranded RNAs affects host fungal growth in Alternaria alternata. Virus Res. 2009, 140 (1–2): 179-187.
Lin YH, Chiba S, Tani A, Kondo H, Sasaki A, Kanematsu S, Suzuki N: A novel quadripartite dsRNA virus isolated from a phytopathogenic filamentous fungus, Rosellinia necatrix. Virology. 2012, 426 (1): 42-50. 10.1016/j.virol.2012.01.013.
Kozlakidis Z, Covelli L, Di Serio F, Citir A, Acikgoz S, Hernandez C, Ragozzino A, Flores R, Coutts RH: Molecular characterization of the largest mycoviral-like double-stranded RNAs associated with Amasya cherry disease, a disease of presumed fungal aetiology. J Gen Virol. 2006, 87 (Pt 10): 3113-3117.
Ahlquist P: Parallels among positive-strand RNA viruses, reverse-transcribing viruses and double-stranded RNA viruses. Nat Rev Microbiol. 2006, 4 (5): 371-382. 10.1038/nrmicro1389.
Koonin EV, Dolja VV: Evolution and taxonomy of positive-strand RNA viruses: implications of comparative analysis of amino acid sequences. Crit Rev Biochem Mol Biol. 1993, 28 (5): 375-430. 10.3109/10409239309078440.
Koonin EV: Evolution of double-stranded RNA viruses: a case for polyphyletic origin from different groups of positive-stranded RNA viruses. Semin Virol. 1992, 3: 327-339.
Koonin EV, Wolf YI, Nagasaki K, Dolja VV: The Big Bang of picorna-like virus evolution antedates the radiation of eukaryotic supergroups. Nat Rev Microbiol. 2008, 6 (12): 925-939. 10.1038/nrmicro2030.
Chen J, Xiong J, Yang J, Mao Z, Chen X: Nucleotide sequences of four RNA segments of a reovirus isolated from the mud crab Scylla serrata provide evidence that this virus belongs to a new genus in the family Reoviridae. Arch Virol. 2011, 156 (3): 523-528. 10.1007/s00705-010-0852-4.
Mertens P: The dsRNA viruses. Virus Res. 2004, 101 (1): 3-13. 10.1016/j.virusres.2003.12.002.
DePaulo J, Powell C: Extraction of double-stranded RNA from plant tissues without the use of organic solvents. Plant Dis. 1995, 79 (3): 246-248. 10.1094/PD-79-0246.
Covelli L, Coutts RH, Di Serio F, Citir A, Acikgoz S, Hernandez C, Ragozzino A, Flores R: Cherry chlorotic rusty spot and Amasya cherry diseases are associated with a complex pattern of mycoviral-like double-stranded RNAs. I. Characterization of a new species in the genus Chrysovirus. J Gen Virol. 2004, 85 (Pt 11): 3389-3397.
Sperschneider J, Datta A: KnotSeeker: heuristic pseudoknot detection in long RNA sequences. RNA. 2008, 14 (4): 630-640. 10.1261/rna.968808.
Darty K, Denise A, Ponty Y: VARNA: interactive drawing and editing of the RNA secondary structure. Bioinformatics. 2009, 25 (15): 1974-1975. 10.1093/bioinformatics/btp250.
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010, 59 (3): 307-321. 10.1093/sysbio/syq010.
Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.
Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002, 18 (3): 502-504. 10.1093/bioinformatics/18.3.502.
This research was supported by China National Funds for Distinguished Young Scientists (31125023), the Special Fund for Agro-scientific Research in the Public Interest (201103016) and China Agriculture Research System (nycytx-00514).
The authors declare that they have no competing interests.
HL, YF and DJ conceived and designed the study; HL performed the computational analyses and lab experiments; HL, YF, DJ, SAG, GL, JX, JC, YP, and XY analyzed data; and HL, YF, DJ and SAG wrote the paper. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figures S1–S3. This file includes 3 supplementary figures. Figure S1 shows the alignments of 5′-untranslated regions (UTRs) of SsPV-S S-1 and S-2 segments. Figure S2 shows Neighbor-Net analysis of mycovirus-related dsRNA viruses. Figure S3 shows the ML phylogenetic trees of the RdRps and CPs of viruses in the family Partitiviridae.(PDF 679 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.