Multiple invasions of Gypsy and Micropia retroelements in genus Zaprionus and melanogaster subgroup of the genus Drosophila
© de Setta et al; licensee BioMed Central Ltd. 2009
Received: 11 June 2009
Accepted: 2 December 2009
Published: 2 December 2009
The Zaprionus genus shares evolutionary features with the melanogaster subgroup, such as space and time of origin. Although little information about the transposable element content in the Zaprionus genus had been accumulated, some of their elements appear to be more closely related with those of the melanogaster subgroup, indicating that these two groups of species were involved in horizontal transfer events during their evolution. Among these elements, the Gypsy and the Micropia retroelements were chosen for screening in seven species of the two Zaprionus subgenera, Anaprionus and Zaprionus.
Screening allowed the identification of diverse Gypsy and Micropia retroelements only in species of the Zaprionus subgenus, showing that they are transcriptionally active in the sampled species. The sequences of each retroelement were closely related to those of the melanogaster species subgroup, and the most parsimonious hypothesis would be that 15 horizontal transfer events shaped their evolution. The Gypsy retroelement of the melanogaster subgroup probably invaded the Zaprionus genomes about 11 MYA. In contrast, the Micropia retroelement may have been introduced into the Zaprionus subgenus and the melanogaster subgroup from an unknown donor more recently (~3 MYA).
Gypsy and Micropia of Zaprionus and melanogaster species share similar evolutionary patterns. The sharing of evolutionary, ecological and ethological features probably allowed these species to pass through a permissive period of transposable element invasion, explaining the proposed waves of horizontal transfers.
While the melanogaster species subgroup - mainly the two sibling species D. melanogaster and D. simulans - presents the most well known dipteran mobilomes, knowledge about the transposable elements (TEs) content of the Zaprionus species is scant. Only seven TE families have been identified in all Zaprionus genus. The elements 412 of Z. tuberculatus, 731 of Z. ornatus, Bari-1 of Z. tuberculatus and, Mariner of 12 Zaprionus species could only be detected by Southern blot hybridization using probes derived from D. melanogaster [12–17]. Furthermore, partial sequences were produced for Gypsy of Z. indianus, Copia of Z. tuberculatus, Mariner of Z. tuberculatus and Z. verruca, and Rover of Z. indianus [17–20]. Interestingly, evolutionary analyses demonstrated that four of these seven TEs of Zaprionus are highly related to those the melanogaster subgroup, sharing their common ancestor more recently than the species ones. Gypsy, Copia, Mariner and Rover were therefore proposed to have been horizontally transferred between species of the Zaprionus subgenus and the melanogaster subgroup of the genus Drosophila [15–18, 20, 21]. Horizontal transfer (HT) of TEs has been inferred when three criteria are met: (a) high sequence conservation between TE sequences of distantly related species; (b) incongruence between TE and host phylogenies and, (c) discontinuous distribution of a TE across a species group. Additionally, requirements like geographic, temporal and ecological overlapping and, exclusion of alternative hypothesis as ancestral polymorphism promoting differential subfamily fixation and high selective constraints over the sequences need to be considered .
In a previous study, we have performed a search of D. melanogaster TEs in the American continent invasive species Z. indianus by Dot blot methodology, aiming to broaden our understanding of TE occurrence in species of the Zaprionus genus . All the 10 TEs analyzed were present in the Z. indianus genome, but only five had strong hybridization signals with the probes that were used: Roo/B104, Doc, MDG-1, Micropia and Gypsy. Among them, we selected Gypsy and Micropia retroelements to expand our evolutionary analyses in the Zaprionus genus. This choice was due to the evidence of the close relationship of these Z. indianus elements with others already described in the literature, inferred by the strong hybridization signals using heterologous probes, the easy amplification with the primers available in the literature and the proposition of a Gypsy HT of between Z. indianus and D. simulans .
Gypsy and Micropia are classified as Ty3/Gypsy elements because of their gene arrangement in the pol coding region - Reverse transcriptase/RNAseH/Integrase . Ty3/Gypsy retroelements, which are similar to vertebrate retroviruses in both sequence and genomic structure, are broadly distributed among eukaryotes . They differ from vertebrate retroviruses mainly because they do not have an env gene necessary to complete the infectious cycle . Gypsy is an exception to the typical Ty3/Gypsy structure because an active env is present, potentially capable of maintaining infectivity . Hence, Gypsy was classified as an Errantivirus by the International Committee on Taxonomy of Viruses (ICTV) .
The Micropia retroelement had already been studied in D. melanogaster, and several species of the repleta and the cardini groups of the subgenus Drosophila [28–30]. Evolutionary analyses showed the existence of at least two Micropia families that have ~30% divergence in DNA sequence . The first family, present in the repleta group, comprises two subfamilies than were differentiated by nucleotide divergence and antisense RNA expression. The repleta species group seems to have transmitted one of its elements to the D. cardini group through HT . The second family is represented by the D. melanogaster element that also transcribes antisense RNAs responsible for control of its expression . The data suggest that the Micropia retroelement is an ancient component of Drosophila genomes .
In contrast to Micropia, the Gypsy retroelement has been analyzed in the 12 Drosophila genomes , and individual sequences from several Drosophilidae species are available, such as the virilis and repleta groups of subgenus Drosophila; the willistoni and melanogaster groups of the subgenus Sophophora; and the Scaptodrosophila latifasciaeformis and the Z. indianus species [18, 32–34]. Two Gypsy families have been identified, one exclusive to D. willistoni and the other widely distributed among Drosophilidae species . The latter is divided into 10 subfamilies, seven of which have been implicated in HT events, demonstrating that inheritance of the Gypsy retroelement is complex [18, 32]. HT events involving Gypsy could have been facilitated by the presence of env, assumed to confer autonomous infectivity on the retroelement, thereby allowing it to cross species barriers [18, 32, 35]. Specifically in the melanogaster species subgroup, two Gypsy subfamilies have been identified, and seven HTs have been proposed among melanogaster species subgroups, or between one species of that subgroup and S. latifasciaeformis and Z. indianus [18, 32]. These inter-genera HTs are assumed to have occurred because synonymous sites of Gypsy env gene are three to five times more highly conserved than those of host genes in species that diverged longer than 40 MYA .
We have now surveyed the Gypsy and Micropia retroelements in seven species of the Zaprionus genus, comprising one species of the Anaprionus subgenus and six of the Zaprionus subgenus. Our objective was to investigate the distribution and evolutionary history of these two elements in this scarcely studied genus by comparing their sequences with those of the 12 Drosophila genomes and other sequences available in the nucleotide database. The results show that the Zaprionus Gypsy and Micropia retroelements are present and transcriptionally active only in the subgenus Zaprionus, to which they were introduced by more than one horizontal transfer event from different donor species.
Distribution of Gypsy and Micropia in the genus Zaprionus
Zaprionus species used in this study, taxonomic classification, divergence time, geographic origin of the strains and GenBank sequence accession numbers.
Divergence time 
Ithala (South Africa)
São Tomé (São Tomé e Príncipe)
Evolutionary analyses of the Gypsyretroelement
To infer the evolutionary history of Gypsy retroelement in the Zaprionus subgenus, we carried out genomic searches for Zaprionus homologous sequences in the 12 Drosophila genomes and the GenBank database. The in silico search retrieved genomic sequences only from the melanogaster subgroup, indicating that the other seven Drosophila genomes (D. ananassae, D. pseudoobscura, D. persimilis, D. willistoni, D. mojavensis, D. virilis and D. grimshawi) do not harbor Gypsy sequences closely related to those of the subgenus Zaprionus. Eight sequences were identified in D. melanogaster, nine in D. simulans, 19 in D. sechellia, six in D. yakuba, and 13 in D. erecta [Additional file 3]. A BLAST-based analysis of the structure sequence indicated that only D. melanogaster genome presents putatively full-length insertions [Additional file 4].
Inferring phylogenetic relationships and divergence
Genetic divergence between the Gypsy and Micropia retroelements
clade 1 - S1
Zaprionus vs melanogaster S1
clade 2 - S2V1
Zaprionus vs melanogaster V1
clade 3 - S2V2
Zaprionus vs melanogaster V2
clade 1 (S1) vs clade 2 (V1)
clade 1 (S1) vs clade 3 (V2)
clade 2 (V1) vs clade 3 (V2)
clade 1 vs clade 2
Zaprionus × melanogaster group
Testing the HT hypothesis
Comparative analyses of synonymous divergence (dS), non-synonymous divergence (dN), dN:dS ratios and codon bias index (CBI) of the Gypsy and Micropia sequences and Gpdh.
CBI mean (min-max)
dN mean (SE)
dS mean (SE)
dN:dS mean (SE)
Zaprionus × melanogaster spp.
(9.70 × 10-8)
(5.95 × 10-4)
(3.88 × 10-5)
(6.49 × 10-4)
Subfamily 2 - V1
Zaprionus × melanogaster spp.
(3.55 × 10-5)
(4.42 × 10-5)
(4.85 × 10-5)
Subfamily 2 - V2
Zaprionus × melanogaster spp.
(2.65 × 10-5)
(2.62 × 10-4)
(2.50 × 10-5)
(2.55 × 10-4)
(1.10 × 10-6)
(1.90 × 10-4)
Zaprionus × melanogaster spp.
(1.44 × 10-5)
(1.94 × 10-4)
(3.80 × 10-5)
(2.45 × 10-4)
(6.45 × 10-5)
(4.87 × 10-4)
Zaprionus × melanogaster spp
(3.07 × 10-6)
(4.90 × 10-6)
(2.16 × 10-5)
(5.43 × 10-4)
Phylogenetic relationships of the Micropiaretroelement
In order to evaluate the evolutionary history of Micropia, the same approaches of phylogeny and divergence estimations used for Gypsy were applied. Sequences homologous to the Zaprionus species were also identified only in species of the melanogaster subgroup, except for D. erecta, the unique species of the melanogaster subgroup for which no significant matches were obtained. D. sechellia had the most hits (22), followed by D. simulans (six), D. yakuba (four) and D. melanogaster (three) [Additional file 3]. Structure analysis indicates that the D. melanogaster and D. simulans genomes harbor putatively full-length insertions [Additional file 8].
Inferring phylogenetic relationships and divergence
Testing the HT hypothesis
Evaluation of the selective constraints showed that the mean dN:dS values of the Micropia sequences were 14-58 times higher than those of Gpdh (Table 3 and [Additional file 10]). Between the Zaprionus and melanogaster sequences, for example, the values were 0.524 and 0.009 for Micropia and Gpdh, respectively. In addition, the Z-test failed to reject the neutrality hypothesis in ~80% of the pairwise comparisons [Additional file 11]. These results refute the hypothesis that the high similarity between the Micropia sequences of Zaprionus and melanogaster is due to high selective constraints conserving their nucleotide sequences.
Network trees corroborate the inference of HT
Network reconstructions were used as an additional approach to test the hypothesis that HT has shaped the evolution of Gypsy and Micropia in Zaprionus and melanogaster species. Since TE sequences can be considered as populations of sequences that share a common ancestor, this type of phylogeny lets one propose possible donors and receptors species, and to reconstruct the TE dispersion routes by the proposition of the ancestral nodes.
Network reconstruction for Micropia clustered the sequences of D. yakuba, D. melanogaster, Z. camerounensis, Z. davidi, Z. indianus and Z. africanus with central median vectors, suggesting that high sequence conservation was not due to HTs involving the species analyzed, but probably one or more donor species not sampled (Figure 6c). The D. melanogasterA3 sequence connects the median vectors with a large cluster that harbors D. sechellia, D. simulans and D. melanogaster sequences, which in turn links a median branch to the Z. tuberculatus sequences. This arrangement indicates at least two more recent transfers, one from D. melanogaster to D. simulans and/or D. sechellia or even to the ancestor of those sister species (corroborated by the low dS values), and a second from any of these species to Z. tuberculatus. In addition, the branch of the D. sechelliaA20 sequence with a median vector that links the Z. camerounensis, Z. davidi, Z. indianus and Z. africanus sequences suggest an additional HT involving the ancestor of D. sechellia. However, the absence of geographic sharing between D. sechellia and Zaprionus species means that this proposal should be viewed cautiously. An alternative explanation could be a high rate of evolution of the D. sechelliaA20 sequence, followed by random convergent evolution.
HT inference and estimation of the divergence time between TE sequences
Time of divergence and horizontal transfer events identified for the Gypsy and Micropia retroelements
D. melanogaster/D. simulans*
3.8 - 8.8
0 - 3.1
Subfamily 2 - V1
5.2 - 11.2
Z. davidi/Z. camerounensis*
1.2 - 6.0
2.5 - 3.8
2.5 - 3.9
Subfamily 2 - V2
D. melanogaster/D. simulans/D. sechellia*
indianus complex/Z. davidi*
0.6 - 10.3
2.3 - 3.2
0 - 1.6
D. simulans/D. sechellia
D. melanogaster or D. simulans/D. sechellia*
0.8 - 1.5#
The Gypsy S2V1 sequences seem to have been involved at least in four HT events. We suggest that D. erecta was the donor of S2V1 in a more ancient transfer to Z. tuberculatus (occurred about 5.2-11.2 MYA), followed by introduction into the Z. camerounensis/Z. davidi ancestor (1.2-6 MYA), and lately into D. yakuba (2.5-3.8 MYA) and D. melanogaster (2.5-3.9 MYA). The transfer to Z. camerounensis/Z. davidi probably occurred in their ancestor, because these species diverged around 2.2 MYA , but the hypothesis of two independent HTs cannot be disregarded. The donor status of D. erecta was based on their basal positioning in the network, the repeated clustering to all the other species of this variant in the phylogeny, the divergence times between D. erecta and D. melanogaster/D. yakuba (8-15 MYA ).
The S2V2 variant is probably involved in one HT event that happened around 0.6-10.3 MYA. It could have occurred between the ancestor of the modern sequences of D. melanogaster/D. simulans/D. sechellia and the indianus/davidi complexes. Alternatively, the HT could have happened in each complex separately, since the davidi complex diverged about 0.8 MYA . The HT between the D. melanogaster/D. simulans/D. sechellia ancestor and the indianus complex ancestor is a broader scenario than the HT proposed between Z. indianus and D. simulans , exemplifying the importance of analyzing as many species as possible before inferring horizontal transfers . Finally, the time of divergence also corroborates the HT from Z. davidi to Z. africanus S2V2 sequences, as seen in the network tree, which probably occurred ~1.9 MYA, subsequent to the divergence of these species (2.6 to 4.1 MYA ).
Comparing the divergence times between the Micropia retroelement of the Zaprionus and melanogaster sequences with those of Gypsy, the HTs of Micropia might have started more recently than those of Gypsy (Micropia HTs: 3.2 MYA, Gypsy HTs: 11.2 MYA). The invariance of the synonymous sites between several pairwise comparisons [Additional file 8] makes it difficult to date these events, but the highest dS levels indicate that, of the species we have analyzed, D. yakuba and the indianus complex species were the first to be invaded by Micropia. So, the evolutionary analyses lead to the inference that Micropia diverged after the splitting of their species hosts, suggesting that at least an unknown species might have transferred Micropia sequences independently in five HTs to D. melanogaster, D. yakuba (2.3-3.2 MYA), Z. camerounensis, Z. davidi and the indianus complex (up to 1.6 MYA). Species of neither the repleta nor the cardini species group are good candidates, since they harbor Micropia retrotransposons of a different family [29, 30]. Hence, the probable donor(s) might have been a non-studied species. Based mainly on the network tree, two other HT events can be proposed. D. melanogaster may have been the donor of Micropia to the D. simulans and D. sechellia, or even to their ancestor. However, introgression cannot be excluded in this case. In turn, one of these three species might have been involved in the transmission of Micropia to Z. tuberculatus, about 0.8-1.5 MYA.
This survey of the Gypsy and Micropia in seven species of Zaprionus indicates that they are widely distributed and transcriptionally active only in species of subgenus Zaprionus. Additionally, the evolutionary analyses demonstrate that these two retroelements are closely related to those of the melanogaster species subgroup, and that their histories might have been repeatedly marked by events of inter- and intra-subgenus HTs. Several pieces of evidence have shown that the phenomenon of HT is frequent in eukaryotes [22, 37–39]. It is important to note that the HTs proposed here were assumed only when the species involved in the event shared geographic, temporal and ecological environments, and when three separate pieces of evidence suggested their occurrence: (i) lower dS values between the TE sequences than the Gpdh sequences, (ii) incongruence between host and TE phylogenies, and (iii) last common ancestor of TE more recent than that of the species.
The studies of Micropia and Gypsy in subgenus Zaprionus species highlight the HT events as an alternative evolutionary mechanism of the retroelements evolution. Among the HTs inferred for Gypsy, four events seemed to involve the Zaprionus and melanogaster species, three involved only the melanogaster group species, and only one the Zaprionus species. For the Micropia retrotransposon, seven HTs were suggested: five between unknown donor(s) and the Zaprionus or the melanogaster species, one among the melanogaster species, and one between the Zaprionus and the melanogaster species. The 15 HTs we have proposed probably occurred during two different waves of invasions. First, the Gypsy retroelement of the melanogaster subgroup had been introduced in the Zaprionus subgenus ~11 MYA. After that (~3 MYA), a second HT wave involving the Micropia retroelement may have introduced it into both the Zaprionus subgenus and melanogaster subgroup from an unknown donor. After the introduction, the TEs could have been prone to transposition and re-introduction in other related species. For example, Z. davidi has donated their Gypsy sequence to Z. africanus after had received it from an ancestor of D. melanogaster, D. simulans and D. sechellia.
According to the TE evolutionary cycle, the TE history could start from HT events, followed by the initial transpositional burst in the new host, and then the accumulation of defective copies along with host-directed epigenetic silencing . The last step could lead to the loss of mobility and, finally, to molecular erosion by random mutations. The high frequency of HTs, combined with the transcriptional activity and low insertion numbers of Gypsy and Micropia retroelements in the Zaprionus species subgenus indicates that these elements remain at the first or second steps of the evolutionary life history of the TEs, i.e. the invasion or the genomic spread period. A question that arises is whether these elements have been transferred in similar ways. HTs of Gypsy are easier to explain, since the active Gypsy env can enable its dispersion and expansion of the genomic territory, but this mechanism does not to apply to Micropia, which lacks env. Nevertheless, retroelements could have been transferred concomitantly with retrovirus infections, even Gypsy; or parasitic infestations, as for example by intracellular Wolbachia bacteria or mites . Moreover, our data allowed us to propose there were HT of Gypsy and Micropia among sister species such as D. melanogaster, D. simulans and D. sechellia, although introgression could not be ruled out.
Another concern is the evolutionary process of fixation of the TEs after the horizontal introduction. These elements could share similar mechanisms of regulation, which would permit them escaping from selective forces and allow fixation after invasion. Control of Drosophila retroelement mobilization depends on the transcription rate, which is directly related to the presence of specific regulatory proteins [41–45] and the rate of RNA degradation, mediated by RNAi systems . However, genetic information about the species of the genus Zaprionus is scarce. Further, studies of the Zaprionus genes involved in controlling mobilization could lead to hypotheses about the mechanisms that allow highly related sequences to be shared in the Zaprionus and melanogaster species genomes.
Our survey suggests that Zaprionus species have experienced waves of retroelement invasions, particularly during the last 7 million years. Species of the melanogaster subgroup might have donated their Gypsy sequences to species of the Zaprionus subgenus, and these two species groups could have received the Micropia retroelement from one or more unknown donor species. After the initial introduction, these species could share their elements. Since the Zaprionus genus and the melanogaster subgroup seem to share the same age of origin and diversification in tropical Africa, as well as ecological features, our data suggest that they passed through a permissive period of transposable element invasion during the diversification period.
Fly stocks and DNA extraction
All the strains of the Zaprionus species that were used are listed in Table 1, being derived from a single female selected randomly from a mass culture kindly provided by Drs Jean David and Amir Yassin from LEGS, CNRS, France. The D. melanogaster Canton-S strain was used as positive control for the molecular analyses. Genomic DNA was extracted from 10 individuals of each strain using the phenol-chloroform method .
PCR reactions, cloning and sequencing
Primers 2813 (5' TTA ACT CCT AGA GTT CAT CGC TGG 3') and 2814 (5' CAT GTA CCT GGT TAA CTA CTG ACC 3') were used to amplify the equivalent 386 bp fragment of the D. hydei Micropia retroelement located in a highly conserved region of the RNAseH domain, between nucleotides 2813 and 3198 . The Gypsy retroelement fragment was obtained using primers GYP3S2F (5' AAA GGC GAY TTG GTT GAC ACT CC 3') and GYP3S2R (5' CAR GTG GCT RGG TTG RGT GTG 3') and corresponds to a 485 bp sequence located in the 6491-6511 region of env (ORF 3) in the D. melanogaster Gypsy element . Both PCR reactions were performed in a final volume of 25 μl, using 200 ng of genomic DNA, 0.4 μM of each primer, 160 μM of each dNTP, 2 mM MgCl2 and 1 U Taq Platinum polymerase (Invitrogen) in 1× PCR buffer. The Gypsy cycling parameters used for amplification have been described previously . The Micropia PCR cycling parameters were: 94°C for 3 min for initial denaturation, 40 cycles of 94°C for 30 s, 58°C for 1 min, and 72°C for 1 min, followed by a final extension step at 72°C for 10 min. The fragments obtained were purified directly from the PCR product, using the GFX PCR DNA and Gel Band Purification Kit (GE Healthcare), and cloned with the TOPO TA Cloning Kit (Invitrogen). Three randomly chosen clones were automatically sequenced in an ABI PRISM 3100 Genetic Analyzer (Applied Biosystems/Hitashi) using the primer pair T7 and M13R.
RNA extraction and RT-PCR reactions
Heads and gonads from 10 individuals of each sex were dissected in Testis Buffer (183 mM KCl, 47 mM NaCl and 10 mM Tris-HCl pH 6.8). Total RNA was isolated from the dissected tissues using the TRIZOL reagent (Invitrogen) method and genomic DNA was eliminated from the samples by RQ1 RNase-Free DNase (Promega) according to the manufacturer's instructions. The cDNA pool was generated from the total RNAs using random primers and a High Capacity cDNA Archive Kit (Applied Biosystems) under low stringency conditions (37°C). The PCR reaction conditions and cycling parameters were the same as those used for genomic DNA amplification of both retroelements. To test genomic DNA contamination in the total RNA and cDNA quality Gpdh constitutive gene amplifications were performed using the total RNA extract treated with DNAse and the cDNA pool as templates, respectively. The control PCR reactions were carried out using 200 ng of total cDNA, 0.1 mM of each dNTP, 0.4 μM of the primers ZapGPDHF (5' GTT CGG CAA TTG AAC CAA TG 3') and ZapGPDHR (5' AGA GAG TCC GTG TGC ATG TG 3'), 2 mM MgCl2 and 1 U Taq Platinum polymerase (Invitrogen) in 1× PCR buffer. The cycling parameters were: 94°C for 2 min for initial denaturation, 35 cycles of 94°C for 1 min, 60°C for 1 min and 72°C for 1 min, and an additional extension step at 72°C for 10 min. The Gpdh primers were designed based on the Z. tuberculatus Gpdh (Glycerol 3 phosphate dehydrogenase) gene ORF1 (L37039) and amplify a 337 bp sequence in this sequence.
Zaprionus subgenus sequences were manipulated into BioEdit  and aligned with Clustal W 1.81 . The most divergent clones (>25%) were selected as queries for a search in the 12 Drosophila genomes using the flybase BLASTn tool http://flybase.bio.indiana.edu/blast/ and the genome database released on October 17 th , 2008. The queries selected were Z. tuberculatus1, Z. tuberculatus2 and Z. gabonicus1 for Gypsy, and Z. tuberculatus2 and Z. africanus1 for Micropia. The structure of the genomic sequences were predicted using the BLAST 2 sequences and ORF Finder programs. In order to obtain only highly related and non-disrupted sequences the searches were performed with stringent parameters (e-values > e-50 and 80% coverage). Redundant genomic sequences (100% identical) were not included in the phylogenetic analyses in order to minimize polytomies.
Multiple alignments of the Zaprionus and the genomic sequences were used to infer the phylogenetic relationships, using the maximum likelihood (ML), neighbor-joining (NJ) and maximum parsimony (MP) methods, as implemented in PhyML 3.0, MEGA 4.1 and PAUP v.4.0b10 [50–52], respectively. Branch support was calculated by bootstrap analysis consisting of 1000 replicates . In the NJ and ML analyses, maximum composite likelihood (MCL) and HKY85 distances were used to estimate the divergence matrices and reconstructed trees, respectively [54, 55]. A heuristic search algorithm was used for MP reconstruction. The sequences obtained were registered in the GenBank database (Table 1). Gypsy sequences of S. latifaciaeformis (AF548144, AF548153 and AF548152) and D. willistoni (AF548159, AF548176 and AF548143), and Micropia sequences of D. buzzatii (AY522351), D. hydei (AY519123), D. paranaensis (AY519124), D. mercatorum (AY519125), D. seriema (AY522346), D. gouveai (AY522353), D. antonietae (AY522345), D. serido (AY522344), D. spenceri (AY522347) and D. koepferae (AY522352) from GenBank, were also used in the phylogenetic reconstruction.
Selection tests for Gypsy, Micropiaand host gene
The number of synonymous substitutions per synonymous site (dS), non-synonymous substitutions per non-synonymous site (dN), the Codon Based Z-test and dN:dS ratios were estimated for the Zaprionus and melanogaster sequences using the Nei-Gojobori distance with the Jukes and Cantor correction, as implemented in MEGA 4.1 . The Gypsy sequences D. sechelliaA6/B14/B15, D. yakubaA6 and D. erectaA2/A6/B9, and the Micropia sequences D. simulansA3/A6, D. sechelliaA9/A18/A20 and D. yakubaA3, were excluded from those alignments because of the presence of indels (>2 bp) disrupting the open reading frames. Point (1 bp) alignment gaps were deleted prior to estimation of dN and dS. The codon bias index (CBI, ) was estimated for each sequence using the DnaSP 4.50 program , where zero indicates no bias and 1 maximum bias. Sequences of exon 4 of Gpdh in Zaprionus and melanogaster subgroup species (FJ705445 to FJ705450, L37039, NM_057218, XM_002078253, XM_002089126, XM_001968825 and D. sechellia genomic sequence: scaffold_5/4016995-4017372) were used to compare the dS values of the Gypsy and Micropia sequences.
Estimations of divergence time for Gypsy and Micropia
The time of divergence of Gypsy and Micropia retrotransposons was estimated according to the molecular clock equation r = k/2T, where r is the evolutionary rate (0.011 per site per MY, according to synonymous sites rate for genes with low codon bias in Drosophila ), k is the dS pairwise divergence, and T is the divergence time between species.
The median-joining network trees were obtained using all sequences studied in the conventional phylogenies. The reconstructions were performed using DNA Aligment 220.127.116.11 and NETWORK 18.104.22.168 software , available at the Fluxus Technology Ltd. website. 'MP calculation' was applied for post-processing the networks, the characters were equally weighted, and the other median-joining parameters followed the software default.
List of abbreviations
long terminal repeat
- env :
maximum composite likelihood distance
Gypsy subfamily 1
Gypsy subfamily 2 variant 1
Gypsy subfamily 2 variant 2
codon bias index
- Gpdh :
glycerol 3 phosphate dehydrogenase
number of synonymous substitutions per synonymous site
non-synonymous substitutions per non-synonymous site
We gratefully acknowledge funding from the CAPES-COFECUB International Cooperation Program (to NS, CMAC and PC), FAPESP (MAVS - 04/02851-9), CNPq (CMAC and MAVS). NS was receipt of a CNPq fellowship. We thank J. David and A. Yassin for providing the Zaprionus strains, F. Lemeunier for the technical help, and the anonymous reviewers for their critical comments.
- Yassin A, Araripe LO, Capy P, Da Lage JL, Klaczko LB, Maisonhaute C, Ogereau D, David JR: Grafting the molecular phylogenetic tree with morphological branches to reconstruct the evolutionary history of the genus Zaprionus (Diptera: Drosophilidae). Mol Phylogenet Evol. 2008, 47: 903-15. 10.1016/j.ympev.2008.01.036.View ArticlePubMedGoogle Scholar
- Lachaise D, Silvain JF: How two Afrotropical endemics made two cosmopolitan human commensals: the Drosophila melanogaster - D. simulans palaeogeographic riddle. Genetica. 2004, 120: 17-39. 10.1023/B:GENE.0000017627.27537.ef.View ArticlePubMedGoogle Scholar
- Pélandakis M, Solignac M: Molecular phylogeny of Drosophila based on ribosomal RNA sequences. J Mol Evol. 1993, 37: 525-543. 10.1007/BF00160433.View ArticlePubMedGoogle Scholar
- Kwiatowski J, Skarecky D, Bailey K, Ayala FJ: Phylogeny of Drosophila and related genera inferred from the nucleotide sequence of the Cu, Zn Sod gene. J Mol Evol. 1994, 38: 443-454. 10.1007/BF00178844.View ArticlePubMedGoogle Scholar
- Russo CAM, Takezaki N, Nei M: Molecular phylogeny and divergence times of drosophilid species. Mol Biol Evol. 1995, 12: 391-404.PubMedGoogle Scholar
- Remsen J, De Salle R: Character congruence of multiple data partitions and the origin of the Hawaiian Drosophilidae. Mol Phylogenet Evol. 1998, 9: 225-235. 10.1006/mpev.1997.0484.View ArticlePubMedGoogle Scholar
- Kwiatowski J, Ayala FJ: Phylogeny of Drosophila and related genera: conflict between molecular and anatomical analyses. Mol Phylogenet Evol. 1999, 13: 319-328. 10.1006/mpev.1999.0657.View ArticlePubMedGoogle Scholar
- Tatarenkov A, Kwiatowski J, Skarecky D, Barrio E, Ayala FJ: On the evolution of Dopa decarboxylase (Ddc) and Drosophila systematics. J Mol Evol. 1999, 48: 445-462. 10.1007/PL00006489.View ArticlePubMedGoogle Scholar
- Da Lage JL, Kergoat GJ, Maczkowiak F, Silvain JF, Cariou ML, Lachaise D: A phylogeny of Drosophilidae using the amyrel gene: questioning the Drosophila melanogaster species group boundaries. J Zoolog Syst Evol Res. 2007, 45: 47-63. 10.1111/j.1439-0469.2006.00389.x.View ArticleGoogle Scholar
- Dalton R: A fly by any other name. Nature. 2009, 457: 368-10.1038/457368a.PubMedGoogle Scholar
- Tamura K, Subramanian S, Kumar S: Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks. Mol Biol Evol. 2004, 21: 36-44. 10.1093/molbev/msg236.View ArticlePubMedGoogle Scholar
- Cizeron G, Lemeunier F, Loevenbruck C, Brehm A, Biémont C: Distribution of the retrotransposable element 412 in Drosophila species. Mol Biol Evol. 1998, 15: 1589-1599.View ArticlePubMedGoogle Scholar
- Montchamp-Moreau C, Ronsseray M, Jacques M, Lehmann M, Anxolabéhère D: Distribution and conservation of sequenceshomologous to the 1731 retrotransposon in Drosophila. Mol Biol Evol. 1993, 10: 791-803.PubMedGoogle Scholar
- Moschetti R, Caggese C, Barsanti P, Caizzi R: Intra- and interspecies variation among Bari-1 elements of the melanogaster species group. Genetics. 1998, 150: 239-250.PubMed CentralPubMedGoogle Scholar
- Maruyama K, Hartl DL: Evolution of the transposable element mariner in Drosophila species. Genetics. 1991, 128: 319-329.PubMed CentralPubMedGoogle Scholar
- Brunet F, Godin F, David JA, Capy P: The mariner transposable element in the Drosophilidae family. Heredity. 1994, 73: 377-385. 10.1038/hdy.1994.185.View ArticlePubMedGoogle Scholar
- Brunet F, Godin F, Bazin C, Capy P: Phylogenetic analysis of Mos1 -like transposable elements in the Drosophilidae. J Mol Evol. 1999, 49: 760-768. 10.1007/PL00006598.View ArticlePubMedGoogle Scholar
- Herédia F, Loreto EL, Valente VL: Complex evolution of gypsy in drosophilid species. Mol Biol Evol. 2004, 21: 1831-1842. 10.1093/molbev/msh183.View ArticlePubMedGoogle Scholar
- McDonald JF, Matyunina LV, Wilson S, Jordan IK, Bowen NJ, Miller WJ: LTR retrotransposons and the evolution of eukaryotic enhancers. Genetica. 1997, 100: 3-13. 10.1023/A:1018392117410.View ArticlePubMedGoogle Scholar
- Vidal NM, Ludwig A, Loreto ELS: Evolution of Tom, 297, 17.6 and rover retrotransposons in Drosophilidae species. Mol Genet Genomics. 2008, 282: 353-362.Google Scholar
- Almeida LM, Carareto CM: Sequence heterogeneity and phylogenetic relationships between the copia retrotransposonin Drosophila species of the repleta and melanogaster groups. Genet Sel Evol. 2006, 38: 535-550. 10.1186/1297-9686-38-5-535.PubMed CentralView ArticlePubMedGoogle Scholar
- Loreto EL, Carareto CM, Capy P: Revisiting horizontal transfer of transposable elements in Drosophila. Heredity. 2008, 100: 545-554. 10.1038/sj.hdy.6801094.View ArticlePubMedGoogle Scholar
- De Setta N, Carareto CMA: Screening for transposable elements in South America invasive species Zaprionus indianus and Drosophila malerkotliana. Drosoph Inf Serv. 2007, 90: 96-99.Google Scholar
- Lloréns C, Futami R, Bezemer D, Moya A: The Gypsy Database (GyDB) of mobile genetic elements. Nucleic Acids Res. 2008, 36: D38-D46. 10.1093/nar/gkm697.PubMed CentralView ArticlePubMedGoogle Scholar
- Eickbush TH: Origin and evolutionary relationships of LTR retroelements. The evolutionary biology of viruses. Edited by: Morse SS. 1994, New York, NY: Raven, 121-157.Google Scholar
- Llorens JV, Clark JB, Martínez-Garay I, Soriano S, de Frutos R, Martínez-Sebastián MJ: Gypsy endogenous retrovirus maintains potential infectivity in several species of Drosophilids. BMC Evol Biol. 2008, 8: 302-10.1186/1471-2148-8-302.PubMed CentralView ArticlePubMedGoogle Scholar
- Boeke JD, Eickbush TH, Sandmeyer SB, Voytas DF: Metaviridae. Virus Taxonomy. Edited by: Murphy FA. 1999, New York, NY: Springer-Verlag, 123-135.Google Scholar
- Huijser P, Kirchhoff C, Lankenau DH, Hennig W: Retrotransposon-like sequences are expressed in Y chromosomal lampbrush loops of Drosophila hydei. J Mol Biol. 1988, 203: 689-697. 10.1016/0022-2836(88)90202-1.View ArticlePubMedGoogle Scholar
- Almeida LM, Carareto CMA: Identification of two subfamilies of micropia transposable element in species of the repleta group of Drosophila. Genetica. 2004, 121: 155-164. 10.1023/B:GENE.0000040386.70086.71.View ArticlePubMedGoogle Scholar
- Cordeiro J, Robe LJ, Loreto EL, Valente VL: The LTR retrotransposon micropia in the cardini group of Drosophila (Diptera: Drosophilidae): a possible case of horizontal transfer. Genetica. 2008, 134: 335-344. 10.1007/s10709-008-9241-2.View ArticlePubMedGoogle Scholar
- Lankenau S, Corces GV, Lankenau DH: The Drosophila micropia retrotransposon encodes a testis-specific antisense RNA complementary to reverse transcriptase. Mol Cell Biol. 1994, 14: 1764-1775.PubMed CentralView ArticlePubMedGoogle Scholar
- Ludwig A, Valente VL, Loreto EL: Multiple invasions of Errantivirus in the genus Drosophila. Insect Mol Biol. 2008, 17: 113-112. 10.1111/j.1365-2583.2007.00787.x.View ArticlePubMedGoogle Scholar
- Alberola TM, De Frutos R: Molecular structure of a gypsy element of Drosophila subobscura (gypsyDs) constituting a degenerate form of insect retroviruses. Nucleic Acids Res. 1996, 24: 914-923. 10.1093/nar/24.5.914.PubMed CentralView ArticlePubMedGoogle Scholar
- Mizrokhi LJ, Mazo AM: Cloning and analysis of the mobile element gypsy from D. virilis. Nucleic Acids Res. 1991, 19: 913-916. 10.1093/nar/19.4.913.PubMed CentralView ArticlePubMedGoogle Scholar
- Mejlumian L, Pélisson A, Bucheton A, Terzian C: Comparative and functional studies of Drosophila species invasion by the gypsy endogenous retrovirus. Genetics. 2002, 160: 201-209.PubMed CentralPubMedGoogle Scholar
- Jordan IK, McDonald JF: Interelement selection in the regulatory region of the copia retrotransposon. J Mol Evol. 1998, 47: 670-676. 10.1007/PL00006425.View ArticlePubMedGoogle Scholar
- Pace JK II, Gilbert C, Clark M, Feschotte C: Repeated horizontal transfer of a DNA transposon in mammals and other tetrapods. Proc Natl Acad Sci USA. 2008, 105: 17023-17028. 10.1073/pnas.0806548105.PubMed CentralView ArticlePubMedGoogle Scholar
- Roulin A, Piegu B, Fortune PM, Sabor F, D'Hont A, Manicacci D, Panaud O: Evidence of multiple horizontal transfers of the long terminal repeat retrotransposon RIRE1 within the genus Oryza. Plant J. 2008, 53: 950-959. 10.1111/j.1365-313X.2007.03388.x.View ArticlePubMedGoogle Scholar
- Cheng X, Zhang D, Cheng Z, Keller B, Ling HQ: A new family of Ty-copia -like retrotransposons originated in the tomato genome by a recent horizontal transfer event. Genetics. 2009, 181: 1183-1193. 10.1534/genetics.108.099150.PubMed CentralView ArticlePubMedGoogle Scholar
- Pinsker W, Harina E, Hagemann S, Miller WJ: The evolutionary life history of P transposons: from invaders to domesticated neogenes. Chromosoma. 2001, 110: 148-158. 10.1007/s004120100144.View ArticlePubMedGoogle Scholar
- Cavarec L, Heidmann T: The Drosophila copia retrotransposon contains binding sites for transcriptional regulation by homeoproteins. Nucleic Acids Res. 1993, 21: 5041-5049. 10.1093/nar/21.22.5041.PubMed CentralView ArticlePubMedGoogle Scholar
- Bhadra U, Pal-Bhadra M, Birchler JA: Role of the male specific lethal (msl) genes in modifying the effects of sex chromosomal dosage in Drosophila. Genetics. 1999, 152: 249-268.PubMed CentralPubMedGoogle Scholar
- Pal-Bhadra M, Bhadra U, Birchler JA: Role of multiple trans-acting regulators in modifying the effect of the retrotransposon copia on host gene expression in Drosophila. Mol Gen Genet. 1998, 259: 198-206. 10.1007/s004380050805.View ArticlePubMedGoogle Scholar
- Cavarec L, Jensen S, Heidmann T: Identification of a strong transcriptional activator for the copia retrotransposon responsible for its differential expression in Drosophila hydei and melanogaster cell lines. Biochem Biophys Res Commun. 1994, 203: 392-399. 10.1006/bbrc.1994.2195.View ArticlePubMedGoogle Scholar
- Tulin A, Stewart D, Spradling AC: The Drosophila heterochromatic gene encoding poly(ADP-ribose) polymerase (PARP) is required to modulate chromatin structure during development. Genes Dev. 2002, 16: 2108-2119. 10.1101/gad.1003902.PubMed CentralView ArticlePubMedGoogle Scholar
- Siomi MC, Saito K, Siomi H: How selfish retrotransposons are silenced in Drosophila germline and somatic cells. FEBS Lett. 2008, 582: 2473-2478. 10.1016/j.febslet.2008.06.018.View ArticlePubMedGoogle Scholar
- Jowett T: Preparation of nucleic acids. Drosophila: A Practical Approach. Edited by: Roberts DB. 1986, Oxford: IRL Press, 275-277.Google Scholar
- Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.Google Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMedGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.View ArticlePubMedGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.View ArticlePubMedGoogle Scholar
- Swofford D: PAUP: phylogenetic analysis using parsimony: Version 4.0b10. 1997, Washington, DC: Smithsonian InstitutionGoogle Scholar
- Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39: 783-791. 10.2307/2408678.View ArticleGoogle Scholar
- Tamura K, Nei M, Kumar S: Prospects for inferring very large phylogenies by using the neighbor-joining method. Proc Natl Acad Sci USA. 2004, 101: 11030-11035. 10.1073/pnas.0404206101.PubMed CentralView ArticlePubMedGoogle Scholar
- Hasegawa M, Kishino H, Yano T: Dating of the human-ape splitting by molecular clock of mitochondrial DNA. J Mol Evol. 1985, 21: 160-174. 10.1007/BF02101694.View ArticleGoogle Scholar
- Morton BR: Chloroplast DNA codon use: evidence forselection at the psb locus based on tRNA availability. J Mol Evol. 1993, 37: 273-280. 10.1007/BF00175504.View ArticlePubMedGoogle Scholar
- Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003, 19: 2496-2497. 10.1093/bioinformatics/btg359.View ArticlePubMedGoogle Scholar
- Bandelt HJ: Median-joining networks for inferring intraespecific phylogenies. Mol Biol Evol. 1999, 16: 37-48.View ArticlePubMedGoogle Scholar