Evolution of a horizontally acquired legume gene, albumin 1, in the parasitic plant Phelipanche aegyptiaca and related species
© Zhang et al; licensee BioMed Central Ltd. 2013
Received: 12 October 2012
Accepted: 4 February 2013
Published: 20 February 2013
Parasitic plants, represented by several thousand species of angiosperms, use modified structures known as haustoria to tap into photosynthetic host plants and extract nutrients and water. As a result of their direct plant-plant connections with their host plant, parasitic plants have special opportunities for horizontal gene transfer, the nonsexual transmission of genetic material across species boundaries. There is increasing evidence that parasitic plants have served as recipients and donors of horizontal gene transfer (HGT), but the long-term impacts of eukaryotic HGT in parasitic plants are largely unknown.
Here we show that a gene encoding albumin 1 KNOTTIN-like protein, closely related to the albumin 1 genes only known from papilionoid legumes, where they serve dual roles as food storage and insect toxin, was found in Phelipanche aegyptiaca and related parasitic species of family Orobanchaceae, and was likely acquired by a Phelipanche ancestor via HGT from a legume host based on phylogenetic analyses. The KNOTTINs are well known for their unique “disulfide through disulfide knot” structure and have been extensively studied in various contexts, including drug design. Genomic sequences from nine related parasite species were obtained, and 3D protein structure simulation tests and evolutionary constraint analyses were performed. The parasite gene we identified here retains the intron structure, six highly conserved cysteine residues necessary to form a KNOTTIN protein, and displays levels of purifying selection like those seen in legumes. The albumin 1 xenogene has evolved through >150 speciation events over ca. 16 million years, forming a small family of differentially expressed genes that may confer novel functions in the parasites. Moreover, further data show that a distantly related parasitic plant, Cuscuta, obtained two copies of albumin 1 KNOTTIN-like genes from legumes through a separate HGT event, suggesting that legume KNOTTIN structures have been repeatedly co-opted by parasitic plants.
The HGT-derived albumins in Phelipanche represent a novel example of how plants can acquire genes from other plants via HGT that then go on to duplicate, evolve, and retain the specialized features required to perform a unique host-derived function.
Horizontal gene transfer (HGT) is the nonsexual transmission of genetic material across species boundaries [1, 2]. HGT is well known in bacteria, where HGT often results in adaptive gains of novel genes and traits [3–5]. There are fewer well-documented cases of HGT among eukaryotes  and the large majority of these cases appear to result in short-lived, nonfunctional sequences [6–8]. Consequently, the long-term evolutionary impact of HGT in multicellular eukaryotes remains largely unknown. Several cases of HGT are known or suspected in plants [9–23], most involving mitochondrial sequences, and/or parasitic plants [13–15, 17–20, 23–25]. Parasitic plants form direct haustorial connections with their host plants and are capable of obtaining a wide range of macromolecules from their hosts, including viruses , gene silencing signals , and messenger RNAs . Consequently, parasites may have many opportunities for HGT events and an increased likelihood that some of these result in functional, and potentially adaptive, gene transfers. Two recent reports by Yoshida et al  and Xi et al  were the first indications that nuclear protein coding sequences, likely obtained from their respective host species, could be integrated into the genomes of parasitic plants by HGT. These were important advances, but they provided few clues as to the long term impact of HGT, how the transgenes evolve, and how they may function. We hypothesized that systematic analysis of genome-scale datasets from parasitic plants could lead to evidence for acquisition and long-term maintenance of functional gene sequences in plants that had been acquired via HGT.
Albumin 1 genes are known only from a subset of species in the legume family (Leguminosae) of angiosperms where they encode seed storage proteins and insect toxins [29, 30]. The albumin 1 proteins in legumes are 112 to 154 amino acids in length and rich in cysteine residues. They form a unique protein structure known as a KNOTTIN, which has three disulfide bonds and is characterized by a “disulfide through disulfide knot” . The KNOTTINs are famous for their intruguing “disulfide through disulfide knot” structure and have been extensively studied in various fields, most of which are related with potentials in drug design [32–37]. Albumin 1 genes may have originated early in the diversification of papilionoid legumes [29, 30], but multiple homologous gene copies have been found only in species that are members of the more derived “Millettioid s.l.” and “Hologalegina” clades .
Orobanche s. l., often known by the common name “broomrape,” includes 150-170 obligate parasitic plant species in the family Orobanchaceae. Growing evidence supports the segregation of broomrapes into four genera : Aphyllon (syn. Orobanche sect. Gymnocaulis), Myzorrhiza (syn. Orobanche sect. M.), Phelipanche (syn. Orobanche sect. Trionychon), Orobanche s. str. (syn. Orobanche sect. O.). Most broomrape species have a narrow host spectrum and grow exclusively on perennial eudicot host plants , with members of the Leguminosae, Solanaceae, and Asteraceae among the more common hosts . As a member of order Lamiales, Orobanchaceae is phylogenetically well-separated from host members in these lineages, particularly legume hosts in the rosid order Fabales (Additional file 1: Figure S1; ). A few broomrape species (e.g., P. aegyptiaca, P. ramosa, O. cernua, O. crenata, and O. minor) have become devastating pests of important crop plants, affecting their growth and resource allocation and imparting significant losses in yield . P. aegyptiaca, the focal species in this study, has a broad host range that includes members of the eudicot families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, Leguminosae, and Solanaceae.
Here we show that a gene encoding albumin 1 KNOTTIN-like protein, closely related to the albumin 1 genes, only known from papilionoid legumes, serving dual roles in food storage and as insect toxins, was found in Phelipanche aegyptiaca and related parasitic species of family Orobanchaceae, and was likely acquired by a Phelipanche ancestor via HGT from a legume host based on phylogenetic analyses. According to genomic sequences from nine related parasite species, 3D protein structure simulation tests, and evolutionary constraint analyses, the broomrape xenogene we identified here retains the intron structure, six highly conserved cysteine residues necessary to form a KNOTTIN protein, and displays levels of purifying selection like those seen in legumes. The albumin 1 xenogene has evolved through >150 speciation events over ca. 16 million years, forming a small family of differentially expressed genes that may confer novel functions in the parasites.
The albumin 1 transcript was first identified as a HGT candidate in the transcriptome of P. aegyptiaca (cultured and grown on Arabidopsis and tobacco) using a BLAST-based  bioinformatic screen (details in Material and Methods). Albumin 1 transcripts were then searched further, using BLASTX, against the NCBI nr database and the PlantGDB database . Top hits were seen (Additional file 2: Figure S2) to Medicago truncatula albumin 1 sequences, with expected values of 5e-51 and 1e-48. Additional BLAST, including Hidden Markov Model (HMM)-based psi-BLAST searches with the sequence from P. aegyptiaca were performed to attempt to detect homologs in three other members of Orobanchaceae with large transcriptome datasets (two parasites, Striga hermonthica and Triphysaria versicolor, and the nonparasitic Lindenbergia philippensis) (Parasitic Plant Genome Project, PPGP ). Several large public databases, including Phytozome , PlantGDB, and SOL Genomics Network , were also searched. After searching 34 sequenced genomes and transcriptomes of 274 additional plant species, albumin 1 homologs were detected only in legumes and the transcriptome libraries of P. aegyptiaca.
Biogeographic overlap and common feeding interactions between diverse broomrapes and temperate papilionoid legumes increase the likelihood that the HGT event occurred in a common ancestor of the parasites that was in direct contact with legume host plants. An alternative (and less parsimonious) explanation is that another organism or virus that co-occurred in the same habitats as the ancestral lineages served as a “stepping stone” for a two- or more step transfer. However, this is not supported in our searches of the sequence databases. Based on fossil-calibrated age estimates of legume lineages [50, 51], we estimate that this horizontal acquisition occurred in an ancestral broomrape that lived in the Miocene epoch, about 16 Mya. Both parasite and their legume host groups have northern temperate distributions, and their lineages likely overlapped in the past as they do now, providing a minimal requirement for a horizontal gene transfer to occur. Another possibility, however unlikely, is that albumin1 was a more recent acquisition that underwent strong convergence at the protein level with this legume lineage. However the branch lengths we observed in the phylogeny (Figure 2) were not unusually long in our DNA-based phylogeny, and given the large collection of related sequences we obtained from other broomrape species, we have reduced any tendency the Orobanche/Phelipanche lineage may have had to connect by chance to a deep branch. Thus, the convergence hypothesis is not supported. Because the breadth of Phelipanche and Orobanche species we have sampled spans the deepest branches of broomrape diversity , the albumin gene can be inferred to have survived through an extended evolutionary radiation of at least 150 species [55–57] or more, if the number of now-extinct broomrape species could be estimated.
Because the introns of Phelipanche albumin 1 xenogenes maintain critical splicing sites and share the same starting positions and first nine base pairs with the known M. truncatula albumin 1 intron, it is likely that the HGT event in broomrapes involved transfer of a genomic sequence rather than a separate cDNA. Following the transfer, albumin 1 genes in broomrape species have evolved under purifying selection consistent with what is observed in related legume albumin 1 genes. This observation, as well as the stage-specific transcription patterns, conserved cysteine residues and predicted 3D KNOTTIN protein structures, strongly suggest that albumin 1 genes encode functional proteins in broomrape species, and could potentially serve a function similar to its role in legumes, providing a large pool of sulfur storage and exhibiting toxicity to insect herbivores in certain legumes [29, 30]. A recent report involves panicoid grass species with C3 or C4 photosynthetic pathways. Evidence was presented that nuclear genes were horizontally transferred between panicoid species and were subsequently adapted into the existing pathways with the effect of advancing the extent of C4 photosynthesis in some lineages . These results indicate that HGT may promote the sharing of adaptive traits among related species. In comparison, the albumin example described here shows how a completely novel and highly specialized trait has been acquired at an ancestral stage from a distantly related donor species and maintained by the recipient lineage throughout an extended period of evolutionary history.
The albumin 1 genes in P. aegyptiaca are highly transcribed in most of the developmental stages we examined. Transcripts are more abundant in reproductive tissue, and lowest in the young haustorium (stage 3), which represents the earliest point in our tissue sampling where the parasite is in direct contact with the host plant. This suggests that the novel gene in P. aegyptiaca is probably not encoding a protein that is playing a direct role in the process of haustorial formation, and that albumin 1 expression is down-regulated as the parasite devotes energy to the essential process of establishing host vascular connections. It is also possible that the low expression in the haustorial stage could help the parasite avoid detection or minimize a negative impact on the health of the host plant during early stages of parasite contact and feeding.
Several other parasitic lineages, including members of Cuscuta (Convolvulaceae), Cassytha (Lauraceae), Apodanthaceae, Hydnoraceae, and the order Santalales, regularly feed upon legumes  and therefore might also have had opportunities to acquire albumin 1 sequences through HGT. Large transcriptome datasets are currently available for only two of these, the generalist parasite Cuscuta pentagona (Convolvulaceae) and for the legume specialist feeder Pilostyles thurberi (Apodanthaceae). Both of these parasites, and other species in these genera, feed widely on legumes. No homolog of albumin 1 was detected in BLAST searches of the Pilostyles transcriptome in the 1KP dataset . However, albumin 1 sequences were detected in the same dataset and in two additional transcriptome libraries from Cuscuta pentagona (J. Westwood, unpublished data but publicly available through 1KP Blast database). Phylogenetic analysis nests the Cuscuta sequences well within Leguminosae, but on an independent branch from the broomrape sequences (Additional file 8: Figure S6), suggesting that these transcripts in Cuscuta represent a different HGT event into Cuscuta from a lineage of papilionoid legumes that was different from the source of the broomrape albumin 1 xenogene. The putative Cuscuta albumin 1 similarly encodes a protein predicted to have KNOTTIN structure (Knoter1d score: 33 to 35). No other albumin 1 sequences were identified elsewhere in searches of REFSEQ or publicly available plant transcriptome datasets.
Because of their extensive, intimate contacts with host plant tissues, and the wide range of materials that are commonly transmitted across haustorial connections [27, 28, 46, 60–63], parasitic plants play an important role as recipients and donors for HGT in plants [14, 15, 17, 19, 24]. As parasitic plants increasingly become the targets for genome-scale analyses, it should become possible to estimate the frequency and likely mechanisms of HGT events between parasites and hosts involving albumin 1 and other genes, the likelihood of more complex stepping-stone models, and how often HGT leads to long-term maintenance of new genes and novel traits.
Screening for HGT candidates
The assembled transcriptome of the parasite P. aegyptiaca was systematically screened for potential HGT candidate sequences. Immediately following an HGT event, a host-derived sequence in a parasitic organism may be identical to the sequence from the host. Over evolutionary time, the host-derived sequence will diverge from the ancestral transgene and, if it survives long enough, the xenologous sequence may pass through both speciation events (forming “xenorthologs”) and/or duplication events (forming “xenparalogs”). Initially, the xenologous sequence will be more closely related to the host sequence than to any other sequence in the parasite or its relatives’ genomes. Such sequences can provide valuable indicators of the rate and types of host-derived sequence incorporation in parasite-host interactions, but they can be difficult to distinguish from host-plant contamination or host-derived mobile transcripts in the parasite. However, as genetic divergence, speciation, and gene duplication events occur, the xenologs can be detectable as a clade of sequences that is closely related to sequences from the host lineage.
The parasitic plants that are the focus of this study are in the family Orobanchaceae (eudicots, asterid order Lamiales). The analysis begins with high throughput BLAST (tBLASTx) of all the contigs from the P. aegyptiaca transcriptome assembly against a database with sequences from two closely related nonparasitic species (Lindenbergia philippensis, a member of Orobanchaceae, representing the nonparasitic sister group of the parasitic members, and Mimulus guttatus, another closely related nonparasitic species of Lamiales/Asteridae, ) and thirteen other plant species with sequenced genomes or large transcriptome assemblies, including eudicots (two Solanaceae [asterids related to Lamiales]: Solanum lycopersicum and Nicotiana tabacum; and six much more distantly related rosid taxa including the range of major host families for most broomrapes: Arabidopsis thaliana [Brassicaceae], Carica papaya [Caricaceae], Populus trichocarpa [Salicaceae], Medicago truncatula [Fabaceae, papilionoid], Cucumis sativus [Cucurbitaceae], Vitis vinifera [Vitaceae]) monocots (Sorghum bicolor, Oryza sativa) and distantly related non-vascular plant species (Selaginella moellendorffii, Physcomitrella patens, Chlamydomonas reinhardtii). Details about the database are in Additional file 9: Table S3. The analysis details are described below.
Contigs were downloaded from the Parasitic Plant Genome Project website (Assembly version OrAeBC4). The HGT candidate screening includes the following steps. First, contigs were BLASTed onto the queried database (tBLASTx, expected value: 1e-10, -b 1, -v 1) described in the above paragraph and the top hit of the BLAST result was retrieved. Second, contigs with rosid species as the top hit were maintained for downstream filtering processes to identify sequences that could be useful for high-resolution evolutionary analysis. Candidate sequences were retained only if the contig length was longer than five hundred base pairs, the aligned identity score was in the range of sixty to ninety five percent, and aligned length was at least fifty percent of the contig length. The last requirement was included to avoid long contigs that only have a small portion that is nearly identical to a distantly related sequence. Third, the filtered contigs were BLASTed against the same database and the top ten hits (expected value: 1e-10, -b 10, -v 10) were retrieved. Contigs that had either of the closely related Mimulus guttatus or Lindenbergia philippensis present in the top ten hits were excluded from further consideration to avoid sequences that were not decisively better matches to distantly related species. Fourth, the same BLAST was performed for the contigs that have passed the previous screenings and all the BLAST hits (expected value: 1e-10, -b 100000, -v 100000) available were considered. If a contig had no Mimulus guttatus and Lindenbergia philippensis in the BLAST hits, which would be expected if the sequence were vertically transmitted from a nonparasitic ancestor, such a contig would be considered as a HGT candidate. However, if a contig had Mimulus guttatus or Lindenbergia philippensis among the BLAST hits, but there was much higher expect value or a much smaller bit score to a host plant lineage, such a contig was also retained as a HGT candidate. We initially began with 157806 Phelipanche aegyptiaca contigs. 333 contigs passed the initial BLAST screening, while 168 contigs and 36 contigs passed the second and third BLAST screenings, respectively. These 36 HGT candidates were passed on to phylogenetic testing. Once HGT candidates were found, we also checked for related sequences in the other parasitic Orobanchaceae species Striga hermonthica and Triphysaria versicolor by using BLAST search, including psi-BLAST.
Phylogenetic analysis and dating
Phylogenetic analysis was performed on all albumin1 homologs detected in the broomrape species (Phelipanche, Orobanche) as well as all other previously known albumin1 sequences and sequences obtained from additional legume species via PCR and cloning (see below). Albumin1 is reported to be restricted to papilionoid legume species (including Medicago). Low stringency BLAST searches (using E-value cutoff of e-5; tBLASTx, BLASTp, and psiBLAST) of diverse angiosperm databases including NCBI nr database, PlantGDB, Phytozome database and SOL genome network (Versions of all databases are before May 2012), failed to detect any additional homologs outside legumes. MUSCLE  was used to produce a multiple sequence alignment of the translated amino acid sequences; a custom java program was used to force nucleotide sequences onto the corresponding amino acid alignment sequences to yield a DNA sequence alignment consistent with the translated sequences. ML phylogeny was obtained using RAxML, version 7.0.4  with the following parameters: raxmlHPC –f a –x 12345 –p 12345 -# 100 –m GTRGAMMA –s alignmentsFile –n OutputFile. Multiple sequence alignments and phylogeny files were deposited in TreeBASE with submission ID: 138787 (http://purl.org/phylo/treebase/phylows/study/TB2:S13878). Genomic sequence data could be downloaded from the following link, http://www.atcgu.com/albumin1_HGT_BMC_data.zip. Bayesian analysis was performed with BEAST version 1.6.1 , using the following parameters: substitution model : GTR, base frequencies : estimated, site heterogeneity model : gamma, clock model : relaxed clock (uncorrelated exp), tree prior : speciation (yule process), MCMC : length of chain 10000000, Log parameters every 1000 chain. Tracer version 1.4  was used to determine the performance of the BEAST output. Tracer burn-in state is 1000000. All ESS are larger than 196.
The potential HGT acquisition time was estimated by BEAST v1.6.1 using the same alignment. We assigned one calibration point: the most recent common ancestor (MRCA) of Pisum/Medicago/Astragalus/Onobrychis, of which the prior was treated as fitting a normal distribution with mean set to 39 mya and stdev of 2.4 mya . We also created taxon groups of Onobrychis/Orobanche/Phelipanche, Orobanche/Phelipanche, and a taxon group just containing Phelipanche genes. The other settings are the same as described above in Phylogenetic analysis section. Tracer was used to analyze the output of BEAST to report the estimated mean and 95% HPD range of divergence time of the previously defined taxon groups (16 Mya: 95% HPD is 11-21 mya. 11 Mya: 95% HPD is 6-16 Mya. 5 Mya: 95% HPD is 3-7 my.). Similar patterns were observed within the BEAST confidence ranges when dates were estimated with r8s  (results not shown).
KNOTTIN structure validation and 3D structure simulation
HGT candidates were confirmed to be KNOTTIN proteins using the prediction program provided by the KNOTTIN database [31, 69]. Amino acid sequences were first confirmed as KNOTTIN structures using Knoter1D program offered by the KNOTTIN database. Knoter1D scores larger than 20 are determined to be KNOTTIN protein structures. Confirmed amino acid sequences (all the albumin1 sequences in Phelipanche) were input in Knoter1D3D program and pdb files were generated by this program.
dN, dS and dN/dS calculation
HyPhy version 2.0 was used to calculate dN, dS and dN/dS ratios . Treefiles and multiple sequence alignments of albumin 1 coding sequences were imported into HyPhy with the ML phylogeny based on the above analysis. Analyses were focused on broomrape species plus three most closely related legume species. Calculations were performed using the following parameters: partition type: codon; substitution model: MG94xHKY85_3x4; parameters: local; equilibrium freqs: estimate. HyPhy was also used in functional constraint analyses among sites using the empirical Bayes technique, detailed results are in Additional file 10: Table S4.
Expression level comparisons of HGT candidates
Assembled contigs and raw Illumina reads were downloaded from PPGP website. For each library, raw reads were mapped onto the HGT candidates in P. aegyptiaca using bwa , samtools  and bedtools . Normalized measures of expression intensity, Reads Per Kilobase per Million mapped reads (RPKM), were calculated from the read counts, the length of each contig, and the total number of mapped reads in each library or developmental stage .
Obtaining genomic sequences by PCR approach
Broomrape species DNA extraction, and gene amplification
Two different sources of tissue were used for broomrape species, dry seeds (obtained from the GermPlasm Bank of the IAS-CSIC, Cordoba, Spain) for Orobanche ballotae, Orobanche hederae, Phelipanche nana and Phelipanche schultzii, and vegetative shoots for Phelipanche aegyptiaca, Orobanche cernua, Orobanche minor, Phelipanche mutelii, and Phelipanche ramosa. Total genomic DNA was isolated from fresh, liquid nitrogen frozen tissue using a DNeasy Plant Mini Kit (Qiagen).
EST unigene contigs OrAeGnB1_75797 and OrAe41G2B1_12653 were downloaded from the Parasitic Plant Genome Project database. A different set of P. aegyptiaca specific primers was designed for each contig (Additional file 11: Table S5). The P. aegyptiaca primers were also used to amplify related sequences from other Orobanche species P. mutelii, P. nana, P. ramosa and P. schultzii. Each PCR reaction contained 10 ng of genomic DNA, 0.5 μM of each forward and reverse primers, 12.5 of 2x iProof Master Mix (BIO-RAD) and conditions as described in the manufacturer’s protocol. PCR products were separated by electrophoresis through a 1% agarose gel, yielding a single band that was excised from the gel, purified using the QIAquick Gel extraction kit (Qiagen), and sequenced using ABI3730xl genetic analyzer and Big Dye Terminator v3.1 sequencing kit for sequencing (both from Applied Biosystems).
Legume DNA extraction, and gene amplification
Total DNA was isolated from herbarium material of Onobrychis argentea Boiss. ssp. africana, A. Dubois 13246 (M), using a DNeasy Plant Mini Kit (Qiagen). Because the Onobrychis sequence obtained from NCBI was incomplete, one forward primer (AlbuminFw3: 5´TTAAGCTCACTCCTTTGGTCCTCTTC3´) and one degenerate reverse primer (AlbuminRv3: 5´CAGGCATCTTCARGAAKCYTTTYKC3´) were designed in order to amplify the full length Albumin 1 gene in O. argentea. Forward 3 was designed on the Q6A1C9 sequence, targeting the more conserved region before the start codon between sequences Q6A1C9 and Q6A1D7 obtained from Onobrychis viciifolia and Astragalus monspessulanus. Reverse 3 was designed from the downstream end of the complete albumin genes Medtr7g041000.1 and OrAeGnB1_75797. The PCR reaction was composed by 10 ng of genomic DNA of O. argentea, using forward primer (Fw3, 1 μM), reverse primer (Rv3, 1 μM), and 12.5 μl of 2x iProof Master Mix (BIO-RAD) in a final volume of 25 μl., following the manufacturer’s protocol. PCR product was separated by electrophoresis through a 1% agarose gel. This product was excised from the gel, purified using the QIAquick Gel extraction kit (Qiagen), sequenced and identified as Albumin 1.
We thank Yongdae Bao (University of Virginia), Loren A. Honaas, Paula E. Ralph, Lena Landherr, Lynn P. Tomsho and Stephan C. Schuster (Penn State University), Pradeepa Gunathilake and Bio Wu (University of California-Davis) and Marta Matvienko (UC Davis Genome Center) for generation of the PPGP transcriptome data, and Gunjun Kim and Megan LeBlanc (Virginia Tech), and the 1KP transcriptome project (Gane Ka-Shu Wong, University of Alberta) for generation of the Cuscuta and Pilostyles transcriptome data from samples provided by J.H.W. and C.W.D, respectively. We also thank Arthur Lesk, Joshua P. Der, Paula E. Ralph, and Zhenzhen Yang for discussion and suggestions, and the KNOTTIN database for access to their 3D modeling software. Thoughtful comments by two anonymous reviewers also helped to improve the paper. This work was supported by NSF Plant Genome award DBI-0701748 (“The Parasitic Plant Genome Project”) to J.H.W., C.W.D., M.P.T., and J.Y. Graduate fellowship support for Y. Zhang was provided by the Intercollege Graduate Program in Genetics and the Department of Biology (Penn State University), and M. Fernández-Aparicio was supported by an International Outgoing European Marie Curie postdoctoral fellowship (PIOF-GA-2009-252538). Additional support was provided from the U.S. Department of Agriculture (Hatch project no. 135798) and NSF IOS-0843372 to J.H.W. and by NSF award DEB-0542958 to M.F.W. Data reported in this paper are archived at Parasitic Plant Genome Project and in the short read archive of N.C.B.I. GenBank (SRP001053) with additional materials and methods and results tabulated in the Supporting Online Material.
- Richardson AO, Palmer JD: Horizontal gene transfer in plants. J Exp Bot. 2007, 58 (1): 1-9.PubMedView ArticleGoogle Scholar
- Acuna R, Padilla BE, Florez-Ramos CP, Rubio JD, Herrera JC, Benavides P, Lee SJ, Yeats TH, Egan AN, Doyle JJ: Adaptive horizontal transfer of a bacterial gene to an invasive insect pest of coffee. Proc Natl Acad Sci USA. 2012, 109 (11): 4197-4202.PubMed CentralPubMedView ArticleGoogle Scholar
- Davies J, Davies D: Origins and evolution of antibiotic resistance. Microbiol Mol Biol Rev. 2010, 74 (3): 417-433. 10.1128/MMBR.00016-10.PubMed CentralPubMedView ArticleGoogle Scholar
- Ochman H, Lawrence JG, Groisman EA: Lateral gene transfer and the nature of bacterial innovation. Nature. 2000, 405 (6784): 299-304. 10.1038/35012500.PubMedView ArticleGoogle Scholar
- Dobrindt U, Hochhut B, Hentschel U, Hacker J: Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol. 2004, 2 (5): 414-424. 10.1038/nrmicro884.PubMedView ArticleGoogle Scholar
- Keeling PJ, Palmer JD: Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008, 9 (8): 605-618. 10.1038/nrg2386.PubMedView ArticleGoogle Scholar
- Feschotte C, Pritham EJ: DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet. 2007, 41: 331-368. 10.1146/annurev.genet.40.110405.090448.PubMed CentralPubMedView ArticleGoogle Scholar
- Schaack S, Gilbert C, Feschotte C: Promiscuous DNA: horizontal transfer of transposable elements and why it matters for eukaryotic evolution. Trends Ecol Evol. 2010, 25 (9): 537-546. 10.1016/j.tree.2010.06.001.PubMed CentralPubMedView ArticleGoogle Scholar
- Cho Y, Qiu YL, Kuhlman P, Palmer JD: Explosive invasion of plant mitochondria by a group I intron. Proc Natl Acad Sci USA. 1998, 95 (24): 14244-14249. 10.1073/pnas.95.24.14244.PubMed CentralPubMedView ArticleGoogle Scholar
- Bergthorsson U, Adams KL, Thomason B, Palmer JD: Widespread horizontal transfer of mitochondrial genes in flowering plants. Nature. 2003, 424 (6945): 197-201. 10.1038/nature01743.PubMedView ArticleGoogle Scholar
- Won H, Renner SS: Horizontal gene transfer from flowering plants to Gnetum. Proc Natl Acad Sci USA. 2003, 100 (19): 10824-10829. 10.1073/pnas.1833775100.PubMed CentralPubMedView ArticleGoogle Scholar
- Bergthorsson U, Richardson AO, Young GJ, Goertzen LR, Palmer JD: Massive horizontal transfer of mitochondrial genes from diverse land plant donors to the basal angiosperm Amborella. Proc Natl Acad Sci USA. 2004, 101 (51): 17747-17752. 10.1073/pnas.0408336102.PubMed CentralPubMedView ArticleGoogle Scholar
- Davis CC, Wurdack KJ: Host-to-parasite gene transfer in flowering plants: phylogenetic evidence from Malpighiales. Science. 2004, 305 (5684): 676-678. 10.1126/science.1100671.PubMedView ArticleGoogle Scholar
- Mower JP, Stefanovic S, Young GJ, Palmer JD: Plant genetics: gene transfer from parasitic to host plants. Nature. 2004, 432 (7014): 165-166.PubMedView ArticleGoogle Scholar
- Davis CC, Anderson WR, Wurdack KJ: Gene transfer from a parasitic flowering plant to a fern. Proc Biol Sci. 2005, 272 (1578): 2237-2242. 10.1098/rspb.2005.3226.PubMed CentralPubMedView ArticleGoogle Scholar
- Diao X, Freeling M, Lisch D: Horizontal transfer of a plant transposon. PLoS Biol. 2006, 4 (1): e5-10.1371/journal.pbio.0040005.PubMed CentralPubMedView ArticleGoogle Scholar
- Barkman TJ, McNeal JR, Lim SH, Coat G, Croom HB, Young ND, Depamphilis CW: Mitochondrial DNA suggests at least 11 origins of parasitism in angiosperms and reveals genomic chimerism in parasitic plants. BMC Evol Biol. 2007, 7: 248-10.1186/1471-2148-7-248.PubMed CentralPubMedView ArticleGoogle Scholar
- Goremykin VV, Salamini F, Velasco R, Viola R: Mitochondrial DNA of Vitis vinifera and the issue of rampant horizontal gene transfer. Mol Biol Evol. 2009, 26 (1): 99-110.PubMedView ArticleGoogle Scholar
- Yoshida S, Maruyama S, Nozaki H, Shirasu K: Horizontal gene transfer by the parasitic plant Striga hermonthica. Science. 2010, 328 (5982): 1128-10.1126/science.1187145.PubMedView ArticleGoogle Scholar
- Sanchez-Puerta MV, Cho Y, Mower JP, Alverson AJ, Palmer JD: Frequent, phylogenetically local horizontal transfer of the cox1 group I Intron in flowering plant mitochondria. Mol Biol Evol. 2008, 25 (8): 1762-1777. 10.1093/molbev/msn129.PubMed CentralPubMedView ArticleGoogle Scholar
- Christin PA, Edwards EJ, Besnard G, Boxall SF, Gregory R, Kellogg EA, Hartwell J, Osborne CP: Adaptive evolution of C(4) photosynthesis through recurrent lateral gene transfer. Curr Biol. 2012, 22 (5): 445-449. 10.1016/j.cub.2012.01.054.PubMedView ArticleGoogle Scholar
- Vallenback P, Jaarola M, Ghatnekar L, Bengtsson BO: Origin and timing of the horizontal transfer of a PgiC gene from Poa to Festuca ovina. Mol Phylogenet Evol. 2008, 46 (3): 890-896. 10.1016/j.ympev.2007.11.031.PubMedView ArticleGoogle Scholar
- Hepburn NJ, Schmidt DW, Mower JP: Loss of Two Introns from the Magnolia tripetala Mitochondrial cox2 Gene Implicates Horizontal Gene Transfer and Gene Conversion as a Novel Mechanism of Intron Loss. Mol Biol Evol. 2012, 29 (10): 3111-3120. 10.1093/molbev/mss130.PubMedView ArticleGoogle Scholar
- Park JM, Manen JF, Schneeweiss GM: Horizontal gene transfer of a plastid gene in the non-photosynthetic flowering plants Orobanche and Phelipanche (Orobanchaceae). Mol Phylogenet Evol. 2007, 43 (3): 974-985. 10.1016/j.ympev.2006.10.011.PubMedView ArticleGoogle Scholar
- Xi Z, Bradley RK, Wurdack KJ, Wong KM, Sugumaran M, Bomblies K, Rest JS, Davis CC: Horizontal transfer of expressed genes in a parasitic flowering plant. BMC Genomics. 2012, 13 (1): 227-10.1186/1471-2164-13-227.PubMed CentralPubMedView ArticleGoogle Scholar
- Birschwilks M, Haupt S, Hofius D, Neumann S: Transfer of phloem-mobile substances from the host plants to the holoparasite Cuscuta sp. J Exp Bot. 2006, 57 (4): 911-921. 10.1093/jxb/erj076.PubMedView ArticleGoogle Scholar
- Tomilov AA, Tomilova NB, Wroblewski T, Michelmore R, Yoder JI: Trans-specific gene silencing between host and parasitic plants. Plant J. 2008, 56 (3): 389-397. 10.1111/j.1365-313X.2008.03613.x.PubMedView ArticleGoogle Scholar
- Westwood JH, Roney JK, Khatibi PA, Stromberg VK: RNA translocation between parasitic plants and their hosts. Pest Manag Sci. 2009, 65 (5): 533-539. 10.1002/ps.1727.PubMedView ArticleGoogle Scholar
- Louis S, Delobel B, Gressent F, Rahioui I, Quillien L, Vallier A, Rahbe Y: Molecular and biological screening for insect-toxic seed albumins from four legume species. Plant Sci. 2004, 167 (4): 705-714. 10.1016/j.plantsci.2004.04.018.View ArticleGoogle Scholar
- Louis S, Delobel B, Gressent F, Duport G, Diol O, Rahioui I, Charles H, Rahbe Y: Broad screening of the legume family for variability in seed insecticidal activities and for the occurrence of the A1b-like knottin peptide entomotoxins. Phytochemistry. 2007, 68 (4): 521-535. 10.1016/j.phytochem.2006.11.032.PubMedView ArticleGoogle Scholar
- Gelly JC, Gracy J, Kaas Q, Le-Nguyen D, Heitz A, Chiche L: The KNOTTIN website and database: a new information system dedicated to the knottin scaffold. Nucleic Acids Res. 2004, 32 (Database issue): D156-D159.PubMed CentralPubMedView ArticleGoogle Scholar
- Clark RJ, Jensen J, Nevin ST, Callaghan BP, Adams DJ, Craik DJ: The engineering of an orally active conotoxin for the treatment of neuropathic pain. Angew Chem Int Ed Engl. 2010, 49 (37): 6545-6548. 10.1002/anie.201000620.PubMedView ArticleGoogle Scholar
- Wang X, Connor M, Smith R, Maciejewski MW, Howden ME, Nicholson GM, Christie MJ, King GF: Discovery and characterization of a family of insecticidal neurotoxins with a rare vicinal disulfide bridge. Nat Struct Biol. 2000, 7 (6): 505-513. 10.1038/75921.PubMedView ArticleGoogle Scholar
- Jackson PJ, McNulty JC, Yang YK, Thompson DA, Chai B, Gantz I, Barsh GS, Millhauser GL: Design, pharmacology, and NMR structure of a minimized cystine knot with agouti-related protein activity. Biochemistry. 2002, 41 (24): 7565-7572. 10.1021/bi012000x.PubMedView ArticleGoogle Scholar
- Clark RJ, Daly NL, Craik DJ: Structural plasticity of the cyclic-cystine-knot framework: implications for biological activity and drug design. Biochem J. 2006, 394 (Pt 1): 85-93.PubMed CentralPubMedView ArticleGoogle Scholar
- Combelles C, Gracy J, Heitz A, Craik DJ, Chiche L: Structure and folding of disulfide-rich miniproteins: insights from molecular dynamics simulations and MM-PBSA free energy calculations. Proteins. 2008, 73 (1): 87-103. 10.1002/prot.22054.PubMedView ArticleGoogle Scholar
- Silverman AP, Levin AM, Lahti JL, Cochran JR: Engineered cystine-knot peptides that bind alpha(v)beta(3) integrin with antibody-like affinities. J Mol Biol. 2009, 385 (4): 1064-1075. 10.1016/j.jmb.2008.11.004.PubMed CentralPubMedView ArticleGoogle Scholar
- Lewis GP: Legumes of the World. 2005, Kew: Royal Botanic GardensGoogle Scholar
- Joel DM: The new nomenclature of Orobanche and Phelipanche. Weed Res. 2009, 49: 6-7.View ArticleGoogle Scholar
- Schneeweiss GM: Correlated evolution of life history and host range in the nonphotosynthetic parasitic flowering plants Orobanche and Phelipanche (Orobanchaceae). J Evol Biol. 2007, 20 (2): 471-478. 10.1111/j.1420-9101.2006.01273.x.PubMedView ArticleGoogle Scholar
- Index of Orobanchaceae. http://www.farmalierganes.com/otrospdf/publica/orobanchaceae%20index.htm,
- Soltis DE, Smith SA, Cellinese N, Wurdack KJ, Tank DC, Brockington SF, Refulio-Rodriguez NF, Walker JB, Moore MJ, Carlsward BS: Angiosperm phylogeny: 17 genes, 640 taxa. Am J Bot. 2011, 98 (4): 704-730. 10.3732/ajb.1000404.PubMedView ArticleGoogle Scholar
- Parker C: Observations on the current status of Orobanche and Striga problems worldwide. Pest Manag Sci. 2009, 65 (5): 453-459. 10.1002/ps.1713.PubMedView ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralPubMedView ArticleGoogle Scholar
- PlantGDB. http://www.plantgdb.org/,
- Westwood JH, Yoder JI, Timko MP, dePamphilis CW: The evolution of parasitism in plants. Trends Plant Sci. 2010, 15 (4): 227-235. 10.1016/j.tplants.2010.01.004.PubMedView ArticleGoogle Scholar
- Parasitic Plant Genome Project. http://ppgp.huck.psu.edu/,
- Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N: Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40 (Database issue): D1178-D1186.PubMed CentralPubMedView ArticleGoogle Scholar
- SOL Genomics Network. http://solgenomics.net/,
- Wojciechowski MF, Lavin M, Sanderson MJ: A phylogeny of legumes (Leguminosae) based on analysis of the plastid matK gene resolves many well-supported subclades within the family. Am J Bot. 2004, 91 (11): 1846-1862. 10.3732/ajb.91.11.1846.PubMedView ArticleGoogle Scholar
- Lavin M, Herendeen PS, Wojciechowski MF: Evolutionary rates analysis of Leguminosae implicates a rapid diversification of lineages during the tertiary. Syst Biol. 2005, 54 (4): 575-594. 10.1080/10635150590947131.PubMedView ArticleGoogle Scholar
- Medicago truncatula HapMap Project. http://www.medicagohapmap.org/index.php,
- Gracy J, Le-Nguyen D, Gelly JC, Kaas Q, Heitz A, Chiche L: KNOTTIN: the knottin or inhibitor cystine knot scaffold in 2007. Nucleic Acids Res. 2008, 36 (Database issue): D314-D319.PubMed CentralPubMedGoogle Scholar
- Westwood JH: The Parasitic Plant Genome Project: New Tools for Understanding the Biology of Orobanche and Striga. Weed Sci. 2012, 60 (2): 295-306. 10.1614/WS-D-11-00113.1.View ArticleGoogle Scholar
- Schneeweiss GM, Colwell A, Park JM, Jang CG, Stuessy TF: Phylogeny of holoparasitic Orobanche (Orobanchaceae) inferred from nuclear ITS sequences. Mol Phylogenet Evol. 2004, 30 (2): 465-478. 10.1016/S1055-7903(03)00210-0.PubMedView ArticleGoogle Scholar
- Schneeweiss GM, Palomeque T, Colwell AE, Weiss-Schneeweiss H: Chromosome numbers and karyotype evolution in holoparasitic Orobanche (Orobanchaceae) and related genera. Am J Bot. 2004, 91 (3): 439-448. 10.3732/ajb.91.3.439.PubMedView ArticleGoogle Scholar
- Manen JF, Habashi C, Jeanmonod D, Park JM, Schneeweiss GM: Phylogeny and intraspecific variability of holoparasitic Orobanche (Orobanchaceae) inferred from plastid rbcL sequences. Mol Phylogenet Evol. 2004, 33 (2): 482-500. 10.1016/j.ympev.2004.06.010.PubMedView ArticleGoogle Scholar
- Nickrent D: The Parasitic Plant Connection. http://www.parasiticplants.siu.edu/,
- The 1KP Project. http://www.onekp.com/,
- Johnson F: Transmission of plant viruses by dodder. Phytopathology. 1941, 31 (7): 649-656.Google Scholar
- Bennett CW: Studies of dodder transmission of plant viruses. Phytopathology. 1944, 34 (10): 905-932.Google Scholar
- Roney JK, Khatibi PA, Westwood JH: Cross-species translocation of mRNA from host plants into the parasitic plant dodder. Plant Physiol. 2007, 143 (2): 1037-1043.PubMed CentralPubMedView ArticleGoogle Scholar
- David-Schwartz R, Runo S, Townsley B, Machuka J, Sinha N: Long-distance transport of mRNA via parenchyma cells and phloem across the host-parasite junction in Cuscuta. New Phytol. 2008, 179 (4): 1133-1141. 10.1111/j.1469-8137.2008.02540.x.PubMedView ArticleGoogle Scholar
- Olmstead RG, dePamphilis CW, Wolfe AD, Young ND, Elisons WJ, Reeves PA: Disintegration of the Scrophulariaceae. Am J Bot. 2001, 88 (2): 348-361. 10.2307/2657024.PubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004, 5: 113-10.1186/1471-2105-5-113.PubMed CentralPubMedView ArticleGoogle Scholar
- Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22 (21): 2688-2690. 10.1093/bioinformatics/btl446.PubMedView ArticleGoogle Scholar
- Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.PubMed CentralPubMedView ArticleGoogle Scholar
- Sanderson MJ: r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics. 2003, 19 (2): 301-302. 10.1093/bioinformatics/19.2.301.PubMedView ArticleGoogle Scholar
- Gracy J, Chiche L: Optimizing structural modeling for a specific protein scaffold: knottins or inhibitor cystine knots. BMC Bioinformatics. 2010, 11: 535-10.1186/1471-2105-11-535.PubMed CentralPubMedView ArticleGoogle Scholar
- Pond SL, Frost SD, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005, 21 (5): 676-679. 10.1093/bioinformatics/bti079.PubMedView ArticleGoogle Scholar
- Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25 (14): 1754-1760. 10.1093/bioinformatics/btp324.PubMed CentralPubMedView ArticleGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079. 10.1093/bioinformatics/btp352.PubMed CentralPubMedView ArticleGoogle Scholar
- Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010, 26 (6): 841-842. 10.1093/bioinformatics/btq033.PubMed CentralPubMedView ArticleGoogle Scholar
- EMBL-EBI. http://www.ebi.ac.uk/Databases/,
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.