ANRIL/CDKN2B-AS shows two-stage clade-specific evolution and becomes conserved after transposon insertions in simians
© He et al.; licensee BioMed Central Ltd. 2013
Received: 6 August 2013
Accepted: 8 November 2013
Published: 13 November 2013
Many long non-coding RNA (lncRNA) genes identified in mammals have multiple exons and functional domains, allowing them to bind to polycomb proteins, DNA methyltransferases, and specific DNA sequences to regulate genome methylation. Little is known about the origin and evolution of lncRNAs. ANRIL/CDKN2B-AS consists of 19 exons on human chromosome 9p21 and regulates the expression of three cyclin-dependent kinase inhibitors (CDKN2A/ARF/CDKN2B).
ANRIL/CDKN2B-AS originated in placental mammals, obtained additional exons during mammalian evolution but gradually lost them during rodent evolution, and reached 19 exons only in simians. ANRIL lacks splicing signals in mammals. In simians, multiple transposons were inserted and transformed into exons of the ANRIL gene, after which ANRIL became highly conserved. A further survey reveals that multiple transposons exist in many lncRNAs.
ANRIL shows a two-stage, clade-specific evolutionary process and is fully developed only in simians. The domestication of multiple transposons indicates an impressive pattern of “evolutionary tinkering” and is likely to be important for ANRIL’s structure and function. The evolution of lncRNAs and that of transposons may be highly co-opted in primates. Many lncRNAs may be functional only in simians.
X-chromosome inactivation in female placental mammals, which causes the products of genes on the X chromosome to have equal dosages in males and females, is controlled by a set of long non-coding RNA (lncRNA) genes, including Xist, Tsix, Jpx, and Tsx (reviewed in [1, 2]). The imprinted expression of some mammalian genes is also controlled by lncRNAs (reviewed in [3, 4]). In addition to X-inactivation and imprinted gene expression, tissue-specific DNA and chromatin methylation occurs widely in animal somatic cells because in differentiated cells, most genes are silenced by DNA and chromatin methylation. Because mammalian genomes contain only a few genes encoding DNA methyltransferases and polycomb repressive complexes for DNA and chromatin methylation and because these proteins lack sequence-specific DNA-binding subunits, how these proteins are guided to different genomic sites has long been a puzzle. Recently, more than 10,000 lncRNAs have been identified in humans [5–7], and many can bind to both polycomb proteins/DNA methyltransferases and specific DNA sequences [8, 9]. As scaffolds bridging protein-DNA interactions, lncRNAs are key players in dynamic and tissue-specific genome modification  (recently reviewed in [11, 12]). Because dysregulated genome modification can cause diverse diseases, especially cancers, lncRNAs have triggered immense research interest in multiple fields. While lncRNAs are thought to be associated with X-chromosome inactivation and imprinted gene expression in mammals and most lncRNAs were first identified in humans , lncRNAs have also been identified recently in multiple non-mammalian organisms [14–16]. Thus, like microRNAs, lncRNAs may be clade- or species-specific, although the details of these patterns remain unknown.
Based upon their locations, lncRNAs can be classified into two groups. Many lncRNAs are located antisense to the genes they regulate; typical examples include AIRN, H19, and Kcnq1ot1, which control the imprinted expression of Igf2, Igf2r, and Kcnq1 [17, 18]. Many other lncRNAs are located far from their target genes; for example, HOTAIR lies between HOXC11 and HOXC12 and regulates the expression of HOXD genes in humans . It is relatively easy to experimentally identify antisense lncRNAs; nevertheless, few in-depth analyses have been performed.
The recently identified ANRIL/CDKN2B-AS on human chromosome 9p21 is antisense to and regulates the expression of three cyclin-dependent kinase (Cdk) inhibitors: CDKN2A (INK4a/p16), ARF (p14), and CDKN2B (INK4b/p15) . The expression of ANRIL can induce CDKN2B silencing in cis and in trans through heterochromatin formation . The silencing of CDKN2B occurs via PRC2 recruitment by ANRIL, which makes the CDKN2B locus H3K27-trimethylated. Depletion of ANRIL increases the expression of CDKN2B . The vital roles of these Cdk inhibitors in cell-cycle control make ANRIL an important molecule in multiple cancers. It is estimated that the genomic region containing ANRIL and CDKN2A/ARF/CDKN2B is altered in 30–40% of human tumours . Understanding the origin and evolution of ANRIL will help to clarify its functions and decipher the evolution of many other lncRNAs.
Studies of lncRNA evolution face three challenges. First, most protein-coding genes in mammals were generated by genome and chromosome duplications that immediately produced all of the gene’s exons. Generated after two rounds of whole-genome duplication , lncRNAs are relatively young, and little is known about how and when they obtained multiple exons. Second, like other ncRNAs, lncRNAs exhibit conserved structures but divergent sequences due to compensatory mutations . Thus, specific genome-search techniques that are more powerful than BLAST/BLAT are needed to search for homologous lncRNA genes in different organisms. Third, lncRNAs may be clade- or even species-specific. When a particular lncRNA is absent in a given organism, it may be difficult to determine whether the gene was never present or underwent a birth-and-death process.
The ANRIL gene is not only important but also quite unusual in that it contains 19 exons, making its origin and evolution particularly intriguing. To decipher its evolutionary history, we searched the genomes of 27 organisms, including non-mammalian vertebrates (hereafter called vertebrates), non-placental mammals, non-primate placental mammals (hereafter called mammals), and primates, to obtain sequences homologous to the exons of the human ANRIL gene. In-depth analyses of these sequences yielded several interesting conclusions. ANRIL originated in the eutherian ancestor and initially contained only a few exons and splicing signals. Later, it underwent clade-specific evolution, obtaining additional exons in some mammals but gradually losing exons in rodents. Notably, its genomic sequence expanded significantly in simians (here represented by the marmoset) through the insertion of multiple transposons. Some transposons were inserted into selective sites within the exons, and some transposons were transformed into the exons. These transposons not only modified the sequence and structure of ANRIL but also caused the gene to become highly conserved. A large-scale survey of lncRNAs in the database http://www.lncRNAdb.org reveals that many lncRNAs contain transposons. These findings indicate for the first time that transposons have contributed significantly to lncRNA evolution in simians. This phenomenon is a remarkable aspect of lncRNA evolution in primates.
Infernal searches identify putative ANRIL exons only in placental mammals
Our genome search results were carefully verified to ensure their reasonability and reliability. First, we used BLAT at the UCSC Genome Browser to search the human ANRIL exons against the 27 organismal genomes. In mammals and prosimians (and even in marmoset, a simian), the hits were much shorter than the human ANRIL exons, and successive series of putative exons were not found. On the other hand, many hits appeared to be false positives (Additional file 1). In comparison, Infernal produced fewer and longer hits, with two important features: (1) the highest-scoring hits matched the full CMs in most simians and some mammals, had extremely low E-values, and were successively distributed on the DNA strand antisense to CDKN2A/CDKN2B; (2) other hits that did not match the CMs well, with medium or low scores, had very large E-values (Additional file 1). This comparison indicates that, as reported previously , Infernal significantly outperformed BLAT in reliably identifying sequences orthologous to ANRIL exons. Second, to confirm that the failure to detect exons in non-placental mammals and vertebrates was not due to the evolutionary distance between human/macaque and these organisms, we built CMs based on the identified exons in rabbit and horse and re-searched these CMs against the genomes of opossum and chicken. Again, no exons were detected. Third, we examined the MultiZ-aligned regions of the 19 exons in the 27 organisms in the UCSC Genome Browser and found that no or only extremely poorly aligned sequences were present in the vertebrates and non-placental mammals (Additional file 2: Figure S1). Finally, to confirm that the failure to detect exons in mouse and rat was not influenced by the human/macaque-based CMs, we built CMs based on the identified exons in rabbit and re-searched these CMs against the mouse and rat genomes. Again, neither medium-scoring hits nor hits of successive CMs were obtained.
Putative ANRIL exons in mammals lack splicing signals
Multiple transposons are inserted into ANRIL exons in simians
Location within human ANRIL
In tree shrew (a small mammal of the order Scandentia, closely related to primates) and in two prosimians (mouse lemur and tarsier), the ANRIL exons showed peculiar features: (1) fewer exons were present in these organisms than in Laurasiatheria; (2) in mouse lemur, the positions of exon 12 and exon 7 (both TE-transformed) were reversed; and (3) in tarsier, CM7 and CM12 produced many hits with scores higher than those of exon 7 and exon 12. Moreover, of the 94335 hits for CM7 on chromosome 1 in the marmoset genome, 15000 had higher scores than marmoset exon 7 itself, indicating that these regions may be closer to human exon 7. These findings suggest that the ANRIL gene evolved substantially from mammals to prosimians and from prosimians to simians and that this process was strongly influenced by transposon activity.
Transposons are inserted into exons within structural contexts
As mobile elements, active transposons insert into many sites in a genome. What drives their insertion and whether transposons selectively insert into specific sites are interesting questions with limited answers. We examined the hits for CM3 and CM8, including TE3, TE8a, and TE8b, across the set of genomes. In rabbit, exon 3 matched bp 1–74 and 307–328 of CM3 but left bp 75–306 of CM3 unmatched. Meanwhile, there is a hit elsewhere that exactly matched bp 75–306 of CM3 (and more hits matched bp 80–265 of CM3). If this hit were inserted into the gap in exon 3, a sequence nearly identical to human exon 3 would be obtained. In the genomes of other non-simian organisms, all exon 3 sequences contained a gap equal to or smaller than that found in rabbit exon 3, and hits matching bp 80–265 of CM3 were found widely elsewhere (especially in zebrafish, lizard, and the three non-placental mammals). Note that compared to these hits (designated TE3), the MER1A transposon in human ANRIL is located closer to the 3′ end of exon 3.
We further analysed intron 3 and intron 8. In human, the simian-specific MER1A transposon penerates into intron 3. Additionally, the THE1A transposon (TE8b) in human exon 8 was followed by another simian-specific transposon at the 5′ end of intron 8. Thus, simian-specific transposons also provide the GU splicing signals for the matured exon 3 and exon 8 in simians. Compared to the 3′ ends of exon 3 and exon 8 in mammals, the 3′ ends of exon 3 and exon 8 in simians appear more structured.
TEs have modified the sequences and structures of ANRIL exons
The insertion of multiple TEs into ANRIL should affect the structure and function of this gene. Because an lncRNA could fold into many different structures, making structural prediction difficult, we examined how the insertion of TEs modified the sequences and structures of the ANRIL exons. TE3 exhibits a palindromic structure that is conserved from lizard to human, with two terminal inverted repeats flanking a short internal region (Additional file 2: Figure S2A). This specific structure causes the formation of a stem structure (and a pair of high-scoring hits for CM3 at the same position on the sense and antisense strands in simians) (Additional file 2: Figure S2A). To determine whether the stem structure contains any microRNAs, we searched E3TE3 against the microRNA database (http://www.mirbase.org) and found that it matched the stem-loop structure of two miRNA families with high scores (hsa-mir-645/ptr-mir-645/ppy-mir-645 with score = 140 and E-value = 5e-05; oan-miR-138/aca-miR-138-1 with score = 75 and E-value = 0.62) (Additional file 2: Figure S3).
Next, we examined exon 7, exon 12, and exon 13, which were transformed by TE7, TE12, and TE13. Exon 12 and the highest-scoring TE12 had highly similar sequences, but exon 12 contained a 20-bp sequence in mammals and prosimians that was absent in simians (Additional file 2: Figure S2B). To determine the impact of this 20-bp sequence on the structure of exon 12, we used RNAfold to predict the structures of exon 12 and the highest-scoring TE12. We found that without this 20-bp sequence, exon 12 formed a more stable hairpin (Additional file 2: Figure S2C). In contrast, exon 13 contained a 40-bp sequence that was absent in the highest-scoring TE13 (Additional file 2: Figure S2D). RNAfold revealed that the 40-bp addition caused more nucleotides to pair in exon 13 than in the highest-scoring TE13. Only single-nucleotide differences were found between exon 7 and the highest-scoring TE7, providing no clear evidence of structural effects. These results indicate that multiple TEs may have considerably modified the sequences and structures of the ANRIL exons, but their impact on the global structure of ANRIL remains unclear.
ANRIL exons became conserved after TE insertions
Multiple transposons may have contributed to lncRNA evolution
LncRNAs containing multiple SINEs and LINEs
NTT_ Human U54776.1
PR antisense transcripts_Human
Most protein-coding genes in mammals were generated by genome and chromosome duplication , but when and how lncRNAs obtained multiple exons remain poorly understood. The answers to these questions will help to decipher when lncRNAs obtained their functions. The lncRNA Xist obtained its exons by the pseudogenisation of protein-coding genes , but this phenomenon is unlikely to have occurred widely. With 19 exons, ANRIL provides a valuable case study for lncRNA evolution. Consistent with the hypothesis that most lncRNAs occur in placental mammals , we found that ANRIL originated in the eutherian ancestor and gradually obtained more exons during its evolution. The evolution of ANRIL shows several notable features. First, mammalian ANRIL genes lack splicing signals and thus may not be properly transcribed. Second, simian ANRIL genes contain splicing signals and multiple transposons. Third, pairwise sequence distances, phylogenetic trees, and relative-rate tests all indicate that ANRIL became highly conserved in simians after transposon insertion. Fourth, our analysis of intron 3 and intron 8 indicates that simian-specific transposons may also provide splicing signals for ANRIL in simians. Finally, ANRIL apparently gradually lost exons during rodent evolution, containing numerous exons in guinea pig but none in mouse and rat. Although human ANRIL is known to contain transposons  and transposons are known to have contributed significantly to human and vertebrate lncRNAs [33, 34], this report provides the first in-depth analysis showing that the insertion of multiple transposons into ANRIL in simians may have been essential for the evolution and function of ANRIL. We are now examining whether significant transposon insertion also occurred in other simian lncRNAs. Because lncRNA exons lack a codon structure, it is unclear whether the transformation of transposons into ANRIL exons involved a typical exonisation process . Importantly, ANRIL is the first typical case of clade-specific evolution of lncRNAs, and further studies are needed to elucidate the birth-and-death process in rodents. We postulate that the distinct pattern of two-stage, clade-specific evolution may be a feature of many other lncRNAs.
The reported level of homology between human and mouse H19 (an lncRNA controlling the imprinted expression of Igf2r) is approximately 66%, while that between human and mouse Xist is approximately 49% (but much higher in exon regions) . Because Infernal shows high power to detect orthologous sequences of ANRIL exons in mammals, the absence of high-scoring hits or medium-scoring hits for successive CMs reliably indicates the absence of ANRIL in mouse and rat. Although no ANRIL exons were found in mouse and rat, a large region of the mouse (and human) chromosome 9p21 contains single-nucleotide mutations that are associated with many diseases, including coronary artery disease [37, 38]. A recent review has summarised these mutations , showing that nearly all of them are located in ANRIL introns. Aligned sequences in the UCSC Genome Browser show that ANRIL introns contain multiple conserved sites or sites with strong epigenomic marks, indicating the functional importance of ANRIL intron sequences.
Some transposons have specifically contributed to mammalian evolution through the rewiring of pregnancy-related gene-regulatory networks . During early primate evolution, many thousands of DNA elements became integrated and fixed , and numerous primate-specific SINEs (short interspersed nucleotide elements) triggered the evolution of primate-specific functions . These transposon bursts in primates likely strongly influenced the structural evolution of primate genomes , but their functional significance remains inadequately understood. Our analysis of ANRIL reveals that transposons not only inserted or transformed into ANRIL exons but also caused ANRIL to become highly conserved in simians. Thus, these transposons obtained functional significance by contributing to lncRNA evolution, and the evolution of significant lncRNAs and transposons may be deeply co-opted. This postulation can reasonably explain the findings that many lncRNAs contain multiple transposons and that transposons contribute significantly to human and vertebrate lncRNAs [33, 34]. Given that approximately one-third of lncRNAs appear to have arisen within the primate lineage  and that many transposons are primate-specific, such co-option has likely had a profound impact on primate evolution and physiology . Specifically, the absence of ANRIL in mouse and rat and the truncated and non-functional HOTAIR in mouse may make the epigenomic regulation of many important genes differ between humans and mice/rats [44, 45]. Transposons also influence genome methylation at many sites in somatic cells. These observations raise a critical question: to what extent do lncRNAs and transposons affect the comparability of human and mouse/rat cancers?
François Jacob postulated that the emergence of novel forms and functions over time can occur via a ‘tinkering’ process through random combinations of pre-existing elements . More recently, it has been suggested that the pool of transposable elements can be domesticated to serve as a ‘warehouse’ for natural selection, potentially acting as a source of lineage-specific elements [47, 48]. Cordaux et al. explored an example of tinkering along the human evolutionary lineage and identified a primate-specific chimeric gene consisting of a host gene merged with a transposable element . Subsequent reports have further examined such phenomena [50, 51]. ANRIL provides a remarkable example because it involves the domestication of both new and ancient transposons with impressive site selectivity. Our analyses of ANRIL also raise the intriguing question of the fate of the transposons in tree shrew and prosimians. Our results should further promote the recently renewed interest in transposon function and evolution [52–54].
Transposons contribute significantly to the evolution of ANRIL and considerable other lncRNAs. ANRIL obtained splicing signals and became conserved after transposon insertions in simians but lost all exons in some rodents, showing a two-stage and clade-specific evolutionary process.
Identifying sequences homologous to exons in the ANRIL gene
First, RNAfold (http://rna.tbi.univie.ac.at/) was used with the default parameters to predict the structures of the 19 exons in human ANRIL (NR_003529). Based on the predicted structures, Infernal v1.0.1 was used with the default parameters to build 19 covariance models (CMs) and to search the 19 CMs against the macaque genome . Second, LocARNA + RNAalifold (http://rna.tbi.univie.ac.at/) were used with the default parameters to align the 19 exons of human and macaque ANRIL and to predict the structures of the 19 aligned exons. Based on these predicted structures, Infernal was used to re-build 19 new CMs. Third, these CMs were used to search the genomes of 27 organisms. To confirm that the failure to detect exons in mouse and rat was not influenced by the use of CMs based on two primates, we built CMs based on the identified exons in rabbit and used these CMs to re-search the mouse and rat genomes. To confirm that the failure to detect exons in non-placental mammals and vertebrates was not influenced by the use of CMs based on the primates, we built CMs based on the identified exons in rabbit and horse and used these CMs to re-search the opossum and chicken genomes. The following unmasked genomic sequences were downloaded from http://www.ensembl.org for the genome search: human (GRCh37.57), chimpanzee (CHIMP2.1.57), gorilla (gorGor3.1.64), rhesus macaque (MMUL_1.57), orangutan (PPYG2.64), marmoset (C_jacchus188.8.131.52), tarsier (tarSyr1.53), tree shrew (TREESHREW.50), lemur (micMur1.48), cow (Btau_4.0.57), horse (EquCab2.57), dog (BROADD2.57), kangaroo rat (dipOrd1.53), mouse (NCBIM37.57), rat (RGSC3.4.57), guinea pig (cavPor3), rabbit (oryCun2.64), armadillo (dasNov2.54), sloth (choHof1.63), elephant (loxAfr3), tenrec (TENREC.50), wallaby (Meug_1.0.55), opossum (BROADO5.50), platypus (ornAna1), zebrafish (Zv9.60), chicken (WASHUC2.54), and lizard (AnoCar2.0.64). Sequences of the ANRIL gene in simians are given in Additional file 3.
Identifying transposons in human ANRIL and in lncRNAs in the lncRNA database
RepeatMasker (http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker) and the database Repbase were used with the default parameters to identify transposons in the human ANRIL gene, in the Infernal search hits, and in lncRNAs in the http://www.lncRNAdb.org database .
Identifying motifs in ANRIL exons
MEME, a program that identifies potential motifs in multiple sequences, was used to identify motifs within the exons of the ANRIL gene. The default parameters were used, except that ‘motif number’ was set to 10 .
Sequence alignment and phylogenetic analysis
MAFFT and LocARNA were used with the default parameters to align the transposons and exons . MEGA v5.1 and PHYLIP v3.69 were used with the default parameters to calculate the pairwise distances of transposons and exons between organisms based on the maximum composite likelihood and F84 models [57, 58]. The divergence times between organisms were acquired from http://www.timetree.org. MEGA v5.1 was used to find the most appropriate substitution models and to build maximum likelihood (ML) trees for the transposons and exons. The default parameters were used, except that 1000 bootstrap replications were performed. MrBayes was used with the default parameters to build Bayesian trees for the transposons and exons . Relative-rate tests were performed using RRTree .
This work was supported by the Guangdong Province Foundation for Returned Scholars and the Guangzhou Supercomputing Center (2012-Y2-00047).
- Ng K, Pullirsch D, Leeb M, Wutz A: Xist and the order of silencing. EMBO Rep. 2007, 8: 34-39. 10.1038/sj.embor.7400871.PubMed CentralPubMedView ArticleGoogle Scholar
- Lee JT: Lessons from X-chromosome inactivation: long ncRNA as guides. Genes Dev. 2009, 23: 1831-1842. 10.1101/gad.1811209.PubMed CentralPubMedView ArticleGoogle Scholar
- Wan L-B, Bartolomei MS: Regulation of imprinting in clusters: noncoding RNAs versus insulators. Adv Genet. 2008, 61: 207-223.PubMedView ArticleGoogle Scholar
- Ferguson-Smith AC: Genomic imprinting: the emergence of an epigenetic paradigm. Nat Rev Genet. 2011, 12: 565-575.PubMedView ArticleGoogle Scholar
- Ponjavic J, Ponting CP, Lunter G: Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res. 2007, 17: 556-565. 10.1101/gr.6036807.PubMed CentralPubMedView ArticleGoogle Scholar
- Guttman M, et al: Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 2009, 458: 223-227. 10.1038/nature07672.PubMed CentralPubMedView ArticleGoogle Scholar
- Derrien T: The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 2012, 22: 1775-1789. 10.1101/gr.132159.111.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhao J, Sun BK, Erwin JA, Song J-J, Lee JT: Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science. 2008, 322: 750-756. 10.1126/science.1163045.PubMed CentralPubMedView ArticleGoogle Scholar
- Tsai M-C, Manor O, Wan Y, Mosammaparast N, Wang JK, Lan F, Shi Y, Segal E, Chang HY: Long noncoding RNA as modular scaffold of histone modification complexes. Science. 2010, 329: 689-693. 10.1126/science.1192002.PubMed CentralPubMedView ArticleGoogle Scholar
- Chu C, Qu K, Zhong FL, Artandi SE, Chang HY: Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol Cell. 2011, 44: 1-12. 10.1016/j.molcel.2011.09.009.View ArticleGoogle Scholar
- Lee JT: Epigenetic regulation by long noncoding RNAs. Science. 2012, 338: 1435-1439. 10.1126/science.1231776.PubMedView ArticleGoogle Scholar
- Ulitsky I, Bartel DP: LincRNAs: genomics, evolution, and mechanisms. Cell. 2013, 154: 26-46. 10.1016/j.cell.2013.06.020.PubMed CentralPubMedView ArticleGoogle Scholar
- Reik W, Lewis A: Co-evolution of X-chromosome inactivation and imprinting in mammals. Nat Rev Genet. 2005, 6: 403-410.PubMedView ArticleGoogle Scholar
- Ulitsky I, Shkumatava A, Jan CH, Sive H, Bartel DP: Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell. 2011, 147: 1537-1550. 10.1016/j.cell.2011.11.055.PubMed CentralPubMedView ArticleGoogle Scholar
- Roeszler KN, Itman C, Sinclair AH, Smith CA: The long non-coding RNA, MHM, plays a role in chicken embryonic development, including gonadogenesis. Dev Biol. 2012, 366: 317-326. 10.1016/j.ydbio.2012.03.025.PubMedView ArticleGoogle Scholar
- Ilik IA, et al: Tandem stem-loops in roX RNAs act together to mediate X chromosome dosage compensation in Drosophila. Mol Cell. 2013, 51: 156-173. 10.1016/j.molcel.2013.07.001.PubMed CentralPubMedView ArticleGoogle Scholar
- Bell AC, Felsenfeld G: Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature. 2000, 405: 482-485. 10.1038/35013100.PubMedView ArticleGoogle Scholar
- Sleutels F, Zwart R, Barlow DP: The non-coding Air RNA is required for silencing autosomal imprinted genes. Nature. 2002, 415: 810-813. 10.1038/415810a.PubMedView ArticleGoogle Scholar
- Rinn JL, et al: Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell. 2007, 129: 1311-1323. 10.1016/j.cell.2007.05.022.PubMed CentralPubMedView ArticleGoogle Scholar
- Pasmant E, Laurendeau I, Heron D, Vidaud M, Vidaud D, Bieche I: Characterization of a germ-line deletion, including the entire INK4/ARF locus, in a melanoma-neural system tumor family: identification of ANRIL, an antisense noncoding RNA whose expression coclusters with ARF. Cancer Res. 2007, 67: 3963-3969. 10.1158/0008-5472.CAN-06-2004.PubMedView ArticleGoogle Scholar
- Yu W, Gius D, Onyango P, Muldoon-Jacobs K, Karp J, Feinberg AP, Cui H: Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA. Nature. 2008, 451: 202-206. 10.1038/nature06468.PubMed CentralPubMedView ArticleGoogle Scholar
- Kotake Y, Nakagawa T, Kitagawa K, Suzuki S, Liu N, Kitagawa M, Xiong Y: Long non-coding RNA ANRIL is required for the PRC2 recruitment to and silencing of p15INK4B tumor suppressor gene. Oncogene. 2011, 30: 1956-1962. 10.1038/onc.2010.568.PubMed CentralPubMedView ArticleGoogle Scholar
- Ohno S: Evolution by gene duplication. 1970, New York: Springer-VerlagView ArticleGoogle Scholar
- Nawrocki EP, Kolbe DL, Eddy S: Infernal 1.0: inference of RNA alignments. Bioinformatics. 2009, 25: 1335-1337. 10.1093/bioinformatics/btp157.PubMed CentralPubMedView ArticleGoogle Scholar
- Gardner PP: The use of covariance models to annotate RNAs in whole genomes. Brief Funct Genomic Proteomic. 2009, 8: 444-450. 10.1093/bfgp/elp042.PubMedView ArticleGoogle Scholar
- Kaplan N, Darden T, Langley CH: Evolution and extinction of transposable elements in Mendelian populations. Genetics. 1985, 109: 459-480.PubMed CentralPubMedGoogle Scholar
- Pace JK, Feschotte C: The evolutionary history of human DNA transposons: evidence for intense activity in the primate lineage. Genome Res. 2007, 17: 422-432. 10.1101/gr.5826307.PubMed CentralPubMedView ArticleGoogle Scholar
- Stocsits RR, Letsch H, Hertel J, Misof B, Stadler PF: Accurate and efficient reconstruction of deep phylogenies from structured RNAs. Nucleic Acid Res. 2009, 37: 6184-6193. 10.1093/nar/gkp600.PubMed CentralPubMedView ArticleGoogle Scholar
- Robinson-Rechavi M, Huchon D: RRTree: relative-rate tests between groups of sequences on a phylogenetic tree. Bioinformatics. 2000, 16: 296-297. 10.1093/bioinformatics/16.3.296.PubMedView ArticleGoogle Scholar
- Amaral PP, Clark MB, Gascoigne DK, Dinger ME, Mattick JS: lncRNAdb: a reference database for long noncoding RNAs. Nuc Acids Res. 2011, 39: D146-D151. 10.1093/nar/gkq1138.View ArticleGoogle Scholar
- Duret L, Chureau C, Samain S, Weissenbach J, Avner P: The Xist RNA gene evolved in eutherians by pseudogenization of a protein-coding gene. Science. 2006, 312: 1653-1655. 10.1126/science.1126316.PubMedView ArticleGoogle Scholar
- Jarinova O, et al: Functional analysis of the chromosome 9p21.3 coronary artery disease risk locus. Arterioscler Thromb Vasc Biol. 2009, 29: 1671-1677. 10.1161/ATVBAHA.109.189522.PubMedView ArticleGoogle Scholar
- Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay LA, Bourque G, Yandell M, Feschotte C: Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet. 2013, 9: e1003470-10.1371/journal.pgen.1003470. 2005PubMed CentralPubMedView ArticleGoogle Scholar
- Kelley DR, Rinn J: Transposable elements reveal a stem cell specific class of long noncoding RNAs. Genome Biol. 2012, 13: R107-10.1186/gb-2012-13-11-r107.PubMed CentralPubMedView ArticleGoogle Scholar
- Möller-Krull1 M, Zemann A, Roos C, Brosius J, Schmitz J: Beyond DNA: RNA editing and steps toward Alu exonization in primates. J Mol Biol. 2008, 382: 601-609. 10.1016/j.jmb.2008.07.014.PubMedView ArticleGoogle Scholar
- Nesterova TB, et al: Characterization of the genomic Xist locus in rodents reveals conservation of overall gene structure and tandem repeats but rapid evolution of unique sequence. Genome Res. 2001, 11: 833-849. 10.1101/gr.174901.PubMed CentralPubMedView ArticleGoogle Scholar
- Broadbent HM, et al: Susceptibility to coronary artery disease and diabetes is encoded by distinct, tightly linked SNPs in the ANRIL locus on chromosome 9p. Hum Mol Genet. 2008, 17: 806-814.PubMedView ArticleGoogle Scholar
- Visel A, Zhu Y, May D, Afzal V, Gong E, Attanasio C, Blow MJ, Cohen JC, Rubin EM, Pennacchio LA: Targeted deletion of the 9p21 non-coding coronary artery disease risk interval in mice. Nature. 2010, 464: 409-412. 10.1038/nature08801.PubMed CentralPubMedView ArticleGoogle Scholar
- Pasmant E, Sabbagh A, Vidaud M, Bieche I: ANRIL, a long, noncoding RNA, is an unexpected major hotspot in GWAS. The FASEB J. 2010, 25: 444-448.View ArticleGoogle Scholar
- Lynch VJ, Leclerc RD, May G, Wagner GP: Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat Genet. 2011, 43: 1154-1159. 10.1038/ng.917.PubMedView ArticleGoogle Scholar
- Kamal M, Xie X, Lander ES: A large family of ancient repeat elements in the human genome is under strong selection. Proc Natl Acad Sci USA. 2006, 103: 2740-2745. 10.1073/pnas.0511238103.PubMed CentralPubMedView ArticleGoogle Scholar
- Pandey R, Mukerji M: From ‘JUNK’ to just unexplored noncoding knowledge: the case of transcribed Alus. Brief Funct Genomics. 2011, 10: 294-311. 10.1093/bfgp/elr029.PubMedView ArticleGoogle Scholar
- Gong C, Maquat LE: LncRNAs transactivate STAU1-mediatedmRNA decay by duplexing with 39 UTRs via Alu elements. Nature. 2011, 470: 284-288. 10.1038/nature09701.PubMed CentralPubMedView ArticleGoogle Scholar
- He S, Liu S, Zhu H: The sequence, structure and evolutionary features of HOTAIR in mammals. BMC Evol Biol. 2011, 11: 102-10.1186/1471-2148-11-102.PubMed CentralPubMedView ArticleGoogle Scholar
- Schorderet P, Duboule D: Structural and functional differences in the long non-coding RNA Hotair in mouse and human. PLoS Genet. 2011, 7: e1002071-10.1371/journal.pgen.1002071.PubMed CentralPubMedView ArticleGoogle Scholar
- Jacob F: Evolution and tinkering. Science. 1977, 196: 1161-1166. 10.1126/science.860134.PubMedView ArticleGoogle Scholar
- Miller WJ, McDonald JF, Pinsker W: Molecular domestication of mobile elements. Genetica. 1997, 100: 261-270. 10.1023/A:1018306317836.PubMedView ArticleGoogle Scholar
- Bowen NJ, Jordan IK: Transposable elements and the evolution of eukaryotic complexity. Curr Issues Mol Biol. 2002, 4: 65-76.PubMedGoogle Scholar
- Cordaux R, Udit S, Batzer MA, Feschotte C: Birth of a chimeric primate gene by capture of the transposase gene from a mobile element. Proc Natl Acad Sci USA. 2006, 103: 8101-8106. 10.1073/pnas.0601161103.PubMed CentralPubMedView ArticleGoogle Scholar
- Hikosaka A, Kobayashi T, Saito Y, Kawahara A: Evolution of the xenopus piggyBac transposon family TxpB: domesticated and untamed strategies of transposon subfamilies. Mol Biol Evol. 2007, 24: 2648-2656. 10.1093/molbev/msm191.PubMedView ArticleGoogle Scholar
- Casola C, Hucks D, Feschotte C: Convergent domestication of pogo-like transposases into centromere-binding proteins in fission yeast and mammals. Mol Biol Evol. 2008, 25: 29-41.PubMed CentralPubMedView ArticleGoogle Scholar
- Feschotte C: Transposable elements and the evolution of regulatory networks. Nat Rev Genet. 2008, 9: 397-405. 10.1038/nrg2337.PubMed CentralPubMedView ArticleGoogle Scholar
- Fedoroff NV: Transposable elements, epigenetics, and genome evolution. Science. 2012, 338: 758-767. 10.1126/science.338.6108.758.PubMedView ArticleGoogle Scholar
- Werren JH: Selfish genetic elements, genetic conflict, and evolutionary innovation. Proc Natl Acad Sci USA. 2011, 108: 10863-10870. 10.1073/pnas.1102343108.PubMed CentralPubMedView ArticleGoogle Scholar
- Bailey TL, Bodén M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS: MEME SUITE: tools for motif discovery and searching. Nuc Acids Res. 2009, 37: W202-W208. 10.1093/nar/gkp335.View ArticleGoogle Scholar
- Katoh K, Kuma K, Toh H, Miyata T: MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nuc Acids Res. 2005, 33: 511-518. 10.1093/nar/gki198.View ArticleGoogle Scholar
- Felsenstein J: Phylip – phylogeny inference package. Cladistics. 1989, 5: 164-166.Google Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralPubMedView ArticleGoogle Scholar
- Kumar S, Hedges SB: TimeTree2: species divergence times on the iPhone. Bioinformatics. 2011, 27: 2023-2024. 10.1093/bioinformatics/btr315.PubMed CentralPubMedView ArticleGoogle Scholar
- Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP: MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012, 61: 539-542. 10.1093/sysbio/sys029.PubMed CentralPubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.