Skip to main content


Analysis of the human Alu Ye lineage

  • 5797 Accesses

  • 17 Citations



Alu elements are short (~300 bp) interspersed elements that amplify in primate genomes through a process termed retroposition. The expansion of these elements has had a significant impact on the structure and function of primate genomes. Approximately 10 % of the mass of the human genome is comprised of Alu elements, making them the most abundant short interspersed element (SINE) in our genome. The majority of Alu amplification occurred early in primate evolution, and the current rate of Alu retroposition is at least 100 fold slower than the peak of amplification that occurred 30–50 million years ago. Alu elements are therefore a rich source of inter- and intra-species primate genomic variation.


A total of 153 Alu elements from the Ye subfamily were extracted from the draft sequence of the human genome. Analysis of these elements resulted in the discovery of two new Alu subfamilies, Ye4 and Ye6, complementing the previously described Ye5 subfamily. DNA sequence analysis of each of the Alu Ye subfamilies yielded average age estimates of ~14, ~13 and ~9.5 million years old for the Alu Ye4, Ye5 and Ye6 subfamilies, respectively. In addition, 120 Alu Ye4, Ye5 and Ye6 loci were screened using polymerase chain reaction (PCR) assays to determine their phylogenetic origin and levels of human genomic diversity.


The Alu Ye lineage appears to have started amplifying relatively early in primate evolution and continued propagating at a low level as many of its members are found in a variety of hominoid (humans, greater and lesser ape) genomes. Detailed sequence analysis of several Alu pre-integration sites indicated that multiple types of events had occurred, including gene conversions, near-parallel independent insertions of different Alu elements and Alu-mediated genomic deletions. A potential hotspot for Alu insertion in the Fer1L3 gene on chromosome 10 was also identified.


The proliferation of Alu elements has had a significant impact on the architecture of primate genomes [1]. They comprise over 10% of the human genome by mass and are the most abundant short interspersed element (SINE) in primate genomes [2]. Alu elements have achieved this copy number by duplicating via an RNA intermediate in a process termed retroposition [3]. During retroposition the RNA copy is reverse transcribed by target primed reverse transcription (TPRT) and subsequently integrated into the genome [46]. While unable to retropose autonomously, Alu elements are thought to borrow the factors that are required for their amplification from the LINE (long interspersed element) elements [69], which encode a protein with endonuclease and reverse transcriptase activity [10, 11]. Because of their high copy number, Alu repeats have been a significant source of new mutations as a result of insertion and post-integration recombination between elements [12, 13].

The majority of Alu amplification occurred early in primate evolution, and the current rate of Alu retroposition is at least 100 fold slower than the peak of amplification that appears to have occurred 30–50 million years ago [2, 1416]. Even though there are over one million Alu elements within the human genome, only a small number of these elements are capable of movement [17]. As a result of the limited amplification capacity of Alu elements, a series of discrete subfamilies of Alu elements that share common diagnostic mutations have been identified in the human genome [1821]. A small subset of "young" Alu repeats are so recent in origin that they are present in the human genome and absent from the genomes of non-human primates, with some of the elements being polymorphic with respect to insertion presence/absence in diverse human genomes [16, 2225]. Individual SINE elements have proven to be essentially homoplasy-free characters which are therefore quite useful for resolving phylogenetic and population genetic questions [2, 2634]. For example, young Alu subfamilies which arose around the radiation of Subtribe Hominina (gorillas, chimpanzees, and humans) four to six million years ago [35] were used as homoplasy free phylogenetic markers to resolve the branching order in hominids [36]. Relationships among other primates have also been resolved using relatively large numbers of Alu elements as phylogenetic markers [28, 3740]

We have previously characterized a large number of recently integrated Alu elements found in the human genome that fall in six distinct lineages, termed Ya, Yb and Yc, Yd, Yg and Yi based upon their diagnostic mutations [4152]. Here, we describe the distribution in the human genome of three Alu subfamilies that are members of the Alu Ye lineage [53] and are characterized by four (Ye4), five (Ye5) and six (Ye6) diagnostic mutations, respectively.


Subfamily size and age

Alu Ye elements were identified in the draft sequence of the human genome using BLAST [54] queries of the draft sequence to identify exact complements to an Alu Ye specific oligonucleotide (Fig. 1). See the Materials and Methods section for details on the search. Using this approach we identified 25 Ye4 subfamily members that shared four diagnostic base positions and thus comprised the Alu Ye4 subfamily. We also identified 103 elements that shared five diagnostic base positions and comprise the Alu Ye5 subfamily and 25 Ye6 subfamily members that shared six diagnostic base positions and comprised the Alu Ye6 subfamily. Each of the subfamilies was named in accordance with standard nomenclature for new Alu subfamilies [55].

Figure 1

Sequence alignment of Alu Ye subfamilies. The consensus sequence for the Alu Y subfamily is shown at the top. The sequences of Alu Ye4, Ye5 and Ye6 subfamilies are shown below. The dots below represent the same nucleotides as the consensus sequence. Deletions are shown as dashes and mutations are shown as the correct base for each of the subfamilies.

To estimate the copy number of the Ye4, Ye5 and Ye6 Alu subfamilies, we preformed BLAST searches of the draft sequence of the human genome using an Alu Ye lineage-specific oligonucleotide to query the database (as outlined in the methods). Seventeen of the 25 Alu Ye4 elements were unique (non-paralogous). There were also 76 unique Ye5 Alu elements and 23 unique Ye6 Alu subfamily members. Multiple alignments of the Alu elements from each subfamily were constructed and the number of mutations from the consensus sequence for each Alu subfamily was determined. In each case the mutations were divided into those that occur at CpG dinucleotides and those that occur at non-CpG positions without including small insertions or deletions as described previously [4749]. The mutations are divided into these two different classes to estimate the average age of each subfamily because the CpG base positions in repeated sequences mutate at a rate that is about six times higher than non-CpG positions [56] as a result of the spontaneous deamination of 5-methylcytosine residues [57].

Mutation densities were calculated for each Alu Ye subfamily. For 17 elements from the Alu Ye4 subfamily, the non-CpG and CpG mutation densities were 2.1% (83/3944) and 12.5 % (106/850). Using a neutral rate of evolution of 0.15% per million years for non-CpG positions [58] and 0.9% per million years for the CpG base positions [56] along with the average mutation density yields age estimates of 14.03 and 13.86 million years old for the Ye4 subfamily. For the Alu Ye5 subfamily 76 elements were analyzed that contained a total of 17632 non-CpG nucleotides and 3800 CpG nucleotides that contained 351 non-CpG and 431 CpG mutations. The mutation densities of the Ye5 subfamily were 1.99% and 11.34% for the non-CpG and CpG nucleotides yielding age estimates based on the average mutation density of 13.27 and 12.60 million years old. For the Alu Ye6 subfamily 23 elements were analyzed that contained a total of 5336 non-CpG nucleotides and 1150 CpG nucleotides that contained 86 non-CpG and 92 CpG mutations. The mutation densities of the Ye6 subfamily were 1.61% and 8% for the non-CpG and CpG nucleotides yielding age estimates based on the average mutation density of 10.75 and 8.89 million years old.

Evolutionary analysis

In order to determine the approximate time of insertion for each Alu Ye4, Ye5 and Ye6 subfamily member, we performed a series of PCR reactions using human and non-human primate DNA samples as templates. Unfortunately, not all of the loci identified in the draft sequence were amenable to PCR analysis, as some of them had inserted into other repetitive regions of the genome making the design of flanking unique sequence PCR primers difficult.

For the Ye subfamilies, 120 of the 153 elements identified in the draft human genomic sequence were amplified by PCR. Examination of the orthologous regions of the various species genomes displayed a series of different PCR patterns indicative of the time of retroposition of each of the elements into the primate genomes. Results from a series of these experiments showed a gradient of Ye Alu repeats beginning with some elements that are recent in origin and unique to the human genome (e.g. Ye5AH110) and ending with elements that are found within all ape genomes (e.g. Ye5AH148). The distribution of all the Ye elements in various primate genomes is summarized in Additional File 2.

Gene conversion

Gene conversion between Alu elements and in other regions of the human genome exerts a significant influence on the accumulation of single nucleotide diversity within the human genome [2, 50]. To estimate the frequency of gene conversion in the Alu Ye subfamily members, we compared the sequences of the elements found in the human genome to the consensus sequences of other Alu subfamilies. Using this approach, we identified two Alu Ye5 subfamily members that appeared to have been subjected to partial gene conversion at their 3' ends. Alu Ye5AH70 contains three mutations that are diagnostic for the Yb8/9 subfamily. Similarly, Alu Ye5AH173 contains three Alu Sc mutations. Each of the sequence exchanges occurred in a short contiguous sequence suggesting that they were products of gene conversion rather than homoplasic point mutations.

We identified one Alu-containing locus that was involved in full gene conversion/ replacement event, (Ye5AH181). In this case, the orthologous Alu elements have similar flanking sequences and direct repeats, although they are not precisely identical due to the random mutations that accumulated over time. DNA sequence analysis of this locus showed that the Alu element of selected new world monkey genomes (spider monkey, woolly monkey and tamarin) belonged to the Alu Sg subfamily. This suggests that a gene conversion of an older, pre-existing Alu Sg may have introduced the Ye5 sequence in the common ancestor of humans, chimpanzees, gorillas and orangutans. Amplification of this locus was unsuccessful in the old world monkey taxa tested.

Alu-mediated genomic deletions

Two deletions of part of the human genome appeared to be associated with newly inserted Alu Ye elements. These deletions were identified at loci Ye5AH24 and Ye5AH27. In the case of Ye5AH24, the deletion was associated with a gene conversion of an Alu Y in both orangutan and siamang to AluYe5 in human, bonobo, common chimpanzee and gorilla and involved the removal of about 500 bp from the 3' flanking region. For Alu Ye5AH27, the deletion was associated with a gene conversion of an Alu Sx element (orangutan and siamang) to AluYe5 (human, bonobo, common chimpanzee and gorilla) and involved the removal of 142 bp from the 3' flanking region. Based on this data, we estimate the frequency of Alu retroposition mediated deletions of approximately 1.67% (2/120).

The pre-integration sites for three elements (Ye5AH11, Ye5AH40 and Ye5AH173) did not amplify in any non-human primate species. Previously, the insertion of L1 elements has been shown to be associated with large genomic deletions [59]. Thus, one possible explanation for the absence of pre-integration PCR products would be that a large deletion (>1 kb) occurred at each of these loci during Alu integration. If a deletion occurred during the integration of an Alu element in the human genome, then the pre-integration product size calculated computationally would be an underestimate of the true size of the locus. To investigate this possibility, we utilized long template PCR reactions of these loci that would facilitate the amplification of larger (up to 25 kb) products. Unfortunately, PCR amplicons were not generated by any of these loci, suggesting that the retrotransposition of these Alu elements in humans may have generated deletions greater than 25 kb in size. Alternately, the orthologous loci in non-human primate genomes may have undergone additional mutations at the oligonucleotide primer sites, preventing PCR amplification.

Independent Alu insertions

We have also identified one locus (Ye5AH161) that contained multiple paralogous Alu insertions in human, chimpanzee, gorilla lineage, old world monkey and new world monkey lineages (Fig. 2). In the human, chimpanzee and gorilla lineage (subtribe Hominina) there was an independent insertion of an Alu Ye5 in the 5' flank of an Alu Sx that is common to all taxa. In all the old world monkey genomes tested (Green monkey, Macaque and Rhesus monkey), an Alu Sp has inserted in the 5' flank of the shared Sx element about 58 bp away of the Alu Ye5 present in Hominina. Also, in the woolly and spider monkeys (new world monkeys), there was an independent insertion of an Alu Sx in the 5' flank of the shared Alu Sx. In gibbon, siamang and orangutan, there were no independent Alu insertions at this locus, only the common Alu Sx is present. In orangutan, however, there was an extra 145 bp of genomic sequences inserted inside the old Alu Sx. The pattern discussed suggests that these three independent parallel insertion events occurred sometime after the divergence of these primates from one another. This locus on chromosome 10q23.33 lies in intron 39–40 of the Human Fer1L3 gene, about 50 bp from exon 39. This locus may be considered a hot spot for Alu insertion. An alignment of locus Ye5AH161 is available as Additional file 1 and at

Figure 2

Parallel insertions at the Ye5AH161 locus. A) The figure shows an agarose gel chromatograph of the PCR products resulting from amplification at the Ye5AH161 locus in 13 primate species. The ~795 bp PCR product is found in the human, common chimpanzee, pygmy chimpanzee, gorilla, green monkey, Rhesus monkey, macaque, woolly monkey and spider monkey genomes. Smaller bands were found in orangutan, gibbon and siamang. Sequence analysis of the PCR products shows three independent insertions; a Ye5 in subtribe Hominina (human, chimpanzee and gorilla), a second insertion of an Alu Sp in old world monkeys, and an Alu Sx insertion in new world monkeys. Suspected non-homologous recombination has inserted 145 bp in the orangutan genome at this locus. B) A schematic representation of the multiple Alu independent insertions and the distance between the shared Alu Sx and the independently inserted Alu elements. The sequence of Fer1L3-Exon 39 is shown. Silent mutations are highlighted and the distance from the inserted Alus are indicated. Abbreviations used in the figure are: Human (H), Chimpanzee (C), Gorilla (G), Orangutan (O), Gibbon (Gn), Siamang (S), Green monkey (Gm), Rhesus monkey (R), Macaque (M), Woolly monkey (W) and Spider monkey (Sm).

We also identified another near-parallel independent Alu insertion event at human Ye5AH16 locus in all the old world monkey genomes tested (Green monkey, Macaque and Rhesus), within the same locus where an Alu Ye5 element was located in the human, chimpanzee, gorilla and orangutan genomes. Thus, the near-parallel insertion most likely occurred after the divergence of humans and apes from old world monkeys, but before the radiation of the old world monkeys. The element present in the old world monkey genomes is an Alu Y and is 80 bp from the human insertion site.

Human genomic diversity

To determine the human genomic diversity associated with each of the Alu Ye4, Ye5 and Ye6 subfamily members, we performed a series of PCR reactions on a collection of 80 geographically-diverse human genomes. Using this approach, we identified one new Alu insertion polymorphism (Ye5AH167) from the loci analyzed in this report. The allele frequencies, genotypes and heterozygosities for the Alu insertion polymorphism are shown in Table 1.

Table 1 Human genetic diversity of Ye5AD167.


Our detailed analysis of the Alu Ye5 subfamily resulted in the recovery of two new Alu subfamilies, Ye4 and Ye6. Each of these Alu subfamilies has a relatively small copy number in the human genome. The proportion of polymorphic elements within each of the subfamilies is quite low with only 0.83% of the Alu Ye elements being polymorphic, only one member of Ye subfamilies (Ye5AD167) is polymorphic with respect to insertion presence/absence in the human genome. In contrast, many other young Alu subfamilies have levels of insertion polymorphism in excess of 20% [2]. Therefore, the amplification of these Alu subfamilies within the human genome has occurred at a very low rate, and may have recently ceased entirely. The estimated average ages of ~14, ~13 and ~9.5 million years old for the Alu Ye4, Ye5 and Ye6 subfamilies, respectively are consistent with their relatively recent origin in primate genomes. It is also consistent with the master gene model of SINE retroposition which suggests that as a master element accumulates mutations over time, the resulting elements will share those mutations [60].

Members of the Alu Ye lineages are dispersed throughout the genomes of all hominoids (humans, greater and lesser apes) suggesting that this subfamily of Alu elements began to amplify about 15–20 million years ago. Therefore, the Ye subfamily appears to have been retroposition competent during hominoid evolution, but must have been relatively inefficient at producing copies. Although the rate of Ye amplification has not been dramatic within the human lineage, it may be quite interesting to recover Alu Ye subfamily members from other ape genomes and to determine the rate of Ye subfamily amplification in these genomes to see if there has been any differential amplification of these elements in non-human primate genomes. The differential amplification of ID SINEs within various members of the rodent lineage has been reported previously suggesting that the amplification of SINEs within various genomes is subject to changes [61, 62].

Gene conversion between Alu repeats has been reported previously [26, 63, 64]. The gene conversion events involve in three Alu Ye subfamily members were quite interesting. In one case (Ye5AH181), the Alu-containing locus was involved in full gene conversion event where Alu Sg in new world monkeys is replaced by an Alu Ye5 in Humans, chimpanzees, gorillas and orangutan. In the other two cases (Ye5AH70 and Ye5AH173), only a small portion of the 3' end of the Ye elements were involved in the gene conversion. This is in good agreement with the molecular nature of gene conversion events recently reported for the Ya5 and Yb8/9 Alu subfamilies [47, 48, 64, 65]. The detection of three gene conversion events from about 153 Alu Ye elements suggests that gene conversion of these events has been relatively rare, with a rate of 1.96%. However, this rate is comparable to that reported previously for the Alu Ya5 and Yb8 subfamilies within the human genome, as well as that for the Ta subfamily of human LINE elements [6466].

In all cases, the Ye Alu family members that were involved in the gene conversion were monomorphic for insertion presence within the human genome. In the partial gene conversion events, the Ye Alu repeats were gene converted by Yb8/9 and Sx Alu elements. The Yb8/9 Alu subfamily was one of the first groups of Alu repeats that was ever reported to be involved in gene conversion, and may be more prone to these types of events as a result of a retroposition rate that is slightly higher than other recently integrated Alu subfamilies in the human genome [48, 64, 65]. The gene conversion between Alu elements may in part be a function of the length of time that the individual Alu elements have resided in the human genome [26, 50]. Based on an examination of low copy number transgenes in the mouse, it has been suggested that the germline recombination machinery in mammals has been evolved to prevent high levels of ectopic recombination between repetitive sequences [67]. It is quite possible that the high copy number of Alu elements allows for pairing between regions of sequence identity of different Alu elements initiating the start of gene conversion before cellular control systems can terminate the process resulting in the production of small gene conversion tracts.

The identification of multiple paralogous Alu insertions involving an Alu Ye element (Ye5AH161) in humans, bonobo, common chimpanzee and gorilla lineage, Alu Sp in old world monkeys lineage and Alu Sx in new world monkeys lineage is also interesting. The paralogous insertion of an Alu repeat into the orthologous regions of human and non-human primate genomes is an independent evolutionary event [26]. To date there are no known cases of the independent insertion of paralogous Alu elements into identical sites within different genomes. The detection of parallel insertions is a function of the rate of retroposition of Alu elements within various primate lineages and the time since the most recent common ancestor [26]. However, this locus (Ye5AH161) supports the idea of hotspots for the integration of Alu repeats within primate genomes. Future studies on the integration of different SINE elements in syntenic regions of human and rodent genomes may yield new insight into the molecular nature of hotspots for SINE element integration.

Genomic deletions created upon LINE-1 retrotransposition using cell culture assays have been recently identified [59]. The rate of LINE element deletion was estimated indirectly in the human genome to be about 3% [68] or 8–13% through sequencing variable sizes of the preintegration sites of L1HS in primates [69]. The precise molecular mechanism of the LINE mediated genomic deletions is still unclear. Recently, an Alu-mediated deletion that resulted in the inactivation of the human CMP-N-acetylneuraminic acid hydroxylase gene [70] and Alu mediated deletions of noncoding genomic sequences have been identified [71]. Here we report two new examples of Alu retroposition-mediated deletions that may have happened by a mechanism similar to that of the LINE element mediated genomic deletions since Alu and L1 elements utilize a common mobilization pathway [6, 8, 72]. In both cases, Alu Ye5AH24 and Alu Ye5AH27, the deletion appears to have occurred, after the separation of human, chimpanzee and gorillas from orangutan and Siamang, during the process of gene conversion similar to the lineage specific Alu deletion reported previously [70, 71].

Here, we have estimated the frequency of Alu retroposition associated genomic deletions as approximately 1.67%. The size of the deleted sequences was over 300 bp on average. New Alu integrations have been estimated to occur in vivo at a frequency of one new event in every 10 to 200 births [12]. If sizable deletions accompany one in every 100 new Alu retroposition events in vivo, the genomic impact of these events could be substantial. This is not a trivial number of deletions when extrapolated to the copy number of Alu elements in the human genome which is over one million [2]. Approximately about 16,700 Alu elements may have been involved in retroposition mediated deletion events within primate genomes. If each of these deletion events removes an average of 300 bp of genomic sequence, this would mean that Alu retroposition mediates the deletion of about 5 Mb of the primate genomic sequences. However, if the Alu associated deletions have involved larger sequences similar to those recently reported for LINE elements [59], then the impact of these events may be 50–500 Mb of lineage specific deletions. In either case, these types of events represent a novel mechanism of lineage-specific deletion within the primate order. Detailed studies of the orthologous regions of primate genomes deleted in this manner may prove instructive for understanding the genetic basis of the difference between humans and non-human primates.


The Alu Ye lineage has had an extended history of expansion in the human lineage. Its expansion appears to have begun soon after the divergence of the hominoids from the remainder of the catarrhine primates and proceeded at a relatively low level since then. Extended periods of relatively low levels of retrotransposition may allow some mobile elements to retain duplication capability for long periods of time. Despite a relatively low level of retrotransposition, the Alu Ye lineage has contributed to the architecture of the human genome through insertion mutations, retrotransposition associated genomic deletions, and gene conversion.


Computational analysis

To identify Alu Ye elements in the draft sequence of the human genome (August 6, 2001, UCSC GoldenPath assembly), we used Basic Local Alignment Search Tool (BLAST) [54] queries of the draft sequence to identify exact complements to the oligonucleotide 5'- GAACCCCGGGGGGCGGAGCCTGCAG-3' that is diagnostic for the Ye lineage as shown in Fig. 1. All of the exact complements to the oligonucleotide queries along with 1000 bp of adjacent flanking unique DNA sequence were excised and stored as unique files and subjected to additional analysis as outlined previously [4749]. A complete list of all the Alu elements identified in the searches is located in Additional file 2 and is available at

DNA samples and PCR amplification

Oligonucleotide primers and PCR amplification reactions for each of the Alu Ye lineage loci analyzed were performed as previously described [4749] using the primers and annealing temperatures shown in Additional file 2 for Alu Ye lineage members. Diverse human DNA samples were available from previous studies [4749]. The cell lines used to isolate DNA samples were as follows: chimpanzee (Pan troglodytes), WES (ATCC CRL1609); gorilla (Gorilla gorilla) lowland gorilla Coriell AG05251B, Ggo-1 (primary gorilla fibroblasts) provided by Dr. Stephen J. O'Brien, National Cancer Institute, Frederick, MD, USA; bonobo (Pan paniscus) Coriell AG05253A; orangutan (Pongo pygmaeus) ATCC CRL6301; green monkey (Chlorocebus aethiops) ATCC CCL70 (old world monkey); and owl monkey (Aotus trivirgatus) OMK (OMKidney) ATCC CRL 1556 (new world monkey). Cell lines were maintained as directed by the source and DNA isolations were performed using Wizard genomic DNA purification (Promega). DNA samples from peripheral lymphocytes or tissue were prepared from the gibbon (Hylobates lar) and siamang (Hylobates syndactylus). Additional non-human primate DNA samples (Pan troglodytes, Pan paniscus, Gorilla gorilla, Pongo pygmaeus, Macaca mulatta (old world monkey), Macaca nemestrina (old world monkey), Saquinus labiatus (new world monkey), Lagothrix lagotricha (new world monkey), Ateles geoffroyi (new world monkey) and Lemur catta (prosimian) available as a primate phylogenetic panel (PRP00001) were purchased from the Coriell Institute for Medical Research.

Sequence analysis

DNA sequencing was performed on a gel purified PCR products that had been cloned using the TOPO TA cloning vector (Invitrogen) using chain termination sequencing [73] on an Applied Biosystems 3100 automated DNA sequencer. The sequence of the orthologous loci (that contained a paralogous Alu element) has been assigned accession numbers AY849282-AY849301. Sequence alignments of the Ye lineage subfamily members were performed using MegAlign software (DNAStar version 3.1.7 for Windows 3.2). The ages for each of the Alu Ye subfamilies were calculated using mutation densities as previously described [43, 4749, 65] with rates suggested by Xing et al. [56].


  1. 1.

    Deininger PL, Batzer MA: Evolution of retroposons. Evolutionary Biology. 1993, 27: 157-196.

  2. 2.

    Batzer MA, Deininger PL: Alu repeats and human genomic diversity. Nat Rev Genet. 2002, 3: 370-379. 10.1038/nrg798.

  3. 3.

    Weiner AM, Deininger PL, Efstratiadis A: Nonviral retroposons: genes, pseudogenes, and transposable elements generated by the reverse flow of genetic information. Annu Rev Biochem. 1986, 55: 631-661. 10.1146/

  4. 4.

    Luan DD, Korman MH, Jakubczak JL, Eickbush TH: Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition. Cell. 1993, 72: 595-605. 10.1016/0092-8674(93)90078-5.

  5. 5.

    Kazazian HH, Moran JV: The impact of L1 retrotransposons on the human genome. Nat Genet. 1998, 19: 19-24.

  6. 6.

    Kajikawa M, Okada N: LINEs mobilize SINEs in the eel through a shared 3' sequence. Cell. 2002, 111: 433-444. 10.1016/S0092-8674(02)01041-3.

  7. 7.

    Sinnett D, Richer C, Deragon JM, Labuda D: Alu RNA transcripts in human embryonal carcinoma cells. Model of post-transcriptional selection of master sequences. J Mol Biol. 1992, 226: 689-706. 10.1016/0022-2836(92)90626-U.

  8. 8.

    Boeke JD: LINEs and Alus – the polyA connection. Nat Genet. 1997, 16: 6-7. 10.1038/ng0597-6.

  9. 9.

    Dewannieux M, Esnault C, Heidmann T: LINE-mediated retrotransposition of marked Alu sequences. Nat Genet. 2003, 35: 41-48. 10.1038/ng1223.

  10. 10.

    Feng Q, Moran JV, Kazazian HH, Boeke JD: Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell. 1996, 87: 905-916. 10.1016/S0092-8674(00)81997-2.

  11. 11.

    Jurka J: Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc Natl Acad Sci U S A. 1997, 94: 1872-1877. 10.1073/pnas.94.5.1872.

  12. 12.

    Deininger PL, Batzer MA: Alu repeats and human disease. Mol Genet Metab. 1999, 67: 183-193. 10.1006/mgme.1999.2864.

  13. 13.

    Batzer MA, Deininger PL: Alu repeats and human genomic diversity. Nature Reviews Genetics. 2002, 3: 370-379. 10.1038/nrg798.

  14. 14.

    Kapitonov V, Jurka J: The age of Alu subfamilies. J Mol Evol. 1996, 42: 59-65. 10.1007/BF00163212.

  15. 15.

    Labuda D, Striker G: Sequence conservation in Alu evolution. Nucleic Acids Res. 1989, 17: 2477-2491.

  16. 16.

    Shen MR, Batzer MA, Deininger PL: Evolution of the master Alu gene(s). J Mol Evol. 1991, 33: 311-320.

  17. 17.

    Deininger PL, Batzer MA, Hutchison CA, Edgell MH: Master genes in mammalian repetitive DNA amplification. Trends Genet. 1992, 8: 307-311.

  18. 18.

    Britten RJ, Baron WF, Stout DB, Davidson EH: Sources and evolution of human Alu repeated sequences. Proc Natl Acad Sci U S A. 1988, 85: 4770-4774.

  19. 19.

    Jurka J, Smith T: A fundamental division in the Alu family of repeated sequences. Proc Natl Acad Sci U S A. 1988, 85: 4775-4778.

  20. 20.

    Slagel V, Flemington E, Traina-Dorge V, Bradshaw H, Deininger P: Clustering and subfamily relationships of the Alu family in the human genome. Mol Biol Evol. 1987, 4: 19-29.

  21. 21.

    Willard C, Nguyen HT, Schmid CW: Existence of at least three distinct Alu subfamilies. J Mol Evol. 1987, 26: 180-186.

  22. 22.

    Arcot SS, Fontius JJ, Deininger PL, Batzer MA: Identification and analysis of a 'young' polymorphic Alu element. Biochim Biophys Acta. 1995, 1263: 99-102.

  23. 23.

    Batzer MA, Rubin CM, Hellmann-Blumberg U, Alegria-Hartman M, Leeflang EP, Stern JD, Bazan HA, Shaikh TH, Deininger PL, Schmid CW: Dispersion and insertion polymorphism in two small subfamilies of recently amplified human Alu repeats. J Mol Biol. 1995, 247: 418-427. 10.1006/jmbi.1994.0150.

  24. 24.

    Carter AB, Salem AH, Hedges DJNKC, Kimball B, Walker JA, Watkins WS, Jorde LB, Batzer MA: Genome wide analysis of the human Alu Yb lineage. Human Genomics. 2004, 1: 167-178.

  25. 25.

    Otieno AC, Carter AB, Hedges DJ, Walker JA, Ray DA, Garber RK, Anders BA, Stoilova N, Laborde ME, Fowlkes JD, Huang CH, Perodeau B, Batzer M: Analysis of the human Alu Ya-lineage. J Mol Biol. 2004, 342: 109-118. 10.1016/j.jmb.2004.07.016.

  26. 26.

    Roy-Engel AM, Carroll ML, El-Sawy M, Salem AH, Garber RK, Nguyen SV, Deininger PL, Batzer MA: Non-traditional Alu evolution and primate genomic diversity. J Mol Biol. 2002, 316: 1033-1040. 10.1006/jmbi.2001.5380.

  27. 27.

    Shedlock AM, Okada N: SINE insertions: powerful tools for molecular systematics. Bioessays. 2000, 22: 148-160. 10.1002/(SICI)1521-1878(200002)22:2<148::AID-BIES6>3.0.CO;2-Z.

  28. 28.

    Schmitz J, Roos C, Zischler H: Primate phylogeny: molecular evidence from retroposons. Cytogenet Genome Res. 2005, 108: 26-37. 10.1159/000080799.

  29. 29.

    Salem A-H, Ray DA, Batzer MA: Identity by descent and DNA sequence variation of human SINE and LINE elements. Cytogenet Gen Res. 2005, 108: 63-72. 10.1159/000080803.

  30. 30.

    Shedlock AM, Takahashi K, Okada N: SINEs of speciation: tracking lineages with retroposons. Trends Ecol Evol. 2004, 19: 545-553. 10.1016/j.tree.2004.08.002.

  31. 31.

    Leeflang EP, Chesnokov IN, Schmid CW: Mobility of short interspersed repeats within the chimpanzee lineage. J Mol Evol. 1993, 37: 566-572.

  32. 32.

    Hamdi HK, Nishio H, Tavis J, Zielinski R, Dugaiczyk A: Alu-mediated phylogenetic novelties in gene regulation and development. J Mol Biol. 2000, 299: 931-939. 10.1006/jmbi.2000.3795.

  33. 33.

    Hamdi H, Nishio H, Zielinski R, Dugaiczyk A: Origin and phylogenetic distribution of Alu DNA repeats: irreversible events in the evolution of primates. J Mol Biol. 1999, 289: 861-871. 10.1006/jmbi.1999.2797.

  34. 34.

    Martinez J, Dugaiczyk LJ, Zielinski R, Dugaiczyk A: Human genetic disorders, a phylogenetic perspective. J Mol Biol. 2001, 308: 587-596. 10.1006/jmbi.2001.4755.

  35. 35.

    Goodman M, Porter CA, Czelusniak J, Page SL, Schneider H, Shoshani J, Gunnell G, Groves CP: Toward a phylogenetic classification of Primates based on DNA evidence complemented by fossil evidence. Mol Phylogenet Evol. 1998, 9: 585-598. 10.1006/mpev.1998.0495.

  36. 36.

    Salem AH, Ray DA, Xing J, Callinan PA, Myers JS, Hedges DJ, Garber RK, Witherspoon DJ, Jorde LB, Batzer MA: Alu elements and hominid phylogenetics. Proc Natl Acad Sci U S A. 2003, 100: 12787-12791. 10.1073/pnas.2133766100.

  37. 37.

    Ray DA, Hedges DJ, Hall MA, Laborde ME, Anders BA, White BR, Stoilova N, Fowlkes JD, Landry KE, Chemnick LG, Ryder O, Batzer M: Alu Insertion Polymorphisms and Platyrrhine Primate Phylogenetic Relationships. Mol Phylogenet Evol.

  38. 38.

    Roos C, Schmitz J, Zischler H: Primate jumping genes elucidate strepsirrhine phylogeny. Proc Natl Acad Sci U S A. 2004, 101: 10650-10654. 10.1073/pnas.0403852101.

  39. 39.

    Schmitz J, Ohme M, Zischler H: SINE insertions in cladistic analyses and the phylogenetic affiliations of Tarsius bancanus to other primates. Genetics. 2001, 157: 777-784.

  40. 40.

    Singer SS, Schmitz J, Schwiegk C, Zischler H: Molecular cladistic markers in New World monkey phylogeny (Platyrrhini, Primates). Mol Phylogenet Evol. 2003, 26: 490-501. 10.1016/S1055-7903(02)00312-3.

  41. 41.

    Batzer MA, Deininger PL: A human-specific subfamily of Alu sequences. Genomics. 1991, 9: 481-487. 10.1016/0888-7543(91)90414-A.

  42. 42.

    Batzer MA, Gudi VA, Mena JC, Foltz DW, Herrera RJ, Deininger PL: Amplification dynamics of human-specific (HS) Alu family members. Nucleic Acids Res. 1991, 19: 3619-3623.

  43. 43.

    Batzer MA, Kilroy GE, Richard PE, Shaikh TH, Desselle TD, Hoppens CL, Deininger PL: Structure and variability of recently inserted Alu family members. Nucleic Acids Res. 1990, 18: 6793-6798.

  44. 44.

    Arcot SS, DeAngelis MM, Sherry ST, Adamson AW, Lamerdin JE, Deininger PL, Carrano AV, Batzer MA: Identification and characterization of two polymorphic Ya5 Alu repeats. Mutat Res. 1997, 382: 5-11.

  45. 45.

    Arcot SS, Adamson AW, Lamerdin JE, Kanagy B, Deininger PL, Carrano AV, Batzer MA: Alu fossil relics – distribution and insertion polymorphism. Genome Res. 1996, 6: 1084-1092.

  46. 46.

    Arcot SS, Shaikh TH, Kim J, Bennett L, Alegria-Hartman M, Nelson DO, Deininger PL, Batzer MA: Sequence diversity and chromosomal distribution of "young" Alu repeats. Gene. 1995, 163: 273-278. 10.1016/0378-1119(95)00317-Y.

  47. 47.

    Carroll ML, Roy-Engel AM, Nguyen SV, Salem AH, Vogel E, Vincent B, Myers J, Ahmad Z, Nguyen L, Sammarco M, Watkins WS, Henke J, Makalowski W, Jorde LB, Deininger PL, Batzer MA: Large-scale analysis of the Alu Ya5 and Yb8 subfamilies and their contribution to human genomic diversity. J Mol Biol. 2001, 311: 17-40. 10.1006/jmbi.2001.4847.

  48. 48.

    Roy-Engel AM, Carroll ML, Vogel E, Garber RK, Nguyen SV, Salem AH, Batzer MA, Deininger PL: Alu insertion polymorphisms for the study of human genomic diversity. Genetics. 2001, 159: 279-290.

  49. 49.

    Roy AM, Carroll ML, Kass DH, Nguyen SV, Salem AH, Batzer MA, Deininger PL: Recently integrated human Alu repeats: finding needles in the haystack. Genetica. 1999, 107: 149-161. 10.1023/A:1003941704138.

  50. 50.

    Roy AM, Carroll ML, Nguyen SV, Salem AH, Oldridge M, Wilkie AO, Batzer MA, Deininger PL: Potential gene conversion and source genes for recently integrated Alu elements. Genome Res. 2000, 10: 1485-1495. 10.1101/gr.152300.

  51. 51.

    Donaldson CJ, Crapanzano JP, Watson JC, Levine EA, Batzer MA: PROGINS Alu insertion and human genomic diversity. Mutat Res. 2002, 501: 137-141.

  52. 52.

    Arcot SS, Adamson AW, Risch GW, LaFleur J, Robichaux MB, Lamerdin JE, Carrano AV, Batzer MA: High-resolution cartography of recently integrated human chromosome 19-specific Alu fossils. J Mol Biol. 1998, 281: 843-856. 10.1006/jmbi.1998.1984.

  53. 53.

    Jurka J, Krnjajic M, Kapitonov VV, Stenger JE, Kokhanyy O: Active Alu elements are passed primarily through paternal germlines. Theor Popul Biol. 2002, 61: 519-530. 10.1006/tpbi.2002.1602.

  54. 54.

    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410. 10.1006/jmbi.1990.9999.

  55. 55.

    Batzer MA, Deininger PL, Hellmann-Blumberg U, Jurka J, Labuda D, Rubin CM, Schmid CW, Zietkiewicz E, Zuckerkandl E: Standardized nomenclature for Alu repeats. J Mol Evol. 1996, 42: 3-6. 10.1007/BF00163204.

  56. 56.

    Xing J, Hedges DJ, Han K, Wang H, Cordaux R, Batzer MA: Alu element mutation spectra: molecular clocks and the effect of DNA methylation. J Mol Biol. 2004, 344: 675-682. 10.1016/j.jmb.2004.09.058.

  57. 57.

    Bird AP: DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 1980, 8: 1499-1504.

  58. 58.

    Miyamoto MM, Slightom JL, Goodman M: Phylogenetic relations of humans and African apes from DNA sequences in the psi eta-globin region. Science. 1987, 238: 369-373.

  59. 59.

    Gilbert N, Lutz-Prigge S, Moran JV: Genomic deletions created upon LINE-1 retrotransposition. Cell. 2002, 110: 315-325. 10.1016/S0092-8674(02)00828-0.

  60. 60.

    Deininger PL, Batzer MA, Hutchison CA, Edgell MH: Master genes in mammalian repetitive DNA amplification. Trends Genet. 1992, 8: 307-311.

  61. 61.

    Kim J, Deininger PL: Recent amplification of rat ID sequences. J Mol Biol. 1996, 261: 322-327. 10.1006/jmbi.1996.0464.

  62. 62.

    Kim J, Martignetti JA, Shen MR, Brosius J, Deininger P: Rodent BC1 RNA gene as a master gene for ID element amplification. Proc Natl Acad Sci U S A. 1994, 91: 3607-3611.

  63. 63.

    Maeda N, Wu CI, Bliska J, Reneke J: Molecular evolution of intergenic DNA in higher primates: pattern of DNA changes, molecular clock, and evolution of repetitive sequences. Mol Biol Evol. 1988, 5: 1-20.

  64. 64.

    Kass DH, Batzer MA, Deininger PL: Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution. Mol Cell Biol. 1995, 15: 19-25.

  65. 65.

    Batzer MA, Rubin CM, Hellmann-Blumberg U, Alegria-Hartman M, Leeflang EP, Stern JD, Bazan HA, Shaikh TH, Deininger PL, Schmid CW: Dispersion and insertion polymorphism in two small subfamilies of recently amplified human Alu repeats. J Mol Biol. 1995, 247: 418-427. 10.1006/jmbi.1994.0150.

  66. 66.

    Myers JS, Vincent BJ, Udall H, Watkins WS, Morrish TA, Kilroy GE, Swergold GD, Henke J, Henke L, Moran JV, Jorde LB, Batzer MA: A comprehensive analysis of recently integrated human Ta L1 elements. Am J Hum Genet. 2002, 71: 312-326. 10.1086/341718.

  67. 67.

    Cooper DM, Schimenti KJ, Schimenti JC: Factors affecting ectopic gene conversion in mice. Mamm Genome. 1998, 9: 355-360. 10.1007/s003359900769.

  68. 68.

    Kazazian HH, Goodier JL: LINE drive. retrotransposition and genome instability. Cell. 2002, 110: 277-280. 10.1016/S0092-8674(02)00868-1.

  69. 69.

    Vincent BJ, Myers JS, Ho HJ, Kilroy GE, Walker JA, Watkins WS, Jorde LB, Batzer MA: Following the LINEs: an analysis of primate genomic variation at human-specific LINE-1 insertion sites. Molecular Biology and Evolution. 2003, 20: 1338-1348. 10.1093/molbev/msg146.

  70. 70.

    Hayakawa T, Satta Y, Gagneux P, Varki A, Takahata N: Alu-mediated inactivation of the human CMP- N-acetylneuraminic acid hydroxylase gene. Proc Natl Acad Sci U S A. 2001, 98: 11399-11404. 10.1073/pnas.191268198.

  71. 71.

    Salem AH, Kilroy GE, Watkins WS, Jorde LB, Batzer MA: Recently integrated Alu elements and human genomic diversity. Molecular Biology and Evolution. 2003, 20: 1349-1361. 10.1093/molbev/msg150.

  72. 72.

    Battilana J, Bonatto SL, Freitas LB, Hutz MH, Weimer TA, Callegari-Jacques SM, Batzer MA, Hill K, Hurtado AM, Tsuneto LT, Petzl-Erler ML, Salzano FM: Alu insertions versus blood group plus protein genetic variability in four Amerindian populations. Ann Hum Biol. 2002, 29: 334-347. 10.1080/03014460110086835.

  73. 73.

    Sanger F, Nicklen S, Coulson AR: DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977, 74: 5463-5467.

Download references


This research was supported by Louisiana Board of Regents Millennium Trust Health Excellence Fund HEF (2000-05)-05, (2000-05)-01, and (2001-06)-02 (MAB), National Science Foundation BCS-0218338 (MAB) and EPS-0346411 (MAB) and the State of Louisiana Board of Regents Support Fund (MAB).

Author information

Correspondence to Mark A Batzer.

Additional information

Authors' contributions

AS performed all experimental work for the project, shared in the analysis and interpretation of the results and wrote the first draft of the manuscript. DAR provided assistance with analysis and interpretation of the data and in preparing the manuscript for submission. DJH wrote the software used to extract Ye elements and the associated flanking sequences from the human genome draft sequence. JJ provided assistance with the analysis and interpretation of the data and input on late drafts of the manuscript. MAB provided the initial input for the project as well as valuable input on each draft of the manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Salem, A., Ray, D.A., Hedges, D.J. et al. Analysis of the human Alu Ye lineage. BMC Evol Biol 5, 18 (2005) doi:10.1186/1471-2148-5-18

Download citation


  • Gene Conversion
  • World Monkey
  • Gene Conversion Event
  • Primate Genome
  • Mutation Density