Evolutionary history of histone demethylase families: distinct evolutionary patterns suggest functional divergence
© Zhou and Ma; licensee BioMed Central Ltd. 2008
Received: 19 July 2008
Accepted: 24 October 2008
Published: 24 October 2008
Histone methylation can dramatically affect chromatin structure and gene expression and was considered irreversible until recent discoveries of two families of histone demethylases, the KDM1 (previously LSD1) and JmjC domain-containing proteins. These two types of proteins have different functional domains and distinct substrate specificities. Although more and more KDM1 and JmjC proteins have been shown to have histone demethylase activity, our knowledge about their evolution history is limited.
We performed systematic phylogenetic analysis of these histone demethylase families and uncovered different evolutionary patterns. The KDM1 genes have been maintained with a stable low copy number in most organisms except for a few duplication events in flowering plants. In contrast, multiple genes for JmjC proteins with distinct domain architectures were present before the split of major eukaryotic groups, and experienced subsequent birth-and-death evolution. In addition, distinct evolutionary patterns can also be observed between animal and plant histone demethylases in both families. Furthermore, our results showed that some JmjC subfamilies contain only animal genes with specific demethylase activities, but do not have plant members.
Our study improves the understanding about the evolutionary history of KDM1 and JmjC genes and provides valuable insights into their functions. Based on the phylogenetic relationship, we discussed possible histone demethylase activities for several plant JmjC proteins. Finally, we proposed that the observed differences in evolutionary pattern imply functional divergence between animal and plant histone demethylases.
One important mechanism for eukaryotic gene regulation is the epigenetic regulation of chromatin structure. The basic unit of chromatin is the nucleosome, which consists of 146 bp of DNA wrapped around an octamer of four histone proteins, H2A, H2B, H3, and H4. Histone proteins can be modified on the N-terminal tail and the modifications can disrupt the interaction between nucleosomes to prevent the packaging of chromatin into higher order structures; also the modified tails can serve as binding sites for chromatin modifiers, facilitating their functions . Histone modifications, such as methylation and acetylation, have been well studied and many of the sites for the modifications are known . For example, methylation can take place on several lysine residues on histone H3 and H4 (H3K4, H3K9, H3K27, H3K36, etc.) and each lysine residue can be mono-, di- or trimethylated. Histone arginine residues like H3R2 and H4R3 can also be mono- or dimethylated. According to the histone code hypothesis, different histone modifications are linked to distinct functional outcomes: H3K4 and H4K36 methylations are mainly associated with active genes while methylated H3K9 and H3K27 are markers for the repressed chromatin in general [1, 2].
As important mechanisms of gene regulation, histone modifications themselves are under precise control . It is known that many histone modifications are dynamically regulated by enzymes which add or remove the chromatin modifications, with defects in either of these two functions resulting in incorrect activation or repression . However, histone methylation was considered irreversible for a long time. Although histone methylation was first reported in 1964 and the first histone methyltransferase was discovered in 2000 [3, 4], it was not until 2004 that KDM1 [histone lysine (K) demethylase 1; previously known as LSD1 (Lysine specific demethylase)] was identified as the first histone demethylase . KDM1 contains a C-terminal amine oxidase (AOD) domain, which is responsible for the demethylase activity through a flavin adenine dinucleotide (FAD)-dependent mechanism, and an N-terminal SWIRM domain also found in other chromatin regulators . Several studies showed that the SWIRM domain is important for the stability and chromatin targeting of KDM1 [6–8]. Since the chemical mechanism of KDM1 mediated demethylation requires a protonated nitrogen for the reaction to proceed, the substrate specificity of KDM1 is limited to mono- or dimethylated lysine residues . Types of histone methylation shown by biochemical studies to be demethylated by KDM1 include H3K4me1/2, and in the presence of androgen receptor (AR), H3K9me1/2, representing a small subset of all the possible states of histone methylation .
Soon after the identification of KDM1, the Jumonji C (JmjC) domain-containing proteins were discovered to be another family of histone demethylases . The JmjC domain is the catalytic domain and these proteins belong to the Cupin superfamily of Fe(II) and α-ketoglutarate dependent dioxygenases . Unlike KDM1, the JmjC domain-containing proteins that have been tested do not require a protonated nitrogen and are able to reverse all three states of lysine methylation . Members in this family have been shown to be able to remove the methyl groups on H3K4, H3K9, H3K27 and H3K36 . Furthermore, a protein in this family, the JMJD6, functions as a histone arginine demethylase through a similar chemical mechanism . JmjC proteins usually contain additional domains, which are involved in the recognition of methylation (e.g. PHD and Tudor), protein-protein interaction (e.g. F-box) and DNA binding (e.g. C2H2 zinc finger), suggesting a wide range of possible functional interactions.
The number of studies of histone demethylases is increasing rapidly in recent years, with members in both families shown to have important biological functions. KDM1 is an essential gene in mouse  and important for viability and fertility in Drosophila . The Arabidopsis homologs of KDM1, including Flowering Locus D (FLD), regulate the transition to reproductive development [17–20]. Moreover, the JmjC domain-containing proteins are involved in a broad range of processes. For example, the newly identified H3K27 demethylases, UTX and JMJD3, play important roles in regulating Hox gene expression and the animal body development [21, 22]. In addition, JMJD3 was suggested to function in the neural stem cell differentiation . Other JmjC domain-containing proteins are involved in processes such as the X-linked neural development (JARID1C) [24, 25] and embryonic stem cell self-renewal (JHDM2A and JHDM3C) .
While these studies greatly advanced our understanding about the molecular and biological functions of histone demethylases, they only covered a limited fraction of the proteins in the two histone demethylase families. A large number of KDM1 and JmjC-containing proteins remain to be functionally characterized, especially in plants. There are only very few studies on plant histone demethylases. In addition to FLD and its two relatives, only three JmjC domain-containing proteins in Arabidopsis have reported functional studies [27, 28], although it is reasonable to expect that the plant histone demethylases have important functions.
Phylogenetic analyses can provide useful information about evolutionary relationship among related genes from different organisms and clues about possible functions of genes closely related to those with known functions. Furthermore, the differences in evolutionary pattern between gene families or species also suggest different evolutionary pressures and diverged functions. Homologs of both types of histone demethylase have been detected in major groups of eukaryotes [5, 12]. However, to our knowledge, there is no detailed phylogenetic analysis on the KDM1 proteins and only one report exists for JmjC domain-containing proteins from fungi and animals . To gain a better understanding of the evolutionary history of these two histone demethylase families, we performed systematic phylogenetic analyses in this study including sequences from eukaryotes and bacteria. We also discussed the functional implications of the evolutionary patterns we observed in the two families.
Results and discussion
Distribution of AOD domain-containing proteins in major lineages
Number of AOD domain-containing genes and JmjC domain-containing genes included in this study:
AOD domain-containing gene
JmjC domain-containing gene
In fungi, KDM1 was found in the fission yeast Schizosaccharomyces pombe but not the budding yeast Saccharomyces cerevisiae . To investigate the distribution of KDM1 in fungi, we searched for KDM1 genes in completely sequenced fungal genomes in the NCBI database. Our phylogenetic analysis of the fungal KDM1 sequences (see Additional file 2) indicates that one KDM1 gene was present in the ancestor of Ascomycota and it was lost in the common ancestor of the budding yeast and Candida albicans after its divergence from Y. lipolytica. In fission yeast, the two KDM1 genes were shown to have important functions in regulating heterochromatin [31, 32], which is marked by H3K9 methylation. In contrast, the budding yeast does not possess H3K9 methylation and employs a different set of proteins to fulfill the function of fission yeast KDM1 genes in heterochromatin regulation . Furthermore, in the absence of KDM1 homolog, The H3K4 demethylation in the budding yeast is performed by a JmjC domain protein which will be discussed later.
Phylogenetic analyses of AODgenes
Previous studies showed that the human KDM1A protein has an insertion in the AOD domain . The insertion forms a coiled-coil protruding from the AOD domain and is required for the binding between human KDM1 and CoREST [6, 8, 34]. The alignment of AOD amino acid sequences showed that this insertion is conserved among animal KDM1A. The fungal KDM1 proteins also have an insertion at the same position, but the sequences are not similar to the animal insertions. Insertions of much shorter length can also be detected in several plant KDM1 proteins. By contrast, no insertion was found in other AOD proteins.
We used the COILS program to test whether the insertions in different KDM1 proteins are able to form a coiled-coil structure. Consistent with the crystal structure, the insertions in the human KDM1A protein is predicted to form a coiled-coil structure with high support. The same results were obtained for other animal KDM1A proteins, suggesting that the interaction between human KDM1A and CoREST might be conserved in all animals. The lack of insertion in animal KDM1B suggests a functional divergence between these two proteins. Interestingly, although the fungal KDM1 proteins possess an insertion, no coiled-coil is predicted. Several studies showed that the two S. pombe KDM1 proteins form a complex with two PHD domain-containing proteins . Hence, unlike their counterparts in animal, the insertions in fungal KDM1s might be involved in the interaction with these PHD proteins or have other functions. The absence of the insertion in other KDM1 proteins suggests that this insertion is not essential for the histone demethylase activity. Alternatively, the KDM1 proteins without the insertion might have different activities.
Plant and animal KDM1genes have different evolutionary patterns
Several genome-wide studies in fungi and Drosophila suggested that evolutionary patterns of gene families are correlated to their functions [36, 37]. The genes with low volatility in copy number during evolution are usually associated with essential functions. In fact, the KDM1A genes have been shown to be essential in mouse and S. pombe and are involved in important biological processes like meiotic progression and spermatogenesis [31, 38]. Although the function of animal KDM1B genes is not known, the similarity of their evolutionary pattern to that of KDM1A also implies functional conservation and importance. Consistent with this idea, the residues critical for cofactor binding and catalytic activity are conserved in animal KDM1B proteins, suggesting that they have histone demethylase activity. Moreover, the expression of animal KDM1B gene is supported by considerable amount of EST data, although less abundant than KDM1A.
However, several potential substrate-binding residues are substituted in animal KDM1B, suggesting possible changes in substrate specificity of these proteins. Other lines of evidence also support the functional divergence between animal KDM1A and KDM1B genes. Besides the SWIRM and the AOD domain, the animal KDM1B proteins also contain a CW-type zinc finger near the N-terminus. The function of this zinc finger is not well characterized, but it is usually found in proteins which also have other domains involved in DNA binding or protein-protein interaction . Interestingly, this domain is also found in a class of SET domain histone methyltransferases (HMTs), which have H3K36 methyltransferase activity . Therefore, the zinc finger possibly facilitates the recognition of substrates other than methylated H3K4 and H3K9 by KDM1B. Furthermore, the tree in Fig. 2 also shows that the animal KDM1B genes have branches longer than those of KDM1A, indicating that the KDM1B genes have evolved at higher rates. To test this idea, we also conducted Ka/Ks analyses for several pairs of animal genes. The results indicate that: (1) both KDM1A and KDM1B genes were under purifying selection with Ka/Ks ratio lower than 0.1; (2) Ka/Ks values for KDM1B genes were significantly higher than those for KDM1A genes, indicating that the KDM1B genes have evolved under less stringent selective pressure (see Additional file 3). As all these results point to a functional divergence between animal KDM1A and KDM1B, it will be worth investigating the functions of KDM1B proteins in the future.
In plants, our results showed that the copy number of group I KDM1 gene increased from one in the common ancestor of green plants to three in the common ancestor of flowering plants. The functional studies of AtKDM1A, AtKDM1B and AtKDM1C revealed that all three genes regulate the transition to reproductive development [17, 18]. It is possible that the expansion of plant group I might have contributed to the evolutionary success of flowering plants. The initiation of reproductive development is one of the most important developmental events in plants and is regulated by a complex regulatory network . According to the duplication-degeneration-complementation (DDC) model , the duplicate group I KDM1 genes would have undergone sub-functionalization or neo-functionalization, which might help to optimize the regulatory network controlling flowering. In fact, functional studies showed that these three genes have partially redundant functions in the repression of the expression of FLC, a major inhibitor of flowering [17, 18]. In addition, AtKDM1B and AtKDM1C can also affect the expression of FWA, a function independent of that of AtKDM1A.
In contrast, such duplication events were not observed for group II genes, suggesting a difference in the function between group I and group II KDM1 genes. Since there is no reported study on AtKDM1D, it is unclear whether this gene also participates in the regulation of flowering. The expression data from the GENEVESTIGATOR database and our previous microarray results [43, 44] showed that AtKDM1D is expressed at very low levels across all developmental stages. On the other hand, the sequence of AtKDM1D gene is well conserved. Hence it is possible that AtKDM1D has evolved a function in a specific group of cells or for a specific environmental situation.
The origin of SWIRM-AOD architecture
As the KDM1 proteins also contain a SWIRM domain in addition to the AOD domain, how the SWIRM-AOD domain architecture originated is still a question. According to previously proposed mechanisms for the evolution of new gene structures , there might be two possible origins for the first KDM1 gene: (1) an exon shuffling/retrotransposition event that brought these two domain together; (2) de novo evolution of SWIRM domain coding region at the 5' of a preexisting AOD gene. Previous studies have shown that, in spite of its short length, the SWIRM domain is an evolutionarily conserved domain that occurred in proteins with different domain compositions . Therefore the second possibility is unlikely.
In comparison to the few introns observed in plant KDM1 genes, the animal KDM1 genes exhibit completely different patterns of intron positions. Most of the animal KDM1 genes have one or two introns in the SWIRM domain and around 10 introns in the AOD domain, with the exception of insect KDM1 genes, which have only 2 introns in the AOD domain only. Furthermore, although the positions of introns are conserved among animal KDM1A and KDM1B genes respectively, they are different from each other or that of the plant KDM1 genes.
The most parsimonious explanation for the observed intron patterns is that the AOD domain of the ancestral KDM1 gene in the most recent common ancestor of animals and plants was intronless, which supports the origin of KDM1 gene through retrotransposition. After that, the plant KDM1 genes have experienced limited or no intron gains, whereas the animal KDM1 genes accumulated many introns during their evolution. It is still not clear what evolutionary pressure suppressed intron gain in plant KDM1 genes. Nevertheless, these results again clearly support our conclusion that the animal and plant KDM1 genes experienced very different evolutionary history.
Evidence for horizontal gene transfer (HGT) during the evolution of AODgenes
AOD7 and AOD8 both are key enzymes important for the biosynthesis of carotenoids in all photosynthetic organisms, including plants, algae and cyanobacteria . In the carotenoid synthetic pathway, these two enzymes catalyze the two dehydrogenation reactions that convert phytonen to cis-lycopene, which is then converted to all-trans-lycopene by an isomerase . In many nonphotosynthetic organisms like fungi and eubacteria except cyanobacteria, these three steps are replaced by a single reaction that is catalyzed by a distinct enzyme . It is possible that these two AOD genes were recruited to the carotenoid pathway in the common ancestor of cyanobacteria after its divergence from the other eubacteria, and then the photosynthetic eukaryotes acquired these two genes from cyanobacteria through HGT.
The plant-specific AOD6 group, which is closely related to AOD7 and AOD8 groups, also shows a similar pattern. A homolog of plant AOD6 genes can be found in the brown alga T. pseudonana, but not in animals and fungi. The AOD6 proteins are predicted to have a chloroplast-targeting signal, but the actual localization and function remains unknown. In addition, the region of the AOD6 genes encoding the AOD domain is intronless. Our phylogenetic analysis (Fig. 5) showed that eukaryotic AOD6 genes are most closely related to AOD genes from proteobacteria and bacteroidetes, suggesting that the eukaryotic AOD6 genes also originated through HGT from a eubacterium. The results on AOD6, AOD7, and AOD8 phylogeny together reveal an important role of HGT events in the evolution of AOD genes.
JmjC domain-containing proteins
Klose et al. have studied the evolutionary relationship between animal and fungal JmjC domain-containing proteins and they identified seven subfamilies based on both phylogenetic analysis of JmjC domain and domain architecture information . JmjC proteins in six of the seven subfamilies have multiple domains and each family has a distinct domain structure. However, the evolutionary history of plant JmjC proteins is not clear. It has already been reported that two Arabidopsis JmjC proteins have an unusual domain architecture, which is not found in animals and fungi. Hence it will be of interest to elucidate the phylogeny of plant JmjC proteins, and compare between the evolutionary patterns of plant and animal JmjC proteins. To investigate the evolutionary history of plant JmjC domain histone demethylase genes, we retrieved sequences for JmjC domain-containing proteins from various plants and selected animals (Table 1, Additional file 1). We also used the sequences of eukaryotic JmjC domains as queries to search for JmjC domain-containing proteins in prokaryotes. While proteins with limited similarity were found in Eubacteria, no homolog was detected in Archaea. Thus, our results indicate that neither AOD nor JmjC protein is present in Archaea. It is known that some archaea already possess the pseudonucleosomal tetrameric structures . However, the absence of histone demethylase in Archaea is not surprising since archaeal histone proteins do not have N-terminal tails . It is possible that, upon the acquisition of histone tails in early eukaryotes, AOD and JmjC proteins were recruited to serve as chromatin modifying enzymes.
The JmjC domains in most eubacterial JmjC proteins have low support from the SMART analysis and they are annotated as Cupin domain by Pfam with high e-value. Similarly, the JmjC domains in human MINA53, NO66, Drosophila CG2982 proteins retrieved in this study are also annotated as Cupin with strong support. The domain architecture analysis of the collected proteins shows that all the eubacterial proteins have only the JmjC domain. In contrast, most eukaryotic proteins contain other domain(s) besides the JmjC domain. Some domain architectures were observed only in plant members and others only in animals and/or fungi members. From the amino acid sequence alignment, the proteins with the same domain architecture have more similar JmjC domains and regions flanking the JmjC domains.
The birth-and-death evolution of genes encoding JmjC domain proteins
Potential histone demethylase activities of plant JmjC proteins
Among the twelve subfamilies identified in this study, six of them have at least one member with known histone demethylase activities . However, as all functional studies so far are performed in animals and fungi, no plant JmjC protein in these six subfamilies has been reported to have histone demethylase activity. In the absence of biochemical studies, our phylogenetic results can be valuable clues about possible functions of plant JmjC proteins. Here, we propose potential histone demethylase activities for the plant JmjC proteins based on the evolutionary relationships from this study, the conservation of enzymatic active sites and domain architectures.
As described above, three subfamilies have both members with known biochemical activities and members from plants. Two human proteins in KDM3 subfamily, human KDM3A and KDM3B, have been shown to have H3K9me1/2 demethylase activity . The plant KDM3 proteins have the same domain architecture as the animal members, with a zinc finger domain in addition to the JmjC domain. The predicted cofactor binding sites are also conserved in most plant KDM3 proteins, suggesting possible H3K9 demethylase function. Consistent with this idea, a recent study revealed an increased level of H3K9 methylation at the BNS locus in the Arabidopsis kdm3c mutant . However, proteins in the two clades including Arabidopsis KDM3A and KDM3B have variant residues at the co-factor binding sites. Hence it is possible that they have evolved novel functions or become pseudogenes. To investigate these two possibilities, we examined the expression data of Arabidopsis KDM3 genes from our previous microarray analysis  and the GENEVESTIGATOR database . AtKDM3A has the highest expression level among these genes at all the developmental stages, and AtKDM3B is also expressed, suggesting they are functional.
Similar phenomenon can be observed in the PKDM7 subfamily. All proteins in this subfamily are from plant and they contain a JmjN domain, a C5H2-zinc finger domain and C-terminal FYRN and FYRC domains. The cofactor binding sites are conserved in all members but the Arabidopsis and poplar PKDM7A proteins, which have evolved much faster than the other members. Nevertheless, the expression data shows that AtPKDM7A is expressed at a level comparable to AtPKDM7B and AtPKDM7D [43, 44]. Although the AtPKDM7C protein has intact cofactor binding sites, it has no detectable expression. The phylogeny in Fig. 7B shows that the PKDM7 subfamily forms a clade with 99/97 bootstrap supports and is most closely related to the KDM5 subfamily. Several animal KDM5 proteins have been shown to have H3K4me2/3 demethylase activities [24, 55–57]. Therefore, although the plant KDM5 and PKDM7 proteins have distinct domain architecture, they might have H3K4 demethylase activities.
Recently, the human JMJD6 protein was shown to have histone arginine demethylase activity . Although most of the cofactor binding sites are conserved in plant JMJD6 proteins, the first KG binding site has been substituted by Ser and Ala in plant JMJD6A and JMJD6B proteins, respectively. It is unclear whether these substitutions will compromise the histone arginine demethylase activity of plant JMJD6 proteins. We noticed that AtJMJD6A and AtJMJD6B are expressed at a high level at specific developmental stages [43, 44], suggesting that these proteins are functional.
In summary, we have used our phylogenetic results to propose histone demethylase activities for plant JmjC proteins in four subfamilies. The other plant JmjC proteins are either in the plant specific subfamilies PKDM8 and PKDM9 or in the subfamilies PKDM11 and PKDM12 which do not have an animal member with known biochemical activities. Nevertheless, some of these plant JmjC proteins have already been implicated in chromatin modification. For example, an elevated histone H4 acetylation level is observed at the FLC locus in the Arabidopsis pkdm9a mutant, which is phenotypically similar to the kdm1a mutant . Moreover, it is still not clear which proteins are responsible for the H3K9me3, H3K27 and H3K36 demethylation in plants, since the KDM2, KDM4 and KDM6 subfamilies do not have a plant member. One possibility is that these demethylase activities in plants are carried out by some of the other JmjC proteins without a known function.
Functional implications of differences in evolutionary patterns
Our phylogenetic analyses of these two histone demethylase families revealed a significant difference in evolutionary pattern between animal and plant proteins in both families. In the AOD family, the plant group I KDM1 genes were duplicated several times before the diversification of flowering plants and further in specific lineages, whereas the animal KDM1 genes have been maintained with a constant copy number in most species. The animal and plant JmjC domain-containing proteins show similar patterns of evolution in some subfamilies but not in the others. Furthermore, certain types of histone demethylation might be conducted by plant JmjC proteins in subfamilies different from the animal JmjC proteins. These results indicate a divergence in the regulation of histone methylation between animals and plants, consistent with the proposed divergent roles of histone methylation in different organisms . In animals, both H3K9me2 and H3K9me3 are enriched in heterochromatin. However, in Arabidopsis, while the H3K9me2 is considered as a hallmark of heterochromatin, H3K9me3 is mainly found in euchromatin . In animals, H3K9me3 demethylation is catalyzed by members of the KDM4 subfamily [59–62], which lacks plant members, suggesting that H3K9me3 demethylation in plants is catalyzed by proteins from another subfamily. Furthermore, previous phylogenetic analysis also revealed a similar evolutionary pattern in the HDAC families; one of the three major classes of SIR2 family of HDACs has members from animal but not plant, whereas the HD2 family is plant specific . Thus, the functional and regulatory diversification might be a common feature of chromatin modification genes.
In addition, our study also showed distinct evolutionary patterns between the AOD and the JmjC families. Whereas the KDM1 genes only experienced limited duplication events and maintained relatively constant domain architecture in their history, the JmjC gene have evolved several types of domain architectures before the divergence of major eukaryotic groups and underwent further duplication subsequently. As suggested by genome-wide studies in Drosophila and fungi, such divergence in evolutionary patterns may indicate differences in functional essentiality [36, 37]. The KDM1 histone demethylases are reported to have a variety of functions. In animal, KDM1 is required for the ligand-dependent transcriptional activation by nuclear hormone receptors [10, 64]. It also plays important roles in cell differentiation, cell cycle control and spermatogenesis . In addition, most of these functions are also shared by JmjC proteins. For example, members of the KDM3 and KDM4 subfamilies, which possess H3K9 demethylase activities, are also required for the steroid hormone induced gene expression [26, 54, 64] and KDM3A is crucial for spermatogenesis . In addition, the KDM5A, an H3K4me2/me3 demethylase, has overlapping roles with KDM1 in the regulation of cell differentiation . It is also possible that KDM1 has some distinct function from those of JmjC proteins. In fact, the work by Lan et al. showed that KDM1 is retained on the unmethylated H3K4 after its action and suggested a role of KDM1 in the prevention of H3K4 methylation .
Another explanation for the observed evolutionary patterns is that they reflect the difference in evolutionary potential of these two families of histone demethylase. Consistent with this idea, several lines of evidence support a greater functional potential of JmjC proteins than KDM1. First, the JmjC proteins have broader substrate specificity than KDM1 proteins. The requirement of a protonated nitrogen in KDM1-mediated demethylation limits the substrate specificity of KDM1 to mono- and dimethylated lysine residues. By contrast, JmjC proteins are able to demethylate all the three states of lysine methylation. In addition, KDM1 proteins are only known to catalyze the demethylation on H3K4 and H3K9, whereas the substrates for JmjC proteins include H3K4, H3K9, H3K27, H3K36 and even H3R2. Studies on protein structures suggest that the interactions between KDM1 and the substrate are intricate and specific, leading to the exquisite substrate specificity of KDM1. Second, the JmjC domain is much smaller than the AOD domain in KDM1. The JmjC domain in most JmjC proteins are less than 200 amino acids, but the length of the AOD domain is usually more than 400 amino acids. Smaller domain might be combined with the other domains more easily, providing JmjC proteins greater evolutionary adaptability. This is supported by a recent study, which identified the protein domains with relatively high tendency to combine with different domains in eukaryotes . In their list of highly versatile domains, most have 250 or fewer amino acids residues. Hence the short length of JmjC domain may allow JmjC proteins to evolve new functions quickly by combining with new domains, which can promote protein-protein interaction, DNA binding or recognition of chromatin modification.
Apparently convergent evolution of histone demethylases
The fact that the KDM1 and JmjC genes belong to two phylogenetically distinct gene families indicates that, during evolution, these two gene families were recruited to perform the histone demethylation activity independently, providing an example of convergent evolution. In fact, this phenomenon is prevalent among histone modifying enzymes. For instance, the enzymes that catalyze histone methylation belong to two different families, the widespread SET-domain family and the DOT1-related protein family . Similarly, there are three distinct families of HDACs and four different families of HATs . In addition, it is also common that families responsible for the same type of histone modification show distinct evolutionary patterns. While some families are widespread in eukaryotes (e.g. SET family HKMTs and SIR2 family HDACs), others are only present in specific lineages of eukaryotes (e.g. DOT1-related HKMTs in animals and fungi and HD2 family HDACs in plants) [2, 63]. The recruitment of more than one gene families to fulfill the same type of biochemical activities might have allowed these families to evolve specific roles under different circumstances (e.g. cell type, developmental stage, environmental cues) or toward different substrates. The multiple origins of histone modification enzymes have likely contributed to the complexity of epigenetic regulation.
In this paper, we present detailed phylogenetic analyses of the KDM1 and JmjC families, whose members include the recently identified histone demethylases. Our results revealed a possible single origin of all KDM1 histone demethylase genes through the acquisition of the region encoding the SWIRM domain by an AOD gene before the split of major eukaryotic lineages. The KDM1 genes are conserved in both copy number and domain structure during evolution, although a few duplication events were observed in plants. We also identified the contribution of HGT events to the evolution of AOD genes. On the other hand, our analyses JmjC genes showed this family clearly experienced birth-and-death evolution and the subfamilies displayed lineage-specific duplication patterns. According to the evolutionary relationship revealed by our study, we proposed histone demethylase activities for several plant JmjC domain-containing proteins. Furthermore, we found distinct evolutionary patterns of histone demethylases in different lineages and between the KDM1 and JmjC families. These results may imply functional divergence of certain types of histone methylation in different organisms and different classes of function associated with KDM1 and JmjC domain-containing histone demethylases. In summary, our study improves the understanding about the evolution and functions of histone demethylases and provides valuable information for future studies.
The amino acid sequences of the AOD domain in reported KDM1 histone demethylases were retrieved from National Center for Biotechnology Information (NCBI). They were used as queries to search against NCBI, TAIR, TIGR and JGI databases for all possible AOD domain-containing proteins in selected eukaryotic organisms by using TBLASTN with e-value less than e-5 as cut-off. All the new results were used as queries to carry out a second round of BLAST search, until no new sequence was found. The collected protein sequences were then analyzed by SMART and Pfam for domain architecture. The proteins which lack the AOD domain or have an AOD domain with e-value greater than e-10 based on both SMART  and Pfam  results were excluded from the further analyses. The prokaryotic sequences were retrieved from NCBI database through BLASTP by using eukaryotic AOD domain-containing proteins as queries and e-5 as cut-off. The same procedure was followed for the retrieval of JmjC domain-containing proteins. Common names for the following species are shown in the figures: Arabidopsis, Arabidopsis thaliana; Poplar, Populus trichocarpa; Rice, Oryza sativa; Moss, Physcomitrella patens; Human, Homo sapiens; Cow, Bos taurus; Mouse, Mus musculus; Zebrafish, Danio rerio; Fruitfly, Drosophila melanogaster; Mosquito, Anopheles gambiae; Honey bee, Apis mellifera; Beetle, Tribolium castaneum; Sea squirt, Ciona intestinalis; Sea urchin, Strongylocentrotus purpuratus; and Sea anemone, Nematostella vectensis.
A preliminary multiple sequences alignment (MSA) was generated using MUSCLE 3.6  with the default settings and a Neighbor-Joining (NJ) tree was constructed using MEGA 4.0  based on the MSA. According to the tree topology, the sequences were divided into several subgroups. Each subgroup of sequences was aligned by MUSCLE 3.6 separately followed by manual adjustment using GeneDoc 18.104.22.168 . These alignments were then combined using the profile alignment function of ClustalX 1.83 . The codeml program from the PAML 4.1 package is used for the Ka/Ks analyses .
Both NJ and Maximum likelihood (ML) methods were used to perform the phylogenetic analyses. NJ trees were constructed using MEGA 4.0 with "pairwise deletion" option and "Poisson correction" model. Bootstrap test of 1000 replicates was carried out to evaluate the reliability of internal branches. ML trees were generated using PHYML 2.4.4  with 100 nonparametric bootstrap replicates. ProtTest 1.4  was used to select the model and parameters for the ML analysis. In this study, WAG amino acid substitution model was used and both proportion of invariable sites and gamma distribution parameter were estimated from the data. In this study, we presented only the NJ trees with bootstrap values from both NJ and ML analyses.
This work was supported by funds from the Department of Biology, the Eberly School of Sciences, and the Huck Institutes of the Life Sciences, the Pennsylvania State University. H.M. was also partially supported by the School of Life Sciences, Fudan University.
- Kouzarides T: Chromatin modifications and their function. Cell. 2007, 128: 693-705. 10.1016/j.cell.2007.02.005.View ArticlePubMedGoogle Scholar
- Martin C, Zhang Y: The diverse functions of histone lysine methylation. Nat Rev Mol Cell Biol. 2005, 6: 838-849. 10.1038/nrm1761.View ArticlePubMedGoogle Scholar
- Murray K: The occurrence of Epsilon-N-methyl lysine in histones. Biochemistry. 1964, 3: 10-15. 10.1021/bi00889a003.View ArticlePubMedGoogle Scholar
- Rea S, Eisenhaber F, O'Carroll D, Strahl BD, Sun ZW, Schmid M, Opravil S, Mechtler K, Ponting CP, Allis CD, et al: Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature. 2000, 406: 593-599. 10.1038/35020506.View ArticlePubMedGoogle Scholar
- Shi Y, Lan F, Matson C, Mulligan P, Whetstine JR, Cole PA, Casero RA: Histone demethylation mediated by the nuclear amine oxidase homolog LSD1. Cell. 2004, 119: 941-953. 10.1016/j.cell.2004.12.012.View ArticlePubMedGoogle Scholar
- Stavropoulos P, Blobel G, Hoelz A: Crystal structure and mechanism of human lysine-specific demethylase-1. Nat Struct Mol Biol. 2006, 13: 626-632. 10.1038/nsmb1113.View ArticlePubMedGoogle Scholar
- Tochio N, Umehara T, Koshiba S, Inoue M, Yabuki T, Aoki M, Seki E, Watanabe S, Tomo Y, Hanada M, et al: Solution structure of the SWIRM domain of human histone demethylase LSD1. Structure. 2006, 14: 457-468. 10.1016/j.str.2005.12.004.View ArticlePubMedGoogle Scholar
- Chen Y, Yang Y, Wang F, Wan K, Yamane K, Zhang Y, Lei M: Crystal structure of human histone lysine-specific demethylase 1 (LSD1). Proc Natl Acad Sci USA. 2006, 103: 13956-13961. 10.1073/pnas.0606381103.PubMed CentralView ArticlePubMedGoogle Scholar
- Shi Y, Whetstine JR: Dynamic regulation of histone lysine methylation by demethylases. Mol Cell. 2007, 25: 1-14. 10.1016/j.molcel.2006.12.010.View ArticlePubMedGoogle Scholar
- Metzger E, Wissmann M, Yin N, Muller JM, Schneider R, Peters AH, Gunther T, Buettner R, Schule R: LSD1 demethylates repressive histone marks to promote androgen-receptor-dependent transcription. Nature. 2005, 437: 436-439.PubMedGoogle Scholar
- Tsukada Y, Fang J, Erdjument-Bromage H, Warren ME, Borchers CH, Tempst P, Zhang Y: Histone demethylation by a family of JmjC domain-containing proteins. Nature. 2006, 439: 811-816. 10.1038/nature04433.View ArticlePubMedGoogle Scholar
- Clissold PM, Ponting CP: JmjC: cupin metalloenzyme-like domains in jumonji, hairless and phospholipase A2beta. Trends Biochem Sci. 2001, 26: 7-9. 10.1016/S0968-0004(00)01700-X.View ArticlePubMedGoogle Scholar
- Agger K, Christensen J, Cloos PA, Helin K: The emerging functions of histone demethylases. Curr Opin Genet Dev. 2008, 18: 159-168. 10.1016/j.gde.2007.12.003.View ArticlePubMedGoogle Scholar
- Chang B, Chen Y, Zhao Y, Bruick RK: JMJD6 is a histone arginine demethylase. Science. 2007, 318: 444-447. 10.1126/science.1145801.View ArticlePubMedGoogle Scholar
- Wang J, Scully K, Zhu X, Cai L, Zhang J, Prefontaine GG, Krones A, Ohgi KA, Zhu P, Garcia-Bassets I, et al: Opposing LSD1 complexes function in developmental gene activation and repression programmes. Nature. 2007, 446: 882-887. 10.1038/nature05671.View ArticlePubMedGoogle Scholar
- Di Stefano L, Ji JY, Moon NS, Herr A, Dyson N: Mutation of Drosophila Lsd1 disrupts H3-K4 methylation, resulting in tissue-specific defects during development. Curr Biol. 2007, 17: 808-812. 10.1016/j.cub.2007.03.068.PubMed CentralView ArticlePubMedGoogle Scholar
- Jiang D, Yang W, He Y, Amasino RM: Arabidopsis relatives of the human Lysine-Specific Demethylase1 repress the expression of FWA and FLOWERING LOCUS C and thus promote the floral transition. Plant Cell. 2007, 19: 2975-2987. 10.1105/tpc.107.052373.PubMed CentralView ArticlePubMedGoogle Scholar
- Liu F, Quesada V, Crevillen P, Baurle I, Swiezewski S, Dean C: The Arabidopsis RNA-binding protein FCA requires a Lysine-Specific Demethylase 1 homolog to downregulate FLC. Mol Cell. 2007, 28: 398-407. 10.1016/j.molcel.2007.10.018.View ArticlePubMedGoogle Scholar
- Krichevsky A, Gutgarts H, Kozlovsky SV, Tzfira T, Sutton A, Sternglanz R, Mandel G, Citovsky V: C2H2 zinc finger-SET histone methyltransferase is a plant-specific chromatin modifier. Dev Biol. 2007, 303: 259-269. 10.1016/j.ydbio.2006.11.012.PubMed CentralView ArticlePubMedGoogle Scholar
- He Y, Michaels SD, Amasino RM: Regulation of flowering time by histone acetylation in Arabidopsis. Science. 2003, 302: 1751-1754. 10.1126/science.1091109.View ArticlePubMedGoogle Scholar
- Agger K, Cloos PA, Christensen J, Pasini D, Rose S, Rappsilber J, Issaeva I, Canaani E, Salcini AE, Helin K: UTX and JMJD3 are histone H3K27 demethylases involved in HOX gene regulation and development. Nature. 2007, 449: 731-734. 10.1038/nature06145.View ArticlePubMedGoogle Scholar
- Lan F, Bayliss PE, Rinn JL, Whetstine JR, Wang JK, Chen S, Iwase S, Alpatov R, Issaeva I, Canaani E, et al: A histone H3 lysine 27 demethylase regulates animal posterior development. Nature. 2007, 449: 689-694. 10.1038/nature06192.View ArticlePubMedGoogle Scholar
- Jepsen K, Solum D, Zhou T, McEvilly RJ, Kim HJ, Glass CK, Hermanson O, Rosenfeld MG: SMRT-mediated repression of an H3K27 demethylase in progression from neural stem cell to neuron. Nature. 2007, 450: 415-419. 10.1038/nature06270.View ArticlePubMedGoogle Scholar
- Iwase S, Lan F, Bayliss P, de la Torre-Ubieta L, Huarte M, Qi HH, Whetstine JR, Bonni A, Roberts TM, Shi Y: The X-linked mental retardation gene SMCX/JARID1C defines a family of histone H3 lysine 4 demethylases. Cell. 2007, 128: 1077-1088. 10.1016/j.cell.2007.02.017.View ArticlePubMedGoogle Scholar
- Tahiliani M, Mei P, Fang R, Leonor T, Rutenberg M, Shimizu F, Li J, Rao A, Shi Y: The histone H3K4 demethylase SMCX links REST target genes to X-linked mental retardation. Nature. 2007, 447: 601-605. 10.1038/nature05823.View ArticlePubMedGoogle Scholar
- Loh YH, Zhang W, Chen X, George J, Ng HH: Jmjd1a and Jmjd2c histone H3 Lys 9 demethylases regulate self-renewal in embryonic stem cells. Genes Dev. 2007, 21: 2545-2557. 10.1101/gad.1588207.PubMed CentralView ArticlePubMedGoogle Scholar
- Noh B, Lee SH, Kim HJ, Yi G, Shin EA, Lee M, Jung KJ, Doyle MR, Amasino RM, Noh YS: Divergent roles of a pair of homologous jumonji/zinc-finger-class transcription factor proteins in the regulation of Arabidopsis flowering time. Plant Cell. 2004, 16: 2601-2613. 10.1105/tpc.104.025353.PubMed CentralView ArticlePubMedGoogle Scholar
- Saze H, Shiraishi A, Miura A, Kakutani T: Control of genic DNA methylation by a jmjC domain-containing protein in Arabidopsis thaliana. Science. 2008, 319: 462-465. 10.1126/science.1150987.View ArticlePubMedGoogle Scholar
- Klose RJ, Kallin EM, Zhang Y: JmjC-domain-containing proteins and histone demethylation. Nat Rev Genet. 2006, 7: 715-727. 10.1038/nrg1945.View ArticlePubMedGoogle Scholar
- Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313: 1596-1604. 10.1126/science.1128691.View ArticlePubMedGoogle Scholar
- Lan F, Zaratiegui M, Villen J, Vaughn MW, Verdel A, Huarte M, Shi Y, Gygi SP, Moazed D, Martienssen RA: S. pombe LSD1 homologs regulate heterochromatin propagation and euchromatic gene transcription. Mol Cell. 2007, 26: 89-101. 10.1016/j.molcel.2007.02.023.View ArticlePubMedGoogle Scholar
- Gordon M, Holt DG, Panigrahi A, Wilhelm BT, Erdjument-Bromage H, Tempst P, Bahler J, Cairns BR: Genome-wide dynamics of SAPHIRE, an essential complex for gene activation and chromatin boundaries. Mol Cell Biol. 2007, 27: 4058-4069. 10.1128/MCB.02044-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Rusche LN, Kirchmaier AL, Rine J: The establishment, inheritance, and function of silenced chromatin in Saccharomyces cerevisiae. Annu Rev Biochem. 2003, 72: 481-516. 10.1146/annurev.biochem.72.121801.161547.View ArticlePubMedGoogle Scholar
- Yang M, Gocke CB, Luo X, Borek D, Tomchick DR, Machius M, Otwinowski Z, Yu H: Structural basis for CoREST-dependent demethylation of nucleosomes by the human LSD1 histone demethylase. Mol Cell. 2006, 23: 377-387. 10.1016/j.molcel.2006.07.012.View ArticlePubMedGoogle Scholar
- Nicolas E, Lee MG, Hakimi MA, Cam HP, Grewal SI, Shiekhattar R: Fission yeast homologs of human histone H3 lysine 4 demethylase regulate a common set of genes with diverse functions. J Biol Chem. 2006, 281: 35983-35988. 10.1074/jbc.M606349200.View ArticlePubMedGoogle Scholar
- Hahn MW, Han MV, Han SG: Gene family evolution across 12 Drosophila genomes. PLoS Genet. 2007, 3: e197-10.1371/journal.pgen.0030197.PubMed CentralView ArticlePubMedGoogle Scholar
- Wapinski I, Pfeffer A, Friedman N, Regev A: Natural history and evolutionary principles of gene duplication in fungi. Nature. 2007, 449: 54-61. 10.1038/nature06107.View ArticlePubMedGoogle Scholar
- Godmann M, Auger V, Ferraroni-Aguiar V, Di Sauro A, Sette C, Behr R, Kimmins S: Dynamic regulation of histone H3 methylation at lysine 4 in mammalian spermatogenesis. Biol Reprod. 2007, 77: 754-764. 10.1095/biolreprod.107.062265.View ArticlePubMedGoogle Scholar
- Perry J, Zhao Y: The CW domain, a structural module shared amongst vertebrates, vertebrate-infecting parasites and higher plants. Trends Biochem Sci. 2003, 28: 576-580. 10.1016/j.tibs.2003.09.007.View ArticlePubMedGoogle Scholar
- Springer NM, Napoli CA, Selinger DA, Pandey R, Cone KC, Chandler VL, Kaeppler HF, Kaeppler SM: Comparative analysis of SET domain proteins in maize and Arabidopsis reveals multiple duplications preceding the divergence of monocots and dicots. Plant Physiol. 2003, 132: 907-925. 10.1104/pp.102.013722.PubMed CentralView ArticlePubMedGoogle Scholar
- Mouradov A, Cremer F, Coupland G: Control of flowering time: interacting pathways as a basis for diversity. Plant Cell. 2002, 14 (Suppl): S111-130.PubMed CentralPubMedGoogle Scholar
- Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999, 151: 1531-1545.PubMed CentralPubMedGoogle Scholar
- Zimmermann P, Hirsch-Hoffmann M, Hennig L, Gruissem W: GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox. Plant Physiol. 2004, 136: 2621-2632. 10.1104/pp.104.046367.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang X, Feng B, Zhang Q, Zhang D, Altman N, Ma H: Genome-wide expression profiling and identification of gene activities during early flower development in Arabidopsis. Plant Mol Biol. 2005, 58: 401-419. 10.1007/s11103-005-5434-6.View ArticlePubMedGoogle Scholar
- Long M, Betran E, Thornton K, Wang W: The origin of new genes: glimpses from the young and old. Nat Rev Genet. 2003, 4: 865-875. 10.1038/nrg1204.View ArticlePubMedGoogle Scholar
- Aravind L, Iyer LM: The SWIRM domain: a conserved module found in chromosomal proteins points to novel chromatin-modifying activities. Genome Biol. 2002, 3: RESEARCH0039-10.1186/gb-2002-3-8-research0039.PubMed CentralView ArticlePubMedGoogle Scholar
- Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, et al: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007, 449: 463-467. 10.1038/nature06148.View ArticlePubMedGoogle Scholar
- Hirschberg J: Carotenoid biosynthesis in flowering plants. Curr Opin Plant Biol. 2001, 4: 210-218. 10.1016/S1369-5266(00)00163-1.View ArticlePubMedGoogle Scholar
- Timmis JN, Ayliffe MA, Huang CY, Martin W: Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes. Nat Rev Genet. 2004, 5: 123-135. 10.1038/nrg1271.View ArticlePubMedGoogle Scholar
- Armbrust EV, Berges JA, Bowler C, Green BR, Martinez D, Putnam NH, Zhou S, Allen AE, Apt KE, Bechner M, et al: The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism. Science. 2004, 306: 79-86. 10.1126/science.1101156.View ArticlePubMedGoogle Scholar
- Sandmann G: Molecular evolution of carotenoid biosynthesis from bacteria to plants. Physiologia Plantarum. 2002, 116: 431-440. 10.1034/j.1399-3054.2002.1160401.x.View ArticleGoogle Scholar
- Malik HS, Henikoff S: Phylogenomics of the nucleosome. Nat Struct Biol. 2003, 10: 882-891. 10.1038/nsb996.View ArticlePubMedGoogle Scholar
- Allis CD, Berger SL, Cote J, Dent S, Jenuwien T, Kouzarides T, Pillus L, Reinberg D, Shi Y, Shiekhattar R, et al: New nomenclature for chromatin-modifying enzymes. Cell. 2007, 131: 633-636. 10.1016/j.cell.2007.10.039.View ArticlePubMedGoogle Scholar
- Yamane K, Toumazou C, Tsukada Y, Erdjument-Bromage H, Tempst P, Wong J, Zhang Y: JHDM2A, a JmjC-containing H3K9 demethylase, facilitates transcription activation by androgen receptor. Cell. 2006, 125: 483-495. 10.1016/j.cell.2006.03.027.View ArticlePubMedGoogle Scholar
- Christensen J, Agger K, Cloos PA, Pasini D, Rose S, Sennels L, Rappsilber J, Hansen KH, Salcini AE, Helin K: RBP2 belongs to a family of demethylases, specific for tri-and dimethylated lysine 4 on histone 3. Cell. 2007, 128: 1063-1076. 10.1016/j.cell.2007.02.003.View ArticlePubMedGoogle Scholar
- Yamane K, Tateishi K, Klose RJ, Fang J, Fabrizio LA, Erdjument-Bromage H, Taylor-Papadimitriou J, Tempst P, Zhang Y: PLU-1 is an H3K4 demethylase involved in transcriptional repression and breast cancer cell proliferation. Mol Cell. 2007, 25: 801-812. 10.1016/j.molcel.2007.03.001.View ArticlePubMedGoogle Scholar
- Lee MG, Norman J, Shilatifard A, Shiekhattar R: Physical and functional association of a trimethyl H3K4 demethylase and Ring6a/MBLR, a polycomb-like protein. Cell. 2007, 128: 877-887. 10.1016/j.cell.2007.02.004.View ArticlePubMedGoogle Scholar
- Fuchs J, Demidov D, Houben A, Schubert I: Chromosomal histone modification patterns – from conservation to diversity. Trends Plant Sci. 2006, 11: 199-208. 10.1016/j.tplants.2006.02.008.View ArticlePubMedGoogle Scholar
- Cloos PA, Christensen J, Agger K, Maiolica A, Rappsilber J, Antal T, Hansen KH, Helin K: The putative oncogene GASC1 demethylates tri- and dimethylated lysine 9 on histone H3. Nature. 2006, 442: 307-311. 10.1038/nature04837.View ArticlePubMedGoogle Scholar
- Fodor BD, Kubicek S, Yonezawa M, O'Sullivan RJ, Sengupta R, Perez-Burgos L, Opravil S, Mechtler K, Schotta G, Jenuwein T: Jmjd2b antagonizes H3K9 trimethylation at pericentric heterochromatin in mammalian cells. Genes Dev. 2006, 20: 1557-1562. 10.1101/gad.388206.PubMed CentralView ArticlePubMedGoogle Scholar
- Klose RJ, Yamane K, Bae Y, Zhang D, Erdjument-Bromage H, Tempst P, Wong J, Zhang Y: The transcriptional repressor JHDM3A demethylates trimethyl histone H3 lysine 9 and lysine 36. Nature. 2006, 442: 312-316. 10.1038/nature04853.View ArticlePubMedGoogle Scholar
- Whetstine JR, Nottke A, Lan F, Huarte M, Smolikov S, Chen Z, Spooner E, Li E, Zhang G, Colaiacovo M, et al: Reversal of histone lysine trimethylation by the JMJD2 family of histone demethylases. Cell. 2006, 125: 467-481. 10.1016/j.cell.2006.03.028.View ArticlePubMedGoogle Scholar
- Pandey R, Muller A, Napoli CA, Selinger DA, Pikaard CS, Richards EJ, Bender J, Mount DW, Jorgensen RA: Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes. Nucleic Acids Res. 2002, 30: 5036-5055. 10.1093/nar/gkf660.PubMed CentralView ArticlePubMedGoogle Scholar
- Garcia-Bassets I, Kwon YS, Telese F, Prefontaine GG, Hutt KR, Cheng CS, Ju BG, Ohgi KA, Wang J, Escoubet-Lozach L, et al: Histone methylation-dependent mechanisms impose ligand dependency for gene activation by nuclear receptors. Cell. 2007, 128: 505-518. 10.1016/j.cell.2006.12.038.PubMed CentralView ArticlePubMedGoogle Scholar
- Okada Y, Scott G, Ray MK, Mishina Y, Zhang Y: Histone demethylase JHDM2A is critical for Tnp1 and Prm1 transcription and spermatogenesis. Nature. 2007, 450: 119-123. 10.1038/nature06236.View ArticlePubMedGoogle Scholar
- Lan F, Collins RE, De Cegli R, Alpatov R, Horton JR, Shi X, Gozani O, Cheng X, Shi Y: Recognition of unmethylated histone H3 lysine 4 links BHC80 to LSD1-mediated gene repression. Nature. 2007, 448: 718-722. 10.1038/nature06034.PubMed CentralView ArticlePubMedGoogle Scholar
- Basu MK, Carmel L, Rogozin IB, Koonin EV: Evolution of protein domain promiscuity in eukaryotes. Genome Res. 2008, 18: 449-461. 10.1101/gr.6943508.PubMed CentralView ArticlePubMedGoogle Scholar
- Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci USA. 1998, 95: 5857-5864. 10.1073/pnas.95.11.5857.PubMed CentralView ArticlePubMedGoogle Scholar
- Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, et al: Pfam: clans, web tools and services. Nucleic Acids Res. 2006, 34: D247-251. 10.1093/nar/gkj149.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.View ArticlePubMedGoogle Scholar
- Nicholas KB, Nicholas HB, Deerfield DW: GeneDoc: Analysis and Visualization of Genetic Variation. EMBNEWNEWS. 1997, 4: 14-Google Scholar
- Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25: 4876-4882. 10.1093/nar/25.24.4876.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24: 1586-1591. 10.1093/molbev/msm088.View ArticlePubMedGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.View ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.