The evolutionary diversification of LSF and Grainyhead transcription factors preceded the radiation of basal animal lineages
© Traylor-Knowles et al; licensee BioMed Central Ltd. 2010
Received: 23 May 2009
Accepted: 18 April 2010
Published: 18 April 2010
The transcription factors of the LSF/Grainyhead (GRH) family are characterized by the possession of a distinctive DNA-binding domain that bears no clear relationship to other known DNA-binding domains, with the possible exception of the p53 core domain. In triploblastic animals, the LSF and GRH subfamilies have diverged extensively with respect to their biological roles, general expression patterns, and mechanism of DNA binding. For example, Grainyhead (GRH) homologs are expressed primarily in the epidermis, and they appear to play an ancient role in maintaining the epidermal barrier. By contrast, LSF homologs are more widely expressed, and they regulate general cellular functions such as cell cycle progression and survival in addition to cell-lineage specific gene expression.
To illuminate the early evolution of this family and reconstruct the functional divergence of LSF and GRH, we compared homologs from 18 phylogenetically diverse taxa, including four basal animals (Nematostella vectensis, Vallicula multiformis, Trichoplax adhaerens, and Amphimedon queenslandica), a choanoflagellate (Monosiga brevicollis) and several fungi. Phylogenetic and bioinformatic analyses of these sequences indicate that (1) the LSF/GRH gene family originated prior to the animal-fungal divergence, and (2) the functional diversification of the LSF and GRH subfamilies occurred prior to the divergence between sponges and eumetazoans. Aspects of the domain architecture of LSF/GRH proteins are well conserved between fungi, choanoflagellates, and metazoans, though within the Metazoa, the LSF and GRH families are clearly distinct. We failed to identify a convincing LSF/GRH homolog in the sequenced genomes of the algae Volvox carteri and Chlamydomonas reinhardtii or the amoebozoan Dictyostelium purpureum. Interestingly, the ancestral GRH locus has become split into two separate loci in the sea anemone Nematostella, with one locus encoding a DNA binding domain and the other locus encoding the dimerization domain.
In metazoans, LSF and GRH proteins play a number of roles that are essential to achieving and maintaining multicellularity. It is now clear that this protein family already existed in the unicellular ancestor of animals, choanoflagellates, and fungi. However, the diversification of distinct LSF and GRH subfamilies appears to be a metazoan invention. Given the conserved role of GRH in maintaining epithelial integrity in vertebrates, insects, and nematodes, it is noteworthy that the evolutionary origin of Grh appears roughly coincident with the evolutionary origin of the epithelium.
In triploblastic animals, LSF/Grainyhead (GRH) transcription factors perform a number of functions essential to both development and homeostasis. They are involved in regulation of the cell cycle, cell division, and cellular differentiation in a range of developmental and non-developmental contexts [1–14].
The LSF/Grainyhead family is split into the LSF/CP2 subfamily and the Grainyhead (GRH) subfamily, which can be distinguished by their distinctive oligomerization domains and differences in their oligomerization behavior [10, 15, 16]. GRH binds to DNA as a dimer, whereas LSF binds as a tetramer [17, 18]. The DNA binding regions in both protein subfamilies show a large amount of conservation [17, 18], but each has distinct transcriptional targets. GRH binds to the DNA sequence: (A/T)C(A/C/T)(G/T)GTT(C/G/T), whereas LSF binds to a direct repeat with the consensus sequence of N(C/G/T)N(C/G/T)(C/G)N(C/T)N(C/G/T)NN(C/G/T)(C/G/T)N(A/C/G)N [15, 16, 18, 19]. LSF proteins can also be distinguished from GRH by the possession of a sterile alpha motif (SAM) . Members of both LSF and GRH subfamilies were previously identified in vertebrates, arthropods, and nematodes, so the origin of the family and the diversification into subfamilies is known to predate the evolutionary split between protostomes and deuterostomes . Recently, a common origin for the LSF/GRH family and the p53 family has been proposed based on similarities in the folding of their DNA-binding domains .
The differences in the molecular functions of LSF and GRH are accompanied by important differences in their biological roles. In both vertebrates and protostome invertebrates, GRH proteins are involved in the development and maintenance of epithelial integrity . For example, in mice, grh is required during embryogenesis where it is expressed exclusively in the developing ectodermal epithelium . Furthermore, embryonic mice lacking grhl-3 exhibit insufficient wound repair and abnormal skin barrier formation leading to excessive postnatal water loss. The water loss is associated with reduced expression of the gene encoding TGase1, an enzyme that promotes cross-linking of parts of the stratum corneum, thus preventing the movement of water and solutes . Likewise, in Xenopus, a Grh-like gene (Xgrh1) has been implicated in the development of the epidermis . One of its primary targets is epidermal keratin. In morpholino studies, knockdown of Xgrhl led to loss of surface structures and pigmentation as well as neck and eye defects associated with epidermal instability . In Drosophila, GRH plays a critical role in epithelial integrity that is analogous to and perhaps homologous with the role played in vertebrates--GRH maintains the tension of the Drosophila cuticle, and it induces cuticle development and cuticle repair following injury [23, 24]. Similarly, the CeGrh1 protein of C. elegans appears to be required for proper cuticle formation during development, as its knockdown leads to soft, malformed cuticles and embryonic lethality .
In addition to its widely conserved role in maintaining epidermal integrity, Grh is also involved in the specification and development of the CNS in both Drosophila and mice [9, 25]. Additionally, in mice, Grh mutants exhibit defects of the salivary and kidney ducts and eyelid closure [26–28], and in humans, a single nucleotide polymorphism found in GRHL2 is associated with age-related hearing impairment .
The biological roles of LSF are diverse and they have clearly diverged from those of GRH, at least in mammals, where the function of LSF has been well characterized. LSF is ubiquitously expressed . It appears to play a role in liver function, eye development, erythropoesis, neural and immune function, regulation of the cell cycle progression, and cell survival [8, 16, 31–42].
When the ancestors of the LSF and GRH subfamilies first originated via a gene duplication event from their common ancestor, they would presumably have had identical or largely overlapping functions. However, at least in extant mammals, LSF and GRH have diverged extensively with respect to their biological roles. The basis for this functional diversification is not clear. The common ancestral functional repertoire of LSF and GRH may have become "subfunctionalized" in the two descendants . Alternatively or in concert, LSF and GRH may have independently acquired novel functions since their split from a common ancestral gene ("neofunctionalization") [43, 44].
If we wish to reconstruct the initial functional diversification of LSF and GRH, it is necessary to identify the ancestor in which the original gene duplication occurred. This may permit us to infer the functional repertoire of the LSF/GRH ancestor, and to compare this ancestral condition with the function of LSF and GRH in a phylogenetic progression of extant taxa. By comparing vertebrates, arthropods, and nematodes, Venkatesan and co-workers previously showed that the origin of distinct LSF and GRH subfamilies predated the diversification of triploblasts into distinct protostome and deuterostome lineages . With the recent availability of sequenced genomes from several basal metazoans, a choanoflagellate, and more distantly related fungal outgroups, we can track the evolution of the LSF/GRH family into the much more distant past. In this study, we report the identification of LSF/GRH family members in 24 previously unreported species. Through a combination of genome prospecting and phylogenetic analysis, we show that the original gene duplication that produced the LSF and GRH subfamilies occurred prior to the evolutionary radiation of basal animal lineages (e.g., Bilateria, Cnidaria, Ctenophora, Porifera, and Placozoa). Interestingly, the GRH protein of the sea anemone Nematostella vectensis, a representative cnidarian, appears to have split into two distinct loci. We also identify six protein motifs that are widely shared between the LSF and GRH subfamilies of metazoans, all of which can be traced to the common ancestor of metazoans and fungi. In addition, there is a single motif that appears unique to the LSF subfamily.
Identification of putative LSF/GRH homologs in animals, choanoflagellates, and fungi
LSF/GRH sequences identified by BLAST searches.
1,2 *Monosiga brevicollis
jgi| Monbr1| 29664| fgenesh2_pg.scaffold_37000034
1,2 *Aspergillus niger
jgi| Aspni1| 174592| e_gw1.2.1262.1
1,2 *Aspergillus niger
jgi| Aspni1| 206941| estExt_GeneWisePlus.C_21219
2 *Mycosphaerella fijiensis
jgi| Mycfi1| 87221| estExt_fgenesh1_pg.C_60133
2 *Mycosphaerella graminicola
jgi| Mycgr3| 76636| estExt_Genewise1Plus.C_chr_110062
2 *Trichoderma virens
jgi| Trive1| 37466| e_gw220.127.116.11
jgi| Phchr1| 428| fgenesh1_pg.C_scaffold_1000428
jgi| Sporo1| 4513| gw1.5.455.1
1,2 *Phycomyces blakesleeanus
jgi| Phybl1| 75515| estExt_fgeneshPB_pg.C_10328
1,2 *Phycomyces blakesleeanus
jgi| Phybl1| 63864| fgeneshPB_pg.8__95
jgi| Capca1| 198092| fgenesh1_pg.C_scaffold_3000030
>jgi| Capca1| 222821| estExt_fgenesh1_pg.C_600043
1,2 *Daphnia pulex
jgi| Dappu1| 64332| e_gw18.104.22.168
1,2 *Daphnia pulex
jgi| Dappu1| 192185| estExt_Genewise1Plus.C_50175
1,2 Drosophilia melanogaster
1,2 *Drosophilia melanogaster
1,2 *Branchiostoma floridae
jgi| Brafl1| 106909| fgenesh2_pg.scaffold_554000005
1,2 *Branchiostoma floridae
jgi| Brafl1| 129980| estExt_fgenesh2_pg.C_4020027
1,2 *Ciona intestinalis
jgi| Cioin2| 262638| gw1.01q.820.1
1,2 *Ciona intestinalis
jgi| Cioin2| 262956| gw1.03q.546.1
jgi| Takru4| 570042| e_gw22.214.171.124
jgi| Takru4| 710919| fgh5_pm.C_scaffold_60000044
1,2 Homo sapiens
1,2 Homo sapiens
gb| AAF32276.1| AF198489_1
1,2 *Nematostella vectensis
jgi| Nemve1| 95157| e_gw.38.94.1
1,2 *Nematostella vectensis
jgi| Nemve1| 189242| estExt_GenewiseH_1.C_1380090
1,2* Vallicula multiformis
1,2 *Lottia gigantea
jgi| Lotgi1| 157385| fgenesh2_pg.C_sca_14000008
1,2 *Lottia gigantea
jgi| Lotgi1| 166840| fgenesh2_pg.C_sca_61000088
2 *Trichoplax adhaerens
jgi| Triad1| 25702| e_gw1.5.1214.1
1,2 *Amphimedon queenslandica
1,2 *Amphimedon queenslandica
jgi| Selmo1| 411196| fgenesh2_pg.C_scaffold_14000249
jgi| Chlre3| 194996| estExt_fgenesh2_pg.C_840034
The cnidarian Nematostella is unusual in that its GRH homolog appears to be split between two loci. Nev-GRH1, which had been reported previously , emerged as a strong match to the entire human GRHL2 protein. However, as Nev-GRH1 appears to be truncated relative to the human protein, we conducted a separate BLAST search using only the carboxy terminal region of the human protein as a query sequence. Nev-GRH2, which was identified in this second BLAST search, is a strong match to the carboxy terminal portion of the human GRHL2 protein.
Among choanoflagellates and fungi, we were also able to identify members of the LSF/GRH family, but clear evidence for distinct LSF and GRH family members was less compelling. The sequenced genome of the choanoflagellate Monosiga brevicollis appears to encode only a single LSF/GRH related gene. Likewise, we could identify only a single LSF/GRH homolog in the genomes of four fungi (Mycosphaerella fijiensis, Mycosphaerella graminicola, Phanerochaete chrysosporium and Trichoderma virens). We did identify two LSF/GRH-related sequences in Aspergillus niger (phylum Ascomycota) and Phycomyces brevicollis (phylum Zygomycota), but in both cases, the two sequences appeared most similar to each other, suggesting that they might have resulted from lineage-specific gene duplications.
In members of the kingdom Plantae, evidence of LSF/GRH family members was far more tenuous. Using a less stringent E-value cut off (e-1), we identified two proteins with limited resemblance to LSF/GRH in Selaginella moellendorffii, a lycophyte. In addition, we also identified a protein with similarity to LSF in the green algae Chlamydomonas reinhardtii. Using this Chlamydomonas sequence to query the genome of Volvox carteri, we identified the corresponding gene in this alga.
Protein motif identification
The motif analysis reveals strong similarities between the pairs of sequences identified in each of the two fungal species. The two proteins from the ascomycote fungus, Aspergillus, are nearly identical to each with respect to motif architecture, and they can be distinguished from other sequences by the possession of motifs 14, 16, and 17. Likewise, the two sequences from the zygomycote fungus, Phycomyces, are most similar to each other with respect to the arrangement of conserved motifs.
The motif analysis also supports the conclusion that the GRH locus of the cnidarian Nematostella has experienced a split. Nev-GRH1 encompasses six conserved motifs (4, 5, 6, 9, 10, 11), and these motifs occupy the same relative positions as in the GRH proteins of fruit fly and sponge. Nev-GRH2 encompasses conserved motifs 18 and 19, which occupy the same relative position in most other metazoan GRH sequences.
The maximum-likelihood analysis (Additional file 2) supports most of the major divisions that appear on the neighbor-joining tree. The animal sequences and fungal sequences comprise discrete subtrees. The LSF sequences form a putative clade, and within this clade, the LSF sequences of triploblasts cluster together to the exclusion of LSF sequences from diploblasts. Likewise, the GRH sequences of triploblasts also group together. However, the putative GRH sequences of diploblastic animals do not form a monophyletic group with the GRH sequences of triploblasts as they do on the neighbor-joining tree. Instead, the sponge and ctenophore sequences appear more closely related to the LSF clade, while the precise position of the anemone and placozoan GRH sequences is not resolved.
On both the neighbor-joining tree and the maximum-likelihood tree, bootstrap support for individual nodes is generally low because the analyses are based on a small number of highly conserved residues. However, both phylogenies are consistent with divisions between animal and fungal sequences and between LSF and GRH sequences, the same divisions that are implied by the motif analysis.
Nev-GRH1 and Nev-GRH2
Potential homologs in plants?
Given that the origin of the LSF/GRH family predates the divergence of animals and fungi, we searched for LSF and GRH homologs in amoebozoans and plants to see if this gene family might predate the origin of opisthokonts. Plant genomes and amoebozoan genomes do not appear to encode any proteins with extensive similarity to the LSF/GRH proteins of animals and fungi. In tblastn searches of assembled genomes at the JGI Genome Portal  using a permissive E value cut-off (e-1), the lycophyte, Selaginella moellendorffii, yielded a hit for GRH (E value 0.07), the alga Chlamydomonas yielded a hit for LSF (E value 0.07), and the amoebozoan Dictyostelium purpureum yielded a hit for LSF (E value 0.04; Additional file 3). When the top hit from Selaginella and Dictyostelium were BLASTed back against the human genome, the search yielded no significant hits.
Evolutionary origins of the LSF/GRH family and subfamilies
Prior to the present study, members of the LSF/GRH family had been reported from a number of triploblastic animals but not from diploblastic animals, choanoflagellates, or fungi. We recovered clear LSF and GRH orthologs from two diploblastic animals (sea anemone and sponge) revealing that the evolutionary divergence between these two subfamilies must have predated the diploblast-triploblast split. Furthermore, fungi possess clear LSF/GRH homologs, although the fungal sequences cannot be assigned to either the LSF or GRH subfamilies. Therefore, while the family clearly originated prior to the metazoan-fungal divergence, the diversification of subfamilies occurred more recently, perhaps in an ancient animal lineage.
Nev-GRH1 and Nev-GRH2
The sea anemone, Nematostella vectensis, is the only species where the sequences encoding the ancestral GRH protein are known to be split between two loci. As sponges, ctenophores, and triploblasts exhibit full-length GRH proteins, this condition must be derived in the sea anemone. The splitting of the ancestral Grh locus in Nematostella must have profound consequences for the regulation and function of GRH. In other animals, GRH binds DNA targets as a dimer. However, in Nematostella, the DNA-binding domain and the oligomerization reside on different proteins. Perhaps Nev-GRH1 is able to interact with the DNA singly, or perhaps a partnership with Nev-GRH2 allows it to form the equivalent of a GRH-dimer on DNA, reminiscent of other GRH proteins. This latter possibility implies that Nev-GRH1 and Nev-GRH2 will be co-expressed in the same cells. This will need to be confirmed experimentally. Interestingly, a comparable split seems to have occurred in the NF-κB gene of this species, with distinct loci encoding different functional domains of the ancestral protein .
Identification of LSF/GRH Homologs in Fungi
Convincing matches to human LSF and/or GRH query sequences were found in the genomes of representative ascomycote, basiomycote and zygomycote fungi (Table 1). The phylum Basiomycota is the sister group to the phylum Ascomycota, with the Zygomycota being more distantly related , and the phylogenetic analysis we performed grouped the LSF-like proteins of the ascomycotes Aspergillus, Mycosphaerella, and Trichoderma to the exclusion of the LSF-like proteins from the zygomycote Phycomyces. In the MEME analysis, the two LSF/GRH proteins identified in the zygomycote Phycomyces were found to possess all of the conserved motifs that were identified within the DNA-binding domain of animals (motifs 4, 5, 6, 9, 10, and 11). The two LSF/GRH proteins of the ascomycote, Aspergillus, also possess motifs 4, 5, 9, 10, and 11, but in place of motif 6, these proteins share motifs 7 and 8, which are unique to this fungus. All four fungal sequences subjected to the MEME analysis were found to contain motif 19, which corresponds to the dimerization domain. Given the strong conservation of motifs between fungi and animals in the DNA-binding and dimerization domains, we hypothesize that the molecular function of these fungal proteins will be very similar to their animal homologs, i.e., they are transcription factors that will bind DNA targets, most likely as dimers (like GRH). However, if the novel fungal-specific motifs functionally replace the SAM domain, which is likely to represent the second protein-protein interaction domain in LSF subfamily members, they might instead bind DNA as tetramers (like LSF).
Insights into the hypothesized ancestral role of GRH from basal animals
Because GRH plays a comparable role in the maintenance and repair of the surface epithelium in mouse , clawed frog , fruit fly , and soil nematode , it has been hypothesized that this role is homologous among triploblastic bilaterians [21, 24]. Given that the shared possession of an epithelium is thought to be homologous across the Metazoa, it is possible that the functional evolution of GRH is connected to the origin and early evolution of the epithelium. The presence of an epithelial boundary is a plesiomorphic character of triploblastic animals, and therefore, we cannot explore the early evolution of animal epithelia using only triploblastic model systems. The identification of clear GRH homologs in cnidarians, ctenophores, and sponges, and the apparent absence of a true Grh gene in the choanoflagellate Monosiga suggests the origin of Grh may be coincident with the origin of the metazoan epithelium. Historically, sponges have been said to lack an epithelium, but more recently, the identification of a genuine basement membrane in homoscleromorph sponges removes this distinction between poriferans and other metazoans . If the role of Grh in maintaining epithelial integrity dates to the origin of the epithelium, then Grh should be expressed in the epidermal epithelium of cnidarians, ctenophores and sponges. Furthermore, Grh should regulate proteins involved in epithelial differentiation and maintenance, although the exact targets of Grh transcriptional regulation may vary among basal animals as they vary among triploblasts. Additionally, we may expect that Grh will be upregulated in response to injury, while knockdown of Grh expression may undermine epithelial integrity and inhibit wound healing. All of these questions are amenable to testing in one or more basal model systems.
The LSF/GRH family had already originated by the time of the opisthokont ancestor, and the overall domain architecture of LSF/GRH proteins has been largely conserved in extant fungi, animals, and choanoflagellates. The LSF subfamily had diverged from the GRH family prior to the divergence of sponges, cnidarians, and triploblastic animals. Consistent differences in domain architecture distinguish the LSF and GRH proteins of both diploblastic and triploblastic animals, suggesting that the functional divergence between these proteins had been established prior to the evolutionary divergence between diploblasts and triploblasts. The sea anemone Nematostella appears unique in that the DNA-binding domain and the dimerization domain of the ancestral GRH protein are now encoded on two separate loci.
Identification of LSF/Grainyhead family members in outgroup taxa
The human proteins LSF [NP_005644.2] and GRHL2 [AAH69633.1] were used to query online genomic databases (Joint Genome Institute Eukaryotic Genomes and NCBI) for LSF-like and GRH-like proteins respectively using BlastP. The following search settings were employed: gap opening penalty = 11; gap extension penalty = 1. Potential homologs that matched one of the query sequences with an expectation score < e-1 were used to query the human genome (using BLASTp) to determine if their top human match was to the original human query sequence (LSF or GRHL2). Sequences were kept for phylogenetic and protein motif identification only if they met this criterion.
Protein motif identification
To identify conserved protein motifs, LSF/GRH proteins were evaluated using MEME (Multiple Expectation Maximization for Motif Elicitation; http://meme.nbcr.net; (; Additional file 1). LSF/GRH family members were chosen to represent ten metazoan phyla, the choanoflagellate Monosiga brevicollis, an ascomycote fungus (Aspergillus) and a zygomycote fungus (Phycomyces; Table 1). The following settings were used in the motif search: maximum number of motifs = 20; occurrences of a single motif = any number; minimum length of a motif = 3 amino acids; maximum length of a motif = 300.
Twenty-eight of the twenty-nine LSF/GRH protein sequences included in the MEME analysis were aligned in preparation for phylogenetic analysis (Additional file 4). The GRH2 protein of Nematostella vectensis was excluded from the alignment because it is substantially truncated relative to the full-length LSF and GRH proteins of other animals. Since motif 4 was identified near the amino terminal of all but three of the proteins, and motif 19 was identified near the carboxy terminal of all but one of the proteins (Figure 1), these motifs were used to bracket the alignment. To ensure that the motifs identified by MEME were maintained in register, the motifs themselves were manually aligned. Then, the regions between conserved motifs were multiply aligned using the Clustal alignment tool found in the application MEGA . The following settings were specified: protein weight matrix = Gonnet, gap opening penalty = 10; gap extension penalty = 0.2. The resulting alignment spans 2045 characters. All positions in the alignment containing gaps were deleted to produce a gap-free alignment comprising 44 characters (Additional file 4).
Phylogenetic relationships among taxa were inferred from both the gap-free alignment and the full alignment using neighbor-joining  and maximum-likelihood . All 44 residues in the gap-free alignment derive from motifs 9-11, which are part of the DNA-binding domain (Additional file 2). First, eighty alternate models of the amino acid substitution process were compared using the program ProtTest 1.3 . The substitution process was optimized along with the tree topology and branch lengths. For both the full alignment and the gap-free alignment, the empirically determined JTT substitution matrix  outperformed other substitution matrices, and incorporating rate variation among sites significantly improved the model (the shape coefficient of the Gamma distribution, α = .837; the coefficient of rate variation among sites = 1/α1/2 = 1.093). The JTT matrix with gamma-distributed rate variation among sites was specified in subsequent phylogenetic analyses.
For the neighbor joining analysis, pairwise distances between proteins were calculated using the Prodist program, and the tree topology was determined using the Neighbor program, both in the Phylip package (v. 3.6; ). Maximum-likelihood analysis was performed using RAxML (v 7.0.3; ) as implemented on the CIPRES Portal (v. 2.0; ). In both the neighbor-joining analysis and the maximum-likelihood analysis, support for specific clades was assessed using the bootstrap : 1,000 replicates of the bootstrap were performed for the neighbor-joining analysis, and 100 replicates were performed for the maximum-likelihood analysis.
The authors thank Dr. Bernie Degnan and Ben Woodcroft for access to the Amphimedon genome browser. We are extremely grateful for the insightful suggestions of two anonymous reviewers. This research was supported by a grant from the Conservation International Marine Management Area Science Program to JRF and NSF grant IOS-0818831 to JRF.
- Auden A, Caddy J, Wilanowski T, Ting SB, Cunningham JM, Jane SM: Spatial and temporal expression of the Grainyhead-like transcription factor family during murine development. Gene Expr Patterns. 2006, 6 (8): 964-970. 10.1016/j.modgep.2006.03.011.View ArticlePubMed
- Bray SJ, Kafatos FC: Developmental function of Elf-1: an essential transcription factor during embryogenesis in Drosophila. Genes Dev. 1991, 5 (9): 1672-1683. 10.1101/gad.5.9.1672.View ArticlePubMed
- Kudryavtseva EI, Sugihara TM, Wang N, Lasso RJ, Gudnason JF, Lipkin SM, Andersen B: Identification and characterization of Grainyhead-like epithelial transactivator (GET-1), a novel mammalian Grainyhead-like factor. Dev Dyn. 2003, 226 (4): 604-617. 10.1002/dvdy.10255.View ArticlePubMed
- Hayashi Y, Yamagishi M, Nishimoto Y, Taguchi O, Matsukage A, Yamaguchi M: A binding site for the transcription factor Grainyhead/Nuclear transcription factor-1 contributes to regulation of the Drosophila proliferating cell nuclear antigen gene promoter. J Biol Chem. 1999, 274 (49): 35080-35088. 10.1074/jbc.274.49.35080.View ArticlePubMed
- Lim LC, Swendeman SL, Sheffery M: Molecular cloning of the alpha-globin transcription factor CP2. Mol Cell Biol. 1992, 12 (2): 828-835.PubMed CentralView ArticlePubMed
- Ramamurthy L, Barbour V, Tuckfield A, Clouston DR, Topham D, Cunningham JM, Jane SM: Targeted disruption of the CP2 gene, a member of the NTF family of transcription factors. J Biol Chem. 2001, 276 (11): 7836-7842. 10.1074/jbc.M004351200.View ArticlePubMed
- Rodda S, Sharma S, Scherer M, Chapman G, Rathjen P: CRTR-1, a developmentally regulated transcriptional repressor related to the CP2 family of transcription factors. J Biol Chem. 2001, 276 (5): 3324-3332. 10.1074/jbc.M008167200.View ArticlePubMed
- Sueyoshi T, Kobayashi R, Nishio K, Aida K, Moore R, Wada T, Handa H, Negishi M: A nuclear factor (NF2d9) that binds to the male-specific P450 (Cyp 2d-9) gene in mouse liver. Mol Cell Biol. 1995, 15 (8): 4158-4166.PubMed CentralView ArticlePubMed
- Uv AE, Harrison EJ, Bray SJ: Tissue-specific splicing and functions of the Drosophila transcription factor Grainyhead. Mol Cell Biol. 1997, 17 (11): 6727-6735.PubMed CentralView ArticlePubMed
- Wilanowski T, Tuckfield A, Cerruti L, O'Connell S, Saint R, Parekh V, Tao J, Cunningham JM, Jane SM: A highly conserved novel family of mammalian developmental transcription factors related to Drosophila grainyhead. Mech Dev. 2002, 114 (1-2): 37-50. 10.1016/S0925-4773(02)00046-1.View ArticlePubMed
- Wilanowski T, Caddy J, Ting SB, Hislop NR, Cerruti L, Auden A, Zhao LL, Asquith S, Ellis S, Sinclair R, et al: Perturbed desmosomal cadherin expression in grainy head-like 1-null mice. Embo J. 2008, 27 (6): 886-897. 10.1038/emboj.2008.24.PubMed CentralView ArticlePubMed
- Ting SB, Wilanowski T, Auden A, Hall M, Voss AK, Thomas T, Parekh V, Cunningham JM, Jane SM: Inositol- and folate-resistant neural tube defects in mice lacking the epithelial-specific factor Grhl-3. Nat Med. 2003, 9 (12): 1513-1519. 10.1038/nm961.View ArticlePubMed
- Tao J, Kuliyev E, Wang X, Li X, Wilanowski T, Jane SM, Mead PE, Cunningham JM: BMP4-dependent expression of Xenopus Grainyhead-like 1 is essential for epidermal differentiation. Development. 2005, 132 (5): 1021-1034. 10.1242/dev.01641.View ArticlePubMed
- Parekh V, McEwen A, Barbour V, Takahashi Y, Rehg JE, Jane SM, Cunningham JM: Defective extraembryonic angiogenesis in mice lacking LBP-1a, a member of the grainyhead family of transcription factors. Mol Cell Biol. 2004, 24 (16): 7113-7129. 10.1128/MCB.24.16.7113-7129.2004.PubMed CentralView ArticlePubMed
- Venkatesan K, McManus HR, Mello CC, Smith TF, Hansen U: Functional conservation between members of an ancient duplicated transcription factor family, LSF/Grainyhead. Nucleic Acids Res. 2003, 31 (15): 4304-4316. 10.1093/nar/gkg644.PubMed CentralView ArticlePubMed
- Veljkovic J, Hansen U: Lineage-specific and ubiquitous biological roles of the mammalian transcription factor LSF. Gene. 2004, 343 (1): 23-40. 10.1016/j.gene.2004.08.010.PubMed CentralView ArticlePubMed
- Attardi LD, Tjian R: Drosophila tissue-specific transcription factor NTF-1 contains a novel isoleucine-rich activation motif. Genes Dev. 1993, 7 (7B): 1341-1353. 10.1101/gad.7.7b.1341.View ArticlePubMed
- Shirra MK, Hansen U: LSF and NTF-1 share a conserved DNA recognition motif yet require different oligomerization states to form a stable protein-DNA complex. J Biol Chem. 1998, 273 (30): 19260-19268. 10.1074/jbc.273.30.19260.View ArticlePubMed
- Frith MC, Hansen U, Weng Z: Detection of cis-element clusters in higher eukaryotic DNA. Bioinformatics. 2001, 17 (10): 878-889. 10.1093/bioinformatics/17.10.878.View ArticlePubMed
- Kokoszynska K, Ostrowski J, Rychlewski L, Wyrwicz LS: The fold recognition of CP2 transcription factors gives new insights into the function and evolution of tumor suppressor protein p53. Cell Cycle. 2008, 7 (18): 2907-2915.View ArticlePubMed
- Harden N: Cell biology. Of grainy heads and broken skins. Science. 2005, 308 (5720): 364-365. 10.1126/science.1112050.View ArticlePubMed
- Ting SB, Caddy J, Hislop N, Wilanowski T, Auden A, Zhao LL, Ellis S, Kaur P, Uchida Y, Holleran WM, et al: A homolog of Drosophila grainy head is essential for epidermal integrity in mice. Science. 2005, 308 (5720): 411-413. 10.1126/science.1107511.View ArticlePubMed
- Mace KA, Pearson JC, McGinnis W: An epidermal barrier wound repair pathway in Drosophila is mediated by grainy head. Science. 2005, 308 (5720): 381-385. 10.1126/science.1107573.View ArticlePubMed
- Moussian B, Uv AE: An ancient control of epithelial barrier formation and wound healing. Bioessays. 2005, 27 (10): 987-990. 10.1002/bies.20308.View ArticlePubMed
- Cenci C, Gould AP: Drosophila Grainyhead specifies late programmes of neural proliferation by regulating the mitotic activity and Hox-dependent apoptosis of neuroblasts. Development. 2005, 132 (17): 3835-3845. 10.1242/dev.01932.View ArticlePubMed
- Yu Z, Bhandari A, Mannik J, Pham T, Xu X, Andersen B: Grainyhead-like factor Get1/Grhl3 regulates formation of the epidermal leading edge during eyelid closure. Dev Biol. 2008, 319 (1): 56-67. 10.1016/j.ydbio.2008.04.001.PubMed CentralView ArticlePubMed
- Yamaguchi Y, Yonemura S, Takada S: Grainyhead-related transcription factor is required for duct maturation in the salivary gland and the kidney of the mouse. Development. 2006, 133 (23): 4737-4748. 10.1242/dev.02658.View ArticlePubMed
- Gustavsson P, Greene ND, Lad D, Pauws E, de Castro SC, Stanier P, Copp AJ: Increased expression of Grainyhead-like-3 rescues spina bifida in a folate-resistant mouse model. Hum Mol Genet. 2007, 16 (21): 2640-2646. 10.1093/hmg/ddm221.View ArticlePubMed
- Van Laer L, Van Eyken E, Fransen E, Huyghe JR, Topsakal V, Hendrickx JJ, Hannula S, Maki-Torkko E, Jensen M, Demeester K, et al: The grainyhead like 2 gene (GRHL2), alias TFCP2L3, is associated with age-related hearing impairment. Hum Mol Genet. 2008, 17 (2): 159-169. 10.1093/hmg/ddm292.View ArticlePubMed
- Swendeman SL, Spielholz C, Jenkins NA, Gilbert DJ, Copeland NG, Sheffery M: Characterization of the genomic structure, chromosomal location, promoter, and development expression of the alpha-globin transcription factor CP2. J Biol Chem. 1994, 269 (15): 11663-11671.PubMed
- Bing Z, Huang JH, Liao WS: NFkappa B interacts with serum amyloid A3 enhancer factor to synergistically activate mouse serum amyloid A3 gene transcription. J Biol Chem. 2000, 275 (41): 31616-31623. 10.1074/jbc.M005378200.View ArticlePubMed
- Zhou W, Clouston DR, Wang X, Cerruti L, Cunningham JM, Jane SM: Induction of human fetal globin gene expression by a novel erythroid factor, NF-E4. Mol Cell Biol. 2000, 20 (20): 7662-7672. 10.1128/MCB.20.20.7662-7672.2000.PubMed CentralView ArticlePubMed
- Volker JL, Rameh LE, Zhu Q, DeCaprio J, Hansen U: Mitogenic stimulation of resting T cells causes rapid phosphorylation of the transcription factor LSF and increased DNA-binding activity. Genes Dev. 1997, 11 (11): 1435-1446. 10.1101/gad.11.11.1435.View ArticlePubMed
- Kashour T, Burton T, Dibrov A, Amara FM: Late Simian virus 40 transcription factor is a target of the phosphoinositide 3-kinase/Akt pathway in anti-apoptotic Alzheimer's amyloid precursor protein signalling. Biochem J. 2003, 370 (Pt 3): 1063-1075. 10.1042/BJ20021197.PubMed CentralView ArticlePubMed
- Jane SM, Nienhuis AW, Cunningham JM: Hemoglobin switching in man and chicken is mediated by a heteromeric complex between the ubiquitous transcription factor CP2 and a developmentally specific protein. Embo J. 1995, 14 (1): 97-105.PubMed CentralPubMed
- Lim LC, Fang L, Swendeman SL, Sheffery M: Characterization of the molecularly cloned murine alpha-globin transcription factor CP2. J Biol Chem. 1993, 268 (24): 18008-18017.PubMed
- Drouin EE, Schrader CE, Stavnezer J, Hansen U: The ubiquitously expressed DNA-binding protein late SV40 factor binds Ig switch regions and represses class switching to IgA. J Immunol. 2002, 168 (6): 2847-2856.View ArticlePubMed
- Chae JH, Kim CG: CP2 binding to the promoter is essential for the enhanced transcription of globin genes in erythroid cells. Mol Cells. 2003, 15 (1): 40-47.PubMed
- Chae JH, Lee YH, Kim CG: Transcription factor CP2 is crucial in hemoglobin synthesis during erythroid terminal differentiation in vitro. Biochem Biophys Res Commun. 1999, 263 (2): 580-583. 10.1006/bbrc.1999.1408.View ArticlePubMed
- Casolaro V, Keane-Myers AM, Swendeman SL, Steindler C, Zhong F, Sheffery M, Georas SN, Ono SJ: Identification and characterization of a critical CP2-binding element in the human interleukin-4 promoter. J Biol Chem. 2000, 275 (47): 36605-36611. 10.1074/jbc.M007086200.View ArticlePubMed
- Bruni P, Minopoli G, Brancaccio T, Napolitano M, Faraonio R, Zambrano N, Hansen U, Russo T: Fe65, a ligand of the Alzheimer's beta-amyloid precursor protein, blocks cell cycle progression by down-regulating thymidylate synthase expression. J Biol Chem. 2002, 277 (38): 35481-35488. 10.1074/jbc.M205227200.View ArticlePubMed
- Powell CM, Rudge TL, Zhu Q, Johnson LF, Hansen U: Inhibition of the mammalian transcription factor LSF induces S-phase-dependent apoptosis by downregulating thymidylate synthase expression. Embo J. 2000, 19 (17): 4665-4675. 10.1093/emboj/19.17.4665.PubMed CentralView ArticlePubMed
- Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000, 154 (1): 459-473.PubMed CentralPubMed
- Rastogi S, Liberles DA: Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol Biol. 2005, 5 (1): 28-10.1186/1471-2148-5-28.PubMed CentralView ArticlePubMed
- Reitzel AM, Sullivan JC, Traylor-Knowles N, Finnerty JR: Genomic survey of candidate stress-response genes in the estuarine anemone Nematostella vectensis. Biol Bull. 2008, 214 (3): 233-254. 10.2307/25470666.View ArticlePubMed
- Ting SB, Wilanowski T, Cerruti L, Zhao LL, Cunningham JM, Jane SM: The identification and characterization of human Sister-of-Mammalian Grainyhead (SOM) expands the grainyhead-like family of developmental transcription factors. Biochem J. 2003, 370 (Pt 3): 953-962. 10.1042/BJ20021476.PubMed CentralView ArticlePubMed
- Uv AE, Thompson CR, Bray SJ: The Drosophila tissue-specific factor Grainyhead contains novel DNA-binding and dimerization domains which are conserved in the human protein CP2. Mol Cell Biol. 1994, 14 (6): 4020-4031.PubMed CentralView ArticlePubMed
- JGI Genome Portal. [http://genome.jgi-psf.org/]
- Sullivan JC, Kalaitzidis D, Gilmore TD, Finnerty JR: Rel homology domain-containing transcription factors in the cnidarian Nematostella vectensis. Dev Genes Evol. 2007, 217 (1): 63-72. 10.1007/s00427-006-0111-6.View ArticlePubMed
- James TY, Kauff F, Schoch CL, Matheny PB, Hofstetter V, Cox CJ, Celio G, Gueidan C, Fraker E, Miadlikowska J, et al: Reconstructing the early evolution of Fungi using a six-gene phylogeny. Nature. 2006, 443 (7113): 818-822. 10.1038/nature05110.View ArticlePubMed
- Aouacheria A, Geourjon C, Aghajari N, Navratil V, Deleage G, Lethias C, Exposito JY: Insights into early extracellular matrix evolution: spongin short chain collagen-related proteins are homologous to basement membrane type IV collagens and form a novel family widely distributed in invertebrates. Mol Biol Evol. 2006, 23 (12): 2288-2302. 10.1093/molbev/msl100.View ArticlePubMed
- Bailey TL: Discovering novel sequence motifs with MEME. Curr Protoc Bioinformatics. 2002, Chapter 2 (Unit 2): 4-PubMed
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24 (8): 1596-1599. 10.1093/molbev/msm092.View ArticlePubMed
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.PubMed
- Felsenstein J: Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981, 17 (6): 368-376. 10.1007/BF01734359.View ArticlePubMed
- Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.View ArticlePubMed
- Jones D, Taylor W, Thornton J: The rapid generation of mutation data matrices from protein sequences. Computer Applications in the Biosciencesx. 1992, 8: 275-282.
- Felsenstein J: PHYLIP (Phylogeny Inference Package). Seattle: Distributed by the author. 2005, Department of Genome Sciences, University of Washington, 3.6
- Stamatakis A, Ludwig T, Meier H: RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics. 2005, 21 (4): 456-463. 10.1093/bioinformatics/bti191.View ArticlePubMed
- CIPRES. Cyberinfrastructure for Phylogenetic Research. [http://www.phylo.org/sub_sections/portal/]
- Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39 (4): 783-791. 10.2307/2408678.View Article
- Halanych KM, Bacheller JD, Aguinaldo AM, Liva SM, Hillis DM, Lake JA: Evidence from 18S ribosomal DNA that the lophophorates are protostome animals. Science. 1995, 267 (5204): 1641-1643. 10.1126/science.7886451.View ArticlePubMed
- Bourlat SJ, Nielsen C, Economou AD, Telford MJ: Testing the new animal phylogeny: a phylum level molecular analysis of the animal kingdom. Mol Phylogenet Evol. 2008, 49 (1): 23-31. 10.1016/j.ympev.2008.07.008.View ArticlePubMed
- Telford MJ, Bourlat SJ, Economou A, Papillon D, Rota-Stabelli O: The evolution of the Ecdysozoa. Philos Trans R Soc Lond B Biol Sci. 2008, 363 (1496): 1529-1537. 10.1098/rstb.2007.2243.PubMed CentralView ArticlePubMed
- Rogozin IB, Wolf YI, Carmel L, Koonin EV: Analysis of rare amino acid replacements supports the Coelomata clade. Mol Biol Evol. 2007, 24 (12): 2594-2597. 10.1093/molbev/msm218.View ArticlePubMed
- Zheng J, Rogozin IB, Koonin EV, Przytycka TM: Support for the Coelomata clade of animals from a rigorous analysis of the pattern of intron conservation. Mol Biol Evol. 2007, 24 (11): 2583-2592. 10.1093/molbev/msm207.View ArticlePubMed
- Philippe H, Lartillot N, Brinkmann H: Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. Mol Biol Evol. 2005, 22 (5): 1246-1253. 10.1093/molbev/msi111.View ArticlePubMed
- Aguinaldo AM, Turbeville JM, Linford LS, Rivera MC, Garey JR, Raff RA, Lake JA: Evidence for a clade of nematodes, arthropods and other moulting animals. Nature. 1997, 387 (6632): 489-493. 10.1038/387489a0.View ArticlePubMed
- Dunn CW, Hejnol A, Matus DQ, Pang K, Browne WE, Smith SA, Seaver E, Rouse GW, Obst M, Edgecombe GD, et al: Broad phylogenomic sampling improves resolution of the animal tree of life. Nature. 2008, 452 (7188): 745-749. 10.1038/nature06614.View ArticlePubMed
- Irimia M, Maeso I, Penny D, Garcia-Fernandez J, Roy SW: Rare coding sequence changes are consistent with Ecdysozoa, not Coelomata. Mol Biol Evol. 2007, 24 (8): 1604-1607. 10.1093/molbev/msm105.View ArticlePubMed
- Roy SW, Irimia M: Rare genomic characters do not support Coelomata: RGC_CAMs. J Mol Evol. 2008, 66 (3): 308-315. 10.1007/s00239-008-9077-5.View ArticlePubMed
- Roy SW, Irimia M: Rare genomic characters do not support Coelomata: intron loss/gain. Mol Biol Evol. 2008, 25 (4): 620-623. 10.1093/molbev/msn035.View ArticlePubMed
- Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22 (21): 2688-2690. 10.1093/bioinformatics/btl446.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.