- Research article
- Open Access
The diversification of the basic leucine zipper family in eukaryotes correlates with the evolution of multicellularity
© Jindrich and Degnan. 2016
- Received: 29 October 2015
- Accepted: 19 January 2016
- Published: 1 February 2016
Multicellularity evolved multiple times in eukaryotes. In all cases, this required an elaboration of the regulatory mechanisms controlling gene expression. Amongst the conserved eukaryotic transcription factor families, the basic leucine zipper (bZIP) superfamily is one of the most ancient and best characterised. This gene family plays a diversity of roles in the specification, differentiation and maintenance of cell types in plants and animals. bZIPs are also involved in stress responses and the regulation of cell proliferation in fungi, amoebozoans and heterokonts.
Using 49 sequenced genomes from across the Eukaryota, we demonstrate that the bZIP superfamily has evolved from a single ancestral eukaryotic gene and undergone multiple independent expansions. bZIP family diversification is largely restricted to multicellular lineages, consistent with bZIPs contributing to the complex regulatory networks underlying differential and cell type-specific gene expression in these lineages. Analyses focused on the Metazoa suggest an elaborate bZIP network was in place in the most recent shared ancestor of all extant animals that was comprised of 11 of the 12 previously recognized families present in modern taxa. In addition this analysis identifies three bZIP families that appear to have been lost in mammals. Thus the ancestral metazoan and eumetazoan bZIP repertoire consists of 12 and 16 bZIPs, respectively. These diversified from 7 founder genes present in the holozoan ancestor.
Our results reveal the ancestral opisthokont, holozoan and metazoan bZIP repertoire and provide insights into the progressive expansion and divergence of bZIPs in the five main eukaryotic kingdoms, suggesting that the early diversification of bZIPs in multiple eukaryotic lineages was a prerequisite for the evolution of complex multicellular organisms.
- bZIP transcription factor
- Gene regulatory networks
Increasing evidence suggests that the evolution of complex multicellular organisms arose from the expansion and diversification of gene regulatory networks (reviewed in ). In eukaryotes, the precise control of gene expression, often in response to physiological and environmental stimuli, largely depends on the binding of specific transcription factor proteins to specific DNA sequences . This ancient mode of gene regulation has been co-opted into and is instrumental in the ontogeny of multicellular eukaryotes, sitting at nodes in developmental gene regulatory networks (GRNs) that underlie spatiotemporal and cell type-specific gene transcription . Analysis of GRNs, largely in bilaterian animals, reveals they are populated by transcription factors of differing evolutionary age, with most being either unique to metazoans (e.g. nuclear receptors) or of an older evolutionary origin (e.g. basic helix-loop-helix transcription factors) .
The basic leucine zipper (bZIP) superfamily of transcription factors appears to have originated early in eukaryotic evolution . bZIPs sit at the heart of key pathways regulating cellular decisions across this domain of life . They have been consistently implicated in a wide range of core eukaryotic cellular processes, including cell proliferation and differentiation, stress response and homeostasis . However, the ancestral role of bZIPs in eukaryotes has been difficult to infer because a single conserved function has not been identified amongst living eukaryotes.
bZIP transcription factors take their name from a highly conserved 60–80 amino acid bZIP domain, which has a bipartite organisation consisting of an N-terminal basic region, responsible for DNA binding, and a leucine zipper, which mediates homo- and hetero-dimerization between bZIPs. As dimers, they regulate transcription by binding short DNA target sites, often in the form of 8 base pair palindromes. In recent years, the bZIP network of several eukaryotes has been described, and their evolution has been, to some extent, investigated in animals [7, 8], fungi  and plants . The recent sequencing of disparate eukaryotic genomes now allows the search for the primordial bZIP and the reconstruction of the evolutionary trajectories this family has taken in different higher eukaryotic lineages, including phyla, kingdoms and superkingdoms.
Here, we analysed the bZIP gene repertoires from a wide range of eukaryote genomes, focusing on the lineage with the widest coverage of draft genomes, the holozoans. We traced back metazoan bZIP families to their origin in the holozoan last common ancestor, opisthokont last common ancestor and beyond. By comparing bZIPs diversification in the main eukaryotic clades, we demonstrate that bZIPs originated from a single protein and then evolved independently in each major eukaryotic lineage. bZIP family complexity appears to increase incrementally over long evolutionary periods, prior to evolutionary transitions into a complex multicellular condition. Early in eukaryotic evolution, a first expansion phase occurred independently in each of the four main eukaryotic lineages - Opistokonta, Amoebozoa, Planta and Heterokonta- yielding to 3 or 4 bZIP families. These families constitute the core of the bZIP complement in extant fungi, heterokonts and amoebozoans. A second expansion phase occurred in holozoans and early plants, prior to the emergence of complex multicellularity in either lineage. In holozoans, it occurred in two main steps: one before the divergence of unicellular holozoans, which display numerous examples of colonial forms with life cycles comprised of more than one cell type (e.g. in Salpingoeca rosetta  and Capsaspora owczarzaki ), and one during the early evolution of metazoans, prior to the emergence of the crown group.
Accrual of the animal bZIP repertoire over the course of holozoan evolution
Phylogenetic analyses suggest that PAR-L and OASIS-L are related to PAR and OASIS (Additional file 2 and 3). PAR-L bZIPs possess the two Asn residues at positions 13 and 14, characteristic of the PAR and CEPB families (Additional file 2). OASIS-L bZIPs include the OASIS-specific Tyr at position 27 but exhibit a unique combination of amino acids at positions 13 to 21: NAIxAxxNR (x represents variable residues), instead of the typical NKxSAxxSR OASIS combination (Additional file 2). REPTOR however does not appear to be related to any other bZIP family and is characterised by an astonishingly conserved basic domain, with 33 consecutive residues (positions 7–35, 37–39 and 41) identical in nearly all species, and a very short coiled-coil region (Fig. 1 and Additional file 2). We named this family after its Drosophila orthologue, which has been recently identified as a downstream factor of the TORC1 signalling pathway . The origin of the PAR-L and OASIS-L sub-families, and REPTOR family can be traced back to being present in the last common metazoan and eumetazoan ancestor, respectively.
Notably, this approach identified a putative member of the JUN family in the holozoan Capsaspora, a bZIP gene family previously regarded as metazoan-specific . This finding is consistent with our observation that members of the fungal GCN4 family and the metazoan JUN family appear to be distant orthologues (Fig. 4 and Additional file 1: Table S2).
Emergence of novel domain combinations in holozoan bZIP families
bZIPs appear to have increased their connectivity throughout metazoan evolution through the linking to other protein domains . The kinase inducible activation domain (KID) is associated with CREB in several metazoans and mediates the interaction of CREB and p300/CBP. Although the Capsaspora genome encodes p300 , its CREB gene does not encode a KID. In contrast both sponge and ctenophore CREBs include a KID domain, consistent with a domain-shuffling event early in metazoan evolution to bring together these domains.
A detailed analysis of domains associated with bZIP domains in other families identified a number of conserved domain combinations, including an ATF2-specific and a JUN-specific transactivation domain, previously described in human ATF2  and JUN , and an ATF6-specific domain, which is present in unicellular holozoan orthologues (Additional file 5). We also recovered highly conserved regions located N-terminally of the PAR, MAF and OASIS basic domain, the latter being present in unicellular holozoan OASISes (Additional file 5). Interestingly, these regions were not conserved in the PAR-L and OASIS-L bZIPs. None of these domains were detected in fungal orthologues or related bZIPs.
Conservation of eukaryotic bZIP DNA-binding motif
Based on our observation that many bilaterian and indeed metazoan bZIPs originated early in holozoan evolution, well prior to the diversification of the Metazoa, we attempted to retrace bZIP evolution beyond the opisthokont last common ancestor. A total of 896 bZIPs were identified from 49 eukaryotic genomes, including opisthokont (metazoans, choanoflagellates, and fungi), amoebozoan, plant (Viridiplantae - land plants and green algae, and red algae), heterokont (oomycetes, diatoms and brown algae) and excavate representatives. bZIP genes were recovered in all species surveyed (Additional file 1: Table S1), except two amoebozoans (Hartmannella vermiformis and Physarum polycephalum) and a microsporidial fungus (Antonospora locustae), consistent with bZIPs being present in the most recent shared ancestor of extant eukaryotes.
Amongst the eukaryotic bZIPs, the leucine residues in the leucine zipper region (residues 31, 38, 45, 52 and 59 in Fig. 3) and a nine amino acid region in the N-terminal of the basic DNA-binding domain (residues 13–21 in Fig. 3) are the most conserved. Five very highly conserved amino acids within the basic region - Asn13, Ala16, Ala17, Ser/Cys20 and Arg21 - have been shown to be instrumental in determining sequence-specific DNA binding in bilaterians and fungi [17, 18]. The first and last of these amino acids are the most conserved amongst eukaryotes, with only three differences at these positions found across all eukaryotes surveyed: the excavate Giardia has a solitary bZIP factor with a Gly at position 13; a few plants have Lys at position 21; and in oomycete heterokonts have a Cys at position 13 in several proteins that appear to be functional . The level of conservation of the other three residues varies between eukaryotic lineages (Fig. 3). For instance, nonpolar Ala is the most common residue in position 16 in opisthokonts and ameobozoans, while in plants, heterokonts and excavates this site is most often populated by Ser, which has an uncharged polar side chain (Fig. 3). Amongst the most conserved residues, position 20 is the most variable in terms of residue diversity and disparity, with Cys only observed in this position in opisthokonts.
Independent expansion of bZIPs in different eukaryotic lineages
Using 49 sequenced genomes representing disparate eukaryotic lineages, including 18 representative metazoan genomes, we have reconstructed the evolution of one of the most ancient transcription factor families employed in animal development and disease, the bZIPs. We demonstrate that the evolution of multicellularity is correlated with the expansion and diversification of bZIPs in different families. However, an apparent increase in morphological and behavioural complexity (e.g. along the bilaterian or eumetazoan stem) is not always accompanied with an increase in gene family number. Indeed, bZIP family complexity appears to increase incrementally over long evolutionary periods, probably being one of a number of prerequisites for the evolution of networks underlying complex gene regulation.
The metazoan bZIP network is comprised of members of differing age
The OASIS family of bZIPs form three orthology groups (Additional file 3) that emerged either prior to metazoan cladogenesis or before the divergence of sponge and eumetazoan lineages. The OASIS-L group is distinct from the other OASIS family members and lacks the OASIS-specific extended basic domain. PAR, CEBP and FOS duplicated and diverged along the bilaterian stem to give rise to three new pairs of families, PAR-NFIL3, CEBP-CEBPg and FOS-ATF3. Cnidarians also possess pairs of CEBP and FOS genes that clade with one of the bilaterian orthologues, suggesting that these duplicated prior to the divergence of cnidarians and bilaterians; one of the duplicates subsequently diverged to form a new subfamily in bilaterians. The sponge-eumetazoan common ancestor possessed two PAR orthologues: PAR and PAR-L; PAR further duplicated and diversified in stem bilaterians to give rise to PAR and NFIL3 orthology groups. Independent expansion of this family in bilaterians, cnidarians and poriferans has yielded PAR-members that are unique to these taxa (Additional file 3). Similarly, although all eumetazoans possess at least two MAF-bZIPs, the emergence of large MAF and small MAF orthology groups did not occur until after the divergence of cnidarians and bilaterians (Additional file 3). This study also identified a new bZIP family, called REPTOR after its Drosophila member , which emerged from an unidentified bZIP ancestor along the poriferan-eumetazoan stem. All REPTORs display an almost identical basic region and a very short leucine zipper, which may nonetheless be functional for dimerization [13, 21].
ATF2-Sko1, ATF6-HAC1 and JUN-CGN4 families are common between metazoans and fungi, and most likely reflect the bZIP network that was in place in the common ancestor of extant opisthokonts. Members of those families preferentially recognise a palindromic sequence that is conserved in extant animals and fungi . We found that the transactivation of ATF2, CREB and JUN are conserved throughout metazoans, and that of ATF6 throughout holozoans (Additional file 5). The conservation of both co-factors and binding sites potentially reflects the primordial role of these factors in the opisthokont ancestor and points towards a conserved function across animal and fungal orthologues. Consistent with this idea, metazoan ATF6 and yeast HAC1 hold a similar role in the activation of the seemingly conserved unfolded protein response . Similarly, ATF2 plays a key role in the control of homeostasis in animals (reviewed in ) while SKO1 is central in the yeast response to different stress stimuli . This reasoning may explain the general housekeeping role of CREB/ATF factors in animals, in contrast to the specific roles of metazoan-specific families (e.g. PAR and the control of circadian rhythm ).
The diversification of bZIPs in eukaryotes support a role in the evolution of multicellularity
Multicellularity evolved several times in eukaryotes  and arose from a diversification of gene regulatory networks . Given the bZIPs are part of the ancestral eukaryotic transcription factor repertoire, a relationship between bZIP expansion, and diversification and evolution of complex multicellularity appears plausible. Our analysis supports the inference that the eukaryotic LCA possessed a solitary bZIP, which underwent an early expansion and diversification in the lineages leading to crown animals and fungi (stem opisthokont lineage), and Viridiplantae (land plants and green algae) (Fig. 5). This is consistent with an early increase in bZIP membership being a prerequisite for the evolution of the complex multicellularity displayed in animals, plants and fungi. The bZIP superfamily underwent a further duplication and divergence in each of these lineages, as detailed above for animals, with some of the expansions occurring prior to the emergence of the stem metazoan lineage (Figs. 5 and 6).
The differential expansion of the bZIP superfamily in opisthokont and plant lineages is consistent with the hypothesis that the independent diversification of bZIP families contributed to evolution of complex multicellularity on more than one occasion. Specifically a marked difference in ancestral metazoan and fungal, and land plant and algal bZIP repertoires, with morphologically more complex metazoans and land plants having more bZIP families (12 and 13, respectively) than simpler fungi and unicellular algae (4 and 5, respectively; Fig. 5).
The bZIP superfamily has also diversified in other eukaryotic lineages that display colonial or simple multicellularity. For instance, with a more limited dataset we identified 3 and 4 families in amoebozoans and heterokonts, respectively. In these cases again it appears that the expansion and diversification of bZIPs into different families occurs in lineages that include organisms with complex life cycles and more than one cell type (Fig. 5). The establishment of the core of the bZIP network early in evolution is consistent with bZIPs central role in basic cellular processes .
Reconstruction of the ancestral bZIP
The giardial bZIP has been proposed as a model for the precursor of all bZIPs . However, it is consistently recovered in a clade that lacks most eukaryotic lineages; only metazoan and fungal bZIPs clade with this bZIP. Although in this study we could not confidently identify the ancestral bZIP of each kingdom, the similarity between sequences from ancient families of plant (proto-B), metazoan (ATF6-OASIS), fungal (HAC1), amoebozoan and heterokont bZIPs suggests they share features of the ancestral eukaryotic bZIP (Fig. 3), including the deeply conserved NxxSAxxSR (residues 13–21) signature motif.
This five-residue motif is involved in sequence-specific DNA binding and has not varied greatly over the entire course of the bZIP superfamily evolution. Indeed this may explain the restricted number bZIP families and the similarity of bZIP DNA-binding sequences throughout Eukaryota. Although each monomer possesses its own transactivation activity (reviewed in ), bZIPs can nonetheless regulate a wide range of cellular processes because they bind DNA as dimers, where each basic domain contributes individually to the recognition of one half binding site . Pairing of bZIPs generates an extensive array of dimeric regulators, which in combination determines the transcriptional activity. Thus the independent diversification of bZIPs in multiple eukaryotic lineages allows for a marked and lineage-specific expansion in potential combinations. As dimerization is key to the functional diversification of bZIP transcription factors, offering the potential for flexible and complex transcriptional programs, the expansion of this gene family potentially contributed to the foundations underlying the evolution of complex multicellular organisms.
We compiled a dataset of 896 bZIPs from 49 sequenced genomes from across the Eukaryota. The depth of this dataset permits an assessment of the evolution of this family of transcription factors in relation to the timing of the major evolutionary transitions in eukaryotes, including the evolution of multicellularity. We demonstrated that bZIPs underwent an early expansion and diversification, independently in each of the five main eukaryotic lineages, and was likely a contributing prerequisite for the evolution of organisms with complex life cycles and multiple cell types. Focusing on metazoans, we reconstructed the duplication events that shaped bZIP sub-families and identified three previously uncharacterised bZIP sub-families that appear to have been lost in mammals. Our analysis identified the ancestral metazoan and holozan bZIP repertoire, which comprise 7 and 12 founder genes, respectively.
Taxonomic sampling and retrieval of bZIPs
The sequences of the full complement of bZIP genes were retrieved from the fully sequenced genomes of 31 non-metazoan eukaryotes and 18 metazoan representatives. Metazoan opisthokonts include: Homo sapiens, Branchiostoma floridae and Ciona intestinalis (Chordata); Strongylocentrotus purpuratus (Echinodermata); Drosophila melanogaster, Tribolium castaneum and Daphnia pulex (Arthropoda): Lottia gigantea (Mollusca), Capitella telata (Polychaeta); Nematostella vectensis, Acropora digitifera and Hydra magnipapillata (Cnidaria); Trichoplax adherans (Placozoa); Mnemiopsis leydi and Pleurobrachia bachei (Ctenophora); and Amphimedon queenslandica, Oscarella carmela and Sycon ciliatum (Porifera). Non-metazoan opisthokonts include: Monosiga brevicollis (Choanoflagellata); Capsaspora owczarzaki (Filasterea); and Saccharomyces cerevisiae, Schizosaccharomyces pombe, Candida albicans, Aspergillus nidulans, Magnaporthe grisea, Ustilago maydis, Cryptococcus neoformans, Mucor circinelloides, Rhizophagus irregularis, Antonospora locustae and Encephalitozoon cuniculi (Fungi). Plants include: Arabidopsis thaliana (Planta); Chlamydomonas reinhardtii (green alga); and Galdieria sulphuraria and Chondrus Crispus (red alga). Amoebozoans include: Entamoeba histolica, Dictyostelium discoideum, Acanthamoeba castellanii, Hartmannella vermiformis and Physarum polycephalum. Heterokonts include: Thalassiosira pseudonana, Phytophthora sojae, Pythium ultimum, Phytophthora infectans, Ectocarpus sliculosus (brown alga), Hyaloperonospora arabidopsidis, and Phaeodactylum tricornutum; Excavates include: Giardia lamblia and Naeglaria gruberi. As there are few published analyses of heterokont bZIPs, we sampled several species in this lineage. Subsequent phylogenetic analyses in this study were restricted to Phytophthora infectans (oocmycete), Thalassiosira pseudonana (diatom) and Ectocarpus siliculosus (brown alga). Species and data sources information can be found in Additional file 1: Table S1. All databases were publically available and no animal work requiring ethics approval was conducted.
The complete bZIP set for each species was obtained through an iterative process including (i) building an initial set of bZIP proteins using PFAM, (ii) interrogating the proteome by BlastP in several databases (NCBI, JGI, Ensemble and species specific genome browsers [28–42]) and then (iii) re-interrogating each of the proteomes with a HMMER genome-wide scanning using custom Hidden Markov Models generated for each phylogenetic clade. When searching early branching holozoans, additional models were generated for each phylum. A general cutoff value of 10-4 was used but proteins with higher e-value were also manually selected against the identification criteria listed below. When possible, we interrogated our dataset with previously published data to assess completeness (see Additional file 1: Table S1 for details, [7, 9, 10, 18, 20, 21, 43–45]). Putative bZIPs were then manually inspected for the following features: (1) a basic domain BR, as defined by , and (2) a leucine zipper, within the two heptads located C-terminally of the BR and presenting a coil-coiled structure of two heptads minimum.
We defined the N-terminal domain boundary of the bZIP domain as the N-terminal end of the crystal structure of GCN4 in complex with DNA  and to avoid artefacts in the tree building algorithm, we set the length of the basic domain (basic region and leucine zipper) at 60 amino acids. Protein sequences were trimmed to their bZIP domain and aligned using the MAFFT v7 algorithm  in Geneious and then manually inspected. Maximum likelihood (ML) analyses were carried out by RaxML . The LG + G substitution model was scored as the best-fit model for each alignment, using ProtTest . Branch support was estimated by performing 1000 bootstrap replicates. Bayesian analyses were carried out with MrBayes 3.2 , using the LGG substitution matrix, with 2 parallel runs, four chains and a resampling frequency of 100. Different temperatures were used when convergence was not achieved. We considered that we had reached convergence when the average standard deviation of split frequencies fell below 0.05. The analysis was terminated if convergence was not reached after 12,000,000 generations.
To determine orthologue assignments, we looked at each species independently.
For holozoan bZIPs, we constructed three bZIP reference sets including the sequences from Homo, Branchiostoma and poriferans (bZIPs from a representative demosponge, calcisponge and homoscleromorph), respectively. We made multiple alignments comprising the putative bZIPs of each species and one of the reference set; the bZIPs identified in each phylum; and the bZIPs identified in each clade. Phylogenetic trees were inferred by maximum likelihood and Bayesian analyses.
We considered two sequences as orthologues if their grouping was supported by bootstrap values and posterior marginal probabilities superior to 80 % and 90 %, respectively. We then manually inspected each protein for conserved amino acids peculiar to each bZIP family (described in  (CEBP);  (MAF);  (JUN and FOS);  (PAR);  (CREB) and  (all) and Additional file 4). Protein sequences, names and family assignment can be found in Additional file 1: Table S1.
Using a similar method, we sought to identify groups of orthologues within and between six eukaryotic kingdoms: animals, fungi, plants, amoebozans, heterokonts and excavates. Our classification was compared with previous studies that have focused on a specific clade (in fungi  and plants ), to confirm our orthologue assignments and limited to taxa for which there is an available draft genome. Thus the eukaryotic lineages with wider and deeper coverage are likely to be more accurate. To reduce the effect of potentially unstable sequences, we ran multiple permutations of the same analysis, with only one or two species representative of each kingdom or lineage, changing the lineage representatives each time.
Search for other motifs and domains in bZIP-containing proteins
The complete set of full-length bZIP sequences from the following representative holozoan species was searched for the presence of conserved motifs using the MEME suite : Homo sapiens; Drosophila melanogaster; Hydra magnipapillata; Amphimedon queenslandica; Mnemiopsis leydii; Monosiga brevicollis; and Capsaspora owczarzaki. The minimal and maximal width for a motif was set to 6 and 50 residues, respectively. The motifs found to be conserved between orthologues were investigated further by building HMM for these motifs/domains and searching the entire derived bZIP proteome of all holozoan species.
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files. Protein sequences can be found in Additional file 1: Table S1. Additional alignments and trees are available upon request.
This work was supported by an Australian Research Council grant to BMD.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Levine M, Tjian R. Transcription regulation and animal diversity. Nature. 2003;424(6945):147–51.View ArticlePubMedGoogle Scholar
- Hughes TR. Introduction to “a handbook of transcription factors”. Subcell Biochem. 2011;52:1–6.View ArticlePubMedGoogle Scholar
- Davidson EH, Erwin DH. Gene regulatory networks and the evolution of animal body plans. Science. 2006;311(5762):796–800.View ArticlePubMedGoogle Scholar
- Degnan BM, Vervoort M, Larroux C, Richards GS. Early evolution of metazoan transcription factors. Curr Opin Genet Dev. 2009;19(6):591–9.View ArticlePubMedGoogle Scholar
- Deppmann CD, Alvania RS, Taparowsky EJ. Cross-species annotation of basic leucine zipper factor interactions: Insight into the evolution of closed interaction networks. Mol Biol Evol. 2006;23(8):1480–92.View ArticlePubMedGoogle Scholar
- Miller M. The importance of being flexible: the case of basic region leucine zipper transcriptional regulators. Curr Protein Pept Sci. 2009;10(3):244–69.PubMed CentralView ArticlePubMedGoogle Scholar
- Amoutzias GD, Veron AS, Weiner 3rd J, Robinson-Rechavi M, Bornberg-Bauer E, Oliver SG, et al. One billion years of bZIP transcription factor evolution: conservation and change in dimerization and DNA-binding site specificity. Mol Biol Evol. 2007;24(3):827–35.View ArticlePubMedGoogle Scholar
- Sebe-Pedros A, de Mendoza A, Lang BF, Degnan BM, Ruiz-Trillo I. Unexpected repertoire of metazoan transcription factors in the unicellular holozoan Capsaspora owczarzaki. Mol Biol Evol. 2011;28(3):1241–54.PubMed CentralView ArticlePubMedGoogle Scholar
- Tian C, Li J, Glass NL. Exploring the bZIP transcription factor regulatory network in Neurospora crassa. Microbiology. 2011;157(Pt 3):747–59.PubMed CentralView ArticlePubMedGoogle Scholar
- Correa LG, Riano-Pachon DM, Schrago CG, dos Santos RV, Mueller-Roeber B, Vincentz M. The role of bZIP transcription factors in green plant evolution: adaptive features emerging from four founder genes. PLoS One. 2008;3(8):e2944.PubMed CentralView ArticlePubMedGoogle Scholar
- Fairclough SR, Chen Z, Kramer E, Zeng Q, Young S, Robertson HM, et al. Premetazoan genome evolution and the regulation of cell differentiation in the choanoflagellate Salpingoeca rosetta. Genome Biol. 2013;14(2):R15.PubMed CentralView ArticlePubMedGoogle Scholar
- Suga H, Chen Z, de Mendoza A, Sebe-Pedros A, Brown MW, Kramer E, et al. The Capsaspora genome reveals a complex unicellular prehistory of animals. Nat Commun. 2013;4:2325.PubMed CentralView ArticlePubMedGoogle Scholar
- Tiebe M, Lutz M, De La Garza A, Buechling T, Boutros M, Teleman AA. REPTOR and REPTOR-BP Regulate Organismal Metabolism and Transcription Downstream of TORC1. Dev Cell. 2015;33(3):272–84.View ArticlePubMedGoogle Scholar
- Whelan NV, Kocot KM, Moroz LL, Halanych KM. Error, signal, and the placement of Ctenophora sister to all other animals. Proc Natl Acad Sci U S A. 2015;112(18):5773–8.PubMed CentralView ArticlePubMedGoogle Scholar
- Nagadoi A, Nakazawa K, Uda H, Okuno K, Maekawa T, Ishii S, et al. Solution structure of the transactivation domain of ATF-2 comprising a zinc finger-like subdomain and a flexible subdomain. J Mol Biol. 1999;287(3):593–607.View ArticlePubMedGoogle Scholar
- Sutherland Ja Fau - Cook A, Cook A Fau - Bannister AJ, Bannister Aj Fau - Kouzarides T, Kouzarides T: Conserved motifs in Fos and Jun define a new class of activation domain. 1992(0890-9369 (Print)).Google Scholar
- Fujii Y, Shimizu T, Toda T, Yanagida M, Hakoshima T. Structural basis for the diversity of DNA recognition by bZIP transcription factors. Nat Struct Biol. 2000;7(10):889–93.View ArticlePubMedGoogle Scholar
- Gamboa-Melendez H, Huerta AI, Judelson HS. bZIP transcription factors in the oomycete phytophthora infestans with novel DNA-binding domains are involved in defense against oxidative stress. Eukaryot Cell. 2013;12(10):1403–12.PubMed CentralView ArticlePubMedGoogle Scholar
- Derelle R, Lopez P, Le Guyader H, Manuel M. Homeodomain proteins belong to the ancestral molecular toolkit of eukaryotes. Evol Dev. 2007;9(3):212–9.View ArticlePubMedGoogle Scholar
- Vinson C, Myakishev M, Acharya A, Mir AA, Moll JR, Bonovich M. Classification of human B-ZIP proteins based on dimerization properties. Mol Cell Biol. 2002;22(18):6321–35.PubMed CentralView ArticlePubMedGoogle Scholar
- Reinke AW, Baek J, Ashenberg O, Keating AE. Networks of bZIP protein-protein interactions diversified over a billion years of evolution. Science. 2013;340(6133):730–4.PubMed CentralView ArticlePubMedGoogle Scholar
- Yoshida H, Matsui T, Yamamoto A, Okada T, Mori K. XBP1 mRNA is induced by ATF6 and spliced by IRE1 in response to ER stress to produce a highly active transcription factor. Cell. 2001;107(7):881–91.View ArticlePubMedGoogle Scholar
- Hai T, Hartman MG. The molecular biology and nomenclature of the activating transcription factor/cAMP responsive element binding family of transcription factors: activating transcription factor proteins and homeostasis. Gene. 2001;273(1):1–11.View ArticlePubMedGoogle Scholar
- Rep M, Proft M, Remize F, Tamas M, Serrano R, Thevelein JM, et al. The Saccharomyces cerevisiae Sko1p transcription factor mediates HOG pathway-dependent osmotic regulation of a set of genes encoding enzymes implicated in protection from oxidative damage. Mol Microbiol. 2001;40(5):1067–83.View ArticlePubMedGoogle Scholar
- Reitzel AM, Tarrant AM, Levy O. Circadian clocks in the cnidaria: environmental entrainment, molecular regulation, and organismal outputs. Integr Comp Biol. 2013;53(1):118–30.PubMed CentralView ArticlePubMedGoogle Scholar
- Parfrey LW, Lahr DJG. Multicellularity arose several times in the evolution of eukaryotes (Response to doi:10.1002/bies.201100187). Bioessays. 2013;35(4):339–47.View ArticlePubMedGoogle Scholar
- Llorca CM, Potschin M, Zentgraf U. bZIPs and WRKYs: two large transcription factor families executing two different functional strategies. Front Plant Sci. 2014;5:169.PubMed CentralView ArticlePubMedGoogle Scholar
- The Mnemiopsis Genome Project Portal [http://research.nhgri.nih.gov/mnemiopsis/].
- The Neurobase website [http://neurobase.rc.ufl.edu/Pleurobrachia].
- The Dictybase website [http://dictybase.org].
- The GiardiaDB website [http://giardiadb.org/giardiadb/].
- The National Center for Biotechnology Information Genbank [ftp://ftp.ncbi.nih.gov/genomes/].
- The DOE Joint genome institute [http://genome.jgi.doe.gov].
- Compagen [http://compagen.zoologie.uni-kiel.de/index.html].
- Origins of Multicellularity Sequencing Project, Broad Institute of Harvard and MIT [http://www.broadinstitute.org/].
- Aurrecoechea C, Brestelli J, Brunk BP, Carlton JM, Dommer J, Fischer S, et al. GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis. Nucleic Acids Res. 2009;37(Database issue):D526–30.PubMed CentralView ArticlePubMedGoogle Scholar
- Cunningham F, Amode MR, Barrell D, Beal K, Billis K, Brent S, et al. Ensembl 2015. Nucleic Acids Res. 2015;43(Database issue):D662–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42(D1):D222–30.PubMed CentralView ArticlePubMedGoogle Scholar
- Gaudet P, Fey P, Basu S, Bushmanova YA, Dodson R, Sheppard KA, et al. dictyBase update 2011: web 2.0 functionality and the initial steps towards a genome portal for the Amoebozoa. Nucleic Acids Res. 2011;39(Database issue):D620–4.PubMed CentralView ArticlePubMedGoogle Scholar
- Hemmrich G, Bosch TC. Compagen, a comparative genomics platform for early branching metazoan animals, reveals early origins of genes regulating stem-cell differentiation. Bioessays. 2008;30(10):1010–8.View ArticlePubMedGoogle Scholar
- Shinzato C, Shoguchi E, Kawashima T, Hamada M, Hisata K, Tanaka M, et al. Using the Acropora digitifera genome to understand coral responses to environmental change. Nature. 2011;476(7360):320–U382.View ArticlePubMedGoogle Scholar
- Yilmaz A, Mejia-Guerra MK, Kurz K, Liang X, Welch L, Grotewold E. AGRIS: the Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res. 2011;39(Database issue):D1118–22.PubMed CentralView ArticlePubMedGoogle Scholar
- Cock JM, Sterck L, Rouze P, Scornet D, Allen AE, Amoutzias G, et al. The Ectocarpus genome and the independent evolution of multicellularity in brown algae. Nature. 2010;465(7298):617–21.View ArticlePubMedGoogle Scholar
- Nunez-Corcuera B, Birch JL, Yamada Y, Williams JG. Transcriptional repression by a bZIP protein regulates Dictyostelium prespore differentiation. PLoS One. 2012;7(1):e29895.PubMed CentralView ArticlePubMedGoogle Scholar
- Ye W, Wang Y, Dong S, Tyler BM, Wang Y. Phylogenetic and transcriptional analysis of an expanded bZIP transcription factor family in Phytophthora sojae. BMC Genomics. 2013;14(1):839.PubMed CentralView ArticlePubMedGoogle Scholar
- Ellenberger TE, Brandl CJ, Struhl K, Harrison SC. The Gcn4 Basic Region Leucine Zipper Binds DNA as a Dimer of Uninterrupted Alpha-Helices - Crystal-Structure of the Protein-DNA Complex. Cell. 1992;71(7):1223–37.View ArticlePubMedGoogle Scholar
- Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66.PubMed CentralView ArticlePubMedGoogle Scholar
- Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22(21):2688–90.View ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21(9):2104–5.View ArticlePubMedGoogle Scholar
- Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61(3):539–42.PubMed CentralView ArticlePubMedGoogle Scholar
- Miller M, Shuman JD, Sebastian T, Dauter Z, Johnson PF. Structural basis for DNA recognition by the basic region leucine zipper transcription factor CCAAT/enhancer-binding protein alpha. J Biol Chem. 2003;278(17):15178–84.View ArticlePubMedGoogle Scholar
- Kurokawa H, Motohashi H, Sueno S, Kimura M, Takagawa H, Kanno Y, et al. Structural basis of alternative DNA recognition by Maf transcription factors. Mol Cell Biol. 2009;29(23):6232–44.PubMed CentralView ArticlePubMedGoogle Scholar
- Haas NB, Cantwell CA, Johnson PF, Burch JB. DNA-binding specificity of the PAR basic leucine zipper protein VBP partially overlaps those of the C/EBP and CREB/ATF families and is influenced by domains that flank the core basic region. Mol Cell Biol. 1995;15(4):1923–32.PubMed CentralView ArticlePubMedGoogle Scholar
- Luo Q, Viste K, Urday-Zaa JC, Senthil Kumar G, Tsai WW, Talai A, et al. Mechanism of CREB recognition and coactivation by the CREB-regulated transcriptional coactivator CRTC2. Proc Natl Acad Sci U S A. 2012;109(51):20865–70.PubMed CentralView ArticlePubMedGoogle Scholar
- Donald JE, Shakhnovich EI. Predicting specificity-determining residues in two large eukaryotic transcription factor families. Nucleic Acids Res. 2005;33(14):4455–65.PubMed CentralView ArticlePubMedGoogle Scholar
- Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol. 1994;2:28–36.PubMedGoogle Scholar
- Srivastava M, Simakov O, Chapman J, Fahey B, Gauthier ME, Mitros T, et al. The Amphimedon queenslandica genome and the evolution of animal complexity. Nature. 2010;466(7307):720–6.PubMed CentralView ArticlePubMedGoogle Scholar
- Schuster-Bockler B, Schultz J, Rahmann S. HMM Logos for visualization of protein families. BMC Bioinformatics. 2004;5:7.PubMed CentralView ArticlePubMedGoogle Scholar
- King N. The unicellular ancestry of animal development. Dev Cell. 2004;7(3):313–25.View ArticlePubMedGoogle Scholar