- Research article
- Open Access
Two more Posterior Hox genes and Hox cluster dispersal in echinoderms
© The Author(s). 2018
- Received: 2 August 2018
- Accepted: 23 November 2018
- Published: 27 December 2018
Hox genes are key elements in patterning animal development. They are renowned for their, often, clustered organisation in the genome, with supposed mechanistic links between the organisation of the genes and their expression. The widespread distribution and comparable functions of Hox genes across the animals has led to them being a major study system for comparing the molecular bases for construction and divergence of animal morphologies. Echinoderms (including sea urchins, sea stars, sea cucumbers, feather stars and brittle stars) possess one of the most unusual body plans in the animal kingdom with pronounced pentameral symmetry in the adults. Consequently, much interest has focused on their development, evolution and the role of the Hox genes in these processes. In this context, the organisation of echinoderm Hox gene clusters is distinctive. Within the classificatory system of Duboule, echinoderms constitute one of the clearest examples of Disorganized (D) clusters (i.e. intact clusters but with a gene order or orientation rearranged relative to the ancestral state).
Here we describe two Hox genes (Hox11/13d and e) that have been overlooked in most previous work and have not been considered in reconstructions of echinoderm Hox complements and cluster organisation. The two genes are related to Posterior Hox genes and are present in all classes of echinoderm. Importantly, they do not reside in the Hox cluster of any species for which genomic linkage data is available.
Incorporating the two neglected Posterior Hox genes into assessments of echinoderm Hox gene complements and organisation shows that these animals in fact have Split (S) Hox clusters rather than simply Disorganized (D) clusters within the Duboule classification scheme. This then has implications for how these genes are likely regulated, with them no longer covered by any potential long-range Hox cluster-wide, or multigenic sub-cluster, regulatory mechanisms.
- Posterior Hox genes
- Hox gene evolution
Hox genes encode a family of homeodomain-containing transcription factors that are renowned for conserved roles in the patterning of the anterior-posterior axis of bilaterian animals, and they may well have more ancient roles in animal axial development [1, 2]. Hox genes often occur in ordered clusters and exhibit spatial and/or temporal collinearity, wherein their order in the cluster matches their order of expression in the embryo [3, 4].
This wealth of new data has a potential to shed new light on the controversial evolutionary history of the Posterior Hox genes in deuterostomes. Posterior Hox genes, related to Hox9 and above in vertebrates, have undergone a dramatic expansion in the deuterostome clade. Deuterostome species investigated so far have been found to possess at least four of these genes, in contrast to the one or two common among their protostome cousins. Xenoturbella and acoelomorphs, which possess small Hox complements with only a single Posterior in each species examined [13–17], have been classified as deuterostomes in some phylogenomic studies ([18, 19], but see ), but even if this placement is correct, the simple body plans and minimal Hox complements of these animals, confirmed by a recent in-depth analysis  are clearly atypical of Deuterostomia.
The cause(s) of deuterostome Posterior Hox expansion are not clear, although some have hypothesised a connection between the proliferation of Posterior Hox genes in chordates and the appearance of morphological novelties at the posterior end of the anterior-posterior (AP) axis, such as tails . Equally unclear is the duplication history that led to the large Posterior Hox complements found in deuterostomes. Phylogenetic studies do not paint a clear picture, with generally poor resolution and variable topologies amongst the chordate Hox9–15 genes and the ambulacrarian Posterior Hox complements of Hox9/10 and Hox11/13a-c [5, 6, 8, 9, 22–27]. As early as 2000, this lack of resolution led Ferrier et al.  to suggest that the relatively high evolutionary rates of deuterostome Posterior Hox genes have largely obscured their true relationships, a hypothesis they dubbed Deuterostome Posterior Flexibility (DPF).
To begin to resolve such a thorny question, good taxon sampling and careful screening to identify the full Hox complements in each taxon are imperative. With the availability of expanding public genomic resources such as Echinobase , those requirements can be fulfilled with greater resolution and robustness than previously possible. Here, we report the presence of two previously overlooked Posterior Hox genes specific to echinoderms, which we name Hox11/13d and e based on their similarity to the well-known Hox11/13b-c genes of echinoderms and hemichordates. Both genes are shared by all extant echinoderm classes and absent from the currently published hemichordate genomes, and good evidence for developmental expression exists for one of them, which has previously been mis-classified. Intriguingly, neither gene is found within the Hox cluster in any of the echinoderm genomes in which assembly quality permits the investigation of linkage. In all species we studied, these novel genes reside on genomic scaffolds separate from those containing the canonical Hox cluster and are accompanied by at least one non-Hox gene. Motif and phylogenetic analyses confirm the status of Hox11/13d and e as distinct, echinoderm-specific Posterior Hox genes and the lack of close linkage with the Hox cluster requires that echinoderms be reclassified as having Split (S) clusters rather than simply Disorganized (D) clusters within the classification system of Duboule .
Hox11/13d and Hox11/13e are novel Posterior Hox genes
Hox11/13d was first detected during an attempt to find orthologues of known echinoderm Hox genes in the ophiuroid Ophiothrix spiculata. After BLAST searches using homeodomain queries from S. purpuratus against the O. spiculata genome and the ophiuroid transcriptomes of Delroisse et al.  revealed the presence of three homeodomains highly similar to Hox11/13b and c, we conducted reciprocal BLAST searches against S. purpuratus as well as searches against the NCBI nr database. Thus, we found a previously undescribed gene model from S. purpuratus (NCBI accession XP_011680299.1) that could be aligned to one of the putative ophiuroid Hox11/13b-c type sequences over its full length. This sequence matched neither Stpu-Hox11/13b nor c, and mapped to scaffold 1168 in v4.2 of the S. purpuratus genome assembly, not scaffold 628, where the Hox cluster described by Cameron et al.  is located. Additionally, the novel gene showed high similarity to the sequence Tsuchimoto and Yamaguchi  identified as Hox11/13c in the sand dollar Peronella japonica.
Subsequent searches confirmed the existence of a Hox11/13d gene distinct from Hox11/13b and c in all other echinoderm genomes available at the time of study: the sea urchin Lytechinus variegatus, the sea cucumbers Parastichopus parvimensis and Apostichopus japonicus, the sea stars Patiria miniata and Acanthaster planci, and the crinoid Anneissia japonica (formerly Oxycomanthus japonicus).
We attempted to find orthologues for Hox11/13d and e in hemichordates; however, neither the genome of Saccoglossus kowalevskii nor that of Ptychodera flava yielded clear examples. In P. flava, no Posterior Hox genes were detected beyond the canonical four identified in previous studies [6, 8, 9]. In S. kowalevskii, there is a fifth, divergent Posterior Hox gene dubbed AbdB-like by Simakov et al. . However, this sequence (accession: ALR88649.1) appears to be unique, and neither phylogenetic analyses nor sequence motifs clearly link it to either Hox11/13d or e (data not shown).
Hox11/13d and e are detached from the Hox cluster
Genome databases, assembly versions and Hox scaffolds used in this study
Hox cluster scaffold(s)
Echinodermata / Crinoidea
Echinodermata / Echinoidea
2296, 638, 378, 5296, 1336, 14,399, 33,366, 12,176, 234,558, 41,337, 1680
Echinodermata / Echinoidea
Echinodermata / Asteroidea
Echinodermata / Asteroidea
2763, 4440, 5928, 12,174, 1848, 3114
Echinodermata / Holothuroidea
1.0 from 
Echinodermata / Holothuroidea
391, 3188, 498, 1495, 4722, 3261, 5822, 317
Echinodermata / Ophiuroidea
2313, 14,601, 543, 145, 887, 4941, 67,114, 5877, 10,227, 4461, 7983
Hemichordata / Enteropneusta
Hemichordata / Enteropneusta
16,417, 1507, 26,913
A handful of the non-Hox genes accompanying Hox11/13d and e were found to be shared between two or more examined species. Specifically, Hox11/13d is flanked by the same two genes in both sea urchins (Fig. 5a.). One of these neighbouring genes, encoding a large peptide containing multiple C-type lectin, fibronectin type 3 and CUB domains, is also on the Hox11/13d scaffold in A. planci, although there it is separated from the Hox gene by ~ 1.6 Mb and multiple other genes. The Hox11/13d scaffold of P. miniata has a gene content strongly conserved with the Hox neighbourhood of the corresponding A. planci scaffold (Fig. 5a).
Among the neighbours of Hox11/13e, S. purpuratus shares a putative orthologous histamine N-methyltransferase gene with L. variegatus. An additional hnmt gene is found in tandem with the first one in S. purpuratus but does not have a clear best match among the numerous similar sequences in the L. variegatus genome. A third S. purpuratus gene encoding an astacin-like metalloproteinase is conserved between the Hox11/13e scaffolds of that species and A. planci; considering that this gene is several hundred kilobases from Hox11/13e in S. pupuratus, the apparent lack of conservation with L. variegatus may be an artefact of the poorer quality of the latter’s genome assembly.
P. parvimensis and the two sea urchins do not share any non-Hox genes on their Hox11/13d and e scaffolds; however, the single non-Hox gene detected on each P. parvimensis scaffold is also present on the corresponding scaffold in A. planci. On the longer Hox11/13e scaffold of the A. japonicus assembly, a further gene encoding a copine-8-like sequence is a good match to a similar sequence in A. planci; however, despite its much larger size, the Hox11/13d scaffold of this species seems to lack the neurobeachin-like sequence shared between P. parvimensis and A. planci. None of the non-Hox genes detected next to Hox11/13d and e in O. spiculata are Hox neighbours in the other species.
We ran the motif-finding program MEME on the non-homeodomain portions of representative deuterostome Posterior Hox sequences to see if we could discover diagnostic features for members of the Hox11/13b+ clade. We also hoped that such motifs could provide additional information to elucidate the tangled evolutionary history of this clade. Overall, the distribution of motifs in 11/13b + protein sequences is patchy (Additional file 3), with different motifs being shared by different groups of genes. Nonetheless, a few motifs may be useful in distinguishing members of this clade within echinoderms. One of these is the C-peptide, which is especially distinctive in Hox11/13e, where a long, strongly conserved region rich in charged residues follows the homeodomain. Hox11/13d has a similar, albeit more variable, charged C-peptide (Additional file 4 Fig. 2), which is largely absent from previously known members of the b + clade.
Another potentially informative motif is the hexapeptide and linker region, which is easily distinguishable between Hox11/13b, c and d, although any trace of a hexapeptide-like sequence (or indeed, any N-peptide conservation) appears to be missing from Hox11/13e (Fig. 2). Hox11/13b in non-crinoid echinoderms is characterised by a highly conserved N-peptide despite the loss of the tryptophan residue key to canonical hexapeptide function . Likewise, this region is very similar in all echinoderm Hox11/13c sequences examined. The N-peptides of Hox11/13d sequences show less conservation, but they share a Ser/Thr-rich linker region. In hemichordates, the N-peptides of both Hox11/13b and c are most similar to that of echinoderm Hox11/13b except for the retention of the conserved tryptophan in the hemichordate sequences (Additional file 5).
Evidence of expression
There is strong evidence for expression of Hox11/13d in multiple echinoderm classes. Most importantly, Tsuchimoto and Yamaguchi  provided good quality in situ data for this gene, although they worked under the assumption that it was Hox11/13c. Their experiments in Peronella japonica demonstrated expression in the vegetal plate during the blastula and early gastrula stages. There are no published in situs from other species, but transcripts of Hox11/13d are present in developmental transcriptomes of S. purpuratus ([32, 33], transcript ID WHL22.13708.0) and L. variegatus (transcript accession JI441003.1) available through Echinobase, the adult arm transcriptome of the ophiuroid Ophiopsila aranea , and the testicular transcriptome of A. planci (, SRX493873). We did not find Hox11/13e in any larval or adult transcriptomes except for a handful of reads in the A. planci testicular transcriptome.
In this study, we describe two novel echinoderm Hox genes with possible implications for Hox cluster evolution and the origin of the unusual body plan of Echinodermata. We named these genes Hox11/13d and e to indicate their relationship to the Hox11/13 genes previously described from both echinoderms and hemichordates. Our results increase the typical complement of echinoderm Posterior Hox genes to six, closer in number to the seven Posterior genes of the “archetypal” chordate amphioxus . Additional, divergent sequences such as AbdB-like in S. kowalevskii and an as yet unnamed, unique Posterior Hox-like sequence we found in O. spiculata (data not shown) raise the possibility of even more unexplored, lineage-specific diversity in this iconic gene family.
The problem of Posterior Hox gene phylogeny in deuterostomes has endured ever since Ferrier et al.  first articulated it. Although that study largely focused on the problem as it pertains to chordates, the relationships of Hox11/13b+ genes in ambulacrarians are equally difficult to resolve. Our work confirms that this problem cannot be easily solved with the addition of more taxa and sequences: despite including complete homeodomains and flanking regions of all types of 11/13 protein from all echinoderm classes and three hemichordates, our phylogenetic analyses still yield poorly resolved trees with conflicting topologies. Rather than illuminating its evolutionary history, the mosaic distribution of conserved sequence motifs outside the homeodomain (Additional files 3and 4 Fig. 6) indicates a high level of evolutionary flexibility in this clade.
Hox genes are best known for their conserved roles in patterning the bilaterian AP axis. In echinoderms, the ancestral AP axis is obscured by the pentameral symmetry of the adult body, but a spatially ordered “Hox vector” can still be discerned in the larval somatocoels of both echinoids [30, 35] and crinoids . This vector incorporates Hox7-Hox11/13b in echinoids and Hox5-Hox9/10 in crinoids, although Hara et al.  were unable to clone Hox11/13b from M. rotundus. Separate from this linear expression pattern, some Hox genes are also expressed in radial patterns in the adult rudiment; such radial expression has been reported for Hox3 in S. purpuratus  and Hox3, 5 and 11/13b in P. japonica .
The conspicuous absence of Anterior and some Central Hox genes from the somatocoelar Hox vector, together with the rearrangement of the sea urchin Hox cluster with Hox1–3 at the “posterior” end of the cluster , prompted Mooi and David  to hypothesise a link between cluster rearrangement and what they termed the axial, radially symmetrical region of a developing echinoderm adult. Building on Duboule’s  discussion of Hox cluster organisation and ordering as a possible evolutionary constraint, Mooi and David  and David and Mooi  suggested that the translocation of Anterior Hox genes in echinoderms permitted a delay in their expression and a dissociation from the AP axis, allowing their novel deployment as part of the developmental toolkit for radial adult structures. The above hypothesis predicted that the 5′ translocation of Anterior Hox genes would be ancestral to living echinoderms. However, the recent publication of Hox cluster data from sea stars  and sea cucumbers  suggests that it is, in fact, a peculiarity of echinoids and therefore not associated with the origin of pentameral symmetry .
All of the echinoderm Hox clusters described above fell into the “Disorganized (D)” category of Duboule’s  classification, meaning they were intact but relatively loosely organised, with losses, inversions and/or rearrangements within the cluster. Our findings reveal two novel genes that appear completely detached from the main Hox cluster even in the echinoderm species with the least disorganized cluster described to date . Given that Hox11/13d and e both occur in every extant echinoderm class, the ancestral echinoderm Hox cluster may have been “Split (S)” sensu Duboule  instead of merely disorganized. Linkage data from crinoids, which form the sister group to all other living echinoderms, will be essential for testing this idea. A mostly intact Hox cluster which a subset of Posterior Hox genes have nonetheless escaped from is also seen in the annelid Platynereis dumerilii ; how closely this loss of cluster integrity parallels the situation in echinoderms remains to be seen.
Hox11/13d is expressed in embryonic stages of several echinoderms, likely an unusual trait for a Hox gene in this clade [30, 36]. Interestingly, the limited spatial expression data that exist for Hox11/13d  hint that despite its departure from the cluster, this gene may exhibit spatially coordinated expression with Hox11/13b, appearing in a domain more vegetal than the latter. Spatial collinearity of Hox gene expression is known to be at least partially independent of clustering. Residual spatial collinearity may persist even after complete Hox cluster disintegration [41, 42]. Conversely, Hox genes in canonically ordered clusters may evolve expression domains that break collinearity, as seen with Hox6 and Hox14 in the “archetypal” chordate amphioxus . Thus, the regulatory, developmental and evolutionary significance of Hox11/13d and e being outside the Hox cluster is difficult to predict without more information on their expression and function.
Nothing is currently known about the expression and developmental roles (if any) of Hox11/13e. Unlike Hox11/13d, it is not present in any of the developmental transcriptomes we searched, suggesting that any developmental expression would happen at late stages that may be crucial for the development of the pentameral adult, or restricted domains that limit its detectability in transcriptome surveys. While Hox11/13d has a very similar homeodomain to Hox11/13b and c (Fig. 2) and shares several conserved motifs with the hemichordate members of the 11/13b+ group (Additional files 3 and 4), Hox11/13e has a highly distinctive homeodomain, so divergent that its original discoverers were not even certain it was a Hox gene . In light of this, the high level of conservation seen in both its homeodomain and C-peptide across different echinoderm clades (Fig. 2) is intriguing, and so is the similarity of the C-peptide to Hox11/13d. As an echinoderm-specific Hox gene that is both highly unique and very conserved within echinoderms, Hox11/13e may prove especially interesting with regard to the evolution of the unusual body plan of this phylum.
Our discovery of two previously unrecognised (except for a brief mention of one of them in ref. ) Hox genes in the well-studied genome of the “model” echinoderm S. purpuratus highlights the continued need for in-depth studies focused on individual gene families in the age of big data. Such deep surveys may be particularly vital for Posterior Hox genes, whose higher levels of sequence divergence compared to most Anterior and Central Hox genes can make them difficult to catch in general homeodomain searches .
The improved taxon sampling resulting from the proliferation of “non-model” genome sequencing projects creates an unprecedented opportunity to chart the distribution of unusual members of key gene families such as Hox11/13e, an essential first step in understanding their role in body plan evolution. In combination with expression and functional studies, such surveys may shed new light on the origin of lineage-specific innovations.
The two echinoderm Posterior Hox genes described here, Hox11/13d and e, have been largely neglected in previous work. These genes must have arisen early in echinoderm evolution since they are found in all extant echinoderm classes. Their genomic locations outside the Hox gene cluster, in all species for which data is available, requires the classification of the Hox clusters in these animals be revised. Echinoderms can no longer be considered as possessing Disorganized Hox clusters, but instead have undergone some dispersal of the Hox genes, thus making echinoderms another example of animals with Split Hox clusters. The impact of this Hox gene dispersal on the evolution of echinoderm body plans remains to be determined, but it is clear that Hox11/13d and e can no longer be controlled by any long-range gene regulatory mechanisms that may be operating within the remainder of the Hox cluster of sea urchins, sea stars, sea cucumbers, brittle stars and feather stars.
Hox gene surveys
During a general search for Hox gene sequences in the genome of the ophiuroid echinoderm Ophiothrix spiculata and transcriptomes of other ophiuroids , with sea urchin Hox cluster sequences as queries, an unexpected and thus far unannotated extra gene was found, here called Hox11/13d. Once it became clear that one of the transcriptomes and the O. spiculata genome both contained a previously undescribed Hox gene distinct from Hox11/13b and c, specific BLAST searches of all available echinoderm genomes were conducted using this novel sequence (chiefly its S. purpuratus orthologue, which has a probably complete RefSeq gene model under accession XP_011680299.1) as a query. Hox11/13e (then unnamed) was briefly mentioned in ref.  as a “Hox11/13c-like” gene in Strongylocentrotus purpuratus. We used the Echinobase gene ID given in that study (SPU_011798, currently annotated as “Homeo2”, an indeterminate ANTP-class gene) to search other echinoderm genomes for possible orthologues (see Additional file 6 for accessions, genomic locations and notes). To test whether Hox11/13d and e were echinoderm-specific or shared across the Ambulacraria, we also searched the genomes  of the hemichordates Ptychodera flava and Saccoglossus kowalevskii for additional Hox11/13b/c-like genes. Genome versions and databases used, along with the locations of the novel Hox genes, can be found in Table 1.
Alignment and phylogenetic analysis
To ascertain that the novel genes we identified were indeed Posterior Hox genes, we created a reference alignment of ANTP-class homeodomains (Additional file 7) from the cephalochordate Branchiostoma floridae, the beetle Tribolium castaneum and the sea urchin Strongylocentrotus purpuratus. B. floridae and T. castaneum homeodomains were downloaded from HomeoDB  and checked by eye, while S. purpuratus homeodomains were extracted from the genome using the published homeobox survey of Howard-Ashby et al.  as a starting point and adding further sequences via BLAST searches of the v4.2 assembly. Homeodomains were manually aligned in Jalview 2.9 , and analysed with the neighbour-joining, maximum likelihood and Bayesian methods (see below for details).
In addition, more focused alignments of deuterostome Posterior Hox protein sequences were created to refine the placement of the novel genes within the Ambulacraria and Deuterostomia. We used Posterior Hox genes from selected echinoderm species with genomes available at the time of the analysis, as well as the “Hox11/13c” sequence from Peronella japonica published in , which we suspected might be a misidentified Hox11/13d. Hemichordates were represented by the acorn worms Saccoglossus kowalevskii, Balanoglossus simodensis  and Ptychodera flava, and chordates by the amphioxus Branchiostoma floridae, the coelacanth Latimeria menadoensis and the elephant shark Callorhinchus milii. In addition to homeodomains, approximately 12 amino acids of flanking sequence on either side of the homeodomain were also used in the full alignment. Where possible, N-terminal flanking sequences (herein, N-peptides) included the “vestigial” hexapeptide found in certain types of Posterior Hox proteins , with the conserved tryptophan as an anchor for alignment; attempts were also made to align C-peptides, although this could only be done within groups of orthologous proteins in most cases. Because we aligned flanking sequences to the extent this was possible, and because some Hox proteins (notably ambulacrarian Hox9/10) have C-peptides shorter than 12 residues, the exact lengths of the N- and C-peptides included vary. The full flanked alignment is available in Additional file 5. The flanked homeodomain alignment was created with MAFFT v7  accessed through Jalview, and edited by eye.
Neighbour-joining trees were constructed in MEGA 7 , with pairwise deletion of missing data and 1000 bootstrap replicates. For maximum likelihood analyses, the PhyML 3.0 web service  was used with subtree pruning and regrafting as the search algorithm and 500 bootstrap replicates to assess tree robustness. Bayesian trees were generated with MrBayes v3.2.6  using two parallel runs of four chains each with the default heating parameters. Bayesian analyses were run in increments of 1 million generations until the standard deviation of split frequencies between runs decreased below the recommended 0.01. This required 7 million generations for the ANTP and full Hox datasets, and 5 million for the homeodomain-only Hox alignment. The first 25% of each run was discarded as burn-in. ML and Bayesian analyses used the best model selected by modelgenerator v0.85  for each dataset (JTT + I + Γ for the full Hox alignment and LG + Γ for the other two). In all cases, model selection was unanimous by all three information criteria employed by modelgenerator. NJ analyses were conducted with JTT + Γ, the highest-scoring model available in MEGA. Since MEGA does not estimate it automatically, the shape parameter α was manually set to the value given by modelgenerator (α = 0.36 for the ANTP dataset, 0.68 for the full Hox dataset and 0.39 for the homeodomain-only alignment).
MEME v4.12.0 [53, 54] was employed to search for potential diagnostic motifs outside the homeodomains of Posterior Hox proteins (Hox11/13b-c clade members in particular). In this analysis, each echinoderm class except crinoids was represented by a single species to avoid the problem of high overall similarity between closely related taxa obscuring potentially interesting motifs. The species chosen were S. purpuratus for Echinoidea, Patiria miniata for Asteroidea, Parastichopus parvimensis for Holothuroidea, and O. spiculata for Ophiuroidea. The crinoid data consisted of the complete protein sequences of Metacrinus rotundus Hox9/10 and Hox11/13c as determined by Hara et al. , supplemented by ORFs containing the homeodomains of Hox11/13a, b, d and e that were manually assembled from Aneissia japonica (formerly Oxycomanthus japonicus) raw genomic reads (NCBI Sequence Read Archive accession SRX447395). The non-homeodomain exons of these genes could not be obtained by BLAST searches of the read data. S. kowalevskii, a harrimaniid, and B. simodensis, a ptychoderid represented hemichordates, while the chordate data consisted of the Posterior sequences from B. floridae and a representative member of each paralogy group from C. milii, selected by visual inspections of PG alignments between C. milii and L. menadoensis. Homeodomains were excluded from the analysis, and the regions preceding and following the homeodomain were tested separately to avoid spurious motif detection across the site of the homeodomain. We searched for motifs between 6 and 20 amino acids long that occurred in at least two sequences with at most one occurrence per sequence, and used MEME’s default E-value threshold of 0.05 as a cutoff. The input sequences and Posterior Hox sequences from additional ambulacrarian taxa were then visually inspected for further instances of each motif, and MEME’s original alignments were manually curated before being converted into logos for publication. Weblogo v2.8.2  with no small sample correction was used to generate the logos. Full motif alignments including partial instances omitted from the logos can be found in Additional file 4, and a summary of motif distributions in our dataset is given in Additional file 3.
Non-Hox neighbours of Hox11/13d and e
In all examined genomes, we found that neither Hox11/13d nor Hox11/13e shared a scaffold with any other Hox gene. Therefore, we scanned the scaffolds containing Hox11/13d and e in S. purpuratus, Lytechinus variegatus, Acanthaster planci, P. miniata and P. parvimensis for non-Hox genes to further assess their separation from the main Hox cluster. The genome of A. japonica did not have long enough scaffolds to permit neighbour analysis and were therefore omitted. In the first instance, existing annotations from each genome database and (where available) transcripts mappable to the Hox11/13d or e scaffold were used to derive a preliminary list of neighbours for each Hox gene. These lists were curated to remove redundancy, extend gene models based on interspecific conservation and/or transcriptomic evidence where applicable, and check the locations of models on the scaffold. In all genomes except those of Apostichopus japonicus and A. planci (which has very large scaffolds and the best annotation quality of the species considered here), open reading frames longer than 100 amino acids were then scanned for further, unannotated genes using homology (species-specific Echinobase BLAST searches or general searches against the nr database) and the presence of conserved domains from the NCBI conserved domain database . Finally, putative Hox neighbours from each echinoderm species were searched against the genomes of the other species to detect any conserved synteny. (For more information about Hox neighbours, see Additional file 6.)
The authors thank the Ferrier group for discussion and the University of St Andrews, School of Biology for making RS a Visiting Scholar. Research in the Ferrier group is funded by the Leverhulme Trust, EU Horizon2020, BBSRC and School of Biology.
Availability of data and materials
All data generated or analysed during this study are included in this published article and the additional files.
RS carried out all data gathering and analyses. RS and DEKF developed the study design and wrote the manuscript. Both authors read and approved the final manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests. DEKF is a Section Editor for BMC Evolutionary Biology.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- DuBuc TQ, Stephenson TB, Rock AQ, Martindale MQ. Hox and Wnt pattern the primary body axis of an anthozoan cnidarian before gastrulation. Nat Commun. 2018;9:2007.View ArticleGoogle Scholar
- Ferrier DEK. The origin of the Hox/ParaHox genes, the ghost locus hypothesis and the complexity of the first animal. Brief Funct Genomics. 2016;15:333–41.View ArticleGoogle Scholar
- Monteiro AS, Ferrier DEK. Hox genes are not always colinear. Int J Biol Sci. 2006;2:95–103.View ArticleGoogle Scholar
- Duboule D. The rise and fall of Hox gene clusters. Development. 2007;134:2549–60.View ArticleGoogle Scholar
- Cameron RA, Rowen L, Nesbitt R, Bloom S, Rast JP, Berney K, Arenas-Mena C, Martinez P, Lucas S, Richardson PM, et al. Unusual gene order and organization of the sea urchin hox cluster. J Exp Zool B: Mol Dev Evol. 2006;306B:45–58.View ArticleGoogle Scholar
- Peterson KJ. Isolation of Hox and Parahox genes in the hemichordate Ptychodera flava and the evolution of deuterostome Hox genes. Mol Phyl Evol. 2004;31:1208–15.View ArticleGoogle Scholar
- Hara Y, Yamaguchi M, Akasaka K, Nakano H, Nonaka M, Amemiya S. Expression patterns of Hox genes in larvae of the sea lily Metacrinus rotundus. Dev Genes Evol. 2006;216:797–809.View ArticleGoogle Scholar
- Ikuta T, Miyamoto N, Saito Y, Wada H, Satoh N, Saiga H. Ambulacrarian prototypical Hox and ParaHox gene complements of the indirect-developing hemichordate Balanoglossus simodensis. Dev Genes Evol. 2009;219:383–9.View ArticleGoogle Scholar
- Freeman R, Ikuta T, Wu M, Koyanagi R, Kawashima T, Tagawa K, Humphreys T, Fang GC, Fujiyama A, Saiga H, et al. Identical genomic Organization of two Hemichordate Hox Clusters. Curr Biol. 2012;22:2053–8.View ArticleGoogle Scholar
- Baughman KW, McDougall C, Cummins SF, Hall M, Degnan BM, Satoh N, Shoguchi E. Genomic organization of Hox and ParaHox clusters in the echinoderm, Acanthaster planci. Genesis. 2014;52:952–8.View ArticleGoogle Scholar
- Jo J, Oh J, Lee HG, Hong HH, Lee SG, Cheon S, Kern EMA, Jin S, Cho SJ, Park JK, et al. Draft genome of the sea cucumber Apostichopus japonicus and genetic polymorphism among color variants. Gigascience. 2017;6:1–6.PubMedPubMed CentralGoogle Scholar
- Zhang X, Sun L, Yuan J, Sun Y, Gao Y, Zhang L, Li S, Dai H, Hamel JF, Liu C, et al. The sea cucumber genome provides insights into morphological evolution and visceral regeneration. PLoS Biol. 2017;15:e2003790.View ArticleGoogle Scholar
- Cook CE, Jiménez E, Akam M, Saló E. The Hox gene complement of acoel flatworms, a basal bilaterian clade. Evol Dev. 2004;6:154–63.View ArticleGoogle Scholar
- Jiménez-Guri E, Paps J, Garcia-Fernàndez J, Saló E. Hox and ParaHox genes in Nemertodermatida, a basal bilaterian clade. Int J Dev Biol. 2006;50:675–9.View ArticleGoogle Scholar
- Fritzsch G, Böhme MU, Thorndyke M, Nakano H, Israelsson O, Stach T, Schlegel M, Hankeln T, Stadler PF. PCR survey of Xenoturbella bocki Hox genes. J Exp Zool B: Mol Dev Evol. 2008;310B:278–84.View ArticleGoogle Scholar
- Hejnol A, Martindale MQ. Coordinated spatial and temporal expression of Hox genes during embryogenesis in the acoel Convolutriloba longifissura. BMC Biol. 2009;7:65.View ArticleGoogle Scholar
- Brauchle M, Bilican A, Eyer C, Bailly X, Martínez P, Ladurner P, Bruggmann R, Sprecher SG. Xenacoelomorpha survey reveals that all 11 animal Homeobox gene classes were present in the first Bilaterians. Genome Biol Evol. 2018;10:2205–17.View ArticleGoogle Scholar
- Philippe H, Brinkmann H, Martinez P, Riutort M, Baguñà J. Acoel flatworms are not Platyhelminthes: evidence from Phylogenomics. PLoS One. 2007;2:e717.View ArticleGoogle Scholar
- Philippe H, Brinkmann H, Copley RR, Moroz LL, Nakano H, Poustka AJ, Wallberg A, Peterson KJ, Telford MJ. Acoelomorph flatworms are deuterostomes related to Xenoturbella. Nature. 2011;470:255–8.View ArticleGoogle Scholar
- Cannon JT, Vellutini BC, Smith J 3rd, Ronquist F, Jondelius U, Hejnol A. Xenacoelomorpha is the sister group to Nephrozoa. Nature. 2016;530:59–93.View ArticleGoogle Scholar
- Lanfear R. Are the deuterostome posterior Hox genes a fast-evolving class? In: Deutsch J, editor. Hox genes: studies from the 20th to the 21st century. New York: Springer; 2010. p. 111–22.View ArticleGoogle Scholar
- Ferrier DEK, Minguillón C, Holland PWH, Garcia-Fernàndez J. The amphioxus Hox cluster: deuterostome posterior flexibility and Hox14. Evol Dev 2000;2:284–93.Google Scholar
- Aronowicz J, Lowe CJ. Hox gene expression in the hemichordate Saccoglossus kowalevskii and the evolution of deuterostome nervous systems. Integr Comp Biol. 2006;46:890–901.View ArticleGoogle Scholar
- Kuraku S, Takio Y, Tamura K, Aono H, Meyer A, Kuratani S. Noncanonical role of Hox14 revealed by its expression patterns in lamprey and shark. Proc Natl Acad Sci U S A. 2008;105:6679–83.View ArticleGoogle Scholar
- Powers TP, Amemiya CT. Evidence for a Hox14 paralog group in vertebrates. Curr Biol. 2004;14:R183–4.View ArticleGoogle Scholar
- Holland LZ, Albalat R, Azumi K, Benito-Gutiérrez È, Blow MJ, Bronner-Fraser M, Brunet F, Butts T, Candiani S, Dishaw LJ, et al. The amphioxus genome illuminates vertebrate origins and cephalochordate biology. Genome Res. 2008;18:1100–11.View ArticleGoogle Scholar
- Thomas-Chollier M, Ledent V, Leyns L, Vervoort M. A non-tree-based comprehensive study of metazoan Hox and ParaHox genes prompts new insights into their origin and evolution. BMC Evol Biol. 2010;10:73.View ArticleGoogle Scholar
- Kudtarkar P, Cameron RA. (2017). Echinobase: an expanding resource for echinoderm genomic information. Database. 2017;bax074. http://www.echinobase.org/Echinobase/. Accessed 21 Jun 2018.Google Scholar
- Delroisse J, Mallefet J, Flammang P. De novo adult transcriptomes of two European brittle stars: spotlight on opsin-based photoreception. PLoS One. 2016;11:e0152988.View ArticleGoogle Scholar
- Tsuchimoto J, Yamaguchi M. Hox expression in the direct-type developing sand dollar Peronella japonica. Dev Dyn. 2014;243:1020–9.View ArticleGoogle Scholar
- Simakov O, Kawashima T, Marlétaz F, Jenkins J, Koyanagi R, Mitros T, Hisata K, Bredeson J, Shoguchi E, Gyoja F, et al. Hemichordate genomes and deuterostome origins. Nature. 2015;527:459–65.View ArticleGoogle Scholar
- Tu Q, Cameron RA, Worley KC, Gibbs RA, Davidson EH. Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis. Genome Res. 2012;22:2079–87.View ArticleGoogle Scholar
- Tu Q, Cameron RA, Davidson EH. Quantitative developmental transcriptomes of the sea urchin Strongylocentrotus purpuratus. Dev Biol. 2014;385:160–7.View ArticleGoogle Scholar
- Stewart MJ, Stewart P, Rivera-Posada J. De novo assembly of the transcriptome of Acanthaster planci testes. Mol Ecol Resour. 2015;15:953–66.View ArticleGoogle Scholar
- Arenas-Mena C, Cameron AR, Davidson EH. Spatial expression of Hox cluster genes in the ontogeny of a sea urchin. Development. 2000;127:4631–43.PubMedGoogle Scholar
- Arenas-Mena C, Martinez P, Cameron RA, Davidson EH. Expression of the Hox gene complex in the indirect development of a sea urchin. Proc Natl Acad Sci U S A. 1998;95:13062–7.View ArticleGoogle Scholar
- Mooi R, David B. Radial symmetry, the anterior/posterior Axis, and echinoderm Hox genes. Annu Rev Ecol Evol Syst. 2008;39:43–62.View ArticleGoogle Scholar
- David B, Mooi R. How Hox genes can shed light on the place of echinoderms among the deuterostomes. EvoDevo. 2014;5:22.View ArticleGoogle Scholar
- Byrne M, Martinez P, Morris V. Evolution of a pentameral body plan was not linked to translocation of anterior Hox genes: the echinoderm HOX cluster revisited. Evol Dev. 2016;18:137–43.View ArticleGoogle Scholar
- Hui JHL, McDougall C, Monteiro AS, Holland PWH, Arendt D, Balavoine G, Ferrier DEK. Extensive chordate and annelid Macrosynteny reveals ancestral Homeobox Gene Organization. Mol Biol Evol. 2012;29:157–65.View ArticleGoogle Scholar
- Seo HC, Edvardsen RB, Maeland AD, Bjordal M, Jensen MF, Hansen A, Flaat M, Weissenbach J, Lehrach H, Wincker P, et al. Hox cluster disintegration with persistent anteroposterior order of expression in Oikopleura dioica. Nature. 2004;431:67–71.View ArticleGoogle Scholar
- Moreno E, Nadal M, Baguñà J. Tracking the origins of the bilaterian Hox patterning system: insights from the acoel flatworm Symsagittifera roscoffensis. Evol Dev. 2009;11:574–81.View ArticleGoogle Scholar
- Pascual-Anaya J, Adachi N, Álvarez S, Kuratani S, D’Aniello S, Garcia-Fernàndez J. Broken colinearity of the amphioxus Hox cluster. EvoDevo. 2012;3:28.View ArticleGoogle Scholar
- Zhong Y, Holland PWH. HomeoDB2: functional expansion of a comparative homeobox gene database for evolutionary developmental biology. Evol Dev. 2011;13:567–8 http://homeodb.zoo.ox.ac.uk/. Accessed 1 Jun 2016.View ArticleGoogle Scholar
- Howard-Ashby M, Materna SC, Brown CT, Chen L, Cameron RA, Davidson EH. Identification and characterization of homeobox transcription factor genes in Strongylocentrotus purpuratus, and their expression in embryonic development. Dev Biol. 2006;300:74–89.View ArticleGoogle Scholar
- Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–91.View ArticleGoogle Scholar
- Shen WF, Rozenfeld S, Lawrence HJ, Largman C. The Abd-B-like Hox homeodomain proteins can be subdivided by the ability to form complexes with Pbx1a on a novel DNA target. J Biol Chem. 1997;272:8198–206.View ArticleGoogle Scholar
- Katoh K, Toh H. Parallelization of the MAFFT multiple sequence alignment program. Bioinformatics. 2010;26:1899–900.View ArticleGoogle Scholar
- Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4.View ArticleGoogle Scholar
- Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21 http://www.atgc-montpellier.fr/phyml/. Accessed 23 Jun 2017.View ArticleGoogle Scholar
- Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.View ArticleGoogle Scholar
- Keane TM, Creevey CJ, Pentony MM, Naughton TJ, Mclnerney JO. Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol. 2006;6:29.View ArticleGoogle Scholar
- Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol. 1994;2:28–36.PubMedGoogle Scholar
- Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–8 http://meme-suite.org/. Accessed 13 Dec 2017.View ArticleGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–90 https://weblogo.berkeley.edu/. Accessed 16 Dec 2016.View ArticleGoogle Scholar
- Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017;45:D200–3 https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi. Accessed 22 Jun 2018.View ArticleGoogle Scholar
- NCBI Genome https://www.ncbi.nlm.nih.gov/genome. Accessed 19 Jun 2018.
- NCBI Sequence Read Archive. https://www.ncbi.nlm.nih.gov/sra. Accessed 23 Jun 2017.
- The Apostichopus japonicus genome Database http://www.genedatabase.cn/aja_genome_20161129.html. Accessed 31 Oct 2018.
- OIST Mar Genomics Portal. http://marinegenomics.oist.jp/gallery/. Accessed 21 Jun 2018.
- Cannon JT, Kocot KM, Waits DS, Weese DA, Swalla BJ, Santos SR, Halanych KM. Phylogenomic resolution of the hemichordate and echinoderm clade. Curr Biol. 2014;24:2827–32.View ArticleGoogle Scholar