The evolutionary position of nematodes
© Blair et al 2002
Received: 18 February 2002
Accepted: 08 April 2002
Published: 08 April 2002
Skip to main content
© Blair et al 2002
Received: 18 February 2002
Accepted: 08 April 2002
Published: 08 April 2002
The complete genomes of three animals have been sequenced by global research efforts: a nematode worm (Caenorhabditis elegans), an insect (Drosophila melanogaster), and a vertebrate (Homo sapiens). Remarkably, their relationships have yet to be clarified. The confusion concerns the enigmatic position of nematodes. Traditionally, nematodes have occupied a basal position, in part because they lack a true body cavity. However, the leading hypothesis now joins nematodes with arthropods in a molting clade, Ecdysozoa, based on data from several genes.
We tested the Ecdysozoa hypothesis with analyses of more than 100 nuclear protein alignments, under conditions that would expose biases, and found that it was not supported. Instead, we found significant support for the traditional hypothesis, Coelomata. Our result is robust to different rates of sequence change among genes and lineages, different numbers of taxa, and different species of nematodes.
We conclude that insects (arthropods) are genetically and evolutionarily closer to humans than to nematode worms.
Traditionally, the animal body cavity (coelom) has played a major role in interpretations of metazoan evolution, from groups (e.g., flatworms) lacking a coelom to those (e.g., nematodes) with a false coelom and finally to the bulk of animal phyla having a true coelom (Coelomata) [1, 2]. There has never been complete agreement on animal phylogeny and classification, but most researchers have divided living coelomate animals into deuterostomes (echinoderms, hemichordates, urochordates, cephalochordates, and vertebrates) and protostomes (arthropods, annelids, mollusks, and other phyla) based on differences in early embryonic development. An analysis of small subunit ribosomal RNA (18S rRNA) sequences challenged this arrangement by placing acoelomate and pseudocoelomate phyla in more derived positions among the protostomes, and in further defining a clade (Ecdysozoa) of molting animals that includes arthropods and nematodes . This "Ecdysozoa" hypothesis has influenced diverse fields  and interpretations of developmental evolution in animals [5–7]. Since its publication, evidence has appeared both for and against this hypothesis [8–15]. Knowing the branching order of the major animal lineages, especially those three with fully sequenced genomes, is of importance to diverse fields such as medical genetics, physiology, neurobiology, paleontology, and astrobiology. With a genealogy of animals, it will be easier to determine the origins and inheritance of mutations, genes, gene functions, and structures.
To test the stability of Coelomata to taxon sampling, we included new sequences of the planarian Dugesia japonica (Phylum Platyhelminthes) in 100 five-taxon protein alignments (Supplemental Table 1, five-taxon analysis). The results, upon concatenation with this additional taxon, were unchanged (Fig. 2B): Coelomata continued to be significantly supported (≥ 98% bootstrap support, posterior probability = 1.0) and the alternative hypotheses were both rejected using the SH test (P < 0.001), although the relationships among the basal phyla (Nematoda and Platyhelminthes) could not be resolved.
Another potential bias is the specific taxa included in an analysis. For example, the original support for Ecdysozoa was obtained only with a particular genus of nematode, Trichinella, that had a short branch in the 18S rRNA tree . To test this, phylogenies were constructed using different species of nematodes. The Coelomata hypothesis was significantly supported (≥ 97% bootstrap support, posterior probabilities = 1.0; Ecdysozoa rejected by SH test, P < 0.006; Hypothesis III rejected by SH test, P < 0.027) using either a genus in a different order, Brugia (18 proteins), or Trichinella (six proteins) (Fig. 2C, 2D). To further address the possibility that these results could be biased by taxon sampling, we included representatives from all available phyla for each protein. The results indicate that an increase in the number of taxa does not decrease single-protein support for Coelomata; in fact, the trend is the reverse (Fig. 2E). Simulation studies have shown that incomplete taxon sampling does not increase topological errors, and that most error is caused by limited sequence data .
In the initial study defining Ecdysozoa , rate variation was considered to be the major bias affecting the phylogenetic position of nematodes. In the 18S rRNA gene, nematodes typically have long branches indicating an increased rate of sequence change. Other nuclear genes also show this pattern, but to a lesser degree [8, 9]. Phylogenetic methods can accommodate moderate amounts of rate variation among lineages without producing an incorrect phylogeny . However, if the rate of change is sufficiently large, longer branches in a phylogeny will sometimes attract one another . If that happens, an ingroup species with a long branch may move to a more basal position in the tree. In analyses of the 18S rRNA gene, nematodes typically appear basal to arthropods + vertebrates. Because the use of a short branch nematode (Trichinella) resulted in a tree whereby nematodes clustered with arthropods, the basal position of nematodes in typical 18S analyses has been interpreted as long-branch attraction .
The importance of knowing the branching order of these species is illustrated by the immediate and wide acceptance of the Ecdysozoa hypothesis and its use in tracing patterns of developmental evolution [5–7, 10]. However, in the initial analysis of 18S rRNA sequences , Ecdysozoa was statistically significant only when a paralinear distance method was used; three other methods did not yield significant bootstrap support. In that study, Ecdysozoa also was not significant, using any method, when the flatworm sequence was included . Subsequent analyses of the 18S rRNA gene have been interpreted differently [24, 25], but none has yielded statistically significant results supporting Ecdysozoa. Moreover, the molting cuticles of arthropods (chitin) and nematodes (collagen) are not homologous . The significance of other morphological characteristics bearing on the position of nematodes continues to be debated .
Besides the 18S rRNA evidence, other genetic evidence for the grouping of nematodes and arthropods has come from qualitative interpretations of Hox gene  and β-thymosin  evolution. In the case of Hox genes, support comes from a single posterior gene sequence (Y75B8A.1) of the nematode Caenorhabditis elegans argued to have greater amino acid similarity with a posterior Hox genes of Drosophila and Priapulus . Unfortunately, the Hox homeodomain is a short (60 amino acid) region with many sequence differences between these taxa. Definition of "sequence signatures" is qualitative and has not been tested statistically. In a subsequent study of nematode posterior Hox genes, other researchers were unable to determine if the simple nematode Hox cluster of six genes is an ancestral or a derived condition .
In the case of β-thymosin, a sequence signature also has been argued to support a grouping of Drosophila and Caenorhabditis . However, it is a gene family known to have paralogs within animals, the position of introns differs between sequences from the two species, and only four other metazoan taxa were surveyed. In addition, knowing the presence or absence of a gene can be problematic without the complete genome sequence of an organism (in this case, genomes were known only in Drosophila and Caenorhabditis). Thus, although suggestive, it is too soon to judge the significance of this sequence signature. One difficulty with interpreting such qualitative evidence, including Hox gene orthology, is that almost any pattern can be found in nature if one looks. In other words, sequence signatures have not yet been surveyed systematically and objectively. In contrast, sequence evidence from randomly selected genes, analyzed phylogenetically, provides a more unbiased database amenable to statistical analysis.
Although it is possible that a basal position of nematodes is the result of some unknown and widespread bias not yet identified, a simpler explanation is that the grouping of nematodes with arthropods is an artifact that arose from the analysis of a single gene, 18S rRNA. The results presented here suggest caution in revising animal phylogeny from analyses of one or a few genes or sequence signatures. Although many other aspects of animal phylogeny remain unresolved, our results indicate that insects (arthropods) are genetically and evolutionarily closer to humans (vertebrates) than to nematodes.
DNA sequences from Dugesia japonica  were used to search the public protein database (Entrez) for orthologous counterparts in Drosophila melanogaster, Caenorhabditis elegans, Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae, and Schizosaccharomyces pombe. When available, sequences from other major animal phyla (e.g., Mollusca, Echinodermata, Annelida) also were obtained. In addition, the database was searched for all proteins from two other nematodes, Brugia and Trichinella, with orthologs in Drosophila, Homo, and Arabidopsis, Saccharomyces, or Schizosaccharomyces. Arabidopsis was used as the primary outgroup (95 out of 100 proteins in the four-taxon analysis, 94 out of 100 proteins for the five-taxon analysis, all proteins in Brugia and Trichinella analyses); yeast was used as the outgroup when Arabidopsis sequences were unavailable or paralogous. This was because many more genes were available for rooting with the plant than with the fungus. All three kingdoms are about equidistant from each other in terms of branch lengths  and therefore a plant serves about equally well for rooting an animal phylogeny as does a fungus. Orthology was assessed using reciprocal BLAST searches of the public protein database; those sequences receiving high scores in each search were also analyzed phylogenetically to ensure orthology. Short (<100 amino acids) sequences were omitted.
Sequences were aligned using Clustal X  and each alignment was visually inspected. Primary analyses of aligned protein data sets were conducted in MEGA2 . Phylogenies were reconstructed using neighbor-joining  under a Poisson correction and a gamma distance (α = 2, or estimated from the data for combined analyses), with bootstrapping (2000 replications) for all analyses. Gamma parameters were estimated from the combined data using maximum likelihood under a Poisson correction  (4-taxon, α = 1.62; 5-taxon, α = 0.94, Brugia, α = 0.87; Trichinella, α = 0.66). In addition, phylogenetic analyses were conducted with maximum likelihood (JTT-F option)  and maximum parsimony (Max-Mini Branch & Bound option)  on combined data sets; in all cases they resulted in similar results (topology and significance) to the neighbor-joining analyses. Posterior probabilities of concatenated files were computed using Bayesian inference  (Jones model with gamma estimated from data; 10,000 generations; 4 chains with temp = 0.2). Shimodaira-Hasegawa tests  were performed in PAML  (JTT-F option, fixed gamma); p-values for each topology were recorded.
Rate constancy was assessed using a chi-square test  under increasing stringency (5, 10, 40% significance levels); p-values were recorded for each protein. A Z-test  was also used under increasing stringency; z-values were recorded for each protein. Proteins determined to be rate constant at different significance levels were concatenated and analyzed in MEGA2 . Nematode position and evolutionary distance were determined for each concatenation. New sequences, accession numbers of sequences, and sequence alignments may be found at the Evogenomics website http://www.evogenomics.org/publications/data/nematode/.
This work was supported by grants from the NASA Astrobiology Institute and National Science Foundation (to S.B.H.).
This article is published under license to BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.