- Research article
- Open Access
Identification of the Otopetrin Domain, a conserved domain in vertebrate otopetrins and invertebrate otopetrin-like family members
BMC Evolutionary Biologyvolume 8, Article number: 41 (2008)
Otopetrin 1 (Otop1) encodes a multi-transmembrane domain protein with no homology to known transporters, channels, exchangers, or receptors. Otop1 is necessary for the formation of otoconia and otoliths, calcium carbonate biominerals within the inner ear of mammals and teleost fish that are required for the detection of linear acceleration and gravity. Vertebrate Otop1 and its paralogues Otop2 and Otop3 define a new gene family with homology to the invertebrate Domain of Unknown Function 270 genes (DUF270; pfam03189).
Multi-species comparison of the predicted primary sequences and predicted secondary structures of 62 vertebrate otopetrin, and arthropod and nematode DUF270 proteins, has established that the genes encoding these proteins constitute a single family that we renamed the Otopetrin Domain Protein (ODP) gene family. Signature features of ODP proteins are three "Otopetrin Domains" that are highly conserved between vertebrates, arthropods and nematodes, and a highly constrained predicted loop structure.
Our studies suggest a refined topologic model for ODP insertion into the lipid bilayer of 12 transmembrane domains, and highlight conserved amino-acid residues that will aid in the biochemical examination of ODP family function. The high degree of sequence and structural similarity of the ODP proteins may suggest a conserved role in the intracellular trafficking of calcium and the formation of biominerals.
Otopetrin1 (Otop1) is the first described member of the otopetrin family, a novel gene family that encodes multi-transmembrane domain proteins. The family was named for the conserved role of Otop1 in the formation of otoconia and otoliths – "oto" (ear) and "petros" (stone). Otoconia are complex calcium carbonate biominerals in the utricle and saccule of the vertebrate inner ear that are required for the normal sensation of linear acceleration and gravity. Degeneration or displacement of otoconia can lead to vertigo and loss of balance [1–5]. Three mutant mice and one zebrafish model with mutations in Otop1 have been described: tilted (tlt) ; mergulhador (mlh) ; inner ear defect (ied) ; and backstroke (bks) , respectively. All of these mutants lack otoconia or otoliths, but have normal inner ear development. In zebrafish, the morpholino knockdown of Otop1 phenocopies the tlt mutation, showing otolith agenesis with no disruption of the patterning of the developing inner ear [9, 10].
The otopetrin family in most vertebrates studied consists of three genes clustered in two chromosomal locations: Otop1 (i.e., human Chr 4p16, mouse Ch5B2) and the paralogous tandem genes Otop2 and Otop3 (i.e., human Ch17q24-25, mouse Ch11E2). Vertebrate otopetrins share a conserved gene and protein structure, with no homology to other transporters, channels, exchangers, or receptors. A preliminary secondary structure prediction based on the human, mouse, rat, zebrafish, and fugu protein sequences suggested a topology of ten transmembrane domains (TM) with cytosolic amino and carboxy termini. Additionally, tBlastn searches in the EST and genomic databases identified regions of homology with the DUF270 domain in a number of arthropod and nematode proteins. DUF270 (pfam03189) is a 404 amino-acid consensus sequence of unknown function that defines the DUF270 family, with members in C. elegans and D. melanogaster. The two regions of maximum homology with DUF270 found in vertebrate otopetrins correspond to putative TM domains 3–5 and 9–10, respectively, and were initially designated DUF270-I and DUF270-II .
Here, we report a comparison of evolutionary constraint and hydropathy profile analysis of 62 vertebrate otopetrins and arthropod and nematode DUF270 proteins, demonstrating that the genes that encode these proteins constitute a single family that we have renamed the Otopetrin Domain Protein (ODP) gene family. The refined topologic model of the ODP proteins includes 12 putative TM domains clustered into three "Otopetrin Domains" (OD-I, -II, and -III, respectively), with a strong degree of sequence conservation across widely divergent groups of metazoa. These regions of highest homology and evolutionary constraint, including the FYR box in the cytoplasmic tail, may represent important functional sub-domains. Biochemical studies in transfected cells show that Otop1 modulates the manner in which cells handle intracellular calcium in response to purinergic stimuli . The lack of known functional domains, such as ATP-binding domains, selectivity pores, or G-protein-binding consensus sequences, suggests that either the ODP family has a novel function that significantly differs from the activities of known channels, transporters, or receptors, or that the ODP genes encode novel functional motifs. We hypothesize that these motifs would likely occur within the evolutionarily constrained regions, as has been shown for other well-conserved gene families . The challenge remains to define the functional domains of the ODP family, with sequence and analyses reported here providing a step in that direction.
Results and Discussion
Comparative sequence data set
The annotation of the Otop1, Otop2, and Otop3 genes in the human, mouse, rat, zebrafish, and fugu genomes is described elsewhere . Orthologous otopetrin sequences were generated using a targeted sequencing approach (from dog, cow, armadillo and western clawed frog) (see methods in [13, 14]) or identified through tBlastn searches of available whole-genome sequences. The phylogenetic relationships of vertebrate otopetrin and arthropod and nematode DUF270 genes were deduced from a total of 62 complete or nearly complete open reading frames in 25 species (see Table 1 for a listing of the specific species and accession numbers). Fragmentary, but clearly otopetrin-related, sequences were also identified in urochordates (ciona), echinoderms (urchin), and cnidarians (nematostella), however were not complete enough to include in this analysis.
Phylogenetic relationships and revised nomenclature of vertebrate otopetrins and arthropod and nematode DUF270 genes
A maximum-likelihood phylogenetic tree was created from the multi-sequence alignment of each encoded protein (Figure 1). The vertebrate, arthropod, and nematode sequences form distinct monophyletic groups, each containing three or more paralogous groups. This arrangement suggests that the ancestral metazoan genome may have contained a single otopetrin-like gene, with subsequent duplications giving rise to the paralogs in the different phyla after the three lineages diverged. Based on the positions in the tree of the named mouse and human sequences, the three vertebrate paralogous groups correspond to Otop1, Otop2, and Otop3. Otop2 and Otop3 are more closely related to each other than either is to Otop1, a clustering that parallels the genomic organization of the Otop genes in the vertebrate genomes. The arthropod and nematode DUF270 sequences, in which encoded proteins cluster independently in the tree from the vertebrate otopetrin sequences, have been renamed as otopetrin-like proteins (OTOPL), and the paralogous groups have been assigned arbitrary letters. This is in agreement with the HUGO gene nomenclature committee guidelines for gene families and grouping . Like vertebrates, arthropods also have three paralogous groups of OTOPLs. The grouping in nematodes is more complex: there appears to be three major groups of OTOPLs, as in vertebrates and arthropods, but each group itself contains two or more paralogous groups as a result of species-specific gene duplications. In summary, vertebrate otopetrins and arthropod and nematode OTOPL genes have been grouped as a single family that we named collectively the Otopetrin Domain Protein (ODP, see below) gene family.
Refined topological model for ODP insertion into the lipid bi-layer
Conserved primary sequence is indicative of an underlying conserved tertiary structure, and the evolutionary information contained in an alignment of related sequences can be leveraged to improve predictions of shared structures . We took advantage of the deep multi-sequence alignment and phylogenetic tree of the ODP family to reexamine the predicted topology of the ODPs (Figure 2A). A hydropathy profile was generated that employs phylogenetic averaging  on hydropathy scale values for amino acids  to improve the detection of conserved hydrophobic regions, which might correspond to TM domains. The hydropathy profile revealed 12 strong hydrophobic regions, ten of which overlap with the originally predicted TM domains . Likewise, the MEMSAT3  and TMAP  algorithms, which take into account leveraged evolutionary information, also predicted 12 TM helices for ODP family members that overlap well with the constrained regions and hydrophobic regions in our profile (Figure 2A).
The refined topological model for the ODP family thus consists of 12 TM domains, with both the N- and C-termini in the cytosol, and in which the two newly identified TM domains are TM4 and TM10, respectively. As shown in Figure 2B, there are three discrete regions with maximum evolutionary constraint among vertebrates, arthropods and nematodes, which we have designated Otopetrin Domain (OD) -I, -II, and -III, respectively. Among the TM domains, TM2 and TM8 show the poorest conservation and evolutionary constraint across species. On the other hand, the loops connecting the TM domains show little sequence conservation or evolutionary constraint, strongly suggesting that the TM domains are the primary functional regions of the ODP family (Figure 2A and Additional file 1). Despite the poor loop sequence conservation, the number of amino acids in 8 of the 11 loops separating TM domains is highly conserved (Table 2), suggesting that the spacing of most of the TM domains relative to one another may be important for the tertiary structure and function of ODP family members. Of note, the length of loop 5, within OD-I, is highly variable across all phyla, but conserved in vertebrates (48 ± 4 amino acid residues), as are all other loops except for loop 10.
Homology between Otop and OTOPL sequences extends beyond the canonical DUF270 domain
DUF270 (pfam03189) is a 404 amino-acid consensus sequence of unknown function. Early tBlastn-based database searches identified regions of homology with the DUF270 domain in both vertebrate Otop and arthropod and nematode OTOPL proteins , now grouped together as the ODP family. Inspection of the multi-species ODP sequence alignment suggests that the homology among ODP proteins extends beyond the canonical DUF270 domain (see Additional file 1). Specifically, the N-terminal end of the DUF270 consensus sequence can be extended to include three amino acids (HAG, amino acids 125–127 in mouse Otop1) that are conserved in most vertebrate (HAG) and nematode (GAG) ODPs examined. At the C-terminal end, the amino-acid conservation continues well beyond the DUF270 motif to include the entire C-terminal tail of vertebrate Otop (amino acids 584–600 in mouse Otop1). A 14-amino-acid consensus sequence for this highly conserved C-terminal tail, which we named the FYR box, is shown in Figure 3. The FYR box is a signature unique to the ODP family, and is present in all ODP proteins but not in any non-ODP sequences in the databases of ESTs and non-redundant sequences.
Comparative analyses of vertebrate otopetrins and arthropod and nematode OTOPL proteins revealed that they all share a TM domain structure and significant conservation of amino-acid sequence, suggesting that they constitute a single protein family, here renamed the ODP family. We have expanded the domains of homology to more accurately reflect the extent of sequence conservation between vertebrates, arthropods and nematodes, and have identified three evolutionarily constrained TM domain-rich areas that we have designated as Otopetrin Domains.
OD-I and OD-III are the most highly conserved regions of the ODP family. Tlt mice carry a missense mutation (Ala151→Glu), which alters the hydrophobicity of the predicted TM3 domain within OD-I, and leads to a presumed alteration in the membrane insertion or activity of Otop1 and otoconial agenesis . The OD-II evolutionarily constrained region was not identified in the initial modeling, but mutations in Otop1 within this conserved segment of the protein have been shown to cause otolith/otoconial agenesis in bks mutant fish (Glu429→Val)  and in mlh mutant mice (Leu408→Gln)  (Figure 2B), suggesting that this region is functionally important.
Initial modeling of the OTOP proteins suggested a 10 TM domain model with cytosolic N- and C-termini . This model had several problems, including that sites consistent with the consensus sequence for N-glycosylation were predicted to be cytosolic. The 12 TM domain model predicted by hydrophobicity and evolutionary constraint analysis places the proposed glycosylation sites in the extracellular space (Figure 2B), and suggests that it may reflect a more accurate version of OTOP insertion into the lipid bilayer. Interestingly, the missense mutations in the tlt, mlh, and bks animal models, which lead to functional loss of OTOP1 activity, each occur within highly conserved transmembrane domains; such mutations often alter the hydrophobicity of the conserved TM domain, which may lead to alterations in the ability of the protein to insert and orient in membranes.
Otop1 is required for the formation of vertebrate otoconia, a process that involves calcium carbonate biomineralization and requires the regulation of intracellular calcium. Biochemical studies in transfected cells show that OTOP1 modulates the manner in which cells handle intracellular calcium in response to purinergic stimuli . The mechanisms of calcium carbonate biomineralization are highly conserved in the development of otoconia and otoliths in the vertebrate inner ear, the formation of the avian eggshell, the mineralization of the arthropod exoskeleton, and the development of other mineralized structures such as the mollusk shell [21–23]. There is evidence that some ODP family members are expressed in tissues associated with calcium secretion and calcium carbonate-based mineralization. In particular, ESTs from Callinectes sapidus (Blue crab) reveal strong expression of the D. melanogaster OTOPLb ortholog in hypodermal tissues that are required for calcium mobilization during the mineralization of the chitinous exoskeleton . ODP mRNAs are also expressed in the hemocytes of various invertebrate species, which have been associated with the development of mineralized structures in mollusks . In mammals, Otop1 is expressed in the lactating mammary gland , perhaps functioning in the secretion of calcium into milk. Taken together, the sequence homology, structural constraint, and expression pattern suggest a conserved role for members of the ODP family in the formation of mineralized structures. Further examination of ODPs and continued characterization of natural and induced mutations in these proteins through both physiologic and topologic studies may assist in better understanding the mechanisms of establishing and maintaining mineralized structures throughout the animal kingdom.
Orthologous Otopetrin sequences were generated by a targeted sequencing approach, or identified through tBlastn searches of available whole-genome sequences. For the targeted sequencing, BAC clones were isolated from the following libraries maintained by the BACPAC Resources Center [14, 26, 27]: dog (Canis familiaris; RPCI-81), cow (Bos Taurus; CHORI-240), armadillo (Dasypus novemcinctus; VMRC-5), and western clawed frog (Xenopus tropicalis; CHORI-216). Specifically, each library was screened using pooled sets of oligonucleotide-based probes designed from the established sequence of the mouse Otop1 or Otop2/Otop3 subloci (on mouse Ch5B2 and Ch11E2, respectively). After isolation and mapping, a total of four BACs (accession numbers AC148430, AC149469, AC147459, and AC166187) were shotgun sequenced and subjected to sequence finishing, as described . The complete gene structures were determined based on alignments to mouse RefSeq mRNAs or species-specific mRNA, when available. For the tBlastn searches, we used mouse Otop1, -2, and -3 to query vertebrate genome sequences, and Drosophila OTPLa, -b, and -c and C. elegans OTOPLd1, -e, -f, -g, -h, and -i to query arthropod and nematode genome sequences (see Table 1 for sequence accession numbers).
Alignment, phylogenic tree generation, and evolutionary constraint versus hydropathy analysis
The initial protein sequence alignment was performed with ProbCons , and a preliminary phylogenetic tree was built with SEMPHY  using only the most confidently aligned regions of the multi-sequence alignment. The sequences were then divided into smaller groups based upon their relatedness according to the tree. Each group was re-aligned with Probcons, and each of these sub-alignments was manually adjusted. ClustalW [31, 32] was then used to profile-align these sub-alignments, producing the final, full alignment. The final phylogenetic tree was constructed using SEMPHY, constraining the topology to conform to SEMPHY trees built from the sub-alignments. 1000 bootstrap replicates were generated for each subtree as well as the final tree. The bootstrap values shown in Figure 1 are from the lowest-level tree in which the given branch occurs.
Evolutionarily constrained regions were detected essentially as described previously . The final alignment and tree were used to calculate single-site evolutionary rates with the empirical Bayesian version of the program Rate4Site . These single-site rate values were smoothed using sliding-windows of weighted averaging. In each 17-position-wide window, the value at the center position of the window was given the highest relative weight, and the relative weight decreased linearly for the values on either side to the edge of the window. The resulting weighted average was assigned to the position in the protein corresponding to the center of the window. To produce the evolutionary constraint profile, the rates were then converted to relative constraint by normalizing to a range between 0 and 1, inverted by subtracting from 1 (because a region of low evolutionary rate is under high evolutionary constraint), and plotted against the position in the protein.
To produce the hydropathy profile, the hydropathy-scale value  for each amino acid in a column of the multi-sequence alignment (corresponding to a single position on the profile) was multiplied by a weighting factor that reflects the fractional contribution of the corresponding sequence to the total sequence diversity represented . The hydropathy score at each position is the sum of these values. These single-position values were smoothed using the same sliding-windows weighted averaging scheme applied to the rate values above, normalized to vary between 0 and 1, and plotted against the position in the protein.
Gizzi M, Ayyagari S, Khattar V: The familial incidence of benign paroxysmal positional vertigo. Acta Otolaryngol. 1998, 118 (6): 774-777. 10.1080/00016489850182422.
Oghalai JS, Manolidis S, Barth JL, Stewart MG, Jenkins HA: Unrecognized benign paroxysmal positional vertigo in elderly patients. Otolaryngol Head Neck Surg. 2000, 122 (5): 630-634. 10.1016/S0194-5998(00)70187-2.
Oas JG: Benign paroxysmal positional vertigo: a clinician's perspective. Ann N Y Acad Sci. 2001, 942: 201-209.
Tusa RJ: Benign paroxysmal positional vertigo. Curr Neurol Neurosci Rep. 2001, 1 (5): 478-485. 10.1007/s11910-001-0110-y.
Bronstein AM: Benign paroxysmal positional vertigo: some recent advances. Curr Opin Neurol. 2003, 16 (1): 1-3. 10.1097/00019052-200302000-00001.
Lane P: Tilted (tlt). Mouse News Lett. 1986, 75: 28-
Hurle B, Ignatova E, Massironi SM, Mashimo T, Rios X, Thalmann I, Thalmann R, Ornitz DM: Non-syndromic vestibular disorder with otoconial agenesis in tilted/mergulhador mice caused by mutations in otopetrin 1. Hum Mol Genet. 2003, 12 (7): 777-789. 10.1093/hmg/ddg087.
Besson V, Nalesso V, Herpin A, Bizot JC, Messaddeq N, Romand R, Puech A, Blanquet V, Herault Y: Training and aging modulate the loss-of-balance phenotype observed in a new ENU-induced allele of Otopetrin1. Biol Cell. 2005, 97 (10): 787-798. 10.1042/BC20040525.
Sollner C, Schwarz H, Geisler R, Nicolson T: Mutated otopetrin 1 affects the genesis of otoliths and the localization of Starmaker in zebrafish. Dev Genes Evol. 2004, 214 (12): 582-590. 10.1007/s00427-004-0440-2.
Hughes I, Blasiole B, Huss D, Warchol ME, Rath NP, Hurle B, Ignatova E, Dickman JD, Thalmann R, Levenson R: Otopetrin 1 is required for otolith formation in the zebrafish Danio rerio. Dev Biol. 2004, 276 (2): 391-402. 10.1016/j.ydbio.2004.09.001.
Hughes I, Saito M, Schlesinger PH, Ornitz DM: Otopetrin1 activation by purinergic nucleotides regulates intracellular calcium. Proc Natl Acad Sci USA. 2007, 104 (29): 12023-12028. 10.1073/pnas.0705182104.
Simon AL, Stone EA, Sidow A: Inference of functional regions in proteins by quantification of evolutionary constraints. Proc Natl Acad Sci USA. 2002, 99 (5): 2912-2917. 10.1073/pnas.042692299.
Thomas JW, Green ED: Comparative sequence analysis of a single-gene conserved segment in mouse and human. Mamm Genome. 2003, 14 (10): 673-678. 10.1007/s00335-003-2300-1.
Thomas JW, Prasad AB, Summers TJ, Lee-Lin SQ, Maduro VV, Idol JR, Ryan JF, Thomas PJ, McDowell JC, Green ED: Parallel construction of orthologous sequence-ready clone contig maps in multiple species. Genome Res. 2002, 12 (8): 1277-1285. 10.1101/gr.283202.
HUGO Gene Nomenclature Committee. [http://www.genenames.org/]
Przybylski D, Rost B: Alignments grow, secondary structure prediction improves. Proteins. 2002, 46 (2): 197-205. 10.1002/prot.10029.
Stone EA, Sidow A: Constructing a meaningful evolutionary average at the phylogenetic center of mass. BMC Bioinformatics. 2007, 8: 222-10.1186/1471-2105-8-222.
Kyte J, Doolittle RF: A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982, 157 (1): 105-132. 10.1016/0022-2836(82)90515-0.
Jones DT: Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics. 2007, 23 (5): 538-544. 10.1093/bioinformatics/btl677.
Persson B, Argos P: Prediction of transmembrane segments in proteins utilising multiple sequence alignments. J Mol Biol. 1994, 237 (2): 182-192. 10.1006/jmbi.1994.1220.
Wilt FH: Developmental biology meets materials science: Morphogenesis of biomineralized structures. Dev Biol. 2005, 280 (1): 15-25. 10.1016/j.ydbio.2005.01.019.
Fekete DM: Developmental biology. Rocks that roll zebrafish. Science. 2003, 302 (5643): 241-242. 10.1126/science.1091171.
Hughes I, Thalmann I, Thalmann R, Ornitz DM: Mixing model systems: Using zebrafish and mouse inner ear mutants and other organ systems to unravel the mystery of otoconial development. Brain Res. 2006, 1091 (1): 58-74. 10.1016/j.brainres.2006.01.074.
Wheatly MG: Calcium homeostasis in crustacea: the evolving role of branchial, renal, digestive and hypodermal epithelia. J Exp Zool. 1999, 283 (7): 620-640. 10.1002/(SICI)1097-010X(19990601)283:7<620::AID-JEZ2>3.0.CO;2-3.
Mount AS, Wheeler AP, Paradkar RP, Snider D: Hemocyte-mediated shell mineralization in the eastern oyster. Science. 2004, 304 (5668): 297-300. 10.1126/science.1090506.
BACPAC Resources Center. [http://bacpac.chori.org]
Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC: Comparative analyses of multi-species sequences from targeted genomic regions. Nature. 2003, 424 (6950): 788-793. 10.1038/nature01858.
Blakesley RW, Hansen NF, Mullikin JC, Thomas PJ, McDowell JC, Maskeri B, Young AC, Benjamin B, Brooks SY, Coleman BI: An intermediate grade of finished genomic sequence suitable for comparative analyses. Genome Res. 2004, 14 (11): 2235-2244. 10.1101/gr.2648404.
Do CB, Mahabhashyam MS, Brudno M, Batzoglou S: ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res. 2005, 15 (2): 330-340. 10.1101/gr.2821705.
Friedman N, Ninio M, Pe'er I, Pupko T: A structural EM algorithm for phylogenetic inference. J Comput Biol. 2002, 9 (2): 331-353. 10.1089/10665270252935494.
Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD: Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 2003, 31 (13): 3497-3500. 10.1093/nar/gkg500.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680. 10.1093/nar/22.22.4673.
Mayrose I, Graur D, Ben-Tal N, Pupko T: Comparison of site-specific rate-inference methods for protein sequences: empirical Bayesian methods are superior. Mol Biol Evol. 2004, 21 (9): 1781-1791. 10.1093/molbev/msh194.
This research was supported by National Institute on Deafness and Other Communication Disorders Grants DC02236 (DMO), DC06974 (IH), and in part by the Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health. We thank Linda Lobos for assembling loop length data. We thank numerous people associated with the NISC Comparative Sequencing Program, in particular Robert Blakesley, Gerry Bouffard, Jennifer McDowell, Baishali Maskeri, Nancy Hansen, Morgan Park, Pamela Thomas, Alice Young and the many dedicated mapping, sequencing and finishing technicians.
IH carried out the analysis and drafted the manuscript. JB carried out the analysis and drafted the manuscript. BH carried out the analysis and drafted the manuscript. EDG edited the manuscript. NISC Comparative Sequencing Program provided sequence data. AS edited the manuscript. DMO carried out the analysis and drafted the manuscript. All authors read and approved the final manuscript.
Inna Hughes, Jonathan Binkley, Belen Hurle contributed equally to this work.