- Research article
- Open Access
Phylogenetic analysis of ferlin genes reveals ancient eukaryotic origins
© Lek et al; licensee BioMed Central Ltd. 2010
- Received: 12 April 2010
- Accepted: 29 July 2010
- Published: 29 July 2010
The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans.
Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without), with six ferlin genes in most vertebrates (three DysF, three non-DysF). Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates) and shark (cartilaginous fish). Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F) adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F) displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire) in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting reproduction-related divergence and specialization of species-specific functions within their genus.
Our phylogenetic studies provide evolutionary insight into the ferlin gene family. We highlight the existence of ferlin-like proteins throughout eukaryotic evolution, from unicellular phytoplankton and apicomplexan parasites, through to humans. We characterise the preservation of ferlin structural motifs, not only of C2 domains, but also the more poorly characterised ferlin-specific motifs representing the DysF, FerA and FerB domains. Our data suggest an ancient role of ferlin proteins, with lessons from vertebrate biology and human disease suggesting a role relating to vesicle fusion and plasma membrane specialization.
- Cartilaginous Fish
- Apicomplexan Parasite
- High Sequence Conservation
- Domain Topology
- Limb Girdle Muscular Dystrophy
The ferlin family of genes in humans and most mammals is composed of six members, and possesses a distinct topology of tandem C2 domains (ranging from four to seven), with a single C-terminal transmembrane domain . The ferlins were identified and named based on homology to a Caenorhabditis elegans spermatogenesis factor Fer-1 . C. elegans Fer-1 mutants are infertile due to defective fusion of membranous organelles with the spermatozoan plasma membrane . There are six mammalian ferlins (Fer1L1-6), with mutations in FER1L1 (dysferlin) and FER1L2 (otoferlin) linked to inherited diseases in humans [4, 5]. Mutations in dysferlin underlie an autosomal recessive form of muscular dystrophy (limb girdle muscular dystrophy type 2B, LGMD2B and the allelic disorder Miyoshi myopathy) . Mutations in oterferlin cause an autosomal recessive form of non-syndromic deafness (DFNB9) . Dysferlin deficient mouse muscle fibers fail to perform calcium-dependent membrane resealing , a process thought to involve fusion and exocytosis [7, 8] or endocytosis  of repair vesicles at sites of injury. Otoferlin null mice show a primary defect in calcium-mediated synaptic vesicle fusion and exocytosis at the synapse of cochlear inner hair cells . FER1L3 (myoferlin) has not yet been linked to human disease, but studies of the myoferlin null mouse demonstrate impaired myoblast fusion and myofiber formation during development and regeneration . Thus, emerging evidence points towards a common vesicular trafficking and fusion role for ferlin proteins, with unknown roles and tissue specificity for mammalian ferlins FER1L4, FER1L5 and FER1L6.
The occurrence of multiple tandem C2 domains that typify the ferlin family are rare. Only two other vertebrate gene families contain three or more C2 domains; the MCTP proteins (multiple C2 domain and transmembrane region proteins)  and E-Syt (extended synaptotagmins) . As an independent folding unit, C2 domains represent a functionally diverse and widely distributed calcium-binding motif. They are classically observed to function as calcium-dependent lipid binding modules , but some C2 domains have lost their calcium sensing ability and instead specialize in protein-protein interactions to regulate membrane trafficking and signal transduction . Solved structures of C2 domains reveal a beta-sandwich fold consisting of eight antiparallel beta-strands connected by highly variable surface loops . A key feature of calcium binding C2 domains is the conservation of calcium binding loops clustered on one end of the sandwich which are composed of negatively charged (usually aspartate) groups responsible for the coordination of multiple calcium ions . In addition to influencing electrostatic potential to enhance phospholipid binding , the residue composition of this loop region also influences phospholipid selectivity, which allows for specificity of their target membranes .
Despite much effort dedicated to structurally and functionally characterize proteins with singly occurring C2 domains, not much is known regarding the purpose and function of multiple (greater than two) C2 domains. Should they be viewed as multiple instances of similarly functioning calcium binding units, maintained for efficiency or redundancy purposes? Or perhaps cross-talk and synergy between adjacent C2 domains gives rise to a more complex additive function beyond the classic role of C2 domains? Interestingly, sequence analysis and comparison of C2 domains within a single ferlin member reveal great diversity; where each domain is more similar to its corresponding counterpart in paralogs , indicating duplication from a common ancestral gene and evolution of specialized functions.
Another domain of interest subject to sequence analysis in this study is the DysF domain, which is present in only certain members of the ferlin family, and also in yeast peroxisomal proteins where its established function is to regulate peroxisome size and number . This domain is of particular interest in understanding ferlin involvement in disease, as numerous disease-causing mutations in dysferlin have been mapped to both the inner and outer portions of the DysF domain. The DysF domain exists as an unusual nested repeat in ferlin proteins, where its function currently remains unestablished. The structure of the inner portion of the myoferlin DysF domain was recently solved, and shown to consist of two long antiparallel beta-strands. It has been suggested that preservation of function of both portions is likely, given that the insertion of the inner DysF domain occurs in an intervening loop region connecting the two beta-strands of the outer DysF domain, allowing both repeats to adopt the same fold without disruption to secondary structures . In this study we present sequence analysis of the DysF domain of the ferlins, important for categorising and obtaining functional clues to this family of proteins.
The increasing availability of whole genome sequences and the annotation of genes from a wide range of phyla enables phylogenetic analysis of gene families to provide important contextual insight into their present day form and association with human disease. In this study, we report the phylogenetic analysis of the ferlin gene family, where we have retrieved ferlin genes from single-celled protists through to a range of metazoan species. We explored the diversity of ferlin domain topologies within this gene family, and examined the level of conservation of both C2 and non-C2 domain elements. The identification of both a DysF and non-Dysf ancestral ferlin in early metazoan species, and their expansion during vertebrate evolution, suggests a fundamental role associated with this ancient gene family that has specialized to include tissue-specific and isoform-specific functions. Sequence analysis within the ferlin protein family has thus far been limited to studies in higher vertebrates [19, 22] with the exception of C. elegans . In this manuscript we extend ferlin sequence analysis to include ferlin family members from eight metazoan phyla and three single-celled protists, some of which were derived from draft genomes.
Identification and annotation of unannotated ferlin genes
The ferlin genes for species within the Drosophila and Caenorhabditis genus and the unidentified mouse Fer1L5 was obtained by TBLASTN  searches against their respective databases. In most cases the ferlin gene was fully contained within a supercontig (or scaffold) and segment pairs were in close proximity to each other. The exons and exon boundaries were identified from the TBLASTN output as high scoring segment pairs or gaps within these segment pairs. In addition, exons were checked for correct ordering and strand. All hits were then manually analyzed for splice acceptor and donor sites to ensure the correct exon-intron boundaries. Finally, multiple sequence alignment using MAFFT was performed to ensure there were no gaps amongst orthologs from the Drosophila and Caenorhabditis genus.
The locations of C2, DysFN, DysFC and transmembrane domains within ferlin genes were detected using SMART . The location of FerI, FerA and FerB domains were detected using Pfam . The program MAFFT was used to do multiple sequence alignments of DysFN, DysFC, FerA and FerB from various orthologs and paralogs . CHROMA was used to highlight patterns in the resulting multiple sequence alignments . Secondary structure prediction was performed using domains from human sequences as input into the Jnet secondary structure prediction server. The resulting output was overlayed on the multiple sequence alignment generated for each domain.
The pair-wise sequence identity of C2 domains was determined using needle, a global alignment tool within EMBOSS . Conservation of calcium-binding residues (aspartates and glutamates) were determined by alignment with C2A of Synaptotagmin 1 [Uniprot: P21579] for which key aspartates have been experimentally determined .
Maximum likelihood trees
The program ClustalW2  with default settings was used to perform multiple sequence alignment to use as input for PHYML. The phylogenetic trees based on protein sequences were generated using the maximum likelihood method employed by PHYML 3.0  using a Le and Gascuel (LG) amino acid based model with estimated proportion of invariable sites and bootstrapping (100 replicates).
Evolutionary relationship of ferlins in metazoans
Ferlin protein sequences used in this study.
EDV22852, EDV22851, EDV23443
Annelida (Segmented worms)
Capitella sp. I
Platyhelminthes (Flat worms)
Nematoda (Round worms)
Pediculus humanus corporis
NP_001137919 (Misfire isoform F)
Culex pipiens quinquefasciatus
XP_001197550, XP_001194793, XP_001199804
C3Y3S6, C3XZR5, C3YBJ8
AAVX01058092, AAVX01535036, AAVX01036594, AAVX01262173, AAVX01071336
XP_689416, Q5SPC5, XP_687465, XP_001922278, XP_001923096, XP_001919573
Q9ESD7, Q9ESF1, Q69ZN7, A3KGK3.3, *, XP_982409
O75923, Q9HC10, Q9NZM1, A9Z1Z3, A0AVI2, Q2WGJ9
Invertebrates generally have two ferlin genes (Nematoda and Arthropoda being exceptions, see below), one belonging in the non-DysF subgroup, and the other to the DysF subgroup, forming branching patterns similar to their evolutionary distance (i.e Tadh, Lgig, Ccap, Sman, Nvec; see Table 1). Two Dysf-containing ferlin sequences were evident in the invertebrate placozoan Trichoplax adhaerens (Tadh Fer1L1 and Fer1L3), which most likely result from a tandem duplication event (in scaffold 8), resulting in their close branching within the DysF subgroup.
Arthropoda possess only non-Dysf ferlins. Moreover, the Drosophila ferlin gene, Misfire (DmeI), is an outlier within the non-DysF subgroup, and does not cluster with ferlin sequences from other arthropods (Phum, Tcas, Apis, Cqui, see Table 1), despite having close evolutionary distance.
Vertebrates encode six ferlin proteins , with three in each Dysf subgroup. Vertebrate DysF-containing ferlins (Figure 1, blue box) then generally partition into Fer1L1-like (dysferlin), Fer1L3-like (myoferlin) and Fer1L5-like orthologs. Similarly, non-DysF ferlins generally cluster into orthologs of Fer1L2 (otoferlin), Fer1L4 and Fer1L6.
Expansion of the ferlin gene family
C2 domain similarity compared to corresponding C2 domains from human dysferlin (Type 1) and otoferlin (Type 2).
Ferlin domain topologies
Figure 2 highlights the different domain topologies noted amongst the ferlin genes in this study. Genes which cluster within the DysF subgroup of Figure 1 are denoted as Type 1, and genes which cluster within the non-DysF subgroup are denoted as Type 2. Additionally, within each type there are subtle differences in domain topology giving rise to subtypes A-F (see Figure 2 legend). The major differences between the subtypes are the presence or absence of C2A (first N-terminal C2 domain), the FerA domain, or the C2DE domain (the C2 domain between C2 D and C2E).
All ferlin genes analysed possess multiple tandem C2 domains, with two highly conserved features: 1) An N-terminal FerI sequence sandwiched between two C2 domains (C2B-FerI-C2C motif); 2) Two C2 domains adjacent to a transmembrane sequence at the extreme C-terminus. All metazoan ferlins possessed these two features, with the exception of the putative truncated Fer1L of C. elegans, and reported truncated splice isoforms of Misfire . These two features were also observed in putative ferlins of single-celled Apicomplexa such as the malaria parasite Plasmodium falciparum (Pfal).
Two outliers in terms of ferlin topology were truncated C. elegans Fer1L, and the Drosophila ferlin gene, Misfire. The unusual truncated topology of C. elegans Fer1L (see Figure 2) was also detected for C. brenneri, but not in three other analysed genomes within the Caenorhabditis genus (see Additional file 1). C. elegans Fer1L also has only a partial DysF domain, and it remains unclear whether Fer1L arose as a result of partial gene duplication, or whether a series of deletions followed a complete gene duplication of Fer-1. Misfire is distinct from all other arthropod ferlins, lacking the FerB domain and possessing a divergent C2F (end C-terminal C2 domain) that is below the C2 domain detection threshold of SMART.
C2 domain conservation
Given that C2 domains represent the most abundant and distinctive feature of the ferlin gene family, we sought to establish the conservation of individual C2 domains throughout metazoan evolution (Table 2). A comparison of invertebrate ferlin C2 domains with human dysferlin and otoferlin revealed high sequence conservation in general, but particularly in the C-terminal C2 domains, C2E and C2F. In contrast, the N-terminal C2 domains have either been lost or have lower similarity. Human otoferlin and sea anemone ortholog Nvec FER1L2, are the only non-Dysf (Type 2) topology ferlins to maintain C2A, which is also absent in many invertebrate Dysf-containing (Type 1) topology ferlins. Interestingly, arthropod (Phum) and mollusc (Lgig) type 2 ferlins have very high sequence conservation of C2 domains when compared to otoferlin, suggesting ancestral functions/interactions may be preserved throughout metazoan evolution.
Conservation of C2 domain calcium-binding residues
Conservation of five putative calcium coordinating residues in C2 domains.
Conservation of non-C2 domains
So far, no function or interaction has been attributed to the DysF, FerI, FerA or FerB domains in the ferlins. The highly conserved region preceding C2 D that is present in dysferlin, myoferlin and Fer1L5 has been termed DysF according to the SMART database, but is not annotated as a conserved/identifiable domain by the Pfam database. In contrast, the FerA, FerB and FerI regions are classified as domains by Pfam but not by SMART. Our analysis of metazoan phyla suggests all should be considered as domain units with potential function due to high conservation across orthologs and paralogs.
FerA and FerB
FerI and DysF
The FerI and DysF domains are of particular interest as both have been identified in ferlin-like proteins of unicellular eukaryotes. The FerI domain, but not the DysF domain, was discovered in ferlin-like proteins of protozoan Apicomplexan parasites Plasmodium (malaria), Theileria [Genbank:XP_765088] (East Coast fever), Babesia [Genbank:XP_001610088] (tick fever) and Toxoplasma [Genbank:XP_002364209] (toxoplasmosis) (data not shown), that account for significant worldwide mortality and morbidity amongst humans and livestock. Similar to the metazoan ferlins, the FerI domains in these Apicomplexan parasites are sandwiched closely between two C2 domains. This could suggest that the C2-FerI-C2 motif functions as a single entity and may have a fundamental function shared between metazoan and protozoan putative ferlins.
Over recent years, advances in sequencing technology have led to the increasing pursuit of genome-wide sequencing of many species. The public availability of data via online databases has in turn enabled individuals to pursue phylogenetic research of their gene of interest to complement their laboratory studies. In this study we present a phylogenetic study, selecting representative vertebrate and invertebrate eukaryotic genomes to shed evolutionary insight into the characteristic features that define the ferlin gene family.
There are six ferlin genes identified in humans to date; three consisting of a DysF domain (Type 1), and three without (Type 2). However, our genomic analysis identified only two ferlins in invertebrates (one Type 1 and one Type 2), suggesting that the six mammalian ferlins originated from two ancestral ferlins of distinct subtypes. The need for metazoans to maintain ferlins of two different types suggests that DysF imparts a specific function, conserved throughout evolution. It is not clear whether the DysF domain was gained in an ancestral ferlin then maintained throughout evolution due to a selective advantage, or, whether the DysF domain was lost following a gene duplication event, also imparting a selective advantage. Arthropods and nematodes were observed as exceptions; a DysF-containing ferlin is not maintained in arthropods, while a non-DysF ferlin is not maintained in nematodes. Using highly conserved C2 domain sequences, we were able to identify and extract two ferlin paralogs in the lamprey (Cmil) and five in the shark (Pmar), narrowing down the likely expansion of the ferlin gene family between the divergence of the jawless vertebrates and the cartilaginous fish. Ferlin-like genes consisting of at least five C2 domains, a C-terminal transmembrane region, and a C2-FerI-C2 motif were also identified in five species of Apicomplexa parasites (Plasmodium, Cryptospiridium, Theileria, Babesia and Toxoplasma), and in unicellular phytoplankton (Ostreococcus), further supporting an ancient role of ferlin-like proteins in eukaryotic biology. Plasmodium and other apicomplexan parasites are characterised by a specialized apicoplast membrane, possess specialised secretory organelles (rhoptries) thought to be involved in events leading to host cell invasion, and form membrane vesicular structures termed 'parasitophorous vacuolar membrane' (PVM) in which the organism resides . Given the large nature of the PVM (30-33um in surface area), the biological process which underlies its ability to form de novo in 10-20 seconds remains a curious area of research for many in the field . With emerging roles for vertebrate ferlins in plasma membrane vesicle fusion [3, 6, 10], and the particular association of ferlins with cells possessing specialised plasma membrane networks such as skeletal and cardiac muscle , placenta , and sperm acrosome , a potential role for ferlins in specialist membrane networks of apicomplexan parasites provides an intriguing avenue for investigation.
Following the identification of the dysferlin gene in 1998, and the shared homology with Fer-1 of C. elegans, Fer-1 has since been thought of as the ancestral ferlin from which the human ferlins were derived. Our phylogenetic analysis of multiple invertebrate ferlins suggests that Fer-1 is not a typical ferlin gene. Despite some regions of homology, Fer-1 (and Drosophila Misfire) form outgroups in the ferlin phylogenetic tree (Figure 1). Fer-1 shows loss of conserved residues that define the DysF domain, while Misfire has lost the FerB domain present in all other metazoan ferlins. Maximum likelihood tree and intra-genus sequence comparison shows sequence divergence of Dmel and Cele from other species within their genus (Additional file 1 and 2), which is typical of rapidly evolving reproduction-related genes . Misfire and Fer-1 may therefore present as exceptions, having lost and/or gained functions not shared across the ferlin family. Thus, we recommend caution when translating findings from studies of ferlin function in Drosophila and C. elegans, to that of ferlin-related diseases in humans, such as muscular dystrophy and non-syndromic deafness. Our results also highlight that species such as zebra fish (Drer) may represent a useful model organism, possessing the full complement of both DysF and non-DysF subgroups, and also technical utility for studies of muscle form and function.
In our sequence analysis of ferlin genes, we identified several conserved features amongst ferlin proteins; multiple tandem C2 domains (5-7), a single C-terminal transmembrane domain, a FerB domain, and a highly conserved N-terminal motif consisting of a FerI tightly sandwiched between two C2 domains. Slight variations in topologies include the incorporation of the optional C2A domain, the C2DE domain, and the FerA domain that appears to be concurrently present with the DysF domain.
Our analysis of the DysF domain shows it has maintained high sequence conservation. The presence of two DysF domains in a nested fashion is a unique and ancient feature of the ferlin family preserved from early diverging metazoans (Tadh, Nvec) through to humans. Despite resolution of the inner DysF domain structure of myoferlin, its function and purpose remains unknown . Our analysis shows that nesting of the DysF has not caused sequence divergence in either the inner or outer DysF regions. Importantly, NMR structural studies have shown that the inner DysF inserts between important secondary structures of the outer DysF domain, thus allowing for both to fold into a similar structure. In addition, reported pathogenic mutations occurring within and between the inner and outer DysF domains of dysferlin suggests that the act of embedding has not disabled the functional capacity of this domain . Interestingly, the DysF domain, although not in nested form, has also been reported to exist in yeast peroxisome proteins . The function of the DysF domain in these proteins is to regulate peroxisome size and number. Therefore, given that dysferlin is localised to vesicles , the function of the DysF domain could analogously be hypothesized to regulate vesicle size and number.
In the species we have analyzed, the C-terminal C2 domains of the ferlins are shown to be more conserved than the N-terminal C2 domains. This suggests that the ferlin C-terminus is perhaps responsible for functions more fundamental, than specialized. In contrast, we observed more divergence within the N-terminus, whereby some species lack C2A, or possess a variation that is highly divergent. These data suggest the ferlin N-terminus could be responsible for functions that are more species specific, and/or ortholog specific, rather than unified across the ferlin family. Despite the lack of conservation of C2A and its absence in certain species, there is evidence to suggest that it remains functional in human ferlins. The solved structure of myoferlin C2A indicates that it is capable of folding into the characteristic C2 domain beta-sandwich, although consisting of more than the typical eight strands. In addition, an alternate splice isoform of C2A of unknown function, in which an alternate exon 1 is used in C2A, is reported to be expressed at significant levels in skeletal muscle and blood cells . Furthermore, several missense mutations (Tryp52Arg, Val67Asp) lying within C2A are listed as probable muscular dystrophy causing changes. Dysferlin interaction with AHNAK, a protein implicated in membrane repair and maintenance has also been localised to C2A .
As seen in the ferlins, several C2 domains belonging to the synaptotagmin family also show degeneration of calcium-binding residues. In synaptotagmins, where structural data in addition to sequence data is available, these changes have been shown to confer ablation of calcium sensitivity . Despite the caveat of lacking known structural data relating to the ferlins, sequence alignment with synaptotagmin C2 domains reveals high sequence and secondary structure conservation, with identifiable conservation of calcium-binding residues within many ferlin C2 domains. In our studies, four out of seven C2 domains in dysferlin show conservation of calcium-binding residues. A previous sequence analysis reports a similar result, although this study did not allow for the highly conservative D→E substitution , and therefore did not identify C2 D and C2E as potential calcium binding domains. Contrary to expectation, a study into the lipid binding specificities of dysferlin C2 domains reports calcium-independent phospholipid binding associated with the C2 domains presented here as most likely to possess calcium sensitivity . It is however still plausible that these C2 domains possess a calcium-sensitive role, but one that is coupled to protein-protein interaction(s), as is the case with calcium-dependent t-SNARE binding in Synaptotagmin I . In otoferlin, calcium-dependent binding to two t-SNARE proteins of the inner hair synaptic complex, syntaxin1A and SNAP-25 has been mapped to C2F , which in our study is shown to possess the full complement of calcium-binding residues. Another study also reports otoferlin C2A as capable of binding syntaxin1A, although in a manner that is calcium-independent, presumably due to the lack of calcium coordinating residues in the C2A domain .
The function and need for so many C2 domains in the ferlins remains a mystery. To draw further lessons from the synaptotagmin family, possible reasons could be attributed to preferential lipid recognition, differential calcium sensitivities, or an expanded repertoire of protein-protein interactions. When the crystal structure of the cytosolic portion of Synaptotagmin I was solved, it was discovered that C2A and C2B were facing in opposite directions . This peculiar orientation was later attributed to the fact that both interact with two opposing membranes of different lipid composition, where C2A was shown to bind synaptic vesicle membranes, while C2B showed preference towards the PI(4,5)P2 rich plasma membrane. This idea of lipid selectivity and preferential binding has been corroborated to an extent, in studies which showed that dysferlin C2A could be distinguished from other C2 domains with its unique phosphoinositide binding ability not commonly seen in others .
The idea of synergy and co-operativity between tandem C2 domains is also another novel concept explored in synaptotagmins. Interestingly, studies of C2A and C2B in isolation show they display different properties than when the two are studied together in tandem. In isolation, C2A of Synaptotagmin I does not bind SNAREs, whereas C2B does not penetrate lipid membranes. However, when tethered to an adjacent C2 domain, C2A is able to bind SNAREs , whereas tethering C2B to C2A, even a non-functional version which fails to bind lipid or calcium, enables C2B binding and penetration of membranes. Experimental evidence reports the requirement for simultaneous neutralisation of calcium binding residues in both C2 domains of synaptotagmin to completely disrupt calcium triggered membrane and SNARE interaction, thus indicative of functional redundancy between adjacent C2 domains . It remains to be established whether there is inter-domain co-operativity and/or functional redundancy shared between ferlin C2 domains.
In this study we have surveyed a range of domain topologies present amongst ferlin genes, highlighting highly preserved domain regions, as well as species-specific domain combinations. Together, this information provides valuable insight into the minimal components required to construct a basic ferlin-like gene. Research into mini-gene (truncated) therapeutic constructs for gene restoration is currently underway for other genes associated with muscular dystrophy such as dystrophin, whose size exceeds the 5kb AAV vector packaging limit . We hypothesise that functional 'mini-ferlins' will require evolutionarily preserved C2-FerI-C2 (~274aa) and C2E-C2F-TM (~489aa) motifs, with (dysferlin) or without (otoferlin) the nested DysF repeat (~228aa embedded form).
In summary, we define ferlins as an ancient family of C2 domain-containing proteins that are likely to possess primordial functions in eukaryotic biology. Our studies demonstrate expansion of the ferlin family during vertebrate evolution; often a basis for functional specialization and tissue-specific expression. We provide useful contextual insight into evolutionary preservation of not only the C2 domains, but also the less studied DysF, FerI, FerA and FerB protein domains. Defining the roles of each of these domains is essential to delineate the biology of ferlins, with clinical relevance to inherited human disease (dysferlin and otoferlin), and perhaps more widely significant in future studies of apicomplexan parasitology.
In this study we show that ferlins are an ancient family of genes common across protists and metazoans, but not in plants or fungi. Metazoans were shown to maintain distinct Type I (with a Dysf domain) and Type II (without a Dysf domain) ferlin lineages; with invertebrates generally possessing two ferlins, and vertebrates possessing six ferlins. Unexpectedly, ferlins from model organisms C.elegans (Fer-1) and Drosophila (misfire) presented as phylogenetic outgroups, suggesting reproduction-related divergence. All ferlins possess an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains adjacent to an extreme C-terminal transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function.
We thank the funding bodies supporting our work; the Australian National Health and Medical Research Council (Cooper & North), the Jain Foundation Inc (Cooper). We thank Frances Evesson and Frances Lemckert for proof-reading and editing. We also thank the Joint Genome Institute (US Department of Energy), the Genome Centre (Washington University School of Medicine, St. Louis), the Wellcome Trust Sanger Institute, the Broad Institute, the Human Genome Sequencing Center (Baylor College of Medicine) and the J. Craig Venter Institute for facilitating access to unpublished sequence data.
- Therrien C, Dodig D, Karpati G, Sinnreich M: Mutation impact on dysferlin inferred from database analysis and computer-based structural predictions. J Neurol Sci. 2006, 250: 71-78. 10.1016/j.jns.2006.07.004.View ArticlePubMedGoogle Scholar
- Bashir R, Britton S, Strachan T, Keers S, Vafiadaki E, Lako M, Richard I, Marchand S, Bourg N, Argov Z, et al: A gene related to Caenorhabditis elegans spermatogenesis factor fer-1 is mutated in limb-girdle muscular dystrophy type 2B. Nat Genet. 1998, 20: 37-42. 10.1038/1689.View ArticlePubMedGoogle Scholar
- Washington NL, Ward S: FER-1 regulates Ca2+ -mediated membrane fusion during C. elegans spermatogenesis. J Cell Sci. 2006, 119: 2552-2562. 10.1242/jcs.02980.View ArticlePubMedGoogle Scholar
- Liu J, Aoki M, Illa I, Wu C, Fardeau M, Angelini C, Serrano C, Urtizberea JA, Hentati F, Hamida MB, et al: Dysferlin, a novel skeletal muscle gene, is mutated in Miyoshi myopathy and limb girdle muscular dystrophy. Nat Genet. 1998, 20: 31-36. 10.1038/1682.View ArticlePubMedGoogle Scholar
- Yasunaga S, Grati M, Cohen-Salmon M, El-Amraoui A, Mustapha M, Salem N, El-Zir E, Loiselet J, Petit C: A mutation in OTOF, encoding otoferlin, a FER-1-like protein, causes DFNB9, a nonsyndromic form of deafness. Nat Genet. 1999, 21: 363-369. 10.1038/7693.View ArticlePubMedGoogle Scholar
- Bansal D, Miyake K, Vogel SS, Groh S, Chen CC, Williamson R, McNeil PL, Campbell KP: Defective membrane repair in dysferlin-deficient muscular dystrophy. Nature. 2003, 423: 168-172. 10.1038/nature01573.View ArticlePubMedGoogle Scholar
- Steinhardt RA, Bi G, Alderton JM: Cell membrane resealing by a vesicular mechanism similar to neurotransmitter release. Science. 1994, 263: 390-393. 10.1126/science.7904084.View ArticlePubMedGoogle Scholar
- Terasaki M, Miyake K, McNeil PL: Large plasma membrane disruptions are rapidly resealed by Ca2+-dependent vesicle-vesicle fusion events. J Cell Biol. 1997, 139: 63-74. 10.1083/jcb.139.1.63.PubMed CentralView ArticlePubMedGoogle Scholar
- Idone V, Tam C, Goss JW, Toomre D, Pypaert M, Andrews NW: Repair of injured plasma membrane by rapid Ca2+-dependent endocytosis. J Cell Biol. 2008, 180: 905-914. 10.1083/jcb.200708010.PubMed CentralView ArticlePubMedGoogle Scholar
- Roux I, Safieddine S, Nouvian R, Grati M, Simmler MC, Bahloul A, Perfettini I, Le Gall M, Rostaing P, Hamard G, et al: Otoferlin, defective in a human deafness form, is essential for exocytosis at the auditory ribbon synapse. Cell. 2006, 127: 277-289. 10.1016/j.cell.2006.08.040.View ArticlePubMedGoogle Scholar
- Doherty KR, Cave A, Davis DB, Delmonte AJ, Posey A, Earley JU, Hadhazy M, McNally EM: Normal myoblast fusion requires myoferlin. Development. 2005, 132: 5565-5575. 10.1242/dev.02155.PubMed CentralView ArticlePubMedGoogle Scholar
- Shin OH, Han W, Wang Y, Sudhof TC: Evolutionarily conserved multiple C2 domain proteins with two transmembrane regions (MCTPs) and unusual Ca2+ binding properties. J Biol Chem. 2005, 280: 1641-1651. 10.1074/jbc.M407305200.View ArticlePubMedGoogle Scholar
- Min SW, Chang WP, Sudhof TC: E-Syts, a family of membranous Ca2+-sensor proteins with multiple C2 domains. Proc Natl Acad Sci USA. 2007, 104: 3823-3828. 10.1073/pnas.0611725104.PubMed CentralView ArticlePubMedGoogle Scholar
- Davletov BA, Sudhof TC: A single C2 domain from synaptotagmin I is sufficient for high affinity Ca2+/phospholipid binding. J Biol Chem. 1993, 268: 26386-26390.PubMedGoogle Scholar
- Nalefski EA, Falke JJ: The C2 domain calcium-binding motif: structural and functional diversity. Protein Sci. 1996, 5: 2375-2390. 10.1002/pro.5560051201.PubMed CentralView ArticlePubMedGoogle Scholar
- Rizo J, Sudhof TC: C2-domains, structure and function of a universal Ca2+-binding domain. J Biol Chem. 1998, 273: 15879-15882. 10.1074/jbc.273.26.15879.View ArticlePubMedGoogle Scholar
- Shao X, Li C, Fernandez I, Zhang X, Sudhof TC, Rizo J: Synaptotagmin-syntaxin interaction: the C2 domain as a Ca2+-dependent electrostatic switch. Neuron. 1997, 18: 133-142. 10.1016/S0896-6273(01)80052-0.View ArticlePubMedGoogle Scholar
- Cho W, Stahelin RV: Membrane binding and subcellular targeting of C2 domains. Biochim Biophys Acta. 2006, 1761: 838-849.View ArticlePubMedGoogle Scholar
- Han R, Campbell KP: Dysferlin and muscle membrane repair. Curr Opin Cell Biol. 2007, 19: 409-416. 10.1016/j.ceb.2007.07.001.PubMed CentralView ArticlePubMedGoogle Scholar
- Yan M, Rachubinski DA, Joshi S, Rachubinski RA, Subramani S: Dysferlin domain-containing proteins, Pex30p and Pex31p, localized to two compartments, control the number and size of oleate-induced peroxisomes in Pichia pastoris. Mol Biol Cell. 2008, 19: 885-898. 10.1091/mbc.E07-10-1042.PubMed CentralView ArticlePubMedGoogle Scholar
- Patel P, Harris R, Geddes SM, Strehle EM, Watson JD, Bashir R, Bushby K, Driscoll PC, Keep NH: Solution structure of the inner DysF domain of myoferlin and implications for limb girdle muscular dystrophy type 2b. J Mol Biol. 2008, 379: 981-990. 10.1016/j.jmb.2008.04.046.View ArticlePubMedGoogle Scholar
- Jimenez JL, Bashir R: In silico functional and structural characterisation of ferlin proteins by mapping disease-causing mutations and evolutionary information onto three-dimensional models of their C2 domains. J Neurol Sci. 2007, 260: 114-123. 10.1016/j.jns.2007.04.016.View ArticlePubMedGoogle Scholar
- Altschul SF, Lipman DJ: Protein database searches for multiple alignments. Proc Natl Acad Sci USA. 1990, 87: 5509-5513. 10.1073/pnas.87.14.5509.PubMed CentralView ArticlePubMedGoogle Scholar
- Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci USA. 1998, 95: 5857-5864. 10.1073/pnas.95.11.5857.PubMed CentralView ArticlePubMedGoogle Scholar
- Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, et al: The Pfam protein families database. Nucleic Acids Res. 2010, 38: D211-222. 10.1093/nar/gkp985.PubMed CentralView ArticlePubMedGoogle Scholar
- Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30: 3059-3066. 10.1093/nar/gkf436.PubMed CentralView ArticlePubMedGoogle Scholar
- Goodstadt L, Ponting CP: CHROMA: consensus-based colouring of multiple alignments for publication. Bioinformatics. 2001, 17: 845-846. 10.1093/bioinformatics/17.9.845.View ArticlePubMedGoogle Scholar
- Cole C, Barber JD, Barton GJ: The Jpred 3 secondary structure prediction server. Nucleic Acids Res. 2008, 36: W197-201. 10.1093/nar/gkn238.PubMed CentralView ArticlePubMedGoogle Scholar
- Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.View ArticlePubMedGoogle Scholar
- Sutton RB, Davletov BA, Berghuis AM, Sudhof TC, Sprang SR: Structure of the first C2 domain of synaptotagmin I: a novel Ca2+/phospholipid-binding fold. Cell. 1995, 80: 929-938. 10.1016/0092-8674(95)90296-1.View ArticlePubMedGoogle Scholar
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23: 2947-2948. 10.1093/bioinformatics/btm404.View ArticlePubMedGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.View ArticlePubMedGoogle Scholar
- Venkatesh B, Kirkness EF, Loh YH, Halpern AL, Lee AP, Johnson J, Dandona N, Viswanathan LD, Tay A, Venter JC, et al: Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii) genome. PLoS Biol. 2007, 5: e101-10.1371/journal.pbio.0050101.PubMed CentralView ArticlePubMedGoogle Scholar
- Sidow A: Gen(om)e duplications in the evolution of early vertebrates. Curr Opin Genet Dev. 1996, 6: 715-722. 10.1016/S0959-437X(96)80026-8.View ArticlePubMedGoogle Scholar
- Smith MK, Wakimoto BT: Complex regulation and multiple developmental functions of misfire, the Drosophila melanogaster ferlin gene. BMC Dev Biol. 2007, 7: 21-10.1186/1471-213X-7-21.PubMed CentralView ArticlePubMedGoogle Scholar
- von Poser C, Ichtchenko K, Shao X, Rizo J, Sudhof TC: The evolutionary pressure to inactivate. A subclass of synaptotagmins with an amino acid substitution that abolishes Ca2+ binding. J Biol Chem. 1997, 272: 14314-14319. 10.1074/jbc.272.22.14314.View ArticlePubMedGoogle Scholar
- Frenal K, Soldati-Favre D: Role of the parasite and host cytoskeleton in apicomplexa parasitism. Cell Host Microbe. 2009, 5: 602-611. 10.1016/j.chom.2009.05.013.View ArticlePubMedGoogle Scholar
- Lingelbach K, Joiner KA: The parasitophorous vacuole membrane surrounding Plasmodium and Toxoplasma: an unusual compartment in infected cells. J Cell Sci. 1998, 111 (Pt 11): 1467-1475.PubMedGoogle Scholar
- Vandre DD, Ackerman WEt, Kniss DA, Tewari AK, Mori M, Takizawa T, Robinson JM: Dysferlin is expressed in human placenta but does not associate with caveolin. Biol Reprod. 2007, 77: 533-542. 10.1095/biolreprod.107.062190.View ArticlePubMedGoogle Scholar
- Wilson KL, Fitch KR, Bafus BT, Wakimoto BT: Sperm plasma membrane breakdown during Drosophila fertilization requires sneaky, an acrosomal membrane protein. Development. 2006, 133: 4871-4879. 10.1242/dev.02671.View ArticlePubMedGoogle Scholar
- Haerty W, Jagadeeshan S, Kulathinal RJ, Wong A, Ravi Ram K, Sirot LK, Levesque L, Artieri CG, Wolfner MF, Civetta A, Singh RS: Evolution in the fast lane: rapidly evolving sex-related genes in Drosophila. Genetics. 2007, 177: 1321-1335. 10.1534/genetics.107.078865.PubMed CentralView ArticlePubMedGoogle Scholar
- Pramono ZA, Lai PS, Tan CL, Takeda S, Yee WC: Identification and characterization of a novel human dysferlin transcript: dysferlin_v1. Hum Genet. 2006, 120: 410-419. 10.1007/s00439-006-0230-1.View ArticlePubMedGoogle Scholar
- Huang Y, Laval SH, van Remoortere A, Baudier J, Benaud C, Anderson LV, Straub V, Deelder A, Frants RR, den Dunnen JT, et al: AHNAK, a novel component of the dysferlin protein complex, redistributes to the cytoplasm with dysferlin during skeletal muscle regeneration. FASEB J. 2007, 21: 732-742. 10.1096/fj.06-6628com.View ArticlePubMedGoogle Scholar
- Therrien C, Di Fulvio S, Pickles S, Sinnreich M: Characterization of Lipid Binding Specificities of Dysferlin C2 Domains Reveals Novel Interactions with Phosphoinositides (dagger). Biochemistry. 2009, 48: 2377-2384. 10.1021/bi802242r.View ArticlePubMedGoogle Scholar
- Rickman C, Davletov B: Mechanism of calcium-independent synaptotagmin binding to target SNAREs. J Biol Chem. 2003, 278: 5501-5504. 10.1074/jbc.C200692200.View ArticlePubMedGoogle Scholar
- Ramakrishnan NA, Drescher MJ, Drescher DG: Direct interaction of otoferlin with syntaxin 1A, SNAP-25, and the L-type voltage-gated calcium channel Cav1.3. J Biol Chem. 2009, 284: 1364-1372. 10.1074/jbc.M803605200.PubMed CentralView ArticlePubMedGoogle Scholar
- Herrick DZ, Kuo W, Huang H, Schwieters CD, Ellena JF, Cafiso DS: Solution and membrane-bound conformations of the tandem C2A and C2B domains of synaptotagmin 1: Evidence for bilayer bridging. J Mol Biol. 2009, 390: 913-923. 10.1016/j.jmb.2009.06.007.PubMed CentralView ArticlePubMedGoogle Scholar
- Earles CA, Bai J, Wang P, Chapman ER: The tandem C2 domains of synaptotagmin contain redundant Ca2+ binding sites that cooperate to engage t-SNAREs and trigger exocytosis. J Cell Biol. 2001, 154: 1117-1123. 10.1083/jcb.200105020.PubMed CentralView ArticlePubMedGoogle Scholar
- Watchko J, O'Day T, Wang B, Zhou L, Tang Y, Li J, Xiao X: Adeno-associated virus vector-mediated minidystrophin gene therapy improves dystrophic muscle contractile function in mdx mice. Hum Gene Ther. 2002, 13: 1451-1460. 10.1089/10430340260185085.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.