- Research article
- Open Access
A holistic phylogeny of the coronin gene family reveals an ancient origin of the tandem-coronin, defines a new subfamily, and predicts protein function
© Eckert et al; licensee BioMed Central Ltd. 2011
- Received: 28 June 2011
- Accepted: 25 September 2011
- Published: 25 September 2011
Coronins belong to the superfamily of the eukaryotic-specific WD40-repeat proteins and play a role in several actin-dependent processes like cytokinesis, cell motility, phagocytosis, and vesicular trafficking. Two major types of coronins are known: First, the short coronins consisting of an N-terminal coronin domain, a unique region and a short coiled-coil region, and secondly the tandem coronins comprising two coronin domains.
723 coronin proteins from 358 species have been identified by analyzing the whole-genome assemblies of all available sequenced eukaryotes (March 2011). The organisms analyzed represent most eukaryotic kingdoms but also cover every taxon several times to provide a better statistical sampling. The phylogenetic tree of the coronin domains based on the Bayesian method is in accordance with the most recent grouping of the major kingdoms of the eukaryotes and also with the grouping of more recently separated branches. Based on this "holistic" approach the coronins group into four classes: class-1 (Type I) and class-2 (Type II) are metazoan/choanoflagellate specific classes, class-3 contains the tandem-coronins (Type III), and the new class-4 represents the coronins fused to villin (Type IV). Short coronins from non-metazoans are equally related to class-1 and class-2 coronins and thus remain unclassified.
The coronin class distribution suggests that the last common eukaryotic ancestor possessed a single and a tandem-coronin, and most probably a class-4 coronin of which homologs have been identified in Excavata and Opisthokonts although most of these species subsequently lost the class-4 homolog. The most ancient short coronin already contained the trimerization motif in the coiled-coil domain.
- Splice Site
- Dictyostelium Discoideum
- Alternative Splice Form
- Trimerization Motif
The coronin proteins, which were originally isolated as a major co-purifying protein from an actin-myosin-complex of the slime mold Dictyostelium discoideum , have since been identified in other protists [2, 3], fungi , and animals , but are absent in plants. Coronins are a conserved family of actin binding proteins [6–8] and the first family member had been named coronin based on its strong immunolocalization to the actin rich crown like structures of the cell cortex in Dictyostelium discoideum . Coronins belong to the superfamily of the eukaryotic-specific WD40-repeat proteins [9, 10] and play a role in several actin-dependent processes like cytokinesis , cell motility [11, 12], phagocytosis [13, 14], and vesicular trafficking .
WD-repeat motifs are minimally conserved regions of approximately 40-60 amino acids typically starting with Gly-His (GH) dipeptides 11-24 residues away from the N-terminus and ending with a Trp-Asp (WD) dipeptide at the C-terminus. WD40-repeat proteins, which are characterized by the presence of at least four consecutive WD repeats in the middle of the molecule, fold into beta propeller structures and serve as stable platforms for protein-protein interactions .
The coronin proteins have five canonical WD-repeat motifs located centrally. Since the region encoding the WD repeats is similar to the sequence of the beta-subunit of trimeric G-proteins the formation of a five-bladed beta-propeller was assumed for coronins . However, the determination of the structure of murine coronin-1 (MmCoro1A ) demonstrated that the protein, analogous to the trimeric G-proteins, forms a seven-bladed beta-propeller carrying two potential F-actin binding sites. Apart from the central WD-repeats, almost all coronin proteins have a C-terminal coiled-coil sequence that mediates homo-oligomerization [18–20], and a short N-terminal motif that contains an important regulatory phosphorylation site in coronin-1B . In addition, each coronin protein has a unique region of variable length and composition following the conserved extension to the C-terminus of the beta-propeller.
Based on their domain composition coronins have originally been divided into two subfamilies, namely short and long coronins . Short coronins consist of 450 - 650 amino acids containing one seven-bladed beta-propeller and a C-terminal coiled-coil region. Furthermore, the N-terminal region of most known short coronins contains 12 basic amino acids. Since this motif is only present in coronin molecules, it has been suggested as a novel coronin signature . The longer types of coronin, also called POD or Coronin 7, possess two complete core domains in tandem but lack a coiled-coil motif. In the longer coronins, the sequence of the basic N-terminal motif is reduced to 5 amino acids. Based on phylogenetic relationships among the coronins, the Human Genome Organization nomenclature committee (HGNC) proposed a system in 2001 that grouped the short coronins into two classes resulting in a total of three subtypes . Very recently, a new nomenclature has been suggested dividing the coronins into twelve subclasses based on the analysis of about 250 coronins from most taxa . In contrast to previous systems, every mammalian coronin (and corresponding vertebrate homologs) was designated an own class resulting in seven vertebrate classes. Invertebrates were grouped into two classes, the fungi got an own class, coronins from alveolates were grouped with those from Parabasalids (class 10), and the remaining coronins from Amoeba, Heterolobosea, and Euglenozoa were combined into the twelfth class. This study constituted the first major phylogenetic analysis of the coronin family. However, this classification was not consistent with the latest phylogeny of the eukaryotes and homologs of some major branches like the stramenopiles were missing.
Here, we present the analysis of the complete coronin repertoires of all eukaryotic organisms sequenced and assembled so far. The distribution of all coronin homologs is in accordance with the latest taxonomy of the eukaryotes and reveals the origin of the tandem-coronin and another newly defined class in the last common ancestor of the eukaryotes.
Identification and annotation of the coronin proteins
The coronin protein genes were identified by TBLASTN searches against the corresponding genome data of the different species. The list of sequenced eukaryotic species as well as access information to the corresponding genome data has been obtained from diArk . Species that missed certain orthologs in the first instance were later searched again with supposed-to-be orthologs of other closely related species. In this iterative process all coronin family proteins have been identified or their loss in certain species or taxa was confirmed. Because verified cDNA sequences and protein predictions, which often contain mispredicted exons and introns even in the "annotated" genomes, are not available for most of the sequenced species, the protein sequences were assembled and assigned by manual inspection of the genomic DNA sequences. Exons have been confirmed by the identification of flanking consensus intron-exon splice junction donor and acceptor sequences . In addition, the gene structures of all coronin genes were reconstructed using WebScipio [25, 26]. Through comparison of the intron positions and splice-site phases in relation to the protein multiple-sequence alignment, several suspicious exon border predictions could be resolved and the protein sequences subsequently be corrected. The genomic sequences of many species contain several gaps due to the low coverage of the sequencing or problems in the assembly process. Only some of the gaps could be closed at the amino-acid level by analyzing EST data.
Pseudogenes without sequence
WGS- and EST-projects
Multiple sequence alignment, phylogenetic analysis, and classification
Short coronins (class-1, class-2, and unclassified coronins)
Surprisingly, the coronins of the Tremellomycetes (e.g. Filobasidiella/Cryptococcus species) that belong to the Basidiomycota encode a C-terminal dUTPase domain (deoxyuridine triphosphatase domain) instead of the coiled-coil region (Figure 3). These coronin sequences are supported by many EST/cDNA clones for several of the Filobasidiella species extending from the coronin domain to the stop-codon. In addition to this dUTPase domain, the Filobasidiella species contain a further dUTPase in the genome that is conserved in the other Basidiomycotes, and also the other fungi. The dUTPase domains of the Tremellomycetes coronins contain all characteristic dUTPase domain motifs  and are therefore supposed to constitute enzymatically active domains. dUTPases typically form homotrimer active site architectures with all monomers contributing conserved residues to each of the three active sites . Except for the prediction of trimerization of these coronins, which could be mediated by the dUTPase domains instead of the coiled-coil domains in the other coronins, it needs experimental data to link the function of actin filament structure remodelling by coronins to dUTP nucleotide hydrolysis in DNA repair by dUTPases.
Class-3 coronins (Type III coronins) comprise homologs that encode two coronin domains arranged in tandem . These two coronin domains are separated by unique regions, and class-3 coronins do not encode coiled-coil domains. As recently reported  the class-3 coronins also encode a CA domain similar to the CA domain of the WASP family proteins at their C-termini (Figure 3). Based on the multiple sequence alignment of 112 class-3 coronins from all major branches of the eukaryotes the position of the C-region has slightly been adjusted in comparison with a previous analysis (Figure 4; ). Although the C-region of the class-3 coronins is not as conserved as similar regions in the yeast short coronins or in WASP family proteins, the characteristic pattern of hydrophobic residues concluded by a basic residue is visible in the homologs of all species (Figure 4). In contrast to the short coronins, the unique region between the C-terminal coronin-domain and the conserved CA-domain is short (20-30 amino acids).
Like for the short coronins the Filobasidiella species have surprising and species-specific tandem-coronins. The Filobasidiella class-3 coronins have a D-glycerate 3-kinase domain between the two coronin-domains (Figure 3). Only the termini of the Filobasidiella class-3 coronins are supported by EST/cDNA data, but long exons bridge the N-terminal coronin-domain and the glycerate 3-kinase domain, as well as the glycerate 3-kinase domain and the C-terminal coronin-domain. As found for the dUTPase domain of the short coronins, the Filobasidiella species contain an additional D-glycerate 3-kinase that has homologs in the other fungi and also in plants. Why it is advantageous to connect an actin-filament binding function to a glycerate 3-kinase needs experimental evaluation. The glycerate 3-kinase domain is not found in the class-3 coronins of the other Basidiomycotes. Except for the Filobasidiella species only the insects have long insertions between the two coronin-domains of their class-3 coronins. These insertions are highly conserved, about 300 residues long, and do not show any homology to known domains, sequence motifs, and other proteins.
In contrast to the related species Rhizopus arrhizus and Phycomyces blakesleeanus the coronin-3 of Mucor circinelloides consists of only the second coronin-domain of the tandem. We can exclude the possibility of this being an artefact of the genome assembly for three reasons. First, the genome sequence is continuous around MucCoro3. Secondly, there is no homology to any part of the N-terminal coronin-domain of RhaCoro3 or PhbCoro3 in the genome although the sequence identity of the C-terminal coronin-domains is about 65%. And finally, there is a TATA-box shortly upstream of the MucCoro3 gene. Because a coronin-3 has already been present in the most ancient eukaryote the loss of the N-terminal coronin-domain must be specific to Mucor circinelloides.
Based on the phylogenetic tree (Figure 1) and the domain composition of the protein homologs, another coronin class can be defined for which the Dictyostelium discoideum homolog, also called villidin , would be a representative (Figure 3). We suggest naming members of this class class-4 coronins. Most class-4 coronins consist of an N-terminal coronin-domain followed by three to four PH domains, four to five gelsolin domains, and a C-terminal villin headpeace domain (VHP). Class-4 coronins were identified in two of the major kingdoms of the eukaryotes, in excavates and opisthokonts. Furthermore, they are found in several of the sub-branches of the opisthokonts, in amoebae, fungi, and the fungi/metazoa incertae sedis branch. Because class-4 coronins from different species often contain different numbers of PH and gelsolin domains, domain gain and loss events must have happened in the respective branches or single species. However, there are not enough coronin-4 homologs identified yet to reconstruct the evolution of these regions. In addition to these multi-domain class-4 coronins there is a group of class-4 coronins that just consists of the conserved coronin domain and is restricted to some Amoebae species yet.
Alternatively spliced coronins
Alternative splicing of human coronin-1C results in two additional transcripts derived from alternative transcription start sites encoded by an additional upstream exon, compared to the normal start site as found and conserved in all other coronin proteins . These alternative splice forms seem to be restricted to modern primates (human, chimpanzee, gorilla, orangutan, and gibbon) and have been discussed in detail elsewhere .
We have identified alternative splice variants for coronin-1D (Figure 5), a coronin subfamily restricted to vertebrates. A cluster of two mutually exclusively spliced exons, exon5a and exon5b, was identified in all tetrapods. The amino-acid sequences corresponding to exon5 of the fish genes are more similar to exon5b than to exon5a. Thus, exon5a is the result of an exon duplication event that either occurred after the separation of tetrapods from fishes or at the onset of the vertebrates, where exon5a has been lost in the ancestor of the fishes. Exon5 represents the sequence of almost the entire fourth WD repeat (fifth blade in the β-propeller) starting in the middle of the fourth β-strand of blade four. By exchanging the fourth WD repeat the vertebrates could fine-tune the function of the coronin-1D beta-barrel domain. Vertebrate coronin-1D (CORO6) has not been analyzed experimentally yet and its specific function is unknown.
Further alternative transcripts are derived from mammalian coronin-1D genes by alternative 5'-splicing of the last exon, exon10. This alternative splicing results in one additional glutamine residue and is conserved in all 22 analyzed mammalian coronin-1D's except for Ailuropoda melanoleuca (giant panda), Loxodonta africana (elephant), Myotis lucifugus (little brown bat), and Bos taurus (cow).
Most of the short coronins have predicted coiled-coil domains at the C-terminus that are the bases for their supposed oligomerization. Initially, coronins have been proposed to form dimers , the most common form of coiled-coil multimerization. In the last decade, a few coronin homologs were biochemically purified and analyzed. Accordingly, the Xenopus laevis coronin-1C (XcoroninA) has been shown to form a dimer  while an oligomeric state has been found for human coronin-1C (coronin 3; , and the Saccharomyces cerevisiae coronin (CRN1) trimerizes . Parallel trimer formation has also been shown in a crystal structure of the coiled-coil domain of mouse coronin-1A  revealing a conserved motif determining the trimeric structure: R1-[ILVM]2-X3-X4-[ILV]5-E6. In this motif arginine forms a salt-bridge with glutamate at the surface of the coiled-coil structure and the aliphatic side chain moieties of arginine and glutamate pack against the hydrophobic residues at positions 2 and 5 of the motif shielding them from solvent. Mutation of the arginine to lysine leads to a concentration-dependent equilibrium between trimers and tetramers with tetramers forming at high concentration, while mutation to alanine or norleucine leads to tetramers . Mutation of the invariant arginine to glutamine in the trimerization motif of human matrilin-1 leads to tetramers . Unfortunately, the switching of arginine and glutamate in the respective positions has not been analyzed yet. We would expect that such a switch should be as stable as the original motif. Thus, to predict the oligomerization state we have analyzed all coronin coiled-coil regions for the presence of the trimerization motif. Accordingly, all 233 class-1 coronins have the classical motif, except for DpCoro1B and DrpCoro1B (Drosophila pseudoobscura and persimilis; Lys at position 1), and NvCoro1 (Nematostella; Cys at position 2), and are thus predicted to form trimers. This would include the Xenopus Coro1C that has, however, been shown to exist as a dimer . The situation is more diverse for the class-2 coronins. The invertebrate coronins contain the trimerization motif, except for AmqCoro2 (Amphimedon; Ser at P1), HerCoro2A (Helobdella; Lys at P1), HerCoro2B (Phe at P1), MydCoro2 (Mayetiola; Gln at P6), and the nematode class-2 coronins (Cys at P2). Almost all fish class-2 coronins contain the trimerization motif, but the other vertebrate class-2 coronins have conserved mutations. The tetrapod class-2A coronins encode a glutamine instead of the invariant arginine, which would turn them to tetramers in analogy to matrilin-1 . The tetrapod class-2B coronins contain glutamine instead of the glutamate at position 6 of the motif, a substitution whose effect has not been analyzed yet.
About half of the analyzed fungal coronins have the classical trimerization motif. The most common substitutions that are found in all Schizosaccharomyces and most Basidiomyota coronins are lysines or glutamines instead of the arginine at position 1. While the coiled-coil region is conserved in general, substitutions happened in specific species but not in whole branches (except for the Schizosaccharomyces). Therefore, we would expect all fungal coronins to form trimers. All Amoeba coronins, the Stramenopiles coronins (exceptions: FrcCoro a His at P1, BhCoro_B a Asn at P6, AuaCoro_B a Lys at P1), the Trichomonas and Naegleria coronins contain the classical trimerization sequence motif in the coiled-coil region. Interestingly, the kinetoplastid coronins have the salt-bridge switched in the motif and should thus also be able to form trimers. From the Alveolata, only the Ciliophora (e.g. Tetrahymena) and Coccidia (e.g. Toxoplasma gondii) coronins contain coiled-coil domains, and only the Coccidia contain the trimerization motif.
These are, however, predictions based on the existence of the proposed trimerization motif. The motif has been identified in 86% of all short, autonomous, and parallel three-stranded coiled-coils while it is also observed in 9% of the antiparallel trimers and in 5% of the parallel and antiparallel dimers . Thus, although most short coronins are predicted to form trimers some might nevertheless function in other oligomeric states in the cell. The oligomerization state can ultimately only be shown in experiments, which have, however, been done for just a few of the coronins yet.
Here, we have analyzed 723 coronins from 358 species. For 323 species whole genome sequence data was available allowing a "holistic" analysis of the coronin protein family. In addition, the whole genome assemblies of 69 species have been analyzed that in the end did not contain any coronin homolog. These species include Rhodophyta (Cyanidioschyzon, Galdieria), Viridiplantae, Microsporidia, Formicata (Giardia), and Haptophyceae (Emiliania). A sequence alignment of the coronin proteins was created and extensively improved manually. The phylogenetic analysis of the conserved coronin domain, which is also included in the crystal structure , using the Bayesian method showed that the grouping of the coronins is completely in accordance with the latest phylogeny of the eukaryotic species (Figure 1, [27–29]). Subsequently, we analyzed the coronin tree with respect to established and proposed classifications defining subfamilies. Two major schemes are currently in use, the old one established by the HGNC  and a more recent one expanding the number of classes from three to twelve . Essentially, the later classification re-defines subclasses of the HGNC scheme as separate classes, e.g. 1A and 1B become class-4 and class-1, respectively, and groups some branches to new classes. However, some coronins still remained unclassified and several classes have been proposed, like the invertebrate metazoan classes 8 and 9, although the contributing members did not form monophyletic branches in the underlying protein family tree. The proposed classes 10 and 12 contain members of unrelated taxonomic branches, probably because these coronins were adjacent in the tree figure. In addition, the Entamoeba tandem-coronin did not group to the other tandem-coronins. Thus, this classification is not consistent with the taxonomy of the eukaryotes. In addition, homologs of major branches were missing in the analysis like those from stramenopiles. We do not intend to add confusion to the classification of the coronin family but want to suggest a reliable and, considering future genome sequencing projects, expandable scheme. Two major reasons support the future use of the HGNC scheme although it needs some minor adjustments. The classification by Morgan and Fernandez  of coronins outside the metazoans is not consistent with the latest taxonomy of the eukaryotes and therefore not adaptable to our more comprehensive coronin tree. In addition, it is well known that two whole-genome-duplications are the reason for the expansion of gene homologs at the origin of the vertebrates , while another whole-genome-duplication happened at the origin of the Actinopterygii [43, 44]. Thus retaining the orthology between non-vertebrate and vertebrate coronins in class-numbers would be desirable but has also been abandoned by Morgan and Fernandez . Here, we adapted the HGNC classification except for renaming CORO6 and CORO7 (HGNC) to coronin-1D and coronin-3, respectively, numbering additional fish coronins as coronin-1E, coronin-2C, and coronin-2D, and defining the new coronin class-4. The term "class" is equivalent to the term "Type" used by the Bear group in recent reviews [8, 45]. However, we prefer the term "class" to be consistent with the terminology used for other protein families (e.g. the myosin family [46, 47]) and therefore to facilitate the work with databases and search engines in the future.
Class-4 coronins represent a new type of coronins that are present in Excavata (Naegleria gruberi), Amoebae, fungi (Spizellomyces punctatus), and the Fungi/Metazoa incertae sedis branch. Most class-4 coronins consist of the N-terminal coronin domain followed by two to three PH, four to five gelsolin, and a C-terminal VHP domain. The first representative of this subfamily has been identified in Dictyostelium discoideum and called villidin because of the homology of its gelsolin and VHP domains to villin . The homology of villidins WD-repeat region to coronin has been recognized later on [48, 49] suggesting villidins origin through a fusion of the coronin domain with villin. Villin is the founding member of a superfamily of proteins containing three to six gelsolin domains (reviewed in [48, 50]). Like villidin (class-4 coronins), villin, supervillin, and protovillin also contain a C-terminal VHP domain. Alignment of villin to the class-4 coronins gelsolin domains shows that the class-4 coronins have lost the first gelsolin domain of villin. The first gelsolin domain of villin is associated with dimerization, actin filament capping, nucleation, and bundling, and G-actin binding . Thus, class-4 coronins do not play a role in these activities via their gelsolin domains . However, villin contains three phospholipid-binding domains, two preceding the second gelsolin domain and one overlapping with the VHP domain. These phospholipid-binding domains are conserved in class-4 coronins and are most probably responsible for their association with internal membranes like Golgi-structures and ER-membranes .
At the origin of the Metazoa and Choanoflagellida branches another gene duplication event led to two distinct classes, class-1 coronins and class-2 coronins (Figure 7). The further evolution of the short coronins in the invertebrate branches is determined by species-specific gene-loss and gene-duplication events (Figure 2). This view is, however, based on the species whose genomes are available today and might change as soon as sequencing of more related species reveals subtypes of the class-1 and class-2 coronins in major invertebrate branches. At the origin of the vertebrates the two well-known whole-genome duplications (2R, ) resulted in several subtypes of both the class-1 and class-2 coronins. The subsequent third whole genome duplication in the fish-lineage [43, 44] led to even more gene duplicates. Subsequent to this boost of coronin homologs at the onset of the vertebrates branch-specific gene deletions happened, like the loss of the class-1B variants in fishes and the class-1A loss in birds (Figure 2).
The short coiled-coil region including the trimerization motif R-[VILM]-X-X-[VIL]-E is an accomplishment of the most ancient short coronin because it is found in coronins of all branches of the eukaryotic tree. It has been retained without major mutations for a long evolutionary time. This is exemplified by the fact that changes, which might lead to other oligomerization states, are species-specific or have been introduced in very recently separated branches.
The phylogenetic tree based on the coronin domains of 723 homologs from 358 species allowed grouping the coronin proteins into four classes: Class-1 (Type I) and class-2 (Type II) comprise short coronins and resulted from a gene duplication of a short coronin at the onset of the halozoans. Short coronins are characterized by an N-terminal coronin domain followed by a unique domain and a C-terminal short coiled-coil region. The coiled-coil domain of almost all short coronins contains a trimerization motif that must therefore have already existed in the last common ancestor of the eukaryotes. Class-3 (Type III) coronins comprise coronins with two coronin-domains arranged in tandem and have been found in species of all eukaryotic kingdoms that contain coronins. Class-4 (Type IV) coronins encode fusions of the coronin domain to villin and have been identified in Excavata and Opisthokonts although most of these species subsequently lost the class-4 homolog. Hence, the last common ancestor of the eukaryotes must have contained a short coronin and a class-3 coronin, and most probably a class-4 coronin.
Identification and annotation of the coronin family proteins
The coronin genes have been identified by TBLASTN searches against the sequenced eukaryotic genomes, which have been obtained via lists available from the diArk database [23, 58]. All hits were manually analyzed at the genomic DNA level. Datasets of predicted proteins produced by the sequencing consortia often miss homologs, and predicted proteins contain mispredicted exons and introns in many cases, necessitating manual assembly and annotation. The correct coding sequences were identified with the help of the multiple sequence alignments of all coronin proteins. As the amount of protein sequences increased (especially the number of sequences in taxa with few representatives), many of the initially predicted sequences were reanalyzed to correctly identify all exon borders. Where possible, EST data available from the NCBI EST database has been analyzed to help in the annotation process. In addition, coronin homologs from cDNA projects or single-gene analyses have been obtained by TBLASTN searches against the NCBI nr database . Gene structures have been reconstructed using WebScipio  as far as genomic sequence data was available. All sequence related data (names, corresponding species, GenBank ID's, alternative names, corresponding publications, domain predictions, gene structure reconstructions, and sequences) and references to genome sequencing centres are available through CyMoBase [60, 61].
Generating the multiple sequence alignment
The multiple sequence alignment of the coronin family has been built and extended during the process of annotating and assembling new sequences. The initial alignment has been generated from the first about 50 non-validated sequences obtained from NCBI using the ClustalW software with standard settings . During the following correction of the sequences (removing wrongly annotated sequences and filling gaps) the alignment has been adjusted manually. Subsequently, every newly predicted sequence has been preliminary aligned to its supposed closest relative using ClustalW, the aligned sequence added to the multiple sequence alignment of the coronins, and the coronin alignment adjusted manually during the subsequent sequence validation process. We have also retained the integrity of the primary sequence within the secondary structural elements that have been determined from the crystal structure (e.g. sequence gaps have only been introduced in known loop regions). Still, many gaps in sequences derived from low-coverage genomes remained. In those cases, the integrity of the exons surrounding the gaps has been maintained (gaps in the genomic sequence are reflected as gaps in the multiple sequence alignment). The unique and coiled-coil regions are completely divergent in sequence and length and were therefore aligned manually. The domain compositions of the short coronin, the class-3, and the class-4 coronins are different and regions outside the N-terminal coronin domain were only aligned within these groups. The C-terminal coronin domains of the class-3 coronins were separately included in the multiple sequence alignment of the coronins, in addition to being aligned as part of their class.
For calculating phylogenetic trees only full-length and partial sequences were included in the alignment. The phylogenetic trees were generated based on the conserved coronin domains (corresponding to amino acids 1-386 of HsCoro1A) using two different methods: 1. Maximum likelihood (ML) using the LG model with estimated proportion of invariable sites and bootstrapping (1,000 replicates) using RAxML . 2. Posterior probabilities were generated using MrBayes v3.1.2  with the MPI option . Two independent runs with 15,000,000 generations, four chains, and a random starting tree were computed using the mixed amino-acid option. From the 32,000th generation MrBayes used the Wag model . Using ProtTest , the LG model , which is, however, not implemented in MrBayes, was determined to provide a slightly better fit to the data than the Wag model. Trees were sampled every 1,000th generation and the first 25% of the trees were discarded as "burn-in" before generating a consensus tree.
Domain and motif prediction
Protein domains were predicted using the SMART  and Pfam  web server. The leucine zipper motifs have been identified using the Prosite database . The CA domains have been identified by visual inspection of the manual sequence alignment of the coronins and motif comparisons with CA domains of WASP family proteins available at CyMoBase (unpublished data, ). Graphical representations of the sequence patterns have been generated with WebLogo .
This work has been funded by grants KO 2251/3-1 and KO 2251/3-2 of the Deutsche Forschungs gemein schaft.
- de Hostos EL, Bradtke B, Lottspeich F, Guggenheim R, Gerisch G: Coronin, an actin binding protein of Dictyostelium discoideum localized to cell surface projections, has sequence similarities to G protein beta subunits. EMBO J. 1991, 10: 4097-4104.PubMedPubMed CentralGoogle Scholar
- Tardieux I, Liu X, Poupel O, Parzy D, Dehoux P, Langsley G: A Plasmodium falciparum novel gene encoding a coronin-like protein which associates with actin filaments. FEBS Lett. 1998, 441: 251-256. 10.1016/S0014-5793(98)01557-9.View ArticlePubMedGoogle Scholar
- Figueroa JV, Precigout E, Carcy B, Gorenflot A: Identification of a coronin-like protein in Babesia species. Ann N Y Acad Sci. 2004, 1026: 125-138. 10.1196/annals.1307.017.View ArticlePubMedGoogle Scholar
- Heil-Chapdelaine RA, Tran NK, Cooper JA: The role of Saccharomyces cerevisiae coronin in the actin and microtubule cytoskeletons. Curr Biol. 1998, 8: 1281-1284. 10.1016/S0960-9822(07)00539-8.View ArticlePubMedGoogle Scholar
- Suzuki K, Nishihata J, Arai Y, Honma N, Yamamoto K, Irimura T, Toyoshima S: Molecular cloning of a novel actin-binding protein, p57, with a WD repeat and a leucine zipper motif. FEBS Lett. 1995, 364: 283-288. 10.1016/0014-5793(95)00393-N.View ArticlePubMedGoogle Scholar
- de Hostos EL: A brief history of the coronin family. Subcell Biochem. 2008, 48: 31-40. 10.1007/978-0-387-09595-0_4.View ArticlePubMedGoogle Scholar
- Clemen CS, Rybakin V, Eichinger L: The coronin family of proteins. Subcell Biochem. 2008, 48: 1-5. 10.1007/978-0-387-09595-0_1.View ArticlePubMedGoogle Scholar
- Uetrecht AC, Bear JE: Coronins: the return of the crown. Trends Cell Biol. 2006, 16: 421-426. 10.1016/j.tcb.2006.06.002.View ArticlePubMedGoogle Scholar
- Smith TF: Diversity of WD-repeat proteins. Subcell Biochem. 2008, 48: 20-30. 10.1007/978-0-387-09595-0_3.View ArticlePubMedGoogle Scholar
- Neer EJ, Schmidt CJ, Nambudripad R, Smith TF: The ancient regulatory-protein family of WD-repeat proteins. Nature. 1994, 371: 297-300. 10.1038/371297a0.View ArticlePubMedGoogle Scholar
- de Hostos EL, Rehfuess C, Bradtke B, Waddell DR, Albrecht R, Murphy J, Gerisch G: Dictyostelium mutants lacking the cytoskeletal protein coronin are defective in cytokinesis and cell motility. J Cell Biol. 1993, 120: 163-173. 10.1083/jcb.120.1.163.View ArticlePubMedGoogle Scholar
- Cai L, Holoweckyj N, Schaller MD, Bear JE: Phosphorylation of coronin 1B by protein kinase C regulates interaction with Arp2/3 and cell motility. J Biol Chem. 2005, 280: 31913-31923. 10.1074/jbc.M504146200.View ArticlePubMedGoogle Scholar
- Maniak M, Rauchenberger R, Albrecht R, Murphy J, Gerisch G: Coronin involved in phagocytosis: dynamics of particle-induced relocalization visualized by a green fluorescent protein Tag. Cell. 1995, 83: 915-924. 10.1016/0092-8674(95)90207-4.View ArticlePubMedGoogle Scholar
- Ferrari G, Langen H, Naito M, Pieters J: A coat protein on phagosomes involved in the intracellular survival of mycobacteria. Cell. 1999, 97: 435-447. 10.1016/S0092-8674(00)80754-0.View ArticlePubMedGoogle Scholar
- Rybakin V, Stumpf M, Schulze A, Majoul IV, Noegel AA, Hasse A: Coronin 7, the mammalian POD-1 homologue, localizes to the Golgi apparatus. FEBS Lett. 2004, 573: 161-167. 10.1016/j.febslet.2004.07.066.View ArticlePubMedGoogle Scholar
- de Hostos EL: The coronin family of actin-associated proteins. Trends Cell Biol. 1999, 9: 345-350. 10.1016/S0962-8924(99)01620-7.View ArticlePubMedGoogle Scholar
- Appleton BA, Wu P, Wiesmann C: The crystal structure of murine coronin-1: a regulator of actin cytoskeletal dynamics in lymphocytes. Structure. 2006, 14: 87-96. 10.1016/j.str.2005.09.013.View ArticlePubMedGoogle Scholar
- Spoerl Z, Stumpf M, Noegel AA, Hasse A: Oligomerization, F-actin interaction, and membrane association of the ubiquitous mammalian coronin 3 are mediated by its carboxyl terminus. J Biol Chem. 2002, 277: 48858-48867. 10.1074/jbc.M205136200.View ArticlePubMedGoogle Scholar
- Oku T, Itoh S, Ishii R, Suzuki K, Nauseef WM, Toyoshima S, Tsuji T: Homotypic dimerization of the actin-binding protein p57/coronin-1 mediated by a leucine zipper motif in the C-terminal region. Biochem J. 2005, 387: 325-331. 10.1042/BJ20041020.View ArticlePubMedPubMed CentralGoogle Scholar
- Kammerer RA, Kostrewa D, Progias P, Honnappa S, Avila D, Lustig A, Winkler FK, Pieters J, Steinmetz MO: A conserved trimerization motif controls the topology of short coiled coils. Proc Natl Acad Sci USA. 2005, 102: 13891-13896. 10.1073/pnas.0502390102.View ArticlePubMedPubMed CentralGoogle Scholar
- Rybakin V, Clemen CS: Coronin proteins as multifunctional regulators of the cytoskeleton and membrane trafficking. Bioessays. 2005, 27: 625-632. 10.1002/bies.20235.View ArticlePubMedGoogle Scholar
- Morgan RO, Fernandez MP: Molecular phylogeny and evolution of the coronin gene family. Subcell Biochem. 2008, 48: 41-55. 10.1007/978-0-387-09595-0_5.View ArticlePubMedGoogle Scholar
- Odronitz F, Hellkamp M, Kollmar M: diArk--a resource for eukaryotic genome research. BMC Genomics. 2007, 8: 103-10.1186/1471-2164-8-103.View ArticlePubMedPubMed CentralGoogle Scholar
- Breathnach R, Chambon P: Organization and expression of eucaryotic split genes coding for proteins. Annu Rev Biochem. 1981, 50: 349-383. 10.1146/annurev.bi.50.070181.002025.View ArticlePubMedGoogle Scholar
- Odronitz F, Pillmann H, Keller O, Waack S, Kollmar M: WebScipio: an online tool for the determination of gene structures using protein sequences. BMC Genomics. 2008, 9: 422-10.1186/1471-2164-9-422.View ArticlePubMedPubMed CentralGoogle Scholar
- Keller O, Odronitz F, Stanke M, Kollmar M, Waack S: Scipio: using protein sequences to determine the precise exon/intron structures of genes and their orthologs in closely related species. BMC Bioinformatics. 2008, 9: 278-10.1186/1471-2105-9-278.View ArticlePubMedPubMed CentralGoogle Scholar
- Parfrey LW, Grant J, Tekle YI, Lasek-Nesselquist E, Morrison HG, Sogin ML, Patterson DJ, Katz LA: Broadly sampled multigene analyses yield a well-resolved eukaryotic tree of life. Syst Biol. 2010, 59: 518-533. 10.1093/sysbio/syq037.View ArticlePubMedPubMed CentralGoogle Scholar
- Reeb VC, Peglar MT, Yoon HS, Bai JR, Wu M, Shiu P, Grafenberg JL, Reyes-Prieto A, Rummele SE, Gross J, Bhattacharya D: Interrelationships of chromalveolates within a broadly sampled tree of photosynthetic protists. Mol Phylogenet Evol. 2009, 53: 202-211. 10.1016/j.ympev.2009.04.012.View ArticlePubMedGoogle Scholar
- Hampl V, Hug L, Leigh JW, Dacks JB, Lang BF, Simpson AG, Roger AJ: Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic "supergroups". Proc Natl Acad Sci USA. 2009, 106: 3859-3864. 10.1073/pnas.0807880106.View ArticlePubMedPubMed CentralGoogle Scholar
- Goode BL, Wong JJ, Butty AC, Peter M, McCormack AL, Yates JR, Drubin DG, Barnes G: Coronin promotes the rapid assembly and cross-linking of actin filaments and may link the actin and microtubule cytoskeletons in yeast. J Cell Biol. 1999, 144: 83-98. 10.1083/jcb.144.1.83.View ArticlePubMedPubMed CentralGoogle Scholar
- Liu SL, Needham KM, May JR, Nolen BJ: Mechanism of a Concentration-dependent Switch between Activation and Inhibition of Arp2/3 Complex by Coronin. J Biol Chem. 2011, 286: 17039-17046. 10.1074/jbc.M111.219964.View ArticlePubMedPubMed CentralGoogle Scholar
- Veltman DM, Insall RH: WASP family proteins: their evolution and its physiological implications. Mol Biol Cell. 2010, 21: 2880-2893. 10.1091/mbc.E10-04-0372.View ArticlePubMedPubMed CentralGoogle Scholar
- Vertessy BG, Toth J: Keeping uracil out of DNA: physiological role, structure and catalytic mechanism of dUTPases. Acc Chem Res. 2009, 42: 97-106. 10.1021/ar800114w.View ArticlePubMedPubMed CentralGoogle Scholar
- Gloss A, Rivero F, Khaire N, Muller R, Loomis WF, Schleicher M, Noegel AA: Villidin, a novel WD-repeat and villin-related protein from Dictyostelium, is associated with membranes and the cytoskeleton. Mol Biol Cell. 2003, 14: 2716-2727. 10.1091/mbc.E02-12-0827.View ArticlePubMedPubMed CentralGoogle Scholar
- Yonemura I, Mabuchi I: Heterogeneity of mRNA coding for Caenorhabditis elegans coronin-like protein. Gene. 2001, 271: 255-259. 10.1016/S0378-1119(01)00509-1.View ArticlePubMedGoogle Scholar
- Xavier CP, Rastetter RH, Stumpf M, Rosentreter A, Muller R, Reimann J, Cornfine S, Linder S, van Vliet V, Hofmann A, Morgan RO, Fernandez MP, Schroder R, Noegel AA, Clemen CS: Structural and functional diversity of novel coronin 1C (CRN2) isoforms in muscle. J Mol Biol. 2009, 393: 287-299. 10.1016/j.jmb.2009.07.079.View ArticlePubMedGoogle Scholar
- Asano S, Mishima M, Nishida E: Coronin forms a stable dimer through its C-terminal coiled coil region: an implicated role in its localization to cell periphery. Genes Cells. 2001, 6: 225-235. 10.1046/j.1365-2443.2001.00416.x.View ArticlePubMedGoogle Scholar
- Beck K, Gambee JE, Kamawal A, Bachinger HP: A single amino acid can switch the oligomerization state of the alpha-helical coiled-coil domain of cartilage matrix protein. EMBO J. 1997, 16: 3767-3777. 10.1093/emboj/16.13.3767.View ArticlePubMedPubMed CentralGoogle Scholar
- Oku T, Itoh S, Okano M, Suzuki A, Suzuki K, Nakajin S, Tsuji T, Nauseef WM, Toyoshima S: Two regions responsible for the actin binding of p57, a mammalian coronin family actin-binding protein. Biol Pharm Bull. 2003, 26: 409-416. 10.1248/bpb.26.409.View ArticlePubMedGoogle Scholar
- Cai L, Makhov AM, Bear JE: F-actin binding is essential for coronin 1B function in vivo. J Cell Sci. 2007, 120: 1779-1790. 10.1242/jcs.007641.View ArticlePubMedGoogle Scholar
- Gandhi M, Jangi M, Goode BL: Functional surfaces on the actin-binding protein coronin revealed by systematic mutagenesis. J Biol Chem. 2010, 285: 34899-34908. 10.1074/jbc.M110.171496.View ArticlePubMedPubMed CentralGoogle Scholar
- Van de Peer Y, Maere S, Meyer A: 2R or not 2R is not the question anymore. Nat Rev Genet. 2010, 11: 166-View ArticlePubMedGoogle Scholar
- Steinke D, Hoegg S, Brinkmann H, Meyer A: Three rounds (1R/2R/3R) of genome duplications and the evolution of the glycolytic pathway in vertebrates. BMC Biol. 2006, 4: 16-10.1186/1741-7007-4-16.View ArticlePubMedPubMed CentralGoogle Scholar
- Jaillon O, Aury JM, Brunet F, Petit JL, Stange-Thomann N, Mauceli E, Bouneau L, Fischer C, Ozouf-Costaz C, Bernot A, Nicaud S, Jaffe D, Fisher S, Lutfalla G, Dossat C, Segurens B, Dasilva C, Salanoubat M, Levy M, Boudet N, Castellano S, Anthouard V, Jubin C, Castelli V, Katinka M, Vacherie B, Biemont C, Skalli Z, Cattolico L, Poulain J, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , et al: Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature. 2004, 431: 946-957. 10.1038/nature03025.View ArticlePubMedGoogle Scholar
- Chan KT, Creed SJ, Bear JE: Unraveling the enigma: progress towards understanding the coronin family of actin regulators. Trends Cell Biol. 2011, 21: 481-488. 10.1016/j.tcb.2011.04.004.View ArticlePubMedPubMed CentralGoogle Scholar
- Odronitz F, Kollmar M: Drawing the tree of eukaryotic life based on the analysis of 2,269 manually annotated myosins from 328 species. Genome Biol. 2007, 8: R196-10.1186/gb-2007-8-9-r196.View ArticlePubMedPubMed CentralGoogle Scholar
- Berg JS, Powell BC, Cheney RE: A millennial myosin census. Mol Biol Cell. 2001, 12: 780-794.View ArticlePubMedPubMed CentralGoogle Scholar
- Archer SK, Claudianos C, Campbell HD: Evolution of the gelsolin family of actin-binding proteins as novel transcriptional coactivators. Bioessays. 2005, 27: 388-396. 10.1002/bies.20200.View ArticlePubMedGoogle Scholar
- Xavier CP, Eichinger L, Fernandez MP, Morgan RO, Clemen CS: Evolutionary and functional diversity of coronin proteins. Subcell Biochem. 2008, 48: 98-109. 10.1007/978-0-387-09595-0_9.View ArticlePubMedGoogle Scholar
- Khurana S, George SP: Regulation of cell structure and function by actin-binding proteins: villin's perspective. FEBS Lett. 2008, 582: 2128-2139. 10.1016/j.febslet.2008.02.040.View ArticlePubMedPubMed CentralGoogle Scholar
- Simpson AG, Inagaki Y, Roger AJ: Comprehensive multigene phylogenies of excavate protists reveal the evolutionary positions of "primitive" eukaryotes. Mol Biol Evol. 2006, 23: 615-625.View ArticlePubMedGoogle Scholar
- Keeling PJ: The endosymbiotic origin, diversification and fate of plastids. Philos Trans R Soc Lond B Biol Sci. 2010, 365: 729-748. 10.1098/rstb.2009.0103.View ArticlePubMedPubMed CentralGoogle Scholar
- Burki F, Shalchian-Tabrizi K, Pawlowski J: Phylogenomics reveals a new 'megagroup' including most photosynthetic eukaryotes. Biol Lett. 2008, 4: 366-369. 10.1098/rsbl.2008.0224.View ArticlePubMedPubMed CentralGoogle Scholar
- Burki F, Shalchian-Tabrizi K, Minge M, Skjaeveland A, Nikolaev SI, Jakobsen KS, Pawlowski J: Phylogenomics reshuffles the eukaryotic supergroups. PLoS One. 2007, 2: e790-10.1371/journal.pone.0000790.View ArticlePubMedPubMed CentralGoogle Scholar
- Nozaki H, Maruyama S, Matsuzaki M, Nakada T, Kato S, Misawa K: Phylogenetic positions of Glaucophyta, green plants (Archaeplastida) and Haptophyta (Chromalveolata) as deduced from slowly evolving nuclear genes. Mol Phylogenet Evol. 2009, 53: 872-880. 10.1016/j.ympev.2009.08.015.View ArticlePubMedGoogle Scholar
- Keeling PJ: Chromalveolates and the evolution of plastids by secondary endosymbiosis. J Eukaryot Microbiol. 2009, 56: 1-8. 10.1111/j.1550-7408.2008.00371.x.View ArticlePubMedGoogle Scholar
- Hackett JD, Yoon HS, Li S, Reyes-Prieto A, Rummele SE, Bhattacharya D: Phylogenomic analysis supports the monophyly of cryptophytes and haptophytes and the association of rhizaria with chromalveolates. Mol Biol Evol. 2007, 24: 1702-1713. 10.1093/molbev/msm089.View ArticlePubMedGoogle Scholar
- diArk - a resource for eukaryotic genome research. [http://www.diark.org]
- Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL: NCBI BLAST: a better web interface. Nucleic Acids Res. 2008, 36: W5-9. 10.1093/nar/gkn201.View ArticlePubMedPubMed CentralGoogle Scholar
- Odronitz F, Kollmar M: Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase). BMC genomics. 2006, 7: 300-10.1186/1471-2164-7-300.View ArticlePubMedPubMed CentralGoogle Scholar
- CyMoBase - a database for cytoskeletal and motor proteins. [http://www.cymobase.org]
- Thompson JD, Gibson TJ, Higgins DG: Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinformatics. 2002, Chapter 2: Unit 2 3-PubMedGoogle Scholar
- Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol. 2008, 57: 758-771. 10.1080/10635150802429642.View ArticlePubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.View ArticlePubMedGoogle Scholar
- Altekar G, Dwarkadas S, Huelsenbeck JP, Ronquist F: Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. Bioinformatics. 2004, 20: 407-415. 10.1093/bioinformatics/btg427.View ArticlePubMedGoogle Scholar
- Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001, 18: 691-699.View ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.View ArticlePubMedGoogle Scholar
- Le SQ, Gascuel O: An improved general amino acid replacement matrix. Mol Biol Evol. 2008, 25: 1307-1320. 10.1093/molbev/msn067.View ArticlePubMedGoogle Scholar
- Letunic I, Doerks T, Bork P: SMART 6: recent updates and new developments. Nucleic Acids Res. 2009, 37: D229-232. 10.1093/nar/gkn808.View ArticlePubMedGoogle Scholar
- Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer EL, Eddy SR, Bateman A: The Pfam protein families database. Nucleic Acids Res. 2010, 38: D211-222. 10.1093/nar/gkp985.View ArticlePubMedGoogle Scholar
- Sigrist CJ, Cerutti L, de Castro E, Langendijk-Genevaux PS, Bulliard V, Bairoch A, Hulo N: PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Res. 2010, 38: D161-166. 10.1093/nar/gkp885.View ArticlePubMedGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14: 1188-1190. 10.1101/gr.849004.View ArticlePubMedPubMed CentralGoogle Scholar
- Letunic I, Bork P: Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007, 23: 127-128. 10.1093/bioinformatics/btl529.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.