Immunoglobulin heavy chains in medaka (Oryzias latipes)
© Magadán-Mompó et al; licensee BioMed Central Ltd. 2011
Received: 8 March 2011
Accepted: 15 June 2011
Published: 15 June 2011
Bony fish present an immunological system, which evolved independently from those of animals that migrated to land 400 million years ago. The publication of whole genome sequences and the availability of several cDNA libraries for medaka (Oryzias latipes) permitted us to perform a thorough analysis of immunoglobulin heavy chains present in this teleost.
We identified IgM and IgD coding ESTs, mainly in spleen, kidney and gills using published cDNA libraries but we did not find any sequence that coded for IgT or other heavy chain isotypes described in fish. The IgM - ESTs corresponded with the secreted and membrane forms and surprisingly, the latter form only presented two constant heavy chain domains. This is the first time that this short form of membrane IgM is described in a teleost. It is different from that identified in Notothenioid teleost because it does not present the typical splicing pattern of membrane IgM. The identified IgD-ESTs only present membrane transcripts, with Cμ1 and five Cδ exons. Furthermore, there are ESTs with sequences that do not have any VH which disrupt open reading frames.
A scan of the medaka genome using transcripts and genomic short reads resulted in five zones within a region on chromosome 8 with Cμ and Cδ exons. Some of these exons do not form part of antibodies and were at times interspersed, suggesting a recombination process between zones. An analysis of the ESTs confirmed that no antibodies are expressed from zone 3.
Our results suggest that the IGH locus duplication is very common among teleosts, wherein the existence of a recombination process explains the sequence homology between them.
Genome information of vertebrates is rapidly becoming available thanks to several full vertebrate genome projects. Such information is very useful for comparative and evolutionary biologists. Comparative genomic studies are helping to discover evolutionary mechanisms that underlie diversification of organisms [1, 2]. Therefore, information obtained from genomes is of great use for understanding the genetic basis of antibody diversity and the evolutionary divergences of the immunoglobulin locus in vertebrates . Immunoglobulin loci are organised into two main types called: "cluster" and "translocon". Cluster type organization is found in both light and heavy chain loci of cartilaginous fish [4, 5] There are many independent variable (VH), diversity (D), joining (JH) and constant (CH) segments sets [VH(D)JHCH] along wide areas of the genome. Therefore, diversity in these molecules is generated through synthesis of antibodies from each of these VH-D-JH-CH regions [6, 7]. In tetrapods and bony fish, the IGH locus configuration is translocon and it presents some specific characteristics. There are genomic segments for the variable regions of antibody heavy chains (VH) and these are followed by segments that code for: diversity (D), joining (JH), and segments that encode the heavy chain domains (CH). A rearranged VHDJH region spliced to CH segment is needed to generate an antibody [8, 9].
It is well established that all fishes have IGHM and other constant chain region genes in the 3' region. Dooley and Flajnik described genes that encoded the IgW (omega immunoglobulin isotype) and IgNAR (New Antigen Receptor) antibodies in the 3' region, for cartilaginous fish [10–12]. Most bony fish belong to the infraclass teleost, where we can find IgM, IgD [13–15] and IgT/IgZ . However, the IgT/IgZ have not been found in catfish . Teleost IgD is an antibody which generally has seven domains and some of these have experienced recent duplications . The IGHZ (of zebrafish) and IGHT (of rainbow trout) correspond to genes that code for antibodies (IgZ and IgT) with four immunoglobulin domains located upstream from the D and JH segments of IGHM. Furthermore, the exons that code for the constant region present their own D and JH segments, and resemble the organization of T cell receptor alpha and delta (TCR α and δ) loci . Other genes for antibodies found at the same location were described later, and may correspond to different forms of the same antibody [17, 19].
Another surprising feature found in some teleost IGH loci, such as in stickleback, catfish [14, 17] and medaka, is the presence of core block [VH(D)JHCH] duplications in the germline. Such presence is perhaps not widespread in teleosts because they were not found in zebrafish genome . The duplications present a high homology suggesting that they happened recently or perhaps there is a biological mechanism that maintains them.
This article presents a description of the antibodies in medaka, wherein antibody structure was deduced based on genomic and EST data. Five zones or regions that code for constant chain immunoglobulin domains have been found in genome, and each of these regions has exons for IgM and IgD. Medaka (Oryzias latipes), catfish (Ictalurus punctatus), zebrafish (Danio rerio) and stickleback (Gasterosteus aculeatus) represent a group of teleosts that have been widely used as animal models in various fields such as biology, medicine, environmental science and fisheries [20, 21]. There is ample information on zebrafish, catfish and stickleback immunoglobulin loci but this is the first time that work on medaka immunoglobulins is published.
Fish and sampling
Adult medaka (Oryzias latipes, strain HdrR belongs to the Southern Japanese population) specimens were kindly supplied by J. Cerdá (Institute of Marine Sciences of Barcelona, CSIC, and Aquaculture Centre). Fish were killed by overexposure to MS222 (Sigma Chemicals). Head kidney and spleen were removed aseptically and RNA was extracted immediately using the QIAmp RNA kit (QIAGEN) following manufacturer's instructions.
cDNA preparation, PCR and DNA sequencing
About 5 μg of total RNA was reverse transcripted into cDNA by using QIAGEN One Step RT-PCR kit and priming with 0.5 μM of Cδ6-antisense primer (5'- GGACTGTTGGAGGATTCATGTCTCACA-3') in a total volume of 50 μl.
Amplification of the IgD constant region was performed in a two-step PCR reaction. 5 μl of cDNA reaction mixture was amplified by thermal cycling in a total volume of 25 μl using Cμ1-sense (5'-CATTGACTTTCTCATGGACTCAGGGC-3') combined with Cδ6- antisense primer. Amplification was performed for 30 cycles at 95°C 30s, 65°C for 30s and 72°C for 90s, with a final elongation step at 72°C for 10 min. Due to a very low amplification product obtained from the first PCR, a second round was performed for 20 more cycles using the same primers and conditions. The amplified products were sequenced on an Applied Biosystems 3130 Genetic Analyzer. The Gepard (GEnome PAir -Rapid Dotter) program  was used to search for homologues with the genomic sequences and identify the IgD domains.
Medaka immunoglobulin expression using ESTs databases
Previously identified immunoglobulin constant heavy chain exons from stickleback  were used to search homologue sequences in the medaka ESTs database (http://www.shigen.nig.ac.jp/medaka). A total of 11 cDNA libraries generated from different tissues of HdrR-medaka were scanned (Additional file 1). ESTs encoding for IgM and/or IgD were retrieved. The medaka immunoglobulin ESTs can be found grouped into three clusters: a) CLSTF16513, with the 5' sequences encoding IgM and IgD, b) CLSTR12908 with 3' sequences for IgM and c) CLSTR18886 with 3' IgD sequences.
In order to identify the genomic zone or region that corresponds to each EST, an alignment was performed using the Lastz program available at the Galaxy website (http://main.g2.bx.psu.edu/) [23, 24]. To confirm results we performed the same analysis using recently released next generation RNA sequences (SRA023697) deposited in the Sequence Read Archive database of the NCBI (http://www.ncbi.nlm.nih.gov/sra). These alignments were visualized using the Tablet - Next Generation Sequence Assembly Visualization software (http://bioinf.scri.ac.uk/tablet/) .
Identification of the IGH locus
The complete genome of Oryzias latipes (assembly: HdrR, October 2005; version 56.1i) built in NCBI (http://www.ncbi.nlm.nih.gov) and Ensembl database platforms (http://www.ensembl.org/index.html) was examined to locate antibody genes. Previously published sequences from other IGHM teleost fish were used to identify genomic scaffolds and chromosomes that contained immunoglobulin genes. These sequences (scaffolds 146, 409 and 501, chromosome 8) were retrieved and analysed in detail using the Vector-NTI (Invitrogen). Two scaffolds were not assigned to any chromosome (scaffold 3172 and 1447) but were identified as harboring IGH gene segments and these scaffolds were observed to overlap on 400 nucleotides suggesting that they are contiguous (Additional File 2).
Identification of exons coding for CH domains was performed by aligning genomic sequences with previously published immunoglobulin mRNAs. Limits of unpublished antibodies were deduced following instructions in the software FGNESH (http://www.softberry.com) and Augustus (http://augustus.gobics.de/submission) . Messenger RNA predicted from the gene sequence was compared with O. latipes EST sequences from NCBI and http://www.shigen.nig.ac.jp/medaka, in order to confirm exon ends and analyse gene expression.
The heavy chain variable segments (VH) of medaka were located on the same scaffolds and chromosome. Several criteria were used to identify VH segments, including: a) the presence of recombination signal sequences (RSS) including the canonical "tattattgt" nonamer sequences (allowing 1 or 2 nucleotide mismatches) and corresponding heptamer sequences, b) the presence of AG and and GT splice sides flanking open reading frames, and c) pattern searches for identifying RSS with 23 bp spacers flanking the 3'end of the VH regions. We verified whether the read sequences corresponded to the VH regions .
D segments were identified by the presence of RSS 5' and RSS 3' . They were compared with O. latipes EST database in order to confirm their expression. The heavy chain joining (JH) segments were located by homology to published JH sequences. This was carried out by comparing a dot plot between published JH sequences and the 5' region of the IGHM (implementing a window of 30 nt and a match of 60%). RSS was used to detect the beginning of the JH exon while the presence of "GTA" was used to determine the end .
In order to resolve occasional mistakes and complete the gaps, all scaffolds retrieved (scaffolds 146, 409 and 501, chromosome 8) were aligned with the recently released genomic new generation sequences (DRA000220), deposited in the Sequence Read Archive database of the NCBI (http://www.ncbi.nlm.nih.gov/sra). The in silico analysis was carried out using the available tools at the Galaxy website (http://main.g2.bx.psu.edu/) and visualized with Tablet - Next Generation Sequence Assembly Visualization software (http://bioinf.scri.ac.uk/tablet/).
Comparative phylogenetic studies were carried out with the program MEGA5  using the algorithm to perform BLOSSUM alignments. The neighbour-joining and minimum evolution methods were then used to plot the phylogenetic trees (pair-wise deletion, JonesTaylor-Thornton matrix and enter range activated sites (gamma-number 2.5). The veracity of these trees was studied using the above-mentioned method and by executing 1000 replicate bootstrappings.
IgM accession numbers: X83372 Oncorhynchus mykiss (rainbow trout), AB2 17624 Takifugu rubripes (Fugu rubripes), AAQ14862 Sineperca chuatsi, AAF69488 Hippoglossus hippoglossus, A46538 Gadus morhua (Atlantic cod), AAO37747 Ornithorhynchus anatinus (platypus) and EU287910 and EU28791 1 Eublepharis macularius (leopard gecko). The G. aculeatus IGH sequences are in the supplementary information (Additional File 3) .
Immunoglobulins in medaka
A bioinformatic search of ESTs in the NBRP medaka database (http://www.shigen.nig.ac.jp/medaka/) was carried out in order to determine the kind of antibodies expressed by the teleost medaka. Previously published Cμ, Cδ, Cζ, sequences from G. aculeatus  were used as queries to identify the ESTs.
Tissue distribution for immunoglobulin ESTs
IgM - EST
IgD - EST
The secreted IgM form appears to be similar to those described in other teleostei , with four CH domains and a secretory tail. This IgM presents three cysteines for interchain bonds, one in the CH1 domain to establish a disulfide bond with the light chain, another in the CH3 domain to join heavy chains and finally, in the secretory tail, probably to form multimers (Additional File 4).
The study of IgD ESTs permitted us to deduce its structure. This is similar to those described in other teleostei, in which the first constant domain is Cμ1 followed by Cδ1. The Cδ6, Cδ7, TM1 and TM2 domains were present in the ESTs in all cases. All IgD domains expressed could not be described because the forward and reverse ESTs sequences did not overlap. Thus, we decided to perform a RT-PCR of head kidney and spleen mRNA with primers designed for the Cμ1 and Cδ6. A PCR product of approximately 1600 bp was obtained and its sequencing confirmed the presence of Cδ1, Cδ2, Cδ3, Cδ4 and Cδ6 domains (Additional File 5). There was no Cδ5 equivalent in all IgD transcripts sequenced.
Medaka IGH Genomic organization
We were able to elucidate the IGH genomic organization, despite finding several gaps, mainly between scaffolds and contigs, which prevent us from creating a complete contiguous annotation. Furthermore, we were able to complete some gaps and solve several contradictions found between ESTs and genomic sequences, using recently deposited next generation sequence data (DRA000220 and SRA026397) in the Sequence Read Archive Database (http://www.ncbi.nlm.nih.gov/sra).
Overall, Cμ and Cδ exons, D and JH segments (Figure 3) were identified in each of the five genomic zones. No Cδ exons were found and this is consistent with data obtained from ESTs analysis. As indicated in Figure 3 some exons were identified or corrected based on the analysis of ESTs and the next generation sequence data (SRA026397 and DRA000220). VH regions were found between zones (see Additional File 6 and 8).
The genomic region designated as zone 1 encodes seven JH segments followed by Cμ1, Cμ2, Cμ3 and Cμ4, and harbors exons that code for a transmembrane and cytoplasm domain. In this zone there are only four Cδ exons (Cδ1, Cδ2, Cδ6 and Cδ7) located 3 kb downstream of the nearest Cμ and are followed by transmembrane and cytoplasm exons. There is a gap between Cδ2 and Cδ6, where there is a high probability of finding the presence of Cδ exons in this first zone.
The remaining zones give us an idea of asymmetric duplications, that is, the presence of Cμ and Cδ exons with a changed configuration. In zone 2, just like in zone 1, the exons Cμ1, Cμ2, Cμ3 (also deduced from EST sequences) and Cμ4 appear after seven D segments and seven JH segments (Figure 3). Exons for transmembrane and cytoplasm domains are also present. At 3' of these exons, we find one Cδ2 exon without any other sequence coding for IgD antibody. Interestingly, about 5 kb downstream, we find D and JH segments followed by Cμ1 and Cδ1- Cδ2- Cδ3- Cδ4- Cδ6 exons again. Therefore, we can differentiate two genomic regions in this zone, namely; zone 2a at 5', and zone 2b at 3'. Both of them have exons to IgM and IgD.
The zone designated as zone 3 seems to be quite disorganized when compared with other zones. As shown in Figure 3, there are very few exons and this suggests that this zone may not generate functional antibodies. Conversely, zone 4 appears to be well structured and presents the highest number of exons. At the 5' region there are four Cμ exons, including Cμ2 with their transmembrane and cytoplasm coding exons (Figure 3). Surprisingly, domain Cμ4 and the transmembrane and cytoplasm exons are found to be duplicated. At the 3' region, there are 10 exons for IgD domains, some of which are repeated (Cδ2, Cδ3, Cδ4 and Cδ6). Between the last Cδ6 and the Cδ7 there are exons that code for IgM (Cμ2 and Cμ4) and finally, we find sequences for transmembrane and cytoplasm IgD domains.
At present, zone 5 is the least resolved genomic region. This is due to the presence of a gap of about 30 kb between scaffold 501 and the 146 junction. The identified sequences, D segments and Cδ exons, are found to be inverted. The IGHM might be missing due to the presence of the gap however; it is very probable that scaffolds 3172 and 1447 belong to the gap because they are not assigned to any chromosome and present sequences for IgM domains (See Figure 3). The above will be taken into account and from now on we will be referred to as zone5/x.
Correlation between ESTs and genomic sequences
IgM coding ESTs expressed by each genomic zone
On the subject of ESTs coding for IgM, we identified a total of 34 ESTs expressed from zone 1, where 21 corresponded to the secreted form and 13 to the membrane form (Table 2). Thirty-two ESTs were assigned to zone 2, with membrane (6 ESTs) and secreted (26 ESTs) forms. Eleven IgM membrane and five IgM secreted coding ESTs belonged to zone 4. Only 8 IgM ESTs (4 membrane and 4 secreted) were found to be expressed from zone 4 and, as expected due to its disorganized genomic structure, no EST from zone 3 was detected.
The analysis of cDNA libraries obtained from different tissues permitted the identification of Ig exons expressed in medaka (Oryzias latipes). ESTs coding for IgM and IgD were identified but no expression or genomic data was found for other isotypes in medaka.
In mammals, the production of secreted and membrane IgM forms involves alternative splicing. The transmembrane form is originated through a cryptic splice site located within Cμ4 that have the acceptor site at 3' of the TM1 exon . This pattern is manifested in Xenopus and cartilaginous fish too [31–33]. However, in teleosts the transmembrane IgM is comprised of the first three exons (Cμ1, Cμ2 and Cμ3) plus the transmembrane and cytoplasm exons [34, 35]. The splicing pattern of IgM appears consistent with exceptions only in a few species, for example, membrane IgM chains with different number of Cμ domains have been described in ancient fishes [30, 36], in which we observe the general rule followed in teleost fishes, Cμ3 - TM, as well as the mammalian pathway, Cμ4 - TM. However, in Siberian Sturgeon, the splicing pattern can result in a transmembrane immunoglobulin with four, two and half, or only one Cμ domain . Notothenioid teleosts membrane IgM transcripts likewise lack the Cμ3, and the Cμ2 is spliced to two short exons (RA and RB) creating an elongated extracellular membrane-proximal domain . Nevertheless, the splicing observed in medaka occurs between the end Cμ2 and TM1 and produces a membrane antigen receptor of only two constant immunoglobulin domains. This is the first time that in a typical teleost is described to have a short transmembrane IgM and indicates that other teleosts may have evolved to exhibit considerable diversity in IgM splicing. Such diversity may be due to a selection process or due to "genomic configurations" that led to the modification of the splicing machinery.
The medaka IgD transcripts studied correspond to the membrane form and, just as in other teleosts, are chimeric, with the inclusion of Cμ1 and six Cδ exons. The Cμ1 exon permits covalent association with light chains, this kind of splicing (Cμ1 to Cδ exon) is not only restricted to teleosts as it has recently been described in porcine IgD transcripts. One interesting feature is that medaka transcripts lack the canonical Cδ5 exon and this finding is confirmed in the genomic sequence, where IGHD loci seem to have been subjected to dramatic recombination events leading to loss of the Cδ5 exon. A high diversity in IGHD genes has been described in teleosts [39, 40]. Seven Cδ domains comprise the backbone of many bony fish delta chains, wherein a wide range of domain organization within fish lineages is observed. In the Japanese flounder (Paralichthys olivaceus)  and stickleback (Gasterosteus aculeatus) , the IGHD locus consists of the Cδ1-Cδ2-Cδ3-Cδ4-Cδ5-Cδ6-Cδ7-TM1-TM2 exons, in which the homology of domains CH2-CH5, CH3-CH6 and CH4-CH7 suggests that Cδ2-Cδ3-Cδ4 duplicated to generate Cδ5-Cδ6-Cδ7 [17, 39]. However, in Atlantic salmon (Salmo salar), grass carp (Ctenopharyngodon idella) and catfish (Ictallurus punctatus) a duplication of Cδ2-Cδ3-Cδ4 has been described [15, 34, 42]. In Atlantic cod (Gadus morhua) the IGHD locus has undergone rearrangement events leading to the loss of Cδ3, Cδ4, Cδ5 and Cδ6 exons with a tandem duplication of the Cδ1-Cδ2 region. It appears that diversification of IgD may be due to germline changes that are species specific rather than due to different splicing pattern as described in IgM. Therefore, only in sharks partly of IgD, like W heavy chain, is diversified through alternative splicing. Further studies are needed to understand the reason for this phenomenon and the biological/evolutive meaning of both mechanisms to generate antibody diversity.
Analysis of ESTs showed that there were atypical IgM and IgD transcripts (approx. 15%), which had stop codons interrupting the reading frames. Most of them lacked the VH region and contained a genomic sequence, named exon 0, at the 5' location, which is spliced directly to the constant exons. It is common to find sterile transcripts from light chain loci in teleosts, and these may be associated to the high frequency of enhancers in the IgL loci of bony fishes [43, 44]. Recently, unusual IgD transcripts have also been described in Salmo salar , wherein the VH and JH sequences are not obvious and include genomic sequences. In catfish , in which the Cδ1 is directly spliced to leader exon, which was shown to be functional and capable of mediating secretion of IgD from catfish B cells. The authors suggest the possibility that this secreted IgD functions as a pattern-recognition molecule. These results observed in the several teleost species suggest an evolutive and functional role for non-traditional VHDJH rearrangement and needs to be studied in the future. In medaka, the splicing between exon 0 and the rest of the exons indicate that all components of the immunoglobulin heavy chain, except the VH region, are needed for a specific process in the teleosts.
The ESTs encoding medaka IgM present differences in their Cμ nucleotide sequences, suggesting a duplicated IGH locus in medaka. Therefore, when we scanned the medaka genome with these ESTs we found a very complex locus, with five tandem duplicated Cμ. and Cδ genes separated by VH, D and JH segments. In other fishes we can find duplicated IGH loci, like in I. punctatus, G. aculeatus, S.salar [17, 19, 40, 45] or, like in zebrafish (Danio rerio), only one IGH copy . Duplicated segments in medaka showed a high DNA level homology for exons and introns. The most probable explanation is such duplications occurred recently and take place frequently. In the future, it would be of interest to identify the mechanism responsible for this genetic exchange. Preliminary data indicates the presence of short repeated sequences (SRS)s at the beginning of duplications suggesting their involvement in such exchange processes (data not shown).
The current medaka whole genome sequence draft presents a number of gaps that do not permit exact delineation of gene configuration. Just like in the case of other vertebrates, the IGH locus has regions that are quite difficult to sequence, due to the frequent presence of SRS. Additionally, the analysis of the medaka germline IGH locus gave rise to uncertainties which on the one hand suggested the lack of Cμ3 and on the other identified Cδ7 as a pseudogene. The database of ESTs and the recently released next generation sequence data from Illumina enabled us to confirm the presence of Cμ3 and Cδ7 as functional exons. However, the high sequence homology between the duplicated segments prevented us from providing a gap-free IGH locus annotation using this additional information.
Despite the medaka IGH locus having many genes, no genes for IgT/Z have been so far identified as has been the case in catfish . Furthermore, we found exons and even entire zones (Ex. zone 3) that were not expressed. It is difficult to explain the evolutive significance of the presence of exons, which are predicted to be functional (without stop codon or any other alteration in their sequences) but are not going to be expressed. Perhaps the screening and sequencing of EST libraries was not sensitive enough to detect mRNA in low concentrations. However, it seems improbable that zone 3 could generate a functional antibody. A possible explanation for sequence maintenance would be its relationship with the genetic locus structure itself. The high number of recombinations may determine that the predicted functional exons cannot generate antibodies in the medaka strain studied, even though antibodies were expressed by non-homologue recombination in other medaka fish. In order to verify this hypothesis, the sequencing of these loci in other fish strains of the same species should give us different haplotypes.
IGH locus duplications appear to be common in teleost fishes and should be favoured by natural selection. These observations indicate that these duplications may have arisen in a common ancestor teleost or are due to independent gene duplications that occurred in each specie through their specific phylogenetic history. The fact that many teleosts appear to harbor duplications may support the first hypothesis, however there are also data that suggest an independent evolution in different lineages. The high homology between different zones of the IGH locus (as exons as introns) indicates recent duplications processes. However, if they took place a long time ago, then recombinations events would be required to explain sequence maintenance. In medaka, such duplications and recombinations could explain the presence of immunoglobulin constant exons in germline IGH locus, which are apparently functional but are not expressed. The same reasoning can be applied in the case of VH segments, to explain high homology between members of the same family. Thus, all chromosome segments that contain the IGH locus would be subjected to such duplication and recombination processes.
Duplicated genes have been identified in many teleostean fishes and it has been suggested that species diversity might be related to large-scale independent gene duplications or to whole genome duplication in an ancient teleost [47, 48]. In the case of IGH locus several particular issues remain to be explored. The mechanism known as allelic exclusion prevents the production of more than one specificity in a single lymphoid cell, only one rearrangement product of immunoglobulin is transcribed and translates . Studies of the allelic exclusion of immunoglobulin genes have been performed in species in which a single IGH locus undergoes somatic rearrangement through the lymphocyte development. However, the mechanisms by which teleosts such as medaka, stickleback, catfish, salmo with several IGH locus duplications can exhibit allelic exclusion remains unknown. In medaka, there are at least four IGH duplications that are functional. This means that one cell has the possibility to produce four heavy chains at the same time and therefore could deviate substantially from the clonal selection theory. Eason et al. , identified different productive gene transcripts in isolated single peripheral blood lymphocytes from cartilaginous fish (Raja eglanteria), indicating the possibility of simultaneous immunoglobulin heavy chains expression from multiple different IGH loci in fishes. In cartilaginous fishes, the IGH locus is arranged in multiple independent clusters, thus indicating that the regulation of immunoglobulin expression could be very different from teleost fishes in which the IGH locus is typically in translocon configuration. The fundamental question regarding the establishment and maintenance of haplotype exclusion in a complex multi-cluster- translocon system such as found in medaka IGH locus remains unanswered today.
Further studies are required to a) understand whether IGH locus duplications involve additional biological mechanisms in the immune system and b) to gauge the potential evolutive advantages of such configurations to the generation of immunoglobulin diversity in these species.
The present study shows the genomic organization of the IGH locus in medaka that has genes for IgM and IgD however, no Cτ genes have been identified upstream of the Cμ region. This IGH locus is very complex, with five duplications that present high homology, being four of them functional. Our results suggest that the IGH locus duplication is very common among teleosts, wherein the existence of a recombination process explains the sequence homology between them.
The authors would like to thank Drs. J. Cerdá ( Institute of Marine Sciences of Barcelona, CSIC, and Aquaculture Centre) and J. Rotllant (Institute of Marine Sciences of Vigo, CSIC) for providing medaka fishes to perform these studies. We are also very grateful to Anastasia Zimmerman (Grice Marine Laboratory - College of Charleston) for a critical appreciation of the manuscript.
- Holland PWH: Gene duplication: Past, present and future. Semin Cell Dev Biol. 1999, 10 (5): 541-547. 10.1006/scdb.1999.0335.View ArticlePubMedGoogle Scholar
- Shimeld S, Holland P: Vertebrate innovations. Proc Natl Acad Sci USA. 2000, 97 (9): 4449-4452. 10.1073/pnas.97.9.4449.View ArticlePubMedPubMed CentralGoogle Scholar
- Hsu E, Pulham N, Rumfelt LL, Flajnik MF: The plasticity of immunoglobulin gene systems in evolution. Immunol Rev. 2006, 210 (1): 8-26. 10.1111/j.0105-2896.2006.00366.x.View ArticlePubMedPubMed CentralGoogle Scholar
- Criscitiello MF, Flajnik MF: Four primordial immunoglobulin light chain isotypes, including lambda and kappa, identified in the most primitive living jawed vertebrates. Eur J Immunol. 2007, 37 (10): 2683-2694. 10.1002/eji.200737263.View ArticlePubMedGoogle Scholar
- Flajnik MF, Kasahara M: Origin and evolution of the adaptive immune system: genetic events and selective pressures. Nature Reviews Genetics. 2010, 11 (1): 47-59.View ArticlePubMedPubMed CentralGoogle Scholar
- Harding FA, Amemiya CT, Litman RT, Cohen N, Litman GW: Two distinct immunoglobulin heavy chain isotypes in a primitive, cartilaginous fish, Raja erinacea. Nucleic Acids Res. 1990, 18 (21): 6369-6376. 10.1093/nar/18.21.6369.View ArticlePubMedPubMed CentralGoogle Scholar
- Harding FA, Cohen N, Litman GW: Immunoglobulin heavy chain gene organization and complexity in the skate, Raja erinacea. Nucleic Acids Res. 1990, 18 (4): 1015-1020. 10.1093/nar/18.4.1015.View ArticlePubMedPubMed CentralGoogle Scholar
- Tonegawa S: Somatic generation of antibody diversity. Nature. 1983, 302 (5909): 575-581. 10.1038/302575a0.View ArticlePubMedGoogle Scholar
- Tonegawa S: Proceedings: Determination of the number of antibody structural genes by DNA-RNA hybridization. Hoppe Seylers Z Physiol Chem. 1976, 357 (5): 617-PubMedGoogle Scholar
- Rumfelt LL, Lohr RL, Dooley H, Flajnik MF: Diversity and repertoire of IgW and IgM VH families in the newborn nurse shark. BMC Immunol. 2004, 5 (1): 8-10.1186/1471-2172-5-8.View ArticlePubMedPubMed CentralGoogle Scholar
- Ota T, Rast JP, Litman GW, Amemiya CT: Lineage-restricted retention of a primitive immunoglobulin heavy chain isotype within the Dipnoi reveals an evolutionary paradox. Proc Natl Acad Sci USA. 2003, 100 (5): 2501-2506. 10.1073/pnas.0538029100.View ArticlePubMedPubMed CentralGoogle Scholar
- Anderson MK, Strong SJ, Litman RT, Luer CA, Amemiya CT, Rast JP, Litman GW: A long form of the skate IgX gene exhibits a striking resemblance to the new shark IgW and IgNARC genes. Immunogenetics. 1999, 49 (1): 56-67. 10.1007/s002510050463.View ArticlePubMedGoogle Scholar
- Edholm ES, Bengte'n E, Stafford JL, Sahoo M, Taylor EB, Miller NW, Wilson M: Identification of two IgD+ B cell populations in channel catfish, Ictalurus punctatus. J Immunol. 2010, 185 (7): 4082-4094. 10.4049/jimmunol.1000631.View ArticlePubMedGoogle Scholar
- Bengte'n E, Quiniou S, Hikima J, Waldbieser G, Warr GW, Miller NW, Wilson M: Structure of the catfish IGH locus: analysis of the region including the single functional IGHM gene. Immunogenetics. 2006, 58 (10): 83-1-844Google Scholar
- Hordvik I: Identification of a novel immunoglobulin delta transcript and comparative analysis of the genes encoding IgD in Atlantic salmon and Atlantic halibut. Mol Immunol. 2002, 39 (1-2): 85-91. 10.1016/S0161-5890(02)00043-3.View ArticlePubMedGoogle Scholar
- Danilova N, Bussmann J, Jekosch K, Steiner LA: The immunoglobulin heavy-chain locus in zebrafish: identification and expression of a previously unknown isotype, immunoglobulin Z. Nat Immunol. 2005, 6 (3): 295-302. 10.1038/ni1166.View ArticlePubMedGoogle Scholar
- Gambón-Deza F, Sánchez-Espinel C, Magadán-Mompó S: Presence of an unique IgT on the IGH locus in three-spined stickleback fish (Gasterosteus aculeatus) and the very recent generation of a repertoire of VH genes. Dev Comp Immunol. 2010, 34 (2): 1-14-122View ArticleGoogle Scholar
- Hansen JD, Landis ED, Phillips RB: Discovery of a unique Ig heavy-chain isotype (IgT) in rainbow trout: Implications for a distinctive B cell developmental pathway in teleost fish. Proc Natl Acad Sci USA. 2005, 102 (19): 6919-6924. 10.1073/pnas.0500027102.View ArticlePubMedPubMed CentralGoogle Scholar
- Savan R, Aman A, Nakao M, Watanuki H, Sakai M: Discovery of a novel immunoglobulin heavy chain gene chimera from common carp (Cyprinus carpio L.). Immunogenetics. 2005, 57 (6): 458-463. 10.1007/s00251-005-0015-z.View ArticlePubMedGoogle Scholar
- Taniguchi Y, Takeda S, Furutani-Seiki M, Kamei Y, Todo T, Sasado T, Deguchi T, Kondoh H, Mudde J, Yamazoe M, Hidaka M, Mitani H, Toyoda A, Sakaki Y, Plasterk RH, Cuppen E: Generation of medaka gene knockout models by target-selected mutagenesis. Genome Biol. 2006, 7 (12): R1-16.View ArticleGoogle Scholar
- Ozato K, Wakamatsu Y: Developmental Genetics of Medaka. Development Growth and Differentiation. 1994, 36 (5): 437-443. 10.1111/j.1440-169X.1994.00437.x.View ArticleGoogle Scholar
- Krumsiek J, Arnold R, Rattei T: Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007, 23 (8): 1026-1028. 10.1093/bioinformatics/btm039.View ArticlePubMedGoogle Scholar
- Blankenberg D, Von K, Coraor N, Ananda G, Lazarus R, Mangan M, Nekrutenko A, Taylor J: Galaxy: a web-based genome analysis tool for experimentalists. Curr Protoc Mol Biol. 2010Google Scholar
- Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010, 11 (8):
- Milne I, Bayer M, Cardle L, Shaw P, Stephen G, Wright F, Marshall D: Tablet--next generation sequence assembly visualization. Bioinformatics. 2010, 26 (3): 401-402. 10.1093/bioinformatics/btp666.View ArticlePubMedPubMed CentralGoogle Scholar
- Stanke M, Steinkamp R, Waack S, Morgenstern B: AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 2004, 32 (Web Server issue): W309-W3 12.View ArticlePubMedPubMed CentralGoogle Scholar
- Jung D, Giallourakis C, Mostoslavsky R, Alt FW: Mechanism and control of V(D)J recombination at the immunoglobulin heavy chain locus. Annu Rev Immunol. 2006, 24: 541-570. 10.1146/annurev.immunol.23.021704.115830.View ArticlePubMedGoogle Scholar
- Lefranc MP, Pommié C, Kaas Q, Duprat E, Bosc N, Guiraudou D, Jean C, Ruiz M, Da Piédade I, Rouard M, Foulquier E, Thouvenin V, Lefranc G: IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains. Dev Comp Immunol. 2005, 29 (3): 185-203. 10.1016/j.dci.2004.07.003.View ArticlePubMedGoogle Scholar
- Kumar S, Nei M, Dudley J, Tamura K: MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008, 9 (4): 299-306. 10.1093/bib/bbn017.View ArticlePubMedPubMed CentralGoogle Scholar
- Ross DA, Wilson MR, Miller NW, Clem LW, Warr GW, Miller NW, Warr GW: Evolutionary variation of immunoglobulin heavy chain RNA processing pathways origins effects and implications. Immunol Rev. 1998, 166 (1): 143-151. 10.1111/j.1600-065X.1998.tb01259.x.View ArticlePubMedGoogle Scholar
- Kokubu F, Hinds K, Litman R, Shamblott MJ, Litman GW: Complete structure and organization of immunoglobulin heavy chain constant region genes in a phylogenetically primitive vertebrate. EMBO J. 1988, 7 (7): 1979-1988.PubMedPubMed CentralGoogle Scholar
- Zhao Y, Pan-Hammarström Q, Yu S, Wertz N, Zhang X, Li N, Butler JE, Hammarström L: Identification of IgF, a hinge-region-containing Ig class, and IgD in Xenopus tropicalis. Proc Natl Acad Sci USA. 2006, 103 (32): 12087-12092. 10.1073/pnas.0600291103.View ArticlePubMedPubMed CentralGoogle Scholar
- Ohta Y, Flajnik M: IgD, like IgM, is a primordial immunoglobulin class perpetuated in most jawed vertebrates. Proc Natl Acad Sci USA. 2006, 103 (28): 10723-10728. 10.1073/pnas.0601407103.View ArticlePubMedPubMed CentralGoogle Scholar
- Clem L, Miller N, Warr G, Wilson M, Bengte'n E: Channel catfish immunoglobulins: repertoire and expression. Dev Comp Immunol. 2006, 30: 77-92. 10.1016/j.dci.2005.06.016.View ArticlePubMedGoogle Scholar
- Danilova N, Amemiya CT: Going adaptive: the saga of antibodies. Ann N Y Acad Sci. 2009, 1168: 130-155. 10.1111/j.1749-6632.2009.04881.x.View ArticlePubMedGoogle Scholar
- Wilson MR, Ross DA, Miller NW, Clem LW, Middleton DL, Warrt GW: Alternate pre-mRNA processing pathways in the production of membrane IgM heavy chains in holostean fish. Dev Comp Immunol. 1995, 19 (2): 165-177. 10.1016/0145-305X(94)00064-M.View ArticlePubMedGoogle Scholar
- Lundqvist M, Strömberg S, Bouchenot C, Pilström L, Boudinot P: Diverse splicing pathways of the membrane IgHM pre-mRNA in a Chondrostean, the Siberian sturgeon. Dev Comp Immunol. 2009, 33 (4): 507-515. 10.1016/j.dci.2008.10.009.View ArticlePubMedGoogle Scholar
- Coscia MR, Varriale S, Santi CD, Giacomelli S, Oreste U: Evolution of the Antarctic teleost immunoglobulin heavy chain gene. Mol Phylogenet Evol. 2010, 55 (1): 226-233. 10.1016/j.ympev.2009.09.033.View ArticlePubMedGoogle Scholar
- Hordvik I, Thevarajan J, Samdal I, Bastani N, Krossøy B: Molecular cloning and phylogenetic analysis of the Atlantic salmon immunoglobulin D gene. Scand J Immunol. 1999, 50 (2): 202-210. 10.1046/j.1365-3083.1999.00583.x.View ArticlePubMedGoogle Scholar
- Savan R, Aman A, Sato K, Yamaguchi R, Sakai M: Discovery of a new class of immunoglobulin heavy chain from fugu. Eur J Immunol. 2005, 35 (11): 3320-3331. 10.1002/eji.200535248.View ArticlePubMedGoogle Scholar
- Srisapoome P, Ohira T, Hirono I, Aoki T: Genes of the constant regions of functional immunoglobulin heavy chain of Japanese flounder, Paralichthys olivaceus. Immunogenetics. 56 (4): 292-300.
- Xiao F, Wang Y, Yan W, Chang M, Yao W, Xu Q, Wang X, Gao Q, Nie P: Ig heavy chain genes and their locus in grass carp Ctenopharyngodon idella. Fish Shellfish Immunol. 2010, 29 (4): 594-599. 10.1016/j.fsi.2010.06.004.View ArticlePubMedGoogle Scholar
- Edholm ES, Wilson M, Bengte'n E: Immunoglobulin light (IgL) chains in ectothermic vertebrates. Dev Comp Immunol. 2011, in pressGoogle Scholar
- Bao Y, Wang T, Guo Y, Zhao Z, Li N, Zhao Y: The immunoglobulin gene loci in the teleost Gasterosteus aculeatus. Fish Shellfish Immunol. 2010, 28 (1): 40-48. 10.1016/j.fsi.2009.09.014.View ArticlePubMedGoogle Scholar
- Yasuike M, Boer JD, Schalburg KRV, Cooper GA, McKinnel L, Messmer A, So S, Davidson WS, Koop BF: Evolution of duplicated IgH loci in Atlantic salmon, Salmo salar. BMC Genomics. 2010, 11: 486-View ArticlePubMedPubMed CentralGoogle Scholar
- Danilova N: Analysis of recombination signal sequences in zebrafish. Mol Immunol. 2005, 42 (10): 1243-1249. 10.1016/j.molimm.2004.11.022.View ArticlePubMedGoogle Scholar
- Christoffels A, Koh EGL, Chia JM, Brenner S, Aparicio S, Venkatesh B: Fugu genome analysis provides evidence for a whole-genome duplication early during the evolution of ray-finned fishes. Mol Biol Evol. 2004, 21 (6): 1146-1151. 10.1093/molbev/msh114.View ArticlePubMedGoogle Scholar
- Brunet FG: Gene loss and evolutionary rates following whole-genome duplication in teleost fishes. Mol Biol Evol. 2006, 23 (9): 1808-1816. 10.1093/molbev/msl049.View ArticlePubMedGoogle Scholar
- Bergman Y, Cedar H: A stepwise epigenetic process controls immunoglobulin allelic exclusion. Nature Reviews Immunology. 2004, 4 (10): 753-761.View ArticlePubMedGoogle Scholar
- Eason DD, Litman RT, Luer CA, Kerr W, Litman GW: Expression of individual immunoglobulin genes occurs in an unusual system consisting of multiple independent loci. Eur J Immunol. 2004, 34 (9): 255-1-2558View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.