We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact us so we can address the problem.
Post-duplication charge evolution of phosphoglucose isomerases in teleost fishes through weak selection on many amino acid sites
© Sato and Nishida; licensee BioMed Central Ltd. 2007
Received: 28 June 2007
Accepted: 29 October 2007
Published: 29 October 2007
The partitioning of ancestral functions among duplicated genes by neutral evolution, or subfunctionalization, has been considered the primary process for the evolution of novel proteins (neofunctionalization). Nonetheless, how a subfunctionalized protein can evolve into a more adaptive protein is poorly understood, mainly due to the limitations of current analytical methods, which can detect only strong selection for amino acid substitutions involved in adaptive molecular evolution. In this study, we employed a comparative evolutionary approach to this question, focusing on differences in the structural properties of a protein, specifically the electric charge, encoded by fish-specific duplicated phosphoglucose isomerase (Pgi) genes.
Full-length cDNA cloning, RT-PCR based gene expression analyses, and comparative sequence analyses showed that after subfunctionalization with respect to the expression organ of duplicate Pgi genes, the net electric charge of the PGI-1 protein expressed mainly in internal tissues became more negative, and that of PGI-2 expressed mainly in muscular tissues became more positive. The difference in net protein charge was attributable not to specific amino acid sites but to the sum of various amino acid sites located on the surface of the PGI molecule.
This finding suggests that the surface charge evolution of PGI proteins was not driven by strong selection on individual amino acid sites leading to permanent fixation of a particular residue, but rather was driven by weak selection on a large number of amino acid sites and consequently by steady directional and/or purifying selection on the overall structural properties of the protein, which is derived from many modifiable sites. The mode of molecular evolution presented here may be relevant to various cases of adaptive modification in proteins, such as hydrophobic properties, molecular size, and electric charge.
Proteins that arise through gene duplication can become novel proteins through fixation of beneficial mutations , but because beneficial mutations are generally rare, the partitioning of ancestral functions among duplicated genes by neutral evolution, or subfunctionalization, has been considered the primary process for the evolution of novel proteins [2–5]. To date, many duplicate genes have been demonstrated to evolve following this model of subfunctionalization, and this model thus has become widely accepted in the context of duplicated gene evolution [6–10].
Nonetheless, how a more adaptive or specialized protein property evolves after subfunctionalization is poorly understood, mainly due to the limited resolution power of current analytical methods, which seek to detect positive selection on individual amino acid substitutions involved in adaptive molecular evolution. Such methods can recognize substitutions expected to be driven by strong selection, many of which are usually function-altering substitutions at important amino acid sites, such as enzyme active sites or viral epitopes [e.g., ]. However, adaptive substitutions by relatively moderate or weak selection may not be recognized by these methods. Therefore, to detect adaptive protein evolution under a much wider range of selection pressure, a novel approach is required. In this study, we utilized a comparative evolutionary approach to this problem, focusing on differences in the high-dimensional properties of a protein, specifically the electric charge, encoded by a pair of duplicated genes.
As a model protein system for this study, we chose an important enzyme involved in glycolysis and gluconeogenesis, phosphoglucose isomerase (PGI; EC 188.8.131.52). The gene encoding this enzyme (Pgi) is present as a single copy in tetrapods, whereas two copies exist in most groups of ray-finned fishes [12–14]. The fact that these duplicated Pgi-1 and Pgi-2 genes in fishes are expressed in different organs [12, 15] implies that these fish-specific duplicate Pgi genes are subfunctionalized with respect to their expression, and are thus good candidates for the model of subfunctionalized genes. For ray-finned fishes, a reliable phylogenetic framework, which is essential for comparative evolutionary analyses, is available due to recent progress in molecular phylogenetic studies [16–21]. In addition, the basal lineages of ray-finned fishes, including Semionotiformes (gar fish) and Amiiformes (amia), have only one Pgi locus . This single-copy gene may be the direct descendant of the ancestral unduplicated Pgi in ray-finned fishes, and thus can be considered an appropriate outgroup gene for comparison between the duplicated Pgi genes.
The Pgi-1 and Pgi-2 genes, which were expected to be subfunctionalized in their expression, also differ in the net electric charge of their encoded proteins [12, 13, 15]. The electric charge of soluble proteins such as PGIs is a structural property brought by a large number of multiple amino acid residues and is involved in the adaptive evolution of several soluble proteins [e.g., [22–24]], such as an acquisition of protein thermostability [25–27]. Therefore, the evolution of electric charge in the duplicated PGI proteins is an interesting subject to investigate regarding the evolution of novel protein properties after subfunctionalization.
In this study, we examined first whether the spatial expression patterns of duplicated Pgi genes in ray-finned fishes are compatible with predictions based on the subfunctionalization model of duplicate gene evolution. Next, by focusing on the electric charges of the PGI proteins, we analyzed the underlying evolutionary process producing novel protein properties after gene duplication through ancestral sequence inference using the maximum likelihood (ML) method based on a reliable phylogenetic framework of ray-finned fishes, and also using three-dimensional (3-D) structural information on the protein.
Duplication and subfunctionalization of the Pgi genes in teleost fishes
A reverse transcriptase-polymerase chain reaction (RT-PCR)-based expression analysis showed that the Pgi gene in non-teleost ray-finned fishes was expressed in all tissues examined (Figure 1B), confirming that this gene is the direct descendant of the ancestral unduplicated Pgi with no tissue specificity, as in tetrapods [12, 13, 15]. In contrast, the teleost Pgi-1 gene was expressed mainly in internal organs, including the liver, heart, gill, brain, and kidney, and weakly in the muscle, whereas Pgi-2 was expressed mainly in the heart and muscle. The differential expression patterns of Pgi-1 and Pgi-2 support the concept of subfunctionalization [2, 3], which is the complementary loss of subsets in the expression organs of the ancestral gene. Thus, the Pgi gene family in ray-finned fishes is an appropriate model for studying molecular evolution after subfunctionalization.
Evolution of the electric charges of duplicated PGI proteins in teleost fishes
Biochemical parameters of vertebrate PGI proteins.
No. of hydrophilic charged residues
No. of amino acids
Isoelectric point (pI)
Positively charged (Arg+Lys)
Negatively charged (Asp+Glu)
Teleost fish PGI-1
Teleost fish PGI-2
Non-teleost fish PGI
Jawless fish PGI
The underlying process of the electric charge evolution of the PGI proteins can be inferred by ML sequence reconstructions  based on the recent ray-finned fish phylogeny . We applied this approach and show the results in Figure 2A. Our results suggest that after gene duplication, pI values in the PGI-1 clade gradually decreased, whereas those in the PGI-2 clade increased. Next, we assigned charge-changing substitutions (between Lys/Arg or Asp/Glu and other residues) to the tree branches based on pairwise sequence comparisons among the inferred ancestral sequences or between the extant and ancestral sequences along the tree topology. This result of assignment (Figure 2C) showed that the charge-changing substitutions were inferred to have occurred in excess (5–28) of that expected (3–5) for parsimonious evolution in electric charge differences between PGI-1 and PGI-2 (Figure 2C; see also Table 1). Figure 2C also shows that the charge-changing substitutions have occurred in both directions (either upward or downward) on most branches at various amino acid sites (76 sites shown in Figure 2B) [see Additional file 1: Table S1]. An analysis using parsimony yielded similar results [see Additional file 1: Fig. S3].
Statistical analyses of the spatial clustering of inferred amino acid substitutions
Analytically inferred and theoretically predicted numbers of charge-changing and charge-neutral substitutions.
Maximum likelihood-inferred numbers
The results of phylogenetic analysis, RT-PCR-based expression analysis, and sequence comparison of Pgi genes in teleost fishes suggest that after subfunctionalization of the duplicated Pgi genes in the ancestor of teleost fishes, the electric charges of the PGI-1 and PGI-2 proteins diverged. This evolution can be interpreted according to the sub-neofunctionalization model of gene evolution [2–5], which proposes that the partitioning of function between the duplicated genes alters the selective environment at each locus, resulting in structural fine-tuning or adaptation of the encoded proteins by positive selection. That is, the divergent evolution of the electric charges in the duplicated PGI isoforms is the consequence of specialization for the specific function (glycolysis or gluconeogenesis) or distinct cellular environment of tissues where each isoform is predominantly expressed (see Figure 1B), as suggested for other water-soluble proteins [22–24].
The present comparative evolutionary analysis implies that since the gene duplication event, the electric charges of the two PGI isoforms changed steadily through many charge-changing substitutions in both directions of charge change; only a few charged amino acid sites were specific to PGI-1 or PGI-2 (Figure 2). Such charge-changing substitutions concentrated at the surface of PGI molecule (Figure 3) were inferred to have occurred much more frequently than expected in the parsimonious evolution of electric charge difference between the two isoforms (see Figure 2C and Table 1). From these observations, two possible scenarios are proposed for the evolution of protein charge in duplicated PGI isoforms: protein charges in PGI-1 decreased gradually, while those in PGI-2 increased; alternatively, charge divergence between PGI-1 and PGI-2 was completed soon after the duplication and before the radiation of teleosts, followed by maintenance of the protein charges under purifying selection, while stochastic charge-changing substitutions by drift occurred among lineages. In either scenario, we can conclude that the surface charge evolution of PGI proteins was not driven by strong selection on individual amino acid sites leading to permanent fixation of a particular residue, but rather was driven by weak selection on a large number of amino acid sites and consequently by steady directional or purifying selection on the overall structural properties of the protein, which is derived from many modifiable sites. This mode of molecular evolution agrees with the understanding that most proteins are substantially tolerant of a broad spectrum of substitutions and thus may harbor many amino acid sites available for evolutionary modification . Our study provides the first plausible evidence of adaptive protein evolution through such selection.
The mode of molecular evolution proposed in this study would be difficult to find using existing methods that detect strong selection for particular substitutions. We applied such an analysis to identify positively selected sites after Pgi gene duplication using DIVERGE version 1.04 ; however, the results were not clear (data not shown). Further analysis using the program CODEML  did not detect the acceleration of the rate of nonsynonymous substitution, not showing selection for amino acid changes (estimated ω was 0.01 to 0.23). Even if, in general, a significant excess of amino acid change is detected, such methodology itself cannot rule out possible confounding effects, or alternative interpretations, particularly the relax of purifying selection . In a previous study, an analysis using the program HonNew  also failed to detect selection for charge-changing substitutions in teleost PGIs, leading to the conclusion that the charge change in the duplicate PGIs of teleosts may be selectively neutral . The mode of molecular evolution presented here, in which diverse evolutionary resolutions exist at the level of a primary sequence that corresponds to a certain selective pressure on a protein property, may be relevant to various cases of adaptive modification in proteins, such as hydrophobic properties, molecular size, and electric charge. This may be an important pathway underlying physiological adaptation, along with protein evolution by simple amino-acid changes, gene deletion or silencing, and possibly cis-regulatory changes [39, 41].
In this paper, we provide the evidence that relatively weak selection on a large number of amino acid sites drives the evolution of novel charge-state of duplicated phosphoglucose isomerases, which are subfunctionalized in teleost fishes. Such mode of adaptive molecular evolution, which was hardly recognizable by existing analytical methods aiming to detect strong selection on individual amino acid changes, may play a substantial role in the evolution of novel proteins.
Our data set contains representatives from divergent lineages of ray-finned fishes, as follows – basal non-teleost ray-finned fishes: Polypterus ornatipinnis (bichir), Acipenser ruthenus (sturgeon), Amia calva (amia), and Lepisosteus osseus (gar); teleosts: Osteoglossum bicirrhosum (arowana) and Anguilla anguilla (eel) from basal groups, Plecoglossus altivelis (smelt) and Danio rerio (zebrafish) from intermediate groups, and Mugil cephalus (mullet) and Fugu rubripes (fugu) from derived groups. Live specimens, which were obtained either from local shops or other investigators in Japan, were treated according to the ethical recommendations of the Ichthyological Society of Japan and the University of Tokyo.
Cloning and sequencing
Total RNA was extracted from fresh skeletal muscle and liver tissue using TRIzol reagent (Invitrogen) and reverse-transcribed into first-strand cDNA with oligo-dT adaptor primer using an RNA PCR kit (TaKaRa). Partial Pgi cDNA was amplified using PCR with vertebrate universal degenerate primers . The well amplified DNA fragments were purified using a MinElute gel extraction kit (Qiagen), ligated into the pGEM-T Easy Vector system (Promega), transmitted into competent E. coli (Competent High DH5a, Toyobo), and sequenced with an ABI PRISM 3100 (Applied Biosystems) using T7 or SP6 primers. The partial Pgi sequences were used to design gene-specific primers (GSPs) for RACE PCR [see Additional file 1: Table S2]; 3' RACE PCR was conducted with the sense GSP and M13 primer M4 (TaKaRa) and the first-strand cDNA as the template. For 5' RACE, double-stranded cDNA PCR libraries were synthesized from 1 μg of total RNA using the cDNA synthesis kit (M-MLV version; TaKaRa) combined with the cDNA PCR library kit (TaKaRa). Then, 5' RACE PCR was conducted with the antisense GSP and CA primer (TaKaRa). Subcloning and sequencing were performed as above.
The Pgi genes from 20 vertebrates were phylogenetically analyzed with the Bayesian and ML methods using the programs MrBayes 3.0B4  and PAUP 4.0b10 , respectively. The species used [GenBank accession numbers or Ensembl Transcript IDs of the Pgi gene(s)] were as follows: bichir (AB282684*), sturgeon (AB282688*), amia (AB282681*), gar (AB282687*), arowana (Pgi-1: AB282682* and Pgi-2: AB282683*), eel (Pgi-1: AB282685* and Pgi-2: AB282686*), smelt (Pgi-1: AB282690* and Pgi-2: AB282691*), zebrafish (Pgi-1: AJ306395 and Pgi-2: AJ306396), mullet (Pgi-1: AJ306392 and Pgi-2: AJ306393), fugu (Pgi-1: NEWSINFRUT00000145974 and AB282689*, and Pgi-2: NEWSINFRUT00000159975), Homo sapiens (human; K03515), Sus scrofa (pig; X07382), Oryctolagus cuniculus (rabbit; AF199601), Cricetulus griseus (hamster; Z37977), Mus musculus (mouse; M1422), Rattus norvegicus (rat; ENSRNOT00000032613), Gallus gallus (chicken; ENSGALT00000007948), Boiga kraepelini (snake; AJ306394), Bufo melanostictus (toad; AJ306397), and Paramyxine yangi (hagfish; AJ306391). Newly cloned sequences in this study (marked with asterisks) were named under the denomination of PGI isozymes . Bayesian and ML trees were constructed under the GTR + I + Γ model , which was selected as the best-fitting model of nucleotide substitution by hierarchical likelihood ratio tests (hLRTs) [45, 46] with 1100 base pairs (bp) of the Pgi coding region (excluding the third codon position) [see Additional file 1: Fig. S4]. The Bayesian posterior probabilities of the phylogeny and its branches were determined from 9901 trees. Support for heuristic ML analysis was assessed using 100 bootstrap replications.
The genomic regions around the Pgi locus (or loci) in the human, chicken, and zebrafish genomes were investigated and compared. Genomic data from the pufferfishes Fugu rubripes and Tetraodon nigroviridis were not useful in this analysis because the locations of their Pgi loci were not determined. Data on the neighborhood of the Pgi locus in the human and chicken genomes were obtained from the NCBI Mapviewer Web site . Twenty-seven protein-coding genes were identified around the human PGI locus, within a 1.8-Mb-long region on chromosome 19. The nucleotide sequences of these human genes were subjected to BLASTN searches against the zebrafish genome sequences using the Ensembl BLASTN search service . The matches detected with an E-value threshold of <10-3 were checked visually. Then, we selected identifiable genes described as putative orthologs of the queries. Their genomic location data were used to rebuild the synteny maps around the zebrafish Pgi loci.
Gene expression analysis
RT-PCR was performed for expression analysis of the Pgi genes. The primers used are described in [see Additional file 1: Table S3]. They were designed as follows: to distinguish the duplicate Pgi loci in teleosts, the 3' region of one primer from each primer pair was made to locate the differential nucleotide site between the two loci of the species concerned, and to avoid false amplification from genomic DNA contaminants, each primer pair was designed to span a Pgi exon/intron boundary considered conservative among vertebrates. Total RNA was extracted from liver, skeletal muscle, heart, gill filament, brain, and kidney (or gonad) tissues of fresh fish samples. RNA extraction, reverse-transcription into first-strand cDNA and PCR were performed in the same manner as mentioned in the Cloning and sequencing section. The thermal-cycle profile was as follows: 1 cycle at 94°C for 2 min; 30 cycles at 94°C for 30 sec, 60°C for 30 sec, and 72°C for 30 sec; followed by 1 cycle at 72°C for 7 min. As a positive control for gene expression, β-actin cDNA was amplified using the primers 5'-GACATGGAGAAGATCTGGCA-3' and 5'-TGATCCACATCTGCTGGAAGGT-3' (predicted product size = 834 bp), which were designed by Dr. Kaoru Kuriiwa of the National Museum of Nature and Science, Tokyo. These primer sequences were based on a highly conserved region of the β-actin gene in mangrove killifish, Rivulus marmoratus (GenBank accession number AF168615). The amplified DNA fragments were separated on a 2.0% L03 agarose gel (TaKaRa), stained with ethidium bromide, and visualized under UV light. GeneRuler™ 100 bp DNA Ladder Plus (MBI Fermentas) was used as a size marker for electrophoresis.
Charge evolution analysis
The ML inference of the ancestral sequences of Pgi genes was performed by BASEML  based on the phylogeny of ray-finned fishes using whole mitochondrial genome data . Tetrapods were excluded from this analysis because of their absence in this tree. Nucleotide sequence alignments of the coding region of Pgi cDNAs (1650 bp, without ambiguous regions) from 10 ray-finned fishes plus hagfish were used. The GTR + Γ  model was selected as the best fitting model by the hLRTs [see Additional file 1: Table S4]. The average overall accuracy of the reconstructed sequences (#1–#15) [see Additional file 1: Appendix] was 0.948 ± 0.003 SE. The pI values were estimated from the deduced amino acid sequences using the ProtParam tool . The solvent-accessible surface area (SASA) of each amino acid residue was estimated with GETAREA 1.1  for the dimeric PGI protein structure using a solvent radius of 1.4 Å (approximately the size of a water molecule). Rabbit PGI [PDB: 1XTB]  was used as a reference structure. The structural portion of the PGI composed of amino acid residues with more than 20 Å 2 SASA was considered "molecular surface." This boundary mostly agrees with other criteria based on the ratio of side-chain surface area to random coil value per residue . A three-dimensional graphical model of the PGI molecule was constructed using RasMol .
Calculation of the expected spatial distribution of amino acid substitutions
where f i is the normalized frequency and μ i is the relative mutability of each amino acid. The parameter f i was estimated separately for the surface and interior portions of the inferred common ancestral protein of PGI-1 and PGI-2 (node #5 in Figure 2C) to consider differential amino acid composition in different parts of the protein [see Additional file 1: Table S5]. Based on the resultant P ij , we estimated the theoretical ratio of the charge-changing substitutions to charge-neutral substitutions (ΣPcharge-changing: ΣPcharge-neutral) of the surface (r1:r2) and interior (r3:r4) portions of the PGI protein molecule under the assumption of random mutation.
According to the null hypothesis that all pairs of amino acid substitutions occur regardless of their spatial locations, the amino acid substitution events would be spatially distributed into the surface and interior portions of the PGI protein along the ratio of the numbers of amino acid substitution sites at the surface (132 sites) to the interior (80 sites) of the PGI protein since their gene duplication. Accounting for the spatial-differential amino acid composition as described above, the expected spatial distribution of amino acid substitutions shown in Table 2 was estimated based on the ratio of charge-changing substitutions in the molecular surface:charge-neutral substitutions in the molecular surface:charge-changing substitutions in the molecular interior:charge-neutral substitutions in the molecular interior = 132r1:132r2:80r3:80r4.
We thank H. Takeshima for the data and samples of smelt (ayu-fish), and J.G. Inoue, Y. Yamanoue, T.P. Satoh, K. Kuriiwa, Y. Hashiguchi, and lab members for aid and information. We also thank T. Miyadai of Fukui Prefectural University for a fugu specimen; A. Okamura of IRAGO Institute Co., Ltd., for a European eel specimen; and A. Murase, Y. Miyazaki, and Y. Tazaki of Tokyo University of Marine Science and Technology for a mullet specimen. This study was partially supported by grants-in-aid from the Japan Society for the Promotion of Science to M.N.
- Ohno S: Evolution by gene duplication. 1970, New York: Springer-VerlagView ArticleGoogle Scholar
- Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: Preservation of duplicate genes by complementary, degenerative mutations. Genetics. 1999, 151: 1531-1545.PubMed CentralPubMedGoogle Scholar
- Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000, 154: 459-473.PubMed CentralPubMedGoogle Scholar
- He X, Zhang J: Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution. Genetics. 2005, 169: 1157-1164.PubMed CentralView ArticlePubMedGoogle Scholar
- Rastogi S, Liberles DA: Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol Biol. 2005, 5: 28-PubMed CentralView ArticlePubMedGoogle Scholar
- Prince VE, Pickett FB: Splitting pairs: the diverging fates of duplicated genes. Nat Rev Genet. 2002, 3: 827-837.View ArticlePubMedGoogle Scholar
- Zhang J: Evolution by gene duplication: an update. Trends Ecol Evol. 2003, 18: 292-298.View ArticleGoogle Scholar
- Yu WP, Brenner S, Venkatesh B: Duplication, degeneration and subfunctionalization of the nested synapsin-Timp genes in Fugu. Trends Genet. 2003, 19: 180-183.View ArticlePubMedGoogle Scholar
- de Souza FS, Bumaschny VF, Low MJ, Rubinstein M: Subfunctionalization of expression and peptide domains following the ancient duplication of the proopiomelanocortin gene in teleost fishes. Mol Biol Evol. 2005, 22: 2417-2427.View ArticlePubMedGoogle Scholar
- Tocchini-Valentini GD, Fruscoloni P, Tocchini-Valentini GP: Structure, function, and evolution of the tRNA endonucleases of Archaea: an example of subfunctionalization. Proc Natl Acad Sci USA. 2005, 102: 8933-8938.PubMed CentralView ArticlePubMedGoogle Scholar
- Bielawski JP, Yang Z: Maximum likelihood methods for detecting adaptive protein evolution. Statistical Methods in Molecular Evolution. Edited by: Nielsen R. 2005, New York: Springer-Verlag, 103-124.View ArticleGoogle Scholar
- Avise JC, Kitto GB: Phosphoglucose isomerase gene duplication in the bony fishes: an evolutionary history. Biochem Genet. 1973, 8: 113-132.View ArticlePubMedGoogle Scholar
- Kao HW, Lee SC: Phosphoglucose isomerases of hagfish, zebrafish, gray mullet, toad, and snake, with reference to the evolution of the genes in vertebrates. Mol Biol Evol. 2002, 19: 367-374.View ArticlePubMedGoogle Scholar
- Steinke D, Hoegg S, Brinkmann H, Meyer A: Three rounds (1R/2R/3R) of genome duplications and the evolution of the glycolytic pathway in vertebrates. BMC Biol. 2006, 4: 16-PubMed CentralView ArticlePubMedGoogle Scholar
- Dando PR: Duplication of the glucosephosphate isomerase locus in vertebrates. Biochem Physiol. 1980, 66: 373-378.Google Scholar
- Venkatesh B, Ning Y, Brenner S: Late changes in spliceosomal introns define clades in vertebrate evolution. Proc Natl Acad Sci USA. 1999, 96: 10267-10271.PubMed CentralView ArticlePubMedGoogle Scholar
- Venkatesh B, Erdmann MV, Brenner S: Molecular synapomorphies resolve evolutionary relationships of extant jawed vertebrates. Proc Natl Acad Sci USA. 2001, 98: 11382-1387.PubMed CentralView ArticlePubMedGoogle Scholar
- Inoue JG, Miya M, Tsukamoto K, Nishida M: Basal actinopterygian relationships: a mitogenomic perspective on the phylogeny of the "ancient fish". Mol Phylogenet Evol. 2003, 26: 110-120.View ArticlePubMedGoogle Scholar
- Miya M, Takeshima H, Endo H, Ishiguro NB, Inoue JG, Mukai T, Satoh TP, Yamaguchi M, Kawaguchi A, Mabuchi K, Shirai SM, Nishida M: Major patterns of higher teleostean phylogenies: a new perspective based on 100 complete mitochondrial DNA sequences. Mol Phylogenet Evol. 2003, 26: 121-138.View ArticlePubMedGoogle Scholar
- Kikugawa K, Katoh K, Kuraku S, Sakurai H, Ishida O, Iwabe N, Miyata T: Basal jawed vertebrate phylogeny inferred from multiple nuclear DNA-coded genes. BMC Biol. 2004, 2: 3-PubMed CentralView ArticlePubMedGoogle Scholar
- Lavoué S, Miya M, Inoue JG, Saitoh K, Ishiguro NB, Nishida M: Molecular systematics of the gonorynchiform fishes (Teleostei) based on whole mitogenome sequences: implications for higher-level relationships within the Otocephala. Mol Phylogenet Evol. 2005, 37: 165-177.View ArticlePubMedGoogle Scholar
- Frolow F, Harel M, Sussman JL, Mevarech M, Shoham M: Insights into protein adaptation to a saturated salt environment from the crystal structure of a halophilic 2Fe-2S ferredoxin. Nature Struct Biol. 1996, 3: 452-458.View ArticlePubMedGoogle Scholar
- Merritt TJS, Quattro JM: Evidence for a period of directional selection following gene duplication in a neurally expressed locus of triosephosphate isomerase. Genetics. 2001, 159: 689-697.PubMed CentralPubMedGoogle Scholar
- Zhang J, Zhang YP, Rosenberg HF: Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey. Nat Genet. 2002, 30: 411-415.View ArticlePubMedGoogle Scholar
- Alsop E, Silver M, Livesay DR: Optimized electrostatic surfaces parallel increased thermostability: a structural bioinformatic analysis. Protein Eng. 2003, 16: 871-874.View ArticlePubMedGoogle Scholar
- Yano JK, Poulos TL: New understandings of thermostable and peizostable enzymes. Curr Opin Biotechnol. 2003, 14: 360-365.View ArticlePubMedGoogle Scholar
- Robinson-Rechavi M, Alibés A, Godzik A: Contribution of electrostatic interactions, compactness and quaternary structure to protein thermostability: lessons from structural genomics of Thermotoga maritima. J Mol Biol. 2006, 356: 547-557.View ArticlePubMedGoogle Scholar
- Amores A, Suzuki T, Yan YL, Pomeroy J, Singer A, Amemiya C, Postlethwait JH: Zebrafish hox clusters and vertebrate genome evolution. Science. 1998, 282: 1711-1714.View ArticlePubMedGoogle Scholar
- Taylor JS, Braasch I, Frickey T, Meyer A, Van de Peer Y: Genome duplication, a trait shared by 22000 species of ray-finned fish. Genome Res. 2003, 13: 382-390.PubMed CentralView ArticlePubMedGoogle Scholar
- Jaillon O, Aury JM, Brunet F, Petit JL, Stange-Thomann N, Mauceli E, Bouneau L, Fischer C, Ozouf-Costaz C, Bernot A, Nicaud S, Jaffe D, Fisher S, Lutfalla G, Dossat C, Segurens B, Dasilva C, Salanoubat M, Levy M, Boudet N, Castellano S, Anthouard V, Jubin C, Castelli V, Katinka M, Vacherie B, Biémont C, Skalli Z, Cattolico L, Poulain J, De Berardinis V, Cruaud C, Duprat S, Brottier P, Coutanceau JP, Gouzy J, Parra G, Lardier G, Chapple C, McKernan KJ, McEwan P, Bosak S, Kellis M, Volff JN, Guigó R, Zody MC, Mesirov J, Lindblad-Toh K, Birren B, Nusbaum C, Kahn D, Robinson-Rechavi M, Laudet V, Schachter V, Quétier F, Saurin W, Scarpelli C, Wincker P, Lander ES, Weissenbach J, Roest Crollius H: Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature. 2004, 431: 946-957.View ArticlePubMedGoogle Scholar
- Chiu CH, Dewar K, Wagner GP, Takahashi K, Ruddle F, Ledje C, Bartsch P, Scemama JL, Stellwag E, Fried C, Prohaska SJ, Stadler PF, Amemiya CT: Bichir hox A cluster sequence reveals surprising trends in ray-finned fish genomic evolution. Genome Res. 2004, 14: 11-17.PubMed CentralView ArticlePubMedGoogle Scholar
- Hoegg S, Brinkmann H, Taylor JS, Meyer A: Phylogenetic timing of the fish-specific genome duplication correlates with the diversification of teleost fish. J Mol Evol. 2004, 59: 190-203.View ArticlePubMedGoogle Scholar
- Lee JH, Jeffery CJ: The crystal structure of rabbit phosphoglucose isomerase complexed with D-sorbitol-6-phosphate, an analog of the open chain form of D-glucose-6-phosphate. Protein Sci. 2005, 14: 727-734.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997, 13: 555-556.PubMedGoogle Scholar
- Bustamante CD, Nielsen R, Sawyer SA, Olsen KM, Purugganan MD, Hartl DL: Solvent accessibility and purifying selection within proteins of Escherichia coli and Salmonella enterica. Mol Biol Evol. 2000, 17: 301-308.View ArticlePubMedGoogle Scholar
- Choi SS, Vallender EJ, Lahn BT: Systematically assessing the influence of 3-dimensional structural context on the molecular evolution of mammalian proteomes. Mol Biol Evol. 2006, 23: 2131-2133.View ArticlePubMedGoogle Scholar
- Lynch M: Simple evolutionary pathways to complex proteins. Protein Sci. 2005, 14: 2217-2225.PubMed CentralView ArticlePubMedGoogle Scholar
- Gu X, Vander Velden K: DIVERGE: phylogeny-based analysis for functional-structural divergence of a protein family. Bioinformatics. 2002, 18: 500-501.View ArticlePubMedGoogle Scholar
- Hughes AL: Looking for Darwin in all the wrong places: the misguided quest for positive selection at the nucleotide sequence level. Heredity. 2007, 99: 364-373.View ArticlePubMedGoogle Scholar
- Zhang J: Rates of conservative and radical nonsynonymous nucleotide substitutions in mammalian nuclear genes. J Mol Evol. 2000, 50: 56-68.PubMedGoogle Scholar
- Hoekstra HE, Coyne JA: The locus of evolution: evo devo and the genetics of adaptation. Evolution. 2007, 61: 995-1016.View ArticlePubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574.View ArticlePubMedGoogle Scholar
- Swofford DL: PAUP*. Phylogenetic analysis using parsimony (*and other methods), version 4.0b10 (Alvitec). 2002, Sinauer, Sunderland, MassachusettsGoogle Scholar
- Yang Z: Estimating the pattern of nucleotide substitution. J Mol Evol. 1994, 39: 105-111.PubMedGoogle Scholar
- Nylander JAA: MrModeltest v2. Program distributed by the author. 2004, Uppsala University, Evolutionary Biology CentreGoogle Scholar
- Posada D, Crandall KA: MODEL TEST: testing the model of DNA substitution. Bioinformatics Appl Note. 1998, 14: 817-818.View ArticleGoogle Scholar
- NCBI Map Viewer. [http://www.ncbi.nlm.nih.gov/mapview/]
- Ensembl BlastView. [http://www.ensembl.org/Multi/blastview]
- Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, Appel RD, Bairoch A: Protein identification and analysis tools on the ExPASy server. The Proteomics Protocols Handbook. Edited by: Walker JM. 2005, Totowa, NJ: Humana Press, 571-607.View ArticleGoogle Scholar
- Fraczkiewicz R, Braun W: Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules. J Comput Chem. 1998, 19: 319-333.View ArticleGoogle Scholar
- Sayle RA, Milner-White EJ: RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995, 20: 374-376.View ArticlePubMedGoogle Scholar
- Jones DT, Taylor WR, Thornton JM: The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992, 8: 275-282.PubMedGoogle Scholar
- Dayhoff MO, Schwartz RM, Orcutt BC: A model of evolutionary change in proteins. Atlas of Protein Sequence and Structure 5. Edited by: Dayhoff MO. 1978, Silver Springs, Maryland: National Biomedical Research Foundation, 345-352. Supp 3Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.