Tempo and mode of early gene loss in endosymbiotic bacteria from insects
© Delmotte et al; licensee BioMed Central Ltd. 2006
Received: 11 April 2006
Accepted: 18 July 2006
Published: 18 July 2006
Understanding evolutionary processes that drive genome reduction requires determining the tempo (rate) and the mode (size and types of deletions) of gene losses. In this study, we analysed five endosymbiotic genome sequences of the gamma-proteobacteria (three different Buchnera aphidicola strains, Wigglesworthia glossinidia, Blochmannia floridanus) to test if gene loss could be driven by the selective importance of genes. We used a parsimony method to reconstruct a minimal ancestral genome of insect endosymbionts and quantified gene loss along the branches of the phylogenetic tree. To evaluate the selective or functional importance of genes, we used a parameter that measures the level of adaptive codon bias in E. coli (i.e. codon adaptive index, or CAI), and also estimates of evolutionary rates (Ka) between pairs of orthologs either in free-living bacteria or in pairs of symbionts.
Our results demonstrate that genes lost in the early stages of symbiosis were on average less selectively constrained than genes conserved in any of the extant symbiotic strains studied. These results also extend to more recent events of gene losses (i.e. among Buchnera strains) that still tend to concentrate on genes with low adaptive bias in E. coli and high evolutionary rates both in free-living and in symbiotic lineages. In addition, we analyzed the physical organization of gene losses for early steps of symbiosis acquisition under the hypothesis of a common origin of different symbioses. In contrast with previous findings we show that gene losses mostly occurred through loss of rather small blocks and mostly in syntenic regions between at least one of the symbionts and present-day E. coli.
At both ancient and recent stages of symbiosis evolution, gene loss was at least partially influenced by selection, highly conserved genes being retained more readily than lowly conserved genes: although losses might result from drift due to the bottlenecking of endosymbiontic populations, we demonstrated that purifying selection also acted by retaining genes of greater selective importance.
The smallest genomes belong to bacterial obligate pathogens or intracellular symbionts of eukaryotes (e.g., Mollicutes, Rickettsiae, Spirochetes, Chlamydiae, and insect-associated gamma-proteobacteria). Phylogenetic analyses indicate that their small genome size is a derived state accompanying the transition to their specialized intracellular lifestyle. In other words, all these microbes certainly evolved from ancestors with larger genomes. One of the smallest genome described to date belongs to B. aphidicola, gamma-proteobacteria that maintain mutualistic endosymbiotic associations with aphids . Gamma-proteobacteria include other obligate partners of insects and command special attention for several reasons: first, genome reduction in this group has been extreme, yielding a ten fold range of genome sizes; second, free-living relatives of these symbionts include the model organism Escherichia coli (E. coli) for which extensive genetic information is available; third, gene order is extremely well conserved between symbionts of the same genus [2, 3] and rather well conserved over genome fragments between symbionts and free-living relatives . This allows the reconstruction of gene loss events at different evolutionary steps of symbiosis. The first genome sequence of B. aphidicola BAp  initiated comparative studies shedding light on the process of genome reduction in this endosymbiont [6, 7] but the recent sequencing of two additional B. aphidicola strains [2, 8] and of three endosymbionts closely related to Buchnera, i.e. Blochmannia floridanus and Blochmannia pennsylvanicus (the endosymbionts of carpenter ants, respectively Camponotus floridanus and Camponotus pennsylvanicus; [3, 9]), and Wigglesworthia glossinidia (the endosymbiont of the tsétsé fly, Glossina brevipalpis; ) considerably increase the scope for comparative analysis to reveal evolutionary patterns of gene loss. Moreover, it provides the opportunity for assessing the generality of the process of genome size reduction in three kinds of symbiotic lineages that have different gene content shaped by specific nutritional needs of their insect hosts.
The identification of long term evolutionary forces that drive genome shrinkage in endosymbionts is much debated. The trend toward large scale gene loss could reflect the inefficiency of natural selection at maintaining genes in the genomes of these cytoplasmically inherited bacteria. Indeed, the vertical partitioning of symbiotic lineages among hosts and their reduced population sizes strongly favour drift, a mechanism proposed to be responsible for the accumulation of mildly deleterious mutations in symbiotic genomes [6, 11]. The hypothesis of an increased fixation of deleterious mutations is supported by several lines of evidence: the general acceleration of evolutionary rates and massive AT enrichment [[12, 13]; but  for a selectionist interpretation], the increased proportion of non-synonymous mutations , the loss of adaptive codon bias  and the very low level of intraspecific polymorphism observed in bacterial populations [16, 17]. Because the host environment provides metabolites, many bacterial loci would become redundant thereby accumulating slightly deleterious mutations by a process known as Muller's ratchet, which eventually could lead to the functional inactivation of non essential genes. In fact, the DNA of a pseudogene may be completely deleted from the B. aphidicola genome in 40 to 60 My . Even some apparently beneficial genes (DNA repair, transcriptional regulation and replication initiation mechanisms) have been lost confirming that drift may play an important role in genome reduction [5, 9, 19].
If the ultimate forces leading to genome shrinkage in endosymbionts are still much debated, the proximal mechanisms responsible for DNA removal seem to be better understood [20, 21]. In particular, it has been suggested that DNA removal occurs because of mutational bias favouring deletions over insertions in bacterial genomes . This process which is apparently universal in prokaryotes could account for the scarcity of non coding DNA in most bacterial lineages. The loss of genes involved in DNA repair which are otherwise broadly conserved in bacteria might explain why deletion biases are not "corrected" and eventually lead to gene degradation and loss. Strikingly, such losses are a common characteristic of Buchnera, Wigglesworthia and Blochmannia, but also of Sitophilus orizae primary endosymbiont (a younger symbiont with a larger genome, where two of such genes are inactivated ) and even of free-living marine bacteria recently engaged in genome reduction . Chromosomal rearrangement which can lead to large deletion events as also contributes to DNA removal. Finally, obligate bacterial mutualists have lost the ability to incorporate foreign DNA which grants them a "one-way ticket to genome shrinkage".
Understanding the evolutionary processes that drive genome reduction requires determining the tempo (rate) and the mode (size and types of deletions) of gene losses. Detailed information is available on recent events of gene loss from comparing closely related symbionts: for example extreme genome stasis (conservation of gene order and gene repertoires) has been shown for three B. aphidicola genomes [2, 8] with an estimated tempo of gene loss in the range of 2–3 Myr per gene. Similar patterns emerge for Blochmannia , where a comparison between two species revealed a limited number of gene losses scattered along the genome with different rates for the two species (ca 0.6 Myr per gene for B. floridanus and ca 6 Myr per gene for B. penssylvanicus). These findings of gradual gene loss with some degree of specific variation is compatible with a phenomenon of drift, as seen above, mitigated by selective processes, since gene loss affects differential gene functions for different symbionts (e.g. B. aphidicola strains tend to conserve many genes involved in amino acid synthesis, while Blochmannia species tend to conserve a disproportionate number of genes involved in synthesis of cofactors).
In contrast, less is known about the rhythm and types of gene loss that occurred in the early steps of the acquisition of symbiosis. This is particularly critical, because gene losses that occurred in the ancestor of all B. aphidicola strains or Blochmannia species respectively were massive (estimated to >1000 genes lost ). Two recent comparative studies which reconstructed the hypothetical ancestral genome of B. aphidicola BAp and its free-living relatives have reached diverging conclusions on the organisation of gene losses. Moran and Mira  found that early gene loss involved deletions of large sets of contiguous genes including loci with unrelated functions, possibly due to recombination at repeated sequences [2, 26]. Such massive and non-oriented process would be the sign of strong genetic drift and weakened selection in the early stages of endosymbiosis [6, 11, 27, 28]. In contrast, Silva et al.  who focused on blocks of conserved synteny, found that genome shrinkage arose through multiple events of gene disintegration dispersed over the whole genome, which would be better explained by selection acting on individual genes even in these early steps. Even if larger deletions could not be ruled out, these authors insisted on the importance of selection for explaining the genome shrinkage observed in endosymbionts. Interestingly, recent genomic and experimental data support both scenarios: on the one hand, the presence of hundreds of pseudogenes scattered around the genomes of some pathogens recently sequenced [29–32] support the scenario whereby genome reduction occurs by slow erosion of individual genes. On the other hand, Nilsson et al.  have nicely showed that, even on a short evolutionary time scale, the disappearance of large stretches of DNA can be frequent in bacteria establishing in a constant environment. This experimental result suggests that large-scale deletions may occurred during the initial stages of genome reduction .
Characterizing the final set of genes of reduced genomes should help resolve this debate. Genome shrinkage is largely a lineage-specific process . Indeed, because evolution is a highly contingent process, it is difficult to predict the fate of each single lineage. Moreover, successive losses are not independent events because the loss of a gene from a genome may influence the types of losses tolerated in the future. Finally, identical functions can be achieved by non-orthologous genes in different lineages. For example, the comparison of genomes of closely related insect endosymbionts shows that they share only 50% of their protein-coding genes. Although important functions such as cell division processes, information storage and processing show a high conservation of their gene repertoires, remarkable differences exist for those genes that encode proteins involved in the cell envelope, flagellum biosynthesis and the metabolism of amino acids, nucleotides, or coenzymes [9, 34]. However, this does not necessarily mean that gene loss is a completely random process. In eukaryotes, Krylov et al.  recently investigated the relation between the propensity of a gene to be lost and its functional importance. Interestingly, these authors found significant relationships between the "propensity for gene loss" (PGL) and sequence substitution rate, gene dispensability, the number of protein interactions and the expression level of the gene. Thus, at least in eukaryotes, the likelihood of being lost seems to be an inverse relation of the biological importance of a gene. To date, this hypothesis has not been tested in prokaryotes, possibly because it is difficult to disentangle the respective contribution of gene loss and horizontal gene transfers in the evolution of bacterial genome size . However, we propose that the well characterized group of insect endosymbionts provides a good model system for investigating the relative importance of drift and purifying selection in the process of genome shrinkage.
In this study, we used five available endosymbiotic genome sequences of the gamma-proteobacteria – B. aphidicola BAp, B. aphidicola BSg, B. aphidicola BBp, Wigglesworthia glossinidia (Wgl) and Blochmannia floridanus (Bfl) – to test whether gene loss was related to the selective importance of genes. We used a parsimony method to reconstruct a minimal ancestral genome of insect endosymbionts and quantified gene loss along the branches of the phylogenetic tree. This allowed us to assess different loss events at different evolutionary scales and to estimate the likelihood that a gene be lost during the evolution of symbiosis. To evaluate the selective or functional importance of genes, we used a parameter that measures the level of adaptive codon bias in E. coli (i.e. codon adaptive index, or CAI), and also estimates of evolutionary rates (Ka) between either pairs of orthologs in free-living bacteria or in pairs of symbionts. These parameters are expected to be correlated with the selective pressure on the genes. We also studied the distribution of deletion sizes (in number of loci) in order to determine whether they occurred by small steps or through large deletions of multiple loci at different positions on the phylogenetic tree rooted at the last common ancestor of free living bacteria and modern endosymbionts.
Results and discussion
Single or multiple origin(s) of endosymbiosis?
It is worth noting that the same result was recently found in at least three other studies, applying different methods on a different set of orthologous protein-coding genes [9, 37, 38]. In particular, Lerat et al.  insisted on the complete lack of conflict between 205 orthologous genes they chose resulting in a fully resolved phylogeny where B. aphidicola BAp and Wiggleworthia grouped together. In this context, we believe that the monophyletic topology of these five endosymbionts is a good "working hypothesis" although our phylogenomic approach precludes classical bootstrapping testing of the robustness of the tree.
However, Herbeck et al.  using a similar approach with more taxa but only two genes, the conserved groEL and 16S rRNA, proposed a different scenario. They found various phylogenies that group Blochmannia and Wigglesworthia but separately from Buchnera, which leaded them to conclude that Blochmannia and Wigglesworthia represent an origin of primary endosymbiosis that is independent from that of Buchnera. Another recent study also cast in doubt the monophyletic character of the three endosymbiotic lineages . These authors found a strong discordance between a phylogeny based on concatenated conserved amino acid sequences and reconstructions based on gene order. The former supported a single origin, while the latter placed Blochmannia as closer relative to E. coli than Yersinia and any other symbiotic lineage. This comes from the fact that synteny is high between Blochmannia and E. coli . This result could still be compatible with a single origin for endosymbionts, provided that different levels of gene rearrangements occur among different lineages.
Although we cannot definitively conclude on the single/multiple origin of AT-rich endosymbionts, we have retained the monophyly of endosymbionts as a working hypothesis since it has been reached several times by different phylogenomic approaches, including our.
Gene loss and CAI of E. coli
Correlations coefficients (R) between Propensity of Gene Loss (PGL) and Codon Adaptive Index (CAI) or non-synonymous substitution rates (Ka). The analysis is restricted to the genes present in the LCA2 that have been lost at least one time during symbiosis (Figure 3). All coefficients are highly significant (non parametric correlation test of Spearman; P < 0.005). N: number of genes.
Gene loss and substitution rates
In consequence, we found significant positive correlations between PGL and the non-synonymous substitution rate for the five pairwise comparisons conducted (Table 1). Genes lost over the course of symbiosis evolution were faster evolving than average (in symbiont and free living bacteria).
Correlation (R) between CAI and Ka for several pairwise comparisons between free-living and endosymbiotic bacterial lineages. All coefficients are highly significant (non parametric correlation test of Spearman; P < 0.005).
Why some genes are retained and some are lost?
We have tried to determine the extent to which gene loss is predictable, by evaluating the correlations between some evolutionary parameters and the propensity of a gene to be lost. We showed that losses are concentrated on the genes that are evolutionary less constrained and of probably less selective importance (genes characterized by low adaptive codon bias in E. coli and high evolutionary rates in free-living and symbiotic species). The bias between propensities of loss for different categories of genes (genes of different CAI in E. coli and of different non-synonymous substitution rates) is particularly strong at the initial steps of the acquisition of symbiosis. We indeed demonstrated that at all stages of symbiosis, genome reduction was specifically targeted to genes of lesser selective importance, or less essential for the survival of the host/symbiont couple. It also is significant that symbionts retain some informative signal about the propensity of gene loss across subsequent diversification in their hosts: indeed, gene losses in later stages concern primarily sequences characterized by relatively high evolutionary rates in other symbiotic lineages. Late gene losses also primarily affect genes characterized by low CAI in E. coli. All this suggests that gene loss, even though governed by drift, was also partially influenced by selection both at ancient and recent stages of symbiosis evolution.
There is no a priori reason why non-synonymous substitution rate should be correlated with the propensity of a gene to be lost. Indeed, these two variables are measures of evolutionary conservation that capture substantially different aspects of evolution and one can imagine that functional constraints could be relaxed on a protein achieving an important function in the cell. Here, we have showed that PGL, sequence evolution rate and CAI are interdependent as already found in eukaryotes . Thus, we believe that the large prokaryotic genomes already available combined with data on gene dispensability, expression, and protein interactivity, open the possibility to new comparative studies testing the prevailing forces driving genome reduction.
A majority of deletions involving few genes in syntenic fragments
Validity of the results under alternative phylogenetic scenarios
We stress that even under the hypothesis of independent origins of endosymbionts, our conclusions on the links between gene dispensability and selective parameters (CAI, Ka) would be little affected. Indeed, a strong effect of CAI and Ka on the propensity of gene loss was observed even without considering the existence of LCA2 because it was observed for the losses occurring between LCA1 and each of the extant endosymbiotic lineages.
In contrast the reconstruction of deletion sizes is more dependent on the validity of our common ancestry of the five symbionts studied. Considering the reconstructed ancestor of the three B. aphidicola strains that are clearly monophyletic affected only marginally earlier studies based on single genome .
The particular originality of this work compared to that of precedent studies [6, 7] lies in its integration of the information on the selective pressures on genes together with more genome data. The larger genome data set allowed us to detect and characterize more ancient events of gene loss by including the reconstructed common ancestors of symbiotic lineages. We have shown that genes lost in the early stages of symbiosis are on average less selectively constrained than genes conserved in any of the extant symbiotic strains studied. This is shown by significant differences among the two types of genes of two parameters that can measure selective importance: non-synonymous evolutionary rates (in symbionts or in free-living enterics) and codon bias in E. coli. In addition, our reconstruction of deeper nodes allowed also a better description of deletion events at the different steps, in particular of their size distribution. Under the hypothesis of a common origin of different symbioses, gene losses would have been mostly occurring through rather small blocks, and in syntenic regions between at least one of the symbionts and present-day E. coli. Our study did not include two genomes from insect-associated endosymbionts that have just been completed, Blochmannia pennsylvanicus  and B. aphidicola BCc (host Cinara cedri, available soon). Studying these genomes will help to better reconstruct recent gene loss events, which occurred after the divergence of Blochmannia and B. aphidcola strains respectively. However, given their small genomes that include very few genes not already present in other complete sequences, their inclusion could not significantly change our conclusions on early patterns of gene loss.
The knowledge of more endosymbiotic genomes, particular of genomes of larger sizes (e.g. Sitophilus oryzae primary endosymbiont  will be of paramount interest for fully resolving several puzzles that remain to date. It will indeed provide a more robust phylogenetic scenario of symbiosis acquisition (in single or multiple events) and a finer knowledge on the rate and patterns of gene losses, which will allow disentangling mutational and selective pressures that modulate genome reduction.
Complete genomes were retrieved from EMBL database: Escherichia coli K12 (NC_000913), Salmonella typhimurium LT2 (NC_003197), Vibrio cholerae O1 biovar eltor str. N16961 (NC_002505, NC_002506), Yersinia pestis (NC_003143), Buchnera aphidicola of Baizongia pistaciae (NC_004545), Buchnera aphidicola of Schizaphis graminum (NC_004061), Buchnera aphidicola of Acyrthosiphon pisum (NC_002528), Wigglesworthia brevipalpis (NC_004344), Candidatus Blochmannia floridanus (NC_005061). The symbiotic strains will be referred as B. aphidicola BAp, B. aphidicola BSg, B. aphidicola BBp, Wigglesworthia (Wgl) and Blochmannia (Bfl).
Phylogenetic reconstruction of trees including endosymbiotic DNA sequences which are strongly AT-biased and evolve at relatively high rates is problematic when using classical models of DNA sequence evolution. To avoid this pitfall, we used the tree building method implemented in NHML3 which is based on a heterogeneous model accounting for unequal transition/transversion rates, unequal evolutionary rates among sequence sites and unequal base compositions of sequences . Maximum likelihood inference based on this model was applied to a trimmed alignment of 61 concatenated conserved protein-coding genes (19143 nucleotides) involved in translation. Trimming was done with GBLOCKS  in order to limit the data set to unambiguous well conserved parts of the alignments. Only the first two positions of codons, which are relatively less AT enriched  were retained for phylogenetic analysis. This yielded a most likely phylogenetic tree that grouped together the five endosymbiotic lineages (Figure 1) suggesting a unique origin of endosymbionts. As the calculation time of this phylogenetic method impeded bootstrap analysis, we decided to test alternative tree topologies, for example assuming several acquisitions of endosymbiosis. We also tested slightly different topologies resulting from the move of Yersinia pestis and Salmonella thyphimurium along the tree. Loglikelihood differences between all additional trees and the initial topology were tested using the loglikelihood ratio tests.
Reconstruction of endosymbiotic ancestors
Because it is difficult to assign orthology for structural RNA, we excluded from the analysis all non coding genes. To determine the set of orthologous coding genes between E. coli K12 and each of the nine bacteria, we performed reciprocal blasts with a cut off value of 10-4, retaining only those genes that were best hits in both comparisons. Applying a parsimony principle, the common ancestor of all endosymbionts was reconstructed as the sum of the orthologous coding genes present in at least one endosymbiotic lineage, E. coli and V. cholera. This led to removal of the few genes that were present in endosymbionts but have no orthologous equivalent in either E. coli or V. cholera. Pseudogenes of endosymbiotic genomes were included in the analysis and considered as lost genes.
Finally, this approach allowed the determination of a minimal free-living ancestor of endosymbionts and of their free-living relatives (LCA1) comprising 1983 conserved coding genes that have a low probability of being acquired by lateral gene transfer (LGT). Indeed, LGT generally involves uptake, from distantly related bacteria or phages , of genes that are absent from related bacteria. Since our method for reconstructing LCA1 was very conservative, we can speculate that the real last free-living bacteria that gave rise to endosymbionts contained more genes. Intermediary common ancestor to symbiotic lineages were defined as subsets of the initial 1983 CDS of LCA1: LCA2 corresponded to the sum of genes present in all endosymbiotic lineages, generating our common ancestry to all these symbionts, LCA3 to the sum of the genes present in all B. aphidicola lineages, and LCA4 to the sum of genes present in Blochmannia and Wigglesworthia lineages (Figure 1).
Quantifying gene loss
Gene loss was quantified using different approaches including both quantitative and qualitative variables.
A quantitative marker of gene loss: for each gene present in LCA1, a parameter called the propensity of a given gene to be lost (PGL) was calculated . This measure required the identification of gene loss along the branches of the phylogeny which was based on the reconstruction of last common ancestors of endosymbionts described above. The propensity of a gene to be lost was calculated as the ratio of the sum of the lengths of branches lacking a given gene to the sum of the lengths of all branches of the tree. This generated a quantitative parameter ranging from 0 (never lost) to 1 (lost from all the branches) that could be used to perform correlations with CAI and substitution rates (Tables 1 and 2).
Discrete categories of genes (Figure 3): we quantified gene loss by grouping genes present in LCA1 into three categories: (A) genes lost in all symbionts, presumably between LCA1 and LCA2; (B) genes present in LCA2 but lost in some of the symbiotic lineages; (C) genes kept in all endosymbionts.
CAI and substitution rate estimates
Our main objective was to examine correlations between patterns of gene loss (at different depths of the tree) and the functional importance of genes, in order to measure if losses, particularly in the initial stages of symbiosis could have been limited by rather precise constraints . To evaluate the level of functional importance of genes, we used two different parameters i) the level of adaptive codon bias (CAI) and ii) the rate of sequence evolution. The former parameter is correlated with the level of gene expression in E. coli , and more essential genes probably have higher levels of expression . Unfortunately CAI data are not available for genes of endosymbionts which show hardly any trace of adaptive bias [12, 15]. We therefore used the CAI of E. coli orthologs, calculated through the CODONW package .
In addition, we estimated synonymous (Ks) and non-synonymous (Ka) substitutions rates by performing pairwise comparisons of coding sequences. Estimates were calculated using Li's method  implemented in the diverge function from the GCG 10.2 package. Since Ks were likely saturated for many of the pairwise comparisons, we restricted our analysis to non-synonymous substitution rates (Ka). Pairwise estimates of non-synonymous substitution rates were conducted for two free-living species pairs (Eco-Stm, Eco-Ype) and four endosymbiotic species pairs (BAp-BSg, BAp-BBp, BSg-BBp, Bfl-Wgl; see Schaber et al.  for detailed results on substitution rates).
Finally, we looked at the relation between gene loss at different depths of the tree and the degree of functional constraint of genes (estimated by CAI or evolutionary rates) (Figures 3, 4, 5). The significance of the different relationships was tested using either the non-parametric Mann-Whitney tests or the Spearman correlation tests.
Reconstruction of gene deletions
Following Moran and Mira , we assumed that the LCA1 possessed the E. coli gene order. This is likely true for a majority of genes, given evidence discussed by these authors that most gene rearrangements occurred in the symbiont, and is also supported by our estimations that about 75% of the genes from chromosome I in V. cholerae are in syntenic fragments with E. coli. Synteny was defined between fragments of the LCA1 and individual symbiotic species if consecutive genes in the LCA1 corresponded to consecutive genes in a given symbiont. To reconstruct synteny between LCA1 and the ancestor of B. aphidicola strains (LCA3), we used the fact that gene order is conserved in that group and simply "filled the gaps" of B. aphidicola BAp with genes absent from this species but present in either of the two other B. aphidicola strains. It was not possible to establish the gene order in LCA2 given the major rearrangements between Blochmannia, Wigglesworthia, and Buchnera (LCA3). However, we assumed that if synteny was established between a fragment in LCA1 and genes from any of these three symbionts, this fragment must have been syntenic with LCA1 in the common ancestor of these symbionts (LCA2).
FD and JS were respectively supported by the Marie Curie Host Fellowships of the European Union Nr. MCFI-1999-01047 and MCFI-1999-01055. This work was supported by the contract HPMD-CT-2000-00056 from European Union and the grants BMC2003-00305 from the Ministerio de Educación y Ciencia (Spain) and Grupos03/04 from Conselleria d'Empresa, Universitat i Ciència (Generalitat Valenciana, Spain) to AM. We are grateful to Jacqui Shykoff for her critical review of the manuscript and her help in revising the english.
- Gil R, Sabater-Munoz B, Latorre A, Silva FJ, Moya A: Extreme genome reduction in Buchnera spp.: Toward the minimal genome needed for symbiotic life. Proc Natl Acad Sci. 2002, 99: 4454-4458. 10.1073/pnas.062067299.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamas I, Klasson L, Canback B, Naslund AK, Eriksson AS, Wernegreen JJ, Sandstrom JP, Moran NA, Andersson SGE: 50 million years of genomic stasis in endosymbiotic bacteria. Science. 2002, 296: 2376-2379. 10.1126/science.1071278.View ArticlePubMedGoogle Scholar
- Degnan PH, Lazarus AB, Wernegreen JJ: Genome sequence of Blochmannia pennsylvanicus indicates parallel evolutionary trends among bacterial mutualists of insects. Genome Res. 2005, 15: 1023-1033. 10.1101/gr.3771305.PubMed CentralView ArticlePubMedGoogle Scholar
- Belda E, Moya A, Silva FJ: Genome rearrangement distances and gene order phylogeny in gamma-Proteobacteria. Mol Biol Evol. 2005, 22: 1456-1467. 10.1093/molbev/msi134.View ArticlePubMedGoogle Scholar
- Shigenobu S, Watanabe H, Hattori M, Sakaki Y, Ishikawa H: Genome sequence of the endocellular bacterial symbiont of aphids Buchnera sp. APS. Nature. 2000, 407: 81-86. 10.1038/35024074.View ArticlePubMedGoogle Scholar
- Moran NA, Mira A: The process of genome shrinkage in the obligate symbiont Buchnera aphidicola. Genome Biol. 2001, 2: RESEARCH0054-10.1186/gb-2001-2-12-research0054.PubMed CentralView ArticlePubMedGoogle Scholar
- Silva FJ, Latorre A, Moya A: Genome size reduction through multiple events of gene disintegration in Buchnera APS. Trends Genet. 2001, 17: 615-618. 10.1016/S0168-9525(01)02483-0.View ArticlePubMedGoogle Scholar
- van Ham RCHJ, Kamerbeek J, Palacios C, Rausell C, Abascal F, Bastolla U, Fernandez JM, Jimenez L, Postigo M, Silva FJ, Tamames J, Viguera E, Latorre A, Valencia A, Moran F, Moya A: Reductive genome evolution in Buchnera aphidicola. Proc Natl Acad Sci. 2003, 100: 581-586. 10.1073/pnas.0235981100.PubMed CentralView ArticlePubMedGoogle Scholar
- Gil R, Silva FJ, Zientz E, Delmotte F, Gonzalez-Candelas F, Latorre A, Rausell C, Kamerbeek J, Gadau J, Holldobler B, van Ham RCHJ, Gross R, Moya A: The genome sequence of Blochmannia floridanus: Comparative analysis of reduced genomes. Proc Natl Acad Sci. 2003, 100: 9388-9393. 10.1073/pnas.1533499100.PubMed CentralView ArticlePubMedGoogle Scholar
- Akman L, Yamashita A, Watanabe H, Oshima K, Shiba T, Hattori M, Aksoy S: Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidia. Nature Genetics. 2002, 32: 402-407. 10.1038/ng986.View ArticlePubMedGoogle Scholar
- Ochman H, Moran NA: Genes lost and genes found: Evolution of bacterial pathogenesis and symbiosis. Science. 2001, 292: 1096-1098. 10.1126/science.1058543.View ArticlePubMedGoogle Scholar
- Moran NA: Accelerated evolution and Muller's rachet in endosymbiotic bacteria. Proc Natl Acad Sci. 1996, 93: 2873-2878. 10.1073/pnas.93.7.2873.PubMed CentralView ArticlePubMedGoogle Scholar
- Clark MA, Moran NA, Baumann P: Sequence evolution in bacterial endosymbionts having extreme base compositions. Mol Biol Evol. 1999, 16: 1586-1598.View ArticlePubMedGoogle Scholar
- Rocha EP, Danchin A: Base composition bias might result from competition for metabolic resources. Trends Genet. 2002, 18: 291-294. 10.1016/S0168-9525(02)02690-2.View ArticlePubMedGoogle Scholar
- Rispe C, Delmotte F, van Ham RCHJ, Moya A: Mutational and selective pressures on codon and amino acid usage in Buchnera, endosymbiotic bacteria of aphids. Genome Res. 2004, 14: 44-53. 10.1101/gr.1358104.PubMed CentralView ArticlePubMedGoogle Scholar
- Funk DJ, Wernegreen JJ, Moran NA: Intraspecific variation in symbiont genomes: Bottlenecks and the aphid-buchnera association. Genetics. 2001, 157: 477-489.PubMed CentralPubMedGoogle Scholar
- Abbot P, Moran NA: Extremely low levels of genetic polymorphism in endosymbionts (Buchnera) of aphids (Pemphigus). Mol Ecol. 2002, 11: 2649-2660. 10.1046/j.1365-294X.2002.01646.x.View ArticlePubMedGoogle Scholar
- Gomez-Valero L, Soriano-Navarro M, Perez-Brocal V, Heddi A, Moya A, Garcia-Verdugo JM, Latorre A: Coexistence of Wolbachia with Buchnera aphidicola and a secondary symbiont in the aphid Cinara cedri. J Bacteriol. 2004, 186: 6626-6633. 10.1128/JB.186.19.6626-6633.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Andersson SGE, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Alsmark UCM, Podowski RM, Naslund AK, Eriksson AS, Winkler HH, Kurland CG: The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature. 1998, 396: 133-140. 10.1038/24094.View ArticlePubMedGoogle Scholar
- Andersson JO, Andersson SGE: Insights into the evolutionary process of genome degradation. Curr Opin Genet Dev. 1999, 9: 664-671. 10.1016/S0959-437X(99)00024-6.View ArticlePubMedGoogle Scholar
- Peterson SN, Fraser CM: The complexity of simplicity. Genome Biol. 2001, 2: COMMENT2002-10.1186/gb-2001-2-2-comment2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Mira A, Ochman H, Moran NA: Deletional bias and the evolution of bacterial genomes. Trends Genet. 2001, 17: 589-596. 10.1016/S0168-9525(01)02447-7.View ArticlePubMedGoogle Scholar
- Dale C, Wang B, Moran N, Ochman H: Loss of DNA recombinational repair enzymes in the initial stages of genome degeneration. Mol Biol Evol. 2003, 20: 1188-1194. 10.1093/molbev/msg138.View ArticlePubMedGoogle Scholar
- Dufresne A, Garczarek L, Partensky F: Accelerated evolution associated with genome reduction in a free-living prokaryote. Genome Biol. 2005, 6:Google Scholar
- Silva FJ, Latorre A, Moya A: Why are the genomes of endosymbiotic bacteria so stable?. Trends Genet. 2003, 19: 176-180. 10.1016/S0168-9525(03)00041-6.View ArticlePubMedGoogle Scholar
- Andersson SGE, Kurland CG: Reductive evolution of resident genomes. Trends Microbiol. 1998, 6: 263-268. 10.1016/S0966-842X(98)01312-2.View ArticlePubMedGoogle Scholar
- Ochman H: Genomes on the shrink. Proc Natl Acad Sci. 2005, 102: 11959-11960. 10.1073/pnas.0505863102.PubMed CentralView ArticlePubMedGoogle Scholar
- Wernegreen JJ: Genome evolution in bacterial endosymbionts of insects. Nat Rev Genet. 2002, 3: 850-861. 10.1038/nrg931.View ArticlePubMedGoogle Scholar
- Cole ST, Eiglmeier K, Parkhill J, James KD, Thomson NR, Wheeler PR, Honore N, Garnier T, Churcher C, Harris D, Mungall K, Basham D, Brown D, Chillingworth T, Connor R, Davies RM, Devlin K, Duthoy S, Feltwell T, Fraser A, Hamlin N, Holroyd S, Hornsby T, Jagels K, Lacroix C, Maclean J, Moule S, Murphy L, Oliver K, Quail MA, Rajandream MA, Rutherford KM, Rutter S, Seeger K, Simon S, Simmonds M, Skelton J, Squares R, Squares S, Stevens K, Taylor K, Whitehead S, Woodward JR, Barrell BG: Massive gene decay in the leprosy bacillus. Nature. 2001, 409: 1007-1011. 10.1038/35059006.View ArticlePubMedGoogle Scholar
- Lerat E, Ochman H: Recognizing the pseudogenes in bacterial genomes. Nucleic Acids Res. 2005, 33: 3125-3132. 10.1093/nar/gki631.PubMed CentralView ArticlePubMedGoogle Scholar
- Jin Q, Yuan ZH, Xu JG, Wang Y, Shen Y, Lu WC, Wang JH, Liu H, Yang J, Yang F, Zhang XB, Zhang JY, Yang GW, Wu HT, Qu D, Dong J, Sun LL, Xue Y, Zhao AL, Gao YS, Zhu JP, Kan B, Ding KY, Chen SX, Cheng HS, Yao ZJ, He BK, Chen RS, Ma DL, Qiang BQ, Wen YM, Hou YD, Yu J: Genome sequence of Shigella flexneri 2a: insights into pathogenicity through comparison with genomes of Escherichia coli K12 and O157. Nucleic Acids Res. 2002, 30: 4432-4441. 10.1093/nar/gkf566.PubMed CentralView ArticlePubMedGoogle Scholar
- Parkhill J, Dougan G, James KD, Thomson NR, Pickard D, Wain J, Churcher C, Mungall KL, Bentley SD, Holden MTG, Sebaihia M, Baker S, Basham D, Brooks K, Chillingworth T, Connerton P, Cronin A, Davis P, Davies RM, Dowd L, White N, Farrar J, Feltwell T, Hamlin N, Haque A, Hien TT, Holroyd S, Jagels K, Krogh A, Larsen TS, Leather S, Moule S, O'Gaora P, Parry C, Quail M, Rutherford K, Simmonds M, Skelton J, Stevens K, Whitehead S, Barrell BG: Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 2001, 413: 848-852. 10.1038/35101607.View ArticlePubMedGoogle Scholar
- Nilsson AI, Koskiniemi S, Eriksson S, Kugelberg E, Hinton JCD, Andersson DI: Bacterial genome size reduction by experimental evolution. Proc Natl Acad Sci. 2005, 102: 12112-12116. 10.1073/pnas.0503654102.PubMed CentralView ArticlePubMedGoogle Scholar
- Klasson L, Andersson SGE: Evolution of minimal-gene-sets in host-dependent bacteria. Trends Microbiol. 2004, 12: 37-43. 10.1016/j.tim.2003.11.006.View ArticlePubMedGoogle Scholar
- Krylov DM, Wolf YI, Rogozin IB, Koonin EV: Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. Genome Res. 2003, 13: 2229-2235. 10.1101/gr.1589103.PubMed CentralView ArticlePubMedGoogle Scholar
- Boussau B, Karlberg EO, Frank AC, Legault BA, Andersson SGE: Computational inference of scenarios for alpha-proteobacterial genome evolution. Proc Natl Acad Sci. 2004, 101: 9722-9727. 10.1073/pnas.0400975101.PubMed CentralView ArticlePubMedGoogle Scholar
- Lerat E, Daubin V, Moran NA: From gene trees to organismal phylogeny in prokaryotes: The case of the gamma-proteobacteria. Plos Biol. 2003, 1: 101-109. 10.1371/journal.pbio.0000019.View ArticleGoogle Scholar
- Canback B, Tamas I, Andersson SG: A phylogenomic study of endosymbiotic bacteria. Mol Biol Evol. 2004, 21: 1110-1122. 10.1093/molbev/msh122.View ArticlePubMedGoogle Scholar
- Herbeck JT, Degnan PH, Wernegreen JJ: Nonhomogeneous model of sequence evolution indicates independent origins of primary endosymbionts within the enterobacteriales (gamma-Proteobacteria). Mol Biol Evol. 2005, 22: 520-532. 10.1093/molbev/msi036.View ArticlePubMedGoogle Scholar
- Schaber J, Rispe C, Wernegreen J, Buness A, Delmotte F, Silva FJ, Moya A: Gene expression levels influence amino acid usage and evolutionary rates in endosymbiotic bacteria. Gene. 2005, 352: 109-117. 10.1016/j.gene.2005.04.003.View ArticlePubMedGoogle Scholar
- Pal C, Papp B, Hurst LD: Highly expressed genes in yeast evolve slowly. Genetics. 2001, 158: 927-931.PubMed CentralPubMedGoogle Scholar
- Dagan T, Blekhman R, Graur D: The "domino theory' of gene death: Gradual and mass gene extinction events in three lineages of obligate symbiotic bacterial pathogens. Mol Biol Evol. 2006, 23: 310-316. 10.1093/molbev/msj036.View ArticlePubMedGoogle Scholar
- Galtier N, Gouy M: Inferring pattern and process: Maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis. Mol Biol Evol. 1998, 15: 871-879.View ArticlePubMedGoogle Scholar
- Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000, 17: 540-552.View ArticlePubMedGoogle Scholar
- Daubin V, Moran NA, Ochman H: Phylogenetics and the cohesion of bacterial genomes. Science. 2003, 301: 829-832. 10.1126/science.1086568.View ArticlePubMedGoogle Scholar
- Sharp PM, Li WH: The codon Adaptation Index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15: 1281-1295.PubMed CentralView ArticlePubMedGoogle Scholar
- Karlin S, Mrazek J: Predicted highly expressed genes of diverse prokaryotic genomes. J Bacteriol. 2000, 182: 5238-5250. 10.1128/JB.182.18.5238-5250.2000.PubMed CentralView ArticlePubMedGoogle Scholar
- CodonW. [http://sourceforge.net/projects/codonw/]
- Li WH: Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J Mol Evol. 1993, 36: 96-99. 10.1007/BF02407308.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.