Complex fate of paralogs
© Szklarczyk et al; licensee BioMed Central Ltd. 2008
Received: 04 November 2008
Accepted: 18 December 2008
Published: 18 December 2008
Thanks to recent high coverage mass-spectrometry studies and reconstructed protein complexes, we are now in an unprecedented position to study the evolution of biological systems. Gene duplications, known to be a major source of innovation in evolution, can now be readily examined in the context of protein complexes.
We observe that paralogs operating in the same complex fulfill different roles: mRNA dosage increase for more than a hundred cytosolic ribosomal proteins, mutually exclusive participation of at least 54 paralogs resulting in alternative forms of complexes, and 24 proteins contributing to bona fide structural growth. Inspection of paralogous proteins participating in two independent complexes shows that an ancient, pre-duplication protein functioned in both multi-protein assemblies and a gene duplication event allowed the respective copies to specialize and split their roles.
Variants with conditionally assembled, paralogous subunits likely have played a role in yeast's adaptation to anaerobic conditions. In a number of cases the gene duplication has given rise to one duplicate that is no longer part of a protein complex and shows an accelerated rate of evolution. Such genes could provide the raw material for the evolution of new functions.
Gene duplication can be a major source of innovation in evolution , providing redundancy and additional genetic material to build upon and differentiate. In general, eukaryotic genomes contain a large fraction of gene duplicates, with paralogs stemming not only from single gene or segmental duplications, but, in the case of S. cerevisiae, also from a Whole-Genome Duplication event that occurred approximately 100 mln years ago (WGD; [2, 3]). Genomic instability and massive gene loss promptly followed WGD and purged most of the newly formed gene copies from the yeast genome, retaining approximately 10% of them . Today, using multiple genomes of related fungal species with conserved synteny, we can unambiguously identify hundreds of gene pairs as WGD paralogs  in addition to normal small scale paralogs.
The identification of paralogs of WGD origin, in conjunction with the wealth of data on physical protein interactions and derived maps of protein complexes, puts us in an unprecedented position to test the fate of nascent duplicated genes and to potentially identify cases of duplication of whole complexes. Recently, it has been shown that, after gene duplication, protein interactions can be conserved [5, 6]. The data suggested that there exists a stepwise pathway of evolution for such functional modules , with duplications of homomeric interactions known to have a significant influence on the evolution of genes . Moreover, it is known that gene duplicates can be found less often among the core components of protein complexes compared to sparse regions of protein interaction network . For our study of the impact of gene duplication on protein complexes, we separated paralogs into two distinct, non-overlapping classes: genes that were duplicated at the WGD event, and non-WGD duplicates detectable by sequence similarity. Dubbed small scale duplications (SSD), these paralogs are the result of the most recent gene duplications, identified per event by employing a best bi-directional hit criterion on all yeast gene pairs (see Methods). From the analysis of the phylogenetic distribution and number of paralogs in related species, it appears that the time of duplication of SSD genes greatly predates the WGD event (see Methods). Both duplication types, WGD and SSD, cover together ~40% of yeast genes, providing a comprehensive overview of these evolutionary events. These two paralog types are already known to differ with respect to their expression pattern [8, 9] and synthetic lethality rate , by displaying different phenotypic effects when deleted  and occurrence across functional classes (e.g., stress responsive genes, ). Musso and colleagues  show that nearly half of WGD paralogs co-cluster in the same protein complex. Amoutzias and colleagues  indicate that whole genome duplication did not change the dimerization specificities of interacting homologs. Here, we show a much more detailed spectrum of evolutionary and functional fates of higher order protein complex subunits. This integrated overview, enables us to quantify the fates with respect to the duplication type and address questions related to protein specialization (subfunctionalization), as well as the emergence of novel functions related to complexes (neofunctionalization).
Our hypotheses were tested on various types of manually curated data: both complexes from MIPS consortium , and those annotated by SGD . To avoid a possible bias introduced by manual curation, we also use computationally derived maps of complexes [15, 16], reconstruction of which was possible owing to recent mass-spectrometry studies [17, 18]. Integration of these datasets allowed us to systematically study the fates of all gene duplicates which are involved in protein complexes.
The fates of duplicate genes in complexes
Intra-complex paralogs: retention is an important fate of paralogs within complexes
We observe a very strong preference for both duplicated proteins to function in the same module. Compared to a null model, where proteins are stochastically reshuffled between complexes, intra-complex paralogs are ~40-fold overrepresented (SGD modules, ). This preference is similar, and not statistically different for both duplication types (P = 0.97, chi-square test) and holds for other module definitions, including the computationally derived protein complexes from complex co-purification experiments (see additional file 1, Table S1). Paralog retention within the module is thus an important factor in shaping the map of protein complexes.
We thus recover the previously made observation that WGD and SSD paralogs are known to act within the ancestral protein complex after the duplication [7, 8]. Further analysis however revealed a wider spectrum of fates in which two paralogs can be involved in a single protein complex, as illustrated in the analysis of the essential yeast chromatin remodeling complex RSC. Owing to the availability of protein-protein interaction data [17, 18] we can distinguish between different modes of participation in a single complex. The first, a more "direct" bait-prey interaction mode, occurs when one protein was designated a bait and the other protein co-purified as a prey; this event is characterized by a high spoke value . The second type of interaction, a prey-prey interaction, can be detected when two prey proteins were co-purified with the same bait in two independent purification experiments and does not provide the evidence for the two proteins to co-occur in the same protein complex at a given time. Hence, we were able to specify the following intra-complex fates:
Ia. mRNA dosage effect
Paralogs in complexes.
46 (P < 10-3)
0.3 (P < 10-3)
110 dosage increase
24 interacting homologs
54 module variants
Mostly old duplications,
5× less WGD paralogs (P < 4*10-4)
Ib. Interacting homologs
This subclass consists of paralogs both present in a protein complex at the same time. Using protein-protein interaction data we identified 24 intra-complex paralogs with a bait-prey interaction type, signified by a high spoke value of least 5 (see Methods for details). This class is exemplified by the RSC3/RSC30 pair from RSC chromatin remodeling complex, known to form a stable heterodimer . This kind of relationship between paralogs is likely to result from an ancestral homodimer, where a paralogous replacement of the dimer's components took place . Strong positive co-expression (Spearman's correlation coefficient of 0.4), even though weaker than the tightly co-regulated CRPs, provides additional clues for simultaneous presence of both proteins in the functional module. Homomers undergoing this evolutionary route are probably the classic view on how two paralogs are involved in the same protein complex, as exemplified by the F1-ATPase alpha and beta subunits .
Ic. Module variants
This perhaps somewhat less explicitly recognized category embraces paralogs with a seemingly intrinsic contradiction: operating within the same "complex", yet never present together with only a prey-prey evidence for their interaction. Such mutually exclusive presence implies existence of different variants of the same complex. To assign proteins to the module variants class, we select intra-complex paralogs with no evidence of direct interaction. That includes paralogs never purified together, or with a negative spoke value (see Methods for details). Our analysis yields 54 intra-complex paralogs that belong to this class. Lower co-expression of these genes, likely resulting from the functional role undertaken by paralogs, confirms that these subunits are alternatively present in a module, thus not required to operate simultaneously (average co-expression Spearman's correlation 0.2, statistically different from other classes, one sided Wilcoxon ranked sum test, P < 0.02). More divergent expression also suggests a mechanism of control of complex activity by conditional assembly (analogous to just-in-time assembly for cell cycle complexes, ).
Type of intra-complex paralogs and viability of single-gene knockouts in rich medium.
Intra-complex duplication type
Deletion mutants of module variants exhibit differential growth patterns when cultured on various carbon sources.
Module variant paralogs
COX5A knockout: reduced fitness when no glucose
CIN8 knockout: unrestricted growth only on glycine
BUL1 knockout: reduced fitness on ethanol
DID4 knockout: severely reduced growth on lactate
NOT3 knockout: severely impaired growth on glycine
NOT5 knockout: growth severely impaired in all conditions tested
REG1 knockout: limited growth on glucose
Bi-complex paralogs: proteins functioning in different complexes
As opposed to intra-complex paralogs, where both proteins function in the same module, bi-complex paralogs each participate in distinct ones. Depending on the map of protein complexes, 44 or more genes fall into this category (see Table 1 and additional file 1, Table S2). We confirmed the lack of interaction between this type of paralogs with protein-complex purification data (only two out of 31 pairs were ever purified together, significantly less than intra-complex paralogs, Fisher exact test P < 4e-5, odds ratio 15).
Interestingly, for bi-complex paralogs, a significant difference between WGD and SSD duplicates can be seen. The majority of them are SSD duplicates (see additional file 1, Table S2). This strong bias, with SSD constituting more than 80% of the bi-complex class, contrasts with handful of WGD paralogs split between different complexes. We propose this to be an effect associated with the age of duplication. The lion's share of SSD paralogs not only predate the WGD event but are older than the divergence with S. pombe. While none of the eight post-S. pombe SSD duplications is bi-complex, three duplications are intra-complex (see additional file 1, Table S5), a hint that not the type of duplication (SSD versus WGD), but its age has a greater influence on the paralog's fate. Over extended evolutionary time since the ancient duplication of majority of SSD paralogs, many specialized (subfunctionalization), join or even established a new complex (neofunctionalization), ultimately leading to the bi-complex relationship. The conservative nature of interaction evolution after gene duplication is confirmed by the underrepresentation of bi-complex paralogs, compared to a null model where proteins are free to change their complex following the duplication (Table 1 and additional file 1, Table S2).
Examples of whole-complex duplications
Another case of whole-complex duplication involves a three-protein Sec61 complex (also referred to as a translocon, Figure 4b). This essential complex forms a channel in the ER membrane and mediates translocation of secretory and membrane proteins into the ER and also retrograde transport of misfolded proteins to the cytoplasm for degradation [29, 30]. The complex has duplicated in the course of evolution to form an Ssh1 translocon complex . The Ssh1 complex, a result of small scale duplications, also functions in co-translational import to the endoplasmic reticulum (an essential paralogous subunit Sec61p plays a post-translational role as well), and is required for normal growth rates.
Overhangs – lone paralogs
The final class of paralogs are overhangs, proteins without an association to a functional module, but with a paralog known to be involved in a protein complex (Figure 1). For SGD protein complexes, we found 58 such proteins, with no significant difference in contributions of WGD and SSD duplication types for most of the protein complex maps (see additional file 1, Table S3). Validation with TAP protein complex purification data shows virtually no association of overhangs with their paralog's module (average interaction spoke value for overhangs is 0.06 compared to 2.6 for their "in-module" partners). Additionally, compared to their paralogs, less functional data about overhangs is available. Perhaps predictably, 11/58 overhangs genes are unnamed genes (i.e. not described in a scientific publication), compared to all of their paralogs being named (Fisher's exact test, P < 0.01). Naming roughly reflects the state of our knowledge about the gene, and we further observe absence of annotation in Molecular Function (P < 0.02) and Biological Process (P < 0.01) classes of GO.
To further validate the role overhangs play in the cellular processes we counted the essential genes (inviable null mutants) among them. Even after excluding unnamed genes from this analysis, we have only 4/48 essential overhangs, compared to 17/58 of their in-module paralogs (Fisher's exact test P < 0.01, odds ratio 4.5). This corroborates with the hypothesis of Hart et al.  that essentiality is a product of the protein complex, rather than the individual protein. We conclude that overhangs play a much less important role in cell biology, at least in the rich medium conditions in which most of the functional studies are performed.
We observed that overhangs are less constrained by evolution on the sequence level. For WGD overhangs, we compared amino acid identity levels of paralogs against their Kluyveromyces waltii ortholog (there is a single ortholog in K. waltii, as this species diverged before the WGD event). The amino acid sequence of overhangs diverges significantly faster compared to their in-module paralogs (34% vs 40% global amino acid identity, one sided paired Wilcoxon signed rank test P < 0.02). We therefore conclude that being a part of the protein complex imposes certain constraints on divergence, and the process of orphaning coincides with an increased rate of sequence evolution.
A higher rate of protein sequence evolution and almost complete loss of interactions with an ancestral protein complex are manifestations of rapid functional divergence. The orphaned proteins are involved in different cellular processes: e.g., an overhang SSD1 (suppressor of SIT4 deletion, YDR293C), interacts with a TOR pathway, and functions in sustaining cell wall integrity , while its paralog DIS3 is a catalytic component of exosome , also involved in mitotic control . We measured the degree of function divergence of overhangs and their paralogs. Using semantic similarity based on Gene Ontologies (see Methods), genes were assigned values between 0 (for different function) and 1 (highly similar or identical function). We observe a rapid divergence of functionality for overhangs (additional file 1, Figure S1). This analysis hints to the overhangs as one of nature's methods to gene neofunctionalization.
Discussion and conclusion
For the paralogs participating in different complexes (bi-complex paralogs), we see a quantitative difference between duplicates of different age, with only a minority of bi-complex paralogs stemming from WGD. We attribute the higher representation of SSD paralogs to the time of the duplication. The mixture of functional data and the knowledge of their evolutionary history enabled us to reconstruct the evolutionary past of WGD paralogs. As bi-complex paralogs might have potentially undergone either neo- or subfunctionalization (see Figure 3a), we suggest, based on the examination of the association between complexes, that bi-complex paralogs could be examples of function specialization in the protein interaction network.
As observed in  there is no overrepresentation of whole modules being duplicated at the WGD event. A massive duplication is a unique opportunity for an organism to replicate components of its cellular machinery (e.g., protein complexes) and let it subsequently evolve independently, with each complex following its own evolutionary path. And even though it appears that gene pairs  and transcriptional network show features of partitioning into heavily intra-connected, but sparsely inter-connected clusters  at the protein complex level we did not observe large-scale duplications. Is it maybe that the ancestor of S. cerevisiae around 100 mln years ago had a chance to duplicate complexes as a whole, but missed the unique opportunity? Certainly the case of cytoplasmic ribosomes is an example of the ancestral yeast cell taking advantage of WGD event and doubling the subunit count in this protein complex. In fact, the completeness of the duplication of the cytoplasmic ribosome (both the small and large subunit) allowed the cell to maintain required equimolar concentrations of CRPs [37, 38] while doubling the gene repertoire, a goal not attainable by stepwise module growth and multiple small duplications.
In eukaryotes a single protein events (loss, duplication or gain) dominate the evolution of functional modules. Even though here we do not quantify the prokaryote/eukaryote difference, scientific literature indicates that multiple copies of a protein complex can be found in bacteria. In the case of Complex I submodules, homologs of some of the recruited proteins already performed a function together previous to their involvement in the new pathway, and were duplicated in parallel of shortly after each other. This type of modular evolution in prokaryotes includes a duplication of, sometimes sizable, complexes: we know that a formate hydrogenlyase complex (FHL) of E. coli is in close evolutionary relation to Complex I . Additionally, a duplication-prone FHL complex can be found in two copies in E. coli (FHL-1 and FHL-2), differing by only three subunits . This observations lead to the hypothesis that appearance of copies of protein complexes in prokaryotes may be associated with the operon structure. The whole module encoded by an operon could duplicate by means of a single, segmental duplication. Alternatively, related complexes could evolve independently in different bacterial species and then be brought together by the horizontal gene transfer of the whole operon. Either way separate, independently functioning copy of a module could, for example, become recruited as a submodule of a bigger protein complex . Interestingly, both E. coli FHL complexes are encoded by two operons, Hyf and Hyc .
Different fates of WGD paralogs involved in RSC complex.
WGD gene pair
n/n, synthetic lethal
mutually exclusive, both indispensable during fermentation, different deletion phenotypes
heterodimers, equal proportions, RSC30 duplicated post-WGD
n/n, phenotypic supression*
Overhangs do not, unlike their paralogs, participate in a protein complex. Direct interaction data confirm that overhangs do not seem to be associated with their paralog's protein complex. In our opinion features such as lower fraction of essential proteins or faster sequence evolution, make overhangs likely to be cases of neofunctionalization, initially working under relaxed evolutionary constraints. We hypothesize that overhangs released from the control of the ancestral protein complex, which are not purged from the genome (such as aforementioned SSD1/DIS3 pair), may form "seeds" for emerging complexes. This, accompanied with draft of additional subunits may to form novel complexes and ultimately become more embedded in the core cellular machinery.
WGD, SSD gene sets and the assessment of the age of duplications
WGD paralogs, genes that duplicated at the time of whole-genome duplication were taken from . SSD paralogs (Small Scale Duplications) represent the most recent, non-WGD gene duplications. The SSD list consists of best bi-directional hits, i.e., gene pairs (A, B), such that their alignment score is higher than alignments of A against any other gene in the genome, and higher than alignment of B with any other gene in the genome (self-alignments excluding). 87% of WGD genes pass the criterion of best bi-directional similarity and were excluded from the SSD dataset.
WGD and SSD types of paralogs both stem from the most recent duplication of a given gene. To determine whether SSD duplications preceded or followed the WGD event, it is enough to assess the phylogenetic distribution of paralogs in multiple fungi species. Using orthology data from  we established that SSD paralogs were present in two copies before the ancestor of yeast underwent the WGD event. More specifically, the analysis of the fungal gene trees  shows that among SSD paralogs which participate in complexes, out of 84 pairs only a single gene pair duplicated after the WGD event (RSC30/YHR054C) and only for eight SSD paralog pairs gene trees imply duplication after the divergence with S. pombe (see additional file 1, Table S5 and Methods).
From the initial analysis of our dataset of paralogs, it is apparent that genes involved in translation followed a distinct evolutionary route. Indeed, it is known from the literature that ribosomal protein sequences are highly constrained and many of the ribosomal protein pairs show exceptionally high levels of identity, likely subject to periodic gene conversion . Interestingly, almost all cytoplasmic ribosomal proteins (CRPs), in a stark contrast to mitochondrial RP, were retained in duplicate after the WGD event. What could be the raison d'être this massive duplication? The set of CRP paralogs has many distinguishing properties, such as (a) a very similar amino acid sequence to paralogs, (b) a high mRNA expression correlation between paralogs (Spearman's correlation coefficient 0.8, see Methods for details), (c) the whole functional class, with few exceptions, was duplicated at WGD. These features imply a low level of functional differentiation and possibly an mRNA dosage increase as an explanation for the retention of both duplicates in CRPs, although new provocative evidence suggests more functional divergence than expected . Nevertheless, such "Whole Ribosome Duplication" may signify the role the WGD event played in the evolution of anaerobic fermentation in yeast (compare with mitochondrial RP, additional file 1, Table S4).
Expression data were taken from the Gene Expression Omnibus (GEO) database  of the National Center for Biotechnology Information (NCBI), downloaded on 21 December 2006. Only multi-array datasets were considered, resulting in 357 microarray samples from 12 experiments, subsequently normalized (see additional file 1, Methods). We used Spearman's rank correlation coefficients to calculate the degree of co-expression between all gene pairs.
Multiple module definitions were used to avoid bias of a certain protein complex annotation and make sure that results obtained hold independent of various protein complex maps used. MIPS data on protein complexes were downloaded from The MIPS Comprehensive Yeast Genome Database (CYGD, http://mips.gsf.de/). The SGD GO complexes (in total 233 complexes, 1705 proteins) were generated by using the SGD GO component annotations (as of 9 May 2007) and then keeping only those components that have a GO description containing one of the following strings: complex, subunit, ribosome, proteasome, nucleosome, repairosome, degradosome, apoptosome, replisome, holoenzyme, snRNP. Only the lowest possible annotation level was maintained. Associations that where obtained from large scale experiments were removed.
We would like to thank Ken Wolfe and Gavin Conant for insightful comments. Authors are grateful to Like Fokkens and Jos Boekhorst for discussions, Martin Oti for co-expression dataset, Joanna Parmley for carefully reading the manuscript and Patrick Kemmeren for sharing protein complex data. We would also like to thank anonymous reviewers for their valuable comments. This work was supported by the Netherlands Genomics Initiative (Horizon programme).
- Ohno S: Evolution by Gene Duplication. 1970, London: Allen & UnwinView ArticleGoogle Scholar
- Wolfe KH, Shields DC: Molecular evidence for an ancient duplication of the entire yeast genome. Nature. 1997, 387: 708-13. 10.1038/42711.View ArticlePubMedGoogle Scholar
- Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES: Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature. 2003, 423: 241-54. 10.1038/nature01644.View ArticlePubMedGoogle Scholar
- Byrne KP, Wolfe KH: The Yeast Gene Order Browser: Combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Res. 2005, 15: 1456-146110. 10.1101/gr.3672305.PubMed CentralView ArticlePubMedGoogle Scholar
- Pereira-Leal JB, Levy ED, Kamp C, Teichmann SA: Evolution of protein complexes by duplication of homomeric interactions. Genome Biol. 2007, 8: R51-10.1186/gb-2007-8-4-r51.PubMed CentralView ArticlePubMedGoogle Scholar
- Pereira-Leal JB, Teichmann SA: Novel specificities emerge by stepwise duplication of functional modules. Genome Res. 2005, 15: 552-9. 10.1101/gr.3102105.PubMed CentralView ArticlePubMedGoogle Scholar
- Li L, Huang Y, Xia X, Sun Z: Preferential duplication in the sparse part of yeast protein interaction network. Mol Biol Evol. 2006, 23: 2467-73msl121. 10.1093/molbev/msl121.View ArticlePubMedGoogle Scholar
- Wapinski I, Pfeffer A, Friedman N, Regev A: Natural history and evolutionary principles of gene duplication in fungi. Nature. 2007, 449: 54-61. 10.1038/nature06107.View ArticlePubMedGoogle Scholar
- Musso G, Zhang Z, Emili A: Retention of protein complex membership by ancient duplicated gene products in budding yeast. Trends Genet. 2007, 23:Google Scholar
- Guan Y, Dunham MJ, Troyanskaya OG: Functional Analysis of Gene Duplications in Saccharomyces cerevisiae. Genetics. 2007, 175: 933-94310. 10.1534/genetics.106.064329.PubMed CentralView ArticlePubMedGoogle Scholar
- Hakes L, Pinney J, Lovell S, Oliver S, Robertson D: All duplicates are not equal: the difference between small-scale and genome duplication. Genome Biology. 2007, 8: R20910-10.1186/gb-2007-8-10-r209.View ArticleGoogle Scholar
- Amoutzias GD, Veron AS, Weiner J, Robinson-Rechavi M, Bornberg-Bauer E, Oliver SG, Robertson DL: One billion years of bZIP transcription factor evolution: conservation and change in dimerization and DNA-binding site specificity. Mol Biol Evol. 2007, 24: 827-35msl211. 10.1093/molbev/msl211.View ArticlePubMedGoogle Scholar
- Mewes HW, Amid C, Arnold R, Frishman D, Güldener U, Mannhaupt G, Münsterkötter M, Pagel P, Strack N, Stümpflen V, Warfsmann J, Ruepp A: MIPS: analysis and annotation of proteins from whole genomes. Nucleic Acids Res. 2004, 32: D41-4. 10.1093/nar/gkh092.PubMed CentralView ArticlePubMedGoogle Scholar
- Hong EL, Balakrishnan R, Dong Q, Christie KR, Park J, Binkley G, Costanzo MC, Dwight SS, Engel SR, Fisk DG, Hirschman JE, Hitz BC, Krieger CJ, Livstone MS, Miyasato SR, Nash RS, Oughtred R, Skrzypek MS, Weng S, Wong ED, Zhu KK, Dolinski K, Botstein D, Cherry JM: Gene Ontology annotations at SGD: new data sources and annotation methods. Nucleic Acids Res. 2008, 36: D577-81gkm909. 10.1093/nar/gkm909.PubMed CentralView ArticlePubMedGoogle Scholar
- Hart GT, Lee I, Marcotte E: A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality. BMC Bioinformatics. 2007, 8: 236-10.1186/1471-2105-8-236.PubMed CentralView ArticlePubMedGoogle Scholar
- Collins SR, Kemmeren P, Zhao X, Greenblatt JF, Spencer F, Holstege FCP, Weissman JS, Krogan NJ: Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae. Mol Cell Proteomics. 2007, 6: 439-50.View ArticlePubMedGoogle Scholar
- Gavin A, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M, Rau C, Jensen LJ, Bastuck S, Dümpelfeld B, Edelmann A, Heurtier M, Hoffman V, Hoefert C, Klein K, Hudak M, Michon A, Schelder M, Schirle M, Remor M, Rudi T, Hooper S, Bauer A, Bouwmeester T, Casari G, Drewes G, Neubauer G, Rick JM, Kuster B, Bork P, et al: Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006, 440: 631-6. 10.1038/nature04532.View ArticlePubMedGoogle Scholar
- Krogan NJ, Cagney G, Yu H, Zhong G, Guo X, Ignatchenko A, Li J, Pu S, Datta N, Tikuisis AP, Punna T, Peregrín-Alvarez JM, Shales M, Zhang X, Davey M, Robinson MD, Paccanaro A, Bray JE, Sheung A, Beattie B, Richards DP, Canadien V, Lalev A, Mena F, Wong P, Starostine A, Canete MM, Vlasblom J, Wu S, Orsi C, et al: Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature. 2006, 440: 637-64310. 10.1038/nature04670.View ArticlePubMedGoogle Scholar
- Planta RJ, Mager WH: The list of cytoplasmic ribosomal proteins of Saccharomyces cerevisiae. Yeast. 1998, 14: 471-7. 10.1002/(SICI)1097-0061(19980330)14:5<471::AID-YEA241>3.0.CO;2-U.View ArticlePubMedGoogle Scholar
- Kellis M, Birren BW, Lander ES: Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature. 2004, 428: 617-24. 10.1038/nature02424.View ArticlePubMedGoogle Scholar
- Angus-Hill ML, Schlichter A, Roberts D, Erdjument-Bromage H, Tempst P, Cairns BR: A Rsc3/Rsc30 Zinc Cluster Dimer Reveals Novel Roles for the Chromatin Remodeler RSC in Gene Expression and Cell Cycle Control. Molecular Cell. 2001, 7: 741-751. 10.1016/S1097-2765(01)00219-2.View ArticlePubMedGoogle Scholar
- Iwabe N, Kuma K, Hasegawa M, Osawa S, Miyata T: Evolutionary relationship of archaebacteria, eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes. Proc Natl Acad Sci USA. 1989, 86: 9355-9. 10.1073/pnas.86.23.9355.PubMed CentralView ArticlePubMedGoogle Scholar
- de Lichtenberg U, Jensen LJ, Brunak S, Bork P: Dynamic Complex Formation During the Yeast Cell Cycle. Science. 2005, 307: 724-72710. 10.1126/science.1105103.View ArticlePubMedGoogle Scholar
- Mbonyi K, van Aelst L, Arguelles JC, Jans AW, Thevelein JM: Glucose-induced hyperaccumulation of cyclic AMP and defective glucose repression in yeast strains with reduced activity of cyclic AMP-dependent protein kinase. Mol Cell Biol. 1990, 10: 4518-4523.PubMed CentralView ArticlePubMedGoogle Scholar
- Hodge MR, Kim G, Singh K, Cumsky MG: Inverse regulation of the yeast COX5 genes by oxygen and heme. Mol Cell Biol. 1989, 9: 1958-1964.PubMed CentralView ArticlePubMedGoogle Scholar
- Burke PV, Raitt DC, Allen LA, Kellogg EA, Poyton RO: Effects of Oxygen Concentration on the Expression of Cytochrome c and Cytochrome c Oxidase Genes in Yeast. J Biol Chem. 1997, 272: 14705-1471210. 10.1074/jbc.272.23.14705.View ArticlePubMedGoogle Scholar
- Steinmetz LM, Scharfe C, Deutschbauer AM, Mokranjac D, Herman ZS, Jones T, Chu AM, Giaever G, Prokisch H, Oefner PJ, Davis RW: Systematic screen for human disease genes in yeast. Nat Genet. 2002, 31: 400-4.PubMedGoogle Scholar
- Hillenmeyer ME, Fung E, Wildenhain J, Pierce SE, Hoon S, Lee W, Proctor M, St Onge RP, Tyers M, Koller D, Altman RB, Davis RW, Nislow C, Giaever G: The chemical genomic portrait of yeast: uncovering a phenotype for all genes. Science. 2008, 320: 362-5. 10.1126/science.1150021.PubMed CentralView ArticlePubMedGoogle Scholar
- Romisch K: Surfing the Sec61 channel: bidirectional protein translocation across the ER membrane. J Cell Sci. 1999, 112: 4185-4191.PubMedGoogle Scholar
- Sommer T, Wolf DH: Endoplasmic reticulum degradation: reverse protein flow of no return. FASEB J. 1997, 11: 1227-33.PubMedGoogle Scholar
- Robb A, Brown JD: Protein transport: two translocons are better than one. Mol Cell. 2001, 8: 484-6. 10.1016/S1097-2765(01)00339-2.View ArticlePubMedGoogle Scholar
- Kaeberlein M, Guarente L: Saccharomyces cerevisiae MPT5 and SSD1 function in parallel pathways to promote cell wall integrity. Genetics. 2002, 160: 83-95.PubMed CentralPubMedGoogle Scholar
- Mitchell P, Petfalski E, Shevchenko A, Mann M, Tollervey D: The exosome: a conserved eukaryotic RNA processing complex containing multiple 3'-->5' exoribonucleases. Cell. 1997, 91: 457-66. 10.1016/S0092-8674(00)80432-8.View ArticlePubMedGoogle Scholar
- Noguchi E, Hayashi N, Azuma Y, Seki T, Nakamura M, Nakashima N, Yanagida M, He X, Mueller U, Sazer S, Nishimoto T: Dis3, implicated in mitotic control, binds directly to Ran and enhances the GEF activity of RCC1. EMBO J. 1996, 15: 5595-605.PubMed CentralPubMedGoogle Scholar
- van Noort V, Snel B, Huynen MA: Predicting gene function by conserved co-expression. Trends in Genetics. 2003, 19: 238-242. 10.1016/S0168-9525(03)00056-8.View ArticlePubMedGoogle Scholar
- Conant GC, Wolfe KH: Functional partitioning of yeast co-expression networks after genome duplication. PLoS Biol. 2006, 4: e109-10.1371/journal.pbio.0040109.PubMed CentralView ArticlePubMedGoogle Scholar
- Warner JR: The economics of ribosome biosynthesis in yeast. Trends Biochem Sci. 1999, 24: 437-40. 10.1016/S0968-0004(99)01460-7.View ArticlePubMedGoogle Scholar
- Planta RJ: Regulation of ribosome synthesis in yeast. Yeast. 1997, 13: 1505-18. 10.1002/(SICI)1097-0061(199712)13:16<1505::AID-YEA229>3.0.CO;2-I.View ArticlePubMedGoogle Scholar
- Li H, Pellegrini M, Eisenberg D: Detection of parallel functional modules by comparative analysis of genome sequences. Nat Biotechnol. 2005, 23: 253-60nbt1065. 10.1038/nbt1065.View ArticlePubMedGoogle Scholar
- Weiss H, Friedrich T, Hofhaus G, Preis D: The respiratory-chain NADH dehydrogenase (complex I) of mitochondria. Eur J Biochem. 1991, 197: 563-76. 10.1111/j.1432-1033.1991.tb15945.x.View ArticlePubMedGoogle Scholar
- Finel M: Organization and evolution of structural elements within complex I. Biochim Biophys Acta. 1998, 1364: 112-219593850. 10.1016/S0005-2728(98)00022-X.View ArticlePubMedGoogle Scholar
- Huynen MA, Gabaldón T, Snel B: Variation and evolution of biomolecular systems: searching for functional relevance. FEBS Lett. 2005, 579 (8): 1839-1845. 10.1016/j.febslet.2005.02.004.View ArticlePubMedGoogle Scholar
- Cairns BR, Lorch Y, Li Y, Zhang M, Lacomis L, Erdjument-Bromage H, Tempst P, Du J, Laurent B, Kornberg RD: RSC, an essential, abundant chromatin-remodeling complex. Cell. 1996, 87: 1249-60. 10.1016/S0092-8674(00)81820-6.View ArticlePubMedGoogle Scholar
- Scannell DR, Byrne KP, Gordon JL, Wong S, Wolfe KH: Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts. Nature. 2006, 440: 341-5. 10.1038/nature04562.View ArticlePubMedGoogle Scholar
- Komili S, Farny NG, Roth FP, Silver PA: Functional Specificity among Ribosomal Proteins Regulates Gene Expression. Cell. 2007, 131: 557-571. 10.1016/j.cell.2007.08.037.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30: 207-10. 10.1093/nar/30.1.207.PubMed CentralView ArticlePubMedGoogle Scholar
- Collins SR, Miller KM, Maas NL, Roguev A, Fillingham J, Chu CS, Schuldiner M, Gebbia M, Recht J, Shales M, Ding H, Xu H, Han J, Ingvarsdottir K, Cheng B, Andrews B, Boone C, Berger SL, Hieter P, Zhang Z, Brown GW, Ingles CJ, Emili A, Allis CD, Toczyski DP, Weissman JS, Greenblatt JF, Krogan NJ: Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map. Nature. 2007, 446: 806-10. 10.1038/nature05649.View ArticlePubMedGoogle Scholar
- He B, Chen P, Chen SY, Vancura KL, Michaelis S, Powers S: RAM2, an essential gene of yeast, and RAM1 encode the two polypeptide components of the farnesyltransferase that prenylates a-factor and Ras proteins. Proc Natl Acad Sci USA. 1991, 88: 11373-7. 10.1073/pnas.88.24.11373.PubMed CentralView ArticlePubMedGoogle Scholar
- Witter DJ, Poulter CD: Yeast geranylgeranyltransferase type-II: steady state kinetic studies of the recombinant enzyme. Biochemistry. 1996, 35: 10454-63. 10.1021/bi960500y.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.