- Open Access
Evolutionary rate depends on number of protein-protein interactions independently of gene expression level: Response
BMC Evolutionary Biologyvolume 4, Article number: 14 (2004)
A response to Fraser HB, Hirsh AE: Evolutionary rate depends on number of protein-protein interactions independently of gene expression level. BMC Evol Biol 2004, 4: 13
Evolving proteins are under selection for the ability to perform precise biochemical functions at minimal metabolic cost in a complex cellular environment. One way to investigate the different selective pressures is to examine what factors influence the rate of protein sequence evolution. In a recent study published in 2002, Fraser et al suggested that proteins that participate in more protein-protein interactions are under greater evolutionary constraint . The basis for this claim was a weak but still statistically significant correlation between a protein's rate of sequence evolution and its number of interaction partners as measured by various studies of protein-protein interactions in yeast. However, subsequent studies found this correlation to be highly dependent on the particular choice of protein-protein interactions data set [2, 3].
We resolved this controversy by demonstrating that the correlation between evolutionary rate and the number of interaction partners is linked to a bias towards counting more interactions for abundant proteins. Abundant proteins evolve more slowly  and some studies are biased towards finding more protein-protein interactions for abundant proteins . Only those data sets that are biased towards finding more interactions for abundant proteins suggest a correlation between evolutionary rate and the number of interaction partners (Figure 1). Some of our findings have subsequently been echoed by others .
Now, Fraser and Hirsh again argue for a meaningful connection between the number of interaction partners and evolutionary rate. We still cannot agree with their analysis. First, we note that the single data set they have re-analyzed is precisely the one which we identified as being the most biased (Figure 1). Their choice to only count interactions for the untagged proteins in mass-spectrometry studies not only fails to account for effects due to the choice of which protein to overexpress (as an interaction is inherently at least pairwise), but in fact increases the net bias in this data set . Fraser and Hirsh also use partial correlation statistics to argue that abundance does not account for all of the correlation. While it is true that some of the data sets still show a statistically significant partial correlation (as we noted in ), statistical tests are only as good as the quality of the data to which they are applied and are not a substitute for carefully inspecting the effects of biases in individual data sets. Figure 1 shows a direct linear relationship between the apparent correlation and the bias of the data set, and data sets with no bias show no correlation.
Fraser and Hirsh comment that some of our previous analysis was based on expression levels measured in an aneuploid strain of yeast. This is true but irrelevant, since we observe identical trends if we quantify abundance using codon adaptation index  or expression levels from the microarray study preferred by Fraser and Hirsh (data not shown).
We readily acknowledge the possibility that there is a real connection between the number of interaction partners and evolutionary rate hidden in all the noise and biases. However, we feel that the appropriate null hypothesis is that there is no correlation, and we do not believe this null hypothesis has been convincingly disproven.
Hirsh and Fraser's original claim  rested on the idea that evolutionary constraints due to protein-protein interactions could be represented by a protein's total number of unique interaction partners. We suggest that if interactions do impose constraints on sequence evolution, they are likely to depend on more subtle factors such as the fraction of a protein's residues directly involved in an intermolecular contact or the total number of monomers present in a macromolecular complex. In fact, one study has investigated the effect of an interaction's type (transient or stable) although this analysis also failed to control for protein abundance .
The ultimate lesson of this controversy is that the complexities and interdependencies of protein evolutionary constraints must be properly controlled for. Many factors have now been investigated for their effects on protein evolutionary rate, and one of the interesting conclusions is that protein abundance has a far greater effect  than other apparently more intuitively appealing factors such as protein dispensability [10–12] or the number of interaction partners. The best studies have acknowledged this fact by carefully controlling for protein abundance (see for example ), and we suggest that this should become standard procedure in the future.
Fraser HB, Hirsh AE, Steinmetz LM, Scharfe C, Feldman MW: Evolutionary rate in the protein interaction network. Science. 2002, 296: 750-752. 10.1126/science.1068696.
Jordan IK, Wolf YI, Koonin EV: No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly. BMC Evol Biol. 2003, 3: 1-10.1186/1471-2148-3-1.
Fraser HB, Wall DP, Hirsh AE: A simple dependence between protein evolution rate and the number of protein-protein interactions. BMC Evol Biol. 2003, 3: 11-10.1186/1471-2148-3-11.
Bloom JD, Adami C: Apparent dependence of protein evolutionary rate on the number of interactions is linked to biases in protein-protein interactions data sets. BMC Evol Biol. 2003, 3: 21-10.1186/1471-2148-3-21.
Pal C, Papp B, Hurst LD: Highly expressed genes in yeast evolve slowly. Genetics. 2001, 158: 927-931.
von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P: Comparative assessment of large-scale data sets of protein-protein interactions. Nature. 2002, 417: 399-403. 10.1038/nature750.
Hahn MW, Conant GC, Wagner A: Molecular evolution in large genetic networks: does connectivity equal constraint?. J Mol Evol. 2004, 58: 203-211. 10.1007/s00239-003-2544-0.
Fraser HB, Hirsh AE: Evolutionary rate depends on number of protein-protein interactions independently of gene expression level. BMC Evol Biol. 2004, 4: 13-10.1186/1471-2148-4-13.
Teichmann SA: The constraints protein-protein interactions place on sequence divergence. J Mol Biol. 2002, 324: 399-407. 10.1016/S0022-2836(02)01144-0. 2
Jordan IK, Rogozin IB, Wolf YI, Koonin EV: Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res. 2002, 12: 962-968. 10.1101/gr.87702. Article published online before print in May 2002.
Pal C, Papp B, Hurst LD: Rate of evolution and gene dispensability. Nature. 2003, 421: 496-498. 10.1038/421496b.
Yang J, Gu L, Li WH: Rate of protein evolution versus fitness effect of gene deletion. Mol Biol Evol. 2003, 20: 772-774. 10.1093/molbev/msg078.
Pal C, Papp B, Hurst LD: Does recombination rate affect the efficiency of purifying selection? the yeast genome provides a partial answer. Mol Biol Evol. 2001, 18: 2323-2001. 3
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.