- Research article
- Open Access
Mutational dynamics of murine angiogenin duplicates
© Codoñer et al; licensee BioMed Central Ltd. 2010
- Received: 28 May 2010
- Accepted: 15 October 2010
- Published: 15 October 2010
Angiogenin (Ang) is a protein involved in angiogenesis by inducing the formation of blood vessels. The biomedical importance of this protein has come from findings linking mutations in Ang to cancer progression and neurodegenerative diseases. These findings highlight the evolutionary constrain on Ang amino acid sequence. However, previous studies comparing human Angiogenin with homologs from other phylogenetically related organisms have led to the conclusion that Ang presents a striking variability. Whether this variability has an adaptive value per se remains elusive. Understanding why many functional Ang paralogs have been preserved in mouse and rat and identifying functional divergence mutations at these copies may explain the relationship between mutations and function. In spite of the importance of testing this hypothesis from the evolutionarily and biomedical perspectives, this remains yet unaccomplished. Here we test the main mutational dynamics driving the evolution and function of Ang paralogs in mammals.
We analysed the phylogenetic asymmetries between the different Ang gene copies in mouse and rat in the context of vertebrate Ang phylogeny. This analysis shows strong evidence in support of accelerated evolution in some Ang murine copies (mAng). This acceleration is not due to non-functionalisation because constraints on amino acid replacements remain strong. We identify many of the amino acid sites involved in signal localization and nucleotide binding by Ang to have evolved under diversifying selection. Compensatory effects of many of the mutations at these paralogs and their key structural location in or nearby important functional regions support a possible functional shift (functional divergence) in many Ang copies. Similarities between 3D-structural models for mAng copies suggest that their divergence is mainly functional.
We identify the main evolutionary dynamics shaping the variability of Angiogenin in vertebrates and highlight the plasticity of this protein after gene duplication. Our results suggest functional divergence among mAng paralogs. This puts forward mAng as a good system candidate for testing functional plasticity of such an important protein while stresses caution when using mouse as a model to infer the consequences of mutations in the single Ang copy of humans.
- Gene Duplication
- Duplication Event
- Adaptive Evolution
- Selective Constraint
- Amino Acid Site
Angiogenin (Ang) is a 14 kDa protein that belongs to the pancreatic ribonuclease A (RNase) superfamily [1–3], and is involved in angiogenesis by inducing the formation of blood vessels [3, 4]. Ang is over-expressed in tumoral cancer cells  and inhibition of Ang function through protein-protein interactions blocks the establishment, progression and metastasis in mice [6–11]. Ang may function as a tRNA-specific ribonuclease that binds to actin on the surface of endothelial cells; once bound, angiogenin is translocated to the nucleus, promoting the endothelial invasiveness necessary for blood vessel formation. The biomedical importance of this protein has been recently pinpointed by studies that have associated point mutations in Ang to neuro-degenerative disease as in the case of amyotrophic lateral sclerosis [12–14].
Human Ang (hAng) has been widely studied and has been the first to be isolated from human colon adenocarciroma cells . Crystallization of the hAng protein in 1994  has been instrumental for many molecular and biomedical studies, however little insight has been achieved regarding the structural and functional constraints on Ang mutational dynamics. Despite the important function of Ang, and therefore its expected evolutionary conservation, many research groups found this protein to be evolutionarily variable, probably linked to the divergent function between hAng and angiogenin from other organisms. For example, hAng exhibits a ribonucleolytic activity that is weaker than bovine pancreatic RNase A, around 105 to 106 times less efficient [17–19], probably due to a single amino acid substitution at position 117 of the protein . It cleaves preferentially on the 3' side of pyrimidines and follows a transphosphorylation/hydrolysis mechanism when inducing angiogenesis, differing not only in magnitude but also in the specificity for the bovine pancreatic RNaseA. Whether the angiogenin functional plasticity is correlated with an evolutionary plasticity remains to be tested. To conduct this test it is important to define the set of functional domains and amino acid sites that provide Ang its function and to identify the evolutionary/functional potential of this protein--which refers to the potential of this protein to evolve towards novel functions. Functional and comparative structural analyses have been paramount to unravel key sites for Ang function (See for example, [16, 21–23]). Many of these studies have specifically assigned functions to particular amino acid sites within the Ang protein. For example, His13, Lys40 and His114 have been shown to be essential in the catalytic activity of Ang [24–26].
Mouse is the model used to study the implications of mutations at Angiogenin in some human illnesses and studies of murine Angiogenin (mAng) have highlighted a burst of other amino acid sites essential for its activity including: i) the B1 binding site comprising Thr44 and Ser118 ; ii) the poorly conserved B2 binding site, that binds a purine ring on the opposite side of the scissile bond, with Glu108 being key at this functional domain [16, 26]; iii) the P2 site that facilitates, in conjunction with B2 binding site, the binding of Ang to the nucleus of the cell owing this activity to the amino acids Arg5 and His8 ; and iv) another putative binding site that has been described to be required for Ang activity, covering the range of residues Asn59 to Asn68, residues Ala108 to Phe110 and residue Asn119. Previous studies pointed to the possible implication of some of these residues (for example residues Glu58 to Lys70) in the binding of Ang to the cell and in causing aggregation rather than purine binding as in the case of RNaseA [28, 29]. In addition to these regions, there is a nuclear localization signal that spans amino acids Arg31-Leu35 of mAng .
Due to the variable copy numbers for mAng generated by gene duplication, mAng become a questionable model to infer the effect of mutations in hAng, because the functional constraints on amino acid sites in Ang may have changed after gene duplication. This problem is magnified in mice for which six different Ang paralogs genes have been so far described (mAng1 to mAng6), all resulting from tandem duplications of the hAng ortholog (mAng1). Only four out of the six copies (mAng1 to mAng4) have been tested for activity. Among these, mAng1, mAng3 and mAng4 present a ribonuclolytic and angiogenic activity (Nobile et al., 1996; Fu et al., 1999; Crabtree et al., 2007a).
Sites involved in Ang activity have been identified through comparative structural analyses and functional data [31–33]. Moreover, mAng2 has been reported to lack angiogenic activity and has been considered to be a pseudo-gene . To date, no function has been reported for either mAng5 or mAng6. Recent studies have provided evidence for the action of diversifying selection post-dating the duplication events that gave rise to five of the mAng1 paralogs [34–37]. Adaptive evolution has also been found in the duplicated gene of Ang from rats (rAng)  and primates [38–40]. Aside from these studies, exhaustive analyses of evolutionary dynamics and structural constraints at this gene in mice remain unperformed.
Here we present an evolutionary study of the duplicated Ang genes in mice to identify amino acid regions that may have played key roles in its functional diversification. We test for the fixation of adaptive amino acid replacements after mAng duplication events to identify shifts on the functional constraints of amino acids after gene duplication and we explore the structural consequences of such shifts. We finally discuss on the putative functional roles of the different Ang proteins in mouse based on our evolutionary analyses.
Evolutionary history of Ang
Among the most interesting observations when examining the tree of figure 1 is that repeated ang gene duplication events occurred during the radiation of murine and rat clades (Figure 1). These duplication events led to the asymmetric expansion of the Ang proteins in both these lineages with the mouse lineage showing a substantial amount of paralogs (6 paralogs corresponding to five duplication events) as compared to the rat lineage, which showed only two paralogs (Figure 1). Rat paralogs present symmetric branch lengths and hence equal evolutionary rates. Unlike rat, mice paralogs show substantial differences in their evolutionary rates (for example, mAng2, mAng6 and mAng4 present long branches compared to their paralogs) hinting their possible functional divergence after gene duplication. The nature and consequences of this functional divergence are elusive and more analyses are needed to determine whether such divergence led to neo-functionalisation or sub-functinalisation of the paralog copies. Mice present large effective population sizes in comparison to human and hence the probability for neo-functionalisation in mice is greater than in humans. Theoretical and population genetics data predict that in large populations strong constraints act against slightly deleterious mutations, hindering the subsequent fixation of compensatory mutations, and consequently the probability of sub-functionalisation is lower than that of neo-functionalisation . Regardless the final outcome, asymmetry between mice paralogs point to the fixation of burst of mutations by adaptive evolution, which may have driven mAng copies to angiogenin functional diversification. The asymmetry in mice angiogenin paralogs is substantial, but what is the selective value of this asymmetry? and What changes have been essential for mAng functional diversification?
Diversifying selection has driven the evolution of mAng copies
To identify events of diversifying selection in the murine angiogenin, we applied the maximum likelihood based models implemented in the program CODEML of the PAML package. In addition, we applied the parsimony sliding window approach implemented in the program SWAPSC. This second approach was convenient for several reasons: i) The sliding window approach allows identifying regions of the proteins under diversifying selection, or other non-neutral selective constraints such as accelerated rates of evolution and mutation hotspots; and ii) SWAPSC accounts for nucleotide bias and for non-neutral evolution of synonymous sites.
Branch F (Figure 2), which leads to the ancestor of mAng3 and mAng5, was detected to be under positive selection using CODEML as the estimated ω = ∞ was greater than one, although synonymous changes were estimated to be 0 and therefore we should be careful in interpreting this result. In Branch A we found several sites under PS, including amino acids spanning the region between Met30 and Gly34. Importantly, this region includes the sites responsible for the nuclear localization of the protein . At Branch G several sites were found to be under positive selection using both approaches, including residues Ser52 to Gly62. Although the functional importance of some of these amino acids is as yet unknown, some of the residues detected to have undergone positive selection have been reported to be close or directly involved in the nuclear localization region of mAng (for example amino acids included in the region Arg31 to Leu35) . This could be an indication that functional divergence of mAng6 was due to positively selected amino acid replacements at these sites.
Inspecting the rest of the branches leading to the different duplicates we found that branch H, which leads to mAng3 (Figure 2), seems to have been evolving under negative selection, with ω = 0.1912 and ω = 0.1329 for maximum likelihood and sliding window, respectively. In contrast some branches such as branch D, which leads to the ancestor of mAng3, mAng5 and mAng6, seems to have been evolving neutrally (ω = 0.951). Similar results were obtained for branch E (ω = 1.015), that corresponds to the mAng4 lineage and branch I (ω = 0.964), that leads to mAng5. This neutral evolution points to the possible non-functionalization of these copies, but as mAng4 and mAng3 have been shown to be functional, we expected mAng5 to be functional as well; we show further analysis that point to this hypothesis as the most plausible one.
Interesting was the case of branch C, mAng2 (Figure 2), because it seems to have been evolving at the same rate (ω = 0.618) as the rest of the species included in the phylogenetic analysis (ω = 0.556). It has been described that mAng2 is a possible pseudo-gene, however we have not detected any evidence for relaxed constraints in this lineage in comparison with other lineages of the mammalian tree that could suggest neutral fixation of amino acid replacing nucleotide substitutions (for example ω = 0.556 < < 1). Conversely, mAng2 seem to have undergone moderate purifying selection.
Gene's expression is one of the main factors affecting evolutionary rates, with expressed genes being those highly conserved. To account for this when comparing non-synonymous-to-synonymous rates ratios among mAng gene copies we investigated the expression of each of the copies using codon adaptation index (CAI) as a proxy to gene expression. CAI was calculated using the webpage http://www.cbib.u-bordeaux2.fr/pise/cai.html. The values of non-synonymous-to-synonymous nucleotide substitutions are not due to different expression levels of the gene copies because, on average, the different mAng copies presented similar expression levels (CAI was estimated to be 0.245, 0.235, 0.232, 0.247, 0.243 and 0.237 for mAng copies 1 to 6, respectively). These gene copies also presented similar expression levels to that of hAng (CAI = 0.262). Difference in evolutionary rates therefore was not due to differences in expression levels among duplicates.
The fact that these copies remain in the proteome of mouse argues against previous studies suggesting non-functionalisation . In addition, all post-duplication lineages presented similar intensities of selection except the pairs of post-duplication lineages F-G (leading to mAng6 and ancestor of mAng5-mAng3 respectively) and H-I that lead to mAng5 and mAng3, respectively (Figure 2). The elevated ω values are more consistent with shifts in the evolutionary rates after gene duplication and with the possible functional divergence of the resulting paralogous copies. In the first pair (F-G lineages), both post-duplication lineages underwent adaptive evolution (for example ω > 1) indicating the possible functional divergence type II (as defined in ). Functional divergence type II involves a change of the ancestral amino acid at a particular amino acid site of the protein after gene duplication. This replacement involves the fixation of two different residues in the post-duplication lineages and their high conservation after the speciation of each of the copies due to their different but equally important functional role in each of the paralogs. Conversely, both post-duplication lineages in the second pair (H-I) evolved under purifying selection, although mAng5 presented significantly accelerated rates of evolution compared to mAng3, indicating possible functional divergence type I (as defined by ). Unlike functional divergence type II, type I involves the fixation of a function conferring residue mutation in one of the paralogs where it becomes highly constrained, while this amino acid sites evolves neutrally in the other where amino acid replacements occur with no functional consequences.
Co-evolution between residues proximal to functional regions in Ang
Relaxed selection is a common phenomenon after gene duplication and it can take place in one or both copies of the gene because of gene redundancy [46, 47]. One of the gene copies therefore may accumulate deleterious mutations while the other copy can remain under strong purifying selection to preserve the ancestral function. The most expected fate for one of the gene copies is non-functionalization followed by its disintegration within few million years of evolution depending on the effective population sizes of the organism . The two copies of a gene can persist in the genome either if the combined function of both paralogs performs the ancestral function (sub-functionalization) or if one copy reproduces the ancestral function while the other diverges towards other functions (neo-functionalisation). Survival of a pseudo-gene in the genome for long evolutionary periods is very unlikely, and therefore copies that remain are likely to be functional. However, evolution of gene copies after duplication can be very complex and up to twelve models have been recently proposed to account for all possible evolutionary scenarios . Based on this assumption, we examined whether the mAng gene copies that were kept in the genome were followed by functional divergence after duplication. Functional divergence is likely to happen in two ways: i) classic functional divergence involves the accumulation of functionally innovative advantageous mutations in one of the gene copies [44, 45]; or alternatively ii) after gene duplication functionally innovative but structurally destabilizing mutations may have become fixed once they have been compensated for by other mutations (compensatory co-evolution): in a normal physiological background the effect of both two mutations is neutral but the phenotypic advantage of the destabilizing mutation may be expressed under novel environmental conditions.
Residues in the Ang protein involved in intramolecular coevolution
Groups of coevolution
Detection of compensatory mutations
Identification of functionally important residues using co-evolutionary analyses
Conserved Sites (4Å close)
Putative Binding Site
H84, Q93, R95
K40, D41, I42, C92
T80, C81, K82, R121
S74, S75, N102
I56, N63, R70, I71, S72, K73, R101
D23, H84, Q93, R95, A96
K40, D41, C92
D23, H84, R95, A96, T97, R122
T44, T79, C81, K82, F120
D23, H84, G86, P91, Q93, R95, A96
C39, K40, D41, S87, C92
D23, H84, R95, A96, T97
T36, G86, W89, P90, P91, Q93
C39, K40, S87, C92
R21, D22, H47, Q77, R95, A96, T97, A98, G99, F100
V78, T79, T80, C81
H84, Q93, R95
K40, D41, I42, C92
T80, C81, K82, R121
S74, S75, N102
I56, N63, R70, I71, S72, K73, R101
D23, H84, Q93, R95, A96
K40, D41, C92
D23, H84, R95, A96, T97, R122
T44, T79, C81, K82, F120
D23, H84, G86, P91, Q93, R95, A96
C39, K40, D41, S87, C92
D23, H84, R95, A96, T97
T36, G86, W89, P90, P91, Q93
C39, K40, S87, C92
R21, D22, H47, Q77, R95, A96, T97, A98, G99, F100
V78, T79, T80, C81
Structural bases of amino acid variability in mAng
Root Mean Square Deviation (RMSD) between the modeled structures for murine ANG protein paralogs
mAng1 vs mAng2
mAng1 vs mAng3
mAng1 vs mAng4
mAng1 vs mAng5
mAng1 vs mAng6
hAng vs mAng1
Similarly to mAng2, mAng5 shows a structural departure from mAng1 in the active centre with evidence of activity blockage (Figure 4). This supports the previously observed poor angiogenic activity and the suggested involvement of mAng in other processes .
Finally, our selective constraints analyses show evidence of adaptive evolution in mAng6. The fact that our structural modelling detects a structurally different active centre in this protein compared to other protein copies where we detect adaptive evolution suggests functional divergence after the split between mAng6 and the remaining Ang copies. Furthermore, intra-molecular co-evolutionary analyses show evidence of compensatory mutations events located nearby important functional regions of the Ang protein. For example, Pro18 and Thr36, frequently identified as coevolving with many other residues in the structure, are probably responsible of the operability of the active site as its location suggests its role in maintaining the proper orientation of sites His13 and Thr44 that belong to the active site. The rest of the pairs of coevolution are mostly surrounding the active site--which comprises amino acids His13, Thr44 and His114. Others are either included within or surrounding the nuclear peptide signal (Arg31-Leu35). These results indicate therefore that, in general, amino acid sites close or included in important protein domains may have coevolved to maintain the structural features necessary for the proper functional activity of Angiogenin. We have also detected two groups of compensatory mutations in mAng after duplication. The importance of these sites is further enhanced by their location in or nearby amino acids Arg31 to Leu35 that have been described to be involved in the nuclear localization of the protein in mouse .
Even though all the duplicates in mouse are different from mAng1, and that nothing has been described for mAng5 and mAng6, there is no reason to think that these copies are non-functional. Remarkably it has been reported the non-nuclear localization of mAng6. The identification of positive selection in amino acids involved in the nuclear localization of the protein may be related to a functional shift of this angiogenin mouse copy. Moreover, mAng2 seems to be functional in contrast to previous reports.
In conclusion, our analyses yield novel results that shed light on the mutational dynamics of Angiogenin when exploring diversifying natural selection and functional divergence. We present the structural and functional interpretations for these evolutionary dynamics and provide a list of amino acid sites that are likely to have a functional impact on the mAng copies suitable for further experimental testing. We also reject the previously supported non-functionalization of duplicated mAng based on evolutionary and structural analyses and we invite researchers for a rather more detailed investigation of the roles and redundancies of duplicated Angiogenin genes.
DNA and protein sequences
Accession numbers for the DNA and protein sequences of the Angiogenin protein used in the analysis
A molecular evolutionary model was first fitted to the protein alignment using PROTTEST 1.0.6 . Phylogenetic trees for proteins were obtained using the PHYML  with the best evolutionary model found in PROTTEST. Confidence of the nodes of the tree was tested by first building 1000 bootstrap pseudo-replicate alignments using the BOOTSTRAP program implemented in the PHYLIP package v3.67 (J. Felsenstein, U. Washington, freely available at http://evolution.genetics.washington.edu/phylip/getme.html). We then used PHYML to obtain 1000 trees from these replicates. The consensus tree was obtained using the majority rule approach implemented in CONSENSE program in the PHYLIP package.
Identification of selective constraints
To identify the main functional diversifying events in Ang during the evolutionary radiation of mammals we analyzed the change in the dynamics of synonymous (dS) and of non-synonymous (dN) nucleotide replacements. In our study we assumed that dS accumulates neutrally on average since they produce no amino acid replacements and are therefore not seen by selection. Taking into account this assumption, we estimated the intensity of selection by obtaining the ratio between dN and dS (ω = dN/dS). This ratio has been regarded as the most stringent way to identify selection, with ω = 1, ω < 1 and ω > 1, indicating neutral evolution, purifying selection and diversifying selection, respectively [58–60]. However, caution is required when measuring selection using this approach because the stability of RNA molecule secondary structure as well as translational selection may impose constraints on synonymous sites leading to lower dS values and consequently to inflated ω estimates [61–64].
To ameliorate the effects of these limitations, we tested for the presence of diversifying selection following two main ways. First we used maximum-likelihood models to identify selective constraints as implemented in the program CODEML of the PAML package v4.0 . Using this approach, we compared a model assuming homogenous distribution of selective constraints along the protein and the phylogeny (model M0: one ω value for the entire tree and alignment) to a model assuming an independent ω for each lineage of the tree . These two nested models (the more complex model includes parameters of the simple model) were compared by the likelihood ratio test (LRT) , with twice the difference between the log-likelihood values of the two models being compared to a χ2 distribution with as many degrees of freedom as number of branches in the tree -1. Second we used a parsimony-based approach robust to deviations from the assumption of neutrality of synonymous substitutions. This parsimony approach was based on the sliding window procedure previously published  and is implemented in the program SWAPSC version 1.0 . This program uses a statistically optimized window size to detect selective constraints in specific codon regions of the given alignment at a particular branch of the phylogenetic tree that show the evolutionary history of the sequences under study .
Briefly, SWAPSC estimates the expected distribution of dS and dN by Li's method  from simulated alignments and assuming a Poisson distribution of substitutions. A statistically optimum windows size is then estimated that makes the detection of adaptive evolution independent of the windows size. The empirical values of dS and dN obtained by using the optimal window size are contrasted with the expected distributions, and several hypotheses regarding the selective constraints acting on codon regions are tested. We obtained the simulated alignments needed for the analysis with the EVOLVER program implemented in the PAML package version 4.0, with the parameters estimated from the true sequence alignment after running the most appropriated codon based model in PAML. Finally, we considered only regions and branches detected under adaptive evolution by those approaches as the true positive results.
Detection of intra-molecular co-evolution
To test for intra-molecular coevolution, we used a recently developed parametric model  implemented in the program CAPS v1 . The sensitivity of CAPS to identify coevolution between pairs of amino acid sites that are functionally linked has been shown to outperform other methods based on mutual information content or on other models of coevolution . We considered therefore the method to be appropriate for an accurate detection of co-evolution. This method has been applied in numerous case studies similar to the one here conducted [50, 71, 73, 74].
Briefly, CAPS compares the correlated variance of the evolutionary rates at 2 amino acid sites in a protein alignment, corrected by the time since the divergence of the 2 sequences they belong to. The algorithm estimates the synonymous nucleotide pairwise sequence divergence as a proxy for their divergence time. This method compares the amino acid transition probability scores between 2 sequences at 2 particular sites, using the blocks substitution matrix . The significance of the CAPS correlation values was assessed by randomly pairing sites of the alignment and building a distribution of correlation coefficients for 1,000,000 randomly paired sites against which we compared real correlation values. To correct for multiple tests and data non-independence CAPS performs a step-down permutation procedure  and corrects the probabilities for the correlation coefficients of co-evolving pairs of sites accordingly .
For co-evolution analyses we used the protein-coding sequence of Ang and minimized type I error using a confidence value of 0.01. The structural PDB file for hANG (1ang, ) was used to identify the co-evolving amino acid positions in the structure (for example, all the amino acid positions in this study refer to their location in the hAng three-dimensional structure).
Molecular co-evolution between amino acids can be the result of their structural, functional, interaction, phylogenetic, or stochastic link . Disentangling the different types of coevolution is a difficult task, although a phylogenetic approach has been suggested as a feasible way to remove amino acids covariation due to stochastic noise . Distinguishing between structural, functional, and interaction co-evolution requires biological information in addition to the mathematical adjustments made by the method. Accordingly, we used correlated variation in the physico-chemical properties of the amino acids as a further filter to our co-evolutionary analyses.
Identifying Compensatory Mutational Dynamics in Angiogenin
Here N is the number of atoms in amino acid i while K is that number in amino acid j. X, Y and Z represent the three-dimensional coordinates of the atoms corresponding to each of the amino acids. We considered two amino acids to contact each other when the distance between their closest atoms was equal or less than 4Å.
Two mutations were considered to have compensated each other if, in addition to presenting the same phylogenetic pattern (be coevolving), they were located within 4Å from each other in the protein crystal structure. Also, two amino acid sites can compensate each other indirectly. For example, if site "A" and site "B" are at more than 8Å distance but are surrounding (within 4Å) an important functional site "C", then changes at site "A" may affect site "C" which has to be compensated by changes at site "B". We also considered these cases to be in support of a compensatory relationship between sites "A" and "B". However, caution must be taken in making such assumptions because close amino acids, even though are likely to influence one another, may not have a compensatory relationship. Conversely, amino acid sites distantly located in the protein structure may have indirect compensatory effects upon one another. The other limitation of this approach is that proteins can undergo dramatic conformational changes during their interactions with other proteins, which is not reflected in the crystal (static) structure of proteins. Under these circumstances, amino acids that are distantly located may interact and hence influence one another. Nonetheless, we adopted the very conservative view that amino acids interacting in our crystal structures are true interactors at the particular conditions under which the protein was crystallized. To identify compensatory relationships under our assumptions between amino acids at distances greater than 4Å, we searched for sites contacting both covarying amino acid sites in the structure showing very low divergence levels in comparison with the rest of the molecule. We measured divergence levels per site by estimating the Poisson amino acid distances for each amino acid site in the multiple sequence alignments. The level of divergence was compared to the distribution of divergence levels built using a pseudo-random sample of 1,000,000 amino acid site columns sampled with replacement from the alignment (one site could be sampled more than once).
Three-dimensional analysis in Angiogenin
3D-JIGSAW program [51, 52] was used to model the 3D structure of the different duplicates. To identify differences among the six copies of mouse Angiogenin proteins' structures we used the program CCOMP , that measures the Mean Root Square Deviation between the different structures.
This work was supported by Science Foundation Ireland to M.A.F (04/YI1/M518). F.M.C. was supported by Marie Curie European Reintegration Grant FP7 actions (238885). S.A.L. is supported by a FPI PhD grant from the Spanish Ministerio de Ciencia y Tecnologia.
- Strydom DJ, Fett JW, Lobb RR, Alderman EM, Bethune JL, Riordan JF, Vallee BL: Amino acid sequence of human tumor derived angiogenin. Biochemistry. 1985, 24 (20): 5486-5494. 10.1021/bi00341a031.View ArticlePubMedGoogle Scholar
- Beintema JJ, Breukelman HJ, Carsana A, Furia A: Evolution of vertebrates. Ribonucleases: Structure and Function. 1997, New York: Academic Press New YorkGoogle Scholar
- Adams SA, Subramanian V: The angiogenins: an emerging family of ribonuclease related proteins with diverse cellular functions. Angiogenesis. 1999, 3 (3): 189-199. 10.1023/A:1009015512200.View ArticlePubMedGoogle Scholar
- Riordan JF: Structure and function of angiogenin. Ribonucleases: Structure and Function. 1997, New York: Academic Press New YorkGoogle Scholar
- Tello-Montoliu A, Patel JV, Lip GY: Angiogenin: a review of the pathophysiology and potential clinical applications. J Thromb Haemost. 2006, 4 (9): 1864-1874. 10.1111/j.1538-7836.2006.01995.x.View ArticlePubMedGoogle Scholar
- Fett JW, Olson KA, Rybak SM: A monoclonal antibody to human angiogenin. Inhibition of ribonucleolytic and angiogenic activities and localization of the antigenic epitope. Biochemistry. 1994, 33 (18): 5421-5427. 10.1021/bi00184a010.View ArticlePubMedGoogle Scholar
- Olson KA, French TC, Vallee BL, Fett JW: A monoclonal antibody to human angiogenin suppresses tumor growth in athymic mice. Cancer Res. 1994, 54 (17): 4576-4579.PubMedGoogle Scholar
- Olson KA, Fett JW, French TC, Key ME, Vallee BL: Angiogenin antagonists prevent tumor growth in vivo. Proc Natl Acad Sci USA. 1995, 92 (2): 442-446. 10.1073/pnas.92.2.442.PubMed CentralView ArticlePubMedGoogle Scholar
- Olson KA, Byers HR, Key ME, Fett JW: Prevention of human prostate tumor metastasis in athymic mice by antisense targeting of human angiogenin. Clin Cancer Res. 2001, 7 (11): 3598-3605.PubMedGoogle Scholar
- Piccoli R, Olson KA, Vallee BL, Fett JW: Chimeric anti-angiogenin antibody cAb 26-2F inhibits the formation of human breast cancer xenografts in athymic mice. Proc Natl Acad Sci USA. 1998, 95 (8): 4579-4583. 10.1073/pnas.95.8.4579.PubMed CentralView ArticlePubMedGoogle Scholar
- Kao RY, Jenkins JL, Olson KA, Key ME, Fett JW, Shapiro R: A small-molecule inhibitor of the ribonucleolytic activity of human angiogenin that possesses antitumor activity. Proc Natl Acad Sci USA. 2002, 99 (15): 10066-10071. 10.1073/pnas.152342999.PubMed CentralView ArticlePubMedGoogle Scholar
- Greenway MJ, Andersen PM, Russ C, Ennis S, Cashman S, Donaghy C, Patterson V, Swingler R, Kieran D, Prehn J, et al: ANG mutations segregate with familial and 'sporadic' amyotrophic lateral sclerosis. Nat Genet. 2006, 38 (4): 411-413. 10.1038/ng1742.View ArticlePubMedGoogle Scholar
- Crabtree B, Thiyagarajan N, Prior SH, Wilson P, Iyer S, Ferns T, Shapiro R, Brew K, Subramanian V, Acharya KR: Characterization of human angiogenin variants implicated in amyotrophic lateral sclerosis. Biochemistry. 2007, 46 (42): 11810-11818. 10.1021/bi701333h.View ArticlePubMedGoogle Scholar
- Wu D, Yu W, Kishikawa H, Folkerth RD, Iafrate AJ, Shen Y, Xin W, Sims K, Hu GF: Angiogenin loss-of-function mutations in amyotrophic lateral sclerosis. Ann Neurol. 2007, 62 (6): 609-617. 10.1002/ana.21221.PubMed CentralView ArticlePubMedGoogle Scholar
- Fett JW, Strydom DJ, Lobb RR, Alderman EM, Bethune JL, Riordan JF, Vallee BL: Isolation and characterization of angiogenin, an angiogenic protein from human carcinoma cells. Biochemistry. 1985, 24 (20): 5480-5486. 10.1021/bi00341a030.View ArticlePubMedGoogle Scholar
- Acharya KR, Shapiro R, Allen SC, Riordan JF, Vallee BL: Crystal structure of human angiogenin reveals the structural basis for its functional divergence from ribonuclease. Proc Natl Acad Sci USA. 1994, 91 (8): 2915-2919. 10.1073/pnas.91.8.2915.PubMed CentralView ArticlePubMedGoogle Scholar
- Shapiro R, Riordan JF, Vallee BL: Characteristic ribonucleolytic activity of human angiogenin. Biochemistry. 1986, 25 (12): 3527-3532. 10.1021/bi00360a008.View ArticlePubMedGoogle Scholar
- Shapiro R, Strydom DJ, Weremowicz S, Vallee BL: Sites of modification of human angiogenin by bromoacetate at pH 5.5. Biochem Biophys Res Commun. 1988, 156 (1): 530-536. 10.1016/S0006-291X(88)80874-X.View ArticlePubMedGoogle Scholar
- Harper JW, Vallee BL: A covalent angiogenin/ribonuclease hybrid with a fourth disulfide bond generated by regional mutagenesis. Biochemistry. 1989, 28 (4): 1875-1884. 10.1021/bi00430a067.View ArticlePubMedGoogle Scholar
- Russo N, Shapiro R, Acharya KR, Riordan JF, Vallee BL: Role of glutamine-117 in the ribonucleolytic activity of human angiogenin. Proc Natl Acad Sci USA. 1994, 91 (8): 2920-2924. 10.1073/pnas.91.8.2920.PubMed CentralView ArticlePubMedGoogle Scholar
- Shapiro R: Structural features that determine the enzymatic potency and specificity of human angiogenin: threonine-80 and residues 58-70 and 116-123. Biochemistry. 1998, 37 (19): 6847-6856. 10.1021/bi9800146.View ArticlePubMedGoogle Scholar
- Holloway DE, Chavali GB, Hares MC, Baker MD, Subbarao GV, Shapiro R, Acharya KR: Crystallographic studies on structural features that determine the enzymatic specificity and potency of human angiogenin: Thr44, Thr80, and residues 38-41. Biochemistry. 2004, 43 (5): 1230-1241. 10.1021/bi035654+.View ArticlePubMedGoogle Scholar
- Leonidas DD, Shapiro R, Subbarao GV, Russo A, Acharya KR: Crystallographic studies on the role of the C-terminal segment of human angiogenin in defining enzymatic potency. Biochemistry. 2002, 41 (8): 2552-2562. 10.1021/bi015768q.View ArticlePubMedGoogle Scholar
- Shapiro R, Fox EA, Riordan JF: Role of lysines in human angiogenin: chemical modification and site-directed mutagenesis. Biochemistry. 1989, 28 (4): 1726-1732. 10.1021/bi00430a045.View ArticlePubMedGoogle Scholar
- Shapiro R, Vallee BL: Site-directed mutagenesis of histidine-13 and histidine-114 of human angiogenin. Alanine derivatives inhibit angiogenin-induced angiogenesis. Biochemistry. 1989, 28 (18): 7401-7408. 10.1021/bi00444a038.View ArticlePubMedGoogle Scholar
- Curran TP, Shapiro R, Riordan JF: Alteration of the enzymatic specificity of human angiogenin by site-directed mutagenesis. Biochemistry. 1993, 32 (9): 2307-2313. 10.1021/bi00060a023.View ArticlePubMedGoogle Scholar
- Russo N, Acharya KR, Vallee BL, Shapiro R: A combined kinetic and modeling study of the catalytic center subsites of human angiogenin. Proc Natl Acad Sci USA. 1996, 93 (2): 804-808. 10.1073/pnas.93.2.804.PubMed CentralView ArticlePubMedGoogle Scholar
- Hallahan TW, Shapiro R, Vallee BL: Dual site model for the organogenic activity of angiogenin. Proc Natl Acad Sci USA. 1991, 88 (6): 2222-2226. 10.1073/pnas.88.6.2222.PubMed CentralView ArticlePubMedGoogle Scholar
- Hallahan TW, Shapiro R, Strydom DJ, Vallee BL: Importance of asparagine-61 and asparagine-109 to the angiogenic activity of human angiogenin. Biochemistry. 1992, 31 (34): 8022-8029. 10.1021/bi00149a036.View ArticlePubMedGoogle Scholar
- Moroianu J, Riordan JF: Identification of the nucleolar targeting signal of human angiogenin. Biochem Biophys Res Commun. 1994, 203 (3): 1765-1772. 10.1006/bbrc.1994.2391.View ArticlePubMedGoogle Scholar
- Nobile V, Vallee BL, Shapiro R: Characterization of mouse angiogenin-related protein: implications for functional studies on angiogenin. Proc Natl Acad Sci USA. 1996, 93 (9): 4331-4335. 10.1073/pnas.93.9.4331.PubMed CentralView ArticlePubMedGoogle Scholar
- Fu X, Roberts WG, Nobile V, Shapiro R, Kamps MP: mAngiogenin-3, a target gene of oncoprotein E2a-Pbx1, encodes a new angiogenic member of the angiogenin family. Growth Factors. 1999, 17 (2): 125-137. 10.3109/08977199909103521.View ArticlePubMedGoogle Scholar
- Crabtree B, Holloway DE, Baker MD, Acharya KR, Subramanian V: Biological and structural features of murine angiogenin-4, an angiogenic protein. Biochemistry. 2007, 46 (9): 2431-2443. 10.1021/bi062158n.View ArticlePubMedGoogle Scholar
- Brown WE, Nobile V, Subramanian V, Shapiro R: The mouse angiogenin gene family: structures of an angiogenin-related protein gene and two pseudogenes. Genomics. 1995, 29 (1): 200-206. 10.1006/geno.1995.1232.View ArticlePubMedGoogle Scholar
- Strydom DJ: The angiogenins. Cell Mol Life Sci. 1998, 54 (8): 811-824. 10.1007/s000180050210.View ArticlePubMedGoogle Scholar
- Singhania NA, Dyer KD, Zhang J, Deming MS, Bonville CA, Domachowske JB, Rosenberg HF: Rapid evolution of the ribonuclease A superfamily: adaptive expansion of independent gene clusters in rats and mice. J Mol Evol. 1999, 49 (6): 721-728. 10.1007/PL00006594.View ArticlePubMedGoogle Scholar
- Cho S, Beintema JJ, Zhang J: The ribonuclease A superfamily of mammals and birds: identifying new members and tracing evolutionary histories. Genomics. 2005, 85 (2): 208-220. 10.1016/j.ygeno.2004.10.008.View ArticlePubMedGoogle Scholar
- Osorio DS, Antunes A, Ramos MJ: Structural and functional implications of positive selection at the primate angiogenin gene. BMC Evol Biol. 2007, 7: 167-10.1186/1471-2148-7-167.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang J, Rosenberg HF: Diversifying selection of the tumor-growth promoter angiogenin in primate evolution. Mol Biol Evol. 2002, 19 (4): 438-445.View ArticlePubMedGoogle Scholar
- Wildman DE, Uddin M, Liu G, Grossman LI, Goodman M: Implications of natural selection in shaping 99.4% nonsynonymous DNA identity between humans and chimpanzees: enlarging genus Homo. Proc Natl Acad Sci USA. 2003, 100 (12): 7181-7188. 10.1073/pnas.1232172100.PubMed CentralView ArticlePubMedGoogle Scholar
- Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. 1999, 16: 1114-1116.View ArticleGoogle Scholar
- Lynch M, O'Hely M, Walsh B, Force A: The probability of preservation of a newly arisen gene duplicate. Genetics. 2001, 159 (4): 1789-1804.PubMed CentralPubMedGoogle Scholar
- Moroianu J, Riordan JF: Nuclear translocation of angiogenin in proliferating endothelial cells is essential to its angiogenic activity. Proc Natl Acad Sci USA. 1994, 91 (5): 1677-1681. 10.1073/pnas.91.5.1677.PubMed CentralView ArticlePubMedGoogle Scholar
- Gu X: Maximum-likelihood approach for gene family evolution under functional divergence. Mol Biol Evol. 2001, 18 (4): 453-464.View ArticlePubMedGoogle Scholar
- Gu X: Statistical methods for testing functional divergence after gene duplication. Mol Biol Evol. 1999, 16 (12): 1664-1674.View ArticlePubMedGoogle Scholar
- Ohno S: Evolution by Gene duplication. 1970, Berlin: Springer-VerlagView ArticleGoogle Scholar
- Nowak MA, Boerlijst MC, Cooke J, Smith JM: Evolution of genetic redundancy. Nature. 1997, 388 (6638): 167-171. 10.1038/40618.View ArticlePubMedGoogle Scholar
- Lynch M, Conery JS: The evolutionary fate and consequences of duplicate genes. Science. 2000, 290 (5494): 1151-1155. 10.1126/science.290.5494.1151.View ArticlePubMedGoogle Scholar
- Innan H, Kondrashov F: The evolution of gene duplication: classifying and distinguishing between models. Nat Rev Genet. 2010, 11: 97-108. 10.1038/nrg2689.View ArticlePubMedGoogle Scholar
- Tully DC, Fares MA: Shifts in the selection-drift balance drive the evolution and epidemiology of foot-and-mouth disease virus. J Virol. 2009, 83 (2): 781-790. 10.1128/JVI.01500-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Bates PA, Sternberg MJ: Model building by comparison at CASP3: using expert knowledge and computer automation. Proteins. 1999, 3: 47-54. 10.1002/(SICI)1097-0134(1999)37:3+<47::AID-PROT7>3.0.CO;2-F.View ArticlePubMedGoogle Scholar
- Contreras-Moreira B, Bates PA: Domain fishing: a first step in protein comparative modelling. Bioinformatics. 2002, 18 (8): 1141-1142. 10.1093/bioinformatics/18.8.1141.View ArticlePubMedGoogle Scholar
- Sicinska W, Rotkiewicz P: Computational analysis of the active sites in binary and ternary complexes of the vitamin D receptor. J Steroid Biochem Mol Biol. 2007, 103 (3-5): 305-309. 10.1016/j.jsbmb.2006.12.077.View ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004, 5: 113-10.1186/1471-2105-5-113.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005, 21 (9): 2104-2105. 10.1093/bioinformatics/bti263.View ArticlePubMedGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.View ArticlePubMedGoogle Scholar
- Akashi H: Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. Genetics. 1999, 151 (1): 221-238.PubMed CentralPubMedGoogle Scholar
- Crandall KA, Kelsey CR, Imamichi H, Lane HC, Salzman NP: Parallel evolution of drug resistance in HIV: failure of nonsynonymous/synonymous substitution rate ratio to detect selection. Mol Biol Evol. 1999, 16 (3): 372-382.View ArticlePubMedGoogle Scholar
- Sharp PM: In search of molecular darwinism. Nature. 1997, 385 (6612): 111-112. 10.1038/385111a0.View ArticlePubMedGoogle Scholar
- Chamary JV, Parmley JL, Hurst LD: Hearing silence: non-neutral evolution at synonymous sites in mammals. Nat Rev Genet. 2006, 7 (2): 98-108. 10.1038/nrg1770.View ArticlePubMedGoogle Scholar
- Mayrose I, Doron-Faigenboim A, Bacharach E, Pupko T: Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates. Bioinformatics. 2007, 23 (13): i319-327. 10.1093/bioinformatics/btm176.View ArticlePubMedGoogle Scholar
- Parmley JL, Chamary JV, Hurst LD: Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers. Mol Biol Evol. 2006, 23 (2): 301-309. 10.1093/molbev/msj035.View ArticlePubMedGoogle Scholar
- Resch AM, Carmel L, Marino-Ramirez L, Ogurtsov AY, Shabalina SA, Rogozin IB, Koonin EV: Widespread positive selection in synonymous sites of mammalian genes. Mol Biol Evol. 2007, 24 (8): 1821-1831. 10.1093/molbev/msm100.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.View ArticlePubMedGoogle Scholar
- Goldman N: Variance to mean ratio, R(t), for poisson processes on phylogenetic trees. Mol Phylogenet Evol. 1994, 3 (3): 230-239. 10.1006/mpev.1994.1025.View ArticlePubMedGoogle Scholar
- Yang Z: Statistical properties of a DNA sample under the finite-sites model. Genetics. 1996, 144 (4): 1941-1950.PubMed CentralPubMedGoogle Scholar
- Fares MA, Elena SF, Ortiz J, Moya A, Barrio E: A sliding window-based method to detect selective constraints in protein-coding genes and its application to RNA viruses. J Mol Evol. 2002, 55 (5): 509-521. 10.1007/s00239-002-2346-9.View ArticlePubMedGoogle Scholar
- Fares MA: SWAPSC: sliding window analysis procedure to detect selective constraints. Bioinformatics. 2004, 20 (16): 2867-2868. 10.1093/bioinformatics/bth303.View ArticlePubMedGoogle Scholar
- Li WH: Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J Mol Evol. 1993, 36 (1): 96-99. 10.1007/BF02407308.View ArticlePubMedGoogle Scholar
- Fares MA, Travers SA: A novel method for detecting intramolecular coevolution: adding a further dimension to selective constraints analyses. Genetics. 2006, 173 (1): 9-23. 10.1534/genetics.105.053249.PubMed CentralView ArticlePubMedGoogle Scholar
- Fares MA, McNally D: CAPS: coevolution analysis using protein sequences. Bioinformatics. 2006, 22 (22): 2821-2822. 10.1093/bioinformatics/btl493.View ArticlePubMedGoogle Scholar
- Travers SA, Fares MA: Functional coevolutionary networks of the Hsp70-Hop-Hsp90 system revealed through computational analyses. Mol Biol Evol. 2007, 24 (4): 1032-1044. 10.1093/molbev/msm022.View ArticlePubMedGoogle Scholar
- Travers SA, Tully DC, McCormack GP, Fares MA: A study of the coevolutionary patterns operating within the env gene of the HIV-1 group M subtypes. Mol Biol Evol. 2007, 24 (12): 2787-2801. 10.1093/molbev/msm213.View ArticlePubMedGoogle Scholar
- Henikoff S, Henikoff JG: Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA. 1992, 89 (22): 10915-10919. 10.1073/pnas.89.22.10915.PubMed CentralView ArticlePubMedGoogle Scholar
- Westfall P, Young S: Resampling-based multiple testing. 1993, New York: John Wiley & SonsGoogle Scholar
- Atchley WR, Wollenberg KR, Fitch WM, Terhalle W, Dress AW: Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis. Mol Biol Evol. 2000, 17 (1): 164-178.View ArticlePubMedGoogle Scholar
- Shoemaker BA, Zhang D, Thangudu RR, Tyagi M, Fong JH, Marchler-Bauer A, Bryant SH, Madej T, Panchenko AR: Inferred Biomolecular Interaction Server--a web server to analyze and predict protein interacting partners and binding sites. Nucleic Acids Res. 2010, 38 (Database issue): D518-24. 10.1093/nar/gkp842.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.