Empirical demonstration of environmental sensing in catalytic RNA: evolution of interpretive behavior at the origins of life

Lehman, Niles; Bernhard, Tess; Larson, Brian C; Robinson, Andrew JN; Southgate, Christopher CB

doi:10.1186/s12862-014-0248-2

Research article
Open access
Published: 04 December 2014

Empirical demonstration of environmental sensing in catalytic RNA: evolution of interpretive behavior at the origins of life

Niles Lehman¹,
Tess Bernhard²,
Brian C Larson¹,
Andrew JN Robinson³ &
…
Christopher CB Southgate³

BMC Evolutionary Biology volume 14, Article number: 248 (2014) Cite this article

2308 Accesses
10 Citations
4 Altmetric
Metrics details

Abstract

Background

The origins of life on the Earth required chemical entities to interact with their environments in ways that could respond to natural selection. The concept of interpretation, where biotic entities use signs in their environment as proxy for the existence of other items of selective value in their environment, has been proposed on theoretical grounds to be relevant to the origins and early evolution of life. However this concept has not been demonstrated empirically.

Results

Here, we present data that certain catalytic RNA sequences have properties that would enable interpretation of divalent cation levels in their environment. By assaying the responsiveness of two variants of the Tetrahymena ribozyme to the Ca²⁺ ion as a sign for the more catalytically useful Mg²⁺ ion, we show an empirical proof-of-principle that interpretation can be an evolvable trait in RNA, often suggested as a model system for early life. In particular we demonstrate that in vitro, the wild-type version of the Tetrahymena ribozyme is not interpretive, in that it cannot use Ca²⁺ as a sign for Mg²⁺. Yet a variant of this sequence containing five mutations that alter its ability to utilize the Ca²⁺ ion engenders a strong interpretive characteristic in this RNA.

Conclusions

We have shown that RNA molecules in a test tube can meet the minimum criteria for the evolution of interpretive behaviour in regards to their responses to divalent metal ion concentrations in their environment. Interpretation in RNA molecules provides a property entirely dependent on natural physico-chemical interactions, but capable of shaping the evolutionary trajectory of macromolecules, especially in the earliest stages of life’s history.

Background

The origins of life on the Earth can be described as a transition from an environment containing chemicals dominated by mass-action kinetics to one containing informational macromolecules dominated by selective forces and evolution [1],[2]. The latter phenomena involve complex molecular ‘behaviour’ whereby populations of molecules compete for reproductive success. The interaction of organisms with an ever-changing environment has been a key evolutionary factor since the earliest entities emerged that were subject to natural selection [3]. It is thus of great importance to the understanding of the origin of life to understand the nascent stages of this interaction.

For a genotype (not necessarily originally a complete genome) to interact with its environment, it must first sense, and then respond to, environmental states. Primordial genotypes would not have been adept at these actions, as they would have not had the benefit of extensive evolutionary tuning. We contend that the most characteristic form of this sensing in living organisms (as opposed to inert material responding mechanically) is an interpretive process involving a response to a sign. At the molecular level, signs are indicators of a more general environmental condition; in particular here we will focus on metal ions as indicative of the presence or absence of other ions in the environment as a consequence of causally related solubilities in an aqueous milieu.

The idea that interpretation, representation, or signification is a fundamental characteristic of living things is the basis of the field of theoretical biology known as biosemiotics (semiotics being the philosophical study of signs) [4]. Biosemiotics is a controversial area of study because it has tended to lack empirically testable propositions. Our focus here is on providing just such opportunities for testing. Originally genotypes would be poor at correctly interpreting the state of the environment, but this trait would be subject to natural selection. To help explore how this evolution could happen, what was first needed was a conceptually robust definition of interpretation that cannot be reduced to a fully adequate re-description in terms only of mechanistic causes and effects. Though the interactions are entirely mechanistic when considered at the molecular level of description, a fully adequate explanation of why such properties would be selected and maintained by selection requires, in addition, the semiotic level of description. Two of the present authors have proposed such a definition [5].

In an interpretation, the trigger for the change of state (the ‘sign’) is related to, but not obligatorily implied by, some aspect of the environment (the ‘object’) that would make such a change in state beneficial. For example, an amoeba will crawl up a chemical gradient of attractant molecules towards the latter’s source. The chemotactic response is adaptive because it increases the chance of the amoeba obtaining nutrition (by ingesting the bacterium that is the source of the attractant). The response is interpretive because it is not the chemical gradient that is of benefit to the amoeba, but the bacterium of which the attractant molecules are a sign. The amoeba’s target (the bacterium) is physically distinct from the sign (the chemical attractant). It is this separation between sign and object that distinguishes an interpretation from a direct adaptive response; i.e., interpretations are fallible. The amoeba may swim up a chemical gradient that has not, in a particular case, been produced by a potential source of nutrition. In general, interpretive behaviour may be adaptive in spite of its fallibility because on balance the benefits derived from obtaining information about the object without having to respond directly to the object outweigh the costs of misinterpretations. The amoeba benefits from responding the attractant gradient because it can thereby detect the presence of a bacterium at a distance.

The simplest conceivable scenario in which an adaptive interpretive response could occur would involve a two-state entity in a two-state environment [5]. Suppose that the environment can be ‘favourable’ (F) or ‘unfavourable’ (U) and the entity has two possible states, A and B. Suppose further that in environment F it is advantageous for the entity to be in state A, and in environment U it is advantageous for the entity to be in state B. The overall ‘fitness’ of the entity in this varying environment may be expressed, by analogy with game theory, in terms of a 2×2 payoff matrix (Figure 1). There are four possible outcomes in the matrix: O1 (environment F, entity state A), O2 (environment U, entity state A), O3 (environment F, entity state B), and O4 (environment U, entity state B). The total ‘payoff’ (overall fitness) for the entity is the sum of O1 to O4, weighted according to the relative probabilities of each of these outcomes. In the non-interpretive configuration of the entity, its state (A or B) varies independently of the state of the environment. An interpretive variant of the entity might be capable of, say, responding to some sign that indicates (fallibly) that the state of the environment is F, the response being a change from state B to state A. This variant may have a selective advantage over the wildtype because it will increase the time that it spends in the advantageous combination of environment F and state A. Such interpretive responsiveness will only be adaptive, however, if it is not outweighed by the costs of a misinterpretation. This cost will be a function of the degree of disadvantage entailed by the entity being in state A in environment U, and the probability of being so placed in such a relation to the environment by the fallible (i.e., less than perfect) correlation between the presence of the sign and environmental state F.

For responsiveness to the sign to have the potential to count as an interpretation it is thus necessary that there be both a potential advantage to the response in the appropriate environment, and a potential disadvantage if the response occurs in the inappropriate environment (a misinterpretation). This may be expressed in terms of three inequalities that together provide the minimum criteria for interpretive behaviour in a 2-environment, 2-states-of-an-entity system. The relevant inequalities are: O1 > O2 (in state A the entity is better off in the favourable rather that the unfavourable environment); O1 > O3 (in the favourable environment the entity is better off in state A than state B); O4 > O2 (in the unfavourable environment the entity is better off in state B than state A, hence the cost of a misinterpretation) [6]. In addition, by definition, the highest of the four possible outcomes must correspond to the nominally favourable environment, otherwise that environment cannot be said to be the favourable one.

With this theoretical framework in hand, it then becomes imperative to demonstrate these phenomena in an empirical system. An obvious choice to manifest interpretation experimentally is ribonucleic acid (RNA), because RNA molecules simultaneously possess evolvability and catalytic function, and the RNA World concept is currently generally accepted as a plausible stage of early life on the Earth [1],[7]. Specifically, we wished to test the hypothesis that a single biomolecule not part of a more complex cellular system may exhibit properties consistent with the minimum criteria for interpretive responsiveness. In this paper we report investigations of the properties of the group I self-splicing ribozyme from the ciliated protozoan Tetrahymena. As sign and object we use the availability of the divalent cations (M²⁺) which are not only prebiotically relevant but known to affect the catalytic behavior of ribozymes [8]-[10], and the response was a change in the catalytic rate of the ribozyme. We show that the cation dependency of the catalytic properties of a variant of this ribozyme could, under appropriate conditions in a hypothetical two-state environment, provide the basis on which the inequalities required as minimal conditions for interpretive responsiveness could be satisfied. We also demonstrate that such interpretive responsiveness can be an evolvable property in variants of this ribozyme, particularly in an origin-of-life context.

Results

Pick-up-the-tail assays

We chose two variants of the L–21 Tetrahymena ribozyme for study (Figure 2). The contemporary genotype (here, CG), an example of a present-day ‘wildtype’, is the L–21 derivative of the naturally occurring sequence obtained from T. thermophila and used in one of the first published examples of a catalytic RNA [11]. This ribozyme requires divalent cations for self-splicing or trans-splicing activity in vitro, and can utilize Mg²⁺ and/or Mn²⁺ but no other [8]. The ‘primitive variant’ (here, PV) is a five-error mutant of the wild-type ribozyme that was discovered through in vitro selection for activity in 10 mM CaCl₂ (clone 61 in ref. [12]). This variant has a catalytic efficiency in pure CaCl₂ that is five orders of magnitude less than that of the wild-type ribozyme (i.e., the CG) in MgCl₂, but its slight activity in Ca²⁺ is nonetheless notable because the ionic radius of Ca²⁺ is significantly larger than that of Mg²⁺, and the five mutations are required to open up the active site enough to confer detectable catalytic activity [10]. We tracked the ability of both the CG and PV versions of this ribozyme to catalyze phospho-ester transfer in four cation-specific buffer conditions. First we measured activity as a function of [MgCl₂] and noted that there was a point of maximum catalytic separation between the two variants at 0.5 mM MgCl₂ (Figure 3). From this we chose ‘high’ and ‘low’ concentrations of Mg to use in subsequent assays; these values were 25 mM and 0.5 mM respectively. These values have been shown before as being above and below, respectively, the midpoint of the Tetrahymena activity assay [8]. We chose 10 mM CaCl₂ as the concentration of this alternative divalent (here, the sign) because this quantity is known to enhance folding of the wild-type Tetrahymena ribozyme in the presence or absence of MgCl₂ [8] and is also the concentration at which the PV ribozyme was selected for activity in the absence of MgCl₂ [12]. These ionic concentrations provide the clearest distinction between the properties of the two ribozymes and thus should optimize our chances of detecting interpretive behavior as an empirical proof-of-principle.

As seen previously, reaction progress as a function of time showed a rapid increase in product initially, with a plateau after approximately one hour. We measured the initial slope of the rapid reaction as a rough estimate of the initial velocity of the reaction to first approach the question of how well the ribozymes were performing under the various salt conditions. Given the extents of reactions of the two ribozymes under the ionic conditions chosen, we obtained the best estimates of initial velocities at 5 minutes for both ribozyme variants (see next section).

Using these initial slopes, the PV ribozyme satisfied all three inequalities needed for interpretive behaviour (O1 > O2, O1 > O3, and O4 > O2) at all time points (e.g., Figure 3b). That is, in the favourable environment the PV ribozyme is better off in the presence than the absence of the ‘sign’ (Ca²⁺), and, notably, in the unfavourable environment the ribozyme is better off in the absence of the sign than in its presence, suggesting a potential evolutionary cost of misinterpretation (Table 1). We performed three replicates of each set of reactions and with the standard errors of the runs we used the Kruskal-Wallace rank-order statistic to determine that the ordering of the inequalities was marginally statistically significant for the PV at the P <0.1 level.

Table 1 Percent reactivity values of the CG and PV ribozymes using the initial velocities of the pick-up-the-tail assays

Full size table

The CG ribozyme also satisfied the three inequalities above. However, the highest performance of the CG ribozyme was in conditions of low [Mg²⁺] and the absence of the sign (Ca²⁺), i.e., outcome O4. By definition, for the CG ribozyme the favourable condition is therefore low rather than high [Mg²⁺], in which case the columns of the 2×2 matrix need to be reversed in order to assess whether the relevant inequalities are satisfied. When this is done the CG ribozyme does not meet the necessary criteria for interpretive behavior. This result does not depend on our choice of time points; the same trends are observed at any time up to 15 minutes (data not shown).

For the PV ribozyme, the O3 and O4 are relatively equal values. For this ribozyme, the ordinal ranking can be described as O1 > O3 ~ O4 > O2. This is because the O3 vs. O4 ranking depends on which sets of data replicates are used, but the order between these two cells does not affect analysis of interpretive behavior. Thus for this variant, the presence of Ca²⁺ more dramatically alters the differentiation between the favorable and unfavorable states, allowing the ribozyme to better distinguish the disparity in Mg²⁺ content and hence be interpretive of its environment.

Kinetic analyses with radiolabeled substrate

It was possible to estimate k _cat values from the assays performed with ³²P-labeled substrate under conditions of enzyme excess for all scenarios except for the PV in O2 and O4 where the extents of reaction were not significant even after one hour. These values are summarized in Table 2. Here it can be observed that there is general agreement with the satisfying of the inequalities that constitute minimum criteria for the possibility of interpretive behaviour. However, because two of the PV values were not determinable, and because the enzyme excess conditions do not emulate those expected to drive the evolution of this trait, we did not rely on these data for our analysis of interpretation.

Table 2 Rough estimates of k _cat values under conditions of enzyme excess

Full size table

These ³²P-labelling kinetic data were used to establish that the 5-minute time point was the optimal estimate of the initial velocities for the CG and PV ribozymes. From the extents of trans-splicing reactions of the ribozymes under the condition of enzyme excess over all four salt concentration combinations, we concluded that the CG consistently reacts approximately five times faster than the PV, and that the initial velocity values (where the extents of reaction are well differentiated yet below 50%) are best measured at 5 minutes, averaging over all four conditions (Figure 4). Moreover, the level of substrate cleavage under these assays was the highest for both ribozymes in the O1 condition, which was not the case for the CG ribozyme under the pick-up-the-tail assay that tracked the behavior of the ribozyme itself. This result influenced our understanding of why the PV ribozyme is interpretive but the CG is not (see Discussion).

Data analysis

An analysis of the total parameter space, and of the fitnesses of the PV ribozyme under study within that space, provides an opportunity to quantify interpretive behaviour of the ribozyme. As outlined in the Methods using equations (1), (2), and (3), the interpretive behaviour of the ribozyme can be quantified using an analysis of the total parameter space, defined by assigning values to three variables: the probability of high Mg²⁺ concentration (favourable environment), the probability of the presence of Ca²⁺ (presence of the sign) and the correlation, r, between the favourable environment and the presence of the sign. The proportion of total activity (fitness) of the ribozyme attributable to the property of interpretation is expressed by the interpretive index, I. If the total fitness of the ribozyme at a given point in parameter space is W _t, and the fitness that would obtain if there were no correlation (predictive value) between the presence of the sign and the favourable environment is W _c, then I = (W _t – W _c)/W _t .

Figure 5a shows that, for the PV ribozyme, there is a positive interpretive component of fitness provided that there is a correlation between the presence of the sign (Ca²⁺) and the occurrence of the favourable environment (high concentration of Mg²⁺). As expected, the interpretive component of fitness rises with the degree of correlation between these variables.

Figure 5b shows the effect of varying the probability of the sign (presence of 10 mM Ca²⁺) when the correlation between the sign and the presence of the favourable environment (25 mM Mg²⁺) is maximal. Note that for a given probability of the favourable environment the maximal interpretive benefit occurs when the probability of the favourable environment is equal to the probability of the sign. This is as expected because it is the condition under which false positive and false negative responses to the sign are minimized. At higher levels of probability of the favourable environment the interpretive benefit is limited by the occurrence of false negatives (absence of the sign in the presence of the favourable environment). At lower levels of probability of the favourable environment the interpretive benefit is limited by false positives (presence of the sign in the absence of the favourable environment). (Figure 5a shows that the maximization of interpretive benefit when the probabilities of the sign and favourable environment are equal holds at all degrees of correlation between them).

Of the three parameter spaces represented in Figure 5b the greatest interpretive benefit is available where the probability of the favourable environment is close to 0.3. Calculations (not shown) reveal that the maximum occurs at a probability of the favourable environment of 0.33. The relationship between interpretive benefit and the probability of the favourable environment is explored further in Figure 5c, which shows the maximum interpretive benefit available at different probabilities of the presence of a favourable environment (i.e., parameter spaces in which the probability of the sign is equal to the probability of the favourable environment). As expected, interpretive benefit increases with increasing correlation between the sign and the interpretive environment.

Discussion

The results of the assays above provide an empirical demonstration of interpretation as manifested at the molecular level. In addition, because the specific inequality ratios vary from one RNA sequence to another, we can infer that interpretation is an evolvable trait. When using the term ‘interpretation’ about a molecular property we are not suggesting any form of consciousness or deliberate choice; at the same time, from these results we do claim that a property more familiar in the behaviour of organisms is both definable and demonstrable at the molecular level [3].

We have shown that the PV ribozyme meets the necessary criteria for interpretive behavior. The maximum interpretive benefit (interpretive index) occurs at locations in parameter space at which the probabilities of the sign (presence of 10 mM Ca²⁺) and the presence of the favourable environment (25 mM Mg²⁺) are equal. This reflects the conditions under which the effects of false positive and false negative responses to the sign are minimized. The maximum interpretive benefit for the PV ribozyme at reaction times of 5 minutes occurs at the location in parameter space at which the probability of the favourable environment is 0.33. When the correlation between sign and favourable environment is maximal, and the probabilities of the sign and favourable environment are equal, the interpretive component of total fitness at this location in parameter space is 46% (Figure 5C). Even when the correlation is relatively low, r = 0.25, the interpretive component of total fitness is 17%. Levels of interpretive benefit sufficient to be subject to selection are also seen in less optimal areas of parameter space, in which the probabilities of the sign and favourable environment are not equal (Figure 5B). Importantly, the characteristics of the relationship between interpretive benefit and the parameter space variables illustrated in Figure 5 are properties of the whole system; i.e., they are ‘emergent’ properties of the 2 × 2 outcome matrix within the constraints defined by the values assigned to the parameter space variables. Therefore, although at a molecular level of description the physico-chemical interactions of a ribozyme molecule are adequate to explain its properties, a fully adequate explanation of a proto-biotic ribozyme sequence and structure might require reference to selection for interpretive behaviour, such an explanation necessarily involving reference to the ribozyme’s past or present relation to a probabilistically varying environment. We emphasize, however, that the PV variant of the Tetrahymena ribozyme represents merely a proof-of-principle in this regard: we are not suggesting that this particular ribozyme had any proto-biotic role.

In the absence of an analysis in terms of interpretation, the values observed for the artificially selected PV version of the Tetrahymena ribozyme in the payoff matrices would be somewhat surprising, because they show that the ribozyme does better in 0.5 mM MgCl₂ (O4) than in the same situation but with 10 mM CaCl₂ present (O2). This stands in contrast to the comparison between O1 and O3, where it is seen that the addition of CaCl₂ to a 25 mM MgCl₂ buffer is advantageous. From a molecular perspective this can be explained through the realization that high amounts of a poorer cofactor (Ca²⁺) solution can flood the catalytic site and out-compete the preferred cation [8]. Thus the results seen here cannot be explained by a simple cooperative binding effect of one metal ion on another. Similarly, for the CG ribozyme, its low activity in O1 can be attributed to the tendency of the Tetrahymena ribozyme to become trapped in unfavorable folded conformations at high, non-physiological concentrations of Mg²⁺ values [8],[14]. This can be detected in the pick-up-the-tail assays where the ribozyme is being tracked but is not seen in the kinetic analyses of ³²P-labeled substrate cleavage where there is vast enzyme excess (Figure 4).

But importantly, from the perspective of evolutionary fitness, the concept of interpretation adds a level of understanding of why certain patterns of cation utilization might be under selection. The concept of interpretation is therefore a significant and novel way of viewing evolutionary options in molecular systems. By considering that the sign (Ca²⁺) and the object (Mg²⁺) typically co-occur in biotic environments and in fact the former is typically at a concentration that is an order of magnitude higher than the latter (Table 3), an appreciation of a interpretive entity’s ability to exploit a chemically less proficient resource – and of the evolutionary advantage this brings – allows us to understand seemingly maladapted traits (e.g., here, O4 > O2). This would be true in other molecular (and organismal) systems as well. It is worth emphasizing that we are not simply re-expressing the behaviour of the Tetrahymena chemical system in new language, but rather that we are positing that interpretation is a general evolutionary strategy operative from the molecular level upwards.

Table 3 Comparison of the physical and geological characteristics of the two divalent cations under study as putative signs in this paper

Full size table

The evolutionary scenario that we envisage from these results describes at least three distinct temporal stages (Figure 6). The first stage would be that of primordial, nearly random, and simple RNA sequences not optimized for replicative activity by natural selection. These RNAs would take advantage of the dominant divalent cation in their environment (Ca²⁺) to facilitate catalysis [19]. The binding would have been weak and many RNA sequences would have been able to conform to the binding of the relatively large and diffuse Ca²⁺ ion; but any anionic backbone shielding and/or catalytic prowess provided by this ion would have been potentially beneficial. Here the Ca²⁺ ion would have been the object itself, and not a sign.

The second stage would have been the advent of ribozymes with interpretive behavior, and the PV is seen as offering a proof-of-principle analogy for the actual RNAs that could have evolved during this time period. Here, the advantage of using Ca²⁺ as a sign in environment where the presence of Ca²⁺ and Mg²⁺ was correlated would imbue such sequences with a fitness advantage as they could accommodate both ions but make use of the chemically more powerful Mg²⁺ ion when available. More specifically, the structural complexity of the RNA had reached a point where the number and conformation of its cation biding sites allowed for a simultaneous distinction between, yet rudimentary simultaneous usage of, both Ca²⁺ and Mg²⁺. The ribozyme can use the former as a sign for the latter, and we expect that simulation models will later show that this feature is enhanced in environments where the appearance of these two ions is correlated in time and space. At this evolutionary stage interpretive molecular behaviour would be advantageous and would be under positive selection.

The third stage would have been the evolution of highly specific ribozyme sequences (and their corresponding 3° structures) able to form highly specific and tight active sites that could make maximal use of the preferred Mg²⁺ ion. These sequences (e.g., the CG variant that is extant in biology today) would have the benefit of long periods of evolutionary optimization in a defined cellular environment where ion transporters could use energy to pump ions against an environmental concentration gradient and create an optimal internal milieu, such as one that is enriched in Mg²⁺ compared to Ca²⁺. Here the evolutionary pressure for this particular kind of molecule would be for maximal and controllable catalytic activity, rather than for the ability to take advantage of signs of the state of the external environment, and thus interpretive behaviour would have been diminished or lost as it would no longer confer any selective advantage. Of course the use of signs (more often referred to as messages or signals in biological contexts) remains an important aspect of intracellular and inter-cellular communication. Our point is that in certain circumstances there may be a selective benefit to interpretive behaviour at one stage of evolution which is then superseded by a more efficient non-interpretive strategy based on high specificity binding. It is even conceivable that biotic systems ultimately retained the use of primordial signs such as Ca²⁺ for intercellular messaging purposes where the ion (sign) now has an arbitrary relation to object (intracellular or extracellular state) that it signifies.

It may be the case that other biopolymers such as proteins could have experienced evolutionary pressures based on interpretive abilities. Yet the potential central role of RNA in early life as a consequence of its dual genotypic and phenotypic functionalities makes it an ideal candidate for searching for interpretive behaviour. We anticipate that this characteristic could be evident in other RNA systems, particularly RNAs with known alternative folded structures, such as riboswitches, as discussed previously [20]. It may also be the case that other signs and objects could be utilized by RNA in an interpretive fashion. The divalent cations Mg²⁺ and Ca²⁺ are both alkali earth metals with similar, but not identical ionic radii (Table 3), and thus display many of the same chemical properties such as aqueous solubility and mineralization with subtle but often critical differences. This made them likewise good candidates for sign/object partners, and the well-studied metal-dependence [8],[10],[12] of the Tetrahymena ribozyme made it a logical system in which to investigate interpretive behaviour. Other ions such as Mn²⁺ [8] and Fe²⁺ [21],[22] are also of interest in this regard, given the roles they can play in RNA-based catalysis.

In a system in which the definition of interpretation and the three inequalities discussed above are all satisfied, it is worth considering how such an interpreting entity might evolve further. Such further evolution might include: response to a different sign that is more highly correlated with the ‘object’ (the entity of which the sign is functioning as a sign); greater specificity of binding of the sign (reducing errors involving analogues of the sign); strategies for reducing the disadvantage of binding the sign when the object is not present; or, in some cases, enhanced capacity to recognize the object, such that recognition of the sign is no longer advantageous. (The latter scenario is the one that we envisage could explain the evolution of the current genotype, CG, of the Tetrahymena ribozyme from the primitive variant, PV.) We propose that it should be possible to observe each of these possibilities by performing targeted artificial evolution experiments on ribozymes in which interpretation has been demonstrated. From the last possibility it may be postulated that interpretations at the molecular level may be evolutionarily more primitive than high-specificity binding, and may have been more widespread than the latter in the early stages of the evolution of life.

Conclusions

Using variants of the Tetrahymena ribozyme, we have shown that they can meet the minimum criteria for the evolution of interpretive behaviour in regards to their responses to divalent metal ion concentrations in their environment. In particular, one variant is interpretive, while another (the contemporary sequence) is not, suggesting that an evolutionary transition can take place and that interpretiveness can be under the influence of selection, either directly in a highly variable or unpredictable environment, or indirectly as in a laboratory selection for a specific phenotype. Interpretation must take place through a physical-chemical interaction between larger genotype molecules and smaller sign molecules; most likely binding constants at catalytic, general folding, or specific allosteric sites on genotypic polymers will prove to be the critical factors. Computer simulations of such events and/or additional examples of putative interpretive behavior in other RNAs will be quite valuable. Interpretation in RNA molecules provides a distinct type of intermolecular interaction event that can shape the evolutionary trajectory of macromolecules, especially in the earliest stages of life’s history.

Methods

RNA preparation

The variants of the 393-nucleotide Tetrahymena ribozyme were transcribed in vitro from DNA templates as described previously [10]. The two ribozymes were purified by electrophoresis through 8% polyacrylamide / 8 M urea gels and resuspended in 0.1 mM EDTA at 10 pmol/μl concentration prior to use. The trans-splicing substrate for in vitro assays, the 25-mer RNA ‘S-1t’ [23]: 5′-GGCCCUCU•AAAUAAAUAAAUAAAUA-3′, was purchased commercially from IDT (Coralville, IA). Upon reaction with the Tetrahymena ribozyme, this molecule is spliced at the dot (•) such that the 3′ end is transferred to the 3′ end of the ribozyme and the 5′ end is liberated.

Ribozyme assays

The initial assays were performed in a ‘pick-up-the-tail’ format [12],[24] in which the reaction was followed by tracking the length increase of the ribozyme following addition of the 3′ end of the substrate molecule. Successful reaction between the L–21 variants of the Tetrahymena ribozyme and the S-1t substrate results in a length change from 393 to 410 nt, which can be followed electrophoretically. For assays, 0.5 μM ribozyme was incubated with 1.25 μM substrate for 0–60 min in four different reaction buffers, all of which were buffered to pH 8.0 with 30 mM EPPS. Previously we tracked the reactivity in 0.5–50 mM MgCl₂ after 195 min (Figure 4 in ref. [12]), and from this we opted to perform subsequent studies for shorter times (0–15 min; to address better the ribozymes’ initial velocities) in the following four reaction buffers (final concentrations): 25 mM MgCl₂ + 10 mM CaCl₂ (O1), 0.5 mM MgCl₂ + 10 mM CaCl₂ (O2), 25 mM MgCl₂ (O3), and 0.5 mM MgCl₂ (O4). These concentration values are similar to, but outside of the range of, those investigated earlier [8] for the CG ribozyme. (The existence of a Ca²⁺-competent mutant was not known when that study [8] was performed.) Reactions were stopped by the addition of enough EDTA to chelate stoichiometrically all divalent cations in solution, and the products were electrophoresed through 8% polyacrylamide / 8 M urea gels. The resulting bands were detected and quantified after staining with SybrGreen II on a Typhoon Trio⁺ imager and quantified using the ImageQuant software (GE Healthcare). The small but clear difference between the reactant (393 nt) and the product (410 nt) allowed the employment of SybrGreen staining with small error. All reactions were performed in triplicate.

Kinetic studies

While the assays described in the previous section were entirely satisfactory (see Results) and performed under conditions of substrate excess, which would be pre-biotically relevant if one considers that nascent RNA genotypes would be far outnumbered by any substrate or “food” molecules, we performed additional analyses under conditions of ribozyme excess for the two ribozyme variants under differing environmental conditions following established protocols [10],[12]. Here, the reaction was followed by tracking the length loss of the substrate following transposition of its 3′ end to the ribozyme and subsequent release of the 5′ portion, which was radiolabeled with γ-³²P[ATP] by the use of the Optikinase enzyme (USB Biochemicals, Cleveland, OH). For assays, 10 or 25 nM ribozyme was incubated with 1.0 nM substrate for 0–60 min in the four different reaction buffers O1–O4 described above. During this period, with this stoichiometry, less than 50% (and usually less than 20%) of the substrate has been converted to product such that single turnover conditions were maintained. Reactions were stopped by the addition of enough EDTA to chelate stoichiometrically all divalent cations in solution, and the products were electrophoresed through 16% polyacrylamide / 8 M urea gels and the resulting bands were detected by phosphorimaging on a Typhoon Trio⁺ imager and quantified using the ImageQuant software (GE Healthcare). The apparent rate constants k _obs were estimated by assuming that substrate binding was not rate limiting and that the reaction could be approximated with by first-order kinetics. While not strictly true for the Tetrahymena ribozyme under lower salt and higher temperature conditions (10 mM MgCl₂ and 50°C) [14], this assumption is likely a good one averaged over all four reaction conditions studied here where the salt concentrations are typically higher and the temperature lower (37°C). The values of k _obs were obtained by fitting the fraction substrate cleaved (f) data to the equation f = A{1–exp(−k _obst)}, where A is the asymptotic maximum extent of reaction. The pseudo-first-order rate constants k _cat were obtained estimating the y-intercept of plots of k _obs vs. k _obs/[E] for each ribozyme in each condition (O1–O4). Because the vast ribozyme excess did not emulate conditions expected for interpretive scenarios, these data were primarily used to establish the best time point to assay the ribozymes for interpretive behaviour using data from the previous section (see Results), and only secondarily to assess interpretive behaviour.

Parameter-space analysis

An analysis of the total parameter space, and of the fitnesses of the PV ribozyme under study within that space, provides an opportunity to quantify interpretive behaviour of the ribozyme. The parameter space is defined by assigning values to three variables: the probability of high Mg²⁺ concentration (favourable environment), the probability of the presence of Ca²⁺ (presence of the sign) and the correlation between the favourable environment and the presence of the sign.

The proportion of total activity (fitness) of the ribozyme attributable to the benefit of interpretation is calculated as follows. The total activity (fitness) of a ribozyme (W _t) at any location within this parameter space is given by the sum of the four experimentally measured outcomes in the 2 × 2 matrix (O1, O2, O3, and O4), weighted according to the probability of each outcome, p(O1), p(O2), p(O3), p(O4):

W_{t} = O 1 • p (O 1) + O 2 • p (O 2) + O 3 • p (O 3) + O 4 • p (O 4)

(1)

The component of total fitness attributable to interpretation at any location in the parameter space (the interpretive index) can be quantified by correcting for the activity calculated for the ribozyme expected in the absence of any interpretive benefit. This ‘control’ fitness (W _c) is given by the total fitness that would obtain if there were, in fact, no correlation between the presence of the sign and the favourable environment: i.e., if the probabilities of these were independent. If the probability of the sign is p(S) and the probability of the favourable environment is p(E), then:

W_{c} = O 1 • p (S) p (E) + O 2 • p (S) (1 - p (E)) + O 3 • (1 - p (S)) p (E) + O 4 • (1 - p (S)) (1 - p (E))

(2)

When the correlation, r, between the presence of the sign and the presence of the favourable environment is zero, the probability of their co-occurrence, p(O1), is given by p(S)•p(E). In that case W _{t =}W _c and the presence of the sign carries no predictive value with respect to the presence of the favourable environment_. When the correlation is maximal, r = 1, the probability of the co-occurrence of sign and favourable environment, p(O1) is equal to the lower of p(S) and p(E). Between this minimum and maximum, the probability of co-occurrence of sign and favourable environment is linearly related to the correlation coefficient, r. For any value of p(O1) within parameter space, the values of p(O2), p(O3) and p(O4) are determined by the fact that p(S) = p(O1) + p(O2) and p(E) = p(O1) + p(O3).

The proportion of total fitness attributable to interpretive benefit (the interpretive index, I) is then given by:

I = (W_{t} - W_{c}) / W_{t}

(3)

References

Joyce GF: RNA evolution and the origins of life. Nature. 1989, 338: 217-224. 10.1038/338217a0.
Article PubMed CAS Google Scholar
Pross A: Toward a general theory of evolution: extending Darwinian theory to inanimate matter. J Syst Chem. 2011, 2: 1-10.1186/1759-2208-2-1.
Article CAS Google Scholar
Larson BC, Jensen RP, Lehman N: The chemical origin of behavior in rooted in abiogenesis. Life. 2012, 2: 313-322. 10.3390/life2040313.
Article PubMed PubMed Central Google Scholar
Emmeche C, Kull K: Towards a Semiotic Biology: Life is the Action of Signs. 2011, Imperial College Press, London, UK
Book Google Scholar
Robinson A, Southgate C: A general definition of interpretation and its application to origin of life research. Biol Philos. 2010, 25: 163-181. 10.1007/s10539-009-9188-4.
Article Google Scholar
Leong TL, Yang ZR, Robinson AJN, Southgate CCB: Interpretation and the origin of life. Biol Theory. 2010, 5: 112-116. 10.1162/BIOT_a_00027.
Article Google Scholar
Bernhardt HS: The RNA world hypothesis: the worst theory of the early evolution of life (except for all the others). Biol Direct. 2012, 7: 23-10.1186/1745-6150-7-23.
Article PubMed CAS PubMed Central Google Scholar
Grosshans CA, Cech TR: Metal ion requirements for sequence-specific endoribonuclease activity of the Tetrahymena ribozyme. Biochemistry. 1989, 28: 6888-6894. 10.1021/bi00443a017.
Article PubMed CAS Google Scholar
Pan T, Long DM, Uhlenbeck OC: Divalent Metal Ions in RNA Folding and Catalysis. The RNA World, First Edition. Edited by: Gesteland RF, Atkins JF. 1993, Cold Spring Harbor Laboratories Press, Cold Spring Harbor, NY, 271-302.
Google Scholar
Cernak P, Madix RA, Kuo LY, Lehman N: Accomodation of Ca(II) ions for catalytic activity by a group I ribozyme. J Inorg Biochem. 2008, 102: 1495-1506. 10.1016/j.jinorgbio.2008.01.011.
Article PubMed CAS Google Scholar
Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottschling DE, Cech TR: Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell. 1982, 31: 147-157. 10.1016/0092-8674(82)90414-7.
Article PubMed CAS Google Scholar
Lehman N, Joyce GF: Evolution in vitro of an RNA enzyme with altered metal dependence. Nature. 1993, 361: 182-185. 10.1038/361182a0.
Article PubMed CAS Google Scholar
Guo F, Gooding AR, Cech TR: Structure of the Tetrahymena ribozyme: base triple sandwich and metal ion at the active site. Mol Cell. 2004, 16: 351-362.
PubMed CAS Google Scholar
Herschlag D, Cech TR: Catalysis of RNA cleavage by the Tetrahymena thermophila ribozyme. 1. Kinetic description of the reaction of an RNA substrate complementary to the active site. Biochemistry. 1990, 29: 10159-10171. 10.1021/bi00496a003.
Article PubMed CAS Google Scholar
Feig AL, Uhlenbeck OC: The Role of Metal Ions in RNA Biochemistry. The RNA World, Second Edition. Edited by: Gesteland RF, Cech TR, Atkins JF. 1999, Cold Spring Harbor Laboratories Press, Cold Spring Harbor, NY, 287-319.
Google Scholar
Ronov AB: Probable changes in the composition of sea water during the course of geological time. Sedimentology. 1967, 10: 25-43. 10.1111/j.1365-3091.1968.tb01909.x.
Article Google Scholar
de Ronde CEJ, Channer DMD, Faure K, Bray CJ, Spooner ETC: Fluid chemistry of Archean seafloor hydrothermal vents: implications for the composition of circa 3.2 Ga seawater. Geochim Cosmochem Acta. 1997, 61: 4025-4042. 10.1016/S0016-7037(97)00205-6.
Article CAS Google Scholar
Farrah H, Hatton D, Pickering WF: The affinity of metal ions for clay surfaces. Chem Geol. 1980, 28: 55-68. 10.1016/0009-2541(80)90035-2.
Article CAS Google Scholar
Yarus M: How many catalytic RNAs? Ions and the Cheshire cat conjecture. FASEB J. 1993, 7: 31-39.
PubMed CAS Google Scholar
Southgate C, Robinson A: Interpretation and the origin of life. Zygon. 2010, 245: 345-360.
Article Google Scholar
Athavale SS, Petrov AS, Hsiao C, Watkins D, Prickett CD, Gossett JJ, Lie L, Bowman JC, O’Neill E, Bernier CR, Hud NV, Wartell RM, Harvey SC, Williams LD: RNA folding and catalysis mediated by iron(II). PLoS ONE. 2012, 7: e38024-10.1371/journal.pone.0038024.
Article PubMed CAS PubMed Central Google Scholar
Hsiao C, Chou I-C, Okafor CD, Bowman JC, O’Neill EB, Athavale SS, Petrov AS, Hud NV, Wartell RM, Harvey SC, Williams LD: RNA with iron(II) as a cofactor catalyzes electron transfer. Nat Chem. 2013, 5: 525-528. 10.1038/nchem.1649.
Article PubMed CAS Google Scholar
Riley CA, Lehman N: Generalized RNA-directed recombination of RNA. Chem Biol. 2003, 10: 1233-1243. 10.1016/j.chembiol.2003.11.015.
Article PubMed CAS Google Scholar
Beaudry AA, Joyce GF: Directed evolution of an RNA enzyme. Science. 1992, 257: 635-641. 10.1126/science.1496376.
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would like to thank N. Vaidya, J.A.M. Yeates, and D. Gofreed for help during preparation of this manuscript. This work was supported by a STARS grant from the Templeton Foundation to C.S., A.R., and N.L., the Oregon Space Grant Consortium, and the Center for Life in Extreme Environments at Portland State University.

Author information

Authors and Affiliations

Department of Chemistry, Portland State University, Portland, OR, USA
Niles Lehman & Brian C Larson
Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
Tess Bernhard
College of Humanities, University of Exeter, Exeter, UK
Andrew JN Robinson & Christopher CB Southgate

Authors

Niles Lehman
View author publications
You can also search for this author in PubMed Google Scholar
Tess Bernhard
View author publications
You can also search for this author in PubMed Google Scholar
Brian C Larson
View author publications
You can also search for this author in PubMed Google Scholar
Andrew JN Robinson
View author publications
You can also search for this author in PubMed Google Scholar
Christopher CB Southgate
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Niles Lehman.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

NL, TB, and BL performed the RNA preparations and assays described in this study. AR and CS developed the concept of interpretation and proposed its implementation in an origins-of-life context. NL conceived of the study, participated in its design and coordination, and drafted the manuscript. AR and CS helped to write the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Lehman, N., Bernhard, T., Larson, B.C. et al. Empirical demonstration of environmental sensing in catalytic RNA: evolution of interpretive behavior at the origins of life. BMC Evol Biol 14, 248 (2014). https://doi.org/10.1186/s12862-014-0248-2

Download citation

Received: 07 August 2014
Accepted: 19 November 2014
Published: 04 December 2014
DOI: https://doi.org/10.1186/s12862-014-0248-2

Empirical demonstration of environmental sensing in catalytic RNA: evolution of interpretive behavior at the origins of life