Next Article in Journal
Evaluation of a Custom Design Gene Panel as a Diagnostic Tool for Human Non-Syndromic Infertility
Next Article in Special Issue
Inosine in Biology and Disease
Previous Article in Journal
Non-Target Effects of dsRNA Molecules in Hemipteran Insects
Previous Article in Special Issue
Tryptophan Depletion Modulates Tryptophanyl-tRNA Synthetase-Mediated High-Affinity Tryptophan Uptake into Human Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution?

by
Tamara L. Hendrickson
*,
Whitney N. Wood
†,‡ and
Udumbara M. Rathnayake
†,§
Department of Chemistry, Wayne State University, 5101 Cass Avenue, Detroit, MI 48202, USA
*
Author to whom correspondence should be addressed.
These two authors contributed equally to this manuscript.
Current address: Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
§
Current address: Laboratory of Cellular and Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
Submission received: 17 February 2021 / Revised: 5 March 2021 / Accepted: 10 March 2021 / Published: 12 March 2021
(This article belongs to the Special Issue tRNAs in Biology)

Abstract

:
The twenty amino acids in the standard genetic code were fixed prior to the last universal common ancestor (LUCA). Factors that guided this selection included establishment of pathways for their metabolic synthesis and the concomitant fixation of substrate specificities in the emerging aminoacyl-tRNA synthetases (aaRSs). In this conceptual paper, we propose that the chemical reactivity of some amino acid side chains (e.g., lysine, cysteine, homocysteine, ornithine, homoserine, and selenocysteine) delayed or prohibited the emergence of the corresponding aaRSs and helped define the amino acids in the standard genetic code. We also consider the possibility that amino acid chemistry delayed the emergence of the glutaminyl- and asparaginyl-tRNA synthetases, neither of which are ubiquitous in extant organisms. We argue that fundamental chemical principles played critical roles in fixation of some aspects of the genetic code pre- and post-LUCA.

1. Introduction

The Central Dogma, including machinery for mRNA-guided, ribosomal protein translation, had already been defined at the time of the last universal common ancestor (LUCA) [1,2]. Selection of the 20 commonly encoded amino acids had occurred as well, with selenocysteine and pyrrolysine being added to the protein code post-LUCA [3,4,5,6,7]. Consequently, at the time of LUCA, mechanisms existed to produce at least one aminoacylated tRNA per commonly coded amino acid, with these aa-tRNAs connecting the codons in the genetic code to the amino acids in proteins [8]. Nevertheless, LUCA putatively contained only eighteen specific, canonical aaRSs, divided into two classes (class I and class II) based on the structures of their active sites. These enzymes were each capable of specifically attaching their cognate amino acid to the corresponding cognate isoacceptor set of tRNAs. This set of canonical aaRSs included two copies of lysyl-tRNA synthetase, one in each class, a notable example of convergent evolution [9,10]. LUCA also contained two non-discriminating (ND) aaRSs: ND-aspartyl- and ND-glutamyl-tRNA synthetase [8]. These enzymes not only aminoacylated their cognate tRNAs, tRNAAsp and tRNAGlu, they also misacylated tRNAAsn and tRNAGln to produce Asp-tRNAAsn and Glu-tRNAGln. These misacylated tRNAs were converted to their cognate aminoacyl-tRNAs through the function of an amidotransferase, ADT [11].
In this article, we will focus on the emergence of six aaRSs—all of which were either absent, non-discriminating, or duplicated in the most likely LUCA—namely the glutamyl- aspartyl-, glutaminyl-, asparaginyl-, cysteinyl-, and lysyl-tRNA synthetases (ND-GluRS, ND-AspRS, GlnRS, AsnRS, CysRS, and LysRS). We will briefly summarize what is known and hypothesized about the early evolution of each of these enzymes and propose scenarios wherein amino acid side chain reactivity and/or the metabolic availability of similar amino acids might have contributed to divergent evolutionary patterns. Figure 1 focuses on the four aaRSs that were either duplicated or missing in LUCA (LysRS, CysRS, AsnRS, and GlnRS). Of these, only LysRS definitively evolved pre-LUCA; however, it emerged in two forms, the expected class II enzyme and as a class I enzyme as well [9,10]. GluRS and AspRS have been omitted from this figure because non-discriminating variants of these enzymes existed at the time of LUCA and were specifically tied to the delayed emergence of GlnRS and AsnRS [11]. In each case, we will look at the novel hypothesis that important correlations between amino acid reactivities and the emergence of each of these enzymes can be drawn. We will also discuss circumstantial evidence from modern enzymes that support this hypothesis.
A word about nomenclature: Throughout this article, we will use common abbreviations for each aaRS, using the standard three-letter codes for each amino acid followed by RS to indicate tRNA synthetase. For example, alanyl-tRNA synthetase will be abbreviated as AlaRS, and so on. GluRS and AspRS can be found in two forms, discriminating (D) and non-discriminating (ND), based on their tRNA substrate specificities. In these cases, they will be referred to as D-GluRS, D-AspRS, ND-GluRS, and ND-AspRS. ND-GluRS glutamylates both tRNAGlu and tRNAGln with glutamate to produce Glu-tRNAGlu and Glu-tRNAGln [12]. Similarly, ND-AspRS aspartylates both tRNAAsp and tRNAAsn to generate Asp-tRNAAsp and Asp-tRNAAsn [13]. We will also refer to non-standard amino acids using their assigned three letter codes as follows: selenocysteine (Sec), ornithine (Orn), homocysteine (Hcy), and pyrrolysine (Pyl).

2. Early aaRS Evolution

Extensive work has gone into predicting, modeling, and understanding evolutionary events in the emergence and fixation of the aaRSs (Reviewed in [8]). Briefly, the two aaRS classes are believed to have evolved in parallel (Figure 2) [14,15,16]; consistently, there are ten aaRSs in each class, with the unusual exception of LysRS, which is found in both classes [9]. Each class of aaRSs was derived from a single, class-specific, active site domain. Over time, these two domains duplicated and diverged into the different precursors of modern aaRS, capable of activating amino acids and transferring the amino acid to RNA microhelices [17]. These single domains eventually acquired additional domains to achieve recognition of full-length tRNAs, for further diversification, and to acquire other functions [18].
Early aaRSs would have been non-specific for both their amino acid and tRNA substrates. Most proteinogenic amino acids can be synthesized under prebiotic conditions [19,20,21]. For a specific aaRS to emerge, a stable source of its cognate substrate needed to be available. For this reason, aaRS evolution had to have been closely tied to the evolution of biosynthetic pathways for the different amino acids. This concept has been considered by others in depth [22,23,24,25,26]. It will not be discussed further here except when necessary to note the probable availability of competing non-cognate substrates with respect to specific aaRSs.
One challenge faced by many aaRSs was (and is) how to select a specific, cognate amino acid when challenged with very similar, non-cognate potential substrates. For example, an early IleRS likely aminoacylated tRNAs with both Ile and Val, and perhaps even Leu, Ala, and α-aminobutyric acid. In fact, many modern aaRSs, including IleRS, still cannot sufficiently discriminate between cognate and structurally similar non-cognate amino acids and consequently use a proofreading activity to remove non-cognate amino acids from their tRNAs [27]. These editing active sites were typically recruited to the aaRS as an appended or insertion domain; examples exist of free-standing editing enzymes as well [28]. Editing is discussed in more detail in the next section of this manuscript.
Amino acid and tRNA substrate selectivities would have become more specific over time as each enzyme evolved. By the time of LUCA, 16 of the 18 aaRSs had presumably achieved sufficient levels of substrate selectivity to allow for accurate translation of the genetic code [8,29]. ND-AspRS and ND-GluRS were the two exceptions: These two enzymes had achieved amino acid selectivity for Asp and Glu, respectively, even as they each aminoacylated tRNAs from two different isoacceptor sets (tRNAAsp/Asn and tRNAGlu/Gln, respectively) [11].
Modern aaRSs all catalyze the same two reactions to attach their cognate amino acids to their cognate tRNA(s) (Figure 3) [30]. In step 1, the cognate amino acid is activated via condensation with ATP and the loss of inorganic pyrophosphate (PPi). Next, the activated amino acid is transferred to the 3′ ribose on the cognate tRNA. Class I aaRSs typically aminoacylate the 2′ hydroxyl on this ribose, whereas class II enzymes favor the 3′ hydroxyl group [31]. In vivo, amino acid activation is presumably driven forward by the action of inorganic pyrophosphatase, which hydrolyzes PPi into two phosphate ions [32]. The second reaction, tRNA aminoacylation, is considered irreversible at least partly because elongation factor TU (EF-Tu) binds the aminoacyl-tRNA (aa-tRNA) product. Without EF-Tu, many aa-tRNAs are unstable, as they are sensitive to hydrolysis [33]. These reactions were fixed in both aaRS classes before LUCA.
By necessity, the aminoacyl-adenylate (aa-AMP) intermediate common to all tRNA aminoacylation reactions is highly reactive. The high electrophilicity of the amino acid carbonyl, coupled with AMP as a powerful leaving group, are essential to enable nucleophilic attack by the appropriate tRNA hydroxyl group. However, we will argue herein that this reactivity prevented ornithine and homocysteine from entering the genetic code and contributed to the delay of selenocysteine’s inclusion in the modern genetic code. Amino acid side chain reactivity will also be considered as an explanation for the late emergence of AsnRS and GlnRS post-LUCA.

3. Generalities on How the aaRSs Recognize Cognate and Reject Non-Cognate Amino Acids

Protein translation occurs with errors at specific codons with frequencies that range from approximately 1 in 3000 to as much as 1 in 28,000 for a specific codon [34,35]. The aaRSs are prime drivers of this accuracy in that they are the enzymes that correctly pair each amino acid to its corresponding tRNA. Consequently, each aaRS had to evolve an active site that was optimized for recognition of its cognate amino acid over structurally similar non-cognate metabolites. This discrimination is straightforward in some cases and difficult in others.
Consider IleRS: its active site is tailored to select isoleucine, and thus, it would be too hydrophobic and small to allow binding of an amino acid such as arginine, for example. On the other hand, this same active site would have a much harder time discriminating against the smaller, structurally similar valine. In fact, Linus Pauling argued that binding energies would differ by only ~1 kcal/mol when the difference in size between two potential substrates was only one methylene group [36]. Consistently, IleRS misactivates valine about once for every 180 times it correctly activates isoleucine and it also misacylates tRNAIle with valine. To address this challenge, IleRS contains a second, editing active site that corrects for these errors such that valine is misincorporated into proteins at isoleucine codons with a frequency of only 1 in 3000 [37,38].
This editing strategy was first described as a double-sieve by Alan Fersht: the first sieve rejects highly dissimilar amino acids but, by necessity, lets through structurally similar ones; then, the second sieve rejects the cognate aminoacyl-tRNA while binding to and hydrolyzing any non-cognate aminoacyl-tRNAs [39]. With IleRS, the first sieve is the aminoacylation active site that occasionally misactivates valine and produces Val-tRNAIle. The second sieve is the editing active site that hydrolyzes Val-tRNAIle but leaves the larger Ile-tRNAIle, the desired product, intact [37,38]. This strategy is used by a number of aaRSs to ensure accurate tRNA aminoacylation [40]. Free-standing editing enzymes have also been discovered [41]. Some aaRS active sites can also release misactivated aa-AMPs, which are subsequently hydrolyzed spontaneously or via intramolecular cyclization, a process that will be discussed further below [42].
This IleRS example highlights the challenges faced by aaRSs early in evolution. Emergence of a single active site that rejects structurally dissimilar amino acids would not always have been straightforward. Before editing domains were available, evolving enzymes would have been limited to discrimination by induced fit and steric hindrance, coupled with matched polarities between the active site and the desired substrate. To our knowledge, the role of chemical reactivity in different amino acids has not been considered with respect to how the genetic code was selected.

4. The Evolution of LysRS: Why Is Ornithine Not Part of the Genetic Code?

As discussed above, presumably the aaRSs showed substrate promiscuity early in their evolution, becoming more selective over time until they reached their modern levels of substrate specificity. Assuming this hypothesis is true, we are interested in exploring possible features that promoted the selection of one amino acid over another. With respect to LysRS, why was lysine chosen over ornithine? Both lysine and ornithine would have been metabolically available, the latter because it is a precursor to arginine and proline [43]. Furthermore, there are modern examples of other types of promiscuous enzymes that utilize both lysine and ornithine [44], demonstrating that this type of substrate duality is possible. Ornithine is a negligible inhibitor of both LysRS1 and LysRS2 with Ki values in the low mM range. LysRS2 (but not LysRS1) can aminoacylate tRNALys with ornithine, producing the misacylated Orn-tRNALys, at least with mM concentrations of ornithine [45]. The ability of modern LysRS2 to use ornithine as a substrate, albeit weakly, supports the hypothesis that both Lys and Orn were acceptable substrates for LysRS early in evolution. So, why and how was lysine selected over ornithine for inclusion in the genetic code?
A few possibilities suggest themselves as answers to this question. First, lysine and LysRS may have been paired and selected by chance, simply winning the evolutionary coin toss. While possible, this scenario is belied by the fact that both LysRS1 and LysRS2 separately developed the ability to discriminate against ornithine in favor of lysine. Second, the use of ornithine in the biosynthesis of arginine and proline might have limited ornithine availability. However, at least in modern organisms, ornithine is part of the urea cycle; is often used in the biosynthesis of antibiotics, siderophores, and polyamines; and free ornithine was recently discovered to bind to the ribosome to regulate polyamine biosynthesis [43]; all of which suggest that ornithine availability is rarely limited. Third, differences in the chemical reactivities of ornithine versus lysine might have favored lysine. Yet, the only difference between these two amino acids is in the length of the side chain, with lysine being one methylene group longer than ornithine. This small difference only allows for discrimination against ornithine at a rate of 1 in ~200, significantly worse than the 1 in 3000 errors typically quoted as necessary for cellular viability [46]. Consequently, any reasonable hypothesis would have to explain how this difference in length could lead to more robust rejection of ornithine by LysRS.
We propose that the shorter length of ornithine disfavored its uptake into the code because the Orn-AMP intermediate would be intrinsically less stable than the corresponding Lys-AMP. Under physiological conditions, approximately 0.1% of the ornithine and lysine side chain amines are deprotonated, neutral, and nucleophilic at a given time. AaRS-catalyzed activation of each amino acid as an aa-AMP would position these neutral amines for intramolecular nucleophilic attack onto their own carboxylic acids, now activated for attack by AMP, to produce the corresponding lactams (Figure 4). Cyclization would pull the deprotonation equilibrium forward, further favoring cyclization. These lactam products would be highly stable due to the newly formed amide bond and their formation would likely be spontaneous.
The cyclization reactions in Figure 4 highlight an important distinction between lysine and ornithine: The lysine lactam is a seven-membered ring, and therefore disfavored by approximately 250–500-fold (~3 kcal/mol), compared to the six-membered ring formed from ornithine [47,48]. We propose that this difference was sufficient to favor ornithine lactam formation over Orn-tRNALys biosynthesis, allowing lysine to claim a spot in the modern genetic code. In fact, modern LysRS2 has retained the ability to activate ornithine to Orn-AMP, which then cyclizes to the lactam [49]. To our knowledge, the ability of LysRS1 to make this lactam has never been studied. Given that amino acids in aminoacyl-tRNAs are activated at their carbonyl groups, lactam formation could even occur on Orn-tRNALys, further favoring the evolution of LysRS over an OrnRS. In fact, it is likely that cyclization to the ornithine lactam prohibited ornithine from consideration even earlier in evolution. In one effort to recapitulate how peptide bonds might have formed spontaneously under prebiotic conditions, lysine was readily incorporated into peptides and depsipeptides under conditions where ornithine and diaminobutyric acid (a serine isostere) cyclized to their corresponding lactams [23]. This important observation strongly suggests that amino acid side chain chemistry impacted evolution from the very beginning.
Importantly, as discussed in the next section, this same kind of side chain reactivity and cyclization can be used to explain how homocysteine might have been eliminated from the genetic code as well. The parallels between these two examples are striking, which, in our opinion, lends them further support.

5. The Cases against Homocysteine, Homoserine, and Selenocysteine

Differences in amino acid side chain chemistry can also be invoked to explain the exclusion of homocysteine (and possibly the late and only partial inclusion of selenocysteine) into the genetic code as well. Like ornithine, homocysteine is a metabolically available amino acid as it is a precursor to methionine and a downstream hydrolysis product from S-adenosyl methionine (SAM) methylation reactions. So, why did CysRS and MetRS specifically emerge instead of a homocysteinyl-tRNA synthetase (HcyRS) replacing one of these enzymes?
The side chain of homocysteine is nucleophilic, especially when deprotonated. Homocysteine’s side chain pKa is ~8, such that at physiological pH approximately 20% will be deprotonated to the highly nucleophilic sulfur anion. With the assumption that primordial CysRS and MetRS enzymes were sloppy and could activate homocysteine in addition to cysteine or methionine, respectively, then the immediate cyclization of the Hcy-AMP intermediate to the five-membered cyclic thiolactone would have been spontaneous (Figure 5A) and would have been favored over cyclization of Cys-AMP to the more strained four-membered ring by approximately 100-fold [48]. Thus, this deleterious cyclization reaction would have favored cysteine selectivity over homocysteine. Steric hindrance would have likely played an additional role in CysRS evolution. It is much easier for an enzyme active site to reject a larger amino acid, namely homocysteine, over the smaller cysteine via simple steric hindrance. Consequently, it is likely that both factors, the favorability of Hcy-AMP to cyclize and the use of steric hindrance to weaken recognition of Hcy, played roles in the emergence of CysRS over a HcyRS.
While the hydroxyl group of homoserine is significantly less nucleophilic than the thiols of cysteine and homocysteine, a similar cyclization reaction (not shown) to the homoserine lactone could also explain the absence of homoserine and a homoseryl-tRNA synthetase from the genetic code. In fact, homoserine is activated to Hse-AMP by modern LysRS2 [50], ValRS, and IleRS [51]. In all three cases, Hse-AMP is subsequently cyclized to the corresponding lactone, supporting this hypothesis. It has been proposed that these cyclization reactions are enzyme-catalyzed [52]; although, to our knowledge, mutations that disrupt this reaction without disrupting amino acid activation have never been identified, suggesting that these are examples of selective release and spontaneous cyclization instead.
MetRS would have faced a more difficult specificity challenge that is reminiscent of that faced by LysRS. Homocysteine is the metabolic precursor to methionine; it lacks the δ-methyl group and terminates in a thiol. Consequently, discrimination ability based on size would be limited, as predicted by Pauling [36] and demonstrated for IleRS [39] and other editing aaRSs (see above). However, Met-AMP would be dramatically more stable than Hcy-AMP because the sulfur in the side chain of Met is not nucleophilic in aqueous medium and thus not susceptible to degradation via cyclization (Figure 5B). (The sensitivity of methionine to oxidation will be discussed at the end of this article). Consequently, the favorability of the Hcy-AMP intramolecular cyclization reaction (Figure 5A) was the likely driving force behind MetRS acquiring specificity for methionine over homocysteine. Remarkably, support for this hypothesis is evident in modern MetRS. Modern MetRS enzymes still misactivate homocysteine to the Hcy-AMP, which cyclizes to the thiolactone [53]. It is likely that this cyclization reaction is an example of selective release and spontaneous cyclization rather than an enzyme-catalyzed editing reaction. MetRS actually releases some Met-AMP as well, suggesting that some release of Met-AMP is necessary to ensure release of Hcy-AMP to limit production Hcy-tRNAMet [54]. Thus, MetRS is a modern example of the use of selective release to achieve substrate selectivity, a trait that presumably emerged pre-LUCA during the selection of methionine over homocysteine as its amino acid substrate.
The use of selenocysteine in the proteome also fits into this picture. Selenocysteine was incorporated into the genetic code as the 21st amino acid post-LUCA [55,56,57]. This cysteine isostere is ribosomally incorporated into only a few proteins in various organisms across all domains of life, including humans. In these species, the UGA stop codon has been reassigned to selenocysteine. Selenocysteine is a useful amino acid with a lower pKa than cysteine (pKa = 5.4); it is fully deprotonated at physiological pH [56]. It also has a lower reduction potential, making it useful for enzyme-catalyzed redox reactions, and is resistant to permanent oxidative damage [56]. Given that the more electron-rich selenide anion is a stronger nucleophile than the cysteine sulfide anion, cyclization to the selenolactone would likely have competed with tRNA aminoacylation, even with the more strained, four-membered ring as a product. Thus, it is probable that cyclization to the selenolactone contributed to the delayed use of selenocysteine in biology. In fact, in modern organisms, Sec-tRNASec is biosynthesized indirectly after tRNA aminoacylation. This more convoluted approach to tRNA aminoacylation circumvents the formation of the highly unstable Sec-AMP to support this evolutionary scenario. Consistently, Sec-tRNASec would also be unstable, compared to Cys-tRNACys. In fact, it is impressive that the unusual chemical capabilities of selenocysteine were sufficient to overcome the challenges of stably attaching this amino acid to tRNASec in any organism.

6. The Post-LUCA Emergence of GlnRS and AsnRS

In the two preceding sections, we argued that the nucleophilicity of ornithine, homocysteine, homoserine, and selenocysteine directly impacted the exclusion of these amino acids from the standard genetic code. The emergence of GlnRS and AsnRS, post-LUCA, is harder to explain even as these two enzymes were clearly the last two to emerge and are the rarest of the 20 standard aaRSs [11]. GlnRS specifically appeared in eukarya; it is only present in a small subset of microorganisms, such as Escherichia coli, as a result of lateral gene transfer [58]. AsnRS emerged earlier than GlnRS and appeared in all three domains in life [59,60]. In organisms that are missing AsnRS and/or GlnRS, Asn-tRNAAsn and/or Gln-tRNAGln are biosynthesized indirectly, with amino acid formation occurring on the tRNA using Asp-tRNAAsn and Glu-tRNAGln as precursors (synthesized by non-discriminating versions of AspRS and GluRS) [61]. This indirect pathway was fixed at the time of LUCA [11].
The same chemistry used above to explain the exclusion of homocysteine and ornithine does not cleanly explain the very late evolution of AsnRS and GlnRS. The challenge is that the carboxylate side chains of aspartate and glutamate are much more nucleophilic than the carboxyamides of asparagine or glutamine. Thus, Asp-AMP and Glu-AMP would have been more prone to intramolecular cyclization than Asn-AMP and Gln-AMP. Any argument based on amino acid nucleophilicity would suggest that AspRS and GluRS should have emerged after AsnRS and GlnRS, instead of the other way around. We will discuss a few other possibilities here although none prove to be as satisfying as those presented above for non-coded amino acids with nucleophilic side chains.
Glutamine is commonly used as a source of nitrogen throughout metabolism and was available early in evolution [62]. Consequently, a lack of glutamine cannot be used to explain the delayed evolution of GlnRS. Glutamine has another unusual property: It is prone to spontaneous cyclization to pyroglutamate (pGlu, Figure 6) [63]. In multicellular and some single cellular eukaryotes, pyroglutamate can be found on the N-terminus of some peptides and proteins. It can form spontaneously or be enzymatically synthesized by the enzyme glutaminyl cyclase [64]. It can be removed via the action of pyroglutamyl peptidase [65]. If glutamine cyclization were to inadvertently occur on tRNAGln, then the resultant pGlu-tRNAGln ester bond would be stabilized (similar to acetylation of aminoacyl-tRNAs [66]); the α-nitrogen of pyroglutamate would be tied up in an amide bond such that it could not participate in ribosomal translation; and, critically, the tRNA would be permanently taken out of the translation cycle. Thus, the formation of pGlu-tRNAGln would have been energetically costly to the emerging protein translation apparatus. Pyroglutamate peptidase has been found in archaea that lack glutaminyl cyclase, and its function in these species is unknown [67]. One unexplored possibility is that it is a pGlu-tRNAGln hydrolase, such that it would free up tRNAGln for recycling back into translation.
At face value, the formation of pyroglutamate on tRNAGln seems like it could explain the delayed emergence of GlnRS. Pyroglutamate formation has been observed in assays of the archaeal amidotransferase GatDE that converts Glu-tRNAGln into Gln-tRNAGln [68,69], highlighting that Gln cyclization can occur on tRNAGln. However, there are weaknesses to this theory: Glutamate can also cyclize to pyroglutamate [63] and the indirect biosynthesis of Gln-tRNAGln proceeds through a γ-phosphorylglutamatyl-tRNAGln intermediate [70,71], which would be highly prone to cyclization to pGlu-tRNAGln with phosphate as a potent leaving group. Despite these challenges, both GluRS and the indirect pathway for Gln-tRNAGln biosynthesis were fixed pre-LUCA. Consequently, it seems unlikely that deleterious pyroglutamate formation was the main driver behind delaying the emergence of GlnRS. Nevertheless, it is possible that this cyclization reaction contributed to this delay to some extent.
The corresponding asparagine cyclization reaction would yield 4-oxoazetidine-2-carboxylic acid. To our knowledge, this four-membered ring has never been observed on tRNAAsn, suggesting its formation was not a strong deterrent of AsnRS emergence. Additionally, aspartic acid would be prone to this same cyclization reaction. Computational calculations suggest that cyclization of aspartic acid (and asparagine by analogy) to this azetidine is not favored [72]. Together, these observations suggest that asparagine cyclization was not a driving force in development of AsnRS. In fact, the delayed emergence of AsnRS probably has little to do with amino acid chemistry. Instead, it is more likely that indirect tRNA aminoacylation of tRNAAsn provided an early route to asparagine biosynthesis. It was not until the emergence of asparagine synthetase A (AS-A), which evolved from AspRS, that selection of an AsnRS could occur. Some modern organisms still lack AS-A and rely on indirect tRNA aminoacylation for asparagine biosynthesis even in the presence of a functional AsnRS [11,60].
All told, amino acid reactivities do not seem to be the major driving forces between the late fixation of GlnRS and AsnRS and other theories provide more satisfying possible explanations. For example, in Mycobacterium tuberculosis, indirect tRNA aminoacylation of tRNAAsn can be co-opted to introduce aspartate into proteins at asparagine codons when faced with antibiotic stress [73]. Perhaps this flexibility offered enough of a selective advantage to promote the retention of indirect tRNA aminoacylation over the emergence of AsnRS and GlnRS.

7. Conclusions

Herein, we have examined the possible impact of amino acid side chain chemistry on the emergence of the aaRSs pre- and post-LUCA. It seems probable that chemical reactivity prohibited homocysteine, homoserine, and ornithine from entering the genetic code and delayed and limited the use of selenocysteine to a few specialized examples. We have presented these chemical arguments with respect to fixation of the modern aaRSs. However, it is quite possible that the propensities of ornithine and homocysteine to cyclize when activated at their α-carbonyls eliminated these amino acids from an emerging genetic code much earlier in time, perhaps even during the RNA world stage of evolution. Chemical reactivity might have contributed to the delay of GlnRS post-LUCA, whereas AsnRS was likely delayed only by a need for a mechanism to synthesize asparagine. The advantages of substrate promiscuity during indirect tRNA aminoacylation might also have been a factor.
We have made these arguments using organic chemistry fundamentals paired with circumstantial evidence from phylogenetic analyses and studies on extant aaRSs and related enzymes. With all evolutionary theories for events that might have occurred pre-LUCA, it is difficult to imagine being able to prove the hypotheses proposed here. Computational calculations could provide energies of activation for the different reactions discussed. Different aa-AMPs could be synthesized and their rates of cyclization could be directly measured; these experiments are challenging, however, given the intrinsic lability of these intermediates. Finally, an archaeal pyroglutamate peptidase could be tested for pGlu-tRNAGln hydrolase activity. Experiments like these would offer further circumstantial support for these hypotheses.
We also chose to focus only on systems that show unusual phylogenetic patterns (e.g., LysRS, CysRS) or were missing in LUCA (GlnRS, AsnRS, Sec-tRNASec, maybe CysRS). Other chemical challenges could have impacted aaRS evolution pre-LUCA. Notably, oxidative damage might have posed a challenge that could have impacted cysteine, methionine, tryptophan and tyrosine availability and stability [74]. Nevertheless, the corresponding aaRSs, with the possible exception of CysRS, all evolved pre-LUCA.
To our knowledge, amino acid reactivity has been largely neglected from evolutionary considerations of the aaRSs. This omission has neglected the important role that chemistry likely played in early selection of these enzymes.

Author Contributions

T.L.H. developed the hypotheses. T.L.H., W.N.W., and U.M.R. researched the concept and wrote the manuscript together. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank Rebecca Alexander, Joseph Chihade, Michael Ibba, Karin Musier-Forsyth, and Lluis Ribas de Pouplana for helpful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mushegian, A.R.; Koonin, E.V. A minimal gene set for cellular life derived by comparison of complete bacterial genomes. Proc. Natl. Acad. Sci. USA 1996, 93, 10268–10273. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Fox, G.E. Origin and evolution of the ribosome. Cold Spring Harb. Perspect. Biol. 2010, 2, a003483. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Böck, A.; Forchhammer, K.; Heider, J.; Leinfelder, W.; Sawers, G.; Veprek, B.; Zinoni, F. Selenocysteine: The 21st amino acid. Mol. Microbiol. 1991, 5, 515–520. [Google Scholar] [CrossRef]
  4. Copeland, P.R. Making sense of nonsense: The evolution of selenocysteine usage in proteins. Genome Biol. 2005, 6, 221. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Borrel, G.; Gaci, N.; Peyret, P.; O’Toole, P.W.; Gribaldo, S.; Brugere, J.-F. Unique characteristics of the pyrrolysine system in the 7th order of methanogens: Implications for the evolution of a genetic code expansion cassette. Archaea 2014, 2014, 374146. [Google Scholar] [CrossRef]
  6. Heinemann, I.U.; O’Donoghue, P.; Madinger, C.; Benner, J.; Randau, L.; Noren, C.J.; Söll, D. The appearance of pyrrolysine in tRNAHis guanylyltransferase by neutral evolution. Proc. Natl. Acad. Sci. USA 2009, 106, 21103–21108. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Brazil, R. The Alphabet Soup of Life. Available online: https://www.chemistryworld.com/features/why-are-there-20-amino-acids/3009378.article#/ (accessed on 9 February 2021).
  8. Ribas de Pouplana, L. The evolution of aminoacyl-tRNA synthetases: From dawn to LUCA. In Biology of Aminoacyl-tRNA Synthetases; Ribas de Pouplana, L., Kaguni, L.S., Eds.; Academic Press: Cambridge, MA, USA, 2020; Volume 48. [Google Scholar]
  9. Ibba, M.; Morgan, S.; Curnow, A.W.; Pridmore, D.R.; Vothknecht, U.C.; Gardner, W.; Lin, W.; Woese, C.R.; Soll, D. A euryarchaeal lysyl-tRNA synthetase: Resemblance to class I synthetases. Science 1997, 278, 1119–1122. [Google Scholar] [CrossRef] [PubMed]
  10. Ambrogelly, A.; Korencic, D.; Ibba, M. Functional annotation of class I lysyl-tRNA synthetase phylogeny indicates a limited role for gene transfer. J. Bacteriol. 2002, 184, 4594–4600. [Google Scholar] [CrossRef] [Green Version]
  11. Sheppard, K.; Söll, D. On the evolution of the tRNA-dependent amidotransferases, GatCAB and GatDE. J. Mol. Biol. 2008, 377, 831–844. [Google Scholar] [CrossRef] [Green Version]
  12. Curnow, A.W.; Hong, K.-W.; Yuan, R.; Kim, S.-I.; Martins, O.; Winkler, W.; Henkin, T.M.; Söll, D. Glu-tRNAGln amidotransferase: A novel heterotrimeric enzyme required for correct decoding of glutamine codons during translation. Proc. Natl. Acad. Sci. USA 1997, 94, 11819–11826. [Google Scholar] [CrossRef] [Green Version]
  13. Raczniak, G.; Becker, H.D.; Min, B.; Söll, D. A single amidotransferase forms asparaginyl-tRNA and glutaminyl-tRNA in Chlamydia trachomatis. J. Biol. Chem. 2001, 276, 45862–45867. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Eriani, G.; Delarue, M.; Poch, O.; Gangloff, J.; Moras, D. Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 1990, 347, 203–206. [Google Scholar] [CrossRef] [PubMed]
  15. Rodin, S.N.; Ohno, S. Two types of aminoacyl-tRNA synthetases could be originally encoded by complementary strands of the same nucleic acid. Orig. Life Evol. Biosph. 1995, 25, 565–589. [Google Scholar] [CrossRef]
  16. Ribas de Pouplana, L.; Schimmel, P. Two classes of tRNA synthetases suggested by sterically compatible dockings on tRNA acceptor stem. Cell 2001, 104, 191–193. [Google Scholar] [CrossRef] [Green Version]
  17. Francklyn, C.; Schimmel, P. Enzymatic aminoacylation of an eight-base-pair microhelix with histidine. Proc. Natl. Acad. Sci. USA 1990, 87, 8655–8659. [Google Scholar] [CrossRef] [Green Version]
  18. Ribas de Pouplana, L.; Schimmel, P. A view into the origin of life: Aminoacyl-tRNA synthetases. Cell. Mol. Life Sci. 2000, 57, 865–870. [Google Scholar] [CrossRef] [PubMed]
  19. Wong, J.T.-F.; Ng, S.-K.; Mat, W.-K.; Hu, T.; Xue, H. Coevolution theory of the genetic code at age forty: Pathway to translation and synthetic life. Life 2016, 6, 12. [Google Scholar] [CrossRef] [Green Version]
  20. Miller, S.L. A production of amino acids under possible primitive earth conditions. Science 1953, 117, 528–529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Higgs, P.G.; Pudritz, R.E. A thermodynamic basis for prebiotic amino acid synthesis and the nature of the first genetic code. Astrobiology 2009, 9, 483–490. [Google Scholar] [CrossRef]
  22. Fournier, G.P.; Andam, C.P.; Alm, E.J.; Gogarten, J.P. Molecular evolution of aminoacyl tRNA synthetase proteins in the early history of life. Orig. Life Evol. Biosph. 2011, 41, 621–632. [Google Scholar] [CrossRef] [Green Version]
  23. Frenkel-Pinter, M.; Haynes, J.W.; Martin, C.; Petrov, A.S.; Burcar, B.T.; Krishnamurthy, R.; Hud, N.V.; Leman, L.J.; Williams, L.D. Selective incorporation of proteinaceous over nonproteinaceous cationic amino acids in model prebiotic oligomerization reactions. Proc. Natl. Acad. Sci. USA 2019, 116, 16338–16346. [Google Scholar] [CrossRef] [Green Version]
  24. Xavier, J.C.; Hordijk, W.; Kauffman, S.; Steel, M.; Martin, W.F. Autocatalytic chemical networks at the origin of metabolism. Proc. R. Soc. B 2020, 287, 20192377. [Google Scholar] [CrossRef]
  25. Takenaka, A.; Moras, D. Correlation between equi-partition of aminoacyl-tRNA synthetases and amino-acid biosynthesis pathways. Nucleic Acids Res. 2020, 48, 3277–3285. [Google Scholar] [CrossRef]
  26. Gospodinov, A.; Kunnev, D. Universal codons with enrichment from GC to AU nucleotide composition reveal a chronological assignment from early to late along with LUCA formation. Life 2020, 10, 81. [Google Scholar] [CrossRef]
  27. Tawfik, D.S.; Gruic-Sovulj, I. How evolution shapes enzyme selectivity—lessons from aminoacyl-tRNA synthetases and other amino acid utilizing enzymes. FEBS J. 2020, 287, 1284–1305. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. An, S.; Musier-Forsyth, K. Trans-editing of Cys-tRNAPro by Haemophilus influenzae YbaK protein. J. Biol. Chem. 2004, 279, 42359–42362. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. O’Donoghue, P.; Luthey-Schulten, Z. On the evolution of structure in aminoacyl-tRNA synthetases. Microbiol. Mol. Biol. Rev. 2003, 67, 550–573. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Ibba, M.; Söll, D. Aminoacyl-tRNA synthesis. Annu. Rev. Biochem. 2000, 69, 617–650. [Google Scholar] [CrossRef] [PubMed]
  31. Weinger, J.S.; Strobel, S.A. Participation of the tRNA A76 hydroxyl groups throughout translation. Biochemistry 2006, 45, 5939–5948. [Google Scholar] [CrossRef] [Green Version]
  32. Wolfson, A.D.; Uhlenbeck, O.C. Modulation of tRNAAla identity by inorganic pyrophosphatase. Proc. Natl. Acad. Sci. USA 2002, 99, 5965–5970. [Google Scholar] [CrossRef] [Green Version]
  33. Peacock, J.R.; Walvoord, R.R.; Chang, A.Y.; Kozlowski, M.C.; Gamper, H.; Hou, Y.M. Amino acid-dependent stability of the acyl linkage in aminoacyl-tRNA. RNA 2014, 20, 758–764. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Loftfield, R.B.; Vanderjagt, D. The frequency of errors in protein biosynthesis. Biochem. J. 1972, 128, 1353–1356. [Google Scholar] [CrossRef]
  35. Freist, W.; Sternbach, H.; Cramer, F. Isoleucyl-tRNA synthetase from baker’s yeast and from Escherichia coli MRE 600. Discrimination of 20 amino acids in aminoacylation of tRNA(Ile)-C-C-A. Eur. J. Biochem. 1988, 173, 27–34. [Google Scholar] [CrossRef] [PubMed]
  36. Pauling, L. Festschrift fuer Prof. Dr. Arthur Stoll Siebzigsten; Stoll, A., Ed.; Birkhayser: Basel, Switzerland, 1958; pp. 597–602. [Google Scholar]
  37. Schmidt, E.; Schimmel, P. Mutational isolation of a sieve for editing in a transfer RNA synthetase. Science 1994, 264, 265–267. [Google Scholar] [CrossRef]
  38. Nureki, O.; Vassylyev, D.G.; Tateno, M.; Shimada, A.; Nakama, T.; Fukai, S.; Konno, M.; Hendrickson, T.L.; Schimmel, P.; Yokoyama, S. Enzyme structure with two catalytic sites for double-sieve selection of substrate. Science 1998, 280, 578–582. [Google Scholar] [CrossRef] [PubMed]
  39. Fersht, A.R.; Dingwall, C. Evidence for the double-sieve editing mechanism in protein synthesis. Steric exclusion of isoleucine by valyl-tRNA synthetases. Biochemistry 1979, 18, 2627–2631. [Google Scholar] [CrossRef] [PubMed]
  40. Hendrickson, T.L.; Schimmel, P. Transfer RNA-dependent amino acid discrimination by aminoacyl-tRNA synthetases. In Translational Mechanisms; Lapointe, J., Brakier-Gringas, L., Eds.; Kluwer Academic/Plenum Publishers: New York, NY, USA, 2003; pp. 34–64. [Google Scholar]
  41. Wong, F.C.; Beuning, P.J.; Silvers, C.; Musier-Forsyth, K. An isolated class II aminoacyl-tRNA synthetase insertion domain is functional in amino acid editing. J. Biol. Chem. 2003, 278, 52857–52864. [Google Scholar] [CrossRef] [Green Version]
  42. Hati, S.; Ziervogel, B.; Sternjohn, J.; Wong, F.C.; Nagan, M.C.; Rosen, A.E.; Siliciano, P.G.; Chihade, J.W.; Musier-Forsyth, K. Pre-transfer editing by class II prolyl-tRNA synthetase: Role of aminoacylation active site in "selective release" of noncognate amino acids. J. Biol. Chem. 2006, 281, 27862–27872. [Google Scholar] [CrossRef] [Green Version]
  43. Sivashanmugam, M.; Jaidev, J.; Umashankar, V.; Sulochana, K.N. Ornithine and its role in metabolic diseases: An appraisal. Biomed. Pharmacother. 2017, 86, 185–194. [Google Scholar] [CrossRef]
  44. Wang, S.; Liang, H.; Liu, L.; Jiang, X.; Wu, S.; Gao, H. Promiscuous enzymes cause biosynthesis of diverse siderophores in Shewanella oneidensis. Appl. Environ. Microbiol. 2020, 86. [Google Scholar] [CrossRef]
  45. Levengood, J.; Ataide, S.F.; Roy, H.; Ibba, M. Divergence in noncognate amino acid recognition between class I and class II lysyl-tRNA synthetases. J. Biol. Chem. 2004, 279, 17707–17714. [Google Scholar] [CrossRef] [Green Version]
  46. Perona, J.J.; Gruic-Sovulj, I. Synthetic and editing mechanisms of aminoacyl-tRNA synthetases. Top. Curr. Chem. 2013, 344, 1–41. [Google Scholar] [CrossRef]
  47. DeTar, D.F.; Brooks, W. Cyclization and polymerization of ω-(Bromoalkyl)dimethylamines. J. Org. Chem. 1978, 43, 2245–2248. [Google Scholar] [CrossRef]
  48. Galli, C.; Illuminati, G.; Mandolini, L.; Tamborra, P. Ring-closure reactions. 7. Kinetics and activation parameters of lactone formation in the range of 3- to 23-membered rings. J. Am. Chem. Soc. 1977, 99, 2591–2597. [Google Scholar] [CrossRef]
  49. Jakubowski, H. Misacylation of tRNALys with noncognate amino acids by lysyl-tRNA synthetase. Biochemistry 1999, 38, 8088–8093. [Google Scholar] [CrossRef]
  50. Jakubowski, H. Aminoacyl thioester chemistry of class II aminoacyl-tRNA synthetases. Biochemistry 1997, 36, 11077–11085. [Google Scholar] [CrossRef]
  51. Jakubowski, H.; Goldman, E. Editing of errors in selection of amino acids for protein synthesis. Microbiol. Rev. 1992, 56, 412–429. [Google Scholar] [CrossRef] [PubMed]
  52. Jakubowski, H. Quality control in tRNA charging—editing of homocysteine. Acta Biochim. Pol. 2011, 58, 149–163. [Google Scholar] [CrossRef] [Green Version]
  53. Jakubowski, H. Homocysteine editing, thioester chemistry, coenzyme A, and the origin of coded peptide synthesis. Life 2017, 7, 6. [Google Scholar] [CrossRef] [Green Version]
  54. Gillet, S.; Hountondji, C.; Schmitter, J.-M.; Blanquet, S. Covalent methionylation of Escherichia coli methionyl-tRNA synthethase: Identification of the labeled amino acid residues by matrix-assisted laser desorption-ionization mass spectrometry. Protein Sci. 1997, 6, 2426–2435. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Labunskyy, V.M.; Hatfield, D.L.; Gladyshev, V.N. Selenoproteins: Molecular pathways and physiological roles. Physiol. Rev. 2014, 94, 739–777. [Google Scholar] [CrossRef] [Green Version]
  56. Maroney, M.J.; Hondal, R.J. Selenium versus sulfur: Reversibility of chemical reactions and resistance to permanent oxidation in proteins and nucleic acids. Free Radic. Biol. Med. 2018, 127, 228–237. [Google Scholar] [CrossRef]
  57. Palioura, S.; Sherrer, R.L.; Steitz, T.A.; Söll, D.; Simonovic, M. The human SepSecS-tRNASec complex reveals the mechanism of selenocysteine formation. Science 2009, 325, 321–325. [Google Scholar] [CrossRef] [Green Version]
  58. Lamour, V.; Quevillon, S.; Diriong, S.; N’Guyen, V.C.; Lipinski, M.; Mirande, M. Evolution of the Glx-tRNA synthetase family: The glutaminyl enzyme as a case of horizontal gene transfer. Proc. Natl. Acad. Sci. USA 1994, 91, 8670–8674. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Shiba, K.; Motegi, H.; Yoshida, M.; Noda, T. Human asparaginyl-tRNA synthetase: Molecular cloning and the inference of the evolutionary history of Asx-tRNA synthetase family. Nucleic Acids Res. 1998, 26, 5045–5051. [Google Scholar] [CrossRef] [Green Version]
  60. Roy, H.; Becker, H.D.; Reinbolt, J.; Kern, D. When contemporary aminoacyl-tRNA synthetases invent their cognate amino acid metabolism. Proc. Natl. Acad. Sci. USA 2003, 100, 9837–9842. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  61. Rathnayake, U.M.; Wood, W.N.; Hendrickson, T.L. Indirect tRNA aminoacylation during accurate translation and phenotypic mistranslation. Curr. Opin. Chem. Biol. 2017, 41, 114–122. [Google Scholar] [CrossRef] [PubMed]
  62. Young, V.R.; Ajami, A.M. Glutamine: The emperor or his clothes? J. Nutr. 2001, 131, 2449S–2459S. [Google Scholar] [CrossRef] [PubMed]
  63. Kumar, A.; Bachhawat, A.K. Pyroglutamic acid: Throwing light on a lightly studied metabolite. Curr. Sci. 2012, 102, 288–297. [Google Scholar]
  64. Fischer, W.H.; Spiess, J. Identification of a mammalian glutaminyl cyclase converting glutaminyl into pyroglutamyl peptides. Proc. Natl. Acad. Sci. USA 1987, 84, 3628–3632. [Google Scholar] [CrossRef] [Green Version]
  65. Abraham, G.N.; Podell, D.N. Pyroglutamic acid. Non-metabolic formation, function in proteins and peptides, and characteristics of the enzymes effecting its removal. Mol. Cell Biochem. 1981, 38, 181–190. [Google Scholar] [CrossRef] [PubMed]
  66. Haenni, A.-L.; Chapeville, F. The behaviour of acetylphenylalanyl soluble ribonucleic acid in polyphenylalanine synthesis. Biochim. Biophys. Acta BBA Nucleic Acids Protein Synth. 1966, 114, 135–148. [Google Scholar] [CrossRef]
  67. Singleton, M.; Isupov, M.; Littlechild, J. X-ray structure of pyrrolidone carboxyl peptidase from the hyperthermophilic archaeon Thermococcus litoralis. Structure 1999, 7, 237–244. [Google Scholar] [CrossRef] [Green Version]
  68. Feng, L.; Sheppard, K.; Tumbula-Hansen, D.; Söll, D. Gln-tRNAGln formation from Glu-tRNAGln requires cooperation of an asparaginase and a Glu-tRNAGln kinase. J. Biol. Chem. 2005, 280, 8150–8155. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Rampias, T.; Sheppard, K.; Söll, D. The archaeal transamidosome for RNA-dependent glutamine biosynthesis. Nucleic Acids Res. 2010, 38, 5774–5783. [Google Scholar] [CrossRef]
  70. Wilcox, M. Gamma-glutamyl phosphate attached to glutamine-specific tRNA. A precursor of glutaminyl-tRNA in Bacillus subtilis. Eur. J. Biochem. 1969, 11, 405–412. [Google Scholar] [CrossRef] [PubMed]
  71. Wilcox, M. Gamma-phosphoryl ester of glu-tRNA-GLN as an intermediate in Bacillus subtilis glutaminyl-tRNA synthesis. Cold Spring Harb. Symp. Quant. Biol. 1969, 34, 521–528. [Google Scholar] [CrossRef]
  72. Chen, Z.-X.; Zhong, H.; Yu, H.-T. Theoretical study of fragmentation pathways and product distribution of deprotonated aspartic acid. Comput. Theor. Chem. 2017, 1099, 1–7. [Google Scholar] [CrossRef]
  73. Su, H.-W.; Zhu, J.-H.; Li, H.; Cai, R.-J.; Ealand, C.; Wang, X.; Chen, Y.-X.; Kayani, M.U.R.; Zhu, T.F.; Moradigaravand, D.; et al. The essential mycobacterial amidotransferase GatCAB is a modulator of specific translational fidelity. Nat. Microbiol. 2016, 1, 16147. [Google Scholar] [CrossRef] [PubMed]
  74. Berlett, B.S.; Stadtman, E.R. Protein oxidation in aging, disease, and oxidative stress. J. Biol. Chem. 1997, 272, 20313–20316. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Connections between aaRS evolutionary events and amino acid substrates. Amino acid chemistry might explain idiosyncrasies in the evolution of AsnRS, GlnRS, CysRS, and LysRS. The central ring shows the amino acid substrates for these aaRSs. LysRS existed in two evolutionarily distinct forms (LysRS1 and LysRS2, green) in LUCA; CysRS is missing from some methanogenic archaea and might have been fixated post-LUCA (magenta); GlnRS and AsnRS clearly emerged post-LUCA and are still missing in many microorganisms. The outer circle shows structurally similar amino acids that were available metabolically but were rejected from the genetic code (ornithine and homocysteine) or entered the code post-LUCA (selenocysteine).
Figure 1. Connections between aaRS evolutionary events and amino acid substrates. Amino acid chemistry might explain idiosyncrasies in the evolution of AsnRS, GlnRS, CysRS, and LysRS. The central ring shows the amino acid substrates for these aaRSs. LysRS existed in two evolutionarily distinct forms (LysRS1 and LysRS2, green) in LUCA; CysRS is missing from some methanogenic archaea and might have been fixated post-LUCA (magenta); GlnRS and AsnRS clearly emerged post-LUCA and are still missing in many microorganisms. The outer circle shows structurally similar amino acids that were available metabolically but were rejected from the genetic code (ornithine and homocysteine) or entered the code post-LUCA (selenocysteine).
Genes 12 00409 g001
Figure 2. Cartoon schematic of early aaRS evolution. (A) A schematic summary of early aaRS evolution. Very early aaRS precursors are predicted to have aminoacylated tRNA micro- or mini- helices with substrate promiscuity. One minihelix would be recognized simultaneously by class I and class II precursors. Gene duplication (i) would have allowed these early aaRS precursors to achieve greater substrate specificity as they (ii) acquired additional domains like the anticodon-binding domains shown here to become the ancestors of modern aaRSs. Figure adapted from [8,16]. (B) List of aaRSs that existed at LUCA versus those that emerged post-LUCA. Enzymes discussed in this article are highlighted in red. CysRS might have existed at LUCA or emerged post-LUCA; this possibility is noted with a question mark.
Figure 2. Cartoon schematic of early aaRS evolution. (A) A schematic summary of early aaRS evolution. Very early aaRS precursors are predicted to have aminoacylated tRNA micro- or mini- helices with substrate promiscuity. One minihelix would be recognized simultaneously by class I and class II precursors. Gene duplication (i) would have allowed these early aaRS precursors to achieve greater substrate specificity as they (ii) acquired additional domains like the anticodon-binding domains shown here to become the ancestors of modern aaRSs. Figure adapted from [8,16]. (B) List of aaRSs that existed at LUCA versus those that emerged post-LUCA. Enzymes discussed in this article are highlighted in red. CysRS might have existed at LUCA or emerged post-LUCA; this possibility is noted with a question mark.
Genes 12 00409 g002
Figure 3. Reactions catalyzed by all known aaRSs. (A) Amino acid activation. (B) tRNA aminoacylation. Aminoacylation occurs on either the 2′ or the 3′ hydroxyl group of the 3′ ribose on the tRNA. For clarity, only a single hydroxyl group is shown.
Figure 3. Reactions catalyzed by all known aaRSs. (A) Amino acid activation. (B) tRNA aminoacylation. Aminoacylation occurs on either the 2′ or the 3′ hydroxyl group of the 3′ ribose on the tRNA. For clarity, only a single hydroxyl group is shown.
Genes 12 00409 g003
Figure 4. Cyclization reactions of Lys-AMP and Orn-AMP to the corresponding lactams. These intramolecular cyclization reactions would occur only when the ornithine and lysine side chains are deprotonated. Ring formation would further drive deprotonation. Formation of the six-membered ring Orn-lactam is favored ~500:1 over that of the seven-membered ring in the Lys-lactam [47,48].
Figure 4. Cyclization reactions of Lys-AMP and Orn-AMP to the corresponding lactams. These intramolecular cyclization reactions would occur only when the ornithine and lysine side chains are deprotonated. Ring formation would further drive deprotonation. Formation of the six-membered ring Orn-lactam is favored ~500:1 over that of the seven-membered ring in the Lys-lactam [47,48].
Genes 12 00409 g004
Figure 5. Cys-AMP and Hcy-AMP, but not Met-AMP, can degrade via cyclization. (A). When deprotonated, both Cys-AMP and Hcy-AMP can cyclize to their respective thiolactones. The five-membered ring formed by homocysteine is favored over the four-membered ring formed by cysteine [48]. (B) The methionine side chain is not nucleophilic under physiological conditions and so a similar cyclization reaction will not occur.
Figure 5. Cys-AMP and Hcy-AMP, but not Met-AMP, can degrade via cyclization. (A). When deprotonated, both Cys-AMP and Hcy-AMP can cyclize to their respective thiolactones. The five-membered ring formed by homocysteine is favored over the four-membered ring formed by cysteine [48]. (B) The methionine side chain is not nucleophilic under physiological conditions and so a similar cyclization reaction will not occur.
Genes 12 00409 g005
Figure 6. Spontaneous cyclization of glutamine to pyroglutamate. This reaction can occur spontaneously in solution (not shown) or following aminoacylation of tRNAGln (shown).
Figure 6. Spontaneous cyclization of glutamine to pyroglutamate. This reaction can occur spontaneously in solution (not shown) or following aminoacylation of tRNAGln (shown).
Genes 12 00409 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Hendrickson, T.L.; Wood, W.N.; Rathnayake, U.M. Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution? Genes 2021, 12, 409. https://0-doi-org.brum.beds.ac.uk/10.3390/genes12030409

AMA Style

Hendrickson TL, Wood WN, Rathnayake UM. Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution? Genes. 2021; 12(3):409. https://0-doi-org.brum.beds.ac.uk/10.3390/genes12030409

Chicago/Turabian Style

Hendrickson, Tamara L., Whitney N. Wood, and Udumbara M. Rathnayake. 2021. "Did Amino Acid Side Chain Reactivity Dictate the Composition and Timing of Aminoacyl-tRNA Synthetase Evolution?" Genes 12, no. 3: 409. https://0-doi-org.brum.beds.ac.uk/10.3390/genes12030409

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop