Pyrrolysine (symbol Pyl or O), encoded by the "amber" stop codon UAG, is a proteinogenic amino acid that is used in some methanogenic archaea and in bacteria. It consists of lysine with a 4-methylpyrroline-5-carboxylate in amide linkage with the <sup>ε</sup>N of the lysine. Its pyrroline side-chain is similar to that of lysine in being basic and positively charged at neutral pH.
Genetics
Nearly all genes are translated using only 20 standard amino acid building blocks. Two unusual genetically encoded amino acids are selenocysteine and pyrrolysine. Pyrrolysine was discovered in 2002 at the active site of methyltransferase enzyme from a methane-producing archeon, Methanosarcina barkeri.
Catalytic function
The extra pyrroline ring is incorporated into the active site of several methyltransferases, where it is believed to rotate relatively freely. It is believed that the ring is involved in positioning and displaying the methyl group of methylamine for attack by a corrinoid cofactor. The proposed model is that a nearby carboxylic acid bearing residue, glutamate, becomes protonated, and the proton can then be transferred to the imine ring nitrogen, exposing the adjacent ring carbon to nucleophilic addition by methylamine. The positively charged nitrogen created by this interaction may then interact with the deprotonated glutamate, causing a shift in ring orientation and exposing the methyl group derived from the methylamine to the binding cleft where it can interact with corrinoid. In this way a net is transferred to the cofactor's cobalt atom with a change of oxidation state from +1 to +3. The methylamine-derived ammonia is then released, restoring the original imine.
Use in bioengineering
The tRNA-aaRS pair for pyrrolysine ("orthogonal pair") is independent of other synthetases and tRNAs in most organisms including Escherichia coli, and further possesses some flexibility in the range of amino acids processed (the aaRS accepts some different molecules that are structurally similar to pyrrolysine), making it an attractive tool to allow the placement of a possibly wide range of functional chemical groups at arbitrarily specified locations in modified proteins. For example, the system provided one of two fluorophores incorporated site-specifically within calmodulin to allow the real-time examination of changes within the protein by FRET spectroscopy, and site-specific introduction of a photocaged lysine derivative. (See Expanded genetic code)
The recognition of a tRNA by an aaRS is by its acceptor stem sequence. The pyrrolysine tRNA can be modified to have an acceptor stem of another tRNA, allowing a different aaRS to act on it. In 2024, it was reported a version modified to accept alanine can effectively suppress premature termination codons (all three of them) in human cell lines.
Evolution
The pylT (tRNA) and pylS (aa-tRNA synthase) genes are part of an operon of Methanosarcina barkeri, with homologues in other sequenced members of the Methanosarcinaceae family: M. acetivorans, M. mazei, and M. thermophila. Pyrrolysine-containing proteins are known to include monomethylamine methyltransferase (mtmB), dimethylamine methyltransferase (mtbB), and trimethylamine methyltransferase (mttB). Homologs of pylS and pylT have also been found in an Antarctic archaeon, Methanosarcina barkeri and a Gram-positive bacterium, Desulfitobacterium hafniense. The other genes of the Pyl operon mediate pyrrolysine biosynthesis, leading to description of the operon as a "natural genetic code expansion cassette".
A number of evolutionary scenarios have been proposed for the pyrrolysine system. The current (2022) view, given available sequences for tRNA and Pyl-tRNA (PylRS) synthase genes, is that:
- tRNA(Pyl) diverged from tRNA(Phe) some time between the divergence of the three domains (~LUCA) and the divergence of archaeal phyla, but was lost in non-archaeal lineages; It is now known that a wide range of prokaryotes have these two genes. It is now known that some non-methanogens also have these two genes, but the dating was not too far off. This is probably true based on the 2022 study, though the paper originally assumed a link to methanogenesis.
MttB is known to have several relatives without pyrrolysine in the MttB superfamily, all with high sequence similarity. Among all structures in the PDB as of 2023, the most similar (by structure and sequence identity) to MttB are the glycine betaine methyltransferase MtgB and a protein of unknown function 4YYC. MtmB is the third most similar structurally (among all PDB structures as of 2023) with a great decrease in similarity.
There has been no report of any member of the MtbB superfamily without pyrrolysine in literature as of 2025.
Non-methyltransferase function
The tRNA<sup>His</sup> guanylyltransferase gene Thg1 from the archaeon Methanosarcina acetivorans has a pyrrolysine residue. The gene works as usual if the pyrrolysine is substituted for other amino acids. In this case, the presence of Pyl results from simple neutral evolution. The insertion of Pyl into a protein requires no special signal in the mRNA, only a UAG codon, so it stands to reason that the barrier for changing a residue into Pyl in a protein sequence is the same as any other amino-acid substitution.
The bacterium Acetohalobium arabaticum has a serine dehydratase with two Pyl residues in addition to the regular methyltransferases. It also only expresses the Pyl machinery when trimethylamine is present. This conditional expression causes the bacterium to only produce a functional version of the serine dehydratase, the methyltransferases, and potentially the rest of its many proteins coded by a gene (about 20% of all its ORFs) with an in-frame TAG when trimethylamine is present. Although the incorporation of Pyl in those non-methyltransferase genes probably have no particular catalytic function, it has been adapted into a new way to regulate the production of protein products.
The prevalence of ORFs with a in-frame TAG is much smaller in archaeons, as expected for their "always-on" (constituent) expression of the Pyl machinery, at about 5%.
Potential for an alternative translation
The tRNA<sub>CUA</sub> can be charged with lysine in vitro by the concerted action of the M. barkeri Class I and Class II lysyl-tRNA synthetases (LysRS1 and LysRS2), neither of which recognizes pyrrolysine. Charging a tRNA<sub>CUA</sub> with lysine was originally hypothesized to be the first step in translating UAG amber codons as pyrrolysine, a mechanism analogous to that used for selenocysteine. More recent data favor direct charging of pyrrolysine on to the tRNA<sub>CUA</sub> by the protein product of the pylS gene, leading to the suggestion that the LysRS1:LysRS2 complex may participate in a parallel pathway designed to ensure that proteins containing the UAG codon can be fully translated using lysine as a substitute amino acid in the event of pyrrolysine deficiency. Further study found that the genes encoding LysRS1 and LysRS2 are not required for normal growth on methanol and methylamines with normal methyltransferase levels, and they cannot replace pylS in a recombinant system for UAG amber stop codon suppression.
