thumb|200px|right|The flow of information within a cell. DNA is first transcribed into RNA, which is subsequently translated into protein. (See [[Central dogma of molecular biology.)]]

thumb|400px|mRNA structure, approximately to scale for a human mRNA, where the median length of 3′UTR is 700 nucleotides

In molecular genetics, the three prime untranslated region (3′UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′UTR often contains regulatory regions that post-transcriptionally influence gene expression.

During gene expression, an mRNA molecule is transcribed from the DNA sequence and is later translated into a protein. Several regions of the mRNA molecule are not translated into a protein including the 5′ cap, 5′ untranslated region, 3′ untranslated region and poly(A) tail. Regulatory regions within the 3′ untranslated region can influence polyadenylation, translation efficiency, localization, and stability of the mRNA. The 3′UTR contains binding sites for both regulatory proteins and microRNAs (miRNAs). By binding to specific sites within the 3′UTR, miRNAs can decrease gene expression of various mRNAs by either inhibiting translation or directly causing degradation of the transcript. The 3′UTR also has silencer regions which bind to repressor proteins and will inhibit the expression of the mRNA.

Many 3′UTRs also contain AU-rich elements (AREs). Proteins bind AREs to affect the stability or decay rate of transcripts in a localized manner or affect translation initiation. Furthermore, the 3′UTR contains the sequence AAUAAA that directs addition of several hundred adenine residues called the poly(A) tail to the end of the mRNA transcript. Poly(A) binding protein (PABP) binds to this tail, contributing to regulation of mRNA translation, stability, and export. For example, poly(A) tail bound PABP interacts with proteins associated with the 5′ end of the transcript, causing a circularization of the mRNA that promotes translation.

The 3′UTR can also contain sequences that attract proteins to associate the mRNA with the cytoskeleton, transport it to or from the cell nucleus, or perform other types of localization. In addition to sequences within the 3′UTR, the physical characteristics of the region, including its length and secondary structure, contribute to translation regulation. These diverse mechanisms of gene regulation ensure that the correct genes are expressed in the correct cells at the appropriate times.

Physical characteristics

The 3′UTR of mRNA has a great variety of regulatory functions that are controlled by the physical characteristics of the region. One such characteristic is the length of the 3′UTR, which in the mammalian genome has considerable variation. This region of the mRNA transcript can range from 60 nucleotides to about 4000. On average the length for the 3′UTR in humans is approximately 800 nucleotides, while the average length of 5′UTRs is only about 200 nucleotides.

Sequences within the 3′UTR also have the ability to degrade or stabilize the mRNA transcript. Modifications that control a transcript's stability allow expression of a gene to be rapidly controlled without altering translation rates. One group of elements in the 3′UTR that can help destabilize an mRNA transcript are the AU-rich elements (AREs). These elements range in size from 50 to 150 base pairs and generally contain multiple copies of the pentanucleotide AUUUA. Early studies indicated that AREs can vary in sequence and fall into three main classes that differ in the number and arrangement of motifs. Human transcripts possess 3′UTRs that are on average twice as long as other mammalian 3′UTRs. This trend reflects the high level of complexity involved in human gene regulation. In addition to length, the secondary structure of the 3′ untranslated region also has regulatory functions. Protein factors can either aid or disrupt folding of the region into various secondary structures. The most common structure is a stem-loop, which provides a scaffold for RNA binding proteins and non-coding RNAs that influence expression of the transcript.

Methods of study

Scientists use a number of methods to study the complex structures and functions of the 3′UTR. Even if a given 3′UTR in an mRNA is shown to be present in a tissue, the effects of localization, functional half-life, translational efficiency, and trans-acting elements must be determined to understand the 3′UTR's full functionality. Computational approaches, primarily by sequence analysis, have shown the existence of AREs in approximately 5 to 8% of human 3′UTRs and the presence of one or more miRNA targets in as many as 60% or more of human 3′UTRs. Software can rapidly compare millions of sequences at once to find similarities between various 3′UTRs within the genome. Experimental approaches have been used to define sequences that associate with specific RNA-binding proteins; specifically, recent improvements in sequencing and cross-linking techniques have enabled fine mapping of protein binding sites within the transcript. Induced site-specific mutations, for example those that affect the termination codon, polyadenylation signal, or secondary structure of the 3′UTR, can show how mutated regions can cause translation deregulation and disease. These types of transcript-wide methods should help our understanding of known cis elements and trans-regulatory factors within 3′UTRs.

Disease

thumb|330px|Diseases caused by different mutations within the 3′UTR

3′UTR mutations can be very consequential because one alteration can be responsible for the altered expression of many genes. Transcriptionally, a mutation may affect only the allele and genes that are physically linked. However, since 3′UTR binding proteins also function in the processing and nuclear export of mRNA, a mutation can also affect other unrelated genes. An expanded number of trinucleotide (CTG) repeats in the 3′UTR of the dystrophia myotonica protein kinase (DMPK) gene causes myotonic dystrophy.