thumb|330px|A simplified schematic of the loading of the eukaryotic pre-replication complex
A pre-replication complex (pre-RC) is a protein complex that forms at the origin of replication during the initiation step of DNA replication. Formation of the pre-RC is required for DNA replication to occur. Complete and faithful replication of the genome ensures that each daughter cell will carry the same genetic information as the parent cell. Accordingly, formation of the pre-RC is a very important part of the cell cycle.
Components
As organisms evolved and became increasingly more complex, so did their pre-RCs. The following is a summary of the components of the pre-RC amongst the different domains of life.
In bacteria, the main component of the pre-RC is DnaA. The pre-RC is complete when DnaA occupies all of its binding sites within the bacterial origin of replication (oriC). The particular sites on the oriC that DnaA binds to determines if the cell has a bORC (bacterial Origin Recognition Complex) or a pre-RC.
The archaeal pre-RC is very different from the bacterial pre-RC and can serve as a simplified model of the eukaryotic pre-RC. It is composed of a single origin recognition complex (ORC) protein, Cdc6/ORC1, and a homohexamer of the minichromosome maintenance (MCM) protein. Sulfolobus islandicus also uses a Cdt1 homologue to recognize one of its replication origins.
The eukaryotic pre-RC is the most complex and highly regulated pre-RC. In most eukaryotes it is composed of six ORC proteins (ORC1-6), Cdc6, Cdt1, and a heterohexamer of the six MCM proteins (MCM2-7). The MCM heterohexamer arguably arose via MCM gene duplication events and subsequent divergent evolution. The pre-RC of Schizosaccharomyces pombe (S. pombe) is notably different from that of other eukaryotes; Cdc6 is replaced by the homologous Cdc18 protein. Sap1 is also included in the S. pombe pre-RC because it is required for Cdc18 binding. The pre-RC of Xenopus laevis (X. laevis) also has an additional protein, MCM9, which helps load the MCM heterohexamer onto the origin of replication.
Recognition of the origin of replication
Recognition of the origin of replication is a critical first step in the formation of the pre-RC. In different domains of life this process is accomplished differently.
In prokaryotes, origin recognition is accomplished by DnaA. DnaA binds tightly to a 9-base pair consensus sequence in oriC; 5' – TTATCCACA – 3'. There are 5 such 9-bp sequences (R1-R5) and 4 non-consensus sequences (I1-I4) within oriC that DnaA binds with differential affinity. DnaA binds R4, R1, and R2 with high affinity and R5, I1, I2, I3, and R3 with lesser affinity.
Archaea have 1–3 origins of replication. The origins are generally AT-rich tracts that vary based on the archaeal species. The singular archaeal ORC protein recognizes the AT-rich tracts and binds DNA in an ATP-dependent fashion.
Eukaryotes typically have multiple origins of replication; at least one per chromosome. Saccharomyces cerevisiae (S. cerevisiae) is the only known eukaryote with a defined initiation sequence TTTTTATG/ATTTA/T. This initiation sequence is recognized by ORC1-5. ORC6 is not known to bind DNA in S. cerevisiae. Initiation sequences in S. pombe and higher eukaryotes are not well defined. However, the initiation sequences are generally either AT-rich or exhibit bent or curved DNA topology. The ORC4 protein is known to bind the AT-rich portion of the origin of replication in S. pombe using AT hook motifs. The mechanism of origin recognition in higher eukaryotes is not well understood but it is thought that the ORC1-6 proteins depend on unusual DNA topology for binding.
The pre-RC of archaea requires ORC binding of the origin. After this, Cdc6 and the MCM homohexameric complex bind in a sequential fashion.
Eukaryotes have the most complex pre-RC. After ORC1-6 bind the origin of replication, Cdc6 is recruited. Cdc6 recruits the licensing factor Cdt1 and MCM2-7. Cdt1 binding and ATP hydrolysis by the ORC and Cdc6 load MCM2-7 onto DNA. There is a stoichiometric excess of the MCM proteins over the ORC and Cdc6 proteins, indicating that there may be multiple MCM heterohexamers bound to each origin of replication.
In S. cerevisiae, CDKs prevent formation of the replication complex during late G1, S, and G2 phases by excluding MCM2-7 and Cdt1 from the nucleus, targeting Cdc6 for degradation by the proteasome, and dissociating ORC1-6 from chromatin via phosphorylation. Known mutations are in the ORC1, ORC4, ORC6, CDT1, and CDC6 genes.
