Design
1. Background
1.1 Background of miniToe Family
The endoribonuclease Csy4 from CRISPR family is the main role of miniToe system. Csy4 (Cas6f) is a 21.4 kDa protein which recognizes and cleaves a specific 22nt RNA hairpin. In type I and type III CRISPR systems, the specific Cas6 endoribonuclease splits the pre-crRNAs in a sequence-specific way to generate 60-nucleotide (nt) crRNA products in which segments of the repeat sequence flank the spacer (to target "foreign" nucleic acid sequence) [1]. Inactivation of the Cas proteins leads to a total loss of the immune mechanism function.
The Csy4 protein consists of an N-terminal ferredoxin-like domain and a C-terminal domain. This C-terminal domain is responsible for pre-crRNA recognition and binding. The Pre-crRNA target site adopts a stem-loop structure (the specific 22nt RNA hairpin) with five base pairs in A-form helical stem capped by GUAUA loop containing a sheared G11-A15 base pair and a bulged nucleotide U14. In the binding structure of Csy4-RNA complex, the RNA stem-loop straddles the β-hairpin formed by strands β6-7 of Csy4[2]. And once the Csy4/RNA complex formed, the structure will stay stable and hard to separate.
Fig.1-1 The structure of Csy4 Fig.1-2 The structure 22nt hairpin which can be recognized by Csy4 Fig.1-3 The Csy4/RNA complex.
2. THE FIRST SYSTEM-MINITOE TOOL
2.1 NEW METHOD ——MINITOE
Based on function of Csy4, we design a new cis-regulatory RNA element named miniToe which can be recognized by Csy4 [3]. The whole system works as a translational activator including three modular parts:1. A cis-repressive RNA (crRNA) to serve as translation suppressor by pairing with RBS as the critical part of miniToe structure.
2. A Csy4 site as a linker between cis-repressive RNA and RBS, which can be specifically cleaved upon Csy4 function.
3. A CRISPR endoribonuclease Csy4.
Fig.2-1 The structure of miniToe.
In the project, the superfolder green fluorescent protein (sfGFP) is the reporter gene to reflect output of our system under miniToe regulation, the expression of this gene is driven by a constitutive promoter and the RBS near cis-repressive RNA coding region with miniToe design.
First, the stability of miniToe structure is crucial. Hence, before wet experiment, we predicted the structure of full-length transcript of this circuit as well as miniToe target region [4][5].
Fig.2-2 The structural prediction of the circuit and miniToe.
To verify the feasibility and function of miniToe, the following circuit was designed for testing the function of miniToe structure in our system. An inducible promoter P tac controls the expression level of Csy4. The cis-repressive RNA coding sequence is inserted at the upstream of reporter (sfGFP) gene,which controlled by a constitutive promoter form Anderson family named J23119.
Fig.2-3 The two plasmids of miniToe test system.
Fig.2-4 The mechanism of miniToe system.
Fig.2-5 The result of first system.
2.2 The characteristics of miniToe
1. The Csy4 protein and target RNA have high binding affinity [1]. It shows that the miniToe may control the state of expression like in a digital-like way (ON/OFF). When the switch is at OFF state, the downstream gene expression is tightly blocked, the reaction grows very slow in the beginning but accelerating rapidly once the Csy4 protein truncates the cis-repressive RNA element.
2. Compare to the small RNA based riboswitch [6], the insertion of hairpin provides Csy4 with a recognition and cleavage site so that the Csy4 may enhance the steric hindrance effect between cis-repressive RNA and RBS when we need to release the cis-repressive RNA for opening the downstream gene expression, which could promote translation activation.
3. Compared to the toehold switch [7], miniToe does not need to redesign cis-repressive RNA case by case because the cis-repressive RNA is not paired with protein coding region.
3. The second system-miniToe family
3.1 miniToe family——The model help us go further
After testing first system, miniToe, the dry lab member explore more deeply!
After building the ODE model, we use it to simulate the dynamics of sfGFP. Comparing with the experimental data, it fits perfectly, which indicates that the model is reliable about first system. By analyzing the sensitivity of the GFP level in the system to cleavage rate by model, it is not difficult to predict that the cleavage rate has an influence in the expression of sfGFP. It means we may change the expression level of sfGFP if we employ different mutants of Csy4 proteins.
Fig.3-1 The ODE model for the first system.
Fig.3-2 The model about sensitivity analysis of the sfGFP.
3.2 The principles of designing mutants
For Csy4 mutants
1. Some key sites in the Csy4 are really crucial for keeping the stable of structure and maintain the functions of recognition and cleavage. Mutatios on those sites may result in serious influence on our system. By point mutation, we hope to get a library of mutants, which could provide several Csy4 mutant candidates with recognition and cleavage rates shows as a "ladder".
For hairpin mutants
2. We need to avoid breaking the recognition site for keep the function of cleavage as relative stable. If we break the key site G20, it may lead to damage of cleavage function.
3. The stability of secondary structure is vital, so we need to focus on each hairpin's Gibbs free energy during design.
4. The aim is to obtain different hairpins which have various activity for Csy4 recognition and RNA cleavage.
3.3 Models help us deeply!
Here a model helps to design the mutants of Csy4 and hairpin. According to the principles of design we mentioned above, four key problems is important in the miniToe model design:
1.Does the Csy4 dock correctly with the miniToe structure?
2.How about the binding ability between the Csy4 and miniToe structure?
3.How about the ability of cleavage between the Csy4 and miniToe structure?
4.Does cis-repressive release from the RBS?
For the Csy4 mutants, the molecular dynamics method was used as our tools. Based on Jiří Šponer’s work [8], we chose four significant symbols of mutants: the interaction matrix; the binding free energy, the distance of Ser151(OG)-G20(N2') and the RMSD of the cleaved-product complex. By comparing the difference between various Csy4 mutants with wild-type Csy4 by above four significant symbols, we finally design four Csy4 mutants: Csy4-Q104A, Csy4-Y176F, Csy4-F155A, and Csy4-H29A based on model prediction.
For the hairpin mutants, the bioinformatics and machine learning become our tools. By exploring the commonality of hairpin in the following step:
1. Firstly, we found the hairpin which is also Repeat Area [9] in CRISPR system and scoring them. 2. Secondly, with the help of the SVM algorithm[10], training a model to score the hairpin. 3. Finally, by using the model, five hairpins are selected: miniToe-1, miniToe-2, miniToe-3, miniToe-4, miniToe-5.
Click here for more details about model!
Fig.3-3 We have selected five Csy4s and six hairpins.
With the help of our model, 4 Csy4 mutants and 5 hairpin mutants are selected. We tested each mutant and got positive data supporting model prediction. Then we set up a function verification experiment with 5*6 combinations of Csy4 and hairpin including wild types. By testing all of them, 10 members work successfully as expectation. So, our second system, miniToe family, has 10 combinations which have been designed and selected.
Fig.3-4 The chart shows 30 combinations in experiments
4. The third system-miniToe polycistron
4.1 Why we do
Many applications of synthetic biology need to regulate the expression profile of multiple genes. Microorganisms with programmable and engineered metabolic pathways are employed as a reaction vessel to natural or unnatural products. It involves the introduction of several genes encoding the enzymes of a metabolic pathway [11][12]. Indeed, pathway optimization requires to adjust the expression of multiple genes at appropriately pattern, for example, the synthetic of poly-3-hydroxybutyrate and Mevalonate [13].
As has been done in the prokaryotes, grouping a cluster of genes into a single polycistron is a convenient method for regulating genes simultaneously. Thus, for the sake of tuning the expression pattern of genes within polycistron, we hope to develop a powerful regulation tool by the miniToe system. We name this system miniToe polycistron which contains several genes in one circuit with different miniToe design. Our aim for this part is achieving different expression profile of the genes by miniToe in polycistrons compared with normal polycistrons.
4.2 How we do
Fig.4-1 The mechanisms of miniToe polycistron
1. First, sfGFP and mCherry is used as a test system in bi-cistron circuit.
2. Then we selected some miniToe parts and inserted them between, before and behind sfGFP and mCherry. For example, for bi-cistron, then three miniToe parts will be inserted. for three genes in polycistron, then four miniToe parts will be inserted, and so on. The reasons why we design like this are below:
1). Without Csy4, the existence of miniToe structure in polycistron will inhibit the gene expression when we don’t want to open the switch. The cis-repressive RNA in miniToe has complementary sequence of adjacent RNA region (RBS) to prevent the binding of ribosomes.
2) Also, the RNA secondary structure of miniToe with Csy4 binding keeping at 3’ UTR after Csy4 cleavage is a protection mechanism to prevent RNA degradation. And this function is based on the high stability and affinity between Csy4 and target RNA structure [14].
3.)The function of each miniToe has specific recognition and cleavage rates, which will make it possible to regulate the gene flexibly.
All in all, miniToe polycistron system has two components, Csy4 and the circuit of polycistron. With Csy4 protein, the polycistron will be cut into several mRNA chains with RNA/Csy4 complex at the 3’ UTR. The capability of RNA degradation protection will be much stronger, because of the high stability and affinity of Csy4 binding, which increase energy threshold for RNA degradation from 3’ UTR. So, the RNA degradation rate will be much lower. For the 5’ end degradation, the Csy4 cut will leave a OH- at 5’ end. the cleavage capability of RNase E will be much lower because there is no pyrophosphate in the 5’ end. Qi’s work [14] has proved that OH-mRNAs exhibit higher gene expression than 5’ PPP-mRNAs.
It is worth mentioning that, our work this year is an improvement based on 2016 OUC-China. The comparison is in demonstrate page, the chapter 5. Click here to see more!
5. miniToe based Motility detection system
5.1 THE APPLICATION OF MINITOE——REGULATION on MOTA
As and translation regulation tool, MiniToe can also be used in application scenario of molecular mechanism research. Sometimes scientists may puzzle with the functions of certain gene or protein without in-depth study. One general method to study them is knock-out or knock-in methods. In this way, organisms show some phenotypic change related to particular gene. However, if we want to know better about the functions of the gene, we may need more tool to change gene expression at different levels.
This year, we use miniToe system to control the motions of Escherichia coli. We transformed the miniToe system into E. coli whose motility is regulated by the motor protein, MotA. MotA provides a channel for the proton gradient required for generation of torque [15]. ΔMotA strains (the motA-deletion strain) can synthesize flagella without function, because they are unable to generate the torque required for flagellar rotation. Expression of motA from a plasmid has been shown to restore motility in ΔmotA strains. Our miniToe system is expected to control MotA protein expression with different levels. We got a strain without motA by gene knock-out and then transform motA gene under the control of miniToe family. The E. coli may restore motility under our control and exhibit the application value and potential of our miniToe tool.
5.2 How we do
E. coli RP6666 (knocked out motA)lacks capacity of motion. To use miniToe system for translation control on motA, we constructed a circuit and put motA gene at the downstream of miniToe part, then transformed the plasmid into E. coli RP6666. By inducing the expression of Csy4, motA could be controlled indirectly, thus making E. coli RP6666 strain regain the capacity of motion.
Fig.5-1 The process of motility detection system
6. In the future
In conclusion, we have demonstrated the design of modular translational activators with CRISPR endoribonuclease Csy4 named miniTo and design four systems which are improved step by step.
In the future, there are some works to perfect project. First, we would like to enlarge project by finding more and more mutants. By finding and designing more mutants a larger library may be created which can enlarge the function of toolkit. Second, we have tested the ratio of regulation in miniToe polycistron. In the future, man-made setting can be used to make calibration curve which will help us to know project deeply.
REFERENCE
[1].Przybilski R, Richter C, Gristwood T, et al. Csy4 is responsible for CRISPR RNA processing in Pectobacterium atrosepticum.[J]. Rna Biology, 2011, 8(3):517-528.[2].Haurwitz R E, Jinek M, Wiedenheft B, et al. Sequence- and structure-specific RNA processing by a CRISPR endonuclease[J]. Science, 2010, 329(5997):1355-1358.
[3].Du P, Miao C, Lou Q, et al. Engineering Translational Activators with CRISPR-Cas System[J]. Acs Synthetic Biology, 2016, 5(1):74.
[4].Hofacker I L. Vienna RNA secondary structure server[J]. Nucleic Acids Research, 2003, 31(13):3429-3431.
[5].M. Zuker. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31 (13), 3406-3415, 2003.
[6].Mandal M, Breaker R R. Gene regulation by riboswitches.[J]. Nature Reviews Molecular Cell Biology, 2004, 5(6):451-63.
[7].Green A, Silver P, Collins J, et al. Toehold switches: de-novo-designed regulators of gene expression.[J]. Cell, 2014, 159(4):925-939.
[8].Estarellas C, Otyepka M, Koča J, et al. Molecular dynamic simulations of protein/RNA complexes: CRISPR/Csy4 endoribonuclease.[J]. Biochimica Et Biophysica Acta, 2015, 1850(5):1072-1090.
[9].Edgar R C. PILER-CR: Fast and accurate identification of CRISPR repeats[J]. Bmc Bioinformatics, 2007, 8(1):1-6.
[10].Schölkopf B, Tsuda K, Vert J P. Support Vector Machine Applications in Computational Biology[C]// MIT Press, 2004:71-92.
[11].Pfleger, B.F., et al., Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes. Nat Biotechnol, 2006. 24(8): p. 1027-32.
[12].Xu, C., et al., Cellulosome stoichiometry in Clostridium cellulolyticum is regulated by selective RNA processing and stabilization. Nat Commun, 2015. 6: p. 6900.
[13].Smolke, C.D. and J.D. Keasling, Effect of gene location, mRNA secondary structures, and RNase sites on expression of two genes in an engineered operon. Biotechnol Bioeng, 2002. 80(7): p. 762-76.
[14].Qi L, Haurwitz R E, Shao W, et al. RNA processing enables predictable programming of gene expression[J]. Nature Biotechnology, 2012, 30(10):1002.
[15].Ravichandar J D, Bower A G, Julius A A, et al. Transcriptional control of motility enables directional movement ofEscherichia coliin a signal gradient[J]. Scientific Reports, 2017, 7(1).
Contact Us : oucigem@163.com | ©2018 OUC IGEM.All Rights Reserved. | …………