Line 1,264: | Line 1,264: | ||
− | <br /><br /><br /><br /> | + | <br /><br /><br /><br /><br /><br /> |
Revision as of 02:34, 18 October 2018
Overview
The aim of our project is to develop a better post-transcriptional regulation strategy and use it in monocistron and polycistron. Here we built models to design and predict our work.
miniToe —— a better transcriptional regulate strategy
To achieve a better post-transcriptional regulation strategy, we designed a system which is composed of an RNA endoribonuclease (Csy4) and an RNA module named miniToe. We modeled to describe the dynamics of the miniToe system and found a way to achieve different regulation level. The ODEs and molecular dynamics were two main tools to explore it. We used the ODEs to describe the reaction curve and the molecular dynamics in order to give some explanations to experimental data.
Below you can follow the several questions we point out to have a better understanding of model work and the miniToe system. We will discuss some structures of Csy4 in different stages (Q1), some structures of miniToe system in different stages (Q2), the reaction order and some key points of miniToe system (Q3), the simulation of ODEs model (Q4), some significant symbols in molecular dynamics (Q5) and the ways to different regulation levels (Q6).
Q1: What is the structure of Csy4?
Fig.1 The structure of Csy4 without the hairpin bound. (PDB ID: 4AL5, resolution 2.0 A)
Fig.2 The structure of Csy4 with the hairpin bound. (PDB ID: 4AL5, resolution 2.0 A)
Q2: What is the structure of miniToe?
1. A cis-repressive RNA (crRNA) serves as a translation suppressor by pairing with RBS and therefore constitutes the critical part of the miniToe structure.
2. A Csy4 site serves as a linker between cis-repressive RNA and RBS, which can be specifically cleaved by Csy4 enzyme.
3. Csy4 enzyme --- A CRISPR endoribonuclease.
Fig.3 The structure of miniToe.
Fig.4 The precursor complex of wild-type Csy4.
Fig.5 The product complex of wild-type Csy4.
Q3: What is the reaction process and key points of miniToe system?
Fig.6 The working process of miniToe system.
(1) The miniToe is produced and accumulated.
(2) Csy4 is produced after induced by IPTG.
(3) Csy4 binds to the miniToe structure and forms the Csy4-miniToe complex.
(4) Csy4 cleaves the special site and divides the miniToe structure into two parts: the Csy4-crRNA complex and the mRNA of sfGFP.
(5) sfGFP is produced.
From the description above, we can get four key problems in our system to make sure whether our system can work successfully:
(1) Can Csy4 dock correctly with the miniToe structure (hairpin)?
(2) How about the ability of binding between the Csy4 and miniToe structure (hairpin)?
(3) How about the ability of cleavage between the Csy4 and miniToe structure (hairpin)?
(4) Can cis-repressive RNA be released from the RBS successfully?
Q4: How about the simulation results of the ODEs model?
According to the work process, we built an ODEs model and simulated our miniToe system for 30h, the result can be seen in the Fig.7.
Fig.7 The dynamics of sfGFP by model prediction
Fig.8 The comparison between experimental data and simulation data.
Q5: How about simulation result of the molecular dynamics?
For the first key point, we have the interaction matrix to describe the molecular docking, and the heatmap of the matrix can be seen in Fig.9.
Fig.9 The heatmap of interaction matrix for wild-type Csy4.
For the second key point, we calculated the binding free Energy of Csy4/RNA complex. The result of binding free energy for wild-type Csy4 is .
For the third key point, we checked the distance of Ser151(OG)-G20(N2’), which is a key interaction in the active site of Csy4 to describe the ability of cleavage. The distance curve of Ser151(OG)-G20(N2’) for wild-type Csy4 can be seem in Fig.10.
Fig.10. The distance of Ser151(OG)-G20(N2’) in wild-type Csy4.
For the last key point, we used the RMSD of product to describe the release of crRNA. The result can be seen in the Fig.11.
Fig.11. The distance of Ser151(OG)-G20(N2') in wild-type Csy4
To see more details
Q6: How to achieve the goal of different regulate level?
Fig.12 The curve of sfGFP with the changing cleavage rate.
miniToe Family —— The way to fifferent regulate level
In the miniToe family, the protein and hairpin were mutated to meet the goal of the different regulation level. In this part, the model can help us design mutants. Importantly, we used different strategies to design the feature of Cys4 and the hairpin. For example, molecular dynamics played an important role in designing protein mutants, and the bioinformatics and machine learning supported us to find the hairpin mutants of our interest.
So here comes the another four key questions: how to design our Csy4 mutants (Q7), how does the method work in design and the result (Q8), the problem different from Csy4 designing when design the hairpin mutants and how to solve it (Q9) and the results of the mutants designing (10).
Q7: How to design the Csy4 mutants?
click to see more
click to hide
We found four important sites in wet lab, Gln104, Tyr176, Phe155 and His29, which play important roles in binding and cleavage in Csy4 structure.(Fig.13) Considering the existing 20 amino acids in nature, there were 80 mutants to be explored if we had only one site to be mutated.
Fig.13 The four important sites in Csy4.
In Q3, we have discussed four key points which can directly influence our miniToe system. In addition, according to the molecular dynamics results in Q5, we can describe the four key points through four significant symbols.
Now we are going to construct a logic line to show how to use the three main information above to design Csy4 mutants:
What we have proved through the experiment is that the wild-type Csy4 can work well with the miniToe system, which means that all the key points we have discussed before didn't affect the wild-type Csy4. The wild-type Csy4 can dock correctly with the miniToe structure and had a good ability to bind and cleave the miniToe structure. Finally, the crRNA can be released from the RBS. So we choose the wild-type Csy4 as a standard, and all Csy4 mutants can check the four points by comparing with wild-type Csy4.
Now for the four points in Q3 we have discussed the mathematical forms in Q5. And the most important thing is how to make a comparison between the mutant and wild-type Csy4 enzymes, which will be discussed in Q8.
Q8: How does the design methods work?
click to see more
click to hide
In Q7, we have discussed the full logic lines about how to design the Csy4 mutants. Here we will give the comparison method for the four key points in miniToe system. And we did this comparison between the mutant and wild-type Csy4 enzymes.
Now we have four mathematical forms including two curves, a numerical value, and a matrix. Four things can be divided into two kinds of data: the matrix and the numerical value. The interaction matrix and the curve can be regarded as a matrix because the curve is discrete, and the binding free energy is a numerical value.
We used Euclidean distance to describe the difference between the two matric:
We used the formula below to calculate the difference of binding free energy between the wild type and mutants:
According to description above, we defined four value to compare the four key points between the mutant and wild-type Cys4 enzymes:
,
,
,
.
By using the four values, five Csy4 mutants were designed and shown in table below.
Csy4
WT
0
0
0
0
Q104A
0.483
2483
9.48
30.82
Y176F
0.592
-382
11.61
40.62
F155A
0.233
-1627
13.41
35.71
H29A
0.173
833
15.29
316.22
To see more details
Q9: How to design the hairpin mutant?
click to see more
click to hide
The design of hairpin mutants is quite different from the Csy4 mutants due to the large library. In theory, except for the two cleaved sites, G20 and C21, we can generate 420 mutants totally.
Combining the bioinformatics and machine learning, we presented an algorithm to pre-processing our big mutation library. The flow chart of the pre-processing algorithm is shown below.
Fig.14 The flow chart of the pre-processing algorithm
The SVM model was training well and the results were shown below.
Fig.15 The training result
After training the SVM model, we used it to evaluate the hairpin mutants. We selected the hairpin mutants with high ranks to check the four key points. Finally, we determined the five hairpin mutants below. The following chart shows the DR-Score which is the evaluated results from the SVM model.
Hairpin-Mutant
miniToe1
76.6306
miniToe2
65.6278
miniToe3
66.7160
miniToe4
62.5537
miniToe5
52.9794
To see more details
Q10: How about the mutants designing results?
click to see more
click to hide
After designing the protein mutants and hairpin mutants, we tested them in the wet lab.
Fig.16 The experimental results of mutants.
And we try to gave a comparison between the special value we used before for evaluating the mutants and experimental results to check our model.
For the protein mutants, we gave a comparison between D3 and experimental results.
Fig.17 The comparison between model and experimental results for different protein mutants.
As we can see in the Fig.17, we can find the inner relationship between D3 and experimental results: the D3 value describes the difference in the ability of cleavage between the wild-type and mutants. A higher D3 value means a weaker cleavage ability.
For the hairpin mutants, we gave a comparison between DR-Score and experimental results.(Fig.18)
Fig.18 The comparison between model and experimental results for hairpin mutants.
We can find the inner relationship between DR-Score and experimental results except miniToe 1. It is reasonable because the machine learning is quite sensitive to the data amounts and the R2 is not 1 in our training result of SVM model.
Importantly, we tested 30 combinations of Csy4 enzymes and hairpins in wet lab.(Fig.19)
Fig.19 The heatmap results of our 30 combinations
miniToe polycistron —— Not just the monocistron
In our miniToe polycistron system, we build a coupled transcription-translation model considering several events in prokaryotes to get a deep understanding of polycistron. Then we simplify this model into a more flexible model to predict how the miniToe structure changes the relative expression level in polycistron.
Here I want to discuss the five problems to understand the model work: What does the coupled transcription-translation model tell us? (Q11) How about the flexible model? (Q12) How does the miniToe structure affect the ratio of expression level of cistrons in polycistron? (Q13) What is the main role of cleavage rate in miniToe polycistron system? (Q14) The proof of miniToe structure in polycistron.(Q15)
Q11: What does the coupled transcription-translation model tell us?
click to see more
click to hide
We have consturcted an coupled transcription-translation model to simulaiton the mRNA distribution in the origin time and final time of polycistron system. The Fig.20 and Fig.21 is the mRNA distribution in t=100s and t = 600s.
Fig.20 The heatmap results of our 30 combinations
Fig.21 The mRNA distribution in t=600s
The difference of two picture give us two information:
1.The different cistrons in diffrernt positions have different mRNA abundance.
2. The different cistrons in diffrernt positions have different translational time.
To see more details
Q12: How about the flexibel model we simplify?
click to see more
click to hide
We have simplify the coupled transcription-translation model into a more flexible model to describe the dynamics of polycistron. This model is including four mian part, take an bicistron as an example:
(1)The transcription of two CDSs region:
Here we divided the polycistron into two part with different transcription paraments
,
to deal with the problem of different mRNA abundance due to the premature termination. The two paraments
,
is totally sequence-dependent.
(2)The degradation of mRNA
The degradation of RNA also divided into two parts with different transcription paraments
,
to deal with the problem of different translational time for two mRNA. Each
can be divided into two part:
The
denotes the recoup item for the translational time difference and the
denotes the common degradation rate of mRNA.
(3)The translation of protein.
In the translation of two proteins, the two paraments used to describe the translation also should be different considering the translation coupling.
(4)The degradation of proteins
We don’t have too much discussion in the proteins degradation here.
This model is the simple forms of the coupled transcription-translation model, it keeps the easy form but also reflect the common phenomenon which will happen in the transcript and translation of polycistron including transcript polarity and translation coupling.
To see more details
Q13: How does the miniToe structure affect the ratio of expression level of cistrons in polycistron?
click to see more
click to hide
Our miniToe structure can be used to protect the 5' end and 3' end of mRNA, which can be used to control the half-life of mRNA to control the ratio of expression level for cistrons in polycistron. As the Fig.22 show, the Csy4/RNA complex can protect the 5' end of mRNA. By choosing the different hairpin which has the different binding capability, the mRNA will have the different half-life.
Fig.22 The mechanisms of miniToe polycistron
Q14: What is the main role of cleavage rate in miniToe polycistron system?
click to see more
click to hide
Using the model constructed by OUC-China 2016, we explore the relationship between the cleavage rate and the ratio of two proteins in stable level. Fig.23 is the result of it.
Fig.23 the relationship between the cleavage rate and the ratio of two proteins in stable level
From the Fig.23 we can find that the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve which have been proved in the first system, while having little effect in the ratio of two proteins in the stable level.
To see more details
Q15: How to prove the effect of our miniToe polycistron in the ?
click to see more
click to hide
Using the model constructed by OUC-China 2016, we explore the relationship between the cleavage rate and the ratio of two proteins in stable level. Fig.23 is the result of it.
Fig.23 the relationship between the cleavage rate and the ratio of two proteins in stable level
From the Fig.23 we can find that the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve which have been proved in the first system, while having little effect in the ratio of two proteins in the stable level.
Reference
Reference for miniToe Family
Reference for miniToe Polycistron
We found four important sites in wet lab, Gln104, Tyr176, Phe155 and His29, which play important roles in binding and cleavage in Csy4 structure.(Fig.13) Considering the existing 20 amino acids in nature, there were 80 mutants to be explored if we had only one site to be mutated.
Fig.13 The four important sites in Csy4.
In Q3, we have discussed four key points which can directly influence our miniToe system. In addition, according to the molecular dynamics results in Q5, we can describe the four key points through four significant symbols.
Now we are going to construct a logic line to show how to use the three main information above to design Csy4 mutants:
What we have proved through the experiment is that the wild-type Csy4 can work well with the miniToe system, which means that all the key points we have discussed before didn't affect the wild-type Csy4. The wild-type Csy4 can dock correctly with the miniToe structure and had a good ability to bind and cleave the miniToe structure. Finally, the crRNA can be released from the RBS. So we choose the wild-type Csy4 as a standard, and all Csy4 mutants can check the four points by comparing with wild-type Csy4.
Now for the four points in Q3 we have discussed the mathematical forms in Q5. And the most important thing is how to make a comparison between the mutant and wild-type Csy4 enzymes, which will be discussed in Q8.
Now we have four mathematical forms including two curves, a numerical value, and a matrix. Four things can be divided into two kinds of data: the matrix and the numerical value. The interaction matrix and the curve can be regarded as a matrix because the curve is discrete, and the binding free energy is a numerical value.
We used Euclidean distance to describe the difference between the two matric:
We used the formula below to calculate the difference of binding free energy between the wild type and mutants:
According to description above, we defined four value to compare the four key points between the mutant and wild-type Cys4 enzymes: , , , .
By using the four values, five Csy4 mutants were designed and shown in table below.
Csy4 | ||||
---|---|---|---|---|
WT | 0 | 0 | 0 | 0 |
Q104A | 0.483 | 2483 | 9.48 | 30.82 |
Y176F | 0.592 | -382 | 11.61 | 40.62 |
F155A | 0.233 | -1627 | 13.41 | 35.71 |
H29A | 0.173 | 833 | 15.29 | 316.22 |
Combining the bioinformatics and machine learning, we presented an algorithm to pre-processing our big mutation library. The flow chart of the pre-processing algorithm is shown below.
Fig.14 The flow chart of the pre-processing algorithm
The SVM model was training well and the results were shown below.
Fig.15 The training result
After training the SVM model, we used it to evaluate the hairpin mutants. We selected the hairpin mutants with high ranks to check the four key points. Finally, we determined the five hairpin mutants below. The following chart shows the DR-Score which is the evaluated results from the SVM model.
Hairpin-Mutant | |
---|---|
miniToe1 | 76.6306 |
miniToe2 | 65.6278 |
miniToe3 | 66.7160 |
miniToe4 | 62.5537 |
miniToe5 | 52.9794 |
After designing the protein mutants and hairpin mutants, we tested them in the wet lab.
Fig.16 The experimental results of mutants.
And we try to gave a comparison between the special value we used before for evaluating the mutants and experimental results to check our model.
For the protein mutants, we gave a comparison between D3 and experimental results.
Fig.17 The comparison between model and experimental results for different protein mutants.
As we can see in the Fig.17, we can find the inner relationship between D3 and experimental results: the D3 value describes the difference in the ability of cleavage between the wild-type and mutants. A higher D3 value means a weaker cleavage ability.
For the hairpin mutants, we gave a comparison between DR-Score and experimental results.(Fig.18)
Fig.18 The comparison between model and experimental results for hairpin mutants.
We can find the inner relationship between DR-Score and experimental results except miniToe 1. It is reasonable because the machine learning is quite sensitive to the data amounts and the R2 is not 1 in our training result of SVM model.
Importantly, we tested 30 combinations of Csy4 enzymes and hairpins in wet lab.(Fig.19)
Fig.19 The heatmap results of our 30 combinations
Fig.20 The heatmap results of our 30 combinations
Fig.21 The mRNA distribution in t=600s
1.The different cistrons in diffrernt positions have different mRNA abundance.
2. The different cistrons in diffrernt positions have different translational time.
To see more details
(1)The transcription of two CDSs region:
Here we divided the polycistron into two part with different transcription paraments , to deal with the problem of different mRNA abundance due to the premature termination. The two paraments , is totally sequence-dependent.
(2)The degradation of mRNA
The degradation of RNA also divided into two parts with different transcription paraments , to deal with the problem of different translational time for two mRNA. Each can be divided into two part:
The denotes the recoup item for the translational time difference and the denotes the common degradation rate of mRNA.
(3)The translation of protein.
In the translation of two proteins, the two paraments used to describe the translation also should be different considering the translation coupling.
(4)The degradation of proteins
We don’t have too much discussion in the proteins degradation here. This model is the simple forms of the coupled transcription-translation model, it keeps the easy form but also reflect the common phenomenon which will happen in the transcript and translation of polycistron including transcript polarity and translation coupling.
To see more details
Fig.22 The mechanisms of miniToe polycistron
Fig.23 the relationship between the cleavage rate and the ratio of two proteins in stable level
From the Fig.23 we can find that the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve which have been proved in the first system, while having little effect in the ratio of two proteins in the stable level. To see more details
Fig.23 the relationship between the cleavage rate and the ratio of two proteins in stable level
From the Fig.23 we can find that the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve which have been proved in the first system, while having little effect in the ratio of two proteins in the stable level.