Difference between revisions of "Team:OUC-China/polycistron"

 
(59 intermediate revisions by 2 users not shown)
Line 156: Line 156:
 
<br /><br />
 
<br /><br />
 
<h3>1.Current model for polycistron expression system </h3>
 
<h3>1.Current model for polycistron expression system </h3>
<br /> Before we modeling our system, we firstly give a short review on model of polycistron expression system. For the common model, they believe that the mRNA of different cistrons in different positions has the same abundance and if they have the same translation rate, the protein which produced by different cistrons is equal. But in the truth, the natural polycistron has many strategies in regulate the protein abundance such as the overleap or hairpin in 3’. And for the synthetic polycistron, many things just like the transcription polarity and translation coupling play important roles. Many of them control the protein by control the mRNA abundance. So a more precise model for polycistron is needed.[1] <br/ ><br />
+
<br /> Before we modeling our system, we firstly give a short review on the model of the polycistron expression system. For the common model, they believe that the mRNA of different cistrons in different positions has the same abundance and if they have the same translation rate, the protein which produced by different cistrons is equal if we do not consider the decay rate of mRNA. But in the truth, the natural polycistron has many strategies to regulate the protein abundance such as the overleap or hairpin in 3’. And for the synthetic polycistron, many things just like the transcription polarity and translation coupling play important roles. Many of them control the protein by control the mRNA abundance. So a more precise model for polycistron is needed[1]. <br/ ><br />
<h3>2.The coupled transcription-translation model for monocistron </h3>
+
<h3 id='werr'>2.The coupled transcription-translation model for polycistron </h3>
<br />In this part we will present a coupled transcription-translation model for the polycistron in prokaryotes. The model is based on the Andre S Riberio’s work, he presents a coupled transcription-translation model for monocistron. We have done some works to extend the model to use in the polycistron. <br /><br />
+
<br />In this part we will present a coupled transcription-translation model for the polycistron in prokaryotes. The model is based on the Andre S Riberio’s work[2], he presents a coupled transcription-translation model for monocistron. We have done some works to extend the model to use in the polycistron. <br /><br />
 
<h4 ><font size="4" color="#8FBC8F"> 2.1 The origin model for monocistron</font></h4>
 
<h4 ><font size="4" color="#8FBC8F"> 2.1 The origin model for monocistron</font></h4>
 
<br /> The origin model build by Andre S Riberio is a stochastic delayed differential equation model in sequence-level, and it can be divided into two mian part: the transcriptional part and the translational part.
 
<br /> The origin model build by Andre S Riberio is a stochastic delayed differential equation model in sequence-level, and it can be divided into two mian part: the transcriptional part and the translational part.
Line 815: Line 815:
 
</math>
 
</math>
 
     </div>  
 
     </div>  
<br /><br />(4)Back-translocation: <br />
+
<br /><br />(4)Back-translocation: <br /><br />
 
<div align="center">    <math>
 
<div align="center">    <math>
 
  <mrow>
 
  <mrow>
Line 1,012: Line 1,012:
 
                                         ) nucleotides.  U<sup>R</sup><sub>n</sub> , O<sup>R</sup><sub>n</sub> and  A<sup>R</sup><sub>n</sub> are the ribonucleic equivalent fo U<sub>n</sub>, O<sub>n</sub> and A<sub>n</sub> in transcriptional part, which has similarity meaning. <br/> <br/>
 
                                         ) nucleotides.  U<sup>R</sup><sub>n</sub> , O<sup>R</sup><sub>n</sub> and  A<sup>R</sup><sub>n</sub> are the ribonucleic equivalent fo U<sub>n</sub>, O<sub>n</sub> and A<sub>n</sub> in transcriptional part, which has similarity meaning. <br/> <br/>
 
 
<h4 ><font size="4" color="#8FBC8F"> 2.2 The model we improve for the polycistron</font></h4>   <br/>Now we have known the coupled transcription-translation model for bi-cistron, which is the simplest polycistron. <br/><br/>In order to extend it to use in the bi-cistron, we simplify add another translational part into the old model. So our new model have one translational part and two translational parts for two CDSs in the bi-cistron. Now the most important things are to build the relationship between two translational part.
+
<h4 ><font size="4" color="#8FBC8F"> 2.2 The model we improve for the polycistron</font></h4>   <br/>
  <br/><br/> The first thing we need to reconsidering that is to recalculate the initiation translation rate for the second CDS because this parameter is influence by translation coupling.
+
                                       
 +
                                       
 +
                                       
 +
      <div align="center"><img src="https://static.igem.org/mediawiki/2018/3/3b/T--OUC-China--p31.jpg" height="150"></div>
 +
<div align="center"><p>Fig.2-1 the organization of operon</p></div>                                 
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                        Now we have known the coupled transcription-translation model for monocistron. In order to extend it to use in the bi-cistron like Fig.2-1, we simply add another translational part into the old model. So our new model has one translational part and two translational parts for two CDSs in the bi-cistron. Now the most important things are to build the relationship between the two translational part.
 +
  <br/><br/> The first thing we need to reconsider that is to recalculate the initiation translation rate for the second CDS because this parameter is influenced by translation coupling.
 
                                       <br/>  <br/>For the translate rates of the second CDS, k<sub>2</sub>
 
                                       <br/>  <br/>For the translate rates of the second CDS, k<sub>2</sub>
, can be calculated by the following formula in statistical thermodynamics:<br />
+
, can be calculated by the following formula in statistical thermodynamics[3]:<br />
 
<div align="center"><math display='block'>
 
<div align="center"><math display='block'>
 
  <mrow>
 
  <mrow>
Line 1,053: Line 1,066:
 
                                         <br/>The formula is divided into two parts to describe the transcript coupling. The first part, r<sub>reinitiation</sub><sup>(2)</sup>
 
                                         <br/>The formula is divided into two parts to describe the transcript coupling. The first part, r<sub>reinitiation</sub><sup>(2)</sup>
 
                                         , showing that the ribosome terminates the translation of upstream CDS then dissociate and re-initiate the translation of downstream CDS, is called the ribosome re-initiation. The second part, e<sub>-&beta;&Delta;G<sub>total</sub></sub>
 
                                         , showing that the ribosome terminates the translation of upstream CDS then dissociate and re-initiate the translation of downstream CDS, is called the ribosome re-initiation. The second part, e<sub>-&beta;&Delta;G<sub>total</sub></sub>
, showing that the elongate along the upstream CDS and unfolding the mRNA structure which increase the expression of the upstream CDS, is called de novo ribosome initiation. The two kinds of initiation can be seen in the Fig.2-1.
+
, showing that the elongate along the upstream CDS and unfolding the mRNA structure which increases the expression of the upstream CDS is called de novo ribosome initiation. The two kinds of initiation can be seen in the Fig.2-2.
 
 
 
 
Line 1,060: Line 1,073:
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/9/94/T--OUC-China--p21.jpg" height="400">
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/9/94/T--OUC-China--p21.jpg" height="400">
 
</div>
 
</div>
<div align="center"><p>Fig.2-1 two kinds of initiation
+
<div align="center"><p>Fig.2-2 two kinds of initiation
 
</p></div>
 
</p></div>
 
     The first part in formula can be calculated by the following formula:
 
     The first part in formula can be calculated by the following formula:
Line 1,110: Line 1,123:
 
                                         refers to the intergenic distance dependence and the k<sub>p</sub>
 
                                         refers to the intergenic distance dependence and the k<sub>p</sub>
 
  refers to the proportionality constant between the ribosome assemble rate and the translation initiation rate.  
 
  refers to the proportionality constant between the ribosome assemble rate and the translation initiation rate.  
  <br/> For the  k<sub>reinitiation</sub>(d<sub>1,2</sub>) is proved that can be calculate by the formula following:<br /><br />
+
  <br/><br/> For the  k<sub>reinitiation</sub>(d<sub>1,2</sub>) is proved that can be calculate by the formula following:<br /><br />
 
<div align="center"><math>
 
<div align="center"><math>
 
  <mrow>
 
  <mrow>
Line 1,235: Line 1,248:
 
th CDS’s start codon while the  
 
th CDS’s start codon while the  
 
  x<sub>stop</sub>
 
  x<sub>stop</sub>
  refers to first nucleotides in i th CDS ‘s stop codon.  And it also points that the<math display='block'>
+
  refers to first nucleotides in i th CDS ‘s stop codon.  And it also points that the
 
                                         k<sub>p</sub>=10
 
                                         k<sub>p</sub>=10
 
  .
 
  .
<br/> The second part in formula can be calculated by the following formula:
+
 
 
+
                                            <br/><br/>The<math>
  Where the  refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome. The  refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the &Delta; can be calculated by the following formula: <br /> <div align="center">          <math>
+
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
Line 1,250: Line 1,262:
 
   <mrow></mrow>
 
   <mrow></mrow>
 
   <mrow>
 
   <mrow>
     <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
+
     <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 +
  </msup>
 +
  </mrow>
 +
</math>
 +
refers to the total binding free energy between the ribosome and 5’UTR, according to the equation:
 +
 +
<div align="center">            <math>
 +
<mrow>
 +
  <mi>&#x0394;</mi><msub>
 +
  <mi>G</mi>
 +
  <mrow>
 +
    <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
 +
  </msub>
 +
  <msup>
 +
  <mrow></mrow>
 +
  <mrow>
 +
    <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 
   </msup>
 
   </msup>
 
   <mo>=</mo><mi>&#x0394;</mi><msub>
 
   <mo>=</mo><mi>&#x0394;</mi><msub>
Line 1,275: Line 1,303:
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
 
   </msub>
 
   </msub>
   <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
+
   </mrow>
 +
</math></div>
 +
<br/><br/>The <math>
 +
<mrow>
 +
  <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi><mo>&#x2212;</mo><mi>r</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
  </msub>
+
  <msub>
+
  <mi>F</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
                </div>
+
refers to the free energy of folding for mRNA-rRNA complex, which is negative.
+
<br/>The <math>
<br/><br/>  Where the <math>
+
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>s</mi><mi>p</mi><mi>a</mi><mi>c</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome.<br/> The <math>
+
refers to the free energy for the non-optimal physical distance between SD sequence and the start codon, which is positive.
 +
<br/><br/>The <math>
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
  refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the <math>
+
refers to the free energy for <math>
 
  <mrow>
 
  <mrow>
   <msub>
+
   <mi>t</mi><mi>R</mi><mi>N</mi><msup>
   <mi>F</mi>
+
   <mi>A</mi>
 
   <mrow>
 
   <mrow>
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>f</mi><mi>M</mi><mi>E</mi><mi>T</mi></mrow>
   </msub>
+
   </msup>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
can be calculated by the following formula:
+
-start codon complex, which is negative.
<div align="center">           <math>
+
<br/><br/>The <math>
 
  <mrow>
 
  <mrow>
   <msub>
+
   <mi>&#x0394;</mi><msub>
   <mi>F</mi>
+
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
 
   </msub>
 
   </msub>
  <mo>=</mo><mfrac>
 
  <mn>1</mn>
 
  <mrow>
 
    <mn>1</mn><mo>+</mo><mi>C</mi><mo>&#x22C5;</mo><msup>
 
    <mi>e</mi>
 
    <mrow>
 
      <mi>&#x03B2;</mi><mo>&#x22C5;</mo><mi>&#x0394;</mi><msub>
 
      <mi>G</mi>
 
      <mrow>
 
        <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
 
      </msub>
 
      <msup>
 
      <mrow></mrow>
 
      <mrow>
 
        <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 
      </msup>
 
      </mrow>
 
    </msup>
 
    </mrow>
 
  </mfrac>
 
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
                </div>
+
refers to the free energy for the , which is negative.
Which the <math>
+
<br/><br/>The <math>
<mi>C</mi>
+
</math>
+
is the ribosome-assisted unfolding coefficient. <math>
+
<mrow>
+
  <mi>C</mi><mtext>=</mtext><mn>0.81</mn></mrow>
+
</math>
+
in our study.
+
<br/>The<math>
+
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
     <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
 
   </msub>
 
   </msub>
  <msup>
 
  <mrow></mrow>
 
  <mrow>
 
    <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 
  </msup>
 
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
  refers to the total binding free energy between the ribosome and 5’UTR, according to the equation:
+
  refers to the free energy of folding for 5’UTR, which is negative.
+
<br/><br/>All these energies can be calculated by the NUPACK suit of energy with mFold 3.0 RNA energy parameter. And five energy can be seen vividly in the Fig.2-3.<br/>
<div align="center">           <math>
+
<div align="center"><img src="https://static.igem.org/mediawiki/2018/6/6e/T--OUC-China--p22.jpg" height="250"></div>
 +
<div align="center"><p>
 +
Fig.2-3 the five part of the total binding free energy</p></div>                                   
 +
                                       
 +
                                       
 +
<br/> <br/> The second part in formula can be calculated by the following formula:
 +
 
 +
  <br /><br /> <div align="center">           <math>
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
Line 1,383: Line 1,384:
 
   <mrow></mrow>
 
   <mrow></mrow>
 
   <mrow>
 
   <mrow>
     <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
+
     <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
 
   </msup>
 
   </msup>
 
   <mo>=</mo><mi>&#x0394;</mi><msub>
 
   <mo>=</mo><mi>&#x0394;</mi><msub>
Line 1,405: Line 1,406:
 
     <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
 
     <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
 
   </msub>
 
   </msub>
 +
 +
</math>
 +
<math>
 +
 
   <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
 
   <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
     <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
   </mrow>
+
   <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
</math></div>
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi><mo>&#x2212;</mo><mi>r</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 +
  </msub>
 +
  <msub>
 +
  <mi>F</mi>
 +
  <mrow>
 +
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
refers to the free energy of folding for mRNA-rRNA complex, which is negative.
+
                </div>
<br/>The <math>
+
 +
<br/><br/>  Where the <math>
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>s</mi><mi>p</mi><mi>a</mi><mi>c</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
refers to the free energy for the non-optimal physical distance between SD sequence and the start codon, which is positive.
+
refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome.<br/><br/> The <math>
<br/>The <math>
+
 
  <mrow>
 
  <mrow>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>&#x0394;</mi><msub>
 
   <mi>G</mi>
 
   <mi>G</mi>
 
   <mrow>
 
   <mrow>
     <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
     <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
refers to the free energy for <math>
+
  refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the <math>
 
  <mrow>
 
  <mrow>
   <mi>t</mi><mi>R</mi><mi>N</mi><msup>
+
   <msub>
   <mi>A</mi>
+
   <mi>F</mi>
 
   <mrow>
 
   <mrow>
     <mi>f</mi><mi>M</mi><mi>E</mi><mi>T</mi></mrow>
+
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
   </msup>
+
   </msub>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
-start codon complex, which is negative.
+
can be calculated by the following formula:
<br/>The <math>
+
<div align="center">           <math>
 
  <mrow>
 
  <mrow>
   <mi>&#x0394;</mi><msub>
+
   <msub>
   <mi>G</mi>
+
   <mi>F</mi>
 
   <mrow>
 
   <mrow>
     <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
+
     <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
 
   </msub>
 
   </msub>
 +
  <mo>=</mo><mfrac>
 +
  <mn>1</mn>
 +
  <mrow>
 +
    <mn>1</mn><mo>+</mo><mi>C</mi><mo>&#x22C5;</mo><msup>
 +
    <mi>e</mi>
 +
    <mrow>
 +
      <mi>&#x03B2;</mi><mo>&#x22C5;</mo><mi>&#x0394;</mi><msub>
 +
      <mi>G</mi>
 +
      <mrow>
 +
        <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
 +
      </msub>
 +
      <msup>
 +
      <mrow></mrow>
 +
      <mrow>
 +
        <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 +
      </msup>
 +
      </mrow>
 +
    </msup>
 +
    </mrow>
 +
  </mfrac>
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
refers to the free energy for the , which is negative.
+
                </div> <br />
<br/>The <math>
+
Which the <math>
 +
<mi>C</mi>
 +
</math>
 +
is the ribosome-assisted unfolding coefficient. <math>
 
  <mrow>
 
  <mrow>
   <mi>&#x0394;</mi><msub>
+
   <mi>C</mi><mtext>=</mtext><mn>0.81</mn></mrow>
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  </mrow>
+
 
</math>
 
</math>
  refers to the free energy of folding for 5’UTR, which is negative.
+
  in our study.
<br/>All these energies can be calculated by the NUPACK suit of energy with mFold 3.0 RNA energy parameter. And five energy can be seen vividly in the Fig.2-2.
+
                                       
<div align="center"><img src="https://static.igem.org/mediawiki/2018/6/6e/T--OUC-China--p22.jpg" height="300"></div>
+
                                       
<div align="center"><p>
+
                                       
Fig.2-2 the five part of the total binding free energy</p></div>
+
<br /><br />
The second thing we need to reconsidering that is to recalculate the premature termination rate for the second CDS because this parameter is influence by transcription polarity. Even there is only one transcriptional part, the premature termination rate of the second CDS is higher than the first CDS because the Rho factor will bind to RNA in the intergenic regions to cause rho- dependent termination. And almost 80% of premature termination is caused by rho-dependent termination. We are going to use the queening theory to build the an model to describe it haven’t finish yet. <br/>
+
                                       
<h4 ><font size="4" color="#8FBC8F"> 2.3 Explore the mRNA abundance using the model </font></h4> We then constructed a polycistron which has two LacZ gene in the model. And then correcting the premature termination and premature termination rate of the second CDS. The other parameters can be find in Andre S Riberio’s work. And carrying out the simulation in StochPy and SGNSim. We get the distribution of mRNA at t=100s and t=600s in Fig2-3 and Fig 2-4.
+
                                       
<div align="center"><img src="2.3" height="450"></div>
+
                                       
<div align="center"><p>Fig.2-3</p></div>
+
The second thing we need to reconsider that is to recalculate the premature termination rate for the second CDS because this parameter is influenced by transcription polarity. Even there is only one transcriptional part, the premature termination rate of the second CDS is higher than the first CDS because the Rho factor will bind to RNA in the intergenic regions to cause rho-dependent termination. And almost 80% of premature termination is caused by Rho-dependent termination. We are going to use the Queuing Theory to build a model to describe it but the model hasn’t finished yet. <br/> <br/>
<div align="center"><img src="2.4" height="450"></div>
+
Here I will give the most essential idea of the unfinished model. We know the original model of queuing theory has assumed that the two cars will not have an accident and no car try to jump the queue. But we will regard the Rho-factor as the car which jumps the queue and the interaction between Rho-factor and RNA polymerase will be described as the accident. Once the waiting time for the first ribosome is increasing, it will be regarded as the happened of Rho-dependent termination.  So by breaking the two basic assumptions of queuing theory, we will have the description of transcriptional polarity. The Rho-dependent termination rate will be calculated as the expectation of the accident times per minutes while the new premature termination rate will be described as the sum of the original premature termination rate and the Rho-dependent termination rate. <br/> <br/>
<div align="center"><p>Fig.2-4</p></div>
+
<h4 ><font size="4" color="#8FBC8F"> 2.3 Explore the mRNA abundance using the model </font></h4> <br/> We constructed a polycistron which has two LacZ gene in the model. And then correcting the premature termination and premature termination rate of the second CDS. The other parameters can be found in Andre S Riberio’s work. And carrying out the simulation in StochPy and SGNSim. We get the distribution of mRNA at t=100s and t=600s in Fig2-4 and Fig 2-5.
The mRNA distribution in 100s stands for the origin time of mRNA distribution while the mRNA distribution in 600s stans for the finally state mRNA
+
<div align="center"><img src="https://static.igem.org/mediawiki/2018/f/fa/T--OUC-China--JCMXPOLY1.png" height="450"></div>
 +
<div align="center"><p>Fig.2-4 the distribution of mRNA at t=100s</p></div>
 +
<div align="center"><img src="https://static.igem.org/mediawiki/2018/9/9d/T--OUC-China--JCMXPOLY2g.png" height="450"></div>
 +
<div align="center"><p>Fig.2-5 the distribution of mRNA at t=600s</p></div>
 +
The mRNA distribution in 100s stands for the origin time of mRNA distribution while the mRNA distribution in 600s stans for the final state of mRNA distribution. Fig.2-4 shows that the mRNA of first CDSs is produced while the mRNA of the second CDSs haven't been produced now. Which shows that the mRNA of different cistrons in different positions has different translational time. Fig.2-5 shows that the there is a decrease between the mRNA abundance between two CDSs which suggest that the mRNA of different cistrons in different positions has different abundance.
 
 
  
Line 1,487: Line 1,515:
 
                                          
 
                                          
 
                                          
 
                                          
  <br />                                      
+
  <br />   <br />                                   
<h3>3.A flexible model for polycistron</h3>
+
<h3 id='wer'>3.A flexible model for polycistron</h3>
 
<br />
 
<br />
We have got some points from the coupled transcription-translation model: the mRNA of different cistrons in different positions has different abundance. This phenomenon may be caused by premature termination or something others, and this will result in the different protein level. And different protein level as caused by the different translation time.
+
We have got some points from the coupled transcription-translation model: the mRNA of different cistrons in different positions has different abundance. This phenomenon may be caused by premature termination or something others, and this will result in the different protein level. And different protein level also caused by the different translation time.
<br/>The coupled transcription-translation model is too complex and hard to operate. Here we propose a framework to explain the more realistic situation happened in the polycistron, but still hope it also can keep the simple forms. <div align="center"><img src="https://static.igem.org/mediawiki/2018/3/3b/T--OUC-China--p31.jpg" height="150"></div>
+
<br/><br/>The coupled transcription-translation model is too complex and hard to operate. Here we propose a framework to explain the more realistic situation happened in the polycistron but still hope it also can keep the simple forms.
<div align="center"><p>Fig.3-1 the organization of operon</p></div>
+
 
 
<br/><br/> Considering a polycistron like the Fig.3-1 shows, it contains a promoter, a 5’UTR and two CDSs which is separated by the intergenic regions, and following a terminator in the end.  
+
<br/><br/> Considering a polycistron like the Fig.2-1 shows, it contains a promoter, a 5’UTR and two CDSs which is separated by the intergenic regions, and following a terminator in the end.  
 
<br/><br/>The reaction can be described following four main steps:
 
<br/><br/>The reaction can be described following four main steps:
 
<br/><br/>(1)The transcription of two CDSs region:
 
<br/><br/>(1)The transcription of two CDSs region:
Line 1,741: Line 1,768:
 
                     </div>
 
                     </div>
 
<br/>In the translation of two proteins, the two paraments used to describe the translation also should be different considering the translation coupling. And we build a thermodynamic model to calculated it before in the 2.2, now we will give a conclusion for it.
 
<br/>In the translation of two proteins, the two paraments used to describe the translation also should be different considering the translation coupling. And we build a thermodynamic model to calculated it before in the 2.2, now we will give a conclusion for it.
+
<br /> <br />
+
                                        For the translate rates of the first CDS, k<sub>1</sub> , can be calculated by the following formula in statistical thermodynamics :
+
+
<div align='center'>
+
                         
<br/>For the translate rates of the second CDS,  
+
    <math display='block'>
, can be calculated by the following formula in statistical thermodynamics:
+
<div align="center"><math display='block'>
+
 
  <mrow>
 
  <mrow>
 
   <msub>
 
   <msub>
 
   <mi>k</mi>
 
   <mi>k</mi>
   <mn>2</mn>
+
   <mn>1</mn>
 
   </msub>
 
   </msub>
   <mo>&#x221D;</mo><msub>
+
   <mo>&#x221D;</mo><msup>
  <mi>r</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <msup>
+
  <mrow></mrow>
+
  <mrow>
+
    <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
+
  </msup>
+
  <mo>+</mo><msup>
+
 
   <mi>e</mi>
 
   <mi>e</mi>
 
   <mrow>
 
   <mrow>
Line 1,775: Line 1,790:
 
     <mrow></mrow>
 
     <mrow></mrow>
 
     <mrow>
 
     <mrow>
       <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
+
       <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
 
     </msup>
 
     </msup>
 
     </mrow>
 
     </mrow>
Line 1,781: Line 1,796:
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
 
+
                         
</div>
+
</div>
  <br/>The formula is divided into two parts to describe the transcript coupling. The first part, <math display='block'>
+
<br />
 +
<br/> <br/>For the translate rates of the second CDS, k<sub>2</sub>
 +
, can be calculated by the following formula in statistical thermodynamics:<br />
 +
<div align="center"><math display='block'>
 
  <mrow>
 
  <mrow>
 
   <msub>
 
   <msub>
 +
  <mi>k</mi>
 +
  <mn>2</mn>
 +
  </msub>
 +
  <mo>&#x221D;</mo><msub>
 
   <mi>r</mi>
 
   <mi>r</mi>
 
   <mrow>
 
   <mrow>
Line 1,795: Line 1,817:
 
     <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
 
     <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
 
   </msup>
 
   </msup>
   </mrow>
+
   <mo>+</mo><msup>
</math>
+
, showing that the ribosome terminates the translation of upstream CDS then dissociate and re-initiate the translation of downstream CDS, is called the ribosome re-initiation. The second part, <math display='block'>
+
<mrow>
+
  <msup>
+
 
   <mi>e</mi>
 
   <mi>e</mi>
 
   <mrow>
 
   <mrow>
Line 1,816: Line 1,834:
 
   </mrow>
 
   </mrow>
 
</math>
 
</math>
, showing that the elongate along the upstream CDS and unfolding the mRNA structure which increase the expression of the upstream CDS, is called de novo ribosome initiation. The two kinds of initiation can be seen in the Fig.2-1.
+
</div>
+
<br /><br /> The calculation method is discussed in 2.2.
+
+
+
<div align="center"><img src="2.1" height="450">
+
</div>
+
<div align="center"><p>Fig.2-1 two kinds of initiation
+
</p></div>
+
    The first part in formula can be calculated by the following formula:
+
<div align="center"><math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>r</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <msup>
+
  <mrow></mrow>
+
  <mrow>
+
    <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
+
  </msup>
+
  <mtext>=</mtext><msub>
+
  <mi>k</mi>
+
  <mi>p</mi>
+
  </msub>
+
  <mo>&#x22C5;</mo><msub>
+
  <mi>k</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <mo stretchy='false'>(</mo><msub>
+
  <mi>d</mi>
+
  <mrow>
+
    <mn>1</mn><mo>,</mo><mn>2</mn></mrow>
+
  </msub>
+
  <mo stretchy='false'>)</mo><mo>&#x22C5;</mo><msup>
+
  <mi>e</mi>
+
  <mrow>
+
    <mo>&#x2212;</mo><mi>&#x03B2;</mi><mo>&#x22C5;</mo><mi>&#x0394;</mi><msub>
+
    <mi>G</mi>
+
    <mrow>
+
      <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
    </msub>
+
    <msup>
+
    <mrow></mrow>
+
    <mrow>
+
      <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
+
    </msup>
+
    </mrow>
+
  </msup>
+
  </mrow>
+
</math>
+
</div>
+
Where the <math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>k</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <mo stretchy='false'>(</mo><msub>
+
  <mi>d</mi>
+
  <mrow>
+
    <mn>1</mn><mo>,</mo><mn>2</mn></mrow>
+
  </msub>
+
  <mo stretchy='false'>)</mo></mrow>
+
</math>
+
refers to the intergenic distance dependence and the <math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>k</mi>
+
  <mi>p</mi>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the proportionality constant between the ribosome assemble rate and the translation initiation rate.
+
<br/> For the  <math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>k</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <mo stretchy='false'>(</mo><msub>
+
  <mi>d</mi>
+
  <mrow>
+
    <mn>1</mn><mo>,</mo><mn>2</mn></mrow>
+
  </msub>
+
  <mo stretchy='false'>)</mo></mrow>
+
</math>
+
is proved that can be calculate by the formula following:
+
<div align="center"><math>
+
<mrow>
+
  <msub>
+
  <mi>k</mi>
+
  <mrow>
+
    <mi>r</mi><mi>e</mi><mi>i</mi><mi>n</mi><mi>i</mi><mi>t</mi><mi>i</mi><mi>a</mi><mi>t</mi><mi>i</mi><mi>o</mi><mi>n</mi></mrow>
+
  </msub>
+
  <mo stretchy='false'>(</mo><mi>d</mi><mo stretchy='false'>)</mo><mtext>=</mtext><mrow><mo>{</mo> <mtable columnalign='left'>
+
  <mtr>
+
    <mtd>
+
    <mtext>0</mtext><mtext>.0072</mtext><mo>&#x00B1;</mo><mtext>0</mtext><mtext>.0048</mtext><mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mtext>0</mtext><mo>&#x2264;</mo><mi>d</mi><mo>&#x2264;</mo><mn>25</mn>
+
    </mtd>
+
  </mtr>
+
  <mtr>
+
    <mtd>
+
    <mtext>0</mtext><mtext>.0220</mtext><mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mi>d</mi><mo>=</mo><mo>&#x2212;</mo><mn>4</mn>
+
    </mtd>
+
  </mtr>
+
  <mtr>
+
    <mtd>
+
    <mtext>0</mtext><mtext>.0072+0</mtext><mtext>.0004</mtext><mo>&#x22C5;</mo><mtext>(</mtext><mi>d</mi><mo>+</mo><mn>10</mn><mtext>)</mtext><mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mover>
+
      <mrow></mrow>
+
      <mrow></mrow>
+
    </mover>
+
    <mo>&#x2212;</mo><mtext>10</mtext><mo>&#x2264;</mo><mi>d</mi><mo>&#x2264;</mo><mo>&#x2212;</mo><mn>25</mn>
+
    </mtd>
+
  </mtr>
+
  </mtable>
+
  </mrow></mrow>
+
</math>
+
 
+
</div>
+
<br/>Where the<math display='block'>
+
<mrow>
+
  <mi>d</mi><mtext>=</mtext><msub>
+
  <mi>x</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
  </msub>
+
  <mo>&#x2212;</mo><msub>
+
  <mi>x</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>o</mi><mi>p</mi></mrow>
+
  </msub>
+
  <mo>&#x2212;</mo><mn>3</mn></mrow>
+
</math>
+
,  <math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>x</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the first nucleotides in <math>
+
<mi>j</mi>
+
</math>
+
th CDS’s start codon while the <math display='block'>
+
<mrow>
+
  <msub>
+
  <mi>x</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>o</mi><mi>p</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
                                        refers to first nucleotides in th CDS ‘s stop codon.  And it also points that the k<sub>p</sub>=10
+
.
+
<br/> The second part in formula can be calculated by the following formula:
+
 
+
  Where the  refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome. The  refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the  can be calculated by the following formula: <div align="center">          <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
  </msub>
+
  <msup>
+
  <mrow></mrow>
+
  <mrow>
+
    <mo stretchy='false'>(</mo><mn>2</mn><mo stretchy='false'>)</mo></mrow>
+
  </msup>
+
  <mo>=</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi><mo>&#x2212;</mo><mi>r</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>p</mi><mi>a</mi><mi>c</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
+
  </msub>
+
  <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  <msub>
+
  <mi>F</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
                </div>
+
+
<br/>  Where the <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome. The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>n</mi><mi>o</mi><mi>n</mi><mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
  refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the <math>
+
<mrow>
+
  <msub>
+
  <mi>F</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
can be calculated by the following formula:
+
<div align="center">          <math>
+
<mrow>
+
  <msub>
+
  <mi>F</mi>
+
  <mrow>
+
    <mi>c</mi><mi>o</mi><mi>u</mi><mi>p</mi><mi>l</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  <mo>=</mo><mfrac>
+
  <mn>1</mn>
+
  <mrow>
+
    <mn>1</mn><mo>+</mo><mi>C</mi><mo>&#x22C5;</mo><msup>
+
    <mi>e</mi>
+
    <mrow>
+
      <mi>&#x03B2;</mi><mo>&#x22C5;</mo><mi>&#x0394;</mi><msub>
+
      <mi>G</mi>
+
      <mrow>
+
        <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
      </msub>
+
      <msup>
+
      <mrow></mrow>
+
      <mrow>
+
        <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
+
      </msup>
+
      </mrow>
+
    </msup>
+
    </mrow>
+
  </mfrac>
+
  </mrow>
+
</math>
+
                </div>
+
Which the <math>
+
<mi>C</mi>
+
</math>
+
is the ribosome-assisted unfolding coefficient. <math>
+
<mrow>
+
  <mi>C</mi><mtext>=</mtext><mn>0.81</mn></mrow>
+
</math>
+
in our study.
+
<br/>The<math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
  </msub>
+
  <msup>
+
  <mrow></mrow>
+
  <mrow>
+
    <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
+
  </msup>
+
  </mrow>
+
</math>
+
refers to the total binding free energy between the ribosome and 5’UTR, according to the equation:
+
+
<div align="center">            <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>t</mi><mi>o</mi><mi>t</mi><mi>a</mi><mi>l</mi></mrow>
+
  </msub>
+
  <msup>
+
  <mrow></mrow>
+
  <mrow>
+
    <mo stretchy='false'>(</mo><mn>1</mn><mo stretchy='false'>)</mo></mrow>
+
  </msup>
+
  <mo>=</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi><mo>&#x2212;</mo><mi>r</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>p</mi><mi>a</mi><mi>c</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
  </msub>
+
  <mo>+</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
+
  </msub>
+
  <mo>&#x2212;</mo><mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math></div>
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi><mo>&#x2212;</mo><mi>r</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy of folding for mRNA-rRNA complex, which is negative.
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>p</mi><mi>a</mi><mi>c</mi><mi>i</mi><mi>n</mi><mi>g</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy for the non-optimal physical distance between SD sequence and the start codon, which is positive.
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>t</mi><mi>a</mi><mi>r</mi><mi>t</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy for <math>
+
<mrow>
+
  <mi>t</mi><mi>R</mi><mi>N</mi><msup>
+
  <mi>A</mi>
+
  <mrow>
+
    <mi>f</mi><mi>M</mi><mi>E</mi><mi>T</mi></mrow>
+
  </msup>
+
  </mrow>
+
</math>
+
-start codon complex, which is negative.
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>s</mi><mi>tan</mi><mi>d</mi><mi>b</mi><mi>y</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy for the , which is negative.
+
<br/>The <math>
+
<mrow>
+
  <mi>&#x0394;</mi><msub>
+
  <mi>G</mi>
+
  <mrow>
+
    <mi>m</mi><mi>R</mi><mi>N</mi><mi>A</mi></mrow>
+
  </msub>
+
  </mrow>
+
</math>
+
refers to the free energy of folding for 5’UTR, which is negative.
+
<br/>All these energies can be calculated by the NUPACK suit of energy with mFold 3.0 RNA energy parameter. And five energy can be seen vividly in the Fig.2-2.
+
<div align="center"><img src="2.2" height="450"></div>
+
<div align="center"><p>
+
Fig.2-2 the five part of the total binding free energy</p></div>
+
The second thing we need to reconsidering that is to recalculate the premature termination rate for the second CDS because this parameter is influence by transcription polarity. Even there is only one transcriptional part, the premature termination rate of the second CDS is higher than the first CDS because the Rho factor will bind to RNA in the intergenic regions to cause rho- dependent termination. And almost 80% of premature termination is caused by rho-dependent termination. We are going to use the queening theory to build the an model to describe it haven’t finish yet. <br/>
+
+
 
 
 
 
Line 2,333: Line 1,883:
 
</math>
 
</math>
 
                 </div>
 
                 </div>
<br/> We don’t have too much discussion in the proteins degradation here.
+
<br/> We don’t have too much discussion in the proteins degradation here. But we still have something to remind. As the protein we used for our study, sfGFP and RFP, are quite stable exactly. The degradation rate almost equals to the dilution due to the growth diffusion of E. coli colonies.
This model is the simple forms of the coupled transcription-translation model, it keeps the easy form but also reflect the common phenomenon which will happened in the transcript and translation of polycistron including transcript polarity and translation coupling. 
+
<br/><br/>
 +
This model is the simple forms of the coupled transcription-translation model, it keeps the easy form but also reflect the common phenomenon which will happen in the transcript and translation of polycistron including transcript polarity and translation coupling. 
 
 
 
<br/> <br/>
 
<br/> <br/>
<h3> 4.Hou the miniToe structure affect the ratio of two CDSs  </h3> <br/>
+
<h3> 4.How the miniToe structure affect the ratio of two CDSs  </h3> <br/>
 
 
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/f/ff/T--OUC-China--p41.jpg" height="450"></div>
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/f/ff/T--OUC-China--p41.jpg" height="450"></div>
 
<div align="center"><p>Fig.4-1 The working processing of miniToe</p></div>
 
<div align="center"><p>Fig.4-1 The working processing of miniToe</p></div>
<br/>  miniToe polycistron system has two components, Csy4 and the circuit of polycistron. With Csy4 protein, the polycistron will be cut into several mRNA chains with RNA/Csy4 complex at the 3’ UTR as the Fig.4-1 showing. The capability of RNA degradation protection will be much stronger, because of the high stability and affinity of Csy4 binding, which increase energy threshold for RNA degradation from 3’ UTR. So, the RNA degradation rate will be much lower. For the 5’ end degradation, the Csy4 cut will leave a OH- at 5’ end. the cleavage capability of RNase E will be much lower because there is no pyrophosphate in the 5’ end. Qi’s work has proved that OH-mRNAs exhibit higher gene expression than 5’ PPP-mRNAs.  
+
<br/>  miniToe polycistron system has two components, Csy4 and the circuit of polycistron. With Csy4 protein, the polycistron will be cut into several mRNA chains with RNA/Csy4 complex at the 3’ UTR as the Fig.4-1 showing. The capability of RNA degradation protection will be much stronger, because of the high stability and affinity of Csy4 binding, which increases the energy threshold for RNA degradation from 3’ UTR. So, the RNA degradation rate will be much lower. For the 5’ end degradation, the Csy4 cut will leave an OH- at 5’ end. the cleavage capability of RNase E will be much lower because there is no pyrophosphate in the 5’ end. Qi’s work has proved that OH-mRNAs exhibit higher gene expression than 5’ PPP-mRNAs.  
   <br/><br/>So by inserting different hairpin which have different binding ability into the miniToe site, we can control the half-life time of mRNA for two CDS in bi-cistron.
+
   <br/><br/>So by inserting different hairpin which has different binding ability into the miniToe site, we can control the half-life time of mRNA for two CDS in bi-cistron.
 
<br/> <br/>
 
<br/> <br/>
 
 
 
 
<h3> 5.The main role of cleavage rate for miniToe polycistron </h3>
+
<h3 id='werrr'> 5.The main role of cleavage rate for miniToe polycistron </h3>
<br/> In our miniToe system, the cleavage rate plays an important role in different regulation level. But in the miniToe polycistron system, the cleavage will not influence our goals that change ratio of two CDSs. Now we will give a simple model which based on the OUC-China 2016 to explain the relationship between the cleavage rate and the ratio of two proteins in stable level.  
+
<br/> In our miniToe system, the cleavage rate plays an important role in different regulation level. But in the miniToe polycistron system, the cleavage will not influence our goals that change ratio of two CDSs. Now we will give a simple model which based on the OUC-China 2016 [4] to explain the relationship between the cleavage rate and the ratio of two proteins in stable level.  
<br/><br/> We know that the cleavage of Csy4 can open the switch and the cleavage can affect the product curve which we have been prove in the model work for first and second system. Now we will discuss how does the cleavage rate effect the radio of two CDSs.
+
<br/><br/> We know that the cleavage of Csy4 can open the switch and the cleavage can affect the product curve which we have been prove in the model work for first and second system. Now we will discuss how does the cleavage rate effect the ratio of two CDSs.
 
  <br/><br/>  The polycistron system model that OUC-China 2016 constructed is describe by the following reaction:
 
  <br/><br/>  The polycistron system model that OUC-China 2016 constructed is describe by the following reaction:
 
<br/><br/> (1)Translation of polycistron mRNA :
 
<br/><br/> (1)Translation of polycistron mRNA :
Line 2,585: Line 2,136:
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kr</td>
+
       <td>K<sub>r</sub></td>
 
       <td>The constant of Transcription</td>
 
       <td>The constant of Transcription</td>
 
       <td>100</td>
 
       <td>100</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kp1</td>
+
       <td>K<sub>p1</sub></td>
 
       <td>The translation rate of polycistron</td>
 
       <td>The translation rate of polycistron</td>
 
       <td>67.51</td>
 
       <td>67.51</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kd1</td>
+
       <td>K<sub>d1</sub></td>
 
       <td>The cleavage rate of Enzyme</td>
 
       <td>The cleavage rate of Enzyme</td>
 
       <td>0.185</td>
 
       <td>0.185</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kp11</td>
+
       <td>K<sub>p11</sub></td>
       <td>The translation rate of mRNA1</td>
+
       <td>The translation rate of mRNA<sub>1</sub></td>
 
       <td>4.005</td>
 
       <td>4.005</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kp12</td>
+
       <td>K<sub>p12</sub></td>
       <td>The translation rate of mRNA2</td>
+
       <td>The translation rate of mRNA<sub>2</sub></td>
 
       <td>4.005</td>
 
       <td>4.005</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kd0</td>
+
       <td>K<sub>d0</sub></td>
 
       <td>The degradation rate of polycistron mRNA</td>
 
       <td>The degradation rate of polycistron mRNA</td>
 
       <td>17.625</td>
 
       <td>17.625</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kd11</td>
+
       <td>K<sub>d11</sub></td>
       <td>The degradation rate of mRNA1</td>
+
       <td>The degradation rate of mRNA<sub>1</sub></td>
 
       <td>18.864</td>
 
       <td>18.864</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kd12</td>
+
       <td>K<sub>d12</sub></td>
       <td>The degradation rate of mRNA2</td>
+
       <td>The degradation rate of mRNA<sub>2</sub></td>
 
       <td>21.816</td>
 
       <td>21.816</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kdp1</td>
+
       <td>K<sub>dp1</sub></td>
       <td>The degradation rate of protein1</td>
+
       <td>The degradation rate of protein<sub>1</sub></td>
 
       <td>0.0625</td>
 
       <td>0.0625</td>
 
     </tr>
 
     </tr>
 
     <tr>
 
     <tr>
       <td>Kdp2</td>
+
       <td>K<sub>dp2</sub></td>
       <td>The degradation rate of protein2</td>
+
       <td>The degradation rate of protein<sub>2</sub></td>
 
       <td>0.01</td>
 
       <td>0.01</td>
 
     </tr>
 
     </tr>
 
   </tbody>
 
   </tbody>
 
</table><br/>
 
</table><br/>
By the reaction equation and parameters above, we explore the relationship between the cleavage rate and the ratio of two proteins in stable level. Fig.5-1 is the result.
+
By the reaction equation and parameters above, we explore the relationship between the cleavage rate and the ratio of two proteins in the stable level. Fig.5-1 is the result. <br /> <br />
 
 
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/8/8c/T--OUC-China--p51.jpg" height="450"></div>
 
<div align="center"><img src="https://static.igem.org/mediawiki/2018/8/8c/T--OUC-China--p51.jpg" height="450"></div>
 
<div align="center"><p>Fig.5-1 The relationship between the cleavage rate and the ratio of two proteins in stable level.</p></div>
 
<div align="center"><p>Fig.5-1 The relationship between the cleavage rate and the ratio of two proteins in stable level.</p></div>
  From the Fig.5-1, we can see that the ratio of two proteins in stable level change little with the changing cleavage rate in a large range, which prove the word we mentioned before.
+
  From the Fig.5-1, we can see that the ratio of two proteins in stable level change little with the changing cleavage rate in a large range, which prove the word we mentioned before. <br/><br/>
   In conclusion, the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve while have little effect in the ratio of two proteins in stable level.
+
   In conclusion, the cleavage rate in our miniToe polycistron plays an important role in changing the shape of the product curve while having little effect in the ratio of two proteins in a stable level.
 
<br/>
 
<br/>
 
<br/>
 
<br/>
<h3>  6. The model of miniToe System </h3>
+
<h3 id='werrrr'>  6. The provement of miniToe System </h3>
<br/> According to the all the model we discussed before, we can have an conclusion that our miniToe polycistron can be divided into two parts of miniToe system we discuss in the model work for our first system. Because we have proved that the cleavage rate in our miniToe polycistron plays a role in changing the shape of product curve while have little effect in the ratio of two proteins in stable level. And while for the  
+
<br/> According to the all the model we discussed before, we can have a conclusion that our miniToe polycistron can be divided into two parts of the miniToe system we discuss in the model work for our first system. The reasons are as following
 +
    <br /><br />
 +
    1) Firstly, we have proved that the cleavage rate in our miniToe polycistron plays a role in changing the shape of the product curve while having little effect on the ratio of two proteins in the stable level. Which give a base to used the polycistron model we discussed in Chapter 3.<br /><br />
 +
    2) Secondly, in the flexible model which is simplified from coupled transcription-translation model, we treat the two cistrons as two parts in order to a precious description of the dynamics of polycistron.
 
   <br/>
 
   <br/>
   <br/>So we can also prove our miniToe structure’s effect by giving the sensitivity analysis of our first system in Fig.6-1. <br/>
+
   <br/>
<div align="center"><img src="https://static.igem.org/mediawiki/2018/3/3d/T--OUC-China--p61.jpg" height="250"></div>
+
  According to the analyse below, our miniToe system can be easily modeled as the first system. So we can also prove our miniToe structure’s function by giving the sensitivity analysis of our first system in Fig.6-1.<br /><br />
<div align="center"><p>Fig.6-1 The numerical integration of sensitivities of parameters in 30h</p></div> <br/><br/>
+
                                       
And com
+
  <div align="center"><img src="https://static.igem.org/mediawiki/2018/c/c4/T--OUC-China--design3-2.png" height="300"></div>
 +
<div align="center"><p>Fig.6-1 the sensitivity analysis of our first system</p></div>      
 +
                                        <br /><br />As we can see in the Figure.6-1, the parameter k<sub>d3</sub> also sensitive to GFP level which means that you can change it to influence the product. By choosing the hairpin which can increase the k<sub>d3</sub>, we can design the ratio of expression level for two cistrons in polycistron.
 +
<br />  <br />                                     
 
<h3> 7. Future Work</h3>
 
<h3> 7. Future Work</h3>
<br/>1)To explore the relationship between free binding energy and mRNA degradation rate.
+
<br/>1)To explore the relationship between free binding energy of Csy4 and mRNA degradation rate.
<br/><br/>2)To finish the queening theory for the calculation of pre
+
<br/><br/>2)To finish the Queuing Theory for the calculation of Premature termination rate.
 +
<br/><br/> PS. I really want to finish this two parts before the wiki freeze permanently but this is quite hard for one person. I hope I can finish it as an after-iGEM work for the model part. And anyone who interests in this model can contact me with e-mail: chonpcaacpnohc@gmail.com. This e-mail address also for the correction of my model mistake.
 
<br/><br/><br/><br/><br/>
 
<br/><br/><br/><br/><br/>
 
 
+
<h3 id='aaa'>8.Reference</h3>
</div>
+
[1]. Quax T E, Wolf Y I, Koehorst J J, et al. Differential translation tunes uneven production of operon-encoded proteins[J]. Cell Reports, 2013, 4(5):938-944.
 +
<br /><br />
 +
[2].Mäkelä J, Lloyd-Price J, Yli-Harja O, et al. Stochastic sequence-level model of coupled transcription and translation in prokaryotes[J]. Bmc Bioinformatics, 2011, 12(1):121.
 +
                                        <br /><br />
 +
[3]. Tian T, Salis H M. A predictive biophysical model of translational coupling to coordinate and control protein expression in bacterial operons.[J]. Nucleic Acids Research, 2015, 43(14):7137-7151.
 +
                                        <br /><br />
 +
                                        [4]. <a href="https://2016.igem.org/Team:OUC-China/Model">https://2016.igem.org/Team:OUC-China/Model</aa>
 +
                                        <br /><br />
 +
 
 +
                                        <br /><br /><br /><br /><br /><br />
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                       
 +
                                        </div>
 
 
 
 

Latest revision as of 17:30, 5 December 2018

Team OUC-China: Main

polycistron

In our miniToe polycistron system, we build a coupled transcription-translation model considering several events in prokaryotes to get a deep understanding of polycistron. Then we simplify this model into a more flexible model to predict how the miniToe structure changes the relative expression level in polycistron.

1.Current model for polycistron expression system


Before we modeling our system, we firstly give a short review on the model of the polycistron expression system. For the common model, they believe that the mRNA of different cistrons in different positions has the same abundance and if they have the same translation rate, the protein which produced by different cistrons is equal if we do not consider the decay rate of mRNA. But in the truth, the natural polycistron has many strategies to regulate the protein abundance such as the overleap or hairpin in 3’. And for the synthetic polycistron, many things just like the transcription polarity and translation coupling play important roles. Many of them control the protein by control the mRNA abundance. So a more precise model for polycistron is needed[1].

2.The coupled transcription-translation model for polycistron


In this part we will present a coupled transcription-translation model for the polycistron in prokaryotes. The model is based on the Andre S Riberio’s work[2], he presents a coupled transcription-translation model for monocistron. We have done some works to extend the model to use in the polycistron.

2.1 The origin model for monocistron


The origin model build by Andre S Riberio is a stochastic delayed differential equation model in sequence-level, and it can be divided into two mian part: the transcriptional part and the translational part. The transcriptional part can be described by the following events:

(1)Initiation and promoter complex formation:

Pro+RNAp RNAp·Pro( τ OC )

(2)Promoter clearance:
RNAp·Pro+ U [1,RNAp+1] O 1 +Pro

(3)Elongation:
A n + U n+RNAp+1 O n+1 + U nRNAp + U R nRNAp

(4)Activation:
O n A n

(5)Pausing:
O n k p O n p O n 1/ τ p O n p

(6)Pause release due to collision:
O n p + A n2RNAp1 0.8 k m O n + A n2RNAp1 i

(7) Pause release by collision
O n p + A n2RNAp1 0.2 k m O n + A n2RNAp 1 p

(8)Arrest:
O n k ar O n ar O n 1/ τ ar O n ar

(9)Editing:
O n k ec O n correcting O n 1/ τ c O n correcting

(10)Premature termination:
O n k pre RNAp+ U [nRNAp,n+RNAp]

(11)Pyrophosporolysis:
O n + U nRNAp1 + U R nRNAp1 k pyro O n1 + U n+RNAp1

(12)Completion:
A n last k f RNAp+ U [ n last , n last RNAp]

(13) mRNA degradation:
A n last k dr ϕ

In the 13 reaction equations above, the Pro stands for the promoter region, the RNAp is RNA polymerase while the Pro-RNAp stands for the promoter which is occupied by the RNA polymerase. An, On and Un are standing for the n th nucleotides in the stage of activated, occupied and unoccupied. U[strat,end] stands for the nucleotides in the range from start number to end number in index. Onp, Onar and Oncorrecting represents the a paused, arrested and error correcting at position n. And due to the temporal steric, the RNAp will occupied about (2ΔRNAp+1) nucleotides. URn denotes transcribed ribonucleotides which are free.

The translation part can be described by the following events:

(1)Initiation:
Rib+ U R [1,n+Rib+1] k trans_init O 1 R +Ri b R

(2)Stepwise translocation:
A n3 R + U R [n+Rib3,n+Rib1] k tm O n2 R

O n2 R k tm O n1 R

O n1 R k tm O n R + U R [n+Rib2,n+Rib]

(3)Activation:
O n R k trans[A,B,C] A n R


(4)Back-translocation:

O n1 R + U R [n+Rib2,nRib] k bt A n3 R + U R [n+Rib3,n+Rib1]

(5)Drop-off:
O n R k drop Rib+ U R [nRib,n+Rib]

(6)Trans-translation:
R k u [Ri b R ]Rib

(7)Elongation completion:
A R n last k u [Ri b R ]Rib+ U [ n last , n last Rib] R + P prem

(8)Folding and activation:
P prem k fold P

(9)Protein degradation:
P k dec ϕ

In the 8 reaction equations above, the Rib stands for the free ribosome while the RibR represents to the ribosome which is binding to the RNA chain. RNAp represents to the footprint of ribosome. Every ribosome will occupied about ( 2Rib+1 ) nucleotides. URn , ORn and ARn are the ribonucleic equivalent fo Un, On and An in transcriptional part, which has similarity meaning.

2.2 The model we improve for the polycistron


Fig.2-1 the organization of operon

Now we have known the coupled transcription-translation model for monocistron. In order to extend it to use in the bi-cistron like Fig.2-1, we simply add another translational part into the old model. So our new model has one translational part and two translational parts for two CDSs in the bi-cistron. Now the most important things are to build the relationship between the two translational part.

The first thing we need to reconsider that is to recalculate the initiation translation rate for the second CDS because this parameter is influenced by translation coupling.

For the translate rates of the second CDS, k2 , can be calculated by the following formula in statistical thermodynamics[3]:
k 2 r reinitiation (2) + e βΔ G total (2)

The formula is divided into two parts to describe the transcript coupling. The first part, rreinitiation(2) , showing that the ribosome terminates the translation of upstream CDS then dissociate and re-initiate the translation of downstream CDS, is called the ribosome re-initiation. The second part, e-βΔGtotal , showing that the elongate along the upstream CDS and unfolding the mRNA structure which increases the expression of the upstream CDS is called de novo ribosome initiation. The two kinds of initiation can be seen in the Fig.2-2.

Fig.2-2 two kinds of initiation

The first part in formula can be calculated by the following formula:
r reinitiation (2) = k p k reinitiation ( d 1,2 ) e βΔ G total (1)
Where the kreinitiation(d1,2) refers to the intergenic distance dependence and the kp refers to the proportionality constant between the ribosome assemble rate and the translation initiation rate.

For the kreinitiation(d1,2) is proved that can be calculate by the formula following:

k reinitiation (d)={ 0.0072±0.0048 0d25 0.0220 d=4 0.0072+0.0004(d+10) 10d25

Where the d= x start x stop 3 , xstart refers to the first nucleotides in j th CDS’s start codon while the xstop refers to first nucleotides in i th CDS ‘s stop codon. And it also points that the kp=10 .

The Δ G total (1) refers to the total binding free energy between the ribosome and 5’UTR, according to the equation:
Δ G total (1) =Δ G mRNArRNA +Δ G spacing +Δ G start +Δ G standby Δ G mRNA


The Δ G mRNArRNA refers to the free energy of folding for mRNA-rRNA complex, which is negative.
The Δ G spacing refers to the free energy for the non-optimal physical distance between SD sequence and the start codon, which is positive.

The Δ G start refers to the free energy for tRN A fMET -start codon complex, which is negative.

The Δ G standby refers to the free energy for the , which is negative.

The Δ G mRNA refers to the free energy of folding for 5’UTR, which is negative.

All these energies can be calculated by the NUPACK suit of energy with mFold 3.0 RNA energy parameter. And five energy can be seen vividly in the Fig.2-3.

Fig.2-3 the five part of the total binding free energy



The second part in formula can be calculated by the following formula:

Δ G total (2) =Δ G mRNArRNA +Δ G spacing +Δ G start +Δ G standby Δ G noncoupling Δ G coupling F coupling


Where the Δ G coupling refers to the free energy of folding for all inhibitory RNA structure that block the standby site, overlap with SD sequence, spacer region or the downstream footprint region of ribosome.

The Δ G noncoupling refers to the free energy of all the other RNA structure except the inhibitory RNA structure. And the F coupling can be calculated by the following formula:
F coupling = 1 1+C e βΔ G total (1)

Which the C is the ribosome-assisted unfolding coefficient. C=0.81 in our study.

The second thing we need to reconsider that is to recalculate the premature termination rate for the second CDS because this parameter is influenced by transcription polarity. Even there is only one transcriptional part, the premature termination rate of the second CDS is higher than the first CDS because the Rho factor will bind to RNA in the intergenic regions to cause rho-dependent termination. And almost 80% of premature termination is caused by Rho-dependent termination. We are going to use the Queuing Theory to build a model to describe it but the model hasn’t finished yet.

Here I will give the most essential idea of the unfinished model. We know the original model of queuing theory has assumed that the two cars will not have an accident and no car try to jump the queue. But we will regard the Rho-factor as the car which jumps the queue and the interaction between Rho-factor and RNA polymerase will be described as the accident. Once the waiting time for the first ribosome is increasing, it will be regarded as the happened of Rho-dependent termination. So by breaking the two basic assumptions of queuing theory, we will have the description of transcriptional polarity. The Rho-dependent termination rate will be calculated as the expectation of the accident times per minutes while the new premature termination rate will be described as the sum of the original premature termination rate and the Rho-dependent termination rate.

2.3 Explore the mRNA abundance using the model


We constructed a polycistron which has two LacZ gene in the model. And then correcting the premature termination and premature termination rate of the second CDS. The other parameters can be found in Andre S Riberio’s work. And carrying out the simulation in StochPy and SGNSim. We get the distribution of mRNA at t=100s and t=600s in Fig2-4 and Fig 2-5.

Fig.2-4 the distribution of mRNA at t=100s

Fig.2-5 the distribution of mRNA at t=600s

The mRNA distribution in 100s stands for the origin time of mRNA distribution while the mRNA distribution in 600s stans for the final state of mRNA distribution. Fig.2-4 shows that the mRNA of first CDSs is produced while the mRNA of the second CDSs haven't been produced now. Which shows that the mRNA of different cistrons in different positions has different translational time. Fig.2-5 shows that the there is a decrease between the mRNA abundance between two CDSs which suggest that the mRNA of different cistrons in different positions has different abundance.

3.A flexible model for polycistron


We have got some points from the coupled transcription-translation model: the mRNA of different cistrons in different positions has different abundance. This phenomenon may be caused by premature termination or something others, and this will result in the different protein level. And different protein level also caused by the different translation time.

The coupled transcription-translation model is too complex and hard to operate. Here we propose a framework to explain the more realistic situation happened in the polycistron but still hope it also can keep the simple forms.

Considering a polycistron like the Fig.2-1 shows, it contains a promoter, a 5’UTR and two CDSs which is separated by the intergenic regions, and following a terminator in the end.

The reaction can be described following four main steps:

(1)The transcription of two CDSs region:
k 1 mRN A 1 k 2 mRN A 2


Here we divided the polycistron into two part with different transcription paraments k 1 , k 2 to deal with the problem of different mRNA abundance due to the premature termination. The two paraments k 1 , k 2 is totally a sequence-dependent as we discuss before.

(2)The degradation of mRNA
mRN A 1 k d 1 ϕ mRN A 2 k d 2 ϕ

The degradation of RNA also divided into two parts with different transcription paraments k d1 , k d2 to deal with the problem of different translational time for two mRNA. Each k di can be divided into two part:
k di = k d k recoup

The k recoup denotes the recoup item for the translational time difference and the k d denotes the common degradation rate of mRNA.

(3)The translation of protein.
mRN A 1 k p 1 mRN A 1  + Protein 1 mRN A 2 k p 2 mRN A 2  + Protein 2

In the translation of two proteins, the two paraments used to describe the translation also should be different considering the translation coupling. And we build a thermodynamic model to calculated it before in the 2.2, now we will give a conclusion for it.

For the translate rates of the first CDS, k1 , can be calculated by the following formula in statistical thermodynamics :
k 1 e βΔ G total (1)



For the translate rates of the second CDS, k2 , can be calculated by the following formula in statistical thermodynamics:
k 2 r reinitiation (2) + e βΔ G total (2)


The calculation method is discussed in 2.2.

(4)The degradation of proteins
Protein 1 kd p 1 ϕ Protein 2 kd p 2 ϕ

We don’t have too much discussion in the proteins degradation here. But we still have something to remind. As the protein we used for our study, sfGFP and RFP, are quite stable exactly. The degradation rate almost equals to the dilution due to the growth diffusion of E. coli colonies.

This model is the simple forms of the coupled transcription-translation model, it keeps the easy form but also reflect the common phenomenon which will happen in the transcript and translation of polycistron including transcript polarity and translation coupling.

4.How the miniToe structure affect the ratio of two CDSs


Fig.4-1 The working processing of miniToe


miniToe polycistron system has two components, Csy4 and the circuit of polycistron. With Csy4 protein, the polycistron will be cut into several mRNA chains with RNA/Csy4 complex at the 3’ UTR as the Fig.4-1 showing. The capability of RNA degradation protection will be much stronger, because of the high stability and affinity of Csy4 binding, which increases the energy threshold for RNA degradation from 3’ UTR. So, the RNA degradation rate will be much lower. For the 5’ end degradation, the Csy4 cut will leave an OH- at 5’ end. the cleavage capability of RNase E will be much lower because there is no pyrophosphate in the 5’ end. Qi’s work has proved that OH-mRNAs exhibit higher gene expression than 5’ PPP-mRNAs.

So by inserting different hairpin which has different binding ability into the miniToe site, we can control the half-life time of mRNA for two CDS in bi-cistron.

5.The main role of cleavage rate for miniToe polycistron


In our miniToe system, the cleavage rate plays an important role in different regulation level. But in the miniToe polycistron system, the cleavage will not influence our goals that change ratio of two CDSs. Now we will give a simple model which based on the OUC-China 2016 [4] to explain the relationship between the cleavage rate and the ratio of two proteins in stable level.

We know that the cleavage of Csy4 can open the switch and the cleavage can affect the product curve which we have been prove in the model work for first and second system. Now we will discuss how does the cleavage rate effect the ratio of two CDSs.

The polycistron system model that OUC-China 2016 constructed is describe by the following reaction:

(1)Translation of polycistron mRNA :
k r mRNA

(2)The translation of polycistron mRNA:
mRNA K p1 mRNA+Protei n 1 +Protei n 2

(3)Cleavage by Enzyme:
mRNA K d1 mRN A 1 +mRN A 2

(4)The translation of two cleaved mRNA
mRN A 1 K p11 mRN A 1 +Protei n 1 mRN A 2 K p12 mRN A 2 +Protei n 2

(5)The degradation of all species
mRNA K d0 ϕ mRN A 1 K d11 ϕ mRN A 2 K d12 ϕ Protei n 1 K dp1 ϕ Protei n 2 K dp2 ϕ


We use the parameters in following table estimated by the OUC-China 2016:
Symbol Definition Value
Kr The constant of Transcription 100
Kp1 The translation rate of polycistron 67.51
Kd1 The cleavage rate of Enzyme 0.185
Kp11 The translation rate of mRNA1 4.005
Kp12 The translation rate of mRNA2 4.005
Kd0 The degradation rate of polycistron mRNA 17.625
Kd11 The degradation rate of mRNA1 18.864
Kd12 The degradation rate of mRNA2 21.816
Kdp1 The degradation rate of protein1 0.0625
Kdp2 The degradation rate of protein2 0.01

By the reaction equation and parameters above, we explore the relationship between the cleavage rate and the ratio of two proteins in the stable level. Fig.5-1 is the result.

Fig.5-1 The relationship between the cleavage rate and the ratio of two proteins in stable level.

From the Fig.5-1, we can see that the ratio of two proteins in stable level change little with the changing cleavage rate in a large range, which prove the word we mentioned before.

In conclusion, the cleavage rate in our miniToe polycistron plays an important role in changing the shape of the product curve while having little effect in the ratio of two proteins in a stable level.

6. The provement of miniToe System


According to the all the model we discussed before, we can have a conclusion that our miniToe polycistron can be divided into two parts of the miniToe system we discuss in the model work for our first system. The reasons are as following

1) Firstly, we have proved that the cleavage rate in our miniToe polycistron plays a role in changing the shape of the product curve while having little effect on the ratio of two proteins in the stable level. Which give a base to used the polycistron model we discussed in Chapter 3.

2) Secondly, in the flexible model which is simplified from coupled transcription-translation model, we treat the two cistrons as two parts in order to a precious description of the dynamics of polycistron.

According to the analyse below, our miniToe system can be easily modeled as the first system. So we can also prove our miniToe structure’s function by giving the sensitivity analysis of our first system in Fig.6-1.

Fig.6-1 the sensitivity analysis of our first system



As we can see in the Figure.6-1, the parameter kd3 also sensitive to GFP level which means that you can change it to influence the product. By choosing the hairpin which can increase the kd3, we can design the ratio of expression level for two cistrons in polycistron.

7. Future Work


1)To explore the relationship between free binding energy of Csy4 and mRNA degradation rate.

2)To finish the Queuing Theory for the calculation of Premature termination rate.

PS. I really want to finish this two parts before the wiki freeze permanently but this is quite hard for one person. I hope I can finish it as an after-iGEM work for the model part. And anyone who interests in this model can contact me with e-mail: chonpcaacpnohc@gmail.com. This e-mail address also for the correction of my model mistake.




8.Reference

[1]. Quax T E, Wolf Y I, Koehorst J J, et al. Differential translation tunes uneven production of operon-encoded proteins[J]. Cell Reports, 2013, 4(5):938-944.

[2].Mäkelä J, Lloyd-Price J, Yli-Harja O, et al. Stochastic sequence-level model of coupled transcription and translation in prokaryotes[J]. Bmc Bioinformatics, 2011, 12(1):121.

[3]. Tian T, Salis H M. A predictive biophysical model of translational coupling to coordinate and control protein expression in bacterial operons.[J]. Nucleic Acids Research, 2015, 43(14):7137-7151.

[4]. https://2016.igem.org/Team:OUC-China/Model







Contact Us : oucigem@163.com | ©2018 OUC IGEM.All Rights Reserved.
banner banner banner banner banner