Line 96: | Line 96: | ||
<article> | <article> | ||
− | In our project we introduce RNA interference (RNAi) and translation repression with small interfering (si)RNAs as an alternative to CRISPR/Cas. To use siRNA as silencing agents for the gene-of -interest we propose a two-step design process. At first potential siRNAs for prokaryotic organisms must be designed. In the second step the silencing effect of these siRNAs can be validated by our siRNA vector system <a href"https://2018.igem.org/Team:Bielefeld-CeBiTec/siRNA">Tace.</a> To facilitate the initial siRNA design step, we developed a siRNA construction tool, which | + | In our project we introduce RNA interference (RNAi) and translation repression with small interfering (si)RNAs as an alternative to CRISPR/Cas. To use siRNA as silencing agents for the gene-of -interest we propose a two-step design process. At first potential siRNAs for prokaryotic organisms must be designed. In the second step the silencing effect of these siRNAs can be validated by our siRNA vector system <a href="https://2018.igem.org/Team:Bielefeld-CeBiTec/siRNA">Tace.</a> To facilitate the initial siRNA design step, we developed a siRNA construction tool, which finds possible siRNAs for a given gene sequence and calculates their gene silencing probabilities. It consists of the three modules: siRNAs for RNAi, siRNA, and check siRNA. Obtained siRNAs are perfectly compatible with our siRNA vector system. To the best of our knowledge, this is the first tool dedicated to predicting customized siRNA for application in prokaryotes. This Python tool comes in two versions: a command line application and an easy-to-use graphical interface. |
</article> | </article> | ||
Line 103: | Line 103: | ||
<article> | <article> | ||
− | siRNAs are small single- or double-stranded RNAs with an average length of 21-25 nucleotides. They are non-coding RNAs which | + | siRNAs are small single- or double-stranded RNAs with an average length of 21-25 nucleotides. They are non-coding RNAs which bind a specific complementary coding mRNA and silence its function. During eukaryotic RNAi, siRNAs are loaded to Argonaute proteins, which carry out the repression, either by blocking mRNA translation or by degrading the mRNA (Siomi and Siomi, 2009). More detailed information on both possible siRNAs mechanisms is found <a href="https://2018.igem.org/Team:Bielefeld-CeBiTec/siRNA">here.</a> |
</article> | </article> | ||
Line 142: | Line 142: | ||
<article> | <article> | ||
− | In 2012 the <a href="https://2012.igem.org/Team:SYSU-Software/Models#pp2">iGEM team SYSU-Software</a> integrated an siRNA cDNA designer as a small part | + | In 2012 the <a href="https://2012.igem.org/Team:SYSU-Software/Models#pp2">iGEM team SYSU-Software</a> integrated an siRNA cDNA designer as a small part of their project. siRNAs designed with this tool were applicable in eukaryotic organisms. They included two different design methods: Tom Tuschl’s method and Rational siRNA design. |
</article> | </article> | ||
Line 148: | Line 148: | ||
<img class="figure sixty" src="https://static.igem.org/mediawiki/2018/c/c6/T--Bielefeld-CeBiTec--Tom_Tuschl_small_vk.png"> | <img class="figure sixty" src="https://static.igem.org/mediawiki/2018/c/c6/T--Bielefeld-CeBiTec--Tom_Tuschl_small_vk.png"> | ||
<figcaption> | <figcaption> | ||
− | <b>Figure 1:</b> Structure of an siRNA designed with Tom Tuschl's method. Both siRNA have a characteristic 'TT' overhang at the 3'-terminus. | + | <b>Figure 1:</b> Structure of an siRNA designed with Tom Tuschl's method. Both siRNA have a characteristic 'TT' overhang at the 3'-terminus (Elbashir et al., 2001). |
</figcaption> | </figcaption> | ||
</figure> | </figure> | ||
<article> | <article> | ||
− | Tom Tuschl’s method focuses mainly on the existence of 5’ and 3’ ‘TT’ overhangs (Figure | + | Tom Tuschl’s method focuses mainly on the existence of 5’ and 3’ ‘TT’ overhangs (Figure 1)(Elbashir et al., 2001). These are not compatible with overhangs and scaffold sequences necessary for the prokaryotic mechanisms. Therefore, we decided to use the Ui-Tei rules as an alternative design method (Naito and Ui-Tei, 2012). Furthermore, we adapted the Rational siRNA design since it was more suitable for our application (Reynolds et al., 2004). Both design rules apply only to the 19nt long target binding sequence. |
</article> | </article> | ||
Line 160: | Line 160: | ||
<article> | <article> | ||
− | By a systematic analysis of 180 eukaryotic siRNAs Reynolds et al. identified eight criteria that are important for | + | By a systematic analysis of 180 eukaryotic siRNAs Reynolds et al. identified eight criteria that are important for their functionality (Reynolds et al., 2004). Each criterion gets a score that is either positive or negative, corresponding to its effect on the siRNA. All siRNA candidates that have a score above six are potential highly functional siRNAs. |
</article> | </article> | ||
<table id="t01" class="centern" style="margin-top:30px; margin-bottom:30px;"> | <table id="t01" class="centern" style="margin-top:30px; margin-bottom:30px;"> | ||
− | <caption style="line-height:1.5; text.align:left;"><b>Table 1:</b>Rational siRNA design criteria with corresponding score</caption> | + | <caption style="line-height:1.5; text.align:left;"><b>Table 1:</b>Rational siRNA design criteria with corresponding score (Reynolds et al., 2004)</caption> |
<tr> | <tr> | ||
<th>Rule</th> | <th>Rule</th> | ||
Line 174: | Line 174: | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td>At least 3 'A/ | + | <td>At least 3 'A/T' bases at positions 15-19</td> |
− | <td>+1 (for each 'A | + | <td>+1 (for each 'A' or 'T')</td> |
</tr> | </tr> | ||
<tr> | <tr> | ||
Line 182: | Line 182: | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td>An 'A' | + | <td>An 'A' at position 3</td> |
<td>+1</td> | <td>+1</td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td>An 'A' | + | <td>An 'A' at position 19</td> |
<td>+1</td> | <td>+1</td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td> | + | <td>A 'T' at position 19</td> |
<td>+1</td> | <td>+1</td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td> | + | <td>An 'A' or 'T' at positin 19</td> |
<td>-1</td> | <td>-1</td> | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td>A | + | <td>An 'A', 'C' or 'T' at position 13</td> |
<td>-1</td> | <td>-1</td> | ||
</tr> | </tr> | ||
Line 205: | Line 205: | ||
<article> | <article> | ||
− | The melting | + | The melting temperature Tm is calculated as follows (Kibbe, 2007): |
$$ T_m = 79.8 + (18.5 * log_{10}[Na^+]) + (58.4 * [\text{G/C content}]) \\+ (11.8 * [\text{G/C content}]^2) - \left(\frac{820}{\text{[G/C content]}}\right)$$ | $$ T_m = 79.8 + (18.5 * log_{10}[Na^+]) + (58.4 * [\text{G/C content}]) \\+ (11.8 * [\text{G/C content}]^2) - \left(\frac{820}{\text{[G/C content]}}\right)$$ | ||
Line 216: | Line 216: | ||
<article> | <article> | ||
− | Ui-Tei et al analyzed 62 eukaryotic siRNAs and identified four design rules for effective siRNAs (Ui-Tei, 2004). Only siRNAs fulfilling all four criteria are considered functional siRNAs. | + | Ui-Tei <i>et al.</i> analyzed 62 eukaryotic siRNAs and identified four design rules for effective siRNAs (Ui-Tei, 2004). Only siRNAs fulfilling all four criteria are considered functional siRNAs. |
</article> | </article> | ||
Line 222: | Line 222: | ||
<li>An ‘A’ or ‘T’ at position 19</li> | <li>An ‘A’ or ‘T’ at position 19</li> | ||
<li>A ‘G’ or ‘C’ at position 1</li> | <li>A ‘G’ or ‘C’ at position 1</li> | ||
− | <li>At least five | + | <li>At least five ‘T’ or ‘A’ residues from positions 13 to 19</li> |
<li>No ‘GC’ stretch more than 9nt long</li> | <li>No ‘GC’ stretch more than 9nt long</li> | ||
</ol> | </ol> | ||
Line 230: | Line 230: | ||
<article> | <article> | ||
− | Not only the sequences of | + | Not only the sequences of potential effective siRNAs are to be determined and returned by the tool, but also the probability with which they are effective. This probability can be calculated with the help of Bayes’ theorem by calculating probabilities of dependent events. The following calculations and formulas are based on Takasaki (2009). |
</br> | </br> | ||
− | The initial hypothesis is that the given siRNA effectively silences an mRNA. To perform the calculations a prior probability is necessary. The prior probability for effective gene silencing of mammalian genes can be obtained from former siRNA experiments and is approximately 0.1 (Takasaki, 2009). Since we have no data on prokaryotic siRNAs, we use the same prior probability for our | + | The initial hypothesis is that the given siRNA effectively silences an mRNA. To perform the calculations a prior probability is necessary. The prior probability for effective gene silencing of mammalian genes can be obtained from former siRNA experiments and is approximately 0.1 (Takasaki, 2009). Since we have no data on prokaryotic siRNAs, we use the same prior probability for our predictions. </br> |
The gene silencing probability can be described as: | The gene silencing probability can be described as: | ||
Line 241: | Line 241: | ||
− | \(P^{eff}\) is the prior probability | + | \(P^{eff}\) is the prior probability and set to0.1 as mentioned above. The siRNA sequence is represented by \(X\), where \(X_1, X_2 ... X_n\) belong to the possible bases adenine, guanine, cytosine and thymine at each position n. As \(P(X|eff)\) is the probability, that the given siRNA sequence will effectively silence if the bases belong to the frequent bases of common effective siRNAs, it is computed as the product of the probabilities that a particular base is located at a particular position of the siRNA: |
$$ P(X|eff) = \prod_{i=1}^{19} q_{x_i^n}^{eff} \qquad (2)$$ | $$ P(X|eff) = \prod_{i=1}^{19} q_{x_i^n}^{eff} \qquad (2)$$ | ||
</article> | </article> | ||
<article> | <article> | ||
− | \(q_{x_i^n}^{eff}\) indicates how likely the occurrence of base \(n\) is at position \(i\) based on known effective siRNAs. It can also be called frequency ratio of \(n\) at position \(i\). The last element \(P(X)\) of formula \((1)\) is the possibility that \(X\) will effectively silence the target sequence. It is the sum of the probability that \(X\) is effective if its nucleotides are found in effective siRNAs | + | \(q_{x_i^n}^{eff}\) indicates how likely the occurrence of base \(n\) is at position \(i\) based on known effective siRNAs. It can also be called frequency ratio of \(n\) at position \(i\). The last element \(P(X)\) of formula \((1)\) is the possibility that \(X\) will effectively silence the target sequence. It is the sum of the probability that \(X\) is effective if its nucleotides are found in effective siRNAs and the probability that \(X\) is effective if its nucleotides are found in ineffective siRNAs. Both probabilities are weighted with the prior probabilities \(P^{eff}\) and \(P^{inf} = 1-P^{eff}\). |
$$ P(X) = P^{eff} P(X|eff)+P^{inf} P(X|inf) \qquad (3)$$ | $$ P(X) = P^{eff} P(X|eff)+P^{inf} P(X|inf) \qquad (3)$$ | ||
</article> | </article> | ||
<article> | <article> | ||
− | \(P(X|inf)\) is calculated | + | \(P(X|inf)\) is calculated similarly to \(P(X|eff)\) and is the probability that \(X\) will effectively silence if the nucleotides belong to the frequent nucleotides of common ineffective siRNAs. |
$$ P(X|inf) = \prod_{i=1}^{19} q_{x_i^n}^{inf} \qquad (4)$$ | $$ P(X|inf) = \prod_{i=1}^{19} q_{x_i^n}^{inf} \qquad (4)$$ | ||
Line 274: | Line 274: | ||
<article> | <article> | ||
− | For the frequency ratios 833 effective and 847 ineffective siRNAs from previous publications were analyzed. For each nucleotide, the probability of occurrence was determined for each position of the siRNA. Different models were taken into account in the calculation. First of all, the occurrence of the different nucleotides at positions 1 to 19 can be considered independently. The probabilities for each position are then calculated independently. However, the occurrences of the nucleotides can also be considered | + | For the frequency ratios 833 effective and 847 ineffective siRNAs from previous publications were analyzed. For each nucleotide, the probability of occurrence was determined for each position of the siRNA. Different models were taken into account in the calculation. First of all, the occurrence of the different nucleotides at positions 1 to 19 can be considered independently. The probabilities for each position are then calculated independently. However, the occurrences of the nucleotides can also be considered as dependent on each other. This means that the occurrence of a nucleotide depends on the nucleotide at the position before. For the calculation of dependent probabilities, the Simple Markow Model was used. It has been found that the resulting silence probability is most accurate when the frequency ratios of the effective siRNAs are calculated dependent and the frequency ratios of the ineffective siRNAs are calculated independent. All frequency ratios can be looked up <a href="https://static.igem.org/mediawiki/2018/5/51/T--Bielefeld-CeBiTec--frequency_ratios_vk.pdf" style="padding-right:0; margin-right:0;">here</a>. |
</br> | </br> | ||
− | + | In combination with the frequency ratios it is now possible to calculate the silencing probability for the 19 bp long binding site of siRNAs. | |
</article> | </article> | ||
Revision as of 21:24, 17 October 2018
siRCon - A siRNA Constructor
siRNAS short introduction
siRNA overhangs and scaffolds
Choosing appropriate design methods
Rational siRNA design
Rule | Score |
---|---|
30%-52% G/C content | +1 |
At least 3 'A/T' bases at positions 15-19 | +1 (for each 'A' or 'T') |
Absence of internal repeats (\(T_m \lt 20\)) | +1 |
An 'A' at position 3 | +1 |
An 'A' at position 19 | +1 |
A 'T' at position 19 | +1 |
An 'A' or 'T' at positin 19 | -1 |
An 'A', 'C' or 'T' at position 13 | -1 |
Ui-Tei rule
- An ‘A’ or ‘T’ at position 19
- A ‘G’ or ‘C’ at position 1
- At least five ‘T’ or ‘A’ residues from positions 13 to 19
- No ‘GC’ stretch more than 9nt long
Calculating silencing probability
Check siRNA
Command line application
The command line application can be obtained directly here or downloaded from our GitHub repository. For the execution of this command line application Python 2.7 needs to be installed.
Graphical Interface usage
As the command line application, the graphical interface version can either be downloaded directly here, or via our GitHub repository.
In the graphical interface, the modules are divided into different tabs (Figure 6). The last tab contains usage and copyright information.
1. siRNA for RNAi
- Insert gene sequence
- Choose Tace vector system (optionally)
- Constructions of siRNAs
- View resulting siRNAs (sense and antisense sequence) and their corresponding probability
- Decide if siRNAs should be saved with MicC scaffold (only if Tace is not used)
- Save results as FASTA file
2. siRNA for silencing
- Insert gene sequence
- Choose Tace vector system (optionally)
- Constructions of siRNAs
- View resulting siRNAs (sense and antisense sequence) and their corresponding probability
- Decide if siRNAs should be saved with MicC scaffold (only if Tace is not used)
- Decide if siRNAs should be saved with OmpA scaffold (only if Tace is not used)
- Save results as FASTA file
3. Check siRNA
- Insert gene sequence
- Insert siRNA sequences
- Choose method the siRNA was constructed for (siRNA for RNAi or siRNA for silencing)
- Choose if siRNA was constructed for Tace (optionally)
- Validation of entered siRNA for given target gene sequences
- View results
- Save results (optionally)
Outlook
Elbashir, S.M., Harborth, J., Lendeckel, W., Yalcin, A., Weber, K., and Tuschl, T. (2001). Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature 411: 494–498.
Foley, P.L., Hsieh, P., Luciano, D.J., and Belasco, J.G. (2015). Specificity and evolutionary conservation of the Escherichia coli RNA pyrophosphohydrolase RppH. J. Biol. Chem. 290: 9478–9486.
Kibbe, W.A. (2007). OligoCalc: an online oligonucleotide properties calculator. Nucleic Acids Res 35: W43–W46.
Na, D., Yoo, S.M., Chung, H., Park, H., Park, J.H., and Lee, S.Y. (2013). Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs. Nat. Biotechnol. 31: 170–174.
Naito, Y. and Ui-Tei, K. (2012). siRNA Design Software for a Target Gene-Specific RNA Interference. Front Genet 3.
Reynolds, A., Leake, D., Boese, Q., Scaringe, S., Marshall, W.S., and Khvorova, A. (2004). Rational siRNA design for RNA interference. Nature Biotechnology 22: 326–330.
Siomi, H. and Siomi, M.C. (2009). On the road to reading the RNA-interference code. Nature 457: 396–404.
Takasaki, S. (2009). Selecting effective siRNA target sequences by using Bayes’ theorem. Computational Biology and Chemistry 33: 368–372.
Foley, P.L., Hsieh, P., Luciano, D.J., and Belasco, J.G. (2015). Specificity and evolutionary conservation of the Escherichia coli RNA pyrophosphohydrolase RppH. J. Biol. Chem. 290: 9478–9486.
Kibbe, W.A. (2007). OligoCalc: an online oligonucleotide properties calculator. Nucleic Acids Res 35: W43–W46.
Na, D., Yoo, S.M., Chung, H., Park, H., Park, J.H., and Lee, S.Y. (2013). Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs. Nat. Biotechnol. 31: 170–174.
Naito, Y. and Ui-Tei, K. (2012). siRNA Design Software for a Target Gene-Specific RNA Interference. Front Genet 3.
Reynolds, A., Leake, D., Boese, Q., Scaringe, S., Marshall, W.S., and Khvorova, A. (2004). Rational siRNA design for RNA interference. Nature Biotechnology 22: 326–330.
Siomi, H. and Siomi, M.C. (2009). On the road to reading the RNA-interference code. Nature 457: 396–404.
Takasaki, S. (2009). Selecting effective siRNA target sequences by using Bayes’ theorem. Computational Biology and Chemistry 33: 368–372.