# Team:Edinburgh UG/Modelling Collaboration

Edinburgh iGEM 2018

# Modelling Collaboration

## Introduction

Team Vilnius-Lithuania aims to use the PURE cell free system [1] in order to integrate proteins into the membrane of the liposome from the inside. The BamA complex is responsible for integrating these proteins; OmpA and lgA hence in order to ensure quick integration BamA needs to be consistently present at high levels throughout the expression of OmpA and lgA. This mechanistic model aims to examine the simultaneous expression of these proteins and compare the results obtained from different starting parameters, primarily different starting volumes of bamA RNA in order to quantify the effectiveness of an initial addition of RNA to ensure fast production of BamA.

## Mass Action Equations

Mass Action Equations are commonly used to represent chemical reactions and provide a starting point for mechanistic modelling of a variety of phenomena. The laws of mass action state that the rate of any chemical reaction is proportional to the product of the masses of the reacting substances, with each mass raised to a power equal to the coefficient that occurs in the chemical equation [2]. The mass action equations in Figure 1 can be used to represent protein expression:

Each of these equations is used in triplicate to represent expression of BamA, OmpA and lgA respectively and from these mass action equations a system of ordinary differential equations can be derived.

## Ordinary Differential Equations

The model uses a simple set of ordinary differential equations (ODEs):

In order to solve this system it is first necessary to derive values for all the parameters used:

copiesBamA, copiesOmpA, copieslgA - Number of plasmid copies - In order to take into account the effect of different starting masses of DNA for BamA, OmpA and lgA it is neccesary to calculate the number of plasmids present from which each protein may be expressed. DNA added to the PURE system should be between 25 and 1000ng per reaction [1] and hence it was decided to screen over a number of values in this range. A single base pair has mass of 650 Daltons [3] hence it is possible to calculate mass in kDa of each plasmid with its particular insert when each plasmid length is known [4,5]:

• peT2Ab with BamA - 5089.5 kDa
• pRSETb with OmpA - 2550.95 kDa
• pRSETb with lgA - 2490.8 kDa

The conversion 1 ng = 6.022*1017 kDa allows the calculation of the number of plasmids present for a particular number of ng of DNA added (as seen below in Table 1):

Table 1. Number of Plasmid Copies

peT2AB with BamA
DNA added (ng) Number of Copies
25 2.985*1015
250 2.985*1016
1000 1.183*1017

pRSETb with OmpA
DNA added (ng) Number of Copies
25 5.902*1015
250 5.902*1016
1000 2.361*1017

pRSETb with lgA
DNA added (ng) Number of Copies
25 6.044*1015
250 6.044*1016
1000 2.418*1017

Although it would have been desirable to screen over a greater number of values within the 25-1000ng range the resulting increase in size of the already large parameter space made this intractable.

trBamA, trOmpA, trlgA - Transcription Rate - Transcription is known to occur at approximately 60 nucleotides per second [6]. Length of each gene was found to be 2430 nucleotides for BamA [7], 1038 nucleotides for OmpA [8] and 945 nucleotides for lgA [9]. Computing the transcription rate per minute for each of BamA, OmpA and lgA:

• trBamA - (60/2430)∗60 = 1.48 mRNAs per minute
• trOmpA - (60/1038)∗60 = 3.47 mRNAs per minute
• trlgA - (60/945)∗60 = 3.81 mRNAs per minute

degmRNA BamA, degmRNA OmpA, degmRNA lgA - mRNA Degradation Rate - Average mRNA half-life is known to be approximately 5 minutes however to reflect variation in this number between different mRNAs, we screen here over half-lives of 1, 5, 10 and 15 minutes for each protein [10]. Degradation rate is calculable from half-life using the formula; deg = ln(2)/halflife [11]. Using this formula degradation rates per minute are calculated as:

• 1 minute half-life - 0.69 mRNAs per minute
• 5 minute half-life - 0.14 mRNAs per minute
• 10 minute half-life - 0.07 mRNAs per minute
• 15 minute half-life - 0.05 mRNAs per minute

trlBamA, trlOmpA, trllgA - Translation Rate - Translation is known to occur at approximately 20 amino acids per second [12]. Length of each protein was found for BamA [7], OmpA [8] and lgA [9] by searching through protein databases. Here we show the calculation used to find the translation rate per minute for BamA, OmpA and lgA:

• trlBamA - (20/810)∗60 = 1.48 proteins per minute
• trlOmpA - (20/346)∗60 = 3.47 proteins per minute
• trllgA - (20/315)∗60 = 3.81 proteins per minute

degBamA, degOmpA, deglgA - Protein Degradation Rate - Protein half-life was determined using ProtParam Tool [13]. ProtParam uses the N-end rule [14] to determine protein half-life, the estimates given for each of BamA, OmpA and lgA are (>)10hrs in E.coli and 30hrs in reticulocytes hence an average 20hr half-life. In order to reflect the inexact nature of these computationally derived half-lives, we screen over possible half-lives of 10, 20 and 30 hours for each of BamA, OmpA and lgA. Applying the same degradation rate formula as previously deg = ln(2)/halflife [11] this yields degredation rates per minute of:

• 10 hour half-life - 1.16 * 10 -3 proteins per minute
• 20 hour half-life - 5.78 * 10 -4 proteins per minute
• 30 hour half-life - 3.85 * 10 -4 proteins per minute

## Starting Conditions

In order to examine the effects of higher initial mass of BamA RNA 6 different values were screened over. The guidance for the PURE system [1] suggests addition of RNA between 1 and 5 ug. Using the masses of sense and anti-sense strands of BamA RNA in kDA [15] (830.382 and 820.8 respectively) and conversion 1 ug = 6.022*1020 kDa the number of RNA molecules added can be calculated using the formula (ug added *(6.022*1020)/((830.382 + 820.8)/2)))) as seen below in table 2:

Table 2. Initial RNA Levels

0 0
1 7.29*1017
2 1.46*1018
3 2.19*1018
4 2.92*1018
5 3.65*1018

## Results

The primary aim of this model is to identify parameters leading to fast and consistently high levels of BamA under conditions of co-expression of BamA, OmpA and lgA. In order to identify those particular parameters leading to these conditions it is first prudent to examine some general trends. Figure 2 displays the minimum, maximum and average levels of mRNAs and protein for BamA, OmpA and lgA. The value at each time point was obtained from the range of values at the same time point across each run of the model.

Table 3 summarizes this data in the form of number of molecules of mRNA and protein for each average, minimum and maximum plot and each of BamA, OmpA and lgA after 2 hours.

Table 3. Protein Levels After 2 Hours

Protein Number of Molecules
Average
BamA 2.8*1020
OmpA 2.78*1021
lgA 3.44*1021
Minimum
BamA 1.95*1018
OmpA 2.13*1019
lgA 2.63*1019
Maximum
BamA 1.28*1021
OmpA 1.28*1022
lgA 1.58*1022

In the average mRNA graph in Figure 2 we can see ompA and lgA mRNA present in rapidly increasingly levels until an equilibrium is reached (around 1 hour), the increased rate and higher equilibrium value of lgA mRNA are due to its shorter gene length and hence faster mRNA synthesis than OmpA. Average bamA mRNA starts of at comparatively high levels due to the initial additions of bamA mRNA however the mass of DNA present in the average case is not sufficient to maintain these high mRNA levels and hence bamA mRNA drops to an equilibrium level around 25% of OmpA and lgA mRNA.

Minimum mRNA levels occur when no addition of initial mRNA is made as well as when screening over low DNA masses and fast mRNA degredation rates. In this area of the parameter space mRNA for all 3 proteins quickly reach low equilibrium levels characterized primarily by effect of gene length on speed of mRNA synthesis.

Maximum mRNA levels are reached with large additions of bamA mRNA and large masses of DNA. The resulting synthesis of bamA mRNA maintains a high equilibrium value throughout the entire model run. mRNA of OmpA and lgA in the maximal case quickly increases to a high equilibrium value after around 100 minutes.

By comparing the mRNA levels in average, minimum and maximum cases we can observe as might be expected that increasing mass of DNA means that equilibrium levels of mRNA take longer to reach and occur at a higher value.

In average, minimum and maximum cases the protein expression of OmpA, lgA and BamA follows the same trend which is unaffected by fluctuations in mRNA level. Each protein is expressed at a rate primarily proportional to its length and to a magnitude primarily dependent on the mass of available DNA.

Figure 3 displays the distribution of number of BamA molecules present at 20 minute intervals, appendix B contains analogous plots for OmpA and lgA.

At each 20 minute interval we can observe the distribution of BamA expression stretches to the right as more protein is synthesized. Within the initial 40 minutes the distribution is heavily grouped together as there has not been enough time for BamA to be present in high levels. Between 60 and 100 minutes the distribution spreads out and settles into distinct groups corresponding to particular parameter settings which increasingly move to the right as more protein is synthesized.

Figure 4 shows the distribution of number of bamA mRNA molecules present at 20 minute intervals, appendix B contains analogous plots for OmpA and lgA.

Distribution of bamA mRNA is extremely spread out initially during the first 40 minutes due to the variation in starting conditions between runs and rapid initial changes in mRNA level in cases of low initial mRNA and very high DNA masses or cases of high initial mRNA and very low DNA masses. By 120 minutes the distribution has settled in a number of distinct peaks corresponding to parameter sets leading to a particular bamA mRNA equilibrium value.

Figure 5 displays average levels of BamA expression per combination of starting conditions.

By examining the model results broken down by starting conditions we can make recommendations about how best to proceed in the wet lab. From Figure 5 we can see by comparing BamA expression with 25 nanograms DNA and 0 micrograms RNA added compared to when RNA is added for the same mass of DNA there is a much increased rate of initial BamA synthesis as a result of mRNA addition. The increased rate of BamA synthesis due to mRNA addition is present in a lesser extent as higher initial masses of DNA are used. When using the maximum DNA mass of 1000 nanograms addition of mRNA in the quantities supported by the PURE system make little effect to protein expression.

## Sensitivity Analysis

Sensitivity Analysis is a vital part of the modelling work flow, by analyzing which parameters contribute most to the uncertainty of the models output we can recognize those parameters that are most important to fine tune from wet lab results.

Fourier Amplitude Sensitivity Testing (FAST) is a computationally efficient method to calculate variance based sensitivity indices used here via the SALib library [16]. Intuitively variance represents the spread of a set of numbers. FAST indices represent the proportion of the output variance of the model attributable to a particular variable and its interactions. Focusing on BamA expression as the protein of interest total order FAST sensitivity indices were calculated using the BamA protein level each 20 minutes as the model output.

Number of BamA plasmid copies contributes most to output variance over the whole time span due to having an extremely high value and having a major effect on where mRNA equilibrium occurs. bamA mRNA degredation rate is considerably faster than BamA degredation rate - with mRNA halflife of the order of minutes and protein halflife the order of hours - hence the greater FAST index.

## Conclusion

Whilst the addition of bamA mRNA does increase the rate of protein expression this effect is proportional to the mass of initial DNA. Initial mass of BamA DNA determines at how high an expression level the equilibrium for BamA expression is reached. In order for addition of mRNA to give the most benefits it should be used where the mass of expressed protein does not need to be particularly high but where the protein needs to quickly present. Due to the shorter size of proteins lgA and OmpA compared to BamA it is not necessary to have maximal expression of these proteins to attain them in an equivalent quantity to BamA. In cases where maximal expression level of BamA is desired (by using 1000 nanograms DNA) the addition of mRNA in the quantities supported by the PURE system produces little change to BamA expression and hence may not be justified given the extra wet lab work incurred.

## References

1. PURE cell free system - https://international.neb.com/products/e6800-purexpress-invitro-protein-synthesis-kit
2. A. B. Kudryavtsev, R. F. Jameson, and W. Linert. The Law of Mass Action. Springer Science & Business Media, 2001.
3. IDT Molecular Facts and Figures - http://sfvideo.blob.core.windows.net/sitefinity/docs/default-source/biotech-basics/molecular-facts-and-figures.pdf?sfvrsn=4563407_4
6. Transcription Rate - http://book.bionumbers.org/what-is-faster-transcription-or-translation/
7. BamA - https://www.uniprot.org/uniprot/P0A940
8. OmpA - https://www.uniprot.org/uniprot/P0A910
9. lgA - https://www.uniprot.org/uniprot/A6Y9S5