Design
In biological aspect, BNU-China wants to solve the questions of plasmids’ genetic and structural instability, which directly cause strain’s degeneration and then weaken the function of target genes. These instability and degeneration-related influence can be clearly valued by observing expression-related phenomenon, like the amount of production and enzyme activity.
Design: Specific gene circuit:
We firstly design the gene loops described in figure 2. When engineered strains produce the wanted product, the downstream growth factor will be expressed and accelerate the growth of strains. In opposite situation, the growth factor will not be activated and strains will relatively get the growth inhibition.
In order to use specific experiments to verify our first idea, we separate the loop into three modules. Module 1 is growth factor module aiming at give strains the growth advantage. Module 2 includes a valuable product, a specific allosteric protein, and a negtive promoter. General relationship in module 2 can be clarified by that in β-galactosidase, lac1 repressor and lac promoter. Module 3 is where the product can be generated.
Module 1
According to literature, expression of glucose dehydrogenase (gdh) can significantly enhance the growth rate of Escherichia coli and Bacillus subtilis. In Bacillus subtilis, when consumed same amount of glucose, the dry weight of the gdh-overexpressed strain is larger. That is to say, gdh-overexpression significantly increases the metabolic efficiency of glucose1-3. Gdh works in the branch of the pentose phosphate pathway. It converts glucose to gluconate, and then gluconate is phosphorylated into 6-P-gluconate and enter the pentose phosphate pathway (Figure 3). This metabolic process generate products includes NADPH, which significantly increases the cell's reducing power and helps cells fight against oxygen free radicals, and different carbon number compounds (C3 to C7), which play an important role in cell damage repair, energy supply and so on (Figure 4). (For example, product ribose-5-phosphate is a precursor to synthetic DNA and RNA, and product glyceraldehyde triphosphate is an important intermediate in glycolysis.)
Module 2
There are some allosteric proteins in E. coli. They can works together with other genes, forming an operon to perform certain function. One of the operons, emrRAB (also known as mprA or marRAB), generally has resistance to some bacteria-harmful uncoupler. emrR, the repressor, can be expressed and bind to its own promoter and inhibit the express of emrA and emrB. When emrR bind with uncoupler like DNP, it is no longer able to bind with its promoter and open or enhance the expression of emrA and emrB. Then cell membrane’s permeability to uncoupler can increase allowing the bacteria to pump uncouplers out of the cell. According to literature, salicylic acid (SA) is also one of the allosteric inhibitors of emrR.
Module 3: pchBA producing salicylic acid
Salicylic acid (SA) is an important organic product that can be used in the production of cosmetics, preservatives, medical supplies, and so on. Nowadays, commonly used industrial synthetic method to produce SA is the Kolbe-Schmitt method, which is a three-step reaction using phenol as raw material. However, this method costs a lot of energy and quite a long time, and phenol has a carcinogenic effect, which is very harmful to the human body and the environment. Therefore, it is promising to replace chemical synthesis method by biosynthesis method.
In nature, only few strains can produce SA, and they are difficult to be isolated or be screened. Recent years, some laboratories have succeed in SA’s biosynthesis in E.coli by utilizing the shikimate synthesis pathway, with a maximum yield of up to 1 g/L. Chorismate, an intermediate product in the pathway, is catalyzed to iso-branched acid by isochorismate synthase (ICS), and then catalyzed to SA by iso-branched pyruvate lyase (IPL) (Fig.7).
Based on the research results from literature, we have the idea of applying our anti-degeneration loop to produce SA in order to increase production and reduce costs. It will undoubtedly have a positive effect on the industrialized biosynthesis of SA.
Previous experiments and literature have shown that both Pseudomonas putida and Pseudomonas fluorescens contains natural ICS and IPL genes and both genes can express and work in E.coli. Meanwhile, we found pchBA (Part: BBa_J45319), an IPL-ICS genes extracted from Pseudomonas fluorescens, in iGEM2018kit. So we use this pchBA and the positive-feedback loop mentioned above to design the pathway showed in figure 8. When pchBA is correctly expressed, SA is produced in the bacteria and makes the downstream gdh expressed. The expression of gdh can give bacteria growth advantage, so that the bacteria can produce larger amount of SA. In this way, we construct a positive feedback, allowing a vigorously growth and higher production to bacteria.
Design: universal gene circuit:
We have also realized that our genetic loop has an unavoidable defect: If we apply this pathway to prevent degradation of an engineered bacterial exogenous metabolic pathway, then we must find a repressor protein. The repressor protein can specifically bind to the promoter sequence to prevent transcription and change in conformation with the fermentation product to leave the specific sequence. However, the proteins present in nature are limited, so it is difficult to specifically find a repressor protein for a certain product molecule. And redesigning the repressor protein and then synthesizing is very complicated and cumbersome. Although many studies have provided good methods of finding, designing or engineering repressor proteins, their practical applications are still not very convenient, in other words, the existing gene loops do not have good universality.
Based on the above discussion, the existing experimental results and HP’s research, we can get the following conclusions: (1) The yield of salicylic acid will not be significantly enhanced using previous gene loop, but the response of emrR protein to salicylic acid is very sensitive, so salicylic acid can be used as a signal molecule; (2) The existing gene loop is not universal, and the number of the existing transducers that can respond to the product – the repressor proteins is limited; (3)In industrial production, E. coli is commonly used to produce a number of protein and peptide products.
In response to the above conclusions, we have designed a new universal anti-degradation loop for use in E. coli (Principle in Figure 9).
By embodying the above ideas, we have designed the following possible integrated pathways.
Before verifying the overall function of the intergrate plasmid, it is necessary to verify the operation of each part. So we separate them into the following modules. Module 4 includes Stop-Start Codon “TGATG”, Module 5 includes cI repressor and its promoter, Module 6 includes T7 RNA polymerase and its promoter.
Module 4: Stop-Start Codon “TGATG”
TGATG is one of the core components in our system. Although made up of five simple bases, it has great functional advantages.
TGATG could form a special unit named “Stop-Start Codon”, which means stop codon(UGA) is in close proximity to start codon(AUG). There are many similar combinations: UGA combines with AUG(UGAAUG/UGAUG), UAA combines with AUG(UAAAUG/UAAUG), UAG combines with AUG(UAGAUG). As for TGATG we use in SAM, the stop codon overlap with start codon.
In the process of translation, ribosome ought to drop at stop codon, but it has some possibility that the ribosome could directly identify start codon and move on to translate the next gene, which means we could save a promoter.
Actually, this phenomenon is not common. According to Vivian, The transcription ratio of the downstream gene and the upstream gene is about 1:5-1:10. Multiple factors impact the data, such as the strain of the bacteria, the plasmid backbone, the gene itself... and so on.
Module 5 : cI repressor and its promoter
One of the keys to our project is how to couple the target gene to the expression of growth-promoting factors, and how to avoid minor variations in the regulatory part bringing a huge impact on the entire system. Initially our idea was to use a large number of operon regulation systems in prokaryotes—usually a small molecule substrate corresponding to a repressor. When a small molecule substrate is present, the repressor dissociates from the operon region that is located upstream of the promoter so that downstream products can be expressed. Therefore, the gene catalyzing the synthesis of the small molecule substrate and the target gene are expressed in tandem by a specific DNA sequence, and the repressor is integrated into the genome for constant expression, so that simultaneous expression of the growth-promoting factor when the target product is expressed can be achieved. Thus, the strain carrying the target gene is given a growth advantage.
However, this approach has two important disadvantages. First, building such a system is cumbersome and difficult. This is mainly reflected in the fact that there is often more than one gene for synthesizing a small molecule, and transferring these genes into the bacteria not only brings a large metabolic burden to the recipient bacteria, but also makes the whole system complicated and hard to predict. Second, small molecular substances are often easily transported across the membrane, which may make it possible that the bacteria that cannot correctly synthesize such a signal molecule (that is, the bacteria whose target gene is not normally expressed) obtains the same growth advantage as the bacteria that normally express the target gene. This invalidates the entire system.
Therefore, we hope that the signal conversion part of the system is as simple, efficient and leak-free as possible, which makes us think of transcription activators. These proteins tend to be of low molecular weight and are not normally transported extracellularly. It can promote the expression of downstream target genes by interacting with the transcription factors originally present in the cells, without the need to transfer additional proteins to help it. After reviewing the literature, we found that there is a cI protein in λ phage, which can achieve both the promotion and inhibition of a same promoter by binding DNA sequences at different positions on a specific promoter. Thus, a DNA sequence that binds to a cI protein resulting in transcriptional activation is fused to other promoters and subjected to some artificial design, such that the cI protein significantly enhances expression of the downstream product of the promoter. According to this principle, the protein has even been applied to the construction of a prokaryotic two-hybrid system, and its effect is expected to be significant.
Module 6 : T7 RNA polymerase and its promoter
The T7final part we constructed contains the lactose operon, GFP, T7 ploymerase, T7 promoter, GDH, and ampicillin resistance genes, of which GDH and T7 ploymerase are coupled to express by TGATG structure. Our experiments focus on how to use positive feedback regulation to ensure the expression of the target gene on the plasmid by the growth-promoting effect of gdh. The specific mechanism is to couple GFP to T7 RNA Polymerase gene by TGATG structure. That is, to express GFP, the corresponding T7RNA Polymerase gene will also be expressed. But since the ribosome will have a certain probability of detaching from the mRNA when it encounters the TGA stop codon of GFP during translation, the translation of the T7RNA Polymerase gene will be less than the translation of the GFP gene, at the same time, the T7RNA Polymerase produced specifically activates the T7promoter downstream of the plasmid, allowing the gdh gene behind it to be expressed, and gdh promotes the growth of the bacterium. The result is that in the bacteria the more GFP is produced, the more T7 RNA Polymerase is expressed, and more T7 promoters will be activated to express more gdh which enables the high-yield GFP-producing strains to gain growth advantages. Thereby the yield of the strains increases. When high-yielding bacterium gain growth advantages, in the process of continuous selection, the degraded strains will gradually lose the competitive advantage. Thereby we can achieve the purpose of controlling the degradation of strains. The advantage of our module is that it can be used to select strains without antibiotics thereby reducing the use of antibiotics, and can be applied to a wide range of industrial production such as food and medicine.