Difference between revisions of "Team:Marburg/Design"

Revision as of 23:28, 17 October 2018

Design

Careful planning puts you ahead in the long run; hurry and scurry puts you further behind
-- King Solomon

What does creating task specific lab strains, building a novel golden-gate based toolbox and envisioning a synthetic metabolic pathway have in common?
All these approaches require intense theoretical groundwork. Many hours have to be spent with screening literature, summarizing the current state of knowledge and finally, developing own ideas. Constructs have to be designed and experiments have to be planned.
All this preliminary work has to be completed before the lab work can be truly started. In addition to our initial theoretical work, we routinely iterated the steps: designing, building, testing and learning throughout the whole project.

On this page, we want to provide you the theoretical groundwork that we did to design our projects. We want to demonstrate that we followed Synbio design principles and conceived our project based on the current state of knowledge.

Design of the Marburg Collection

We created a novel golden-gate based cloning toolbox consisting of 123 parts. Here we describe the design of all relevant features of this toolbox. We provide instruction on how to use the connectors and the thought behind the selection of specific fusion sites.

Overview of the Marburg Collection

A major goal of our project was to create a collection of characterized parts for use in V. natriegens to provide other iGEM teams, as well as the whole scientific community a toolbox for the rational design of metabolic pathways, genetic circuits or any other DNA construct.
Our toolbox was initially conceived for being used in V. natriegens. However, while designing the Marburg Collection, we realized that a toolbox with maximum flexibility can easily be used in more than one bacterial species. Because alternative bacterial chassis, apart from E. coli, are gaining increasing importance (Kim et al. 2016. ) , a toolbox that is compatible with more than one bacterial species enables scientists to work with organisms that have the exact properties needed for specific applications. Our vision is to establish the Marburg Collection as the first broad-host range golden-gate-based cloning toolbox.

**Figure 1: Hierarchical cloning is facilitated by subsequent Golden Gate reactions.**
Basic building blocks like promoters or terminators are stored in LVL0 plasmids. Parts from each category of our collection can be chosen to built LVL 1 plasmids harboring a single transcription unit. Up to five transcription units can be assembled into a LVL2 plasmid.

We investigated existing bacterial toolboxes like EcoFlex (Moore et al. 2016.) and iGEMs PhytoBrick system, which is closely related to the GoldenBraid cloning (Sarrion-Perdigones et al. 2011.) . All these toolboxes are based on predefined vectors that preset the origin of replication and the antibiotic resistance cassette thus preventing their use in organisms that are not compatible with either feature. We wanted to give scientists the freedom of choice to select oris and resistance cassettes that suit their organisms and applications best. This flexibility is possible because our toolbox does not rely on classical “backbones” but instead functions by the complete de novo assembly of at least eight basic genetic parts.

All basic parts like promoters or resistance parts are stored in LVL0 plasmids. The assembly of a plasmid comprising a single transcription unit is done by assembling at least eight parts resulting in one LVL1 plasmid. One to five LVL1 plasmids can then be used for a subsequent round of assembly to obtain a multigene LVL2 plasmid which could already harbor a full synthetic metabolic pathway consisting of up to five enzymes. Our toolbox even allows more rounds of assembly, each combining up to five constructs of the previous level. This enormous cloning capacity results theoretically in an infinite number of transcription units that can be assembled in a small amount of time.

**Figure 2: Hierarchical cloning is facilitated by subsequent Golden Gate reactions.**
Basic building blocks like promoters or terminators are stored in LVL0 plasmids. Parts from each category of our collection can be chosen to built LVL1 plasmids harboring a single transcription unit. Up to five transcription units can be assembled into a LVL2 plasmid.

An additional layer of flexibility in the construction of LVL2 plasmids is added to our toolbox by our 5’- and 3’- Connectors which flank the transcription units in LVL1 plasmids. These connectors provide the fusion sites for the assembly of LVL2 plasmids. By selecting the correct connectors, the user can define the order as well as the orientation of each transcription unit. The ability to assign an orientation to a transcription unit is supposed to reduce the influence of neighboring unit on each other caused by transcriptional read through and DNA supercoiling. Moreover, our connectors are designed to function as genetic insulators. They consist of 300 bp “neutral DNA” flanked by strong transcriptional terminators to separate each transcription unit in a LVL2 plasmid. We believe that this novel feature is an important step towards building genetic constructs in a rational and predictable manner.

Construction of novel part entry vectors

All LVL0 parts have to be stored in plasmids to allow for amplification and long term storage. To create new LVL0 parts, a PCR product or annealed oligos are cloned into a part entry vector. This vector harbours the resistance and ori that are required for selection and propagation. Furthermore, part entry vectors can be designed in a way that they contain a dropout. This dropout can be a transcription unit for a marker that generates a visible output. The first golden-gate-based toolbox MoClo (Weber et al. 2011.) used a LacZ alpha transcription unit which can be used for blue white screening in many E. colii> cloning strains. This concept was also adapted by iGEMs PhytoBrick system. During the cloning of LVL0 parts, this dropout is replaced by the desired part. When the cloning reaction is transformed into a suitable E. coli strain and the cells are plated on agar plates with supplemented IPTG and X-Gal . Colonies transformed with the religated entry plasmid appear blue while white colonies most probably contain the correctly assembled plasmid. The LVL0 part entry vector in iGEMs PhytoBrick system has been designed as described and can be used for blue-white screening.
(BBa_P10500). We appreciate the approach of using part entry plasmids with dropouts but, for two reasons, we think that LacZ is not an optimal reporter. First, blue white screening requires the two expensive chemicals IPTG and X-Gal which have to be added to the agar plates. Second, blue-white screening is restricted to E.coli strains with an incomplete laci> operon that is complemented by the LacZ alpha fragment that is expressed from the plasmid (Langley et al. 1975.) . Consequently blue white screening is not compatible with a V. natriegens wild type strain

(Link to Improve).

**Figure 3: BBa_K2560002 was designed as a novel part entry vector**
A sfGFP dropout cassette is flanked by BsmBI recognition sites. During the cloning of LVL0 parts with BsmBI, this dropout cassette is replaced by the desired part

We overcame both drawbacks by replacing the LacZ alpha dropout of (BBa_P10500) by a RFP and sfGFP dropout and thus created the two parts BBa_K2560001and BBa_K2560002, respectively. Like (BBa_P10500), our two new dropout parts are located in a derivative of pSB1C3 which is extended with two BsaI recognition sites. The dropout parts are flanked by BsmBI recognition sites that are required for replacing the dropout by the desired LVL0 part. The resulting LVL0 parts are classified as PhytoBricks and are RFC 10 and RFC 5 compatible, assuming that the part itself does not contain prohibited recognition sites. The resulting plasmids can be used to clone and store all LVL0 parts. However, for resistance parts a different approach enables more convenient LVL1 cloning. In the LVL1 golden-gate-reaction, at least eight parts are combined in a single tube together with the required enzymes. All parts, except the resistance part, are stored in the previously described pSB1C3 derivate which confers a chloramphenicol resistance. Obviously, the plasmid that provides the new resistance cassette for the LVL1 plasmid (e.g. a Cas9 plasmid with kanamycin resistance), has to contain the same resistance cassette (kanamyicin in this example). Because cloning is never 100 % efficient, re-ligation of LVL0 plasmids is a common event, and in case of the LVL0 resistance part, results in false positive colonies which do not contain the desired LVL1 plasmid.
We developed a solution for this problem by creating the novel resistance entry vectors BBa_K2560005 and BBa_K2560006. These plasmids contain a RFP and sfGFP dropout, respectively, and a chloramphenicol resistance cassette that is flanked by BsaI and BsmBI recognition sites. When a new LVL0 resistance part is cloned, the chloramphenicol resistance is replaced by the new antibiotic resistance marker resulting in a RFP or sfGFP expressing plasmid with the respective resistance marker. When using these LVL0 parts for LVL1 cloning, the re-ligated resistance parts yield colonies with a visually detectable phenotype. As a result, correct plasmids can be easily identified, even for inefficient LVL1 clonings with < 10 % efficiency.

Choice of fusion sites

Golden-gate based cloning relies upon the use of type IIs restriction endonucleases like BsaI, BsmBI or BpiI. In comparison to the commonly used type II restriction enzymes (e.g. EcoRI or PstI) they also recognize a specific DNA sequence but cleave outside of their recognition sequence (Pingoud and Jeltsch, 2001) . The golden-gate-cloning method is taking advantage of this property. A single enzyme can be used to create various single-stranded overhangs that match in a predefined order and finally lead to the correctly assembled plasmid (Pingoud and Jeltsch, 2001) .
When we started to design the Marburg Collection, we carefully investigated which fusion sites should be used. The fusion sites do not only set the order in which the single parts will be assembled but also affect the assembly efficiency and determine if a newly designed toolbox is compatible with already existing collections, so that parts can be shared easily. For us, the most important and decisive argument was that we wanted to be compatible with as many other toolboxes as possible. It is our strong belief that scientists all over the world should agree on one set of fusion sites to ensure complete interchangeability between different toolboxes. The toolboxes of MoClo (Weber et al. 2011) , Loop Assembly (Pollak et al. 2018) and the PhytoBrick system already use a common set of fusion sites. We decided to adapt these fusion sites for all parts that build the transcription unit (Promoter, RBS , CDS , Terminator, Tags). Fusion sites for parts that are novel to our system (Connectors, Oris, Resistance cassettes) had to be newly designed by us because these parts did not exist in the other toolboxes.

We applied the following design principles to obtain optimal fusion sites. Firstly, the newly designed fusion sites must neither be identical to already existing fusion sites nor be palindromic to prevent assembly in a wrong order. Secondly, the fusion sites should not consist of bases that represent a portion of the recognition sequence of a restriction enzyme. If, for example, a fusion site with the sequence GGTC was used, and the sequence of the downstream part starts with TC, a BsaI recognition site would be reconstituted. So all fusion sites that would result in a partial recognition sequence of either BsaI, BsmBI or any of the enzymes that are used in the Biobrick cloning, are excluded. Lastly, the remaining candidates were sorted according to their GC content and the fusion sites with the highest GC content were chosen.
To make design of new parts as simple as possible, we created a collection of overhangs that can be copied from table 1 and pasted to the sequence specific part of a primer to create new LVL0 parts. These primers contain the cut sites for integration into the part entry vector as well as the predefined fusion sites that are required for correctly assembling LVL1 plasmids. In some cases these overhangs contain additional bases that will be discussed in the following chapter.

View table with primer overhangs

Part Category	Fwd Overhang	Rev Overhang
1 5’ Connector	AAGGTCTCGCTCGAACACGTCTCGNNNN	GGAGTGAGGGAGACCAA
2 Promoter	AACGTCTCGCTCGGGAG	TACTTGAGGGAGACGAA
3 RBS	AACGTCTCGCTCGTACTAGAG	TAATCAATGTGAGGGAGACGAA
4 CDS	AACGTCTCGCTCGAATG	GCTTTGAGGGAGACGAA
5 Terminator	AACGTCTCGCTCGGCTTAA	CGCTTGAGGGAGACGAA
6 3’ Connector	AAGGTCTCGCTCGCGCT	*NNNN*GGAGACGAGCTTGAGGGAGACCAA
7 Ori	AACGTCTCGCTCGAGCT	TGCTTGAGGGAGACGAA
8 Resistance	AACGTCTCGCTCGTGCTT	AACATGAGGGAGACGAA
4x N-Tag	AACGTCTCGCTCGAATG	GGGATGTGAGGGAGACGAA
4y N-tagged CDS	AACGTCTCGCTCGGATG	GCTTTGAGGGAGACGAA
5a C-Tag	AACGTCTCGCTCGGCTTTA	GGGTATGAGGGAGACGAA
5b Terminator	AACGTCTCGCTCGGGTAA	CGCTTGAGGGAGACGAA

Table 1: Primer overhangs for creating new LVL0 parts. Overhangs have to be added to the 5’ end of the primer. Reverse overhangs have to be added as reverse complement. Underlined bases represent BsmBI recognition sites. Bolt bases indicate fusion sites for LVL1 assembly. Bold and underlined bases show BsaI recognition sites. N bases written in bold and italic show fusion sites for lvl1 assembly and have to be custom-designed.

Additional bases between parts

The fusion sites of most golden-gate-based cloning methods create a four base pair scar that is referred to as the fusion site. The fusion sites are the feature that makes toolboxes compatible with each other. Between some parts, additional bases are required for different reasons. These bases were chosen carefully to achieve best performance of the respective part. Please note that these additional bases are not a strict requirement to use or being compatible with our toolbox but we recommend them for the design of additional parts.
The first additional bases were incorporated between the Promoter and RBS part. The fusion site is TACT and AGAG was added additionally. The sequence between promoter and RBS that results by using our suggested overhangs form the same scar that is created if the the parts were assembled with 3A Assembly (Knight, 2003) . This means that the distance between promoter and RBS is not changed and therefore we do not expect negative effects in transcription or translation. Moreover, we hope that creating the same scar with a different method will make our experimental data more comparable to the data acquired with plasmids assembled with 3A Assembly in previous iGEM projects. The next bases were integrated between RBS and CDS parts. The fusion site, which was adapted from the PhytoBrick system, AATG and TAATC was added upstream of it. Previous work has shown that the sequence between a RBS and the start codon dramatically affects the expression of the desired protein (Lentini et al. 2013). A spacer length of six base pairs was shown to result in the strongest expression. When comparing different bases in a six bp spacer, the experimental data indicate significant differences. The 3A Assembly scar results in 50 % expression strength compared to the sequence TAATCT which was referred to as the reference (Lentini et al. 2013). We chose to use the spacer sequence which is expected to result in highest expression as we think that a system that is designed to enable strongest expression can be easily adapted for low expression by using weak RBS or promoters while going to the opposite direction might be more difficult. Unfortunately, we could not adapt the exact “reference sequence” because the first A in the fusion site AATG is already part of the spacer. Eventually, we used the first five bases of the strongest spacer (Lentini et al. 2013) upstream of the fusion site.

**Figure 4: Additional bases and fusion sites ensure correct spacing and allow tags**
Between some parts, additional base pairs were integrated to ensure correct spacing and to maintain the triplet code. We expanded our toolbox by providing N- and C- terminal tags by creating novel fusions and splitting the CDS and terminator part, respectively.

Close attention has to be paid to fusion sites that connect two sequences which are translated like the CDS part or N- and C- terminal Tags. A fusion site in our system consists of four bases which would result in disrupting the triplet code. To prevent mistranslated proteins, two additional bases have to be added to create a six bp scar that results in two translated amino acids. These can be seen as linkers between the joined CDS parts or tags. We decided to preferably use amino acids that are abundant in natural or synthetic flexible linkers like glycine and serine (Chen et al. 2012) . Flexible linkers have been shown to improve the performance of epitope tags in Saccharomyces cerevisiae (Sabourin et al. 2007) . Therefore we added the bases GG upstream of the fusion site between 4x (N-Tag) and 4y ( CDS ) which results in glycine and methionine (methionine is preset by the fusion site) and the bases TA downstream of the fusion site between 4 ( CDS ) and 5a (C-Tag) resulting in an alanine and leucine linker. To allow for the optional use of C-terminal tags, a CDS part must not possess a stop codon. Therefore an additional linker has to be introduced between 4 ( CDS ) and 5 (Terminator) as well as between 5a (C-Tag) and 5b (Terminator) resulting in an alanine-STOP and glycine-STOP, respectively.

Design of Connectors

One key feature of the Marburg Collection are the connectors that provide our toolbox with the required flexibility. Our design is inspired by the “Dueber Toolbox”, a golden-gate-based cloning method designed for applications in Saccharomyces cerevisiae (Lee et al. 2015) . To our knowledge, the “Dueber Toolbox” is the first cloning system that performs assembly through all levels using restriction sites located on independent basic parts, termed connectors, instead of destination plasmids for level 1 and level 2 plasmids (Weber et al. 2011) .

**Figure 5: Default setup of connectors for building a LVL2 plasmid consisting of 5 transcription units**
The connectors match in the following pattern: 5'Con(N) connects to 3'Con(N-1)

Using connectors instead of destination plasmids enables the user to freely choose antibiotic resistance parts and oris. To achieve the flexibility to choose one out of four resistances and one out of three oris, like it is possible in our toolbox, twelve plasmids would be required for building LVL1 plasmids. For building LVL2 plasmids, LVL1 plasmids with different fusion sites are needed. In our system, this is also achieved through the connectors in contrast to the destination plasmids which are used in other cloning methods.

In already existing toolboxes, the combinatorics that come with choosing oris and resistances would be multiplied by the number of required positional vectors which provide the fusion sites for LVL2 assembly. A toolbox with five possible positions, four resistances and three oris would require 60 LVL1 destination plasmids. To enable inversion of transcription units, a built-in feature in our toolbox, the previous number would be multiplied by two, resulting in 120 required LVL1 destination plasmids.
Unsurprisingly, this theoretical toolbox would not be convenient to be built and used.

The Marburg Collection presents a novel approach that achieves maximum flexibility with a minimum of required components. We provide two sets of connectors. The first set, the “short connectors” solely provide the fusion sites for subsequent assembly. The fusion sites are identical to the ones that are used in building LVL1 plasmids to avoid having to design a complete new set of fusion sites. This also enables using LVL0 ori and resistance parts in LVL2 plasmids. Inversion of individual transcription units was also achieved by designing fusion sites that match in reverse order.

**Figure 6: Inversion of transcription units is feasible with our connectors.**
To achieve inversion, the respective transcription unit has to be built with connectors that match the neighboring transcription units. The function by following pattern: 3'Con(N)inv connects to 3'Con(N-1) and 5'Con(N)inv connects to 5'Con(N+1)

The 5’ and 3’ connectors can be chosen independently to build a LVL1 plasmid. This allows the user to combine any number of LVL1 plasmids between one and five into a LVL2 plasmid without end linkers that are required in other toolboxes (Andreou and Nakayama, 2018). An unusual example is shown in figure 7 to demonstrate the number of possibilities. The only logic that has to be followed for designing multigene constructs is to follow the formula 5'Con(N) connects to 3'Con(N-1). In the default set up TU(N) is built with 5'Con(N) and 3'Con(N) This will result in a LVL2 plasmid harboring five transcription units.
However, any other combination that follows the previously given rule will result in a functionally assembled LVL2 plasmid.

**Figure 7: Unusual examples are displayed to show the range of possibilities when using the Marburg Collection**

Design of insulators

In addition to the “short connectors”, we designed a set of “long connectors” that function as genetic insulators. They fulfill the same basic functions as the short connectors, providing fusion sites for subsequent cloning. Additionally, they consist of 300 bp “neutral DNA” flanked by two strong transcriptional terminators.
The purpose of these long connectors is to reducing cross interaction between neighbouring transcription units, mainly transcriptional read through, a phenomenon that was described previously (Mairhofer et al. 2015) . The design of “neutral DNA” started with generating 300 bp of random DNA which do not possess recognition sites of BsaI, BsmBI and any of the enzymes used in Biobrick Assembly, using a self-made Matlab script. These sequences were analyzed with the reverse mode of the R2oDNA designer (Casini et al. 2014) . The R2oDNA designer can take a DNA sequence as input and quantifies the extent of secondary structures, repeats and forbidden sequence motifs, such as promoter or RBS motifs. Because the genome sequence of V. natriegens is not included in the tool, a BLAST search was performed manually with the sequences to check for homologies with the genome of V. natriegens. The top hit was considered as a fourth score to quantify the quality of a spacer sequence. At the end of this process, four scores were assigned to every sequence, each describing a different characteristic. We decided that spacer sequences with decent scores in each category are better suited as insulators than sequences with a superior score in one and inferior scores in other categories.

**Figure 8: Design of our insulating connectors**
300 bp neutral DNA are flanked by two strong terminators. A BsmBI site for level 2 cloning is incorporated at the 5'end

We developed a Matlab script to find the “best” spacer sequences by increasing quantiles and picking spacers that fall into the lowest quantiles for all four categories. For example, the best sequence was found to be among the best 28 % of all tested sequences in terms of secondary structure, repeats, forbidden motifs and homology to the genome of V. natriegens.

The selected sequences were flanked by synthetic transcriptional terminators that were developed in the lab of Christopher A. Voigt (Chenet al. 2013.) . Most terminators were described to be unidirectional (Chenet al. 2013.) and therefore the orientation of the flanking terminators had to be considered. Our priority was to prevent transcription into the respective transcription unit from the upstream sequence. Therefore each spacer sequence was equipped with a strong terminator at the 3’ and 5’ end of the 5’ connector in forward and reverse orientation, respectively.
We expect to achieve the best reduction of crosstalk between neighbouring transcription units by combining our insulators in combination with connectors that facilitate inversion of an individual transcription unit, thus providing a large step towards the reliable and predictable design of synthetic circuits.

Selection of parts

In addition to the connectors that form the scaffold of our toolbox, we evaluated which parts should be integrated into the Marburg Collection. As a starting point, we focused on parts that are already commonly used in E. coli to demonstrate their compatibility with our new chassis V. natriegens. Therefore we integrated all 20 promoters of the Anderson Library, four RBS from the Community Collection and the most frequently used reporters and terminators. We also included two well established inducible promoters, three widely used oris and the four resistance parts covering the antibiotics used by iGEM.
In addition to this standard set of parts, our toolbox provides N- and C-terminal fluorescence and epitope tags to facilitate fluorescence microscopy and protein purification and degradation tags that can be used to finetune protein levels. Lastly, we submitted all project related parts that can be used to perform genome engineering via CRISPR/Cas9 or natural competence and all enzymes that were used in our metabolic engineering project.

Summary of the features of the Marburg Collection

The Marburg Collection is designed to be the most flexible golden-gate-based toolbox for prokaryotes. The high degree of flexibility of our toolbox is achieved by the de novo construction of plasmids instead of using entry vectors like most other toolboxes. A novel core feature are the connectors that provide the fusion sites for subsequent cloning steps and can be used to set the order and orientation of each individual transcription unit. A set of newly designed insulators, consisting of “neutral DNA”, and two transcriptional terminators, are designed to minimize crosstalk between neighbouring transcription units.

Strain Engineering

In Strain Engineering we modified V. natriegens to establish three different strains which were designed based on commonly used E. coli lab strains to accelerate daily lab work. Therefore, we designed a new workflow for fast genomic modificationsin V. natriegens to further speed up our project.

Methods

We designed a workflow for fast genomic modifications in V. natriegens using the Flp/FRT system for excision of antibiotics markers and combining this technique with the natural competence of V. natriegens for the uptake of linear DNA from their environment. Therefore, we utilized the Marburg Collection to assemble the desired constructs for introducing the Flp recombinase and TfoX into the V. natriegens cells.

Natural Competence

In our project we wanted to build many different strains. For this we have to make many genomic changes of the wildtype of Vibrio natriegens. Mutations must be introduced in some genes, others must be deleted or inserted. In order to think about a good method that works in V. natriegens and with which we can make many modifications very quickly, we looked up into the literature. We found a great publication from the group of Ankur B. Dalia published 2017. They have shown that V. natriegens is natural competent and that it is possible to perform a natural transformation with this organism. This method is a good choice for many multiplex genome editing in a short time. They showed successful that it was possible to integrate a linear PCR product with a antibiotic cassette that replace the gene of interest. Besides, they have managed to perform a cotransformation. This fact was very interesting to us and we knew immediately that if we can reproduce it, then that's the method we want to use.
It was possible for us to get in contact with the group leader and asked him for his strain, he had used for the transformation. In this strain, Tfox is expressed under the control of an IPTG inducible promoter. It is not yet known how the natural Tfox is induced therefore, this method was resorted.
After receiving the strain, we were able to start testing it to see if the method works in our hands. Before we start in the lab, several things have to be considered: Transformation is dependent on the length of homologous sequence. It was shown that the transformation efficiency is much higher with 3kb homologous regions than for example with 0.5kB but she is still high. In a cotransformation, however, it is important that the homologous regions are chosen to be slightly larger. There was a big difference between the efficiency of the transformation. For example, the efficiency is 6-fold lower with 1kb-flanks than with 3kb-flanks (Daliaet al.2017) . Moreover, you have to remember that you have to make multiple iterative cycles to get the complexity of genome edits in the population (Daliaet al.2014) . In each cycle it is important to change the antibiotic marker. An advantage is that the efficient in V. natriegens ist still high and you only need a low amount of the unselected product (25-50ng/108 CFU) (Daliaet al.2017).

To get in touched with the multiplex genome editing by natural transformation, we construct a linear DNA Fragment to transform. We have chosen an antibiotic resistance for selection, which has to be different to the resistance marker on the plasmid in the strain. Using a PCR we construct at first the chloramphenicol cassette with the frt sites (Link: frt) and fuse this on the upstream and downstream regions of the gene we want to delete in order to integrate it by homologous recombination (Figure). In this case it was the dns. It was previously shown that cells with genetic modifications on this site did not developed unfavorable phenotype (Daliaet al.2017). To use the strain after completion of the modifications it is important to get the plasmid out. At first we inoculate a culture without the antibiotic in order to lose the plasmid with the tfox gene. Every 12h 10μl were plated out and with 10µl a new culture is inoculate. By doing that we don´t get the plasmid out, so we have to think about another technique for getting rid of the plasmid. For that we looked up again into the literature. We could find, that Ankur B. Dalia wrote that the plasmid is very stable and you can use the strain without antibiotic. Then we contacted him again to pray for help. He told us that it was possible to get the plasmid out with another strategy. This makes us very relieved, because if the plasmid is still in the modified strain, which would be critical for using the strain for cloning, due to marker and Ori incompatibility. With his proposal to grow a culture with the add of IPTG and than to screen for clones who are no longer carbenicillin resistance, we are able to get the pMMB-tfox Plasmid out of the strains. After it was possible for us to use natural transformation we start to build our own plasmid. Therefore we use the Marburg Collection (Link: Part Collection), to make this great part good and easy accessible for the scientific community. Therefore we build a level 0 and a level 1 Plasmid and integrated it into the wildtype of V. natriegens.

The Flp/frt system

In order to establish the Flp/frt system in V. natriegens for fast genomic deletions two components are needed. First we need to introduce genomic FRT sites for the desired modifications and second a FLP recombinase needs to be introduced into the V. natriegens cells. By attaching frt sites to the chloramphenicol resistance cassette that will be used for creating deletions, we will introduce the genomic FRT sites that are needed. Therefore, the resistance cassette from pYTK will be amplified using Primers with frt overhangs. Depending which kind of gene, we want to delete, the upstream and downstream homologous regions with a length of 500 to 3000 bp will also be amplified attaching frt overhangs to the upstream reverse and downstream forward primers. This way, upstream fragment, chloramphenicol resistance cassette and downstream fragment can be assembled to large linear fragments by performing fusion PCRs. Using the natural transformation method we will transform these linear fragments to the V. natriegens cells. By homologous recombination, the fragments can integrate into the genome. This Insertion will be proved by colony PCR. The second component for the Flp/frt system is introducing a functional FLP recombinase into the cells. We will involve the pBR-FLP plasmid that was used to establish the Flp/frt system in other Vibrio species like V. cholerae (Blokesch 2012) into the Marburg toolbox using primers with compatible overhangs. Assembling different parts of this toolbox to a plasmid with a full transcriptional unit, we will create a plasmid encoding the FLP recombinase. Because we already use all inducible promoters of the Marburg toolbox for inducing tfox or the T7 system , we decided to use a consecutive promoter for the flp recombinase. The plasmid will be assembled using K2560007 as a consecutive promoter, K2560010 as an RBS, K2560011 and K2560012 as connectors, K2560271 as the coding sequence of FLP, K2560046 as an ori, K2560057 as backbone with a kanamycin resistance and K2560035 as a terminator.

Introducing this plasmid into V. natriegens and expressing flp, the recombinase will recognize the integrated frt sites that are in the same orientation. By Flp mediated recombination, the resistance cassette will be cut out of the genome, creating the deletion of the desired gene.
Afterwards the strain will be cultivated without chloramphenicol and screened for chloramphenicol sensitive clones. The excision needs to be proved by colony PCR. The next effort is to get rid of the Flp plasmid and its kanamycin resistance because we want our strains to be compatible with all your constructs. Due to its high growth rate, V. natriegens loses plasmids, with some exceptions like pMMB-tfox (Hayes et al., 2017) , very fast if there is no selection pressure by antibiotics in the medium. The Flp encoding plasmid will be selected out, cultivating the created strain without kanamycin and screening for kanamycin sensitive colonies on plate. A colony PCR, negative for Flp will prove the loss of the plasmid and the strain is ready for the usage.

Integration

content

Strains

There are different E. coli strains for a variety of applications. In every lab diverse strains are stored in freezers for methods like cloning, protein expression and protein interaction. And all of them are being used. For our project Vibrigens, we decided to establish the world’s fastest three lab strains suited for specific needs based on our V. natriegens wildtype strain. Here we describe how we designed our three strains VibriClone, VibriExpress and VibriInteract based on the genotype of commonly used E. coli strains.

VibriClone

To create a cloning strain that fulfills all the wishes of the scientific society, we have to think of a good strategy. A cloning strain is a kind of a tool in the laboratory day. We asked ourselves which features we needed urgently. The currently used chassis for cloning is E. coli. We started to look in the genotype, to see what modifications are important. Then we think about which one we would like to take over. Our choice was to knock out the exonucleases so that it would be possible to transform DNA without degrading. In addition, we would like to insert a point mutation in recA, that the homologous recombination between plasmids and into the genome. Moreover, we also want to make it possible to store in the fridge. Therefore we insert the katG gene encodes an enzyme with an catalase and peroxidase activity into the genome. Furthermore, we also wanted to make the blue-and-white selection possible for our VibriClone, we can achieve that by inserting the lactose Operon in the genome.

Exonucleases

In the cloning strain of E. coli, the exonuclease EndA was knocked out. This is very important for such a strain, otherwise the exonucleases degrade the DNA to be transformed. EndA makes double-strand breaks in duplex DNA und is the most abundant DNase activity in E. coli (Dürwaldet al.1968). In the present of the exonuclease no colonies were formed during transformation (Kanekoet al.2010) . It was recognized that by only the complete loss of host endA there were no degradation after 30h at 39°C. In addition, it was also found out that it creates a structural stability to give transformants and that higher molecular weight genomic DNA bands were recovered from the supernatant and it leads to certain discrete bands in digestion (Itayaet al.2012) . V. cholerae has two extracellular nucleases: Xds and Dns. The Dns is homologous to EndA from E. coli. Both proteins have been shown to be secreted into the medium. These two would degrade the free DNA, which is the source of a natural transformation. Because of this the method would not work or work very poorly. To enable the uptake of intact DNA during the competency process, the functions would have to be extinguished. It was shown that when you delete the nucleases, the transformation process works well. In a mutant in which the dns was knocked out, transformation frequency is more than hundred times higher than WT and when xds was knocked out the transformation frequency was 2,5-fold increased. This result showed that it is good if both nucleases are knocked out, but that the dns is the most important one (Itayaet al.2012) . In addition, you get through the deletion of the two nucleases a cleaner preparations of DNA. We wanted our cloning strain to have all the benefits, so it was very important to us that the exonucleases be knocked out. Otherwise, transformation efficiency would be very low and you don´t get clear preparations of DNA and that would not be great for a cloning strain. We started by searching for homologous genes of endA of E. coli and dns and xds of V. cholera. We were able to find out by blast analysis that in Vibrio natriegens there are two exonucleases homologous to those of E. coli and V. cholera. They are called Dns and ExeM.

In the alignment of the V. natriegens dns with the V. cholera dns and E. coli endA you can conclude that the three proteins are highly conserved among the three strains (see Alignment). The intermediate region and the C-terminus are highly conserved in all three proteins, while the N-terminus is variable. This suggests that the parts that are highly conserved are important for the function of nucleases. It can be concluded that the dns of V. natriegens has a very similar function to the two exonucleases of the other two organisms.

The most parts of the two proteins ExeM from V. natriegens and Xds from V. cholera are conserved, but both the N- and the C-Terminal vary a lot from each other. This suggests that the function of these two proteins has a similar function. Thus we can assume that it is very probable that the ExeM protein in V. natriegens behaves like the Xds from V. cholera. Because of this it is very important to delete this protein otherwise the DNA would be degraded. In 2016, Matthew T Weinstock showed that the quality of isolated plasmid DNA could be improved by deleting dns in V. natriegens (Weinstocket al.2016) . In addition, the growth and viability of cells with a deletion of dns in rich medium should not be impaired (Daliaet al.2017) .

After we made all these thoughts, we were able to start and plan the deletion of the exonucleases Dns and ExeM. We constructed cassettes to erase the genes. To first get to know the method and to check whether the natural transformation works as it has already been shown in our hands, we have started to knock out the dns. For this we constructed a cassette with a chloramphenicol resistance and the upstream and downstream region of the dns gene. To use the natural transformation it is enough to use 1000bp flanks but for the co transformation it is much better to use 3000bo flanks. By using Frt sequences between die single fragments, the antibiotics cassette can be excised from the flippase after integration. Because this knockout worked, we started planning for VibriClone. Our decision was made to perform the deletions with co-transformations. This was a big advantage because you only need one selected product. As selected product we chose the dns, due to the fact, it was shown that a deletion and insertion of an antibiotics cassette at this locus is not harmfulto the cells. This has not been shown for the other parts. The deletion of exeM is done by an unselected product. Therefore we create the upstream and downstream regions of this gene and take another gene that we would like to insert (katG) and build an unselected cassette by using fusion PCRs.
With this procedure, we are able to knock out the exonucleases and thus obtain a higher transformation efficiency.

RecA

The classic allosterically regulated enzyme RecA is involved in the regulation of the SOS System and the homologous recombination (McGrewet al.2003) . If you think about what you need for a cloning strain, it quickly comes to your mind that homologous recombination in the genome and between plasmids is not only unnecessary but also undesirable.
In E. coli, a so-called recA1 mutation was performed. This mutation limits the recombination of an introduced plasmid with the genome and between plasmids thereby the stability of the plasmids are ensured. Another advantage is that the exogenous plasmid DNA is left unchanged by the modification of RecA. This allows the purification of this bacterial culture plasmid to allow high fidelity PCR amplification of the original plasmid sequence. The modification in this gene was a point mutation from G to A at position 720. Thereby the glycin at the place 160 chance to aspartic acid. We use this model to modify our strain VibriClone.
At position 158 in recA of V. natriegens is a glycine. Through an alignment, we have found out that this is homologous to that in E. coli. To get the same kind of function loss, this glyin needs to be changed to an aspartic acid. For the modification of recA we build an unselected cassette from the gene with a point mutation. To achieve that two primers bind at the future mutation place, one forward and one reverse were created. In addition two more primers with a gap of 3000 bp to the other both were ordered. The results are two fragments which end at the point mutation. By a fusion PCR with only the outside primers, you get the hole fragment containing the recA with the desired mutation.
All in all with these mutation we make VibriClone fine for all of us to use for cloning without recombineering of an introduced plasmid with the genome and between two plasmids.

KatG

Another feature, which would improve the applicability of V. natriegens as a cloning chassis, would be to engineer cold resistance, due to fact that we all are used to store our plates of E.coli in the fridge. It has been shown for other vibrio species that cold sensitivity is triggered by toxic peroxide present in media, which is normally removed for example in E.coli by reactive oxygen species detoxification systems. Therefore our approach is to introduce such a system into our cloning strain, in order to transfer that tolerance, which enables keeping the working procedures from E.coli. It was shown, that catalase supplemented on agar plates could improve the viability under cold conditions. Nevertheless a more usable approach would be to include this tolerance within the strain itself. Several approaches exist to realize this goal, for instance increasing the glutathione pool within the cell or by introducing a plasmid with an encoded catalase, such as KatG, KatE or the ahpCF operon.
We decided on introducing KatG into the genome, due to previous results, which were reported by Weinstock et al. They could show that KatG heterologous expressed from plasmid extends the time for fridge storage up to 3 weeks, which would suit for the needs for a cloning host. For integrating the katG into the genome, we combined introducing katG with the deletion of the second nuclease by utilizing 3000 bp homologous overhangs of the upstream and downstream flanking genomic region of exeM. These overhangs are fused to the katG transcription unit, that is amplified from the E.coli DH5𝛼 genome, by fusion PCR and subsequently transformed via natural transformation as a cotransformation.

The lac operon

The lac operon of E. coli consists of the three genes lacI, lacA and lacY. In the cloning strain, the fact that the β-galactokinase cleaves X-Gal into a blue dye became apparent. The lacZ gene encodes the β-galactokinase which consists of two parts: the lacZ-alpha and the lacZ-omega. It has been shown that if the alpha fragment was deleted, the omega fragment is not functional. By introduced the alpha fragment on a plasmid, alpha-complementation occurs. (Link alpha complementation) We also wanted to make the blue-and-white selection possible for our VibriClone, as it is in the DH5𝛼 strain of E. coli. Therefore we checked at first which lac genes are present in the wildtype of V. natriegens. We found the lacA gene on chromosom 1 and the lacZ and lacI gene on chromosom 2.
As you can see, several parts are missing or different in all three proteins (see Alignment). What indicates that the lac genes are not functional in V. natriegens, which makes sense as lactose can not be found in its natural habitat and therefore genes for lactose metabolism are not needed and no selection pressure is present. That's the reason why we decided to amplify the lac Operon from DH5𝛼 that has no lacZ alpha subunit.

Next we considered whether the wildtype lacZ from V. natriegens has a functional alpha subunit. If it exists, the 𝛼-complementation would not work anymore. To check this, we aligned the alpha subunit with the lacZ gene of V. natriegens (Figure 5). You can see that many parts of the subunit are missing, but a few sequences are conserved. To avoid that it can come to a complementation, we wanted to delete the part in our VibriClone. Since it appears that the entire lacZ may be nonfunctional, we decided to bring the amplified lactose operon on this place.
To achieve that we build a cassette consist of the upstream and downstream regions of lacZ and the lactose operon from DH5𝛼. The lactose operon of E. coli has a size of 6326bp. Since that is very large we decided to use smaller homologous flanks. It was shown that the natural co-transformation work well with 1000bp flanks, so we use this size. Otherwise the linear DNA Fragment has a size of ~12000bp. Thus, it is possible for us to introduce the lactose operon by cotransformation as an unselected product with the other described products.

With this kind of genome modification, it is possible for the entire scientific community to immediately recognize a successful transformation with VibriClone. This would save not only through the fast growth rate but also through the method of cloning an incredible amount of working time.

Outlook

After developing the first cloning strain of Vibrio natriegens to fully replace E. coli as the most used cloning host in synthetic biology, we asked ourselves what would be the next step. VibriClone provides a starting point for further improvements for instance in case of special cloning application. When we created this strain we completed a full round of the design build test cycle and as we want to bring engineering into biology we started with the next cycle by designing VibriClone 2.0.
Therefore several different strains should be established in parallel to full fill the demand for more specialized applications. For E. coli several different strains exist, which are utilized for example for different sized constructs (TOP10), for allowing genome engineering tool, such as MAGE (mutS deficient), for in vivo mutagenesis, for toxic gene cloning or for preventing unwanted methylation.
In the beginning of our project we did next generation sequencing for our Vibrio natriegens strain and performed bioinformatic analysis and identified several other candidates for deletion and other genome engineering methods. One design feature for our 2.0 strain would include deleting methylase and restriction modification systems, which are present in our genome. In common cloning strains for E.Coli (dH5𝛼, Top10) restriction modification systems are deleted (Δ(mrr-hsdRMS-mcrBC) to enable cloning with DNA from different sources and to protect the DNA from endogenous restriction enzymes. That's why a desirable goal for the next generation of our cloning strain, would be to design a strain, similar to E.coli, which is deficient for all for those modification systems.

When we performed next generation sequencing in the beginning of our project for our wild type strain, we subsequently did a bioinformatic analysis and genome annotation. By utilizing these information we were able to identified several methylases and restriction modification systems.

Type I restriction - modification system subunit M	EC 2 . 1 . 1 . 72
Type I restriction . modification system , specificity subunit S	EC 3. 1.21.3
Type I restriction modification system restriction subunit R	EC 3 . 1 . 21 . 3
Type III restriction - modification system methylation subunit	EC 2.1 . 1.72

We are sure, that more methylases and restriction modification systems can be found in the genome of Vibrio natriegens by utilizing and combining different bioinformatic tools such as the rebase database.
Additionally we found several phage associated sequences and it has been shown that deleting phage related regions could make a bacteria more genetically accessible. In general a genome reduction could have advantages for a cloning host, as it has been shown that it could increase genome stability, reduce mutation rates and improved recombinational stability for cloning repetitive DNA sequences.
The most ambitious idea for our VibriClone 2.0 strain would be to free up chromosome II for utilization as a low copy plasmid in order to clone big DNA fragments or whole genomes and thereby establishing a synthetic chromosome for challenging cloning applications. In the future this could enable cloning of small of hole small genomes, such as the mycoplasma genome in order to make the genome engineering process easier and especially much faster, compared to common methods, where whole genome cloning heavily relies on yeast as a cloning chassis. Combining the fast growth of Vibrio natriegens with powerful genome engineering methods like Crispr Cas, MAGE, CAGE, the ability for natural transformation and MuGENT, would allow rapid genome prototyping and could open up complete new possibilities for building up synthetic genomes from scratch. This would enable engineering customizable microbes with complete new properties specifically tailored for the individual needs for a broad range of applications.

VibriExpress

Sometimes it is really hard to part with old things and adopt to new ones. But as scientists, it is our responsibility to try out new and unexplored things in order to advance progress and improve on already known. To convince you all, to start working with a new chassis we will show that all commonly used techniques are transferable from E. coli to V. natriegens, without much effort. Since it is important to overexpress and purify proteins in heterologous systems, a fast growing protein expression strain is required. Before starting to overexpress a protein, you need to make important decisions: “What kind of system would be the best?”, “Which chassis could be the most effective” or “Do I need a chassis at all, what about cell-free-expressions systems?”
There is not the one and only right way to overexpress a protein. For each protein, other aspects have to be considered in order to make a good decision. These decisions depend on different characteristics of the desired protein, like its solubility, if it is a bacterial protein or a eukaryotic one or if it is toxic or not.
In E. coli, several expression systems are established to cover a broad spectrum, but the most commonly used is the T7 expression system. To enable scientists, to overexpress their proteins much faster and purify a higher protein yield, we decided to establish the T7 system in the world’s fastest growing organism V. natriegens and did further analysis to learn which genetic modifications are needed to use its full potential. As a model, we were guided by the E. coli strain BL21 (DE3). This E. coli strain is often used for heterologous protein overexpression. In our opinion, the most important features of BL21 (DE3) are the genomic integration of the T7 RNA Polymerase gene and the deletion of proteases like Lon and OmpT to improve the protein yield by decreasing the protein degradation (Dunn & Studier 1983) , (Kim et al. 2015) . We will try two different strategies to create a genomic T7 cassette integration (Figure 2). Because it was shown, that the deletion of dns is not harmful ( Hesek et al. 2016) , we decided to integrate the sequence of the T7 cassette at the endogenous locus of dns. In parallel, we will integrate the T7 cassette in one of our successfully characterized integration sites.
Looking at the genotype of E. coli BL21 we noticed that lacI localizes at the upstream site of the T7 RNA Polymerase gene (Kim et al. 2015) . LacI is the negative regulator of the lac operon (Gilbert & Muller-Hill 1966) . In case of the T7 system, the lacUV5 promoter controls the T7 RNA Polymerase expression ( Studier & Moffatt 1986) . By integrating the repressor LacI at the same time into the V. natriegens genome we will reduce the leakiness of the lacUV5 promoter, as it was done in E. coli. Additionally we checked if there are genes encoding proteins similar to Lon or OmpT, because these two were deleted in E. coli BL21 (DE3) to minimize the degradation of proteins (Dunn & Studier 1983) ,(Kim et al. 2015) . By BLAST analysis, we could find a Lon protease with a high degree of similarity to the Lon protease of E. coli in V. natriegens (Figure 1). We could not find any OmpT homologs neither by BLAST analysis nor by an ompT annotation in the genome of V. natriegens. Because of the high similarity of the Lon proteases to each other, it is likely that the Lon of V. natriegens fulfils a similar function like the Lon protease in E. coli. This convinced us to delete the lon of V. natriegens.

As already mentioned we will follow two different strategies to create the T7 integration. For the first strategy, we will fuse the T7 cassette. Amplified from the BL21 genome to a chloramphenicol resistance cassette and to 1000-bp up- and downstream homologous regions to dns. Since it is possible to create two or more genomic modifications, using only one selected product in natural co-transformation of linear fragments (Hayes et al. 2017) , we will use another fragment with homologous up- and downstream regions of lon but without a selection marker in between, to delete lon.
For the second strategy, we will integrate the T7 cassette to one of our characterized integration sites. For this reason, we will fuse the T7 cassette to 1000 bp long up- and downstream homologous regions to integration site 19 (int19). For the deletion of lon, we will use the same linear fragment as in strategy one. As a selected product with an antibiotics marker in between we will utilize the deletion cassette as for Δdns in VibriClone.

As we will attach FRT sites to the antibiotic cassette by PCR, we will be able to remove the selection marker after integration of the selected product, using a plasmid that encodes a Flip recombinase (link FLP recombinase). After the genomic modifications are proved by PCRs, the T7 system needs to be tested and VibriExpress to be characterized. First of all a growth curve of this new strain has to be measured in comparison to E. coli BL21 to show you, that V. antigens is still faster than E. coli. Next, we will prove, that the cells are healthy after several genomic modifications by microscopic analysis. To show you, that changing from E. coli to V. natriegens will be easy and to prove, that tools like the T7 system are transferable without much effort, we will test a construct containing a T7 promoter in both organisms and compare the results to each other. We are convinced that the T7 system in VibriXpress will deliver a much higher amount of protein in a shorter time. This way we would like to speed up your scientific progress. In 100 years of research with E. coli, BL21 is not the only used strain with a T7 expression system. Strains like Lemo21(DE3) (commercialized by NEB) are designed for fine-tuning of T7 expression especially for difficult targets. This fine-tuning is achieved due to the variation of the lysozyme (lysY) level, a gene encoding a variant of phage T7 lysozyme. The T7 Lysozyme is an inhibitor of T7 RNA Polymerase (Moffatt & Studier 1987) , (Zhang & Studier 1997) . By reducing the basal T7 RNA Polymerase activity toxic genes can be expressed in the same cell under control of a T7 promoter. Because of its ability to inhibit transcription the T7 lysozyme is often used for controlling expression in systems based on T7 RNA Polymerase ( Studier & Moffatt 1986).

In strains like Lemo21(DE3) the level of lysozyme can be optimized by inducing LysY with L-rhamnose to alleviate inclusion body formation or inhibitory effects from toxic proteins. In future, we will need to engineer this kind of specialized strains for troubleshooting as well in V. natriegens.
In addition, more research is necessary to study V. natriegens specific proteases, their function and their strength. Of course, we could not find an aspartyl protease similar to OmpT of E. coli, but this does not rule out that V. natriegens possesses other proteases that are crucial for protein degradation during overexpression. Finding highly active proteases and deleting them would further improve the amount of produced protein.
What about all the scientists that would like to use other expression systems than T7? As we mentioned before the T7 system is not always the first choice for protein overexpression. Other commonly used promoters like the synthetic trc promoter or T5 promoter are recognized from E. coli RNA Polymerases and do not require a genomic integration of a phage RNA polymerase (Erfle et al. 1985) , (Gentz & Bujard 1985) . We could show, that these promoters are recognized from the V. natriegens RNA polymerase as well (Link metabolic engineering), enabling you to use the potential of this impressing organism in a variety of tools and benefitting of its speed. This result, together with all promising facts about V. natriegens like the higher solubility of some proteins that are difficult to purify and the simple transferability of E. coli tools to this organism, have strengthen our vision that V. natriegens is the perfect chassis to accelerate scientific progress in synthetic biology.

VibriInteract

Protein interaction studies are known to be time-consuming. First, all the plasmids required need to be cloned and co-transformed into the interaction strain. Second, you have to cultivate those co-transformants and test them on inductive plates. Now imagine how much time you could save using a much faster growing organism, accelerating all steps required for protein interaction studies. And imagine this time, could be used in a meaningful way, to get new results and speed up scientific progress.

For the design of VibriInteract we had a look at the Genotype of the E. Coli strains used in Bacterial Two Hybrid Assays which are commercially available from Euromedex. It is also possible to use any other strain deficient of/from cya.

DHM1: F-, cya-854, recA1, endA1, gyrA96 (Nal r), thi1, hsdR17, spoT1, rfbD1, glnV44(AS).

BTH101: F-, cya-99, araD139, galE15, galK16, rpsL1 (Str r), hsdR2, mcrA1, mcrB1.

The genotypes of the strains recommended and provided by Euromedex showed three common genetic modifications:

F-: Both strains do not contain the F-plasmid which encodes the mechanism for conjugation in E. Coli (Willetts & Skurray, 1980) . V. natriegens has not been described to possess such a plasmid needed for conjugation. For this reason, we did not consider this part for VibriInteract.
hsdR17/hsdR2: HsdR is a restriction endonuclease found in E. coli of the EcoKI restriction-modification system. Endogenous sites get methylated by methylases. HsdR restricts foreign DNA by recognizing sites which or unmodified (Sain & Murray, 1980) . Point mutations in this gene lead to a nonfunctional protein. By this, non methylated DNA is no longer degraded. This is important for strains that are designed to carry foreign DNA like plasmids e.g. cloning strains like VIBRICLONE. For VibriInteract this would be a useful alteration compared to the wildtype since our strain has to carry not only one but two plasmids at the same time. When searching for Type I restriction enzyme EcoKI R proteins we find several in V. natriegens. One of them, BA894_04835 shows a high degree of similarity when doing protein alignments using ClustalX 2.0.10. This could be used for later modifications in our strain but we did not choose it as our highest priority since it is not necessarily needed for functioning protein interaction studies based on B2H.

Figure 1:
cya-854/cya-99: CyaA is a key regulator of the mal and the lac operon (click here to reread our description about CyaA (link)). Both strains recommended for a Bacterial Two-Hybrid assay in E. coli are deficient of this protein. We decided that this genetic modification our highest priority for the VibriInteract strain since we wanted to use complementation of CyaA as proof of protein interaction. We identified a homologous protein by doing a BLASTP search of CyaA from E. coli in V. natriegens, BA894_1512. By protein alignment using ClustalX 2.0.10 we see that the two proteins showed a high degree of similarity. Therefore. we hypothesized that it serves the same or a similar function as the E. coli protein thus we decided to delete this gene using natural transformation for insertion of our deletion cassettes and the Flp recombinase to get rid of the antibiotic resistance cassette used for selection of insertion clones. This deletion is the start of the development of a strain which can be used to check for the interaction of proteins.

Figure 2:

Examinationof V. natriegens as chassis for protein interaction studies

Before we started to establish our strain, we had to verify if the wildtype strain of V. natriegens is suited to work in an interaction study assay based on the B2H system. For this purpose, we tested if our strain possesses a functional mal or lac operon by plating wildtype on diverse media which were:

LBv2 agar plates supplemented with IPTG and Xgal used for Blue-White Screening if the lac operon is functional.
M63 minimal medium agar plates supplemented with maltose to test if the mal operon is functional.
MacConkey agar plates to test if the mal operon is functional.

Characterization of VibriInteract

After obtaining our final strain we have to characterize it, because why would anyone use a new strain? With our characterization we want to show you how easy it is to use VibriInteract. Therefore, we designed experiments to demonstrate the benefits of V. natriegens and how little changes you have to undergo to use VibriInteract instead of the commonly used E. coli strain BTH101. First, we did growth curves, to verify that VibriInteract is faster than BTH101. Second, we did microscopic studies to verify that the cells of our strain still look healthy. Last and most important, we did a Protein Interaction study to verify that the pUT18 and pKT25 plasmids of the commercially sold B2H kit are working in our strain.

Outlook

For the future, we would like modify genes also modified in our cloning strain, which are recA, katG, dns and exeM. Those modification would help the foreign plasmids used for the B2H Assay to be retained in the cell. Read more about in the VibriClone part of our Strain Engineering subgroup. It could also be beneficial to eliminate the restriction endonuclease homologous to HsdR in E. coli. By this, foreign DNA is less likely to be degraded. Additionally, we would like to replace the non-functional lacZ gene with the lac operon of a wildtype E. coli strain like K12 MG1655. Hereby, we could use blue-white screening for our assays. This screening method is much faster than the selection on maltose minimal medium and would accelerate the assay even more. Another possible alteration of our strain could be the insertion of a reporter gene under control of an promoter which can be induced by the cAMP/CAP complex. This reporter gene could be an antibiotic resistance, a toxin or the luciferase. Using an antibiotic resistance one could use positive selection, since cells can grow on this specific antibiotic when proteins of interest are interacting. With a toxin under a cAMP/CAP dependent promoter one could use negative selection, since cells can no longer grow when proteins are interacting. The last possible reporter gene lux is giving an output signal through luminescence which could be measured.

We strongly believe, that the creation of VibriInteract is taking the community a step closer to establishing V. natriegens as the new standard chassis organism in synthetic biology.

CRISPR/Cas9

Performing genome engineering in bacteria using CRISPR/Cas9 usually requires the presence of three components in a cell at the same time, namely: Cas9, gRNA and a donor template (Wasels et al. 2017). Therefore we aimed to design a convenient system that allows easy adaption of CRISPR/Cas9 to target various sequences.

**Figure 1: LVL1-pTet-Cas9.**
The LVL1 plasmid with p15A origin, pTET promotor, Cas9 CDS and tretacyclin-resisstence is shown.

Preparation of Cas9
A Cas9 and dCas9 LVL0 part was created as a CDS part in our Marburg Collection. The cas9 sequence was PCR amplified from pCas9CR4 (Reisch and Prather 2017). To make this sequence compatible with our toolbox and the iGEM registry, we removed a BsmBI and EcoRI recognition site in the parts sequence creating the Cas9 LVL0 part BBa_K2560047. For further experiments, a dCas9 part could prove useful and therefore we induced mutations the nuclease loop in the RuvC1 (D10A) & HNH (H840A) subunits creating BBa_K2560054.

To establish CRISPR/Cas9 in V. natriegens, we conceived a Cas9 plasmid where the expression is tightly controlled by an inducible promoter. We expect this to help during cloning and transformation steps, as the toxicity of Cas9 expression is reduced in the absence of inducer. We chose pTet for this purpose. This decision is based on the characterization of this part that showed tight control and a 40 fold induction of this promoter when induced with ATc (Link to Results) . Finally, we created LVL1-pTet-Cas9, a plasmid with the Cas9 CDS under control of pTet (figure 1). We chose to built this plasmid with p15A as the ori and a kanamycin resistance cassette. This enables cotransformation of this plasmid together with a second plasmid harboring the gRNA cassette.

**Figure 2: gRNA entry Vector.**
The gRNA entry vector contained the scaffold of the gRNA and a sfGFP drop out for insertion of spacer sequences.

Construction of a gRNA entry Vector
To enable simple cloning of gRNAs, we constructed BBa_K2560305 an entry vector for gRNA spacer (figure 2). The spacer is defined as the 20 bp at the 5’ end of the gRNA that confers sequence specificity to the target sequence in the genome. This parts design is based on BBa_K2457002 a part which harbors a gRNA targeting LacZ of E. coli. To achieve superior cloning flexibility we incorporated the approach of using a sfGFP dropout similar to the part entry vector of the Marburg Collection BBa_K2560002. Analogous to the cloning of small LVL0 parts, new spacer sequences can easily be cloned by annealing of oligos and subsequent ligation into the gRNA entry vector using a golden-gate reaction. This drop out helps to distinguish correctly assembled plasmids from the religated entry plasmid. The succesfully cloned plasmid then contains a gRNA expression cassette as well as a pMB1 ori and the chloramphenicol resistance cassette.

The two plasmids LVL1-pTet-Cas9 and BBa_K2560305 after integration of a spacer are compatible and can be cotransformed. As a third component a donor template has to be added to the transformation.

**Figure 3: LVL2 plasmid.**
Both the pTET-Cas9 LVL1 plasmid and the gRNA entry vector assembled in one LVL2 plasmid.

Assembly of LVL2 plasmid
We realized that cotransformation of three components into V. natriegens might be challenging and therefore, we designed a plasmid that is a combination of the previously designed LVL1-pTet-Cas9 and the gRNA entry vector BBa_K2560305. Using our toolbox, we converted the LVL1 construct into a LVL2 plasmid to remove the BsmBI recognition sites that are present in the LVL1 plasmid. Subsequently, we integrated the gRNA transcription unit including the sfGFP dropout into the 5’ Connector via Gibson assembly. Finally, we created a plasmid that possesses a Cas9 CDS with pTet and the gRNA entry sequence to facilitate easy cloning (figure 3).

In most bacterial CRISPR/Cas9 approaches, a two plasmid system is used separating Cas9 and gRNA on two different plasmids (Wasels et al. 2017). The reason is that during the process of gRNA cloning, a functional Cas9 transcription unit is present in the same cell with the correctly cloned gRNA, resulting in double strand breaks (DSB) leading to the death of the cells.

Our approach to overcome this challenge is to use E. coli as cloning chassis for the construction of the Cas9-gRNA plasmid which is then transformed in V. natriegens together with a donor template to perform genome engineering. This is possible because the spacer sequence targeting the genome of V. natriegens should not match any sequence in the genome of E. coli and therefore we do not expect creation of DSB.

Designing of donor templates
Different approaches for the design of donor templates are described in literature. The mutation can be introduced either as dsDNA in a linear form or integrated into a plasmid or as ssDNA (Jiang et al. 2015).

We decided to use ssDNA because they can be easily ordered from companies offering synthesis. As a prove of concept we decided to tackle the dns gene on chromosome 1 of V. natriegens, as this mutation has already been shown not to be lethal and to significantly increase DNA yields of plasmid preparations (Lee et al. 2017). We used two ssDNA oligos each targeting the leading or lagging strand. The oligos consists of the desired mutation flanked by 50 bp long homolougous sequences flanking the position of the introduced DSB. The last three nucleotides at both ends are connected via phosphorothioate-bonds providing the stabilization of the donor oligonucleotides and protection against digestion (Reisch and Prather 2017).

The donor template was designed to exchange four basepairs at the beginning of the dns gene with a three basepair stop-codon (figure 4). In this way, the knock-out is ensured through the premature translation termination as well as a frameshift in the sequence. The spacer sequences targeting the genomic sequence was designed using the CRISPR design tool of Geneious which utilizes the algorithm developed by Doensch et al. (2014).

**Figure 4: Alligment of the donor template and the *V. natriegens* genome.**
The alligment shows that the the donor template differ in two nucleotides from the dns-CDS of *V. natriegens*. The first nucelotide leads to the conversion of an aminoacid to a stopcodon whereas the second, missing nucleotide leads to a frameshift.

Metabolic Engineering

With Vibrigens we aim to speed up synthetic biology in all of its facettes. For that we designed a workflow for accelerated metabolic engineering in high-throughput. We use the Marburg toolbox to construct whole pathway libraries, benefit from natural transformation to introduce it into our chassis, enable fast product screening with biosensors and implemented directed evolution for further pathway optimization.

Workflow for Metabolic Engineering and Design of Pathway and Enzymes

Our workflow

One of the central elements of synthetic biology is the Design-Build-Test-Learn cycle. It illustrates the workflow of how systems like metabolic pathways, genetic circuits or anything else are constructed and evaluated. The first step is to design the system basing on the existent knowledge about the system. Then the system is built and its functionality is tested. The results of these tests describe the ‘Learning’ part of the cycle. They are used to make an improved design. Finally, a new round in the DBTL cycle can be made and the system gets improved in each round of the cycle. For our approach we decided to establish a workflow for metabolic engineering, which bases on the DBTL cycle and combines the existing principles of metabolic engineering with the advantages of the fast-growing organism V. natriegens. First of all, we use the Marburg Toolbox to construct a library of pathways with different combinations of promoters, ribosome binding sites, coding regions, enzyme ratios and plasmid copy numbers. Next, we transform the plasmids for each pathway version together with the plasmid for the biosensor into our chassis. When the pathway is now performed, the cells produce a fluorescence or luminescence signal and its intensity correlates to the 3-hydroxypropionate ( 3HPA-) concentration. By using FACS we can select for the most efficient pathway versions. By using multiplex automated genome engineering (MAGE), refeeding 3HPA into the central metabolism and repetition of FACS selection we have a cycle of directed evolution-guided pathway optimization. The strains with the most promising pathway variants can then be isolated and their pathway composition elucidated. Thus, we get information about designing principles for our pathway and can propose an improved design for a new round in the cycle of metabolic engineering.

Figure of our implemented workflow of metabolic engineering. To accelerate metabolic engineering we implemented a workflow based on the DBTL-cycle. Starting by building a library of pathway variants, they all can be introduced into <i>V. natriegens</i> as test chassis. By using biosensors FACS-directed selection of promising variants is possible. Cultivation and repeating of FACS leads to directed evolution and finally the most efficient strains can be analyzes to investigate the genetic composition. These finding can be feeded into the Learning-part of the DBTL-cycle thereby closing the workflow. Repeating the workflow continiously increases pathway efficiency and enables high-throuput metabolic engineering. — **Figure 1: Our implemented workflow of metabolic engineering**
To accelerate metabolic engineering we implemented a workflow based on the DBTL-cycle. Starting by building a library of pathway variants, they all can be introduced into *V. natriegens* as test chassis. By using biosensors FACS-directed selection of promising variants is possible. Cultivation and repeating of FACS leads to directed evolution and finally the most efficient strains can be analyzes to investigate the genetic composition. These finding can be feeded into the Learning-part of the DBTL-cycle thereby closing the workflow. Repeating the workflow continiously increases pathway efficiency and enables high-throuput metabolic engineering.

Our pathway

As mentioned in the description, there are many ways to produce 3HPA. We decided for a pathway, basing on the conversion of acetyl-CoA via malonyl-CoA to 3HPA. According to (Valdehuesa et al. 2013) who evaluated many pyruvate-derived production pathways from a thermodynamical point of view, this is one of the most efficient routes for 3HPA production. We decided for this route for several reasons. First it bases on glucose degradation and V. natriegens has an unbeaten glucose-uptake rate, enabling a high glucose consumption rate. Second there is no vitamin B12 necessary, the only cofactors which are involved are NADPH and biotin, both of them naturally occurring in the organism. Third there is theoretically just one further enzyme needed to complete the pathway. All enzymes necessary for the production of malonyl-CoA are still present in V. natriegens and we just have to integrate the last enzyme, malonyl-CoA reductase. Nevertheless, overexpression of acetyl-CoA carboxylase increases productivity, so we took both enzymes into consideration. (Kildegaard et al. 2016) proposed, that there are many ways to further optimize the pathway and direct the flow to a high product titer. Hence, implementing many variations of this pathway into V. natriegens and testing them in a fast manner promises a high gain of knowledge about pathway optimization and general principles of metabolic engineering.

text — **Figure 2: Pathway for the production of 3-hydroxypropionic acid**
The pathway starts with pyruvate as product of glycolysis and other central metabolisms. Pyruvate is dehydrogenated by pyruvate dehydrogenase to acetyl-CoA. Acetyl-CoA gets carboxylated by Acc to form malonyl-CoA. Malonyl-CoA is finally reduced to 3-hydroxypropionic acid by Mcr. Acc and Mcr are the two enzyme, we planned to express heterologously. Acc = Acetyl-CoA carboxylase, Mcr = Malonyl-CoA reductase

Choice of enzymes

For the core pathway, we need to implement two enzymes, acetyl-CoA carboxylase (Acc) and malonyl-CoA reductase (Mcr). For Acc there are several candidates, which have still been tested for synthetic metabolisms ((Kildegaard et al. 2016); (Liu et al. 2013); (Vidra et al. 2017)). Our plan was to first test enzymes from different organisms and then optimize expression profiles. To benefit from the fast engineering workflow in V. natriegens, we decided to use enzymes from different organisms. We planned to use the native Acc from V. natriegens, because it is still present in the cells and we prevent unnecessary burden by overexpressing a heterologous Acc. The native Acc consists of four subunits together with an associated biotin ligase. The genetic organization is the same like in E. coli, which is shown in figure 3. Nevertheless, further looks into literature revealed, that heterologous expression of pathway enzymes could be more promising because housekeeping Accs often are strictly regulated while foreign genes have different regulation motifs. Hence, we also used the Acc from E. coli together with its associated biotin ligase (BirA). These enzymes have already been tested for 3HPA production (Liu et al. 2013). Additionaly, we decided to test the enzyme from Synechococcus elongatus. Since cyanobacteria require higher amounts of fatty acids to form thylakoid membranes, they might have more active enzymes. The S. elongatus Acc consists of four subunits, just like those from E. coli and V. natriegens. For this enzyme it was also recommended to use it with a biotin ligase, because it needs a covalently bound biotin to be activ. Unfortunately Accs consisting of four subunits form a very unstable complex and unbalanced expression levels of its subunits lead to gene repression (Abdel-Hamid et al. 2006). To circumvent this issue, we also chose the Acc from C. glutamicum. In this organism, AccB and AccC are fused to one subunit, as well as AccA and AccD. Balancing of two subunits instead of four is much easier, therefore this enzyme is beneficial for us. Gande et al., 2007 still characterized this enzyme and confirmed its applicability for synthetic biology. Since C. glutamicum is a gram-positive bacterium, we decided to codon optimize the genes for a smoother translation and coexpressed the associated biotin ligase (Accession numbers NCgl0670, NCgl0678, NCgl0679).

Genetic organization of acc genes in E. coli and comparison to Accs from other strains. In E. coli Acc is encoded by the genes accA, accB, accC, accD and the associated gene birA. In V. natriegens and S. elongatus there is the same genetic organization. Acc from C. glutamicum is encoded by only two genes, accBC and accD. — **Figure 3: Genetic organisation of *acc* genes in *E. coli* and comparison to Accs from other strains**
In *E. coli* Acc is encoded by the genes *accA*, *accB*, *accC*, *accD* and the associated gene *birA*. In *V. natriegens* and *S. elongatus* there is the same genetic organization. Acc from *C. glutamicum* is encoded by only two genes, *accBC* and *accD*.

The second enzyme in our pathway is the malonyl-CoA reductase (Mcr). Since this enzyme only occurs in autotrophic organisms, performing the 3-hydroxypropionate bicycle, there aren’t many candidates for our enzyme choice. In literature Mcr from Chloroflexus aurantiacus has mostly been used, because it is easy to purify, has a high activity and is still active at moderate temperatures (Hügler et al. 2002) ; (Liu et al. 2017). We decided to use this enzyme and test it with different expression levels and Acc/Mcr ratios. However, recently it was shown, that a dissection of the enzyme strongly increases productivity (Liu et al. 2013). In general, Mcr has two functional domains, the C-terminal domain catalyzes the reduction of malonyl-CoA to 3-oxopropanoate and the N-terminal domain catalyzes the further reduction to 3HPA. In the mentioned publication there wasn’t an exact explanation for the increased productivity of the dissected enzymes. Our hypothesis is, that the complete enzyme, which has its temperature optimum at 60°C, has a suboptimal protein folding at 37°C while the dissected enzymes are smaller and less bulky and therefore can fold easier. Deeper investigations of enzyme behavior additionally revealed that the C-terminal domain is much slower than the N-terminal domain and thereby the C-domain slows down the overall reaction (Liu et al. 2013). Therefore, we decided to use the complete enzyme of Mcr as well as the dissected enzymes McrC and McrN. There is also another Mcr from Sulfolobus tokodaii, which only catalyzes the rate-limiting step from malonyl-CoA to 3-oxopropanoic acid. Since McrC has the same function, we planned to coexpress Mcr from S. tokodaii together with McrN from C. aurantiacus and test, if the activity increases. All in all, we planned to use the acetyl-CoA carboxylase from E. coli, V. natriegens, C. glutamicum and S. elongatus, the Mcr from S. tokodaii and the Mcr, McrC and McrN from C. aurantiacus. First, we wanted to test enzyme activity and afterwards use the Marburg toolbox to build combinations of enzymes and expression levels.

The figure shows the composition of malonyl-CoA reductase and its splitted versions. We will test three versions: The whole Mcr, the splitted version McrN + McrC and a combination of McrN and Mcr from S. tokodaii — **Figure 4: Composition of malonyl-CoA reductase and its splitted versions**
It was published that splitting of Mcr leads to increased activity, therefore beside the whole enzyme we also want to test the splitted version McrN + McrC and a combination of McrN and Mcr from *S. tokodaii*

Rational Plasmid Design and Pathway Library

Rational design of plasmids

Synthetic biology proceeded significantly in recent years and we have much knowledge about rational design of circuits and pathways. Nevertheless, failures and unexpected outcomes slowdown scientific progress. To tackle this issue, we decided to follow two approaches. On the one hand we planned to model our pathway and predict a – theoretically – optimal pathway, which we can then construct and test. And on the other hand, we decided to build additionally to this particular pathway a whole library of many variations of this pathway with little differences. In this way we create many producer strains in parallel and speed up the metabolic engineering workflow. Our rational approach bases on a Michaelis-Menton-based metabolic analysis. For more details see our metabolic model. Since there are no stochiometric flux data about V. natriegens yet, we used the data from E. coli, which has a very comparable metabolome to V. natriegens (citation needed). For the model we only considered acc from C. glutamicum and mcr from C. aurantiacus, because they were from our point of view the most promising ones. The model predicted, that Acc is the rate limiting enzyme in the pathway and has to be expressed ten times stronger than Mcr. We used the promoter characterization data, obtained from our part collection team (For more details see the section about characterization of the promoters of our Marburg toolbox) and introduced them into the model. The resulting prediction was, that we have to use the promoter J23106 for acc and J23109 for mcr. Therefore, we designed the pathway plasmid according to this model. We built the plasmid based on the Marburg Toolbox with 4 transcriptional units. For the ori we chose ColEI, which has a broad host range and a high copy number in V. natriegens and E. coli (Jain et al. 2013), [Link zu qPCR von Carlos]). We decided for that one, because V. natriegens shall be a chassis for testing pathway variants, which then maybe will be transferred into other strains, therefore a broad host range vector is reasonable. For accBC and accD we chose the promoter J23106 and for mcr the promoter J23109 according to the model. For birA we also chose J23109 as promoter, because BirA is only relevant for activation of Acc and not for direct reaction catalysis. For ribosome binding sites and terminators, we used the same for each coding region, because otherwise we would have fluctuations which weren’t considered in the model. An illustration of the plasmid is shown in figure 5.

The scheme shows the composition of the level 2 plasmid, containing four transcriptional units for accBC, accD, birA and mcr. We chose the promoters according to the metabolic model [LINK] to achieve optimized expression strengths. accBC consists of 5'-Con1, J23106, B0030, accBC, B0015 and 3'-Con1, accD consists of 5'-Con2, J23106, B0030, accD, B0015 and 3'-Con2, birA consists of 5'-Con3, J23109, B0030, birA, B0015 and 3'-Con3, mcr consists of 5'-Con4, J23109, B0030, mcr, B0015 and 3'-Con5. — **Figure 5: Composition of our rationally designed pathway plasmid with the Marburg Collection**
The scheme shows the composition of the level 2 plasmid, containing four transcriptional units for *accBC*, *accD*, *birA* and *mcr*. We chose the promoters according to the metabolic model [LINK] to achieve optimized expression strengths.

Pathway library

For the second approach of pathway design, where we build a whole library, we also used the Marburg Toolbox [Marburg Toolbox soll ein Link zur Marburg Toolbox sein] to combine different promoters, ribosome binding sites and coding regions. In theory, the toolbox enables the plasmid assembly of 21780 different part combinations per transcriptional unit (22 promoters x 5 rbs x 6 terminators x 3 oris x 11 resistance cassettes; connectors weren’t considered). For a plasmid containing accBC, accD, birA and mcr that would make 6 x 1012 possible constructs, but assembly of so many versions is irrational. The better way is to select a few parts which are combined and then for the most promising versions one can think of more combinations. Therefore, we focused on eight promoters and three ribosome binding sites with different strengths. Our idea was to build pathway plasmids containing the whole pathway, as well as plasmids containing only the acetyl-CoA carboxylase genes or only the mcr gene, which can then be cotransformed (Table 1). To enable cotransformation, we used the oris ColEI and p15a, which are compatible to each other.

Table 1: Plasmid categories for our pathway library

Plasmid Category	Inserts	Description
I	mcr	These plasmids contain the whole mcr gene from C. aurantiacus and can be cotransformed with each ColEI plasmid, because they have a p15a ori.
II	mcrN mcrC	These plasmids contain the dissected mcr fragments from C. aurantiacus and can be cotransformed with each ColEI plasmid, because they have a p15a ori.
III	mcrN mcrSt	These plasmids contain the N-terminal mcr fragments from C. aurantiacus and the mcr from S. tokodaii. They can be cotransformed with each ColEI plasmid, because they have a p15a ori.
IV	accBC accD birA	These plasmids contain all required genes for the acetyl-CoA carboxylase from C. glutamicum and can be cotransformed with each p15a plasmid, because they have a ColEI ori.
V	accBC accD birA mcr	These plasmids contain all genes for the complete pathway. They combine the Acc from C. glutamicum with Mcr from C. aurantiacus.
VI	accBC accD birA mcrN mcrC	These plasmids contain all genes for the complete pathway. They combine the Acc from C. glutamicum with the dissected Mcr from C. aurantiacus.
VII	accBC accD birA mcrN mcrSt	These plasmids contain all genes for the complete pathway. They combine the Acc from C. glutamicum with the N-terminal Mcr-domain from C. aurantiacus and Mcr from S. tokodaii.

The idea behind the plasmids with the whole pathway is, that the cells only have to maintain one particular plasmid. The strategy for the single-enzyme plasmids is that by cotransformation different combinations can be made to increase the number of possible pathway variants. The different plasmid types can be seen in table 1, all in all we designed the construction of 1446 plasmids for our pathway library. For plasmids of category I (see table 1), we combined eight promoters with three ribosome binding sites, a kanamycin resistance gene and a p15a ori, giving us 24 constructs. For plasmids of category II and III we used the same promoters and ribosome binding sites for both transcriptional units, leading to 576 plasmids respectively. For plasmids of category IV we restricted our selection to three promoters and three ribosome binding sites. Since a misbalance of accBC- and accD expression levels leads to occlusion bodies and decreased activity we decided to always use the same promoters and rbs for both genes. In this category we get 81 plasmid variants. For categories V, VI and VII, containing the whole pathway, the combinatorial possibilities were very high, so we decided to further restrict the selection by always coupling strong promoters to strong rbs, medium promoter to medium rbs and weak promoter to weak rbs. Therefore, we got a total of 27 plasmids for category V and 81 plasmids for category VI and VII, respectively. We also considered to integrate one or several genes, therefore we planned to search for integration sites and to characterize them. [Link zu Bauersachs Teil]

Biosensors and Product Screening

Our biosensors

The design of our biosensors was largely guided by the goal of having as little impact on the host metabolism, while simultaneously being simple and easy to implement. We chose to design all parts to be compatible with the Marburg toolbox, RFC10, RFC25, andRFC1000, to make it as easy as possible for us and future users to implement it and adapt it to new requirements.

**Figure 6: Scheme of the 3-HPA sensor**
This part enables creation of new LVL0 parts by replacing the parts sequence with the desired insert. The sfGFP expression cassette is flanked by BsmBI recognition sites

Annotation of the location for the regulatory sequences upstream of the HdpR coding sequence was insufficient for us to precisely remove the activator binding site or the constitutive promoter driving HdpR expression. So, we included an about 200bp long region upstream of the HdpR, which should be enough to include all regulatory motifs. Also included is the HdpR coding sequence with its own ribosomal binding site (RBS). and promoter. This results in a part that includes the whole sensing function, but does not quite fit our scheme of clearly defined functional units. Since expression of any part cloned behind the regulatory region is directly controlled by it, and the part serves as both, a promoter and RBS. It is basically an inducible promoter fused to an RBS. Thus, we decided to give the part the overhangs of an RBS. on the downstream and a promoter on the upstream end.

In order to obtain the desired region from the P. putida KT2440 genome, we planned to amplify it via PCR and then isolate the fragment by gel purification of the corresponding band. Into the primer-design we factored in the overhangs we needed to make it Marburg toolbox compatible. So, we gave them non-binding overhangs which automatically added the necessary BsmBI cut sites.

**Figure 7: Scheme of the malonyl-CoA sensor**
This sensor consists of two unique level 0 parts that combined with the lux operon as reporter constitute the biosensor

For the malonyl-CoA sensor, our proceeding was very similar. Just that we knew the exact location of all functional parts, except of the location of the promoter, form the 2015 paper by Liu et al and from annotations in the published genome (Liu et al. 2015). Therefore, we could implement it fully into the Marburg toolbox, by simply treating the FapR coding sequence as coding sequence. Not that straightforwardly was the design of the fapO regulator binding site. Since we did not know the promoter strength, and the mechanism of regulation was independent of the promoter sequence, we decided to implement a promoter from the Andersen library, J23100. By placing it right in front of the fapO site, we created a construct that, in theory, should give us a FapR controlled promoter. For the amplification of the individual parts from the B. subtilis 168 genome, again, we used primers sporting non-binding regions, adding the necessary overhangs for our toolbox.

For future projects, other sensors could also be employed. In a publication from 2016, Rogers et al. propose an alternative sensor for 3HPA with a lower perception threshold (Rogerset al.2016). We choose not to implement it in our design because of its more complicated mechanism that requires the implementation of more genes. Intriguingly, it works by first converting 3HPA to acrylate via a pathway of three heterologous enzymes. The acrylate is then sensed by a transcription factor, similarly to how our sensors described above work. They also report its successful implementation and application in increasing the 3HPA titer up to 23-fold. It is conceivable that with a little more effort our system could yield comparable results. Off cause, s we have the additional benefit of having a much shorter iteration interval.

**Figure 8: Biosensors for malonyl-CoA and 3-hydroxypropionate**
The malonyl-CoA Sensor FapR is a dimeric protein which binds to the fapO operator, thereby inhibiting expression of the reporter gene. When malonyl-CoA is bound, the regulator dissociates from the DNA and gene expression can occur. The Sensor for 3-hydroxypropionate works reversely. In the absence of 3HPA the 3HPA sensor HpdR binds to its operator, allowing the polymerase to read the HpdR promoter.

Further product sceening

The usage of biosensors for product screening is a fast and cheap method enabling high throughput analysis of pathway libraries. However, to further characterize the pathway variants and implemented enzymes we planned to measure enzyme activities. In that way we can quantitatively benchmark each enzyme and establish in vitro assays as a prototyping engineering approach. For the malonyl-CoA reductase, it is easy to measure reaction velocity, because the enzyme converts NADPH into NAD+ and both compounds have different absorbances at 345nm (De Ruyck et al. 2007). Based on that we designed an enzyme assay, where we use cell extract from Mcr producing cells or directly purified Mcr and mixed it with a defined amount of NADPH and MgCl2. By measuring the absorbance, one can determine if NADPH concentration changes. By adding malonyl-CoA to the reaction mix we then start the reaction and are able to observe a decrease in absorbance because the reaction consumes NADPH. Here the starting concentrations of malonyl-CoA and NADPH have to be high enough to ensure, that nearly each Mcr molecule in the mixture is saturated and the reaction goes proportional to enzyme concentration. By determining the slope of the linear decrease of absorbance it is possible to calculate the specific activity. The specific activity is a value which characterizes the ability of enzymes, to convert a substrate into a product. It is defined as the moles of product formed by an enzyme in a given amount of time under given conditions per milligram of total proteins, the unit is µmol min-1mg-1. To calculate the specific activity, one needs the slope of the measured absorbance. This value defines the change of absorbance per time. By using the Lambert-Beer law it is possible to calculate the amount of NADPH converted into NADP+ per time (Lambert and Anding, 1760).

ε is 3400 M-1cm-1 and d is 1cm. By dividing with time and enzyme volume, we get the final formula to calculate the NADPH conversion per time.

Since one conversion of malonyl-CoA corresponds to two conversions of NADPH, we can calculate, how much malonyl-CoA is consumed per time. This value is defined as enzyme activity with the unit µmol/min and depends on the enzyme amount. By taking the protein amount into consideration, the specific activity can be calculated:

For Acc the enzyme assay must be redesigned because carboxylation of acetyl-CoA doesn’t consume NADPH or other photometrically measurable compounds. Therefore, we decided to do a coupled enzyme assay, where we not only add Acc (or Acc containing cell extract) into the reaction mixture but also purified Mcr as indicator enzyme which metabolizes malonyl-CoA as product of Acc and thereby converts NADPH to NAD+. It is crucial to add large amounts of Mcr into the mixture to ensure, that Acc is the rate-determining enzyme and not Mcr. For the Acc activity assay we planned to mix purified Acc or cell extract with ATP, MgCl2, NaHCO3, purified Mcr and NADPH. Our strategy was to start the reaction by adding Acetyl-CoA as precursor for all reaction steps and then measure absorbance loss just like in the Mcr assay. The calculations for the specific activity are also the same. Beside evaluating enzyme performances, we also wanted to specifically detect and quantify 3HPA in our cells. We planned to do that via HPLC and mass spectrometry. Establishing a HPLC/Mass spectrometry protocol is also essential for calibrating our biosensors. The identification of malonyl-CoA as intermediate of our pathway is not so challenging because the CoA appendix lends it a larger mass of 853.12g/mol, which is specific for malonyl-CoA. For 3HPA the specific identification is more difficult because V. natriegens is able to fermentatively produce lactate, a structural isomer of 3HPA. To distinguish between 3HPA and lactate we chose to use a triple quadrupole mass spectrometer. In such a device two quadrupole mass analyzers are toggled in a row and separated by a third quadrupole, responsible for the collision-induced dissociation of ions. Our strategy was to cultivate cells, performing the 3HPA pathway or at least one of its reaction steps and then lysate the cells and measure lactate- as well as 3HPA-concentrations in the cell extract and also in the medium, because there are no hints if 3HPA stays in the cell or is released into the medium.

Directed Evolution and Outlook

Directed evolution

Metabolic Engineering offers many possibilities to create suitable pathways but it consists of more than just building them. Pathway optimization is also crucial for efficient productivity. In our workflow for metabolic engineering we want to implement directed evolution to drive metabolic fluxes towards 3HPA production. We elaborated two strategies to do that. Our first strategy was to implement genes for the conversion of 3HPA into succinate. In that way, we lead our product back into the central metabolism. By deleting all other genes responsible for succinate production, the cells are forced to use the 3HPA bypass to get succinate, which is needed in the TCA cycle. The big advantage of using V. natriegens for that purpose is that its short doubling time increases the number of cell divisions per time and thereby increases the number of mutations in a given time frame. Our second strategy is to use MAGE [Link zu project→description→metabolic engineering→Directed evolution→MAGE erklärung] together with our biosensors and fluorescence-activated cell sorting ( FACS ), to cause various genome modifications and rapidly select for cells with increased 3HPA productivity (Figure 9). We looked into literature and found 16 possible genome modifications for 3HPA production made by other groups. Table 2 shows a list of all modifications. Our plan was to order oligos for each of these modifications, leading to upregulation, downregulation, deletion or site-directed mutagenesis of the respective gene. The aimed modification for each gene is shown in the table. For the next step we decided to use natural transformation for integration of these oligos into the strain. Because each cell will take up different numbers and different combinations of oligos, we will get a population of genomic variations [Link zu Project → Design → Strainies → Mugent]. The genetically diverse population can then be put into FACS and we can select for cells with the strongest signal and thereby the best productivity. Finally, the whole cycle of MAGE and FACS can be repeated, leading to continuously improved pathways. We believe that the combination of MAGE- FACS with the fast growth of V. natriegens will speed up the process of metabolic engineering and pathway optimization significantly!

Outlook

In our project we try to accelerate the process of metabolic engineering in all of its facets and thereby find a suitable pathway variant to achieve high 3HPA productivity. Nevertheless, there will still be many possibilities to further improve the production of 3HPA in V. natriegens. Table 2 shows a list of 16 genome modifications, that were made in other organisms with the same or a similar pathway to direct the flow towards 3HPA. Some of these modifications target for instance NADPH homeostasis. During glycolysis cells produce NADH while the production of 3HPA depends on NADPH. Therefore, a misbalance between NADH and NADPH could arise, but overexpression of pntAB (nicotinamide nucleotide transhydrogenase) or gapA (NADP+-dependent glycerol-3-phosphate dehydrogenase) has been shown to stabilize cofactor balance (See table for references). Another possibility to enrich the NADPH pool is to downregulate glycolysis while upregulating pentose phosphate pathway, which produces NADPH instead of NADH. This can be achieved by deletion of NAD+-dependent gapA or deletion of pfkA (phosphofructokinase). Another critical point in the 3HPA route are side reactions. Pyruvate can be reduced to lactate or decarboxylated to acetate, what could be suppressed by deletion of ldhA (lactate dehydrogenase) or pox (pyruvate decarboxylase). The flow from pyruvate to acetyl-CoA can be increased by overexpression of pdh (pyruvate dehydrogenase). For acetyl-CoA there are also side reactions, for instance hydrolysis to acetate. This reaction can be suppressed by deletion of pta (phosphotransacetylase) and ackA (acetate kinase) or by overexpression of ald6 (aldehyde dehydrogenase) and acs (acetyl-CoA synthase) for the reverse reaction to acetyl-CoA. Malonyl-CoA can also go into side reactions for fatty acid biosynthesis instead of the reduction to 3HPA. However, downregulation of fabAB (fatty acid biosynthesis) or inducible inactivation of fabAB has been showed to increase 3HPA titer. Additionally it has also been shown that site-directed mutagenesis of acc and mcr genes increases product titer (Kildegaard et al. 2016); (Liu et. al. 2016).

Table 2: Genome modifications for further pathway optimization

Gene	Organism	Function	Modification	Reference
*gapA*	S. cerevisiae	GAPDH (NADP-dependent)	Overexpression	(Chen et. al. 2014)
*mis1*	S. cerevisia	malate synthase	Deletion	(Chen et. al. 2014)
*pox*	E. coli	pyruvate decarboxylase	Deletion	(Dittrich et. al. 2008)
*acc1*	S. cerevisiae	acetyl-Coa carboxylase	Site-directed mutagenesis	(Kildegaard et al. 2016)
*ald6*	S. cerevisiae	aldehyde dehydrogenase	Overexpression	(Kildegaard et al. 2016)
*pdh*	S. cerevisiae	pyruvate dehydrogenase	Overexpression	(Kildegaard et al. 2016)
*acs*	E. coli	acetyl-CoA synthase	Overexpression	(Lin et. al. 2006)
*bauA*	From P. putida into E. coli	beta-ala-pyruvate aminotransferase	Complementation	(Liu et. al. 2016)
*panD*	E. coli	aspartate-1-decarboxylase	Deletion	(Liu et. al. 2016)
*ackA*	E. coli BL21 (DE3)	acetate kinase	Deletion	(Rathnasingh et al. 2012)
*ldhA*	E. coli BL21 (DE3)	lactate dehydrogenase	Deletion	(Rathnasingh et al. 2012)
*pta*	E. coli BL21 (DE3)	phosphotransacetylase	Deletion	(Rathnasingh et al. 2012)
*sucAB*	E. coli BL21 (DE3)	2-oxo-ketoglutarate dehydrogenase	Deletion	(Rathnasingh et al. 2012)
*gapA*	C. glutamicum	GADPH	Deletion	(Siedler et. al. 2013)
*pfkA*	C. glutamicum	phosphofructokinase	Deletion	(Siedler et. al. 2013)
*fabAB*	E. coli	fatty acid biosynthesis	Downregulation	(Yang et. al. 2015)
*pntAB*	E. coli BL21 (DE3)	nicotinamide nucleotide tranhydrogenase	Overexpression	(Rathnasingh et al. 2012)
*mcrC*	E. coli (from C. aurantiacus)	malonyl-CoA reductase	Site directed mutagenesis	(Liu et. al. 2016)

We chose the production of 3HPA as application to show the functionality of the Marburg toolbox and as proof-of-principle for our metabolic engineering workflow. However, our goal is not to produce high amounts of 3HPA, but rather to show that V. natriegens can be used as test chassis for rapid testing of many pathway variations to find the best one. The next step would therefore be to evaluate, if the pathway with its genetic composition can be transmitted to other organisms to see if the route still works. Besides that, there are many other compounds of medical and economical relevance, which still can’t be produced biotechnologically. The production of the anticancer drug taxol involves for instance 19 enzymes and is therefore a very challenging pathway with lot of bottlenecks and tunable reactions (Croteau et al., 2006). We think it would be beneficial to use V. natriegens as chassis for testing this challenging pathway or parts of it. Its short doubling time and genetic accessibility will accelerate the process of enzyme engineering and finetune expression levels or gene combinations. Another crucial aspect, that has to be tackled is the universality of our workflow. We involved biosensors for product screening but for many metabolites there are still no known sensor proteins. There are three possibilities to circumvent this issue. The first is to find new biosensors, what is complicated, because many compounds, especially drugs for medical purposes don’t occur in the nature, therefore it is unlikely that there are still proteins binding specifically to these compounds. The second possibility would be to engineer proteins to become drug-specific sensors. The third possibility is to upscale LC-MS techniques to enable larger sets of measurements and faster screening. LC-MS has an unbeaten accuracy but its bottleneck is the velocity. The average time needed for one sample lies between 1 and 10 minutes, making measurements of large sets of samples impossible. Furthermore LC-MS requires high amounts of buffer. However, as mentioned in the Labautomation-section [soll ein Link sein], the company Labcyte developed a system to use acoustic waves for pipetting nanoliter droplets into microtiter plates in a very fast manner. Now they invented an acoustic mass spectrometer, which combines acoustic nanodroplet formation with mass spectrometry. The device is able to measure 3 samples per second without the need of a mobile phase. Such a device could be used instead of biosensor-based flow cytometry to broader the applicability of our metabolic engineering workflow.

References

Abdel-Hamid, A.M., Cronan, J.E., 2007. Coordinate expression of the acetyl coenzyme A carboxylase genes, accB and accC, is necessary for normal regulation of biotin synthesis in Escherichia coli. J. Bacteriol. 189, 369–376. https://doi.org/10.1128/JB.01373-06

Chen, Y., Bao, J., Kim, I.-K., Siewers, V., Nielsen, J., 2014. Coupled incremental precursor and co-factor supply improves 3-hydroxypropionic acid production in Saccharomyces cerevisiae. Metab. Eng. 22, 104–109. https://doi.org/10.1016/j.ymben.2014.01.005

Croteau, R., Ketchum, R.E.B., Long, R.M., Kaspera, R., Wildung, M.R., 2006. Taxol biosynthesis and molecular genetics. Phytochem. Rev. https://doi.org/10.1007/s11101-005-3748-2

De Ruyck, J., Famerée, M., Wouters, J., Perpète, E.A., Preat, J., Jacquemin, D., 2007. Towards the understanding of the absorption spectra of NAD(P)H/NAD(P)+ as a common indicator of dehydrogenase enzymatic activity. Chem. Phys. Lett. 450, 119–122. https://doi.org/10.1016/j.cplett.2007.10.092

Dittrich, C.R., Vadali, R. V., Bennett, G.N., San, K.-Y., 2008. Redistribution of Metabolic Fluxes in the Central Aerobic Metabolic Pathway of E. coli Mutant Strains with Deletion of the ackA-pta and poxB Pathways for the Synthesis of Isoamyl Acetate. Biotechnol. Prog. 21, 627–631. https://doi.org/10.1021/bp049730r

Gande, R., Dover, L.G., Krumbach, K., Besra, G.S., Sahm, H., Oikawa, T., Eggeling, L., 2007. The two carboxylases of Corynebacterium glutamicum essential for fatty acid and mycolic acid synthesis. J. Bacteriol. 189, 5257–5264. https://doi.org/10.1128/JB.00254-07

Hügler, M., Menendez, C., Schägger, H., Fuchs, G., 2002. Malonyl-coenzyme A reductase from Chloroflexus aurantiacus, a key enzyme of the 3-hydroxypropionate cycle for autotrophic CO(2) fixation. J. Bacteriol. 184, 2404–10. https://doi.org/10.1128/JB.184.9.2404-2410.2002

Jain, A., Srivastava, P., 2013. Broad host range plasmids. FEMS Microbiol. Lett. 348, 87–96. https://doi.org/10.1111/1574-6968.12241

Kildegaard, K.R., Jensen, N.B., Schneider, K., Czarnotta, E., Özdemir, E., Klein, T., Maury, J., Ebert, B.E., Christensen, H.B., Chen, Y., Kim, I.K., Herrgård, M.J., Blank, L.M., Forster, J., Nielsen, J., Borodina, I., 2016. Engineering and systems-level analysis of Saccharomyces cerevisiae for production of 3-hydroxypropionic acid via malonyl-CoA reductase-dependent pathway. Microb. Cell Fact. 15. https://doi.org/10.1186/s12934-016-0451-5

Lambert, J.H., Anding, E., 1760. Photometrie. Photometria, sive De mensura et gradibus luminis, colorum et umbrae (1760) : Lambert, Johann Heinrich, Internet Archive, Photometria, sive De mensura et gradibus luminis, colorum et umbrae. Eberhardt Klett, Augsburg.

Lin, H., Castro, N.M., Bennett, G.N., San, K.-Y., 2006. Acetyl-CoA synthetase overexpression in Escherichia coli demonstrates more efficient acetate assimilation and lower acetate accumulation: a potential tool in metabolic engineering. Appl. Microbiol. Biotechnol. 71, 870–874. https://doi.org/10.1007/s00253-005-0230-4

Liu, C., Ding, Y., Zhang, R., Liu, H., Xian, M., Zhao, G., 2016. Functional balance between enzymes in malonyl-CoA pathway for 3-hydroxypropionate biosynthesis. Metab. Eng. 34, 104–111. https://doi.org/10.1016/j.ymben.2016.01.001

Liu, C., Wang, Q., Xian, M., Ding, Y., Zhao, G., 2013. Dissection of Malonyl-Coenzyme A Reductase of Chloroflexus aurantiacus Results in Enzyme Activity Improvement. PLoS One 8, 1–8. https://doi.org/10.1371/journal.pone.0075554

Liu, D., Xiao, Y., Evans, B.S., Zhang, F., 2015. Negative Feedback Regulation of Fatty Acid Production Based on a Malonyl-CoA Sensor-Actuator. ACS Synthetic Biology 4, 132-40 http://www.ncbi.nlm.nih.gov/pubmed/24377365

Rathnasingh, C., Raj, S.M., Lee, Y., Catherine, C., Ashok, S., Park, S., 2012. Production of 3-hydroxypropionic acid via malonyl-CoA pathway using recombinant Escherichia coli strains. J. Biotechnol. 157, 633–640. https://doi.org/10.1016/j.jbiotec.2011.06.008

Rogers, J.K. & Church, G.M., 2016. Genetically encoded sensors enable real-time observation of metabolite production. Proc. Natl. Acad. Sci. U.S.A. 113, 2388-93. https://doi.org/10.1073/pnas.1600375113

Siedler, S., Lindner, S.N., Bringer, S., Wendisch, V.F., Bott, M., 2013. Reductive whole-cell biotransformation with Corynebacterium glutamicum: improvement of NADPH generation from glucose by a cyclized pentose phosphate pathway using pfkA and gapA deletion mutants. Appl. Microbiol. Biotechnol. 97, 143–152. https://doi.org/10.1007/s00253-012-4314-7

Valdehuesa, K.N.G., Liu, H., Nisola, G.M., Chung, W.J., Lee, S.H., Park, S.J., 2013. Recent advances in the metabolic engineering of microorganisms for the production of 3-hydroxypropionic acid as C3 platform chemical. Appl. Microbiol. Biotechnol. 97, 3309–3321. https://doi.org/10.1007/s00253-013-4802-4

Vidra, A., Németh, Á., 2017. Bio-based 3-hydroxypropionic Acid : A Review. period. Polytech. Chem. Eng. 1–11. Yang, Y., Lin, Y., Li, L., Linhardt, R.J., Yan, Y., 2015. Regulating malonyl-CoA metabolism via synthetic antisense RNAs for enhanced biosynthesis of natural products. Metab. Eng. 29, 217–226. https://doi.org/10.1016/j.ymben.2015.03.018

@@ Line 406: / Line 406: @@
 <div class="btn_expand">Strains</div>
 	    	<div class="content">
+<p>There are different <i>E. coli</i> strains for a variety of applications. In every lab diverse strains are stored in freezers for methods like cloning, protein expression and protein interaction. And all of them are being used. For our project Vibrigens, we decided to establish the world’s fastest three lab strains suited for specific needs based on our <i>V. natriegens</i> wildtype strain. Here we describe how we designed our three strains VibriClone, VibriExpress and VibriInteract based on the genotype of  commonly used <i>E. coli</i> strains.</p>
 <div style="display:flex;justify-content: space-around; align-items: center;">