Team:Vilnius-Lithuania-OG/Design Regulatory

Collaborations

Regulatory
CAT-Seq Design

Introduction

While Catalytic Activity Sequencing began as a method for catalytic biomolecule activity recording, we have also create a way to adjust CAT-Seq to record activities of regulatory part libraries. In addition to the activities, cross-interactions of different regulatory parts can also be measured.

We highly suggest to read the main design section of CAT-Seq first, as we will not go over the main design scheme here and will only build upon it further.

Adjusting CAT-Seq system for Regulatory Sequences

Just like in the main CAT-Seq design page, we again divide the Regulatory CAT-Seq design page into two parts:

1. Design description - an in depth explanation of how a particular part of our system works, how it should be used in real-life situation and why such design choices were adopted.

2. Proof of Concept - a story about how we applied the design using specific components and parts to build Regulatory CAT-Seq and demonstrate that it works as intended.

General description

As already established, the Catalytic Activity Sequencing records the activity of different catalytic biomolecules in separate water droplets simultaneously. That is achieved with the help of Substrate Nucleotides. In turn, unique catalytic biomolecules produce different amounts of Product Nucleotides, which can then be quantified by adding them to their own amplified template DNA and later sequencing those molecules.

Yet, the activity of the catalytic biomolecule is not the only aspect that can influence the amount of Product Nucleotides that are produced. If all of the droplets would contain the same catalytic biomolecule, but each droplet would have a different concentration of that biomolecule, we would in result get different amounts of Product Nucleotides. For example, droplets with large amount of biomolecules may produce a large number of Product Nucleotides and vice-versa.

Using such logic, we can hypothesise that any sequence element that can adjust the expression of the catalytic biomolecule can be measured and its influence on the Product Nucleotide - recorded. In other words, this influence is the activity, or strength of regulatory element.

Proof of concept

To demonstrate that CAT-Seq can be applied to record the activity and cross interactions of regulatory sequences we have chosen two main objects: Ribosome Binding Sites and synthetic Toehold Switches.

1. Regulatory part activity recording

Ribosome binding sites (RBS) are sequences that are responsible for recruiting the ribosomes during the initiation of protein translation. RBS can have different sequences, and depending on those sequences, the frequency of ribosome binding rate to the RBS will be different. In this way, gene of interest translation rate can be controlled in a constitutive manner.

Having a large RBS library with known activities would allow to precisely pick an RBS sequence with a specific activity, while engineering sensitive and complex genetic circuits. Yet, it is not easy to screen large libraries of RBS, as each individual known sequence must be selectively placed in a physically separated compartment and then measured.

With CAT-Seq, such activity recording becomes much easier. While keeping all the workflow from the main Catalytic Activity Sequencing design, all we need to do is to change the way we prepare the library. Firstly, instead of using a library of catalytic biomolecules, we can use a single esterase enzyme. Then, specific or random RBS sequence library can be prepared and attached to the esterase enzyme sequence. After that, every step of CAT-Seq method is the same.

After the library is encapsulated into droplets, depending on the RBS sequence strength some droplets may have different amounts of the esterase. For example, the droplets with more esterase will produce more Product Nucleotides and vice-versa. After the amplification and sequencing steps we can learn two main things about the measured library - the RBS sequence and its relative RBS strength in the form of Product Nucleotide concentration for every library variant screened.

For our proof of concept experiments, we have chosen to measure multiple already established RBS sequences from the Registry of Standard Biological Parts. First, we measured the activities of RBS using standard methods as a control. Next, we have measured the same constructs using the CAT-Seq system.

2. Regulatory part interaction recording

Next, we want to give an example on how to record regulatory part interactions using CAT-Seq. In this case, we will be using Toehold Switches.

The Toehold Switch systems are composed of two RNA strands referred to as the Switch and Trigger. The Switch RNA contains the coding sequence of the gene being regulated. The Switch RNA forms a hairpin structure that includes the RBS site. While the hairpin structure is formed, the translation is inhibited. The Trigger RNA is a molecule that can selectively bind to the Switch RNA region and expose the gene RBS site for ribosomes. Once that happens, the protein translation can be initiated.

While these riboregulators are extremely useful in engineering synthetic biology circuits, there is still a huge lack of orthogonal Toehold Switch and Trigger pairs. The orthogonal pairs are needed in order to create complex circuits, which require a large amount of logic functions. Yet, finding properly working pairs of Toehold Switches is still difficult, as in-silico methods still require more experimental data to generate well-working Toehold Switch pairs from scratch. Moreover, creating a Toehold Switch pair that is not only working by itself, but also does not cross interact with other pairs is an even more daunting task.

To give an example, if we had 50 working Toeholds and wanted to see how they would interact with each other using currently available methods, we would have to do 2500 separate measurements. For that reason, we want to demonstrate that CAT-Seq allows to measure regulatory part cross-interactions simultaneously and with ease.

Once again, only the first part of general CAT-Seq design needs to be changed - the library preparation. Instead of using a catalytic biomolecule library, a single enzyme is used. Then, libraries need to be prepared - one for Toehold Switches and another for Trigger RNA. The Trigger RNA libraries also require a separate T7 promoter for RNA expression. Then, both of those libraries must be ligated to the enzyme DNA fragment. Resulting library contains fragments which have the same catalytic biomolecule, yet each of them have a random combination of a specific Toehold Switch and RNA Trigger.

After the encapsulation, the droplets which have the correct Trigger RNA and Toehold Switch combination (in which Trigger RNA binds to that specific Toehold Switch), can produce catalytic biomolecules. In turn, those catalytic biomolecules can produce the Product Nucleotides.

In other droplets, where the Trigger RNA binds to the Toehold Switch with weaker affinity, there are also less catalytic biomolecules and Product Nucleotides produced. Finally, the droplets in which the Trigger RNA does not bind to the Toehold Switch, there are no biomolecules or product nucleotides produced.

After the Amplification and Sequencing steps, from each unique sequenced single molecule we can receive the information about the Toehold Switch and Trigger RNA sequence pairs, and also the information about how that regulatory pair interacted. In this case the interaction stands for how strongly the specific Trigger RNA bound to the Toehold Switch.

For our proof of concept, we have chosen multiple Toehold Switches which were already described in literature. We have first prepared a small library of different combinations of Toehold Switches and Trigger RNAs. Then, we measured those interactions using standard, low-throughput methods. Lastly, we have used CAT-Seq to determine those cross-interactions in high-throughput.


In this way, CAT-Seq can be applied to conveniently record the activities of large number of regulatory parts and at the same time assess their cross-interactions.