Team:MIT/Design

Design
System Design Overview
Our goal was to port the S. mutans ComCDE system into HEK293T mammalian cells. We wish to equip mammalian cells with the ability to sense and disrupt S. mutans biofilm formation. ComCDE is a two-component signaling (TCS) system, a type of bacterial quorum sensing system). The sensing system has two main components: ComD (a histidine kinase receptor protein) and ComE (a response regulator), whose native functions are described in detail in the ComCDE page. ComE binds to a promoter sequence. ComD is activated by the Competence-Stimulating Peptide (CSP). Below, we describe key design choices, including the modifications made to the native pieces of the ComCDE pathway to optimize its expression and functionality in mammalian cells.
ComD Design
In order to adapt ComD to mammalian cells, we made several modifications. First, we chose ComD amino acid sequences from three common strains of S. mutans native to the oral cavity (UA159, SM6, and NN2025) and used the IDT codon optimization tool to human optimize the DNA sequence of each protein. Second, we fused CD4 signaling peptide (BBa_I712009) to the 5’ prime end of comD in order to guide comD to expression on the membrane of HEK cells. Finally, we fused the coding sequence of mKO2, a fluorescent protein, 3’ prime so that we could easily track the location of the comD protein.
ComE Design
The ComE protein has two main structural domains - the acceptor domain and the effector domain. The acceptor domain is a protein domain unique to ComE which contains the aspartic acid that is the target of phosphorylation. The effector domain binds to DNA. It is characterized as a LytR domain and is widely conserved across several subgroups of TCS response regulators but is not extremely well understood.1

In order to adapt ComE to mammalian cells, we made several modifications. First, we chose ComE amino acid sequences from three common strains of S. mutans native to the oral cavity (UA159, SM6, and NN2025) and used the IDT codon optimization tool to human optimize the DNA sequence of each protein. Additionally, we fused ComE C terminal to a nuclear localization sequence (NLS), VP16 activating domain, and a nuclear export sequence (NES). ComE’s native S. mutans don’t have a nucleus, so we believed that adding an NLS and NES would assist in shuttling the protein to and from the mammalian nucleus, to increase interactions with DNA (in the nucleus) and ComD (in the cell membrane). Since transcription machinery differs in prokaryotes and eukaryotes, we also fused the VP16 transcription activating domain to the ComE C terminal to ensure that ComE would activate transcription upon binding DNA. The NLS, VP16, and NES sequences were taken from Systematic Transfer of Prokaryotic Sensors and Circuits to Mammalian Cells (Weiss and Voigt et al., 2014. DOI: 10.1021/sb5002856).
Constitutively Active ComE: We utilized two approaches to design constitutively active ComE control proteins, so we could experimentally validate that ComE was driving the correct promoter once ported into HEK cells. This validation isolates the ComE promoter interaction so that we could test for ComE and promoter functionality without needing CSP-ComD functional interactions.

1.) Our first approach was to make a variation of each of the three ComE proteins with a D60E phosphomimetic amino acid substitution, in order to mimic the constitutively phosphorylated state of ComE. The 60th amino acid is the aspartic acid phosphorylation site, and glutamic acid is a typical aspartate phosphomimetic. This approach was previously used in S. mutans ComE studies.2

2.) The second approach was to isolate the effector region of ComE. In some response regulators, the acceptor portion blocks the effector portion from binding DNA in the deactivated state, so isolating the effector region can create a mimic of the activated protein.
Competence Stimulating Peptide (CSP) Design
We designed a synthetic CSP fused to a fluorophore (Alexa-fluor 405) to allow for visualization of ligand binding process with ComD.
ComE Binding Promoter Design
The ComE binding site typically includes 37bp with two 11bp direct repeat site with consensus sequence TCBTAAAYSGT.
1.) To design a comE binding promoter, we chose 3 comE binding sites with different binding affinity from comE downstream gene’s promoter sequence. Specifically, the high, medium and low-affinity binding sequences are from comC, ftf and gtfc promoter with IC50 value 2.9 ± 0.4, 11 ± 1.0 and 15 ± 3.8 (10-9 M) respectively3. In each engineered promoter, we fused three repeated comE binding sites at 5’ prime and a minCMV promoter at 5’ prime, generating 3 comE binding promoter with 3 levels of binding affinity. We included three repeated comE binding site to increase the likelihood that comE binds to the promoter because comE have to shuttle back and forth between nucleus and cytoplasm. Also, multiple binding site is needed because comE is activated as oligomer. MinCMV promoter is included at 3’ prime so that when comE-VP16 binds to promoter, VP16 will recruit the transcription complex to minCMV promoter and activates downstream gene transcription of actuation protein.

2.) The second approach was to isolate the effector region of ComE. In some response regulators, the acceptor portion blocks the effector portion from binding DNA in the deactivated state, so isolating the effector region can create a mimic of the activated protein.