BBrickIt
BioBrick parts and standard assembly
A BioBrick part can be defined as a DNA sequence (encoding a particular biological function) that conforms to a set of standard rules. The Registry of Standard Biological Parts consists of over 2000 standard parts, each of which can be assembled according to BioBrick assembly standards to create multi-component parts of increasing complexity and functionality (Shetty et al., 2008).
There are different assembly standards, namely BioBrick Standard Assembly (RFC 10), BioBrick BB-2 Assembly (RFC 21), Silver/Fusion Bricks (RFC 23), Freiburg Fusion Protein (RFC 25) and BioBrick MoClo (RFC 1000). (Registry of Standard Biological Parts, iGEM; Knight, 2007; Knight, 2008; Anderson et al., 2009; Philips et al., 2006; Grünberg et al., 2009).
The BioBrick assembly standard RFC 10 is the most widely used assembly standard (Registry of Standard Biological Parts, iGEM). Each year hundreds of iGEM teams all over the world contribute and submit BioBrick compatible parts to the registry. The BioBrick standard has a set of stringent requirements for a part to be RFC 10 compatible:
Should not contain Illegal and Avoidable sites
Allowed sequences within BioBrick parts include any DNA sequence
which does not contain the following sequences:
Illegal
EcoRI (gaattc)
XbaI (tctaga)
SpeI (actagt)
PstI (ctgcag)
Avoidable
NotI (gcggccgc)
Should have appropriate Prefix and Suffix
Each BioBrick part must contain the following sequence as suffix downstream of the 3' end of the part:
T ACTAGT A GCGGCCG CTGCAG
The prefix used for coding regions is different from the sequence for non-coding BioBrick parts. This is to allow the construction of ribosomal binding sequences 5' of
coding regions.
1. For non-coding Biobrick parts this prefix must be placed 5' upstream of
the part:
GAATTC GCGGCCGC T TCTAGA G
2. For BioBrick parts coding for proteins, the following sequence should be placed 5' upstream of the start codon (ATG) of the coding region:
GAATTC GCGGCCGC T TCTAG
Should have appropriate terminating sequence
All parts containing start codons other than ATG must be changed to possess ATG as the start codon. Though not mandatory, it is highly recommended, that all coding regions terminate with in-frame TAATAA stop codons, replacing other stop codons (TGA, TAG) if needed.
To ensure that the biological part conforms to these standards, researchers have to carry out multiple checks and modifications, some of which include:
- Checking for presence of illegal and avoidable restriction sites in the part.
- Modification of part sequence to replace illegal and avoidable restriction sites, depending on chassis organism.
- Addition of suitable prefix according to standards described above.
- Addition of suitable suffix according to standards described above.
BBrickIt: The one-stop solution to your BioBricking needs
Based on our own experience and a survey that we conducted with iGEM 2018 teams (results can be found here) we found that one has to alternate between different programmes to achieve all requirements to construct a BioBrick.
We found ourselves alternating between different programmes such NEB Cutter, various codon optimization tools and the iGEM Parts Registry. Although this process is repetitive and tedious (more so when creating/submitting many parts), it also is very important for the sequence that is generated to be free of errors, the chances of which increases when done manually.
We sought to create an open source online tool which would enable users to check, modify and validate their parts according to the RFC 10 standard, all in a single programme thereby making the entire process simpler, faster and error-free. We believe that this tool will be particularly helpful to first-time iGEM participants.
BioBrick your part and more!
Identify and validate part for presence/absence of illegal restriction sites
Replace sequence according to chassis (E. coli, B. subtilis and S. cerevisiae)
Add required Prefix and suffix
Key considerations while designing the software
Open source
Ease of use
Single programme for all analysis and design
Error-free sequence modification
Codon-optimization for common chassis organisms
User guide
We have created a user guide that outlines the step-by-step procedure for using this software. The user guide can be accessed here.
BBrickIt Software Beta Testing
Several iGEM teams tested our software along the way.
Many teams asked us to make a comprehensive user manual for this software, which we did and shared with all of them! Also, initially, we uploaded a .exe file on Github, which worked only on the Windows operating system. As MacOS is also widely used, the MacOS version of BBrickIt is also available in the GitHub repository.
Target audience: The iGEM community!
Feedback from iGEM teams
Extremely happy to bypass the hectic job of having to think of prefix and suffix sites, scars, illegal sites etc. before planning to clone anything in the standard vectors
The tool is easy to use and well outlined. This software would have helped us catch that we initially forgot to add the BioBrick prefix and suffix to our gene sequence!
“BBrickIt is a useful tool for iGEM teams in producing complete BioBricked part and is also very efficient.”
Methodology
The initial idea for this software came to us while we were shuttling between various software tools in order to identify the compatibility of our sequence as per the BioBrick RFC 10 standard. We shuffled between various tools for performing tasks like identifying illegal restriction sites, reverse complementing the sequence, and manually adding prefix and suffix sequences. We then decided to create a simple, user-friendly GUI that can perform all these functions and specifically look for sites as per BioBricking standards.
The programme also suggests a codon-optimized sequence in case the input sequence is not RFC 10-compatible. Currently the programme works for three commonly used chassis organisms, Escherichia coli, Bacillus subtilis and Saccharomyces cerevisiae.
BBrickIt was coded using the Python language, and the GUI was created using the Tkinter library that is provided along with it. This is an open source programme and its source code can be found here.
References