Team:Rice/Software/Step7

Structural Constraint Details

Take the output from previous step. Predict the secondary structure of all candidate ribosomes and discard those that form any kind of hairpin structure within the ASD region.

Function: RNAfoldcentroidseq


This function takes in a sequence as an input, then makes use of the program RNAfold to predict the most probable secondary structure resulting from the primary structure. The option using centroid ensemble is shown to be more reliable compared to the default option[2]. The output is a bracket/dot form (Click here for more details.) of the sequence that indicates whether each base is part of any secondary structure.


Take the remaining library after energy constraints have been applied. Predict the secondary structure of all candidate ribosomes and discard those that form any kind of hairpin structure within the ASD region.

Function: secstructurelib


This function takes in a library of sequences of ASD and SD, put each ASD in the corresponding position in the 16s rRNA, and use RNAfoldcentroidseq to return a library that contains all the secondary structures of all candidate 16s rRNAs.


Function: ASD_2rystructure_narrow


This function takes in a library, run the library through secstructurelib to get a representative library of secondary structure, and filter out those with secondary structures in the ASD region. The whole pipeline of processing the input library is shown in the figure below.

Figure S4. Elimination of sequences that lead to secondary structure complications.