Difference between revisions of "Team:Uppsala/Transcriptomics/Bioinformatics"

Line 246: Line 246:
 
                             <!-- Here you put your paragraphs -->  
 
                             <!-- Here you put your paragraphs -->  
 
                             <p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p><br>
 
                             <p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p><br>
                             <h3>Genome alignment</h3>
+
                             <h3>Genome Alignment</h3>
 
<p>The base sequences needs to be aligned to the reference genome of the sequenced species in question for the downstream data analysis. This is important because we want to know where each sequence actually lies in the genome and which genes they correspond to. Genome alignment was done using another community tool called minimap2.</p>
 
<p>The base sequences needs to be aligned to the reference genome of the sequenced species in question for the downstream data analysis. This is important because we want to know where each sequence actually lies in the genome and which genes they correspond to. Genome alignment was done using another community tool called minimap2.</p>
  
Line 267: Line 267:
  
 
<div class="card-holder">
 
<div class="card-holder">
<h3>Gene counting</h3>
+
<h3>Gene Counting</h3>
  
 
                       </div>
 
                       </div>
Line 300: Line 300:
 
<h2>Result</h2>
 
<h2>Result</h2>
  
<h3>Validating our transcriptomics pipeline</h3>
+
<h3>Validating our Transcriptomics Pipeline</h3>
 
<p>The transcriptomics pipeline was tried out and validated using read files available from the internet. The files consisted of two datasets of <i>E. Coli</i> (triplicates) cultured in regular LB and a sugar solution respectively.</p><br><br>
 
<p>The transcriptomics pipeline was tried out and validated using read files available from the internet. The files consisted of two datasets of <i>E. Coli</i> (triplicates) cultured in regular LB and a sugar solution respectively.</p><br><br>
 
                      
 
                      
Line 401: Line 401:
 
<br><br>
 
<br><br>
  
<h3>Analyzing our own sequencing data</h3>
+
<h3>Analyzing Our Own Sequencing Data</h3>
 
<p><b>Table 1</b>: The first few genes as a result of the differential gene expression analysis seen in <b>Figure 6</b> together with their  
 
<p><b>Table 1</b>: The first few genes as a result of the differential gene expression analysis seen in <b>Figure 6</b> together with their  
 
promotor sequence and function in the organism.</p>
 
promotor sequence and function in the organism.</p>

Revision as of 20:09, 17 October 2018





<