Difference between revisions of "Team:Uppsala/Transcriptomics/Bioinformatics"

Line 333: Line 333:
 
<h3>Demultiplexing and adapter trimming</h3>
 
<h3>Demultiplexing and adapter trimming</h3>
 
<p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p>
 
<p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p>
 +
 +
<h3>Genome alignment</h3>
 +
<p>The base sequences needs to be aligned to the reference genome of the sequenced species in question for the downstream data analysis. This is important because we want to know where each sequence actually lies in the genome and which genes they correspond to. Genome alignment was done using another community tool called minimap2.</p>
 +
 +
<h3>Gene counting</h3>
 +
<p>Gene counting basically means that you count how many times each mRNA sequence (aligned over a gene from the previous step) occurs. This in turn directly correlates to the amount of up- or down-regulation of that particular gene. A lot of different tools were available for gene counting but ”featureCounts” was chosen through galaxy.</p>
  
  

Revision as of 17:05, 15 October 2018