Difference between revisions of "Team:Uppsala/Transcriptomics/Bioinformatics"

Line 12: Line 12:
 
         </style>
 
         </style>
  
 +
    </head>
 +
    <body>
  
 
 
 
<style/>
 
 
 
            :root{
 
                --primary: #8b1a32;
 
                --secondary:#969696;
 
                --tertiary: #f15025;
 
                --whiteish: #fcf7ff;
 
                --light-blue:#cde6f5;
 
                --alt-secondary: #554640;
 
 
            }
 
 
 
            html{
 
                width: 100%;
 
                height: 100%;
 
            }
 
            *{
 
                margin: 0;
 
                padding: 0;
 
                box-sizing: border-box;
 
            }
 
           
 
            .parallax {
 
                /* The image used */
 
                background-image: url("https://static.igem.org/mediawiki/2018/9/99/T--Uppsala--Transcriptomics-HEADER_2.jpeg");
 
 
                /* Set a specific height */
 
                min-height: 99vh;
 
 
                /* Create the parallax scrolling effect */
 
                background-attachment: fixed;
 
                background-position: center;
 
                background-repeat: no-repeat;
 
                background-size: cover;
 
                /*background: linear-gradient(to bottom, transparent 90%);*/
 
            }
 
           
 
            .blur-box {
 
            background-color: var(--whiteish);
 
            box-shadow: 0 0 10px 10px var(--whiteish);
 
            }
 
 
            .sub-header{
 
 
                /* Set a specific height */
 
                height:20vw;
 
                min-height: 150px;
 
               
 
 
                /* Create the parallax scrolling effect */
 
                /*background-attachment: fixed;*/
 
                background-position: center;
 
                background-repeat: no-repeat;
 
                background-size: cover;
 
               
 
             
 
            }
 
           
 
            .sub-header h1{
 
                font-size: 60;
 
                text-align: center;
 
                position:absolute;
 
                left:0;
 
                right:0;
 
                margin-top:6.5% ;
 
                margin-left:auto;
 
                margin-right: auto;
 
                color:white;
 
                bottom: px;
 
            }
 
 
/* TODO: CHANGE */
 
            #blue{
 
                background-image: url(redbanner.jpg);
 
            }
 
 
            h1, h2{
 
 
                color: #661325;
 
            }
 
            h1{
 
                margin-top:2em;
 
                margin-bottom: 0.5em;
 
            }
 
            #first-title{
 
                top:100px;
 
                margin-top: 0;
 
            }
 
           
 
            .content{
 
                position:relative;
 
             
 
                background-color: var(--whiteish);
 
            }
 
 
            .content-text{
 
                margin-top:2em;
 
                min-width: 400px;
 
                width:70%;
 
                position: relative;
 
                margin: auto;
 
                /*background-color: #8c7cff;*/
 
 
            }
 
 
 
            .scroll-pointer{
 
                top:85%;
 
                position:absolute;
 
                /*margin-left:auto;
 
                margin-right:auto;*/
 
                width: 100%;
 
                padding: 0;
 
            }
 
 
            .scroll-pointer img{
 
                color: var(--primary) ;
 
 
            }
 
 
            .center-icon{
 
                width:100px;
 
                position: relative;
 
                display: block;
 
                margin-right: auto;
 
                margin-left: auto;
 
                opacity: 0.8;
 
            }
 
.center {
 
    display: block;
 
    margin-left: auto;
 
    margin-right: auto;
 
    width: 50%;
 
}
 
 
 
 
</style>
 
  
  
Line 296: Line 155:
  
  
 
 
 
 
 
 
    </head>
 
  
 
     <div class="body">
 
     <div class="body">
Line 320: Line 172:
 
                  
 
                  
  
<h1>Bioinformatics</h1>
+
<div class="card-holder">
 +
    <div class="content-card-heading">
 +
        <h1>Bioinformatics</h1>
 +
    </div>
 +
 
 +
 
 
   
 
   
 
<p>After a succesfull sequencing has been performed and you’re left with raw data containing millions and millions (and millions) of lines of base sequences, all of this needs to be processed and interpreted. This is where the interdisciplinary field of bioinformatics comes in. A vast range of software tools are available, tailored to different kinds of analysis as well as being unique to the different sequencing methods being used.<br><br>
 
<p>After a succesfull sequencing has been performed and you’re left with raw data containing millions and millions (and millions) of lines of base sequences, all of this needs to be processed and interpreted. This is where the interdisciplinary field of bioinformatics comes in. A vast range of software tools are available, tailored to different kinds of analysis as well as being unique to the different sequencing methods being used.<br><br>
Line 332: Line 189:
 
<h3>Demultiplexing and adapter trimming</h3>
 
<h3>Demultiplexing and adapter trimming</h3>
 
<p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p>
 
<p>Because the sequencing itself runs pooled samples containing both the barcoded cultured- and control-group samples, the data produced needs to be demultiplexed i.e separated into files containing the reads from respective groups. Because the barcodes used to fingerprint each group is made up of its own base sequence, this also had to be removed or ”trimmed” from the data, leaving us with the pure mRNA sequences. This was achieved using a free nanopore community tool called porechop.</p>
 +
                </div>
  
 
<!--Start of template with side picutre -->
 
<!--Start of template with side picutre -->
Line 351: Line 209:
  
 
                     </div>
 
                     </div>
 +
             
 
                  
 
                  
 
                 <!--End of template with side picture -->
 
                 <!--End of template with side picture -->
Line 360: Line 219:
 
<h3>Gene counting</h3>
 
<h3>Gene counting</h3>
 
<p>Gene counting basically means that you count how many times each mRNA sequence (aligned over a gene from the previous step) occurs. This in turn directly correlates to the amount of up- or down-regulation of that particular gene. A lot of different tools were available for gene counting but ”featureCounts” was chosen through galaxy.</p>
 
<p>Gene counting basically means that you count how many times each mRNA sequence (aligned over a gene from the previous step) occurs. This in turn directly correlates to the amount of up- or down-regulation of that particular gene. A lot of different tools were available for gene counting but ”featureCounts” was chosen through galaxy.</p>
 +
                      </div>
  
 
<!--Start of template with side picutre -->
 
<!--Start of template with side picutre -->
Line 389: Line 249:
 
<p>The transcriptomics pipeline was tried out and validated using read files available from the internet. The files consisted of two datasets of E. Coli (triplicates) cultured in regular LB and a sugar solution respectively.</p><br><br>
 
<p>The transcriptomics pipeline was tried out and validated using read files available from the internet. The files consisted of two datasets of E. Coli (triplicates) cultured in regular LB and a sugar solution respectively.</p><br><br>
  
 +
                      </div>
 
<!--Start of template with side picutre -->
 
<!--Start of template with side picutre -->
 
                 <div class="card-holder">  
 
                 <div class="card-holder">  
Line 410: Line 271:
 
                 <!--End of template with side picture -->
 
                 <!--End of template with side picture -->
 
<br><br>
 
<br><br>
 
+
                </div>
 
<!--Start of template with side picutre -->
 
<!--Start of template with side picutre -->
 
                 <div class="card-holder">  
 
                 <div class="card-holder">  
Line 434: Line 295:
  
 
<p>The results after searching for the genes in the NCBI database showed that the most expressed gene from the sugar-cultured E. Coli was shown to be involved in a type of sugar system, proving that the pipeline was indeed working.</p><br><br>
 
<p>The results after searching for the genes in the NCBI database showed that the most expressed gene from the sugar-cultured E. Coli was shown to be involved in a type of sugar system, proving that the pipeline was indeed working.</p><br><br>
 
+
                </div>
 
<!--Start of template with side picutre -->
 
<!--Start of template with side picutre -->
 
                 <div class="card-holder">  
 
                 <div class="card-holder">  
Line 457: Line 318:
 
<br><br>
 
<br><br>
  
<p>The resuts from our runs unfortunately did not produce as good results as seen above. Due to the major issues with sequencing and actually generating enough data, it can be seen in figure 4 what kind of effect it had. Judging by the adjusted p-values it is clear that even though the genes can indeed be identified the statistical significance is extremely uncertain (the minimal accepted threshold is an adjusted p-value of < 0.05). Any up-or down regulation of fold-change of interest was not able to be identified either. Looking at these errors it can be assumed that no major change in fold-change as well as low significancy is due to simply not enough data being generated from the prior sequencing step. Because of these facts no gene could be identified even as a candidate.</p>
+
                    <p>The resuts from our runs unfortunately did not produce as good results as seen above. Due to the major issues with sequencing and actually generating enough data, it can be seen in figure 4 what kind of effect it had. Judging by the adjusted p-values it is clear that even though the genes can indeed be identified the statistical significance is extremely uncertain (the minimal accepted threshold is an adjusted p-value of &#62; 0.05). Any up-or down regulation of fold-change of interest was not able to be identified either. Looking at these errors it can be assumed that no major change in fold-change as well as low significancy is due to simply not enough data being generated from the prior sequencing step. Because of these facts no gene could be identified even as a candidate.</p>
 +
                   
 +
                    <h1>References</h1>
 +
                    <p><b>[1]</b> Reference here</p>
 +
                    <p><b>[2]</b> Reference here</p>
 +
                   
 +
                   
 +
                </div>
  
 
                 <!-- HERE ENDS THE PORTION WHERE YOU PUT IN YOUR CONTENT-->
 
                 <!-- HERE ENDS THE PORTION WHERE YOU PUT IN YOUR CONTENT-->
Line 464: Line 332:
 
         </div>
 
         </div>
 
     </div>
 
     </div>
 +
                 
 +
                    </body>
 
</html>
 
</html>

Revision as of 18:55, 15 October 2018