Difference between revisions of "Team:Uppsala/Transcriptomics/Barcoding-Library Preparation"

 
(3 intermediate revisions by 3 users not shown)
Line 209: Line 209:
 
                 <div class="card-holder">
 
                 <div class="card-holder">
  
                 <h1> Barcoding and Library Preparation </h>
+
                 <h1 id="Bar"> Barcoding and Library Preparation </h1>
  
 
                
 
                
  
                     <p id"Bar">Each sequencing technology has its own mechanism of sequencing. Third generation of sequencing uses pores through which nucleic acid strand is pulled through to read the genetic information. In order to assure that nucleic acids pass the pore at correct speed and orientation, adaptors with motor proteins need to be attached to ends of cDNA molecules. Motor protein then anneals to the pore and pulls the molecule through (Oxford Nanopore, 2018). <br><br>
+
                     <p>Each sequencing technology has its own mechanism of sequencing. Third generation of sequencing uses pores through which nucleic acid strand is pulled through to read the genetic information. In order to assure that nucleic acids pass the pore at correct speed and orientation, adaptors with motor proteins need to be attached to ends of cDNA molecules. Motor protein then anneals to the pore and pulls the molecule through [1]. <br><br>
  
                         Other the adaptors, barcodes are also attached to the cDNA sample. In our application, two different samples are sequenced simultaneously using one flow cell. In order to distinguish which molecule belongs to what sample, barcodes (short DNA fragments with known sequence) are ligated to the cDNA. Subsequent bioinformatic analysis allows sorting the reads according to the barcodes and assign them into two distinct samples.  
+
                         In addition to adaptors, barcodes are also attached to the cDNA sample. In our application, two different samples are sequenced simultaneously using one flow cell. In order to distinguish which molecule belongs to what sample, barcodes (short DNA fragments with known sequence) are ligated to the cDNA. Subsequent bioinformatic analysis allows sorting of the reads according to the barcodes and assign them into two distinct samples.  
 
                     </p>  
 
                     </p>  
  
 
                     <h2 id="Exp">Experiment</h2>
 
                     <h2 id="Exp">Experiment</h2>
  
                     <p>Initially, cDNA is treated with Ultra II End Prep (NEB), which performs end-repair and tailing. End-prep assures that all fragments end in blunt ends and that there are no overhangs, end-tailing adds non-template dAMP to 3´end, which is complementary with dT on barcodes, which are ligated in the subsequent step using Blunt/TA ligase. After barcode ligation, adaptors are ligated using Quick T4 Ligation Kit. Library is then ready to be loaded into the flow cell after passing through the checkpoint. </p>
+
                     <p>Initially, cDNA is treated with Ultra II End Prep (NEB), which performs end-repair and tailing. End-prep assures that all fragments end in blunt ends and that there are no overhangs, end-tailing adds non-template dAMP to 3'end, which is complementary with dT on barcodes, which are ligated in the subsequent step using Blunt/TA ligase. After barcode ligation, adaptors are ligated using Quick T4 Ligation Kit. Library is then ready to be loaded into the flow cell after passing through the checkpoint. </p>
  
 
                     <h2 id="Res">Results</h2>
 
                     <h2 id="Res">Results</h2>
                     <p>At this stage, the limited amount of material limits ways of assuring that the library preparation was successful. Under normal circumstances, it would be possible to check quality of cDNA library with Nanodrop. But due to rather small volume and concentration, it was decided that only quantity will be measured using Qubit ( (as Nanodrop has shown to not being very accurate below concentrations of 30 ng/µl). Table 1 shows the usual yield in various steps. </p><br><br>
+
                     <p>At this stage, the limited amount of material limits ways of assuring that the library preparation was successful. Under normal circumstances, it would be possible to check quality of cDNA library with Nanodrop. But due to rather small volume and concentration, it was decided that only quantity will be measured using Qbit (as Nanodrop has shown to not being very accurate below concentrations of 30 ng/µl). Table 1 shows the usual yield in various steps. </p><br><br>
  
  
Line 301: Line 301:
 
                     <p>It was not certain whether the barcodes and adaptors were attached properly. If some of these steps fails, all the subsequent part would most likely fail as well and therefore lead to low sequencing throughput. In order to test if the library preparation has been done correctly, we prepared a library of standard lambda phage DNA (provided with the kit for troubleshooting). Table 2 shows the yields at the various steps. Interestingly, large amount of material is lost at the adaptor ligation step. </p>
 
                     <p>It was not certain whether the barcodes and adaptors were attached properly. If some of these steps fails, all the subsequent part would most likely fail as well and therefore lead to low sequencing throughput. In order to test if the library preparation has been done correctly, we prepared a library of standard lambda phage DNA (provided with the kit for troubleshooting). Table 2 shows the yields at the various steps. Interestingly, large amount of material is lost at the adaptor ligation step. </p>
  
                    <h4>Result</h4>
+
                 
  
  
Line 366: Line 366:
  
  
                     <p>Graph shown in figure 2 shows that reads are of high quality. In our actual sequencing runs, reads were always of very low quality. This result suggests that low quality / amount of passed reads is most likely due to input material (cDNA library) rather than to the library preparation itself. </p>
+
                     <p>Graph shown in figure 2 shows that reads are of high quality. In our actual sequencing runs, reads were always of very low quality. This result suggests that low quality/amount of passed reads is most likely due to input material (cDNA library) rather than to the library preparation itself. </p>
  
                     <h4>Discussion</h4>
+
                      
  
  
Line 377: Line 377:
 
                     <p>Library preparation is a complex procedure involving multiple enzymes and purification steps. Decreased efficiency of library preparation can be due to malfunctioning of any of the steps. The major issue in prepared libraries has been low sequencing throughput and low quality of reads. We have therefore tested if the issue is somehow connected to our samples or to the actual library prep. Since preparing library from supplied phage DNA was successful (high quality reads, decent throughput), we concluded that the issue was in fact in our input material. This has later proven to be true due to RNA contamination of the libraries as described in cDNA synthesis. <br><br>
 
                     <p>Library preparation is a complex procedure involving multiple enzymes and purification steps. Decreased efficiency of library preparation can be due to malfunctioning of any of the steps. The major issue in prepared libraries has been low sequencing throughput and low quality of reads. We have therefore tested if the issue is somehow connected to our samples or to the actual library prep. Since preparing library from supplied phage DNA was successful (high quality reads, decent throughput), we concluded that the issue was in fact in our input material. This has later proven to be true due to RNA contamination of the libraries as described in cDNA synthesis. <br><br>
  
                     Even with RNA contamination as the potential explanation for low quality reads (RNA is being sequenced using algorithm for DNA and therefore the bases are not being recognized) the problem of low throughput persisted. Major losses are seen during the library prep (up to 75%). According to Oxford Nanopore, this loss is expected, Question is whether it would be worth to increase input material above the recommendation of a manufacturer to achieve higher throughput. <br><br>
+
                     Even with RNA contamination as the potential explanation for low quality reads (RNA is being sequenced using algorithm for DNA and therefore the bases are not being recognized) the problem of low throughput persisted. Major losses are seen during the library prep (up to 75%). According to Oxford Nanopore, this loss is expected, Question is whether it would be worth to increase input material above the recommendation of the manufacturer to achieve higher throughput. <br><br>
  
                     Sequencing using Oxford Nanopore has been used mainly for long fragments of genomic DNA. In our application we aimed to sequenced very short reads (average about 1 kb) of cDNA. As this application is relatively new, we assume the process might not be fully optimized (eg. retention of small fragments by beads, amount of input library, etc.) for our application. <br><br>
+
                     Sequencing using Oxford Nanopore has been used mainly for long fragments of genomic DNA. In our application we aimed to sequence very short reads (average about 1 kb) of cDNA. As this application is relatively new, we assume the process might not be fully optimized (eg. retention of small fragments by beads, amount of input library, etc.) for our application. <br><br>
  
 
                     Additional troubleshooting would need to be performed to adjust the protocols provided by Oxford Nanopore to our application, which was unfortunately not possible in the course of this project due to budgetary and time restrictions. </p>
 
                     Additional troubleshooting would need to be performed to adjust the protocols provided by Oxford Nanopore to our application, which was unfortunately not possible in the course of this project due to budgetary and time restrictions. </p>

Latest revision as of 16:13, 3 December 2018