Overview

The final step in our detection method consists on measuring and analyzing DNA molecules that have been selectively tagged for sequencing. We have succeeded in developing such a method based on the use of our designed fusion protein dxCas9-Tn5 (Picelli, et al, 2014., Hu, et al, 2018). This page is dedicated to describe how we designed this novel approach, how this was tested and what applications this method can be extended to.

Targeted DNA enrichment during library preparation

We focused on the Oxford Nanopore Technology sequencing platform as the core technology to measure and obtain DNA sequences from samples. In this sequencing platform, DNA molecules diffuse through a protein pore embedded on a silica membrane. Changes in electrical signal due to this DNA diffusion are measured. The specific changes in voltage are sequence dependent, and software tools from ONT can convert this signal into DNA sequences. On one sequencing device (Flow-Cell), more than 2000 active pores can be generating data at the same time. The amount of data produced from this type of sequencing is truly gigantic, and efficient data analysis of these measurements is crucial to filter and compare data (Jain, Oslen, Paten & Akeson, 2016).
To control this molecule diffusion through the pore and facilitate the determination of specific sequences, the DNA molecules to be sequenced are attached to a so called motor-protein. This protein regulates the rate of diffusion through the nanopore and is indispensable for this type of sequencing. During library preparation with ONT an enzyme called transposase is used to randomly fragment DNA sequences and add DNA adapters. These DNA adapters are ligated to another DNA molecule attached to the motor protein in a subsequent step (Jain, Oslen, Paten & Akeson, 2016). We aimed to change the fragmentation step of this reaction with a newly designed fusion protein that will make this adapter integration sequence specific. Thus, during library preparation, specific DNA sequences will be favourably adapted and enriched for sequencing, as shown in figure 1.

Timeline — **Figure 1.** Differences between random library preparation with transposase from ONT library preparation (left) and targeted library preparation with our fusion protein (right). The principle behind targeted library preparation is to enrich certain DNA targets before the sequencing run.

Enrichment for specific DNA sequences allows more detailed studies and processing of higher number of samples with lower data generation. To make multiplexing easier, we developed a software tool that generates specific DNA adapter sequences needed for ONT sequencing. Each sequence generated can be directly linked to the sample identification. To analyse and use this tool, please visit our improvement page.

Proof of principle

Targeted integration of DNA adapters during library preparation and sequencing with ONT MinION

We constructed, expressed and purified the fusion protein dxCas9-Tn5. We demonstrated that this protein could integrate DNA adapters on a specific target (using a gRNA for dxCas9) on human Erythropoyetin coding sequence (EPO cds) as substrate (to see these results, please visit our demonstration page). After this result was obtained, we performed the integration reaction with DNA adapters that were compatible with ONT sequencing. These adapters were integrated on the specific EPO cds DNA target. The sample in the reaction with our fusion protein had two molecules: EPO cds (the target of fusion) and a different molecule (not targeted by sgRNA) to assess any of target adapter integration.
After this integration reaction, the sample was used for library preparation from ONT sequencing. This protocol was started at the second step, as our fusion protein replaced the first reaction. After sequencing, data was aligned to both DNA molecules as reference. This experiment proved that target EPO cds was enriched for sequencing in comparison with negative control DNA that wasn’t targeted (89 unique aligned reads against 0 unique aligned reads respectively). This result indicated sample enrichment towards EPO cds.
For further proof of targeted integration, figure 2 shows the specific alignment of one of the unique reads from this experiment to reference EPO cds DNA used.

;/0-opl;./

For more information and discussion on these experiments, please visit our sequencing results.

Further improvements and prospective applications

We obtained a proof of principle that indicates how library preparation is enriched towards a specific DNA sequence. However, the method will still need to be optimized before it can be used for detection applications. One of the major improvements to be made is the optimization of the protocol for integration of our fusion protein. Buffer composition must be altered to favour reaction kinetics of both Cas9 binding to target and Tn5 integration. Another improvement of the method should include the design of optimal adapters compatible with ONT library preparation. This could be done by interacting with sequencing experts from ONT because the specific adapter sequences are not known due to corporate. For our prospective collaboration with Oxford Nanopore technologies, please visit our entrepreneurship page.
This method was designed and developed while searching for a solution in detection methods for gene doping, but the application scope is much broader than that. The ability to enrich samples for specific DNA molecules prior to sequencing can be used in research areas that require highly detailed information of DNA mutations. Such studies could include single point mutations to large structural variations or copy number alterations, detection of viral infections, fetal DNA screening and food safety maintenance (Gabrieli, Sharim, Michaeli & Ebanstein, 2017).

Team:TUDelft/Measurement/Main

Overview

Targeted DNA enrichment during library preparation

Proof of principle

Targeted integration of DNA adapters during library preparation and sequencing with ONT MinION

Further improvements and prospective applications

References