4.7 Article

Adaptive sequencing using nanopores and deep learning of mitochondrial DNA

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 4, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac251

Keywords

nanopore-sequencing; selective-sequencing; deep-learning; real-time; classification

Funding

  1. EdmondJ. Safra Center for Bioinformaticsat Tel-Aviv University
  2. Tel Aviv University Innovation Laboratories (TILabs)
  3. Djerassi Elias Institute of Oncology
  4. Gertner Institute
  5. ZiminInstitute

Ask authors/readers for more resources

This study demonstrates an alternative method of software-based selective sequencing in real time by combining nanopore sequencing and deep learning. The results show that deep learning can accurately classify the first 200 nucleotides in a raw nanopore sequencing signal format. The uniqueness of this method is its ability to distinguish two groups of DNA without a labeled reference.
Nanopore sequencing is an emerging technology that reads DNA by utilizing a unique method of detecting nucleic acid sequences and identifies the various chemical modifications they carry. Deep learning has increased in popularity as a useful technique to solve many complex computational tasks. 'Adaptive sequencing' is an implementation of selective sequencing, intended for use on the nanopore sequencing platform. In this study, we demonstrated an alternative method of software-based selective sequencing that is performed in real time by combining nanopore sequencing and deep learning. Our results showed the feasibility of using deep learning for classifying signals from only the first 200 nucleotides in a raw nanopore sequencing signal format. This was further demonstrated by comparing the accuracy of our deep learning classification model across data from several human cell lines and other eukaryotic organisms. We used custom deep learning models and a script that utilizes a 'Read Until' framework to target mitochondrial molecules in real time from a human cell line sample. This achieved a significant separation and enrichment ability of 2.3-fold. In a series of very short sequencing experiments (10, 30 and 120 min), we identified genomic and mitochondrial reads with accuracy above 90%, although mitochondrial DNA comprised only 0.1% of the total input material. The uniqueness of our method is the ability to distinguish two groups of DNA even without a labeled reference. This contrasts with studies that required a well-defined reference, whether of a DNA sequence or of another type of representation. Additionally, our method showed higher correlation to the theoretically possible enrichment factor, compared with other published methods. We believe that our results will lay the foundation for rapid and selective sequencing using nanopore technology and will pave the approach for clinical applications that use nanopore sequencing data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available