4.8 Article

Nanopore Data Analysis: Baseline Construction and Abrupt Change-Based Multilevel Fitting

Journal

ANALYTICAL CHEMISTRY
Volume 93, Issue 34, Pages 11710-11718

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.1c01646

Keywords

-

Funding

  1. National Science Foundation (CBET) [2041340, 2022374]
  2. National Institutes of Health [R21CA240220]
  3. Our Health in Our Hands ANU Grand Challenge
  4. Australian Research Council
  5. Div Of Chem, Bioeng, Env, & Transp Sys
  6. Directorate For Engineering [2041340, 2022374] Funding Source: National Science Foundation

Ask authors/readers for more resources

This study introduces an analysis platform with four baseline fitting methods adaptable to various nanopore traces for extracting events effectively. The performance was tested with stable and fluctuating current profiles, showing that event count increased significantly with increasing fitting robustness for fluctuating profiles. Event turning points were clustered using the dbscan method, with subsequent segmentation and iterative refinement to deduce final event levels.
Solid-state nanopore technology delivers single-molecule resolution information, and the quality of the deliverables hinges on the capability of the analysis platform to extract maximum possible events and fit them appropriately. In this work, we present an analysis platform with four baseline fitting methods adaptive to a wide range of nanopore traces (including those with a step or abrupt changes where pre-existing platforms fail) to maximize extractable events (2x improvement in some cases) and multilevel event fitting capability. The baseline fitting methods, in the increasing order of robustness and computational cost, include arithmetic mean, linear fit, Gaussian smoothing, and Gaussian smoothing and regressed mixing. The performance was tested with ultra-stable to vigorously fluctuating current profiles, and the event count increased with increasing fitting robustness prominently for vigorously fluctuating profiles. Turning points of events were clustered using the dbscan method, followed by segmentation into preliminary levels based on abrupt changes in the signal level, which were then iteratively refined to deduce the final levels of the event. Finally, we show the utility of clustering for multilevel DNA data analysis, followed by the assessment of protein translocation profiles.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available