4.7 Article

Halcyon: an accurate basecaller exploiting an encoder-decoder model with monotonic attention

向作者/读者索取更多资源

Recent advancements in nanopore sequencing technology have enabled cost-effective long-read sequencing, making precise DNA sequence analysis increasingly important. Researchers have developed a new basecaller, Halcyon, incorporating neural network techniques and monotonic-attention mechanisms to learn semantic correspondences between nucleotides and signal levels. Evaluation with human whole-genome sequencing data showed that Halcyon outperformed existing basecallers and achieved competitive performance against the latest basecallers from Oxford Nanopore Technologies.
Motivation: In recent years, nanopore sequencing technology has enabled inexpensive long-read sequencing, which promises reads longer than a few thousand bases. Such long-read sequences contribute to the precise detection of structural variations and accurate haplotype phasing. However, deciphering precise DNA sequences from noisy and complicated nanopore raw signals remains a crucial demand for downstream analyses based on higher-quality nanopore sequencing, although various basecallers have been introduced to date. Results: To address this need, we developed a novel basecaller, Halcyon, that incorporates neural-network techniques frequently used in the field of machine translation. Our model employs monotonic-attention mechanisms to learn semantic correspondences between nucleotides and signal levels without any pre-segmentation against input signals. We evaluated performance with a human whole-genome sequencing dataset and demonstrated that Halcyon outperformed existing third-party basecallers and achieved competitive performance against the latest Oxford Nanopore Technologies' basecallers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据