4.6 Article

Empirical evaluation of methods for de novo genome assembly

Journal

PEERJ COMPUTER SCIENCE
Volume -, Issue -, Pages -

Publisher

PEERJ INC
DOI: 10.7717/peerj-cs.636

Keywords

DNA sequences; De novo assembly; De-Bruijn-Graph; Overlap-Layout-Consensus; String-Graph based assembly

Funding

  1. Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Education [NRF2019R1F1A1064019]

Ask authors/readers for more resources

This article explores the rapid growth of high-throughput sequencing projects and new read-assembly algorithms driven by next-generation sequencing technologies like Ion Torrent, Illumina, and PacBio. Genome research has led to the creation of high-quality reference genomes for various organisms, with de novo assembly playing a crucial role in gene discovery and other studies. The article provides a thorough comparison of different assembly algorithms to aid new users in understanding them better.
Y Technologies for next-generation sequencing (NGS) have stimulated an exponential rise in high-throughput sequencing projects and resulted in the development of new read-assembly algorithms. A drastic reduction in the costs of generating short reads on the genomes of new organisms is attributable to recent advances in NGS technologies such as Ion Torrent, Illumina, and PacBio. Genome research has led to the creation of high-quality reference genomes for several organisms, and de novo assembly is a key initiative that has facilitated gene discovery and other studies. More powerful analytical algorithms are needed to work on the increasing amount of sequence data. We make a thorough comparison of the de novo assembly algorithms to allow new users to clearly understand the assembly algorithms: overlap-layout-consensus and de-Bruijn-graph, string-graph based assembly, and hybrid approach. We also address the computational efficacy of each algorithm's performance, challenges faced by the assembly tools used, and the impact of repeats. Our results compare the relative performance of the different assemblers and other related assembly differences with and without the reference genome. We hope that this analysis will contribute to further the application of de novo sequences and help the future growth of assembly algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available