4.8 Article

Linear time complexity de novo long read genome assembly with GoldRush

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biotechnology & Applied Microbiology

Telomere-to-telomere assembly of diploid chromosomes with Verkko

Mikko Rautiainen et al.

Summary: The Telomere-to-Telomere consortium has achieved the first complete sequence of a human genome. They used a combination of long Nanopore sequencing reads and high-resolution assembly graph to resolve repeat sequences and automate the process in their Verkko pipeline. The result is a phased, diploid assembly with many chromosomes assembled from end to end. This advance is crucial for constructing comprehensive pangenome databases and chromosome-scale comparative genomics.

NATURE BIOTECHNOLOGY (2023)

Article Biochemistry & Molecular Biology

Ensembl 2022

Fiona Cunningham et al.

Summary: Ensembl is unique in its flexible infrastructure for access to genomic data and annotation. They have focused on expediting annotation of new assemblies via the Ensembl Rapid Release platform, with the greatest annual number of newly annotated genomes released. They also developed a new method for comparative analyses and annotated non-vertebrate eukaryotes for the first time.

NUCLEIC ACIDS RESEARCH (2022)

Article Multidisciplinary Sciences

The complete sequence of a human genome

Sergey Nurk et al.

Summary: The Telomere-to-Telomere (T2T) Consortium has presented a complete sequence of a human genome, T2T-CHM13, which covers the whole genome except for the Y chromosome. This new sequence includes gapless assemblies, error corrections in previous references, and nearly 200 million base pairs of additional sequence with gene predictions, including protein coding genes. The completion of important regions allows for further studies on genetic variations and functions.

SCIENCE (2022)

Article Microbiology

Nanopore long-read-only metagenomics enables complete and high-quality genome reconstruction from mock and complex metagenomes

Lei Liu et al.

Summary: NanoPhase is an open-source tool that enables the reconstruction of reference-quality genomes from complex metagenomes using only Nanopore long reads. It improves the quality and contiguity of MAGs, allowing for a more comprehensive investigation of target microbiomes.

MICROBIOME (2022)

Article Biochemical Research Methods

ntHash2: recursive spaced seed hashing for nucleotide sequences

Parham Kazemi et al.

Summary: ntHash2 is a fast algorithm for spaced seed hashing that can be integrated into various bioinformatics tools for efficient sequence analysis in genome research. It is faster than previous versions and conventional hashing algorithms, and also improves the uniformity of hash distribution.

BIOINFORMATICS (2022)

Article Multidisciplinary Sciences

Semi-automated assembly of high-quality diploid human reference genomes

Erich D. Jarvis et al.

NATURE (2022)

Article Microbiology

Systematic benchmarking of nanopore Q20+kit in SARS-CoV-2 whole genome sequencing

Junhong Luo et al.

Summary: Whole genome sequencing provides key information for the prevention, control, and tracking of COVID-19. Nanopore sequencing is a rapid and simple sequencing technology with long reads. The combination of LSK112 kit and flow cell R10.4 improves sequencing accuracy.

FRONTIERS IN MICROBIOLOGY (2022)

Article Biotechnology & Applied Microbiology

Comparison of structural variants detected by PacBio-CLR and ONT sequencing in pear

Yueyuan Liu et al.

Summary: This study comprehensively analyzed and compared structural variations in the pear genome using different long read platforms. The results showed that Nanovar had the highest sensitivity in detecting SVs at low sequencing depth. Several genes and transcription factors related to phenotypic differences between pear varieties were identified through SV detection.

BMC GENOMICS (2022)

Article Biochemical Research Methods

LongStitch: high-quality genome assembly correction and scaffolding using long reads

Lauren Coombe et al.

Summary: LongStitch is a scalable pipeline that corrects and scaffolds draft genome assemblies exclusively using long reads. It incorporates multiple tools developed by the group and runs in up to three stages, including initial assembly correction (Tigmint-long), followed by two incremental scaffolding stages (ntLink and ARKS-long). Tested on various organisms and consistently improving assembly contiguity compared to other tools, LongStitch is expected to benefit a wide variety of de novo genome assembly projects.

BMC BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

Effective sequence similarity detection with strobemers

Kristoffer Sahlin

Summary: K-mer-based methods are commonly sensitive to variable mutation rates, while strobemers, with combinations determined by a hash function, provide more evenly distributed sequence matches and are less sensitive to different mutation rates. Strobemers also outperform traditional methods in sequence match coverage.

GENOME RESEARCH (2021)

Article Biotechnology & Applied Microbiology

phasebook: haplotype-aware de novo assembly of diploid genomes from long reads

Xiao Luo et al.

Summary: Haplotype-aware diploid genome assembly is essential in various disciplines, and phasebook, a novel de novo approach, outperforms other methods in terms of haplotype coverage while maintaining competitive performance in assembly errors and contiguity.

GENOME BIOLOGY (2021)

Article Biochemistry & Molecular Biology

Application of long-read sequencing to the detection of structural variants in human cancer genomes

Yoshitaka Sakamoto et al.

Summary: Long-read sequencing technologies have significantly advanced cancer genomics research by enabling precise detection of structural variants (SVs) and unveiling their complex structures, as well as revealing epigenomic information surrounding SV loci. This provides a new opportunity for better understanding disease development and drug development.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2021)

Article Biotechnology & Applied Microbiology

Straglr: discovering and genotyping tandem repeat expansions using whole genome long-read sequences

Readman Chiu et al.

Summary: Tandem repeat (TR) expansion is the underlying cause of over 40 neurological disorders, and long-read sequencing technology offers an exciting avenue for detecting TR expansions. The software tool Straglr allows for targeted genotyping and novel expansion detection, showing potential for investigating disease-associated TR expansions using long-read sequencing.

GENOME BIOLOGY (2021)

Article Biochemical Research Methods

Fast and accurate long-read assembly with wtdbg2

Jue Ruan et al.

NATURE METHODS (2020)

Article Biochemical Research Methods

ntJoin: Fast and lightweight assembly-guided scaffolding using minimizer graphs

Lauren Coombe et al.

BIOINFORMATICS (2020)

Article Biotechnology & Applied Microbiology

Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes

Kishwar Shafin et al.

NATURE BIOTECHNOLOGY (2020)

Article Multidisciplinary Sciences

Mismatch-tolerant, alignment-free sequence classification using multiple spaced seeds and multiindex Bloom filters

Justin Chu et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Multidisciplinary Sciences

Highly accurate long-read HiFi sequencing data for five complex genomes

Ting Hon et al.

SCIENTIFIC DATA (2020)

Article Medicine, Research & Experimental

Will long-read sequencing technologies replace short-read sequencing technologies in the next 10 years?

Boluwatife A. Adewale

AFRICAN JOURNAL OF LABORATORY MEDICINE (2020)

Article Biotechnology & Applied Microbiology

Haplotype threading: accurate polyploid phasing from long reads

Sven D. Schrinner et al.

GENOME BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Arang Rhie et al.

GENOME BIOLOGY (2020)

Article Genetics & Heredity

Benchmarking of long-read correction methods

Juliane C. Dohm et al.

NAR GENOMICS AND BIOINFORMATICS (2020)

Article Biochemical Research Methods

ntEdit: scalable genome sequence polishing

Rene L. Warren et al.

BIOINFORMATICS (2019)

Article Biotechnology & Applied Microbiology

Assembly of long, error-prone reads using repeat graphs

Mikhail Kolmogorov et al.

NATURE BIOTECHNOLOGY (2019)

Article Biochemical Research Methods

Resolving repeat families with long reads

Philipp Bongartz

BMC BIOINFORMATICS (2019)

Article Biotechnology & Applied Microbiology

LRScaf: improving draft genomes using long noisy reads

Mao Qin et al.

BMC GENOMICS (2019)

Article Biotechnology & Applied Microbiology

Performance of neural network basecalling tools for Oxford Nanopore sequencing

Ryan R. Wick et al.

GENOME BIOLOGY (2019)

Article Biochemical Research Methods

Versatile genome assembly evaluation with QUAST-LG

Alla Mikheenko et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Minimap2: pairwise alignment for nucleotide sequences

Heng Li

BIOINFORMATICS (2018)

Article Biochemical Research Methods

ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers

Lauren Coombe et al.

BMC BIOINFORMATICS (2018)

Article Biochemical Research Methods

Tigmint: correcting assembly errors using linked reads from large molecules

Shaun D. Jackman et al.

BMC BIOINFORMATICS (2018)

Article Biochemistry & Molecular Biology

Fast and accurate de novo genome assembly from long uncorrected reads

Robert Vaser et al.

GENOME RESEARCH (2017)

Article Biochemistry & Molecular Biology

ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter

Shaun D. Jackman et al.

GENOME RESEARCH (2017)

Article Biochemical Research Methods

ntHash: recursive nucleotide hashing

Hamid Mohamadi et al.

BIOINFORMATICS (2016)

Article Biochemistry & Molecular Biology

Chromosome-scale shotgun assembly using an in vitro method for long-range linkage

Nicholas H. Putnam et al.

GENOME RESEARCH (2016)

Article Biochemical Research Methods

BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs

Felipe A. Simao et al.

BIOINFORMATICS (2015)

Article Biochemical Research Methods

Sealer: a scalable gap-closing application for finishing draft genomes

Daniel Paulino et al.

BMC BIOINFORMATICS (2015)

Article Biotechnology & Applied Microbiology

Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph

Zhenyu Li et al.

BRIEFINGS IN FUNCTIONAL GENOMICS (2012)

Review Genetics & Heredity

Repetitive DNA and next-generation sequencing: computational challenges and solutions

Todd J. Treangen et al.

NATURE REVIEWS GENETICS (2012)

Article Genetics & Heredity

Review of General Algorithmic Features for Genome Assemblers for Next Generation Sequencers

Bilal Wajid et al.

GENOMICS PROTEOMICS & BIOINFORMATICS (2012)

Article Genetics & Heredity

Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

A. P. Jason de Koning et al.

PLOS GENETICS (2011)

Article Biochemical Research Methods

How repetitive are genomes?

Bernhard Haubold et al.

BMC BIOINFORMATICS (2006)

Article Biochemical Research Methods

PatternHunter: faster and more sensitive homology search

B Ma et al.

BIOINFORMATICS (2002)