4.8 Article

VeChat: correcting errors in long reads using variation graphs

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Genetics & Heredity

Enhancing Long-Read-Based Strain-Aware Metagenome Assembly

Xiao Luo et al.

Summary: Microbial communities are highly diverse and involve multiple strains, making it challenging to accurately decipher their composition. This study proposes the use of MetaBooster and MetaBooster-HiFi pipelines for strain-aware metagenome assembly from long-read sequencing data, which outperform current methods in terms of relevant assembly criteria.

FRONTIERS IN GENETICS (2022)

Article Biotechnology & Applied Microbiology

Strainline: full-length de novo viral haplotype reconstruction from noisy long reads

Xiao Luo et al.

Summary: Researchers have introduced a novel approach called Strainline that allows for the assembly of viral haplotypes from noisy long reads without a reference genome. Benchmarking on simulated and real datasets of varying complexity and diversity confirms the novelty and superiority of Strainline.

GENOME BIOLOGY (2022)

Article Biochemical Research Methods

PBSIM2: a simulator for long-read sequencers with a novel generative model of quality scores

Yukiteru Ono et al.

Summary: Researchers introduced a new generative model for quality scores to capture characteristics of errors in reads for long-read sequencers, and evaluated that their simulator successfully simulates reads consistent with real reads.

BIOINFORMATICS (2021)

Article Multidisciplinary Sciences

Scalable long read self-correction and assembly polishing with multiple sequence alignment

Pierre Morisse et al.

Summary: Third-generation sequencing technologies can sequence long reads of tens of kbp, but with high error rates, requiring self-correction. CONSENT is a new self-correction method that combines multiple sequence alignment and local de Bruijn graphs, offering faster computation and improved performance on ultra-long reads.

SCIENTIFIC REPORTS (2021)

Article Multidisciplinary Sciences

Pangenomics enables genotyping of known structural variants in 5202 diverse genomes

Jouni Siren et al.

Summary: Giraffe is a pangenome short-read mapper that efficiently maps to a collection of haplotypes threaded through a sequence graph. It speeds up mapping to thousands of human genomes and enables improved accuracy in genome-wide genotyping, ultimately enhancing genomic analyses. This tool facilitates a more comprehensive characterization of variation and has the potential to benefit various genomic studies.

SCIENCE (2021)

Article Multidisciplinary Sciences

Towards complete and error-free genome assemblies of all vertebrate species

Arang Rhie et al.

Summary: The Vertebrate Genome Project and the international Genome 10K consortium have collaborated to generate high-quality genome assemblies for 16 species representing six major vertebrate lineages, leading to new biological discoveries. Long-read sequencing technologies are essential for maximizing genome quality, and addressing complex repeats and haplotype heterozygosity are crucial for reducing assembly errors and improving completeness of reference genomes. The lessons learned from this project have paved the way for the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all known vertebrate species.

NATURE (2021)

Article Genetics & Heredity

Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer

Akihiro Fujimoto et al.

Summary: This study utilized long-read sequencing technology to perform whole-genome sequencing on Japanese liver cancer patients and identified germline and somatic structural variations. By comparing to the chimpanzee genome, events causing insertions and deletions were correctly inferred, and differences in mechanisms of somatic mutations in liver cancers compared to germline variations were observed.

GENOME MEDICINE (2021)

Article Biotechnology & Applied Microbiology

phasebook: haplotype-aware de novo assembly of diploid genomes from long reads

Xiao Luo et al.

Summary: Haplotype-aware diploid genome assembly is essential in various disciplines, and phasebook, a novel de novo approach, outperforms other methods in terms of haplotype coverage while maintaining competitive performance in assembly errors and contiguity.

GENOME BIOLOGY (2021)

Article Biotechnology & Applied Microbiology

Complete, closed bacterial genomes from microbiomes using nanopore sequencing

Eli L. Moss et al.

NATURE BIOTECHNOLOGY (2020)

Article Biochemical Research Methods

Fast and accurate long-read assembly with wtdbg2

Jue Ruan et al.

NATURE METHODS (2020)

Article Biochemical Research Methods

yacrd and fpa: upstream tools for long-read genome assembly

Pierre Marijon et al.

BIOINFORMATICS (2020)

Article Biotechnology & Applied Microbiology

Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes

Kishwar Shafin et al.

NATURE BIOTECHNOLOGY (2020)

Article Multidisciplinary Sciences

Telomere-to-telomere assembly of a complete human X chromosome

Karen H. Miga et al.

NATURE (2020)

Review Genetics & Heredity

Long-read human genome sequencing and its applications

Glennis A. Logsdon et al.

NATURE REVIEWS GENETICS (2020)

Article Biochemical Research Methods

metaFlye: scalable long-read metagenome assembly using repeat graphs

Mikhail Kolmogorov et al.

NATURE METHODS (2020)

Article Biotechnology & Applied Microbiology

Haplotype threading: accurate polyploid phasing from long reads

Sven D. Schrinner et al.

GENOME BIOLOGY (2020)

Article Biotechnology & Applied Microbiology

Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Arang Rhie et al.

GENOME BIOLOGY (2020)

Article Biochemical Research Methods

FLAS: fast and high-throughput algorithm for PacBio long-read self-correction

Ergude Bao et al.

BIOINFORMATICS (2019)

Article Microbiology

CAMISIM: simulating metagenomes and microbial communities

Adrian Fritz et al.

MICROBIOME (2019)

Article Biotechnology & Applied Microbiology

Assembly of long, error-prone reads using repeat graphs

Mikhail Kolmogorov et al.

NATURE BIOTECHNOLOGY (2019)

Article Biochemical Research Methods

Full-length de novo viral quasispecies assembly through variation graph construction

Jasmijn A. Baaijens et al.

BIOINFORMATICS (2019)

Article Multidisciplinary Sciences

Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing

Peter Edge et al.

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

Versatile genome assembly evaluation with QUAST-LG

Alla Mikheenko et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Minimap2: pairwise alignment for nucleotide sequences

Heng Li

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Hybrid correction of highly noisy long reads using a variable-order de Bruijn graph

Pierre Morisse et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

fastp: an ultra-fast all-in-one FASTQ preprocessor

Shifu Chen et al.

BIOINFORMATICS (2018)

Article Biotechnology & Applied Microbiology

Nanopore sequencing and assembly of a human genome with ultra-long reads

Miten Jain et al.

NATURE BIOTECHNOLOGY (2018)

Article Biotechnology & Applied Microbiology

Variation graph toolkit improves read mapping by representing genetic variation in the reference

Erik Garrison et al.

NATURE BIOTECHNOLOGY (2018)

Article Biotechnology & Applied Microbiology

High-quality genome sequences of uncultured microbes by assembly of read clouds

Alex Bishara et al.

NATURE BIOTECHNOLOGY (2018)

Article Multidisciplinary Sciences

High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries

Chirag Jain et al.

NATURE COMMUNICATIONS (2018)

Article Biochemical Research Methods

Accurate self-correction of errors in long reads using de Bruijn graphs

Leena Salmela et al.

BIOINFORMATICS (2017)

Review Biochemistry & Molecular Biology

Genome graphs and the evolution of genome inference

Benedict Paten et al.

GENOME RESEARCH (2017)

Article Biochemistry & Molecular Biology

De novo assembly of viral quasispecies using overlap graphs

Jasmijn A. Baaijens et al.

GENOME RESEARCH (2017)

Article Biochemistry & Molecular Biology

Fast and accurate de novo genome assembly from long uncorrected reads

Robert Vaser et al.

GENOME RESEARCH (2017)

Article Biochemistry & Molecular Biology

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

Sergey Koren et al.

GENOME RESEARCH (2017)

Article Biotechnology & Applied Microbiology

DESMAN: a new tool for de novo extraction of strains from metagenomes

Christopher Quince et al.

GENOME BIOLOGY (2017)

Article Biochemical Research Methods

Modelling haplotypes with respect to reference cohort variation graphs

Yohei Rosen et al.

BIOINFORMATICS (2017)

Article Biochemical Research Methods

Edlib: a C/C plus plus library for fast, exact sequence alignment using edit distance

Martin Sosic et al.

BIOINFORMATICS (2017)

Article Biochemical Research Methods

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences

Heng Li

BIOINFORMATICS (2016)

Article Biochemical Research Methods

LoRDEC: accurate and efficient long read error correction

Leena Salmela et al.

BIOINFORMATICS (2014)

Article Biochemical Research Methods

proovread: large-scale high-accuracy PacBio correction through iterative short read consensus

Thomas Hackl et al.

BIOINFORMATICS (2014)

Article Biochemistry & Molecular Biology

Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations

Francesca Di Giallonardo et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Biochemical Research Methods

Multiple sequence alignment using partial order graphs

C Lee et al.

BIOINFORMATICS (2002)