4.5 Article

Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing

Related references

Note: Only part of the references are listed.
Article Plant Sciences

A chromosome scale tomato genome built from complementary PacBio and Nanopore sequences alone reveals extensive linkage drag during breeding

Willem M. J. van Rengs et al.

Summary: In this study, we used PacBio HiFi and ONT Nanopore sequencing to assemble the tomato genome and discovered structural variations and linkage drag associated with virus resistance genes. The results were validated through chromosome conformation capture and marker studies, demonstrating the effectiveness of long read technologies in generating near-complete genome sequences.

PLANT JOURNAL (2022)

Article Biochemical Research Methods

The SAMBA tool uses long reads to improve the contiguity of genome assemblies

Aleksey V. Zimin et al.

Summary: Third-generation sequencing technologies generate long reads, which are valuable for resolving complex repeats. An upgrade strategy for existing assemblies is to use long-read data to fill gaps and improve contiguity.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Biochemical Research Methods

Long-read mapping to repetitive reference sequences using Winnowmap2

Chirag Jain et al.

Summary: Approximately 5-10% of the human genome is inaccessible due to the presence of repetitive sequences. Existing long-read mappers often yield incorrect alignments and variant calls within repetitive sequences. To address this issue, a new long-read mapping method called Winnowmap2 was developed, which is more tolerant of structural variation and more sensitive to paralog-specific variants within repeats.

NATURE METHODS (2022)

Article Multidisciplinary Sciences

The complete sequence of a human genome

Sergey Nurk et al.

Summary: The Telomere-to-Telomere (T2T) Consortium has presented a complete sequence of a human genome, T2T-CHM13, which covers the whole genome except for the Y chromosome. This new sequence includes gapless assemblies, error corrections in previous references, and nearly 200 million base pairs of additional sequence with gene predictions, including protein coding genes. The completion of important regions allows for further studies on genetic variations and functions.

SCIENCE (2022)

Article Genetics & Heredity

High-quality Arabidopsis thaliana Genome Assembly with Nanopore and HiFi Long Reads

Bo Wang et al.

Summary: This study successfully assembled a high-quality and almost complete genome of Arabidopsis thaliana using multiple advanced sequencing technologies. The new genome assembly contains more information compared to the previous reference genome, providing valuable insights into the global pattern of centromeric polymorphisms and the genetic and epigenetic features in plants.

GENOMICS PROTEOMICS & BIOINFORMATICS (2022)

Article Biology

DENTIST-using long reads for closing assembly gaps at high accuracy

Arne Ludwig et al.

Summary: In this study, we present DENTIST, a sensitive, highly accurate, and automated pipeline method for closing gaps in short-read assemblies with long error-prone reads. Through tests on real genomic data, we demonstrate that DENTIST achieves higher accuracy and similar sensitivity compared to previous methods.

GIGASCIENCE (2022)

Article Biochemical Research Methods

Liftoff: accurate mapping of gene annotations

Alaina Shumate et al.

Summary: Advancements in DNA sequencing and computational methods have led to a significant increase in high-quality genome assemblies for many species. To annotate gene features in these genomes, a common strategy is to map genes from a previously annotated reference genome to new or improved assemblies. The tool Liftoff can accurately map genes between the same or closely related species, ensuring high sequence identity and preserving gene structure.

BIOINFORMATICS (2021)

Review Biology

Significantly improving the quality of genome assemblies through curation

Kerstin Howe et al.

Summary: Genome sequence assemblies are crucial for understanding biology, but achieving error-free assemblies remains a challenge. Assembly evaluation and curation play a key role in reducing errors and improving assembly quality. Insights gained from curation can lead to significant improvements in genome assembly.

GIGASCIENCE (2021)

Article Biochemical Research Methods

Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm

Haoyu Cheng et al.

Summary: hifiasm is a novel assembler that utilizes long high-fidelity sequence reads to accurately represent haplotype information, outperforming existing tools in haplotype-resolved assembly on various datasets, including a hexaploid genome dataset.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

The genetic and epigenetic landscape of the Arabidopsis centromeres

Matthew Naish et al.

Summary: The study used long-read sequencing to assemble the Arabidopsis genome and resolve all five centromeres. It found that centromeres consist of megabase-scale tandemly repeated satellite arrays and are invaded by retrotransposons. The centromeres are evolving through cycles of satellite homogenization and retrotransposon-driven diversification.

SCIENCE (2021)

Article Biology

Twelve years of SAMtools and BCFtools

Petr Danecek et al.

Summary: SAMtools and BCFtools are widely used tools for processing high-throughput sequencing data, with a history of 12 years of continuous development and improvement. These packages have been utilized in various software projects and genomic pipelines and are freely available on GitHub.

GIGASCIENCE (2021)

Article Multidisciplinary Sciences

Towards complete and error-free genome assemblies of all vertebrate species

Arang Rhie et al.

Summary: The Vertebrate Genome Project and the international Genome 10K consortium have collaborated to generate high-quality genome assemblies for 16 species representing six major vertebrate lineages, leading to new biological discoveries. Long-read sequencing technologies are essential for maximizing genome quality, and addressing complex repeats and haplotype heterozygosity are crucial for reducing assembly errors and improving completeness of reference genomes. The lessons learned from this project have paved the way for the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all known vertebrate species.

NATURE (2021)

Article Biochemical Research Methods

Cooler: scalable storage for Hi-C data and other genomically labeled arrays

Nezar Abdennur et al.

BIOINFORMATICS (2020)

Article Biochemical Research Methods

Identifying and removing haplotypic duplication in primary genome assemblies

Dengfeng Guan et al.

BIOINFORMATICS (2020)

Review Genetics & Heredity

Long-read human genome sequencing and its applications

Glennis A. Logsdon et al.

NATURE REVIEWS GENETICS (2020)

Article Multidisciplinary Sciences

Genome of Solanum pimpinellifolium provides insights into structural variants during tomato breeding

Xin Wang et al.

NATURE COMMUNICATIONS (2020)

Article Biotechnology & Applied Microbiology

Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Arang Rhie et al.

GENOME BIOLOGY (2020)

Letter Biotechnology & Applied Microbiology

CRISPResso2 provides accurate and rapid genome editing sequence analysis

Kendell Clement et al.

NATURE BIOTECHNOLOGY (2019)

Article Multidisciplinary Sciences

Genetic compensation triggered by mutant mRNA degradation

Mohamed A. El-Brolosy et al.

NATURE (2019)

Article Biotechnology & Applied Microbiology

Assembly of long, error-prone reads using repeat graphs

Mikhail Kolmogorov et al.

NATURE BIOTECHNOLOGY (2019)

Article Biotechnology & Applied Microbiology

Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome

Aaron M. Wenger et al.

NATURE BIOTECHNOLOGY (2019)

Article Biochemical Research Methods

Integrating Hi-C links with assembly graphs for chromosome-scale assembly

Jay Ghurye et al.

PLOS COMPUTATIONAL BIOLOGY (2019)

Article Genetics & Heredity

The complex architecture and epigenomic impact of plant T-DNA insertions

Florian Jupe et al.

PLOS GENETICS (2019)

Article Biotechnology & Applied Microbiology

RaGOO: fast and accurate reference-guided scaffolding of draft genomes

Michael Alonge et al.

GENOME BIOLOGY (2019)

Article Biotechnology & Applied Microbiology

Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline

Shujun Ou et al.

GENOME BIOLOGY (2019)

Article Biochemical Research Methods

Minimap2: pairwise alignment for nucleotide sequences

Heng Li

BIOINFORMATICS (2018)

Article Biochemical Research Methods

Accurate detection of complex structural variations using single-molecule sequencing

Fritz J. Sedlazeck et al.

NATURE METHODS (2018)

Article Biotechnology & Applied Microbiology

HiGlass: web-based visual exploration and analysis of genome interaction maps

Peter Kerpedjiev et al.

GENOME BIOLOGY (2018)

Article Biochemical Research Methods

CAMSA: a tool for comparative analysis and merging of scaffold assemblies

Sergey S. Aganezov et al.

BMC BIOINFORMATICS (2017)

Article Biotechnology & Applied Microbiology

Scaffolding of long read assemblies using long range contact information

Jay Ghurye et al.

BMC GENOMICS (2017)

Article Biochemistry & Molecular Biology

Bypassing Negative Epistasis on Yield in Tomato Imposed by a Domestication Gene

Sebastian Soyk et al.

Article Biotechnology & Applied Microbiology

A comparative evaluation of genome assembly reconciliation tools

Hind Alhakami et al.

GENOME BIOLOGY (2017)

Article Biochemical Research Methods

Assemblytics: a web analytics tool for the detection of variants from an assembly

Maria Nattestad et al.

BIOINFORMATICS (2016)

Article Biochemistry & Molecular Biology

1,135 Genomes Reveal the Global Pattern of Polymorphism in Arabidopsis thaliana

Carlos Alonso-Blanco et al.

Article Biochemistry & Molecular Biology

Heatmapper: web-enabled heat mapping for all

Sasha Babicki et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biotechnology & Applied Microbiology

Modification of plant regeneration medium decreases the time for recovery of Solanum lycopersicum cultivar M82 stable transgenic lines

Sarika Gupta et al.

PLANT CELL TISSUE AND ORGAN CULTURE (2016)

Article Biochemical Research Methods

The Sequence Alignment/Map format and SAMtools

Heng Li et al.

BIOINFORMATICS (2009)

Article Biochemistry & Molecular Biology

The Making of a Compound Inflorescence in Tomato and Related Nightshades

Zachary B. Lippman et al.

PLOS BIOLOGY (2008)

Article Biochemical Research Methods

Assembly reconciliation

Aleksey V. Zimin et al.

BIOINFORMATICS (2008)

Article Biochemical Research Methods

WindowMasker:: window-based masker for sequenced genomes

A Morgulis et al.

BIOINFORMATICS (2006)

Article Plant Sciences

In silico screening of a saturated mutation library of tomato

N Menda et al.

PLANT JOURNAL (2004)

Article Biotechnology & Applied Microbiology

Versatile and open software for comparing large genomes

S Kurtz et al.

GENOME BIOLOGY (2004)