4.7 Article

GAVISUNK: genome assembly validation via inter-SUNK distances in Oxford Nanopore reads

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Multidisciplinary Sciences

The complete sequence of a human genome

Sergey Nurk et al.

Summary: The Telomere-to-Telomere (T2T) Consortium has presented a complete sequence of a human genome, T2T-CHM13, which covers the whole genome except for the Y chromosome. This new sequence includes gapless assemblies, error corrections in previous references, and nearly 200 million base pairs of additional sequence with gene predictions, including protein coding genes. The completion of important regions allows for further studies on genetic variations and functions.

SCIENCE (2022)

Article Multidisciplinary Sciences

Segmental duplications and their variation in a complete human genome

Mitchell R. Vollger et al.

Summary: This study presents a comprehensive view of human segmental duplication (SD) organization using a complete human genome dataset. SDs are an important component of the genome, accounting for one-third of the total sequence, and they exhibit evolutionary differences and structural diversity between humans and other primates.

SCIENCE (2022)

Article Biochemical Research Methods

Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm

Haoyu Cheng et al.

Summary: hifiasm is a novel assembler that utilizes long high-fidelity sequence reads to accurately represent haplotype information, outperforming existing tools in haplotype-resolved assembly on various datasets, including a hexaploid genome dataset.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

Haplotype-resolved diverse human genomes and integrated analysis of structural variation

Peter Ebert et al.

Summary: Through the use of long-read and strand-specific sequencing technologies, this study has achieved the de novo assembly of high-quality haplotype-resolved human genomes without the need for parent-child trio data. The research identified various forms of genetic variation, including structural variants and SV hotspots, and revealed the mechanisms of SV formation while providing SV candidates for adaptive selection within the human population.

SCIENCE (2021)

Article Multidisciplinary Sciences

The structure, function and evolution of a complete human chromosome 8

Glennis A. Logsdon et al.

Summary: The study completed the linear assembly of human chromosome 8 using long-read sequencing technologies, resolving five long-standing gaps, including regions important for disease risk. The research revealed that the centromeric alpha-satellite sequence is generally methylated, and conducted comparative analysis of centromeres in chimpanzee, orangutan and macaque. The study estimates that the mutation rate of centromeric satellite DNA is accelerated compared to unique portions of the genome.

NATURE (2021)

Article Multidisciplinary Sciences

Towards complete and error-free genome assemblies of all vertebrate species

Arang Rhie et al.

Summary: The Vertebrate Genome Project and the international Genome 10K consortium have collaborated to generate high-quality genome assemblies for 16 species representing six major vertebrate lineages, leading to new biological discoveries. Long-read sequencing technologies are essential for maximizing genome quality, and addressing complex repeats and haplotype heterozygosity are crucial for reducing assembly errors and improving completeness of reference genomes. The lessons learned from this project have paved the way for the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all known vertebrate species.

NATURE (2021)

Article Biochemical Research Methods

TandemTools: mapping long reads and assessing/improving assembly quality in extra-long tandem repeats

Alla Mikheenko et al.

BIOINFORMATICS (2020)

Article Multidisciplinary Sciences

Telomere-to-telomere assembly of a complete human X chromosome

Karen H. Miga et al.

NATURE (2020)

Review Genetics & Heredity

Long-read human genome sequencing and its applications

Glennis A. Logsdon et al.

NATURE REVIEWS GENETICS (2020)

Review Multidisciplinary Sciences

Array programming with NumPy

Charles R. Harris et al.

NATURE (2020)

Article Multidisciplinary Sciences

The ENCODE Blacklist: Identification of Problematic Regions of the Genome

Haley M. Amemiya et al.

SCIENTIFIC REPORTS (2019)

Article Biochemistry & Molecular Biology

HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies

Peter Edge et al.

GENOME RESEARCH (2017)

Article Biochemistry & Molecular Biology

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

Sergey Koren et al.

GENOME RESEARCH (2017)

Article Biochemical Research Methods

A fast, lock-free approach for efficient parallel counting of occurrences of k-mers

Guillaume Marcais et al.

BIOINFORMATICS (2011)

Article Multidisciplinary Sciences

Diversity of Human Copy Number Variation and Multicopy Genes

Peter H. Sudmant et al.

SCIENCE (2010)

Article Biochemistry & Molecular Biology

DupMasker:: A tool for annotating primate segmental duplications

Zhaoshi Jiang et al.

GENOME RESEARCH (2008)