4.6 Article

Improving variant calling using population data and deep learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biochemistry & Molecular Biology

GENCODE: reference annotation for the human and mouse genomes in 2023

Adam Frankish et al.

Summary: GENCODE provides high quality gene and transcript annotation for the human and mouse genomes, supported by experimental data, serving as a reference for genome biology and clinical genomics. The consortium generates data, develops tools and carries out analyses to support the identification and annotation of transcript structures and their function.

NUCLEIC ACIDS RESEARCH (2023)

Article Multidisciplinary Sciences

Management of validation of HPLC method for determination of acetylsalicylic acid impurities in a new pharmaceutical product

Malgorzata Kowalska et al.

Summary: The study focuses on validating a method for determining the content of salicylic acid and unknown impurities in new pharmaceutical tablets. HPLC was used for separation, and the method was confirmed to be linear, precise, and accurate through validation tests.

SCIENTIFIC REPORTS (2022)

Article Biochemical Research Methods

indelPost: harmonizing ambiguities in simple and complex indel alignments

Kohei Hagiwara et al.

Summary: Small insertions and deletions (indels) in nucleotide sequence can be represented differently by mapping algorithms and variant callers, especially for complex indels. This can lead to ambiguity and incomplete allele representation, potentially causing critical misannotation of variant effect. The Python library indelPost was introduced to address these issues by harmonizing ambiguities for both simple and complex indels through realignment and read-based phasing, showing improved performance over specialized tools for complex indel analysis.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses

Benjamin Kaminow et al.

Summary: The study explores the consensus genome as a potential successor of the Human Reference Genome and evaluates its impact on the accuracy of RNA-seq read alignment. The findings suggest that using consensus genomes can significantly reduce mapping errors in certain cases, and incorporating more specific genomic variations has limited utility in improving accuracy.

GENOME RESEARCH (2022)

Article Biochemistry & Molecular Biology

High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios

Marta Byrska-Bishop et al.

Summary: The 1000 Genomes Project is the largest open resource of whole-genome sequencing data, and a new high-coverage WGS 1kGP resource has been released, improving the sensitivity and accuracy of variant calls and making it more valuable for association studies.
Article Biotechnology & Applied Microbiology

Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads

David Porubsky et al.

Summary: This study introduces a reference-free workflow for diploid de novo genome assembly, combining single-cell strand sequencing with continuous long-read or high-fidelity sequencing data. By employing this strategy, the researchers successfully generated completely phased de novo genome assemblies for each haplotype of an individual of Puerto Rican descent, demonstrating high accuracy and contiguity with low switch error rates. A comparison of Oxford Nanopore Technologies and Pacific Biosciences phased assemblies identified regions that are preferential sites of contig breaks, regardless of sequencing technology or phasing algorithms.

NATURE BIOTECHNOLOGY (2021)

Article Multidisciplinary Sciences

Exome sequencing and analysis of 454,787 UK Biobank participants

Joshua D. Backman et al.

Summary: In this study, whole-exome sequencing was used to identify gene-trait associations in 454,787 individuals, revealing 564 distinct genes with significant trait associations. Rare variant associations were enriched in loci from genome-wide association studies (GWAS) but most were independent of common variant signals.

NATURE (2021)

Article Multidisciplinary Sciences

Haplotype-resolved diverse human genomes and integrated analysis of structural variation

Peter Ebert et al.

Summary: Through the use of long-read and strand-specific sequencing technologies, this study has achieved the de novo assembly of high-quality haplotype-resolved human genomes without the need for parent-child trio data. The research identified various forms of genetic variation, including structural variants and SV hotspots, and revealed the mechanisms of SV formation while providing SV candidates for adaptive selection within the human population.

SCIENCE (2021)

Article Biotechnology & Applied Microbiology

A unified haplotype-based method for accurate and comprehensive variant calling

Daniel P. Cooke et al.

Summary: Octopus is a variant caller that uses a polymorphic Bayesian genotyping model capable of modeling different experimental designs within a unified haplotype-aware framework. It accurately calls germline variants in individuals, including low-frequency somatic variations, while producing fewer false positives compared to other methods. Octopus also outputs realigned evidence BAM files to assist with validation and interpretation.

NATURE BIOTECHNOLOGY (2021)

Article Genetics & Heredity

Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits

Doruk Beyter et al.

Summary: Analysis of long-read sequencing data from 3,622 Icelanders identifies a set of high-confidence structural variants and provides insights into their effect on human traits and diseases.

NATURE GENETICS (2021)

Article Multidisciplinary Sciences

Rare variant contribution to human disease in 281,104 UK Biobank exomes

Quanli Wang et al.

Summary: The study reveals a significant contribution of rare variants to common disease, with a large number of gene-phenotype associations detected through gene-based collapsing analysis that cannot be identified in single-variant association tests. Rare variants are also significantly enriched for loss-of-function-mediated traits and approved drug targets.

NATURE (2021)

Article Genetics & Heredity

Advancing human genetics research and drug discovery through exome sequencing of the UK Biobank

Joseph D. Szustakowski et al.

Summary: The UK Biobank Exome Sequencing Consortium is a successful collaborative project between UK Biobank and biopharmaceutical companies, providing valuable rare coding variation resources for drug discovery. The project has strengthened academic and industry ties and promoted interaction and learning within the wider research community.

NATURE GENETICS (2021)

Review Genetics & Heredity

Towards population-scale long-read sequencing

Wouter De Coster et al.

Summary: Long-read sequencing technologies have advanced to the point where they can be applied to variant detection at a population scale. New computational tools have led to the emergence of population-scale studies in the past two years, with many more expected in the future. The review covers recent developments, challenges, experimental design guidance, as well as strategies for variant validation and genotyping.

NATURE REVIEWS GENETICS (2021)

Article Biotechnology & Applied Microbiology

Reference flow: reducing reference bias using multiple population genomes

Nae-Chyun Chen et al.

Summary: This paper introduces a reference flow alignment method that improves alignment accuracy and reduces reference bias by using multiple population reference genomes. Compared to the graph aligner vg, the reference flow method achieves similar accuracy and bias avoidance with only 14% memory footprint and 5.5 times the speed.

GENOME BIOLOGY (2021)

Article Genetics & Heredity

Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era

Nikita Abramovs et al.

FRONTIERS IN GENETICS (2020)

Article Multidisciplinary Sciences

The mutational constraint spectrum quantified from variation in 141,456 humans

Konrad J. Karczewski et al.

NATURE (2020)

Editorial Material Genetics & Heredity

The road ahead in genetics and genomics

Amy L. McGuire et al.

NATURE REVIEWS GENETICS (2020)

Article Biochemical Research Methods

Accurate, scalable cohort variant calls using DeepVariant and GLnexus

Taedong Yun et al.

Summary: The study presents an open-source cohort-calling method using DeepVariant and GLnexus to optimize analysis-ready cohort-level variants, showing superior results compared to GATK Best Practices in the 1000 Genomes Project samples.

BIOINFORMATICS (2020)

Article Medicine, General & Internal

Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma

Saud H. AlDubayan et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2020)

Editorial Material Biochemistry & Molecular Biology

The Missing Diversity in Human Genetic Studies

Giorgio Sirugo et al.

Article Biotechnology & Applied Microbiology

Best practices for benchmarking germline small-variant calls in human genomes

Peter Krusche et al.

NATURE BIOTECHNOLOGY (2019)

Article Genetics & Heredity

Clinical use of current polygenic risk scores may exacerbate health disparities

Alicia R. Martin et al.

NATURE GENETICS (2019)

Article Biotechnology & Applied Microbiology

Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome

Aaron M. Wenger et al.

NATURE BIOTECHNOLOGY (2019)

Article Biochemical Research Methods

Strelka2: fast and accurate calling of germline and somatic variants

Sangtae Kim et al.

NATURE METHODS (2018)

Article Biotechnology & Applied Microbiology

A universal SNP and small-indel variant caller using deep neural networks

Ryan Poplin et al.

NATURE BIOTECHNOLOGY (2018)

Article Multidisciplinary Sciences

Comparison of three variant callers for human whole genome sequencing

Anna Supernat et al.

SCIENTIFIC REPORTS (2018)

Article Biochemical Research Methods

VarMatch: robust matching of small variant datasets using flexible scoring schemes

Chen Sun et al.

BIOINFORMATICS (2017)

Article Genetics & Heredity

The time and place of European admixture in Ashkenazi Jewish history

James Xue et al.

PLOS GENETICS (2017)

Article Multidisciplinary Sciences

Extensive sequencing of seven human genomes to characterize benchmark reference materials

Justin M. Zook et al.

SCIENTIFIC DATA (2016)

Article Multidisciplinary Sciences

A global reference for human genetic variation

David M. Altshuler et al.

NATURE (2015)

Article Biochemistry & Molecular Biology

ClinVar: public archive of relationships among sequence variation and human phenotype

Melissa J. Landrum et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Genetics & Heredity

Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test

Michael C. Wu et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2011)

Article Genetics & Heredity

A framework for variation discovery and genotyping using next-generation DNA sequencing data

Mark A. DePristo et al.

NATURE GENETICS (2011)

Article Genetics & Heredity

Genetic similarities within and between human populations

D. J. Witherspoon et al.

GENETICS (2007)