4.6 Article

A statistical boosting framework for polygenic risk scores based on large-scale genotype data

Related references

Note: Only part of the references are listed.
Article Mathematical & Computational Biology

Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank

Ruilin Li et al.

Summary: This paper presents a scalable and efficient algorithm for fitting a Cox proportional hazard model, which is demonstrated to be effective on large-scale and high-dimensional data.

BIOSTATISTICS (2022)

Article Genetics & Heredity

Breast and prostate cancer risk: The interplay of polygenic risk, rare pathogenic germline variants, and family history

Emadeldin Hassanin et al.

Summary: This study aimed to investigate the joint influence of polygenic risk scores (PRS), rare pathogenic germline variants (PVs), and family history on the risk of breast cancer and prostate cancer. The results showed that PRS alone provides a meaningful risk gradient, similar to PVs in moderate risk genes, and acts as a risk modifier when considering high-risk genes. Including family history along with PV and PRS further improves cancer risk stratification.

GENETICS IN MEDICINE (2022)

Article Health Care Sciences & Services

Deselection of base-learners for statistical boosting-with an application to distributional regression

Annika Stroemer et al.

Summary: A new procedure for enhanced variable selection for component-wise gradient boosting is presented, addressing the issue of overfitting in high-dimensional data. Optimal final models are achieved by deselecting unimportant base-learners, improving variable selection and prediction performance.

STATISTICAL METHODS IN MEDICAL RESEARCH (2022)

Article Oncology

The importance of ethnicity: Are breast cancer polygenic risk scores ready for women who are not of White European origin?

D. Gareth Evans et al.

Summary: PRS based on White European ethnicity for breast cancer risk assessment may overestimate risk for women of other ethnicities, particularly those of Black and Jewish origin. Developing ethnicity-specific PRS is urgently needed to expand the application of this technology.

INTERNATIONAL JOURNAL OF CANCER (2022)

Article Genetics & Heredity

Significant sparse polygenic risk scores across 813 traits in UK Biobank

Yosuke Tanigawa et al.

Summary: We conducted a systematic assessment of polygenic risk score (PRS) prediction for over 1,500 traits using genetic and phenotype data from the UK Biobank. We found that sparse PRS models showed significant incremental predictive performance and that the number of genetic variants selected in the model correlated with predictive performance. However, the transferability of sparse PRS models trained on European individuals to non-European individuals in the UK Biobank was limited.

PLOS GENETICS (2022)

Article Genetics & Heredity

Statistical learning for sparser fine-mapped polygenic models: The prediction of LDL-cholesterol

Carlo Maj et al.

Summary: This study proposes and applies a three-step strategy based on existing statistical learning methods to derive sparse models for genome-wide data with a polygenic signal. By using marginal screening, fine-mapping, and statistical boosting, this approach selects and fits multivariable regression models, improving the prediction performance and sparsity of polygenic risk scores.

GENETIC EPIDEMIOLOGY (2022)

Article Genetics & Heredity

Genetics of 35 blood and urine biomarkers in the UK Biobank

Nasa Sinnott-Armstrong et al.

Summary: The study evaluated the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank, identifying 1,857 loci associated with at least one trait. Through Mendelian randomization analysis, 51 causal relationships were discovered, including previously known agonistic effects of urate on gout and cystatin C on stroke. Finally, by developing polygenic risk scores and building 'multi-PRS' models, genetic risk stratification for common diseases was improved.

NATURE GENETICS (2021)

Article Genetics & Heredity

The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation

Samuel A. Lambert et al.

NATURE GENETICS (2021)

Article Multidisciplinary Sciences

Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

Carla Marquez-Luna et al.

Summary: Polygenic risk prediction is a widely investigated topic with promising clinical applications. The method LDpred-funct leverages trait-specific functional priors to improve prediction accuracy across 21 heritable traits in the UK Biobank. Incorporating functional information has shown promise for enhancing polygenic risk prediction accuracy of complex traits.

NATURE COMMUNICATIONS (2021)

Article Biochemical Research Methods

Randomized boosting with multivariable base-learners for high-dimensional variable selection and prediction

Christian Staerk et al.

Summary: The study introduces three extensions of statistical boosting algorithms, allowing for multi-variable updates in base-learners selection, random preselection, and adaptive preselection based on predictive performance history. These approaches lead to sparser and more interpretable prediction models with competitive performance.

BMC BIOINFORMATICS (2021)

Article Multidisciplinary Sciences

Improved genetic prediction of complex traits from individual-level data or summary statistics

Qianqian Zhang et al.

Summary: The researchers have developed eight prediction tools that allow users to specify the heritability model, showing substantial improvement in predicting complex traits.

NATURE COMMUNICATIONS (2021)

Review Biochemical Research Methods

Tutorial: a guide to performing polygenic risk score analyses

Shing Wan Choi et al.

NATURE PROTOCOLS (2020)

Article Multidisciplinary Sciences

Biophysical ambiguities prevent accurate genetic prediction

Xianghua Li et al.

NATURE COMMUNICATIONS (2020)

Article Biochemical Research Methods

LDpred2: better, faster, stronger

Florian Prive et al.

BIOINFORMATICS (2020)

Article Multidisciplinary Sciences

Polygenic prediction via Bayesian regression and continuous shrinkage priors

Tian Ge et al.

NATURE COMMUNICATIONS (2019)

Article Multidisciplinary Sciences

Improved polygenic prediction by Bayesian multiple regression on summary statistics

Luke R. Lloyd-Jones et al.

NATURE COMMUNICATIONS (2019)

Review Cardiac & Cardiovascular Systems

PCSK9 inhibitors: clinical evidence and implementation

Marc S. Sabatine

NATURE REVIEWS CARDIOLOGY (2019)

Article Biochemical Research Methods

Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr

Florian Prive et al.

BIOINFORMATICS (2018)

Article Health Care Sciences & Services

Lack Of Diversity In Genomic Databases Is A Barrier To Translating Precision Medicine Research Into Practice

Latrice G. Landry et al.

HEALTH AFFAIRS (2018)

Article Statistics & Probability

Boosting for statistical modelling: A non-technical introduction

Andreas Mayr et al.

STATISTICAL MODELLING (2018)

Article Multidisciplinary Sciences

The UK Biobank resource with deep phenotyping and genomic data

Clare Bycroft et al.

NATURE (2018)

Article Genetics & Heredity

Polygenic scores via penalized regression on summary statistics

Timothy Shin Heng Mak et al.

GENETIC EPIDEMIOLOGY (2017)

Article Medicine, General & Internal

Risks of Breast, Ovarian, and Contralateral Breast Cancer for BRCA1 and BRCA2 Mutation Carriers

Karoline B. Kuchenbaecker et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2017)

Article Allergy

The environment, epigenome, and asthma

Ivana V. Yang et al.

JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY (2017)

Article Mathematical & Computational Biology

Probing for Sparse and Fast Variable Selection with Model-Based Boosting

Janek Thomas et al.

COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE (2017)

Article Computer Science, Information Systems

Approaches to Regularized Regression - A Comparison between Gradient Boosting and the Lasso

Tobias Hepp et al.

METHODS OF INFORMATION IN MEDICINE (2016)

Article Genetics & Heredity

Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores

Bjarni J. Vilhjalmsson et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2015)

Article Genetics & Heredity

Efficient Bayesian mixed-model analysis increases association power in large cohorts

Po-Ru Loh et al.

NATURE GENETICS (2015)

Article Biochemical Research Methods

PRSice: Polygenic Risk Score software

Jack Euesden et al.

BIOINFORMATICS (2015)

Article Genetics & Heredity

Inference of the Genetic Architecture Underlying BMI and Height with the Use of 20,240 Sibling Pairs

Gibran Hemani et al.

AMERICAN JOURNAL OF HUMAN GENETICS (2013)

Article Mathematical & Computational Biology

Incorporating group correlations in genome-wide association studies using smoothed group Lasso

Jin Liu et al.

BIOSTATISTICS (2013)

Review Medicine, General & Internal

Lipoprotein(a): resurrected by genetics

F. Kronenberg et al.

JOURNAL OF INTERNAL MEDICINE (2013)

Article Health Care Sciences & Services

Prediction intervals for future BMI values of individual children - a non-parametric approach by quantile boosting

Andreas Mayr et al.

BMC MEDICAL RESEARCH METHODOLOGY (2012)

Review Cardiac & Cardiovascular Systems

Blood pressure and human genetic variation in the general population

Pankaj Arora et al.

CURRENT OPINION IN CARDIOLOGY (2010)

Editorial Material Genetics & Heredity

Hints of hidden heritability in GWAS

Greg Gibson

NATURE GENETICS (2010)

Article Genetics & Heredity

Common SNPs explain a large proportion of the heritability for human height

Jian Yang et al.

NATURE GENETICS (2010)

Review Biochemistry & Molecular Biology

A genetic perspective on coeliac disease

Gosia Trynka et al.

TRENDS IN MOLECULAR MEDICINE (2010)

Article Statistics & Probability

High-dimensional generalized linear models and the lasso

Sara A. van de Geer

ANNALS OF STATISTICS (2008)

News Item Multidisciplinary Sciences

Personal genomes: The case of the missing heritability

Brendan Maher

NATURE (2008)

Article Mathematical & Computational Biology

Group additive regression models for genomic data analysis

Yihui Luan et al.

BIOSTATISTICS (2008)

Article Statistics & Probability

Boosting algorithms:: Regularization, prediction and model fitting

Peter Buehlmann et al.

STATISTICAL SCIENCE (2007)

Article Computer Science, Interdisciplinary Applications

Relaxed lasso

Nicolai Meinshausen

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2007)

Article Mathematical & Computational Biology

Nonparametric pathway-based regression models for analysis of genomic data

Zhi Wei et al.

BIOSTATISTICS (2007)

Article Statistics & Probability

Sparsity oracle inequalities for the Lasso

Florentina Bunea et al.

ELECTRONIC JOURNAL OF STATISTICS (2007)

Article Statistics & Probability

Regularization and variable selection via the elastic net

H Zou et al.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY (2005)

Article Statistics & Probability

Boosting with the L2 loss:: Regression and classification

P Bühlmann et al.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2003)

Article Statistics & Probability

Greedy function approximation: A gradient boosting machine

JH Friedman

ANNALS OF STATISTICS (2001)