4.1 Article

Boosting for statistical modelling: A non-technical introduction

Journal

STATISTICAL MODELLING
Volume 18, Issue 3-4, Pages 365-384

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/1471082X17748086

Keywords

variable selection; high-dimensional data; model choice; statistical learning

Funding

  1. Deutsche Forschungsgemeinschaft (DFG) [SCHM 2966/1-2]
  2. Interdisciplinary Center for Clinical Research (IZKF) of the Friedrich-Alexander-University Erlangen-Nurnberg [J49]

Ask authors/readers for more resources

Boosting algorithms were originally developed for machine learning but were later adapted to estimate statistical models-offering various practical advantages such as automated variable selection and implicit regularization of effect estimates. The interpretation of the resulting models, however, remains the same as if they had been fitted by classical methods. Boosting, hence, allows to use an advanced machine learning scheme to estimate various types of statistical models. This tutorial aims to highlight how boosting can be used for semi-parametric modelling, what practical implications follow from the design of the algorithm and what kind of drawbacks data analysts have to expect. We illustrate the application of boosting in the analysis of a stunting score from children in India and a high-dimensional dataset of tumour DNA to develop a biomarker for the occurrence of metastases in breast cancer patients.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available