☆ 4.7 Article

Reducing variability of breast cancer subtype predictors by grounding deep learning models in prior knowledge

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Volume 138, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.compbiomed.2021.104850

Keywords

Applied computing; Genomics; Bioinformatics; Transcriptomics

Funding

William and Linda Frost Fund
College of Science and Math at California Polytechnic State University, San Luis Obispo
Doris A. Howell Foundation-CSUPERB Research Scholars Program
Cal Poly Digital Transformation Hub by Amazon Web Services

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Deep learning neural networks have shown improved performance in breast cancer subtype classification, but face issues of parameter diversity and distribution mismatch. Embedding prior knowledge from literature can enhance predictive models, particularly in breast cancer research which offers abundant knowledge on subtype biomarkers and genetic signatures.

Deep learning neural networks have improved performance in many cancer informatics problems, including breast cancer subtype classification. However, many networks experience underspecificationwheremultiplecombinationsofparametersachievesimilarperformance, bothin training and validation. Additionally, certain parameter combinations may perform poorly when the test distribution differs from the training distribution. Embedding prior knowledge from the literature may address this issue by boosting predictive models that provide crucial, in-depth information about a given disease. Breast cancer research provides a wealth of such knowledge, particularly in the form of subtype biomarkers and genetic signatures. In this study, we draw on past research on breast cancer subtype biomarkers, label propagation, and neural graph machines to present a novel methodology for embedding knowledge into machine learning systems. We embed prior knowledge into the loss function in the form of inter-subject distances derived from a well-known published breast cancer signature. Our results show that this methodology reduces predictor variability on state-of-the-art deep learning architectures and increases predictor consistency leading to improved interpretation. We find that pathway enrichment analysis is more consistent after embedding knowledge. This novel method applies to a broad range of existing studies and predictive models. Our method moves the traditional synthesis of predictive models from an arbitrary assignment of weights to genes toward a more biologically meaningful approach of incorporating knowledge.

Reducing variability of breast cancer subtype predictors by grounding deep learning models in prior knowledge

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Reducing variability of breast cancer subtype predictors by grounding deep learning models in prior knowledge

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper