4.3 Article

A Hybrid Deep Learning Model for Predicting Molecular Subtypes of Human Breast Cancer Using Multimodal Data

Journal

IRBM
Volume 43, Issue 1, Pages 62-74

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.irbm.2020.12.002

Keywords

Breast cancer subtypes; Deep learning; Prediction; Multimodal fusion; Image filtering

Funding

  1. Sichuan Agricultural University Innovation Training Program project [201910626061]

Ask authors/readers for more resources

In this study, a Hybrid DL model based on multimodal data is proposed for the prediction of breast cancer subtypes. By combining patient's gene modality data with image modality data and fusing features using weighted linear aggregation, a more accurate and efficient prediction of breast cancer subtypes is achieved.
Background: The prediction of breast cancer subtypes plays a key role in the diagnosis and prognosis of breast cancer. In recent years, deep learning (DL) has shown good performance in the intelligent prediction of breast cancer subtypes. However, most of the traditional DL models use single modality data, which can just extract a few features, so it cannot establish a stable relationship between patient characteristics and breast cancer subtypes. Dataset: We used the TCGA-BRCA dataset as a sample set for molecular subtype prediction of breast cancer. It is a public dataset that can be obtained through the following link: https://portal .gdc .cancer. gov /projects /TCGA-BRCA Methods: In this paper, a Hybrid DL model based on the multimodal data is proposed. We combine the patient's gene modality data with image modality data to construct a multimodal fusion framework. According to the different forms and states, we set up feature extraction networks respectively, and then we fuse the output of the two feature networks based on the idea of weighted linear aggregation. Finally, the fused features are used to predict breast cancer subtypes. In particular, we use the principal component analysis to reduce the dimensionality of high-dimensional data of gene modality and filter the data of image modality. Besides, we also improve the traditional feature extraction network to make it show better performance. Results: The results show that compared with the traditional DL model, the Hybrid DL model proposed in this paper is more accurate and efficient in predicting breast cancer subtypes. Our model achieved a prediction accuracy of 88.07% in 10 times of 10-fold cross-validation. We did a separate AUC test for each subtype, and the average AUC value obtained was 0.9427. In terms of subtype prediction accuracy, our model is about 7.45% higher than the previous average. (c) 2021 AGBM. Published by Elsevier Masson SAS. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available