4.7 Article

Semi-Supervised Approach for EGFR Mutation Prediction on CT Images

期刊

MATHEMATICS
卷 10, 期 22, 页码 -

出版社

MDPI
DOI: 10.3390/math10224225

关键词

semi-supervised learning; adversarial training; generative adversarial networks; medical image analysis; genotype prediction

资金

  1. National Funds through the Portuguese funding agency, FCTFoundation for Science and Technology Portugal [LA/P/0063/2020, 2021.05767.BD]
  2. BD

向作者/读者索取更多资源

The use of deep learning methods in medical imaging has shown promising results but requires large annotated datasets. This study proposes a semi-supervised learning approach that combines labeled and unlabeled data to improve predictive model efficiency for predicting mutation status in lung cancer.
The use of deep learning methods in medical imaging has been able to deliver promising results; however, the success of such models highly relies on large, properly annotated datasets. The annotation of medical images is a laborious, expensive, and time-consuming process. This difficulty is increased for the mutations status label since these require additional exams (usually biopsies) to be obtained. On the other hand, raw images, without annotations, are extensively collected as part of the clinical routine. This work investigated methods that could mitigate the labelled data scarcity problem by using both labelled and unlabelled data to improve the efficiency of predictive models. A semi-supervised learning (SSL) approach was developed to predict epidermal growth factor receptor (EGFR) mutation status in lung cancer in a less invasive manner using 3D CT scans.The proposed approach consists of combining a variational autoencoder (VAE) and exploiting the power of adversarial training, intending that the features extracted from unlabelled data to discriminate images can help in the classification task. To incorporate labelled and unlabelled images, adversarial training was used, extending a traditional variational autoencoder. With the developed method, a mean AUC of 0.701 was achieved with the best-performing model, with only 14% of the training data being labelled. This SSL approach improved the discrimination ability by nearly 7 percentage points over a fully supervised model developed with the same amount of labelled data, confirming the advantage of using such methods when few annotated examples are available.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据