4.6 Article

Uncertainty-Aware Semi-Supervised Method Using Large Unlabeled and Limited Labeled COVID-19 Data

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3462635

Keywords

Semi-supervised learning; generative adversarial networks; COVID-19; supervised learning; deep learning

Funding

  1. MINECO/FEDER [RTI2018-098913-B100, CV20-45250, A-TIC-080UGR18]

Ask authors/readers for more resources

This article introduces a semi-supervised classification method using limited labeled data, relying on Sobel edge detection and Generative Adversarial Networks (GANs) to automate COVID-19 diagnosis. Experimental results demonstrate that the proposed method significantly outperforms supervised learning methods in cases where labeled data is scarce.
The new coronavirus has caused more than one million deaths and continues to spread rapidly. This virus targets the lungs, causing respiratory distress which can be mild or severe. The X-ray or computed tomography (CT) images of lungs can reveal whether the patient is infected with COVID-19 or not. Many researchers are trying to improve COVID-19 detection using artificial intelligence. Our motivation is to develop an auto-maticmethod that can cope with scenarios inwhich preparing labeled data is time consuming or expensive. In this article, we propose a Semi-supervised Classification using Limited Labeled Data (SCLLD) relying on Sobel edge detection and Generative Adversarial Networks (GANs) to automate the COVID-19 diagnosis. The GAN discriminator output is a probabilistic value which is used for classification in this work. The proposed system is trained using 10,000 CT scans collected from Omid Hospital, whereas a public dataset is also used for validating our system. The proposed method is compared with other state-of-the-art supervised methods such as Gaussian processes. To the best of our knowledge, this is the first time a semi-supervised method for COVID-19 detection is presented. Our system is capable of learning from a mixture of limited labeled and unlabeled datawhere supervised learners fail due to a lack of sufficient amount of labeled data. Thus, our semi-supervised training method significantly outperforms the supervised training of Convolutional Neural Network (CNN) when labeled training data is scarce. The 95% confidence intervals for our method in terms of accuracy, sensitivity, and specificity are 99.56 +/- 0.20%, 99.88 +/- 0.24%, and 99.40 +/- 0.18%, respectively, whereas intervals for the CNN (trained supervised) are 68.34 +/- 4.11%, 91.2 +/- 6.15%, and 46.40 +/- 5.21%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available