4.6 Article

An Image Turing Test on Realistic Gastroscopy Images Generated by Using the Progressive Growing of Generative Adversarial Networks

Journal

JOURNAL OF DIGITAL IMAGING
Volume 36, Issue 4, Pages 1760-1769

Publisher

SPRINGER
DOI: 10.1007/s10278-023-00803-2

Keywords

Generative adversarial networks; Gastroscopy image; Synthetic image; Image Turing test; Progressive growing of generative adversarial networks

Ask authors/readers for more resources

Generative adversarial networks (GAN) in medicine are valuable techniques for producing high-quality gastrointestinal images. This study used the progressive growing of GAN (PGGAN) to generate realistic images and investigated its limitations. The accuracy and sensitivity of endoscopists in distinguishing real and synthetic images were not significantly different. Real images with the anatomical landmark pylorus had higher detection sensitivity. However, GANs need improvement in representing rugal folds and mucous membrane texture.
Generative adversarial networks (GAN) in medicine are valuable techniques for augmenting unbalanced rare data, anomaly detection, and avoiding patient privacy issues. However, there were limits to generating high-quality endoscopic images with various characteristics, such as peristalsis, viewpoints, light sources, and mucous patterns. This study used the progressive growing of GAN (PGGAN) within the normal distribution dataset to confirm the ability to generate high-quality gastrointestinal images and investigated what barriers PGGAN has to generate endoscopic images. We trained the PGGAN with 107,060 gastroscopy images from 4165 normal patients to generate highly realistic 512(2) pixel-sized images. For the evaluation, visual Turing tests were conducted on 100 real and 100 synthetic images to distinguish the authenticity of images by 19 endoscopists. The endoscopists were divided into three groups based on their years of clinical experience for subgroup analysis. The overall accuracy, sensitivity, and specificity of the 19 endoscopist groups were 61.3%, 70.3%, and 52.4%, respectively. The mean accuracy of the three endoscopist groups was 62.4 [Group I], 59.8 [Group II], and 59.1% [Group III], which was not considered a significant difference. There were no statistically significant differences in the location of the stomach. However, the real images with the anatomical landmark pylorus had higher detection sensitivity. The images generated by PGGAN showed highly realistic depictions that were difficult to distinguish, regardless of their expertise as endoscopists. However, it was necessary to establish GANs that could better represent the rugal folds and mucous membrane texture.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available