4.6 Article

An Image Turing Test on Realistic Gastroscopy Images Generated by Using the Progressive Growing of Generative Adversarial Networks

期刊

JOURNAL OF DIGITAL IMAGING
卷 36, 期 4, 页码 1760-1769

出版社

SPRINGER
DOI: 10.1007/s10278-023-00803-2

关键词

Generative adversarial networks; Gastroscopy image; Synthetic image; Image Turing test; Progressive growing of generative adversarial networks

向作者/读者索取更多资源

Generative adversarial networks (GAN) in medicine are valuable techniques for producing high-quality gastrointestinal images. This study used the progressive growing of GAN (PGGAN) to generate realistic images and investigated its limitations. The accuracy and sensitivity of endoscopists in distinguishing real and synthetic images were not significantly different. Real images with the anatomical landmark pylorus had higher detection sensitivity. However, GANs need improvement in representing rugal folds and mucous membrane texture.
Generative adversarial networks (GAN) in medicine are valuable techniques for augmenting unbalanced rare data, anomaly detection, and avoiding patient privacy issues. However, there were limits to generating high-quality endoscopic images with various characteristics, such as peristalsis, viewpoints, light sources, and mucous patterns. This study used the progressive growing of GAN (PGGAN) within the normal distribution dataset to confirm the ability to generate high-quality gastrointestinal images and investigated what barriers PGGAN has to generate endoscopic images. We trained the PGGAN with 107,060 gastroscopy images from 4165 normal patients to generate highly realistic 512(2) pixel-sized images. For the evaluation, visual Turing tests were conducted on 100 real and 100 synthetic images to distinguish the authenticity of images by 19 endoscopists. The endoscopists were divided into three groups based on their years of clinical experience for subgroup analysis. The overall accuracy, sensitivity, and specificity of the 19 endoscopist groups were 61.3%, 70.3%, and 52.4%, respectively. The mean accuracy of the three endoscopist groups was 62.4 [Group I], 59.8 [Group II], and 59.1% [Group III], which was not considered a significant difference. There were no statistically significant differences in the location of the stomach. However, the real images with the anatomical landmark pylorus had higher detection sensitivity. The images generated by PGGAN showed highly realistic depictions that were difficult to distinguish, regardless of their expertise as endoscopists. However, it was necessary to establish GANs that could better represent the rugal folds and mucous membrane texture.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据