4.7 Review

Original Research Skin cancer classification via convolutional neural networks: systematic review of studies involving human experts

期刊

EUROPEAN JOURNAL OF CANCER
卷 156, 期 -, 页码 202-216

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.ejca.2021.06.049

关键词

Skin cancer classification; Digital biomarkers; Convolutional neural network(s); Artificial intelligence; Machine learning; Deep learning; Dermatology; Malignant melanoma

类别

资金

  1. Federal Ministry of Health, Berlin, Germany
  2. NIH/NCI Cancer Center Support Grant [P30 CA008748]

向作者/读者索取更多资源

The study systematically analyzed the research status of reader studies involving melanoma and found that CNN-based classifiers demonstrated superior or at least equivalent performance compared with clinicians. However, it was noted that most studies were conducted in highly artificial settings, with test sets not representing the full range of patient populations and melanoma subtypes encountered in clinical practice.
Background: Multiple studies have compared the performance of artificial intelligence (AI) -based models for automated skin cancer classification to human experts, thus setting the cornerstone for a successful translation of AI-based tools into clinicopathological practice. Objective: The objective of the study was to systematically analyse the current state of research on reader studies involving melanoma and to assess their potential clinical relevance by evaluating three main aspects: test set characteristics (holdout/out-of-distribution data set, composition), test setting (experimental/clinical, inclusion of metadata) and representativeness of participating clini-cians. Methods: PubMed, Medline and ScienceDirect were screened for peer-reviewed studies published between 2017 and 2021 and dealing with AI-based skin cancer classification involving melanoma. The search terms skin cancer classification, deep learning, convolutional neural network (CNN), melanoma (detection), digital biomarkers, histopathology and whole slide imaging were com-bined. Based on the search results, only studies that considered direct comparison of AI results with clinicians and had a diagnostic classification as their main objective were included. Results: A total of 19 reader studies fulfilled the inclusion criteria. Of these, 11 CNN-based ap-proaches addressed the classification of dermoscopic images; 6 concentrated on the classification of clinical images, whereas 2 dermatopathological studies utilised digitised histopathological whole slide images. Conclusions: All 19 included studies demonstrated superior or at least equivalent performance of CNN-based classifiers compared with clinicians. However, almost all studies were conducted in highly artificial settings based exclusively on single images of the suspicious lesions. Moreover, test sets mainly consisted of holdout images and did not represent the full range of patient populations and melanoma subtypes encountered in clinical practice. (c) 2021 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据