4.6 Article

Boosting few-shot confocal endomicroscopy image recognition with feature-level MixSiam

期刊

BIOMEDICAL OPTICS EXPRESS
卷 14, 期 3, 页码 1054-1071

出版社

Optica Publishing Group
DOI: 10.1364/BOE.478832

关键词

-

向作者/读者索取更多资源

This paper proposes a feature-level MixSiam method based on the traditional Siamese network for learning the discriminative features of probe-based confocal laser endomicroscopy (pCLE) images for gastrointestinal (GI) tumor classification. The method consists of two stages: self-supervised learning (SSL) and few-shot learning (FS). In the SSL stage, a feature mixing approach is introduced to enhance the adaptation of the Siamese structure to the intra-class variance of the pCLE dataset. In the FS stage, a pre-trained model obtained through SSL is used as the base learner to enable rapid generalization to other pCLE classification tasks with limited labeled data. Experimental results demonstrate the effectiveness of the proposed method in improving the classification of pCLE images for different stages of tumor development.
As an emerging early diagnostic technology for gastrointestinal diseases, confocal laser endomicroscopy lacks large-scale perfect annotated data, leading to a major challenge in learning discriminative semantic features. So, how should we learn representations without labels or a few labels? In this paper, we proposed a feature-level MixSiam method based on the traditional Siamese network that learns the discriminative features of probe-based confocal laser endomicroscopy (pCLE) images for gastrointestinal (GI) tumor classification. The proposed method is divided into two stages: self-supervised learning (SSL) and few-shot learning (FS). First, in the self-supervised learning stage, the novel feature-level-based feature mixing approach introduced more task-relevant information via regularization, facilitating the traditional Siamese structure can adapt to the large intra-class variance of the pCLE dataset. Then, in the few-shot learning stage, we adopted the pre-trained model obtained through self-supervised learning as the base learner in the few-shot learning pipeline, enabling the feature extractor to learn richer and more transferable visual representations for rapid generalization to other pCLE classification tasks when labeled data are limited. On two disjoint pCLE gastrointestinal image datasets, the proposed method is evaluated. With the linear evaluation protocol, feature-level MixSiam outperforms the baseline by 6% (Top-1) and the supervised model by 2% (Top1), which demonstrates the effectiveness of the proposed feature-level-based feature mixing method. Furthermore, the proposed method outperforms the previous baseline method for the few-shot classification task, which can help improve the classification of pCLE images lacking large-scale annotated data for different stages of tumor development.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据