☆ 3.8 Proceedings Paper

DT-MIL: Deformable Transformer for Multi-instance Learning on Histopathological Image

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII (2021)

期刊

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII

卷 12908, 期 -, 页码 206-216

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

DOI: 10.1007/978-3-030-87237-3_20

关键词

Deformable transformer; Multi-instance learning; Key-value attention; Histopathological image analysis

类别

Acoustics Computer Science, Artificial Intelligence Engineering, Biomedical Medicine, General & Internal Microscopy Radiology, Nuclear Medicine & Medical Imaging

资金

National Key R&D Program of China [2018YFC2000702]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The article introduces a novel embedded-space MIL model, based on deformable transformer (DT) architecture and convolutional layers, termed DT-MIL, which outperforms other MIL models by generating the bag representation in a fully trainable way, representing the bag with a high-level and nonlinear combination of all instances, and encoding the position relationship and context information during bag embedding phase.

Learning informative representations is crucial for classification and prediction tasks on histopathological images. Due to the huge image size, whole-slide histopathological image analysis is normally addressed with multi-instance learning (MIL) scheme. However, the weakly supervised nature of MIL leads to the challenge of learning an effective whole-slide-level representation. To tackle this issue, we present a novel embedded-space MIL model based on deformable transformer (DT) architecture and convolutional layers, which is termed DT-MIL. The DT architecture enables our MIL model to update each instance feature by globally aggregating instance features in a bag simultaneously and encoding the position context information of instances during bag representation learning. Compared with other state-of-the-art MIL models, our model has the following advantages: (1) generating the bag representation in a fully trainable way, (2) representing the bag with a high-level and nonlinear combination of all instances instead of fixed pooling-based methods (e.g. max pooling and average pooling) or simply attention-based linear aggregation, and (3) encoding the position relationship and context information during bag embedding phase. Besides our proposed DT-MIL, we also develop other possible transformer-based MILs for comparison. Extensive experiments show that our DT-MIL outperforms the state-of-the-art methods and other transformer-based MIL architectures in histopathological image classification and prediction tasks. An open-source implementation of our approach can be found at https://github.com/yfzon/DT-MIL.

DT-MIL: Deformable Transformer for Multi-instance Learning on Histopathological Image

期刊

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

DT-MIL: Deformable Transformer for Multi-instance Learning on Histopathological Image

期刊

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文