4.7 Article

Lymph Node Metastasis Prediction From Whole Slide Images With Transformer-Guided Multiinstance Learning and Knowledge Transfer

Journal

IEEE TRANSACTIONS ON MEDICAL IMAGING
Volume 41, Issue 10, Pages 2777-2787

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMI.2022.3171418

Keywords

Transformers; Feature extraction; Lymph nodes; Metastasis; Image analysis; Thyroid; Task analysis; Whole slide image analysis; multi-instance learning; transformer; knowledge distillation

Funding

  1. Fundamental Research Funds for the Central Universities [20720190012, 20720210121]

Ask authors/readers for more resources

In this paper, a transformer-guided framework is proposed to predict lymph node metastasis of papillary thyroid carcinoma from whole slide histopathological images (WSIs). The framework incorporates a lightweight feature extractor, clustering-based instance selection, and a transformer-MIL module to improve accuracy. Additionally, an attention-based mutual knowledge distillation paradigm is utilized to enhance the training of the model. Experimental results demonstrate that the proposed approach outperforms state-of-the-art methods in terms of AUC.
The gold standard for diagnosing lymph node metastasis of papillary thyroid carcinoma is to analyze the whole slide histopathological images (WSIs). Due to the large size of WSIs, recent computer-aided diagnosis approaches adopt the multi-instance learning (MIL) strategy and the key part is how to effectively aggregate the information of different instances (patches). In this paper, a novel transformer-guided framework is proposed to predict lymph node metastasis from WSIs, where we incorporate the transformer mechanism to improve the accuracy from three different aspects. First, we propose an effective transformer-based module for discriminative patch feature extraction, including a lightweight feature extractor with a pruned transformer (Tiny-ViT) and a clustering-based instance selection scheme. Next, we propose a new Transformer-MIL module to capture the relationship of different discriminative patches with sparse distribution on WSIs and better nonlinearly aggregate patch-level features into the slide-level prediction. Considering that the slide-level annotation is relatively limited to training a robust Transformer-MIL, we utilize the pathological relationship between the primary tumor and its lymph node metastasis and develop an effective attention-based mutual knowledge distillation (AMKD) paradigm. Experimental results on our collected WSI dataset demonstrate the efficiency of the proposed Transformer-MIL and attention-based knowledge distillation. Our method outperforms the state-of-the-art methods by over 2.72% in AUC (area under the curve).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available