3.8 Proceedings Paper

Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images

出版社

IEEE
DOI: 10.1109/ICCV48922.2021.00398

关键词

-

资金

  1. BWH Pathology
  2. Nvidia GPU Grant Program
  3. NIGMS [R35GM138216]
  4. NSF

向作者/读者索取更多资源

This paper introduces a Multimodal Co-Attention Transformer (MCAT) framework that learns a dense co-attention mapping between WSIs and genomic features in an embedding space. By reducing the space complexity of WSI bags, this method demonstrates superior performance in multiple instance learning.
Survival outcome prediction is a challenging weakly-supervised and ordinal regression task in computational pathology that involves modeling complex interactions within the tumor microenvironment in gigapixel whole slide images (WSIs). Despite recent progress in formulating WSIs as bags for multiple instance learning (MIL), representation learning of entire WSIs remains an open and challenging problem, especially in overcoming: 1) the computational complexity of feature aggregation in large bags, and 2) the data heterogeneity gap in incorporating biological priors such as genomic measurements. In this work, we present a Multimodal Co-Attention Transformer (MCAT) framework that learns an interpretable, dense co-attention mapping between WSIs and genomic features formulated in an embedding space. Inspired by approaches in Visual Question Answering (VQA) that can attribute how word embeddings attend to salient objects in an image when answering a question, MCAT learns how histology patches attend to genes when predicting patient survival. In addition to visualizing multimodal interactions, our co-attention transformation also reduces the space complexity of WSI bags, which enables the adaptation of Transformer layers as a general encoder backbone in MIL. We apply our proposed method on five different cancer datasets (4,730 WSIs, 67 million patches). Our experimental results demonstrate that the proposed method consistently achieves superior performance compared to the state-of-the-art methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据