☆ 4.7 Article

Federated learning for computational pathology on gigapixel whole slide images

MEDICAL IMAGE ANALYSIS (2022)

Journal

MEDICAL IMAGE ANALYSIS

Volume 76, Issue -, Pages -

Publisher

ELSEVIER

DOI: 10.1016/j.media.2021.102298

Keywords

Federated learning; Pathology; Computational pathology; Whole slide imaging; Split learning

Funding

BWH Pathology
NIH National Institute of General Medical Sciences (NIGMS) [R35GM138216]
Google Cloud Research Grant
Nvidia GPU Grant Program
NSF

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Deep learning-based computational pathology algorithms show great potential in various tasks, but require large high-quality training data. Collaborative integration of medical data from multiple institutions can enhance model performance, although privacy concerns and data sharing complexities remain challenging.

Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non human-identifiable features from histology such as molecular alterations. However, the development of robust, adaptable and accurate deep learning-based models often rely on the collection and time-costly curation large high-quality annotated training data that should ideally come from diverse sources and patient populations to cater for the heterogeneity that exists in such datasets. Multi-centric and collaborative integration of medical data across multiple institutions can naturally help overcome this challenge and boost the model performance but is limited by privacy concerns among other difficulties that may arise in the complex data sharing process as models scale towards using hundreds of thousands of gigapixel whole slide images. In this paper, we introduce privacy-preserving federated learning for gigapixel whole slide images in computational pathology using weakly-supervised attention multiple instance learning and differential privacy. We evaluated our approach on two different diagnostic problems using thousands of histology whole slide images with only slide-level labels. Additionally, we present a weakly-supervised learning framework for survival prediction and patient stratification from whole slide images and demonstrate its effectiveness in a federated setting. Our results show that using federated learning, we can effectively develop accurate weakly-supervised deep learning models from distributed data silos without direct data sharing and its associated complexities, while also preserving differential privacy using randomized noise generation. We also make available an easy-to-use federated learning for computational pathology software package: http://github.com/mahmoodlab/HistoFL . (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

Federated learning for computational pathology on gigapixel whole slide images

Journal

MEDICAL IMAGE ANALYSIS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Federated learning for computational pathology on gigapixel whole slide images

Journal

MEDICAL IMAGE ANALYSIS

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper