☆ 4.7 Article

Federated learning for preserving data privacy in collaborative healthcare research

DIGITAL HEALTH (2022)

Journal

DIGITAL HEALTH

Volume 8, Issue -, Pages -

Publisher

SAGE PUBLICATIONS LTD

DOI: 10.1177/20552076221134455

Keywords

Federated learning; deep learning; data; security; privacy

Funding

National Institute of General Medical Sciences (NIGMS) of the National Institutes of Health [K23GM140268]
Thomas H. Maren Fund
National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health from the National Institute of General Medical Sciences [K01DK120784, R01GM110240]
UF Research [AWD09459]
Gatorade Trust, University of Florida
National Science Foundation CAREER award from the NIA [1750192, P30AG028740, R01AG05533]
NIGMS [R01GM110240]
NIBIB [1R21EB027344]
National Center for Advancing Translational Sciences and Clinical and Translational Sciences Award [UL1TR000064]
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [1750192] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In healthcare AI applications, maintaining generalizability, external validity, and reproducibility is crucial. Traditional approaches of sharing patient data can compromise data privacy and security. Federated learning techniques offer an alternative by sharing knowledge instead of data, preserving both data privacy and availability.

Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individuals' right to prevent the sharing and disclosure of information about themselves) and data security (simultaneously preserving confidentiality, accuracy, fidelity, and availability of data). This article describes insights from real-world implementation of federated learning techniques that offer opportunities to maintain both data privacy and availability via collaborative machine learning that shares knowledge, not data. Local models are trained separately on local data. As they train, they send local model updates (e.g. coefficients or gradients) for consolidation into a global model. In some use cases, global models outperform local models on new, previously unseen local datasets, suggesting that collaborative learning from a greater number of examples, including a greater number of rare cases, may improve predictive performance. Even when sharing model updates rather than data, privacy leakage can occur when adversaries perform property or membership inference attacks which can be used to ascertain information about the training set. Emerging techniques mitigate risk from adversarial attacks, allowing investigators to maintain both data privacy and availability in collaborative healthcare research. When data heterogeneity between participating centers is high, personalized algorithms may offer greater generalizability by improving performance on data from centers with proportionately smaller training sample sizes. Properly applied, federated learning has the potential to optimize the reproducibility and performance of collaborative learning while preserving data security and privacy.

Federated learning for preserving data privacy in collaborative healthcare research

Journal

DIGITAL HEALTH

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Federated learning for preserving data privacy in collaborative healthcare research

Journal

DIGITAL HEALTH

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper