☆ 4.7 Article

Detecting model misconducts in decentralized healthcare federated learning

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2022)

期刊

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

卷 158, 期 -, 页码 -

出版社

ELSEVIER IRELAND LTD

DOI: 10.1016/j.ijmedinf.2021.104658

关键词

Model Misconducts; Federated Learning; Predictive Modeling; Electronic Health Record; Blockchain Distributed Ledger Technology

类别

Computer Science, Information Systems Health Care Sciences & Services Medical Informatics

资金

U.S. National Institutes of Health [R00HG009680, R01HL136835, R01GM118609, R01HG011066, U24LM013755]
Graduate Division San Diego Matching Fellowship
San Diego Biomedical Informatics Education & Research (SABER) NIH National Library of Medicine (NLM) [T15LM011271]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study aims to propose an algorithm-agnostic approach to detect model misconduct in cross-institutional collaborations and apply it to federated machine learning on genomic/healthcare data. The results show that the proposed method has a high recall rate with low computational cost, effectively identifying misconduct.

Background: To accelerate healthcare/genomic medicine research and facilitate quality improvement, researchers have started cross-institutional collaborations to use artificial intelligence on clinical/genomic data. However, there are real-world risks of incorrect models being submitted to the learning process, due to either unforeseen accidents or malicious intent. This may reduce the incentives for institutions to participate in the federated modeling consortium. Existing methods to deal with this model misconduct issue mainly focus on modifying the learning methods, and therefore are more specifically tied with the algorithm. Basic Procedures: In this paper, we aim at solving the problem in an algorithm-agnostic way by (1) designing a simulator to generate various types of model misconduct, (2) developing a framework to detect the model misconducts, and (3) providing a generalizable approach to identify model misconducts for federated learning. We considered the following three categories: Plagiarism, Fabrication, and Falsification, and then developed a detection framework with three components: Auditing, Coefficient, and Performance detectors, with greedy parameter tuning. Main Findings: We generated 10 types of misconducts from models learned on three datasets to evaluate our detection method. Our experiments showed high recall with low added computational cost. Our proposed detection method can best identify the misconduct on specific sites from any learning iteration, whereas it is more challenging to precisely detect misconducts for a specific site and at a specific iteration. Principal Conclusions: We anticipate our study can support the enhancement of the integrity and reliability of federated machine learning on genomic/healthcare data.

Detecting model misconducts in decentralized healthcare federated learning

期刊

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

出版社

ELSEVIER IRELAND LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Detecting model misconducts in decentralized healthcare federated learning

期刊

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

出版社

ELSEVIER IRELAND LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文