4.8 Article

Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians

期刊

NATURE MEDICINE
卷 -, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41591-023-02437-x

关键词

-

向作者/读者索取更多资源

A collaboration system, CoDoC, integrates decisions between human experts and AI to optimize screening and triaging, reducing clinicians' workload. CoDoC outperforms clinician-only or AI-only baselines in breast cancer screening and TB triaging workflows, reducing false positives and clinician workload. The sustained performance gains of CoDoC across various settings and AI systems highlight its potential in futuristic clinical applications.
A collaboration system helps to integrate decisions between human experts and AI to optimize screening and triaging and to reduce clinicians' workload. Predictive artificial intelligence (AI) systems based on deep learning have been shown to achieve expert-level identification of diseases in multiple medical imaging settings, but can make errors in cases accurately diagnosed by clinicians and vice versa. We developed Complementarity-Driven Deferral to Clinical Workflow (CoDoC), a system that can learn to decide between the opinion of a predictive AI model and a clinical workflow. CoDoC enhances accuracy relative to clinician-only or AI-only baselines in clinical workflows that screen for breast cancer or tuberculosis (TB). For breast cancer screening, compared to double reading with arbitration in a screening program in the UK, CoDoC reduced false positives by 25% at the same false-negative rate, while achieving a 66% reduction in clinician workload. For TB triaging, compared to standalone AI and clinical workflows, CoDoC achieved a 5-15% reduction in false positives at the same false-negative rate for three of five commercially available predictive AI systems. To facilitate the deployment of CoDoC in novel futuristic clinical settings, we present results showing that CoDoC's performance gains are sustained across several axes of variation (imaging modality, clinical setting and predictive AI system) and discuss the limitations of our evaluation and where further validation would be needed. We provide an open-source implementation to encourage further research and application.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据