4.7 Article

The need to separate the wheat from the chaff in medical informatics Introducing a comprehensive checklist for the (self)-assessment of medical AI studies

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Mathematical & Computational Biology

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation

Davide Chicco et al.

Summary: In this manuscript, the authors reaffirm the robustness of the Matthews correlation coefficient (MCC) as a metric for summarizing classifier performance. They compare MCC to other metrics such as balanced accuracy, bookmaker informedness, and markedness, and demonstrate situations where MCC provides a more informative response. The authors also identify three exceptions where bookmaker informedness may be more appropriate, but overall suggest MCC as a standard measure for evaluating binary classifications in various fields.

BIODATA MINING (2021)

Review Ophthalmology

Reporting guidelines for artificial intelligence in healthcare research

Hussein Ibrahim et al.

Summary: Reporting guidelines are structured tools that specify the minimum information required by researchers when reporting a study. Using AI reporting guidelines specific to studies involving AI interventions can enhance the quality of research by focusing on study design, while supplementing them with AI-specific guidance from available guidelines.

CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY (2021)

Article Health Care Sciences & Services

External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb

Kym I. E. Snell et al.

Summary: Rules-of-thumb for sample size in external validation of clinical prediction models may not be precise, with factors like LP distribution affecting precision of performance estimates. A tailored simulation-based approach can offer more flexibility and reliability in determining sample size requirements for validation.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2021)

Review Health Care Sciences & Services

Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis

Ravi Aggarwal et al.

Summary: This study evaluated the diagnostic accuracy of deep learning algorithms in identifying pathology in medical imaging, finding good performance in ophthalmology, respiratory disease and breast imaging, but noting high heterogeneity between studies and extensive variation in methodology, outcome measures and terminology. The development of artificial intelligence-specific guidelines, particularly STARD, is recommended to address these issues.

NPJ DIGITAL MEDICINE (2021)

Article Computer Science, Artificial Intelligence

Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans

Michael Roberts et al.

Summary: Many machine learning-based approaches have been developed for the prognosis and diagnosis of COVID-19 from medical images. However, a systematic review found that current studies have methodological flaws, preventing their potential clinical utility. Recommendations are provided to address these issues for higher-quality model development.

NATURE MACHINE INTELLIGENCE (2021)

Article Health Care Sciences & Services

Clinician checklist for assessing suitability of machine learning applications in healthcare

Ian Scott et al.

Summary: The article discusses the application of machine learning algorithms in healthcare and presents a checklist of 10 questions to assist clinicians in evaluating the readiness of algorithms for routine care. The checklist aims to address key issues such as validity, utility, feasibility, safety, and ethical use of algorithms.

BMJ HEALTH & CARE INFORMATICS (2021)

Review Medical Laboratory Technology

Systematic review and meta-analysis of within-subject and between-subject biological variation estimates of 20 haematological parameters

Abdurrahman Coskun et al.

CLINICAL CHEMISTRY AND LABORATORY MEDICINE (2020)

Review Computer Science, Artificial Intelligence

Automated machine learning: Review of the state-of-the-art and opportunities for healthcare

Jonathan Waring et al.

ARTIFICIAL INTELLIGENCE IN MEDICINE (2020)

Editorial Material Medicine, General & Internal

Challenges to the Reproducibility of Machine Learning Models in Health Care

Andrew L. Beam et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2020)

Article Computer Science, Information Systems

MINIMAR (MINimum Information for Medical Al Reporting): Developing reporting standards for artificial intelligence in health care

Tina Hernandez-Boussard et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2020)

Article Medical Informatics

As if sand were stone. New concepts and metrics to probe the ground on which to build trustable AI

Federico Cabitza et al.

BMC MEDICAL INFORMATICS AND DECISION MAKING (2020)

Article Health Care Sciences & Services

Is Artificial Intelligence Better Than Human Clinicians in Predicting Patient Outcomes?

Joon Lee

JOURNAL OF MEDICAL INTERNET RESEARCH (2020)

Editorial Material Biochemistry & Molecular Biology

Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist

Beau Norgeot et al.

NATURE MEDICINE (2020)

Editorial Material Medicine, General & Internal

Reporting of artificial intelligence prediction models

Gary S. Collins et al.

LANCET (2019)

Review Medicine, General & Internal

Machine Learning in Medicine

Alvin Rajkomar et al.

NEW ENGLAND JOURNAL OF MEDICINE (2019)

Review Computer Science, Artificial Intelligence

Causability and explainability of artificial intelligence in medicine

Andreas Holzinger et al.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2019)

Review Health Care Sciences & Services

A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models

Evangelia Christodoulou et al.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2019)

Review Radiology, Nuclear Medicine & Medical Imaging

Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review

Indranil Balki et al.

CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES (2019)

Article Medicine, General & Internal

Three myths about risk thresholds for prediction models

Laure Wynants et al.

BMC MEDICINE (2019)

Article Medicine, General & Internal

Key challenges for delivering clinical impact with artificial intelligence

Christopher J. Kelly et al.

BMC MEDICINE (2019)

Letter Medicine, General & Internal

Extension of the CONSORT and SPIRIT statements

Xiaoxuan Liu et al.

LANCET (2019)

Article Multidisciplinary Sciences

Machine learning algorithm validation with a limited sample size

Andrius Vabalas et al.

PLOS ONE (2019)

Article Medicine, General & Internal

PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies

Robert F. Wolff et al.

ANNALS OF INTERNAL MEDICINE (2019)

Article Medicine, General & Internal

Calibration: the Achilles heel of predictive analytics

Ben van Calster et al.

BMC MEDICINE (2019)

Review Medical Informatics

Artificial Intelligence Versus Clinicians in Disease Diagnosis: Systematic Review

Jiayi Shen et al.

JMIR MEDICAL INFORMATICS (2019)

Article Computer Science, Information Systems

Predictive analytics in health care: how can we know it works?

Ben Van Calster et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2019)

Article Computer Science, Information Systems

Does health informatics have a replication crisis?

Enrico Coiera et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2018)

Review Multidisciplinary Sciences

Opportunities and obstacles for deep learning in biology and medicine

Travers Ching et al.

JOURNAL OF THE ROYAL SOCIETY INTERFACE (2018)

Article Multidisciplinary Sciences

Artificial intelligence faces reproducibility crisis

Matthew Hutson

SCIENCE (2018)

Review Urology & Nephrology

Reporting and Interpreting Decision Curve Analysis: A Guide for Investigators

Ben Van Calster et al.

EUROPEAN UROLOGY (2018)

Article Medicine, General & Internal

Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data

Milena A. Gianfrancesco et al.

JAMA INTERNAL MEDICINE (2018)

Editorial Material Medicine, General & Internal

Ensuring Fairness in Machine Learning to Advance Health Equity

Alvin Rajkomar et al.

ANNALS OF INTERNAL MEDICINE (2018)

Editorial Material Medicine, General & Internal

Machine learning in medicine: Addressing ethical challenges

Effy Vayena et al.

PLOS MEDICINE (2018)

Article Radiology, Nuclear Medicine & Medical Imaging

DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning

Ke Yan et al.

JOURNAL OF MEDICAL IMAGING (2018)

Article Health Care Sciences & Services

Artificial intelligence powers digital medicine

Alexander L. Fogel et al.

NPJ DIGITAL MEDICINE (2018)

Editorial Material Medicine, General & Internal

Unintended Consequences of Machine Learning in Medicine

Federico Cabitza et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2017)

Article Medical Informatics

Imbalanced target prediction with pattern discovery on clinical data repositories

Tak-Ming Chan et al.

BMC MEDICAL INFORMATICS AND DECISION MAKING (2017)

Article Urology & Nephrology

Medical big data: promise and challenges

Choong Ho Lee et al.

KIDNEY RESEARCH AND CLINICAL PRACTICE (2017)

Editorial Material Health Care Sciences & Services

Prediction models need appropriate internal, internal-external, and external validation

Ewout W. Steyerberg et al.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2016)

Article Multidisciplinary Sciences

MIMIC-III, a freely accessible critical care database

Alistair E. W. Johnson et al.

SCIENTIFIC DATA (2016)

Article Cardiac & Cardiovascular Systems

Machine Learning in Medicine

Rahul C. Deo

CIRCULATION (2015)

Article Health Care Sciences & Services

The precision recall curve overcame the optimism of the receiver operating characteristic curve in rare diseases

Brice Ozenne et al.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2015)

Review Anesthesiology

A classification of chronic pain for ICD-11

Rolf-Detlef Treede et al.

Article Computer Science, Artificial Intelligence

Missing data in medical databases: Impute, delete or classify?

Federico Cismondi et al.

ARTIFICIAL INTELLIGENCE IN MEDICINE (2013)

Article Medicine, General & Internal

Comparison of imputation methods for missing laboratory data in medicine

Akbar K. Waljee et al.

BMJ OPEN (2013)

Article Mathematical & Computational Biology

Toward open sharing of task-based fMRI data: the OpenfMRI project

Russell A. Poldrack et al.

FRONTIERS IN NEUROINFORMATICS (2013)

Article Computer Science, Information Systems

Leakage in Data Mining: Formulation, Detection, and Avoidance

Shachar Kaufman et al.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2012)

Article Computer Science, Interdisciplinary Applications

Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods

Simone Borra et al.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2010)

Article Cardiac & Cardiovascular Systems

Randomized Clinical Trials and Observational Studies Guidelines for Assessing Respective Strengths and Limitations

Edward L. Hannan

JACC-CARDIOVASCULAR INTERVENTIONS (2008)

Article Medicine, General & Internal

Why most published research findings are false

JPA Ioannidis

PLOS MEDICINE (2005)

Article Public, Environmental & Occupational Health

The cross-sectional cohort study - An underutilized design

JI Hudson et al.

EPIDEMIOLOGY (2005)

Article Health Care Sciences & Services

External validation is necessary in, prediction research: A clinical example

SE Bleeker et al.

JOURNAL OF CLINICAL EPIDEMIOLOGY (2003)

Article Medical Laboratory Technology

LOINC, a universal standard for identifying laboratory observations: A 5-year update

CJ McDonald et al.

CLINICAL CHEMISTRY (2003)

Article Meteorology & Atmospheric Sciences

Summarizing multiple aspects of model performance in a single diagram.

KE Taylor

JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES (2001)