4.8 Article

Use and misuse of the receiver operating characteristic curve in risk prediction

Journal

CIRCULATION
Volume 115, Issue 7, Pages 928-935

Publisher

LIPPINCOTT WILLIAMS & WILKINS
DOI: 10.1161/CIRCULATIONAHA.106.672402

Keywords

cardiovascular diseases; epidemiology; follow-up studies; prevention; risk factors; statistics; risk

Funding

  1. NCI NIH HHS [CA-47988] Funding Source: Medline
  2. NHLBI NIH HHS [HL-43851] Funding Source: Medline

Ask authors/readers for more resources

The c statistic, or area under the receiver operating characteristic ( ROC) curve, achieved popularity in diagnostic testing, in which the test characteristics of sensitivity and specificity are relevant to discriminating diseased versus nondiseased patients. The c statistic, however, may not be optimal in assessing models that predict future risk or stratify individuals into risk categories. In this setting, calibration is as important to the accurate assessment of risk. For example, a biomarker with an odds ratio of 3 may have little effect on the c statistic, yet an increased level could shift estimated 10- year cardiovascular risk for an individual patient from 8% to 24%, which would lead to different treatment recommendations under current Adult Treatment Panel III guidelines. Accepted risk factors such as lipids, hypertension, and smoking have only marginal impact on the c statistic individually yet lead to more accurate reclassification of large proportions of patients into higher- risk or lower- risk categories. Perfectly calibrated models for complex disease can, in fact, only achieve values for the c statistic well below the theoretical maximum of 1. Use of the c statistic for model selection could thus naively eliminate established risk factors from cardiovascular risk prediction scores. As novel risk factors are discovered, sole reliance on the c statistic to evaluate their utility as risk predictors thus seems ill-advised.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available