4.5 Article

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

期刊

STATISTICAL SCIENCE
卷 29, 期 4, 页码 640-661

出版社

INST MATHEMATICAL STATISTICS
DOI: 10.1214/13-STS450

关键词

Advantage learning; bias-variance trade-off; model misspecification; personalized medicine; potential outcomes; sequential decision-making

资金

  1. NIH [R37 AI031789, R01 CA051962, R01 CA085848, P01 CA142538, T32 HL079896]

向作者/读者索取更多资源

In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that operationalizes this process. Each rule corresponds to a decision point and dictates the next treatment action based on the accrued information. Using existing data, a key goal is estimating the optimal regime, that, if followed by the patient population, would yield the most favorable outcome on average. Q- and A-learning are two main approaches for this purpose. We provide a detailed account of these methods, study their performance, and illustrate them using data from a depression study.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据