☆ 4.5 Article

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

STATISTICAL SCIENCE (2014)

期刊

STATISTICAL SCIENCE

卷 29, 期 4, 页码 640-661

出版社

INST MATHEMATICAL STATISTICS

DOI: 10.1214/13-STS450

关键词

Advantage learning; bias-variance trade-off; model misspecification; personalized medicine; potential outcomes; sequential decision-making

类别

Statistics & Probability

资金

NIH [R37 AI031789, R01 CA051962, R01 CA085848, P01 CA142538, T32 HL079896]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that operationalizes this process. Each rule corresponds to a decision point and dictates the next treatment action based on the accrued information. Using existing data, a key goal is estimating the optimal regime, that, if followed by the patient population, would yield the most favorable outcome on average. Q- and A-learning are two main approaches for this purpose. We provide a detailed account of these methods, study their performance, and illustrate them using data from a depression study.

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

期刊

STATISTICAL SCIENCE

出版社

INST MATHEMATICAL STATISTICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

期刊

STATISTICAL SCIENCE

出版社

INST MATHEMATICAL STATISTICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文