4.5 Article

Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences

期刊

BIOMETRICS
卷 79, 期 3, 页码 2260-2271

出版社

WILEY
DOI: 10.1111/biom.13754

关键词

constrained optimization; dynamic treatment regime; observational studies; tree-based reinforcement learning; viable decision rules

向作者/读者索取更多资源

A dynamic treatment regime is a series of decision rules that guide treatment based on an individual's static and time-varying status. However, there are often restrictions on treatment sequences when analyzing observational data. To address this challenge, a restricted tree-based reinforcement learning method is proposed, which searches for an interpretable DTR under user-specified restrictions to achieve the best treatment outcomes.
A dynamic treatment regime (DTR) is a sequence of decision rules that provide guidance on how to treat individuals based on their static and time-varying status. Existing observational data are often used to generate hypotheses about effective DTRs. A common challenge with observational data, however, is the need for analysts to consider restrictions on the treatment sequences. Such restrictions may be necessary for settings where (1) one or more treatment sequences that were offered to individuals when the data were collected are no longer considered viable in practice, (2) specific treatment sequences are no longer available, or (3) the scientific focus of the analysis concerns a specific type of treatment sequences (eg, stepped-up treatments). To address this challenge, we propose a restricted tree-based reinforcement learning (RT-RL) method that searches for an interpretable DTR with the maximum expected outcome, given a (set of) user-specified restriction(s), which specifies treatment options (at each stage) that ought not to be considered as part of the estimated tree-based DTR. In simulations, we evaluate the performance of RT-RL versus the standard approach of ignoring the partial data for individuals not following the (set of) restriction(s). The method is illustrated using an observational data set to estimate a two-stage stepped-up DTR for guiding the level of care placement for adolescents with substance use disorder.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据