期刊
BIOMETRICS
卷 79, 期 3, 页码 2260-2271出版社
WILEY
DOI: 10.1111/biom.13754
关键词
constrained optimization; dynamic treatment regime; observational studies; tree-based reinforcement learning; viable decision rules
A dynamic treatment regime is a series of decision rules that guide treatment based on an individual's static and time-varying status. However, there are often restrictions on treatment sequences when analyzing observational data. To address this challenge, a restricted tree-based reinforcement learning method is proposed, which searches for an interpretable DTR under user-specified restrictions to achieve the best treatment outcomes.
A dynamic treatment regime (DTR) is a sequence of decision rules that provide guidance on how to treat individuals based on their static and time-varying status. Existing observational data are often used to generate hypotheses about effective DTRs. A common challenge with observational data, however, is the need for analysts to consider restrictions on the treatment sequences. Such restrictions may be necessary for settings where (1) one or more treatment sequences that were offered to individuals when the data were collected are no longer considered viable in practice, (2) specific treatment sequences are no longer available, or (3) the scientific focus of the analysis concerns a specific type of treatment sequences (eg, stepped-up treatments). To address this challenge, we propose a restricted tree-based reinforcement learning (RT-RL) method that searches for an interpretable DTR with the maximum expected outcome, given a (set of) user-specified restriction(s), which specifies treatment options (at each stage) that ought not to be considered as part of the estimated tree-based DTR. In simulations, we evaluate the performance of RT-RL versus the standard approach of ignoring the partial data for individuals not following the (set of) restriction(s). The method is illustrated using an observational data set to estimate a two-stage stepped-up DTR for guiding the level of care placement for adolescents with substance use disorder.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据