期刊
APPLIED MATHEMATICS AND OPTIMIZATION
卷 81, 期 3, 页码 685-710出版社
SPRINGER
DOI: 10.1007/s00245-018-9485-x
关键词
Continuous-time Markov decision processes; Piecewise deterministic Markov decision processes; Exponential utility; Dynamic programming
资金
- Royal Society [IE160503]
We consider a piecewise deterministic Markov decision process, where the expected exponential utility of total (nonnegative) cost is to be minimized. The cost rate, transition rate and post-jump distributions are under control. The state space is Borel, and the transition and cost rates are locally integrable along the drift. Under natural conditions, we establish the optimality equation, justify the value iteration algorithm, and show the existence of a deterministic stationary optimal policy. Applied to special cases, the obtained results already significantly improve some existing results in the literature on finite horizon and infinite horizon discounted risk-sensitive continuous-time Markov decision processes.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据