4.7 Article

Adaptive policies for perimeter surveillance problems

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
卷 283, 期 1, 页码 265-278

出版社

ELSEVIER
DOI: 10.1016/j.ejor.2019.11.004

关键词

Applied probability; Stochastic processes; Uncertainty modelling; OR in defence

资金

  1. EPSRC [EP/L015692/1]

向作者/读者索取更多资源

We consider the problem of sequentially choosing observation regions along a line, with an aim of maximising the detection of events of interest. Such a problem may arise when monitoring the movements of endangered or migratory species, detecting crossings of a border, policing activities at sea, and in many other settings. In each case, the key operational challenge is to learn an allocation of surveillance resources which maximises successful detection of events of interest. We present a combinatorial multi-armed bandit model with Poisson rewards and a novel filtered feedback mechanism - arising from the failure to detect certain intrusions - where reward distributions are dependent on the actions selected. Our solution method is an upper confidence bound approach and we derive upper and lower bounds on its expected performance. We prove that the gap between these bounds is of constant order, and demonstrate empirically that our approach is more reliable in simulated problems than competing algorithms. (C) 2019 The Authors. Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据