4.4 Article

The Knowledge-Gradient Policy for Correlated Normal Beliefs

Journal

INFORMS JOURNAL ON COMPUTING
Volume 21, Issue 4, Pages 599-613

Publisher

INFORMS
DOI: 10.1287/ijoc.1080.0314

Keywords

simulation: design of experiments; decision analysis: sequential; statistics; Bayesian

Ask authors/readers for more resources

We consider a Bayesian ranking and selection problem with independent normal rewards and a correlated multivariate normal belief on the mean values of these rewards. Because this formulation of the ranking and selection problem models dependence between alternatives' mean values, algorithms may use this dependence to perform efficiently even when the number of alternatives is very large. We propose a fully sequential sampling policy called the knowledge-gradient policy, which is provably optimal in some special cases and has bounded suboptimality in all others. We then demonstrate how this policy may be applied to efficiently maximize a continuous function on a continuous domain while constrained to a fixed number of noisy measurements.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available