Related references
Note: Only part of the references are listed.Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
Jean-Yves Audibert et al.
THEORETICAL COMPUTER SCIENCE (2009)
The nonstochastic multiarmed bandit problem
P Auer et al.
SIAM JOURNAL ON COMPUTING (2003)
Finite-time analysis of the multiarmed bandit problem
P Auer et al.
MACHINE LEARNING (2002)
Computer go: An AI oriented survey
B Bouzy et al.
ARTIFICIAL INTELLIGENCE (2001)