On the Optimal Amount of Experimentation in Sequential Decision Problems
Abstract
We provide a tight bound on the amount of experimentation under the optimal strategy in sequential decision problems. We show the applicability of the result by providing a bound on the cut-off in a one-arm bandit problem.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2009
- DOI:
- 10.48550/arXiv.0907.2002
- arXiv:
- arXiv:0907.2002
- Bibcode:
- 2009arXiv0907.2002R
- Keywords:
-
- Mathematics - Probability;
- Mathematics - Statistics;
- 62C10;
- 60G99;
- 93E35