On the Optimal Amount of Experimentation in Sequential Decision Problems

doi:10.48550/arXiv.0907.2002

On the Optimal Amount of Experimentation in Sequential Decision Problems

We provide a tight bound on the amount of experimentation under the optimal strategy in sequential decision problems. We show the applicability of the result by providing a bound on the cut-off in a one-arm bandit problem.

Publication:

arXiv e-prints

Pub Date:

July 2009

DOI:

10.48550/arXiv.0907.2002

arXiv:

arXiv:0907.2002

Bibcode:

2009arXiv0907.2002R

Keywords:

Mathematics - Probability;
Mathematics - Statistics;
62C10;
60G99;
93E35

NASA/ADS

On the Optimal Amount of Experimentation in Sequential Decision Problems

Abstract