Abstract
We provide a tight bound on the amount of experimentation under the optimal strategy in sequential decision problems. We show the applicability of the result by providing a bound on the cut-off in a one-arm bandit problem.
Original language | English |
---|---|
Pages (from-to) | 381-385 |
Number of pages | 5 |
Journal | Statistics and Probability Letters |
Volume | 80 |
Issue number | 5-6 |
DOIs | |
State | Published - 2010 |