Some results for the two armed bandit problem
Author:
P.W. Jones
DOI:
10.1080/02331887608801311
Publication Frequency:
6 issues per year
Subjects:
Mathematical Statistics;
Statistical Theory & Methods;
Statistics;
Statistics for the Biological Sciences;
Stochastic Models & Processes;
Formats available:
PDF
(English)
View Article:
View Article (PDF)
Abstract
The paper is concerned with the optimal dynamic programming approach to the solution of the two armed bandit problem for beta priors for the two unknown probabilities. Some properties of the objctive function are obtained and a conjecture concerning the design is made. The suboptimal one step ahead design in considered and is shown to have the same properties as those of the optimal only in certain special cases.
|
| view references (4) |

Download Citation


CiteULike
Del.icio.us
BibSonomy
Connotea