Contextual bandit learning with predictable rewards

Alekh Agarwal, Miroslav Dudik, Satyen Kale, John Langford, and Robert E. Schapire

Details

Publication typeInproceedings
Published inProceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS-12)
> Publications > Contextual bandit learning with predictable rewards