Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Sample-efficient Nonstationary-policy Evaluation for Contextual Bandits

Miroslav Dudík, Dumitru Erhan, John Langford, and Lihong Li

Details

Publication typeInproceedings
Published inProceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI-12)
> Publications > Sample-efficient Nonstationary-policy Evaluation for Contextual Bandits