"Contextual Bandit Learning with Predictable Rewards"

Alekh Agarwal et al. (2012)
a service of Schloss Dagstuhl - Leibniz Center for Informatics