"Safe Policy Improvement by Minimizing Robust Baseline Regret."

Mohammad Ghavamzadeh, Marek Petrik, Yinlam Chow (2016)

Details and statistics

DOI:

access: open

type: Conference or Workshop Paper

metadata version: 2021-01-21

a service of  Schloss Dagstuhl - Leibniz Center for Informatics