"Generalized Optimistic Q-Learning with Provable Efficiency."

Grigory Neustroev, Mathijs Michiel de Weerdt (2020)

Details and statistics

DOI: 10.5555/3398761.3398868

access: open

type: Conference or Workshop Paper

metadata version: 2022-07-26

a service of  Schloss Dagstuhl - Leibniz Center for Informatics