"Reinforcement learning from human reward: Discounting in episodic tasks."

W. Bradley Knox, Peter Stone (2012)

Details and statistics

DOI: 10.1109/ROMAN.2012.6343862

access: closed

type: Conference or Workshop Paper

metadata version: 2017-05-26

a service of  Schloss Dagstuhl - Leibniz Center for Informatics