"O2TD: (Near)-Optimal Off-Policy TD Learning."

Bo Liu et al. (2017)

Details and statistics

DOI:

access: open

type: Informal or Other Publication

metadata version: 2019-05-10

a service of  Schloss Dagstuhl - Leibniz Center for Informatics