"Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient ..."

Shixiang Gu et al. (2017)
a service of Schloss Dagstuhl - Leibniz Center for Informatics