"Imitation Learning via Off-Policy Distribution Matching."

Ilya Kostrikov, Ofir Nachum, Jonathan Tompson (2020)
a service of Schloss Dagstuhl - Leibniz Center for Informatics