"On the Global Convergence Rates of Softmax Policy Gradient Methods."

Jincheng Mei et al. (2020)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics