default search action
BibTeX record journals/corr/abs-2302-01275
@article{DBLP:journals/corr/abs-2302-01275, author = {Ted Moskovitz and Brendan O'Donoghue and Vivek Veeriah and Sebastian Flennerhag and Satinder Singh and Tom Zahavy}, title = {ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs}, journal = {CoRR}, volume = {abs/2302.01275}, year = {2023}, url = {https://doi.org/10.48550/arXiv.2302.01275}, doi = {10.48550/ARXIV.2302.01275}, eprinttype = {arXiv}, eprint = {2302.01275}, timestamp = {Thu, 09 Feb 2023 16:11:17 +0100}, biburl = {https://dblp.org/rec/journals/corr/abs-2302-01275.bib}, bibsource = {dblp computer science bibliography, https://dblp.org} }
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.