 | 2010 |
| 4 |  | Hamid Reza Maei,
Csaba Szepesvári,
Shalabh Bhatnagar,
Richard S. Sutton:
Toward Off-Policy Learning Control with Function Approximation.
ICML 2010: 719-726 |
| 2009 |
| 3 |  | Richard S. Sutton,
Hamid Reza Maei,
Doina Precup,
Shalabh Bhatnagar,
David Silver,
Csaba Szepesvári,
Eric Wiewiora:
Fast gradient-descent methods for temporal-difference learning with linear function approximation.
ICML 2009: 125 |
| 2 |  | Hamid Reza Maei,
Csaba Szepesvári,
Shalabh Bhatnagar,
Doina Precup,
David Silver,
Richard S. Sutton:
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.
NIPS 2009: 1204-1212 |
| 2008 |
| 1 |  | Richard S. Sutton,
Csaba Szepesvári,
Hamid Reza Maei:
A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation.
NIPS 2008: 1609-1616 |