 | 2012 |
| 9 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Parametric Return Density Estimation for Reinforcement Learning
CoRR abs/1203.3497: (2012) |
| 2010 |
| 8 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Nonparametric Return Distribution Approximation for Reinforcement Learning.
ICML 2010: 799-806 |
| 7 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Parametric Return Density Estimation for Reinforcement Learning.
UAI 2010: 368-375 |
| 6 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Hisashi Kashima,
Tetsuro Morimura:
Least Absolute Policy Iteration--A Robust Approach to Value Function Approximation.
IEICE Transactions 93-D(9): 2555-2565 (2010) |
| 5 |  | Takamitsu Matsubara,
Tetsuro Morimura,
Jun Morimoto:
Adaptive Step-size Policy Gradients with Average Reward Metric.
Journal of Machine Learning Research - Proceedings Track 13: 285-298 (2010) |
| 4 |  | Tetsuro Morimura,
Eiji Uchibe,
Junichiro Yoshimoto,
Jan Peters,
Kenji Doya:
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
Neural Computation 22(2): 342-376 (2010) |
| 2009 |
| 3 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Hisashi Kashima,
Tetsuro Morimura:
Least absolute policy iteration for robust value function approximation.
ICRA 2009: 2904-2909 |
| 2 |  | Tetsuro Morimura,
Eiji Uchibe,
Junichiro Yoshimoto,
Kenji Doya:
A Generalized Natural Actor-Critic Algorithm.
NIPS 2009: 1312-1320 |
| 2008 |
| 1 |  | Tetsuro Morimura,
Eiji Uchibe,
Junichiro Yoshimoto,
Kenji Doya:
A New Natural Policy Gradient by Stationary Distribution Metric.
ECML/PKDD (2) 2008: 82-97 |