dblp.uni-trier.dewww.dagstuhl.dewww.uni-trier.de

Tetsuro Morimura Coauthor index pubzone.org

List of publications from the DBLP Bibliography Server - FAQ
Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo

DBLP keys2012
9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka: Parametric Return Density Estimation for Reinforcement Learning CoRR abs/1203.3497: (2012)
2010
8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka: Nonparametric Return Distribution Approximation for Reinforcement Learning. ICML 2010: 799-806
7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka: Parametric Return Density Estimation for Reinforcement Learning. UAI 2010: 368-375
6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMasashi Sugiyama, Hirotaka Hachiya, Hisashi Kashima, Tetsuro Morimura: Least Absolute Policy Iteration--A Robust Approach to Value Function Approximation. IEICE Transactions 93-D(9): 2555-2565 (2010)
5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTakamitsu Matsubara, Tetsuro Morimura, Jun Morimoto: Adaptive Step-size Policy Gradients with Average Reward Metric. Journal of Machine Learning Research - Proceedings Track 13: 285-298 (2010)
4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Jan Peters, Kenji Doya: Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning. Neural Computation 22(2): 342-376 (2010)
2009
3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMasashi Sugiyama, Hirotaka Hachiya, Hisashi Kashima, Tetsuro Morimura: Least absolute policy iteration for robust value function approximation. ICRA 2009: 2904-2909
2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya: A Generalized Natural Actor-Critic Algorithm. NIPS 2009: 1312-1320
2008
1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya: A New Natural Policy Gradient by Stationary Distribution Metric. ECML/PKDD (2) 2008: 82-97

Coauthor Index

1Kenji Doya [1] [2] [4]
2Hirotaka Hachiya [3] [6] [7] [8] [9]
3Hisashi Kashima [3] [6] [7] [8] [9]
4Takamitsu Matsubara [5]
5Jun Morimoto [5]
6Jan Peters [4]
7Masashi Sugiyama [3] [6] [7] [8] [9]
8Toshiyuki Tanaka [7] [8] [9]
9Eiji Uchibe [1] [2] [4]
10Junichiro Yoshimoto [1] [2] [4]

Last update Sun Jun 3 16:06:10 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page