 | 2012 |
| 21 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Parametric Return Density Estimation for Reinforcement Learning
CoRR abs/1203.3497: (2012) |
| 20 |  | Tingting Zhao,
Hirotaka Hachiya,
Gang Niu,
Masashi Sugiyama:
Analysis and improvement of policy gradient estimation.
Neural Networks 26: 118-129 (2012) |
| 19 |  | Hirotaka Hachiya,
Masashi Sugiyama,
Naonori Ueda:
Importance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition.
Neurocomputing 80: 93-101 (2012) |
| 2011 |
| 18 |  | Masashi Sugiyama,
Makoto Yamada,
Manabu Kimura,
Hirotaka Hachiya:
On Information-Maximization Clustering: Tuning Parameter Selection and Analytic Solution.
ICML 2011: 65-72 |
| 17 |  | Tingting Zhao,
Hirotaka Hachiya,
Gang Niu,
Masashi Sugiyama:
Analysis and Improvement of Policy Gradient Estimation.
NIPS 2011: 262-270 |
| 16 |  | Makoto Yamada,
Taiji Suzuki,
Takafumi Kanamori,
Hirotaka Hachiya,
Masashi Sugiyama:
Relative Density-Ratio Estimation for Robust Distribution Comparison.
NIPS 2011: 594-602 |
| 15 |  | Hirotaka Hachiya,
Jan Peters,
Masashi Sugiyama:
Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning.
Neural Computation 23(11): 2798-2832 (2011) |
| 2010 |
| 14 |  | Hirotaka Hachiya,
Masashi Sugiyama:
Feature Selection for Reinforcement Learning: Evaluating Implicit State-Reward Dependency via Conditional Mutual Information.
ECML/PKDD (1) 2010: 474-489 |
| 13 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Nonparametric Return Distribution Approximation for Reinforcement Learning.
ICML 2010: 799-806 |
| 12 |  | Tetsuro Morimura,
Masashi Sugiyama,
Hisashi Kashima,
Hirotaka Hachiya,
Toshiyuki Tanaka:
Parametric Return Density Estimation for Reinforcement Learning.
UAI 2010: 368-375 |
| 11 |  | Masashi Sugiyama,
Ichiro Takeuchi,
Taiji Suzuki,
Takafumi Kanamori,
Hirotaka Hachiya,
Daisuke Okanohara:
Least-Squares Conditional Density Estimation.
IEICE Transactions 93-D(3): 583-594 (2010) |
| 10 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Hisashi Kashima,
Tetsuro Morimura:
Least Absolute Policy Iteration--A Robust Approach to Value Function Approximation.
IEICE Transactions 93-D(9): 2555-2565 (2010) |
| 9 |  | Masashi Sugiyama,
Ichiro Takeuchi,
Taiji Suzuki,
Takafumi Kanamori,
Hirotaka Hachiya,
Daisuke Okanohara:
Conditional Density Estimation via Least-Squares Density Ratio Estimation.
Journal of Machine Learning Research - Proceedings Track 9: 781-788 (2010) |
| 8 |  | Takayuki Akiyama,
Hirotaka Hachiya,
Masashi Sugiyama:
Efficient exploration through active learning for value function approximation in reinforcement learning.
Neural Networks 23(5): 639-648 (2010) |
| 2009 |
| 7 |  | Hirotaka Hachiya,
Jan Peters,
Masashi Sugiyama:
Efficient Sample Reuse in EM-Based Policy Search.
ECML/PKDD (1) 2009: 469-484 |
| 6 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Hisashi Kashima,
Tetsuro Morimura:
Least absolute policy iteration for robust value function approximation.
ICRA 2009: 2904-2909 |
| 5 |  | Takayuki Akiyama,
Hirotaka Hachiya,
Masashi Sugiyama:
Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning.
IJCAI 2009: 980-985 |
| 4 |  | Hirotaka Hachiya,
Takayuki Akiyama,
Masashi Sugiyama,
Jan Peters:
Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Neural Networks 22(10): 1399-1410 (2009) |
| 2008 |
| 3 |  | Hirotaka Hachiya,
Takayuki Akiyama,
Masashi Sugiyama,
Jan Peters:
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation.
AAAI 2008: 1351-1356 |
| 2 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Christopher Towell,
Sethu Vijayakumar:
Geodesic Gaussian kernels for value function approximation.
Auton. Robots 25(3): 287-304 (2008) |
| 2007 |
| 1 |  | Masashi Sugiyama,
Hirotaka Hachiya,
Christopher Towell,
Sethu Vijayakumar:
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control.
ICRA 2007: 1733-1740 |