 | 2010 |
| 4 |  | Takayuki Akiyama,
Hirotaka Hachiya,
Masashi Sugiyama:
Efficient exploration through active learning for value function approximation in reinforcement learning.
Neural Networks 23(5): 639-648 (2010) |
| 2009 |
| 3 |  | Takayuki Akiyama,
Hirotaka Hachiya,
Masashi Sugiyama:
Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning.
IJCAI 2009: 980-985 |
| 2 |  | Hirotaka Hachiya,
Takayuki Akiyama,
Masashi Sugiyama,
Jan Peters:
Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Neural Networks 22(10): 1399-1410 (2009) |
| 2008 |
| 1 |  | Hirotaka Hachiya,
Takayuki Akiyama,
Masashi Sugiyama,
Jan Peters:
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation.
AAAI 2008: 1351-1356 |