![]() | ![]() |
Ask others: ACM DL/Guide -
- CSB - MetaPress - Google - Bing - Yahoo
| 91 | Hirotaka Hachiya, Jan Peters, Masashi Sugiyama: Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning. Neural Computation 23(11): 2798-2832 (2011) | |
| 60 | Hirotaka Hachiya, Jan Peters, Masashi Sugiyama: Efficient Sample Reuse in EM-Based Policy Search. ECML/PKDD (1) 2009: 469-484 | |
| 49 | Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters: Adaptive importance sampling for value function approximation in off-policy reinforcement learning. Neural Networks 22(10): 1399-1410 (2009) | |
| 47 | Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters: Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation. AAAI 2008: 1351-1356 |
Selection of 4 from 120 records - Jan Peters has 114 coauthors
Last update 2012-09-10 CET by the DBLP Team —
Content released under the ODC-BY 1.0 license — See also our legal information page