 | 2012 |
| 19 |  | Jérémy Fix,
Matthieu Geist:
Monte-Carlo Swarm Policy Search.
ICAISC (SIDE-EC) 2012: 75-83 |
| 2011 |
| 18 |  | Bruno Scherrer,
Matthieu Geist:
Recursive Least-Squares Learning with Eligibility Traces.
EWRL 2011: 115-127 |
| 17 |  | Edouard Klein,
Matthieu Geist,
Olivier Pietquin:
Batch, Off-Policy and Model-Free Apprenticeship Learning.
EWRL 2011: 285-296 |
| 16 |  | Matthieu Geist,
Bruno Scherrer:
ℓ1-Penalized Projected Bellman Residual.
EWRL 2011: 89-101 |
| 15 |  | Olivier Pietquin,
Matthieu Geist,
Senthilkumar Chandramohan:
Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences.
IJCAI 2011: 1878-1883 |
| 14 |  | Senthilkumar Chandramohan,
Matthieu Geist,
Fabrice Lefèvre,
Olivier Pietquin:
User Simulation in Dialogue Systems Using Inverse Reinforcement Learning.
INTERSPEECH 2011: 1025-1028 |
| 13 |  | Lucie Daubigney,
Milica Gasic,
Senthilkumar Chandramohan,
Matthieu Geist,
Olivier Pietquin,
Steve Young:
Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System.
INTERSPEECH 2011: 1301-1304 |
| 12 |  | Matthieu Geist,
Olivier Pietquin:
Managing Uncertainty within KTD.
Journal of Machine Learning Research - Proceedings Track 16: 157-168 (2011) |
| 11 |  | Olivier Pietquin,
Matthieu Geist,
Senthilkumar Chandramohan,
Hervé Frezza-Buet:
Sample-efficient batch reinforcement learning for dialogue management optimization.
TSLP 7(3): 7 (2011) |
| 2010 |
| 10 |  | Matthieu Geist,
Olivier Pietquin:
Statistically linearized least-squares temporal differences.
ICUMT 2010: 450-457 |
| 9 |  | Matthieu Geist,
Olivier Pietquin:
Eligibility traces through colored noises.
ICUMT 2010: 458-465 |
| 8 |  | Senthilkumar Chandramohan,
Matthieu Geist,
Olivier Pietquin:
Optimizing spoken dialogue management with fitted value iteration.
INTERSPEECH 2010: 86-89 |
| 7 |  | Matthieu Geist,
Olivier Pietquin:
Revisiting Natural Actor-Critics with Value Function Approximation.
MDAI 2010: 207-218 |
| 6 |  | Senthilkumar Chandramohan,
Matthieu Geist,
Olivier Pietquin:
Sparse Approximate Dynamic Programming for Dialog Management.
SIGDIAL Conference 2010: 107-115 |
| 5 |  | Matthieu Geist,
Olivier Pietquin:
Kalman Temporal Differences.
J. Artif. Intell. Res. (JAIR) 39: 483-532 (2010) |
| 4 |  | Matthieu Geist,
Olivier Pietquin,
Gabriel Fricout:
Différences temporelles de Kalman. Cas déterministe.
Revue d'Intelligence Artificielle 24(4): 423-443 (2010) |
| 2009 |
| 3 |  | Matthieu Geist,
Olivier Pietquin,
Gabriel Fricout:
Kernelizing Vector Quantization Algorithms.
ESANN 2009 |
| 2 |  | Matthieu Geist,
Olivier Pietquin,
Gabriel Fricout:
Tracking in Reinforcement Learning.
ICONIP (1) 2009: 502-511 |
| 2008 |
| 1 |  | Matthieu Geist,
Olivier Pietquin,
Gabriel Fricout:
Bayesian Reward Filtering.
EWRL 2008: 96-109 |