![]() | ![]() |
| 2009 | ||
|---|---|---|
| 1 | Peter Vamplew, Richard Dazeley, Ewan Barker, Andrei Kelarev: Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks. Australasian Conference on Artificial Intelligence 2009: 340-349 | |
| 1 | Richard Dazeley | [1] |
| 2 | Andrei Kelarev (A. V. Kelarev) | [1] |
| 3 | Peter Vamplew | [1] |
Data released under the ODC-BY 1.0 license — See also our legal information page