![]() | ![]() |
| 2007 | ||
|---|---|---|
| 1 | Roy Fox, Moshe Tennenholtz: A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs. AAAI 2007: 553-558 | |
| 1 | Moshe Tennenholtz | [1] |
Data released under the ODC-BY 1.0 license — See also our legal information page