![]() | ![]() |
| 2007 | ||
|---|---|---|
| 1 | Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Shuichi Arai, Arata Miyauchi: A reinforcement learning method using a dynamic reinforcement function based on action selection probability. Systems and Computers in Japan 38(7): 1-11 (2007) | |
| 1 | Shuichi Arai | [1] |
| 2 | Arata Miyauchi | [1] |
| 3 | Hidehiro Nakano | [1] |
| 4 | Satoko Takada | [1] |
Data released under the ODC-BY 1.0 license — See also our legal information page