 | 2011 |
| 8 |  | Seiji Ishihara,
Harukazu Igarashi:
Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies.
KES (1) 2011: 120-130 |
| 2008 |
| 7 |  | Harukazu Igarashi,
K. Nakamura,
Seiji Ishihara:
Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks.
IJCNN 2008: 46-52 |
| 6 |  | Seiji Ishihara,
Harukazu Igarashi:
Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies.
PRICAI 2008: 164-174 |
| 2006 |
| 5 |  | Seiji Ishihara,
Harukazu Igarashi:
A Task Decomposition Algorithm Using Mixtures of Normal Distributions for Classification Problems.
HIS 2006: 28 |
| 4 |  | Seiji Ishihara,
Harukazu Igarashi:
Applying the policy gradient method to behavior learning in multiagent systems: The pursuit problem.
Systems and Computers in Japan 37(10): 101-109 (2006) |
| 2005 |
| 3 |  | Seiji Ishihara,
Harukazu Igarashi:
A Task Decomposition Algorithm Using Radial Basis Functions for Classification Problems.
DICTA 2005: 2 |
| 2003 |
| 2 |  | Seiji Ishihara,
Harukazu Igarashi:
Policy Gradient Methods in Multi-Agent Systems.
HIS 2003: 789-798 |
| 1998 |
| 1 |  | Seiji Ishihara,
Takashi Nagano:
A Modular Type Network for Incremental Learning.
ICONIP 1998: 1651-1654 |