"Reinforcement Learning to Create Value and Policy Functions Using Minimax ..."

Kei Takada, Hiroyuki Iizuka, Masahito Yamamoto (2020)

Details and statistics

DOI: 10.1109/TG.2019.2893343

access: closed

type: Journal Article

metadata version: 2020-04-09

a service of  Schloss Dagstuhl - Leibniz Center for Informatics