"Q-Learning with Dynamic Rewards Table Applied to the SONET/SDH Ring Problem."

Thiago Henrique Freire de Oliveira, Adrião Duarte Dória Neto, Jorge Dantas de Melo (2018)
a service of Schloss Dagstuhl - Leibniz Center for Informatics