![]() |
Ask others: ACM DL/Guide -
- CSB - MetaPress - Google - Bing - Yahoo
| 2 | Nigel Tao, Jonathan Baxter, Lex Weaver: A Multi-Agent Policy-Gradient Approach to Network Routing. ICML 2001: 553-560 | |
| 1 | Lex Weaver, Nigel Tao: The Optimal Reward Baseline for Gradient-Based Reinforcement Learning. UAI 2001: 538-545 |
Selection of 2 from 2 records - Nigel Tao has 2 coauthors
Copyright © 2009-12-28 by Michael Ley (ley@uni-trier.de)