dblp.uni-trier.dewww.dagstuhl.dewww.uni-trier.de

Mohammad Ghavamzadeh Coauthor index pubzone.org

List of publications from the DBLP Bibliography Server - FAQ
Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo

DBLP keys2011
26Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, Peter Auer: Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits. ALT 2011: 189-203
25Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMatthew W. Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos: Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization. EWRL 2011: 102-114
24no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLVictor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh, Bruno Scherrer: Classification-based Policy Iteration with a Critic. ICML 2011: 1049-1056
23no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Alessandro Lazaric, Rémi Munos, Matthew W. Hoffman: Finite-Sample Analysis of Lasso-TD. ICML 2011: 1177-1184
22Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLVictor Gabillon, Mohammad Ghavamzadeh, Alessandro Lazaric, Sébastien Bubeck: Multi-Bandit Best Arm Identification. NIPS 2011: 2222-2230
21Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Gheshlaghi Azar, Rémi Munos, Mohammad Ghavamzadeh, Hilbert J. Kappen: Speedy Q-Learning. NIPS 2011: 2411-2419
2010
20Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlessandro Lazaric, Mohammad Ghavamzadeh: Bayesian Multi-Task Reinforcement Learning. ICML 2010: 599-606
19Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos: Analysis of a Classification-based Policy Iteration Algorithm. ICML 2010: 607-614
18Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos: Finite-Sample Analysis of LSTD. ICML 2010: 615-622
17Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Alessandro Lazaric, Odalric-Ambrym Maillard, Rémi Munos: LSTD with Random Projections. NIPS 2010: 721-729
16Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLOdalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric, Mohammad Ghavamzadeh: Finite-sample Analysis of Bellman Residual Minimization. Journal of Machine Learning Research - Proceedings Track 13: 299-314 (2010)
2009
15Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLShalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Natural actor-critic algorithms. Automatica 45(11): 2471-2482 (2009)
2008
14Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Fitted Q-Iteration: Application to Planning. EWRL 2008: 55-68
13Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Policy Iteration. NIPS 2008: 441-448
2007
12Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Yaakov Engel: Bayesian actor-critic algorithms. ICML 2007: 297-304
11Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLShalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Incremental Natural Actor-Critic Algorithms. NIPS 2007
10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchical Average Reward Reinforcement Learning. Journal of Machine Learning Research 8: 2629-2669 (2007)
2006
9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Yaakov Engel: Bayesian Policy Gradient Algorithms. NIPS 2006: 457-464
8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan, Rajbala Makar: Hierarchical multi-agent reinforcement learning. Autonomous Agents and Multi-Agent Systems 13(2): 197-229 (2006)
2005
7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLIon Muslea, Virginia Dignum, Daniel D. Corkill, Catholijn M. Jonker, Frank Dignum, Silvia Coradeschi, Alessandro Saffiotti, Dan Fu, Jeff Orkin, William Cheetham, Kai Goebel, Piero P. Bonissone, Leen-Kiat Soh, Randolph M. Jones, Robert E. Wray III, Matthias Scheutz, Daniela Pucci de Farias, Shie Mannor, Georgios Theocharous, Doina Precup, Bamshad Mobasher, Sarabjot S. Anand, Bettina Berendt, Andreas Hotho, Hans W. Guesgen, Michael T. Rosenstein, Mohammad Ghavamzadeh: The Workshop Program at the Nineteenth National Conference on Artificial Intelligence. AI Magazine 26(1): 103-108 (2005)
2004
6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: Learning to Communicate and Act Using Hierarchical Reinforcement Learning. AAMAS 2004: 1114-1121
2003
5no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchical Policy Gradient Algorithms. ICML 2003: 226-233
2002
4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: A multiagent reinforcement learning algorithm by dynamically merging markov decision processes. AAMAS 2002: 845-846
3no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: Hierarchically Optimal Average Reward Reinforcement Learning. ICML 2002: 195-202
2001
2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLRajbala Makar, Sridhar Mahadevan, Mohammad Ghavamzadeh: Hierarchical multi-agent reinforcement learning. Agents 2001: 246-253
1no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMohammad Ghavamzadeh, Sridhar Mahadevan: Continuous-Time Hierarchical Reinforcement Learning. ICML 2001: 186-193

Coauthor Index

1Sarabjot S. Anand (Sarabjot Singh Anand, Sarab S. Anand) [7]
2Peter Auer [26]
3Mohammad Gheshlaghi Azar [21]
4Bettina Berendt [7]
5Shalabh Bhatnagar [11] [15]
6Piero P. Bonissone [7]
7Sébastien Bubeck [22]
8Alexandra Carpentier [26]
9William Cheetham [7]
10Silvia Coradeschi [7]
11Daniel D. Corkill [7]
12Frank Dignum (F. P. M. Dignum) [7]
13Virginia Dignum [7]
14Yaakov Engel [9] [12]
15Amir Massoud Farahmand [13] [14]
16Daniela Pucci de Farias [7]
17Dan Fu [7]
18Victor Gabillon [22] [24]
19Kai Goebel [7]
20Hans W. Guesgen (Hans Werner Guesgen) [7]
21Matthew W. Hoffman [23] [25]
22Andreas Hotho [7]
23Randolph M. Jones [7]
24Catholijn M. Jonker [7]
25Hilbert J. Kappen (Bert Kappen) [21]
26Alessandro Lazaric [16] [17] [18] [19] [20] [22] [23] [24] [25] [26]
27Mark Lee [11] [15]
28Sridhar Mahadevan [1] [2] [3] [4] [5] [6] [8] [10]
29Odalric-Ambrym Maillard [16] [17]
30Rajbala Makar [2] [8]
31Shie Mannor [7] [13] [14]
32Bamshad Mobasher [7]
33Rémi Munos [16] [17] [18] [19] [21] [23] [25] [26]
34Ion Muslea [7]
35Jeff Orkin [7]
36Doina Precup [7]
37Michael T. Rosenstein [7]
38Alessandro Saffiotti [7]
39Bruno Scherrer [24]
40Matthias Scheutz [7]
41Leen-Kiat Soh [7]
42Richard S. Sutton [11] [15]
43Csaba Szepesvári [13] [14]
44Georgios Theocharous [7]
45Robert E. Wray III [7]

Last update Wed May 30 22:34:44 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page