default search action

combined dblp search
author search
venue search
publication search

ask others

Edward Hughes 0001

> Home > Persons

Person information

affiliation: DeepMind Technologies Limited, London, United Kingdom

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/ZhaoB0ZNTTT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ZhaoB0ZNTTT24
Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe:
Towards Zero Shot Learning in Restless Multi-armed Bandits. AAMAS 2024: 2618-2620
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001DPBMSSR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001DPBMSSR24
Edward Hughes, Michael D. Dennis, Jack Parker-Holder, Feryal M. P. Behbahani, Aditi Mavalankar, Yuge Shi, Tom Schaul, Tim Rocktäschel:
Position: Open-Endedness is Essential for Artificial Superhuman Intelligence. ICML 2024
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BruceDEPS0LMSAA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BruceDEPS0LMSAA24
Jake Bruce, Michael D. Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal M. P. Behbahani, Stephanie C. Y. Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott E. Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel:
Genie: Generative Interactive Environments. ICML 2024
[c18]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/ZhaoB0ZNTTT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhaoB0ZNTTT24
Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe:
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization. IJCAI 2024: 321-329
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-15391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-15391
Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal M. P. Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott E. Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel:
Genie: Generative Interactive Environments. CoRR abs/2402.15391 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16244
Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomasev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz, Reed Enger, Andrew Barakat, Victoria Krakovna, John Oliver Siy, Zeb Kurth-Nelson, Amanda McCroskery, Vijay Bolina, Harry Law, Murray Shanahan, Lize Alberts, Borja Balle, Sarah de Haas, Yetunde Ibitoye, Allan Dafoe, Beth Goldberg, Sébastien Krier, Alexander Reese, Sims Witherspoon, Will Hawkins, Maribeth Rauh, Don Wallace, Matija Franklin, Josh A. Goldstein, Joel Lehman, Michael Klenk, Shannon Vallor, Courtney Biles, Meredith Ringel Morris, Helen King, Blaise Agüera y Arcas, William Isaac, James Manyika:
The Ethics of Advanced AI Assistants. CoRR abs/2404.16244 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00392
Jonathan Cook, Chris Lu, Edward Hughes, Joel Z. Leibo, Jakob N. Foerster:
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning. CoRR abs/2406.00392 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04268
Edward Hughes, Michael Dennis, Jack Parker-Holder, Feryal M. P. Behbahani, Aditi Mavalankar, Yuge Shi, Tom Schaul, Tim Rocktäschel:
Open-Endedness is Essential for Artificial Superhuman Intelligence. CoRR abs/2406.04268 (2024)
2023
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BauerBBBBCCCDGG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BauerBBBBCCCDGG23
Jakob Bauer, Kate Baumli, Feryal M. P. Behbahani, Avishkar Bhoopchand, Nathalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Satinder Singh, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei M. Zhang:
Human-Timescale Adaptation in an Open-Ended Task Space. ICML 2023: 1887-1935
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-07608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-07608
Adaptive Agent Team, Jakob Bauer, Kate Baumli, Satinder Baveja, Feryal M. P. Behbahani, Avishkar Bhoopchand, Nathalie Bradley-Schmieg, Michael Chang, Natalie Clay, Adrian Collister, Vibhavari Dasagi, Lucy Gonzalez, Karol Gregor, Edward Hughes, Sheleem Kashem, Maria Loks-Thompson, Hannah Openshaw, Jack Parker-Holder, Shreya Pathak, Nicolas Perez Nieves, Nemanja Rakicevic, Tim Rocktäschel, Yannick Schroecker, Jakub Sygnowski, Karl Tuyls, Sarah York, Alexander Zacherl, Lei Zhang:
Human-Timescale Adaptation in an Open-Ended Task Space. CoRR abs/2301.07608 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00768
Udari Madhushani, Kevin R. McKee, John P. Agapiou, Joel Z. Leibo, Richard Everett, Thomas W. Anthony, Edward Hughes, Karl Tuyls, Edgar A. Duéñez-Guzmán:
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas. CoRR abs/2305.00768 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14526
Yunfan Zhao, Nikhil Behari, Edward Hughes, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe:
Towards Zero Shot Learning in Restless Multi-armed Bandits. CoRR abs/2310.14526 (2023)
2022
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/aicom/GempABBBCDVDEEH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aicom/GempABBBCDVDEEH22
Ian Gemp, Thomas W. Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome T. Connor, Vibhavari Dasagi, Bart De Vylder, Edgar A. Duéñez-Guzmán, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Pérolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls:
Developing, evaluating and scaling learning agents in multi-agent environments. AI Commun. 35(4): 271-284 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00715
Avishkar Bhoopchand, Bethanie Brownfield, Adrian Collister, Agustin Dal Lago, Ashley Edwards, Richard Everett, Alexandre Fréchette, Yanko Gitahy Oliveira, Edward Hughes, Kory W. Mathewson, Piermaria Mendolicchio, Julia Pawar, Miruna Pislar, Alex Platonov, Evan Senter, Sukhdeep Singh, Alexander Zacherl, Lei M. Zhang:
Learning Robust Real-Time Cultural Transmission without Human Data. CoRR abs/2203.00715 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-06760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-06760
Michael Bradley Johanson, Edward Hughes, Finbarr Timbers, Joel Z. Leibo:
Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning. CoRR abs/2205.06760 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10958
Ian Gemp, Thomas W. Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome T. Connor, Vibhavari Dasagi, Bart De Vylder, Edgar A. Duéñez-Guzmán, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Pérolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls:
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments. CoRR abs/2209.10958 (2022)
2021
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Bakker0WGIL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Bakker0WGIL021
Michiel A. Bakker, Richard Everett, Laura Weidinger, Iason Gabriel, William S. Isaac, Joel Z. Leibo, Edward Hughes:
Modelling Cooperation in Network Games with Spatio-Temporal Complexity. AAMAS 2021: 1455-1457
[c15]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/StrouseMBHE21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/StrouseMBHE21
DJ Strouse, Kevin R. McKee, Matt M. Botvinick, Edward Hughes, Richard Everett:
Collaborating with Humans without Human Data. NeurIPS 2021: 14502-14515
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-02274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-02274
Pol Moreno, Edward Hughes, Kevin R. McKee, Bernardo Ávila Pires, Théophane Weber:
Neural Recursive Belief States in Multi-Agent Reinforcement Learning. CoRR abs/2102.02274 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06911
Michiel A. Bakker, Richard Everett, Laura Weidinger, Iason Gabriel, William S. Isaac, Joel Z. Leibo, Edward Hughes:
Modelling Cooperation in Network Games with Spatio-Temporal Complexity. CoRR abs/2102.06911 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04982
Kevin R. McKee, Edward Hughes, Tina O. Zhu, Martin J. Chadwick, Raphael Koster, Antonio García Castañeda, Charlie Beattie, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Deep reinforcement learning models the emergent dynamics of human cooperation. CoRR abs/2103.04982 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08176
DJ Strouse, Kevin R. McKee, Matt M. Botvinick, Edward Hughes, Richard Everett:
Collaborating with Humans without Human Data. CoRR abs/2110.08176 (2021)
2020
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/aamas/TuylsPLHELSG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/TuylsPLHELSG20
Karl Tuyls, Julien Pérolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel:
Bounds and dynamics for empirical game theoretic analysis. Auton. Agents Multi Agent Syst. 34(1): 7 (2020)
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ai/BardFCBLSPDMHDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BardFCBLSPDMHDM20
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi challenge: A new frontier for AI research. Artif. Intell. 280: 103216 (2020)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/BachrachEHLLLJC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/BachrachEHLLLJC20
Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating team formation using deep reinforcement learning. Artif. Intell. 288: 103356 (2020)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/HughesAELBB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HughesAELBB20
Edward Hughes, Thomas W. Anthony, Tom Eccles, Joel Z. Leibo, David Balduzzi, Yoram Bachrach:
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games. AAMAS 2020: 538-547
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/McKeeGMDHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/McKeeGMDHL20
Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo:
Social Diversity and Social Preferences in Mixed-Motive Reinforcement Learning. AAMAS 2020: 869-877
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BalduzziCAGHLPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BalduzziCAGHLPG20
David Balduzzi, Wojciech M. Czarnecki, Tom Anthony, Ian Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. ICLR 2020
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MullerORTPLHMLH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MullerORTPLHMLH20
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Pérolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Rémi Munos:
A Generalized Training Approach for Multiagent Learning. ICLR 2020
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangLFS0Z20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangLFS0Z20
Jiachen Yang, Ang Li, Mehrdad Farajtabar, Peter Sunehag, Edward Hughes, Hongyuan Zha:
Learning to Incentivize Other Learning Agents. NeurIPS 2020
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-04678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-04678
David Balduzzi, Wojciech M. Czarnecki, Thomas W. Anthony, Ian M. Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel:
Smooth markets: A basic mechanism for organizing gradient-based learners. CoRR abs/2001.04678 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-02325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-02325
Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo:
Social Diversity and Social Preferences in Mixed-Motive Reinforcement Learning. CoRR abs/2002.02325 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00799
Edward Hughes, Thomas W. Anthony, Tom Eccles, Joel Z. Leibo, David Balduzzi, Yoram Bachrach:
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games. CoRR abs/2003.00799 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06051
Jiachen Yang, Ang Li, Mehrdad Farajtabar, Peter Sunehag, Edward Hughes, Hongyuan Zha:
Learning to Incentivize Other Learning Agents. CoRR abs/2006.06051 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-09054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-09054
Raphael Köster, Kevin R. McKee, Richard Everett, Laura Weidinger, William S. Isaac, Edward Hughes, Edgar A. Duéñez-Guzmán, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences. CoRR abs/2010.09054 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-10380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-10380
Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel:
Negotiating Team Formation Using Deep Reinforcement Learning. CoRR abs/2010.10380 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-08630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-08630
Allan Dafoe, Edward Hughes, Yoram Bachrach, Tantum Collins, Kevin R. McKee, Joel Z. Leibo, Kate Larson, Thore Graepel:
Open Problems in Cooperative AI. CoRR abs/2012.08630 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c9]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/WangHFCDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/WangHFCDL19
Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
Evolving Intrinsic Motivations for Altruistic Behavior. AAMAS 2019: 683-692
[c8]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/LeiboPHWMDSDG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LeiboPHWMDSDG19
Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. AAMAS 2019: 1099-1107
[c7]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/EcclesHKWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/EcclesHKWL19
Tom Eccles, Edward Hughes, János Kramár, Steven Wheelwright, Joel Z. Leibo:
The Imitation Game: Learned Reciprocity in Markov games. AAMAS 2019: 1934-1936
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BahdanauHLHHKG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BahdanauHLHHKG19
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Seyed Arian Hosseini, Pushmeet Kohli, Edward Grefenstette:
Learning to Understand Goal Specifications by Modelling Reward. ICLR (Poster) 2019
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FoersterSHBDWBB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FoersterSHBDWBB19
Jakob N. Foerster, H. Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew M. Botvinick, Michael Bowling:
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning. ICML 2019: 1942-1951
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JaquesLHGOSLF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JaquesLHGOSLF19
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Çaglar Gülçehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas:
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning. ICML 2019: 3040-3049
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/isalalife/SunehagLLMHL0EG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isalalife/SunehagLLMHL0EG19
Peter Sunehag, Guy Lever, Siqi Liu, Josh Merel, Nicolas Heess, Joel Z. Leibo, Edward Hughes, Tom Eccles, Thore Graepel:
Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems. ALIFE 2019: 103-110
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-08162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-08162
Ishita Dasgupta, Jane X. Wang, Silvia Chiappa, Jovana Mitrovic, Pedro A. Ortega, David Raposo, Edward Hughes, Peter W. Battaglia, Matthew M. Botvinick, Zeb Kurth-Nelson:
Causal Reasoning from Meta-reinforcement Learning. CoRR abs/1901.08162 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-00506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-00506
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling:
The Hanabi Challenge: A New Frontier for AI Research. CoRR abs/1902.00506 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00742
Joel Z. Leibo, Edward Hughes, Marc Lanctot, Thore Graepel:
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research. CoRR abs/1903.00742 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-08082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-08082
Tom Eccles, Edward Hughes, János Kramár, Steven Wheelwright, Joel Z. Leibo:
Learning Reciprocity in Complex Sequential Social Dilemmas. CoRR abs/1903.08082 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-09453
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-09453
Marc Lanctot, Edward Lockhart, Jean-Baptiste Lespiau, Vinícius Flores Zambaldi, Satyaki Upadhyay, Julien Pérolat, Sriram Srinivasan, Finbarr Timbers, Karl Tuyls, Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Paul Muller, Timo Ewalds, Ryan Faulkner, János Kramár, Bart De Vylder, Brennan Saeta, James Bradbury, David Ding, Sebastian Borgeaud, Matthew Lai, Julian Schrittwieser, Thomas W. Anthony, Edward Hughes, Ivo Danihelka, Jonah Ryan-Davis:
OpenSpiel: A Framework for Reinforcement Learning in Games. CoRR abs/1908.09453 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-12823
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-12823
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Pérolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Rémi Munos:
A Generalized Training Approach for Multiagent Learning. CoRR abs/1909.12823 (2019)
2018
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BahdanauHLHKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BahdanauHLHKG18
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette:
Jointly Learning "What" and "How" from Instructions and Goal-States. ICLR (Workshop) 2018
[c1]
- view
- export record
  dblp key:
  - conf/nips/HughesLPTDCDZMK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HughesLPTDCDZMK18
Edward Hughes, Joel Z. Leibo, Matthew Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion improves cooperation in intertemporal social dilemmas. NeurIPS 2018: 3330-3340
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-08884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-08884
Edward Hughes, Joel Z. Leibo, Matthew G. Philips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion resolves intertemporal social dilemmas. CoRR abs/1803.08884 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01946
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette:
Learning to Follow Language Instructions with Adversarial Reward Induction. CoRR abs/1806.01946 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-08647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-08647
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Çaglar Gülçehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas:
Intrinsic Social Motivation via Causal Influence in Multi-Agent RL. CoRR abs/1810.08647 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01458
Jakob N. Foerster, H. Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew M. Botvinick, Michael Bowling:
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning. CoRR abs/1811.01458 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-05931
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-05931
Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
Evolving intrinsic motivations for altruistic behavior. CoRR abs/1811.05931 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07019
Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. CoRR abs/1812.07019 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.