default search action

combined dblp search
author search
venue search
publication search

ask others

Dhruva Tirumala

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/scirobotics/HaarnojaMLHTHWTSHBHBHTSBCSG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scirobotics/HaarnojaMLHTHWTSHBHBHTSBCSG24
Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Jan Humplik, Markus Wulfmeier, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess:
Learning agile soccer skills for a bipedal robot with deep reinforcement learning. Sci. Robotics 9(89) (2024)
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/TirumalaLCHHLMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TirumalaLCHHLMH24
Dhruva Tirumala, Thomas Lampe, José Enrique Chen, Tuomas Haarnoja, Sandy H. Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin A. Riedmiller, Nicolas Heess, Markus Wulfmeier:
Replay across Experiments: A Natural Extension of Off-Policy RL. ICLR 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-02425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-02425
Dhruva Tirumala, Markus Wulfmeier, Ben Moran, Sandy H. Huang, Jan Humplik, Guy Lever, Tuomas Haarnoja, Leonard Hasenclever, Arunkumar Byravan, Nathan Batchelor, Neil Sreendra, Kushal Patel, Marlon Gwira, Francesco Nori, Martin A. Riedmiller, Nicolas Heess:
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning. CoRR abs/2405.02425 (2024)
2023
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/VezzaniTWRAMHHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/VezzaniTWRAMHHH23
Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin A. Riedmiller:
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration. Trans. Mach. Learn. Res. 2023 (2023)
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/collas/GalashovMTTNCP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/GalashovMTTNCP23
Alexandre Galashov, Jovana Mitrovic, Dhruva Tirumala, Yee Whye Teh, Timothy Nguyen, Arslan Chaudhry, Razvan Pascanu:
Continually learning representations at scale. CoLLAs 2023: 534-547
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13653
Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Markus Wulfmeier, Jan Humplik, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess:
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning. CoRR abs/2304.13653 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-15951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-15951
Dhruva Tirumala, Thomas Lampe, José Enrique Chen, Tuomas Haarnoja, Sandy H. Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin A. Riedmiller, Nicolas Heess, Markus Wulfmeier:
Replay across Experiments: A Natural Extension of Off-Policy RL. CoRR abs/2311.15951 (2023)
2022
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/TirumalaGNHPSDC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/TirumalaGNHPSDC22
Dhruva Tirumala, Alexandre Galashov, Hyeonwoo Noh, Leonard Hasenclever, Razvan Pascanu, Jonathan Schwarz, Guillaume Desjardins, Wojciech Marian Czarnecki, Arun Ahuja, Yee Whye Teh, Nicolas Heess:
Behavior Priors for Efficient Reinforcement Learning. J. Mach. Learn. Res. 23: 221:1-221:68 (2022)
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/collas/SalterWTHRHR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/SalterWTHRHR22
Sasha Salter, Markus Wulfmeier, Dhruva Tirumala, Nicolas Heess, Martin A. Riedmiller, Raia Hadsell, Dushyant Rao:
MO2: Model-Based Offline Options. CoLLAs 2022: 902-919
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RaoSHWZVTAMHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RaoSHWZVTAMHH22
Dushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell:
Learning transferable motor skills with hierarchical latent mixture policies. ICLR 2022
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-01947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-01947
Sasha Salter, Markus Wulfmeier, Dhruva Tirumala, Nicolas Heess, Martin A. Riedmiller, Raia Hadsell, Dushyant Rao:
MO2: Model-Based Offline Options. CoRR abs/2209.01947 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-13743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-13743
Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin A. Riedmiller:
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration. CoRR abs/2211.13743 (2022)
2021
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/GarneloCLTOGHB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/GarneloCLTOGHB21
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi:
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity. AAMAS 2021: 1501-1503
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WulfmeierRHLAHN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WulfmeierRHLAHN21
Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Data-efficient Hindsight Off-policy Option Learning. ICML 2021: 11340-11350
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04041
Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi:
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity. CoRR abs/2110.04041 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-05062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-05062
Dushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell:
Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies. CoRR abs/2112.05062 (2021)
2020
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SongASCSRNALTHB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SongASCSRNALTHB20
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick:
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control. ICLR 2020
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15588
Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Data-efficient Hindsight Off-policy Option Learning. CoRR abs/2007.15588 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14274
Dhruva Tirumala, Alexandre Galashov, Hyeonwoo Noh, Leonard Hasenclever, Razvan Pascanu, Jonathan Schwarz, Guillaume Desjardins, Wojciech Marian Czarnecki, Arun Ahuja, Yee Whye Teh, Nicolas Heess:
Behavior Priors for Efficient Reinforcement Learning. CoRR abs/2010.14274 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GalashovJHTSDCT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GalashovJHTSDCT19
Alexandre Galashov, Siddhant M. Jayakumar, Leonard Hasenclever, Dhruva Tirumala, Jonathan Schwarz, Guillaume Desjardins, Wojciech M. Czarnecki, Yee Whye Teh, Razvan Pascanu, Nicolas Heess:
Information asymmetry in KL-regularized RL. ICLR (Poster) 2019
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MerelAPTLTHW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MerelAPTLTHW19
Josh Merel, Arun Ahuja, Vu Pham, Saran Tunyasuvunakool, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Greg Wayne:
Hierarchical Visuomotor Control of Humanoids. ICLR (Poster) 2019
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-07438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-07438
Dhruva Tirumala, Hyeonwoo Noh, Alexandre Galashov, Leonard Hasenclever, Arun Ahuja, Greg Wayne, Razvan Pascanu, Yee Whye Teh, Nicolas Heess:
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL. CoRR abs/1903.07438 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-01240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-01240
Alexandre Galashov, Siddhant M. Jayakumar, Leonard Hasenclever, Dhruva Tirumala, Jonathan Schwarz, Guillaume Desjardins, Wojciech M. Czarnecki, Yee Whye Teh, Razvan Pascanu, Nicolas Heess:
Information asymmetry in KL-regularized RL. CoRR abs/1905.01240 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-12238
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-12238
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick:
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control. CoRR abs/1909.12238 (2019)
2018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09656
Josh Merel, Arun Ahuja, Vu Pham, Saran Tunyasuvunakool, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Greg Wayne:
Hierarchical visuomotor control of humanoids. CoRR abs/1811.09656 (2018)
2017
[c2]
- view
  - electronic edition @ mindmodeling.org (archived)
  - details & citations
- export record
  dblp key:
  - conf/cogsci/WangKSLTMBKB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cogsci/WangKSLTMBKB17
Jane Wang, Zeb Kurth-Nelson, Hubert Soyer, Joel Z. Leibo, Dhruva Tirumala, Rémi Munos, Charles Blundell, Dharshan Kumaran, Matt M. Botvinick:
Learning to reinforcement learn. CogSci 2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ppopp/BalajiTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppopp/BalajiTL17
Vignesh Balaji, Dhruva Tirumala, Brandon Lucia:
POSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization. PPoPP 2017: 431-432
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-09491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-09491
Vignesh Balaji, Dhruva Tirumala, Brandon Lucia:
Flexible Support for Fast Parallel Commutative Updates. CoRR abs/1709.09491 (2017)
2016
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WangKTSLMBKB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangKTSLMBKB16
Jane X. Wang, Zeb Kurth-Nelson, Dhruva Tirumala, Hubert Soyer, Joel Z. Leibo, Rémi Munos, Charles Blundell, Dharshan Kumaran, Matthew M. Botvinick:
Learning to reinforcement learn. CoRR abs/1611.05763 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.