default search action
Georg Ostrovski
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. J. Mach. Learn. Res. 25: 163:1-163:47 (2024) - 2023
- [c14]Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Rémi Munos:
Quantile Credit Assignment. ICML 2023: 24517-24531 - [c13]Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto:
Deep Reinforcement Learning with Plasticity Injection. NeurIPS 2023 - [i18]Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. CoRR abs/2301.04462 (2023) - [i17]Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto:
Deep Reinforcement Learning with Plasticity Injection. CoRR abs/2305.15555 (2023) - 2022
- [j3]Çaglar Gülçehre, Srivatsan Srinivasan, Jakub Sygnowski, Georg Ostrovski, Mehrdad Farajtabar, Matthew Hoffman, Razvan Pascanu, Arnaud Doucet:
An empirical study of implicit regularization in deep offline RL. Trans. Mach. Learn. Res. 2022 (2022) - [c12]Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana L. Borsa, Tom Schaul:
When should agents explore? ICLR 2022 - [c11]Tom Schaul, André Barreto, John Quan, Georg Ostrovski:
The Phenomenon of Policy Churn. NeurIPS 2022 - [i16]Tom Schaul, André Barreto, John Quan, Georg Ostrovski:
The Phenomenon of Policy Churn. CoRR abs/2206.00730 (2022) - [i15]Çaglar Gülçehre, Srivatsan Srinivasan, Jakub Sygnowski, Georg Ostrovski, Mehrdad Farajtabar, Matt Hoffman, Razvan Pascanu, Arnaud Doucet:
An Empirical Study of Implicit Regularization in Deep Offline RL. CoRR abs/2207.02099 (2022) - 2021
- [c10]Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney:
On the Effect of Auxiliary Tasks on Representation Dynamics. AISTATS 2021: 1-9 - [c9]Will Dabney, Georg Ostrovski, André Barreto:
Temporally-Extended ε-Greedy Exploration. ICLR 2021 - [c8]Georg Ostrovski, Pablo Samuel Castro, Will Dabney:
The Difficulty of Passive Learning in Deep Reinforcement Learning. NeurIPS 2021: 23283-23295 - [i14]Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney:
On The Effect of Auxiliary Tasks on Representation Dynamics. CoRR abs/2102.13089 (2021) - [i13]Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa:
Return-based Scaling: Yet Another Normalisation Trick for Deep RL. CoRR abs/2105.05347 (2021) - [i12]Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul:
When should agents explore? CoRR abs/2108.11811 (2021) - [i11]Georg Ostrovski, Pablo Samuel Castro, Will Dabney:
The Difficulty of Passive Learning in Deep Reinforcement Learning. CoRR abs/2110.14020 (2021) - 2020
- [i10]Will Dabney, Georg Ostrovski, André Barreto:
Temporally-Extended ε-Greedy Exploration. CoRR abs/2006.01782 (2020)
2010 – 2019
- 2019
- [c7]Steven Kapturowski, Georg Ostrovski, John Quan, Rémi Munos, Will Dabney:
Recurrent Experience Replay in Distributed Reinforcement Learning. ICLR (Poster) 2019 - [i9]Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero:
Adapting Behaviour for Learning Progress. CoRR abs/1912.06910 (2019) - 2018
- [c6]Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver:
Rainbow: Combining Improvements in Deep Reinforcement Learning. AAAI 2018: 3215-3222 - [c5]Will Dabney, Georg Ostrovski, David Silver, Rémi Munos:
Implicit Quantile Networks for Distributional Reinforcement Learning. ICML 2018: 1104-1113 - [c4]Georg Ostrovski, Will Dabney, Rémi Munos:
Autoregressive Quantile Networks for Generative Modeling. ICML 2018: 3933-3942 - [i8]Georg Ostrovski, Will Dabney, Rémi Munos:
Autoregressive Quantile Networks for Generative Modeling. CoRR abs/1806.05575 (2018) - [i7]Will Dabney, Georg Ostrovski, David Silver, Rémi Munos:
Implicit Quantile Networks for Distributional Reinforcement Learning. CoRR abs/1806.06923 (2018) - 2017
- [c3]Georg Ostrovski, Marc G. Bellemare, Aäron van den Oord, Rémi Munos:
Count-Based Exploration with Neural Density Models. ICML 2017: 2721-2730 - [i6]Georg Ostrovski, Marc G. Bellemare, Aäron van den Oord, Rémi Munos:
Count-Based Exploration with Neural Density Models. CoRR abs/1703.01310 (2017) - [i5]Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Daniel Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver:
Rainbow: Combining Improvements in Deep Reinforcement Learning. CoRR abs/1710.02298 (2017) - [i4]Karl Tuyls, Julien Pérolat, Marc Lanctot, Georg Ostrovski, Rahul Savani, Joel Z. Leibo, Toby Ord, Thore Graepel, Shane Legg:
Symmetric Decomposition of Asymmetric Games. CoRR abs/1711.05074 (2017) - 2016
- [j2]Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, Sergio Gomez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John P. Agapiou, Adrià Puigdomènech Badia, Karl Moritz Hermann, Yori Zwols, Georg Ostrovski, Adam Cain, Helen King, Christopher Summerfield, Phil Blunsom, Koray Kavukcuoglu, Demis Hassabis:
Hybrid computing using a neural network with dynamic external memory. Nat. 538(7626): 471-476 (2016) - [c2]Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. AAAI 2016: 1476-1483 - [c1]Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos:
Unifying Count-Based Exploration and Intrinsic Motivation. NIPS 2016: 1471-1479 - [i3]Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos:
Unifying Count-Based Exploration and Intrinsic Motivation. CoRR abs/1606.01868 (2016) - 2015
- [j1]Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin A. Riedmiller, Andreas Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis:
Human-level control through deep reinforcement learning. Nat. 518(7540): 529-533 (2015) - [i2]Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. CoRR abs/1512.04860 (2015) - 2013
- [i1]Georg Ostrovski, Sebastian van Strien:
Payoff Performance of Fictitious Play. CoRR abs/1308.4049 (2013)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint