default search action
Gheorghe Comanici
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c13]Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera-Paredes, Petar Velickovic, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner:
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search. IJCAI 2024: 6985-6993 - 2023
- [i7]Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera-Paredes, Petar Velickovic, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner:
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search. CoRR abs/2311.03583 (2023) - [i6]Kate Baumli, Satinder Baveja, Feryal M. P. Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang:
Vision-Language Models as a Source of Rewards. CoRR abs/2312.09187 (2023) - 2022
- [i5]Gheorghe Comanici, Amelia Glaese, Anita Gergely, Daniel Toyama, Zafarali Ahmed, Tyler Jackson, Philippe Hamel, Doina Precup:
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning. CoRR abs/2204.10374 (2022) - 2021
- [c12]Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, Doina Precup:
Temporally Abstract Partial Models. NeurIPS 2021: 1979-1991 - [i4]Daniel Toyama, Philippe Hamel, Anita Gergely, Gheorghe Comanici, Amelia Glaese, Zafarali Ahmed, Tyler Jackson, Shibl Mourad, Doina Precup:
AndroidEnv: A Reinforcement Learning Platform for Android. CoRR abs/2105.13231 (2021) - [i3]André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver, Doina Precup:
The Option Keyboard: Combining Skills in Reinforcement Learning. CoRR abs/2106.13105 (2021) - [i2]Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, Doina Precup:
Temporally Abstract Partial Models. CoRR abs/2108.03213 (2021) - 2020
- [c11]Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel, Doina Precup:
What can I do here? A Theory of Affordances in Reinforcement Learning. ICML 2020: 5243-5253 - [i1]Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel, Doina Precup:
What can I do here? A Theory of Affordances in Reinforcement Learning. CoRR abs/2006.15085 (2020)
2010 – 2019
- 2019
- [c10]André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver, Doina Precup:
The Option Keyboard: Combining Skills in Reinforcement Learning. NeurIPS 2019: 13031-13041 - 2015
- [c9]Sherry Shanshan Ruan, Gheorghe Comanici, Prakash Panangaden, Doina Precup:
Representation Discovery for MDPs Using Bisimulation Metrics. AAAI 2015: 3578-3584 - [c8]Sherry Shanshan Ruan, Gheorghe Comanici, Prakash Panangaden, Doina Precup:
Representation Discovery for MDPs Using Bisimulation Metrics. AAAI 2015: 4202-4203 - [c7]Gheorghe Comanici, Doina Precup, Prakash Panangaden:
Basis refinement strategies for linear value function approximation in MDPs. NIPS 2015: 2899-2907 - 2012
- [c6]Cosmin Paduraru, Doina Precup, Joelle Pineau, Gheorghe Comanici:
An Empirical Analysis of Off-policy Learning in Discrete MDPs. EWRL 2012: 89-102 - [c5]Gheorghe Comanici, Prakash Panangaden, Doina Precup:
On-the-Fly Algorithms for Bisimulation Metrics. QEST 2012: 94-103 - 2011
- [c4]Gheorghe Comanici, Doina Precup:
Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics. AAAI 2011: 325-330 - [c3]Gheorghe Comanici, Doina Precup:
Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics. ALA 2011: 85-99 - [c2]Gheorghe Comanici, Doina Precup:
Basis function discovery using spectral clustering and bisimulation metrics. AAMAS 2011: 1079-1080 - 2010
- [c1]Gheorghe Comanici, Doina Precup:
Optimal policy switching algorithms for reinforcement learning. AAMAS 2010: 709-714
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 21:28 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint