


Остановите войну!
for scientists:
Adam Gleave
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [i12]Adam Gleave, Geoffrey Irving:
Uncertainty Estimation for Language Reward Models. CoRR abs/2203.07472 (2022) - [i11]Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave:
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning. CoRR abs/2203.07475 (2022) - [i10]Adam Gleave, Sam Toyer:
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning. CoRR abs/2203.11409 (2022) - [i9]Erik Jenner, Adam Gleave:
Preprocessing Reward Functions for Interpretability. CoRR abs/2203.13553 (2022) - 2021
- [j1]Antonin Raffin, Ashley Hill, Adam Gleave, Anssi Kanervisto, Maximilian Ernestus, Noah Dormann:
Stable-Baselines3: Reliable Reinforcement Learning Implementations. J. Mach. Learn. Res. 22: 268:1-268:8 (2021) - [c4]Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike:
Quantifying Differences in Reward Functions. ICLR 2021 - 2020
- [c3]Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell:
Adversarial Policies: Attacking Deep Reinforcement Learning. ICLR 2020 - [i8]Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike:
Quantifying Differences in Reward Functions. CoRR abs/2006.13900 (2020) - [i7]Pedro Freire, Adam Gleave, Sam Toyer, Stuart Russell:
DERAIL: Diagnostic Environments for Reward And Imitation Learning. CoRR abs/2012.01365 (2020) - [i6]Eric J. Michaud
, Adam Gleave, Stuart Russell:
Understanding Learned Reward Functions. CoRR abs/2012.05862 (2020)
2010 – 2019
- 2019
- [i5]Adam Gleave, Michael Dennis, Neel Kant, Cody Wild, Sergey Levine, Stuart Russell:
Adversarial Policies: Attacking Deep Reinforcement Learning. CoRR abs/1905.10615 (2019) - 2018
- [i4]Adam Gleave, Oliver Habryka:
Multi-task Maximum Entropy Inverse Reinforcement Learning. CoRR abs/1805.08882 (2018) - [i3]Sören Mindermann, Rohin Shah, Adam Gleave, Dylan Hadfield-Menell:
Active Inverse Reward Design. CoRR abs/1809.03060 (2018) - [i2]Aaron Tucker, Adam Gleave, Stuart Russell:
Inverse reinforcement learning for video games. CoRR abs/1810.10593 (2018) - 2017
- [c2]Adam Gleave
, Christian Steinruecken:
Making Compression Algorithms for Unicode Text. DCC 2017: 441 - [i1]Adam Gleave, Christian Steinruecken:
Making compression algorithms for Unicode text. CoRR abs/1701.04047 (2017) - 2016
- [c1]Ionel Gog, Malte Schwarzkopf, Adam Gleave, Robert N. M. Watson, Steven Hand:
Firmament: Fast, Centralized Cluster Scheduling at Scale. OSDI 2016: 99-115
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
load content from web.archive.org
Privacy notice: By enabling the option above, your browser will contact the API of web.archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
Tweets on dblp homepage
Show tweets from on the dblp homepage.
Privacy notice: By enabling the option above, your browser will contact twitter.com and twimg.com to load tweets curated by our Twitter account. At the same time, Twitter will persistently store several cookies with your web browser. While we did signal Twitter to not track our users by setting the "dnt" flag, we do not have any control over how Twitter uses your data. So please proceed with care and consider checking the Twitter privacy policy.
last updated on 2022-04-21 22:46 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint