default search action

combined dblp search
author search
venue search
publication search

ask others

Jan Leike

> Home > Persons

Person information

affiliation: Anthropic PBC, San Francisco, CA, USA
affiliation (former): OpenAI, San Francisco, CA, USA
affiliation (PhD): Australian National University, Canberra, ACT, Australia
affiliation: University of Freiburg, Germany

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GaoTTGTRSL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GaoTTGTRSL025
Leo Gao, Tom Dupré la Tour, Henk Tillman, Gabriel Goh, Rajan Troll, Alec Radford, Ilya Sutskever, Jan Leike, Jeffrey Wu:
Scaling and evaluating sparse autoencoders. ICLR 2025
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-18837
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-18837
Mrinank Sharma, Meg Tong, Jesse Mu, Jerry Wei, Jorrit Kruthoff, Scott Goodfriend, Euan Ong, Alwin Peng, Raj Agarwal, Cem Anil, Amanda Askell, Nathan Bailey, Joe Benton, Emma Bluemke, Samuel R. Bowman, Eric Christiansen, Hoagy Cunningham, Andy Dau, Anjali Gopal, Rob Gilson, Logan Graham, Logan Howard, Nimit Kalra, Taesung Lee, Kevin Lin, Peter Lofgren, Francesco Mosconi, Clare O'Hara, Catherine Olsson, Linda Petrini, Samir Rajani, Nikhil Saxena, Alex Silverstein, Tanya Singh, Theodore R. Sumers, Leonard Tang, Kevin K. Troy, Constantin Weisser, Ruiqi Zhong, Giulio Zhou, Jan Leike, Jared Kaplan, Ethan Perez:
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming. CoRR abs/2501.18837 (2025)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-16797
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-16797
Erik Jones, Meg Tong, Jesse Mu, Mohammed Mahfoud, Jan Leike, Roger B. Grosse, Jared Kaplan, William Fithian, Ethan Perez, Mrinank Sharma:
Forecasting Rare Language Model Behaviors. CoRR abs/2502.16797 (2025)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-10965
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-10965
Samuel Marks, Johannes Treutlein, Trenton Bricken, Jack Lindsey, Jonathan Marcus, Siddharth Mishra-Sharma, Daniel M. Ziegler, Emmanuel Ameisen, Joshua Batson, Tim Belonax, Samuel R. Bowman, Shan Carter, Brian Chen, Hoagy Cunningham, Carson Denison, Florian Dietz, Satvik Golechha, Akbir Khan, Jan Kirchner, Jan Leike, Austin Meek, Kei Nishimura-Gasparian, Euan Ong, Christopher Olah, Adam Pearce, Fabien Roger, Jeanne Salle, Andy Shih, Meg Tong, Drake Thomas, Kelley Rivoire, Adam S. Jermyn, Monte MacDiarmid, Tom Henighan, Evan Hubinger:
Auditing language models for hidden objectives. CoRR abs/2503.10965 (2025)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-05410
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-05410
Yanda Chen, Joe Benton, Ansh Radhakrishnan, Jonathan Uesato, Carson Denison, John Schulman, Arushi Somani, Peter Hase, Misha Wagner, Fabien Roger, Vladimir Mikulik, Samuel R. Bowman, Jan Leike, Jared Kaplan, Ethan Perez:
Reasoning Models Don't Always Say What They Think. CoRR abs/2505.05410 (2025)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-10139
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-10139
Jiaxin Wen, Zachary Ankner, Arushi Somani, Peter Hase, Samuel Marks, Jacob Goldman-Wetzler, Linda Petrini, Henry Sleight, Collin Burns, He He, Shi Feng, Ethan Perez, Jan Leike:
Unsupervised Elicitation of Language Models. CoRR abs/2506.10139 (2025)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-16245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-16245
Cole Wyeth, Marcus Hutter, Jan Leike, Jessica Taylor:
Limit-Computable Grains of Truth for Arbitrary Computable Extensive-Form (Un)Known Games. CoRR abs/2508.16245 (2025)
2024
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LightmanKBEBLLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LightmanKBEBLLS24
Hunter Lightman, Vineet Kosaraju, Yuri Burda, Harrison Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe:
Let's Verify Step by Step. ICLR 2024
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BurnsIKBGACEJLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BurnsIKBGACEJLS24
Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeffrey Wu:
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision. ICML 2024
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04093
Leo Gao, Tom Dupré la Tour, Henk Tillman, Gabriel Goh, Rajan Troll, Alec Radford, Ilya Sutskever, Jan Leike, Jeffrey Wu:
Scaling and evaluating sparse autoencoders. CoRR abs/2406.04093 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-00215
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-00215
Nat McAleese, Rai Michael Pokorny, Juan Felipe Ceron Uribe, Evgenia Nitishinskaya, Maja Trebacz, Jan Leike:
LLM Critics Help Catch LLM Bugs. CoRR abs/2407.00215 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13692
Jan Hendrik Kirchner, Yining Chen, Harri Edwards, Jan Leike, Nat McAleese, Yuri Burda:
Prover-Verifier Games improve legibility of LLM outputs. CoRR abs/2407.13692 (2024)
2023
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-20050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-20050
Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe:
Let's Verify Step by Step. CoRR abs/2305.20050 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09390
Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeff Wu:
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision. CoRR abs/2312.09390 (2023)
2022
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Ouyang0JAWMZASR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Ouyang0JAWMZASR22
Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F. Christiano, Jan Leike, Ryan Lowe:
Training language models to follow instructions with human feedback. NeurIPS 2022
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08102
Matthew Rahtz, Vikrant Varma, Ramana Kumar, Zachary Kenton, Shane Legg, Jan Leike:
Safe Deep RL in 3D Environments using Human Feedback. CoRR abs/2201.08102 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02155
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F. Christiano, Jan Leike, Ryan Lowe:
Training language models to follow instructions with human feedback. CoRR abs/2203.02155 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05802
William Saunders, Catherine Yeh, Jeff Wu, Steven Bills, Long Ouyang, Jonathan Ward, Jan Leike:
Self-critiquing models for assisting human evaluators. CoRR abs/2206.05802 (2022)
2021
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/natmi/PrunklAAWLD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/natmi/PrunklAAWLD21
Carina E. A. Prunkl, Carolyn Ashurst, Markus Anderljung, Helena Webb, Jan Leike, Allan Dafoe:
Institutionalizing ethics in AI through broader impact requirements. Nat. Mach. Intell. 3(2): 104-110 (2021)
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Gleave0LRL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Gleave0LRL21
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike:
Quantifying Differences in Reward Functions. ICLR 2021
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11039
Carina E. A. Prunkl, Carolyn Ashurst, Markus Anderljung, Helena Webb, Jan Leike, Allan Dafoe:
Institutionalising Ethics in AI through Broader Impact Requirements. CoRR abs/2106.11039 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-03374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-03374
Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Pondé de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Joshua Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, Wojciech Zaremba:
Evaluating Large Language Models Trained on Code. CoRR abs/2107.03374 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-10862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-10862
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul F. Christiano:
Recursively Summarizing Books with Human Feedback. CoRR abs/2109.10862 (2021)
2020
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ReddyDLLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ReddyDLLL20
Siddharth Reddy, Anca D. Dragan, Sergey Levine, Shane Legg, Jan Leike:
Learning Human Objectives by Evaluating Hypothetical Behavior. ICML 2020: 8020-8029
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ArmstrongLOL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ArmstrongLOL20
Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg:
Pitfalls of Learning a Reward Function Online. IJCAI 2020: 1592-1600
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-13654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-13654
Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg:
Pitfalls of learning a reward function online. CoRR abs/2004.13654 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13900
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike:
Quantifying Differences in Reward Functions. CoRR abs/2006.13900 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09153
David Krueger, Tegan Maharaj, Jan Leike:
Hidden Incentives for Auto-Induced Distributional Shift. CoRR abs/2009.09153 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-06709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-06709
David Krueger, Jan Leike, Owain Evans, John Salvatier:
Active Reinforcement Learning: Observing Rewards at a Cost. CoRR abs/2011.06709 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BahdanauHLHHKG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BahdanauHLHHKG19
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Seyed Arian Hosseini, Pushmeet Kohli, Edward Grefenstette:
Learning to Understand Goal Specifications by Modelling Reward. ICLR (Poster) 2019
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-05652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-05652
Siddharth Reddy, Anca D. Dragan, Sergey Levine, Shane Legg, Jan Leike:
Learning Human Objectives by Evaluating Hypothetical Behavior. CoRR abs/1912.05652 (2019)
2018
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tcs/LeikeH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcs/LeikeH18
Jan Leike, Marcus Hutter:
On the computability of Solomonoff induction and AIXI. Theor. Comput. Sci. 716: 28-49 (2018)
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BahdanauHLHKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BahdanauHLHKG18
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette:
Jointly Learning "What" and "How" from Instructions and Goal-States. ICLR (Workshop) 2018
[c21]
- view
- export record
  dblp key:
  - conf/nips/IbarzLPILA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/IbarzLPILA18
Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei:
Reward learning from human preferences and demonstrations in Atari. NeurIPS 2018: 8022-8034
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/tacas/LeikeH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tacas/LeikeH18
Jan Leike, Matthias Heizmann:
Geometric Nontermination Arguments. TACAS (2) 2018: 266-283
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01946
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette:
Learning to Follow Language Instructions with Adversarial Reward Induction. CoRR abs/1806.01946 (2018)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06521
Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei:
Reward learning from human preferences and demonstrations in Atari. CoRR abs/1811.06521 (2018)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07871
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07871
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg:
Scalable agent alignment via reward modeling: a research direction. CoRR abs/1811.07871 (2018)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-05979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-05979
Miljan Martic, Jan Leike, Andrew Trask, Matteo Hessel, Shane Legg, Pushmeet Kohli:
Scaling shared model governance via model splitting. CoRR abs/1812.05979 (2018)
2017
[c19]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/LamontALH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LamontALH17
Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter:
Generalised Discount Functions applied to a Monte-Carlo AI u Implementation. AAMAS 2017: 1589-1591
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/AslanidesLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/AslanidesLH17
John Aslanides, Jan Leike, Marcus Hutter:
Universal Reinforcement Learning Algorithms: Survey and Experiments. IJCAI 2017: 1403-1410
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LeikeLOH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LeikeLOH17
Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hutter:
On Thompson Sampling and Asymptotic Optimality. IJCAI 2017: 4889-4893
[c16]
- view
- export record
  dblp key:
  - conf/nips/ChristianoLBMLA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChristianoLBMLA17
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep Reinforcement Learning from Human Preferences. NIPS 2017: 4299-4307
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LamontALH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LamontALH17
Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter:
Generalised Discount Functions applied to a Monte-Carlo AImu Implementation. CoRR abs/1703.01358 (2017)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AslanidesLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AslanidesLH17
John Aslanides, Jan Leike, Marcus Hutter:
Universal Reinforcement Learning Algorithms: Survey and Experiments. CoRR abs/1705.10557 (2017)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1706-03741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1706-03741
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep reinforcement learning from human preferences. CoRR abs/1706.03741 (2017)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-09883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-09883
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg:
AI Safety Gridworlds. CoRR abs/1711.09883 (2017)
2016
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/FilanLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/FilanLH16
Daniel Filan, Jan Leike, Marcus Hutter:
Loss Bounds and Time Complexity for Speed Priors. AISTATS 2016: 1394-1402
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/tacas/HeizmannDGLMSP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tacas/HeizmannDGLMSP16
Matthias Heizmann, Daniel Dietsch, Marius Greitschus, Jan Leike, Betim Musa, Claus Schätzle, Andreas Podelski:
Ultimate Automizer with Two-track Proofs - (Competition Contribution). TACAS 2016: 950-953
[c13]
- view
  - electronic edition @ auai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/LeikeLOH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LeikeLOH16
Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hutter:
Thompson Sampling is Asymptotically Optimal in General Environments. UAI 2016
[c12]
- view
  - electronic edition @ auai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/LeikeTF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LeikeTF16
Jan Leike, Jessica Taylor, Benya Fallenstein:
A Formal Solution to the Grain of Truth Problem. UAI 2016
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeLOH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeLOH16
Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hutter:
Thompson Sampling is Asymptotically Optimal in General Environments. CoRR abs/1602.07905 (2016)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FilanHL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FilanHL16
Daniel Filan, Marcus Hutter, Jan Leike:
Loss Bounds and Time Complexity for Speed Priors. CoRR abs/1604.03343 (2016)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Leike16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Leike16
Jan Leike:
Exploration Potential. CoRR abs/1609.04994 (2016)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeTF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeTF16
Jan Leike, Jessica Taylor, Benya Fallenstein:
A Formal Solution to the Grain of Truth Problem. CoRR abs/1609.05058 (2016)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH16
Jan Leike, Matthias Heizmann:
Geometric Nontermination Arguments. CoRR abs/1609.05207 (2016)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Leike16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Leike16a
Jan Leike:
Nonparametric General Reinforcement Learning. CoRR abs/1611.08944 (2016)
2015
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/corr/LeikeH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH15
Jan Leike, Matthias Heizmann:
Ranking Templates for Linear Loops. Log. Methods Comput. Sci. 11(1) (2015)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/agi/DaswaniL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/agi/DaswaniL15
Mayank Daswani, Jan Leike:
A Definition of Happiness for Reinforcement Learning Agents. AGI 2015: 231-240
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/aldt/EverittLH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aldt/EverittLH15
Tom Everitt, Jan Leike, Marcus Hutter:
Sequential Extensions of Causal and Evidential Decision Theory. ADT 2015: 205-221
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/LeikeH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/LeikeH15
Jan Leike, Marcus Hutter:
Solomonoff Induction Violates Nicod's Criterion. ALT 2015: 349-363
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/LeikeH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/LeikeH15a
Jan Leike, Marcus Hutter:
On the Computability of Solomonoff Induction and Knowledge-Seeking. ALT 2015: 364-378
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/LeikeH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/LeikeH15
Jan Leike, Marcus Hutter:
Bad Universal Priors and Notions of Optimality. COLT 2015: 1244-1259
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/tacas/HeizmannDLMP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tacas/HeizmannDLMP15
Matthias Heizmann, Daniel Dietsch, Jan Leike, Betim Musa, Andreas Podelski:
Ultimate Automizer with Array Interpolation - (Competition Contribution). TACAS 2015: 455-457
[c5]
- view
  - electronic edition @ auai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/LeikeH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LeikeH15
Jan Leike, Marcus Hutter:
On the Computability of AIXI. UAI 2015: 464-473
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DaswaniL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DaswaniL15
Mayank Daswani, Jan Leike:
A Definition of Happiness for Reinforcement Learning Agents. CoRR abs/1505.04497 (2015)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/EverittLH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/EverittLH15
Tom Everitt, Jan Leike, Marcus Hutter:
Sequential Extensions of Causal and Evidential Decision Theory. CoRR abs/1506.07359 (2015)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH15a
Jan Leike, Marcus Hutter:
Solomonoff Induction Violates Nicod's Criterion. CoRR abs/1507.04121 (2015)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH15b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH15b
Jan Leike, Marcus Hutter:
On the Computability of Solomonoff Induction and Knowledge-Seeking. CoRR abs/1507.04124 (2015)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH15c
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH15c
Jan Leike, Marcus Hutter:
Bad Universal Priors and Notions of Optimality. CoRR abs/1510.04931 (2015)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH15d
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH15d
Jan Leike, Marcus Hutter:
On the Computability of AIXI. CoRR abs/1510.05572 (2015)
2014
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/alt/LeikeH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/LeikeH14
Jan Leike, Marcus Hutter:
Indefinitely Oscillating Martingales. ALT 2014: 321-335
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/tacas/LeikeH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tacas/LeikeH14
Jan Leike, Matthias Heizmann:
Ranking Templates for Linear Loops. TACAS 2014: 172-186
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/vmcai/LeikeT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vmcai/LeikeT14
Jan Leike, Ashish Tiwari:
Synthesis for Polynomial Lasso Programs. VMCAI 2014: 434-452
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH14
Jan Leike, Matthias Heizmann:
Ranking Templates for Linear Loops. CoRR abs/1401.5338 (2014)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HeizmannHLP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HeizmannHLP14
Matthias Heizmann, Jochen Hoenicke, Jan Leike, Andreas Podelski:
Linear Ranking for Linear Lasso Programs. CoRR abs/1401.5347 (2014)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Leike14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Leike14
Jan Leike:
Ranking Function Synthesis for Linear Lasso Programs. CoRR abs/1401.5351 (2014)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH14a
Jan Leike, Matthias Heizmann:
Geometric Series as Nontermination Arguments for Linear Lasso Programs. CoRR abs/1405.4413 (2014)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeH14b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeH14b
Jan Leike, Marcus Hutter:
Indefinitely Oscillating Martingales. CoRR abs/1408.3169 (2014)
2013
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/atva/HeizmannHLP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atva/HeizmannHLP13
Matthias Heizmann, Jochen Hoenicke, Jan Leike, Andreas Podelski:
Linear Ranking for Linear Lasso Programs. ATVA 2013: 365-380
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeikeT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeikeT13
Jan Leike, Ashish Tiwari:
Synthesis for Polynomial Lasso Programs. CoRR abs/1311.4046 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.