default search action

combined dblp search
author search
venue search
publication search

ask others

Jonathan Uesato

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2022
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/fat/WeidingerURGHMG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fat/WeidingerURGHMG22
Laura Weidinger, Jonathan Uesato, Maribeth Rauh, Conor Griffin, Po-Sen Huang, John Mellor, Amelia Glaese, Myra Cheng, Borja Balle, Atoosa Kasirzadeh, Courtney Biles, Sasha Brown, Zac Kenton, Will Hawkins, Tom Stepleton, Abeba Birhane, Lisa Anne Hendricks, Laura Rimell, William Isaac, Julia Haas, Sean Legassick, Geoffrey Irving, Iason Gabriel:
Taxonomy of Risks posed by Language Models. FAccT 2022: 214-229
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/RauhMUHWWDGIGIH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RauhMUHWWDGIGIH22
Maribeth Rauh, John Mellor, Jonathan Uesato, Po-Sen Huang, Johannes Welbl, Laura Weidinger, Sumanth Dathathri, Amelia Glaese, Geoffrey Irving, Iason Gabriel, William Isaac, Lisa Anne Hendricks:
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models. NeurIPS 2022
2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WelblGUDMHAKCH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WelblGUDMHAKCH21
Johannes Welbl, Amelia Glaese, Jonathan Uesato, Sumanth Dathathri, John Mellor, Lisa Anne Hendricks, Kirsty Anderson, Pushmeet Kohli, Ben Coppin, Po-Sen Huang:
Challenges in Detoxifying Language Models. EMNLP (Findings) 2021: 2447-2469
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BerradaDDSBUGK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BerradaDDSBUGK21
Leonard Berrada, Sumanth Dathathri, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Jonathan Uesato, Sven Gowal, M. Pawan Kumar:
Make Sure You're Unsure: A Framework for Verifying Probabilistic Specifications. NeurIPS 2021: 11136-11147
2020
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WengDUXGSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WengDUXGSK20
Tsui-Wei Weng, Krishnamurthy (Dj) Dvijotham, Jonathan Uesato, Kai Xiao, Sven Gowal, Robert Stanforth, Pushmeet Kohli:
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control. ICLR 2020
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DathathriDKRUBS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DathathriDKRUBS20
Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian J. Goodfellow, Percy Liang, Pushmeet Kohli:
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming. NeurIPS 2020
2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Moosavi-Dezfooli19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Moosavi-Dezfooli19
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Jonathan Uesato, Pascal Frossard:
Robustness via Curvature Regularization, and Vice Versa. CVPR 2019: 9078-9086
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/GowalDSBQUAMK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/GowalDSBQUAMK19
Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Arthur Mann, Pushmeet Kohli:
Scalable Verified Training for Provably Robust Image Classification. ICCV 2019: 4841-4850
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QinDOBSGUSK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QinDOBSGUSK19
Chongli Qin, Krishnamurthy (Dj) Dvijotham, Brendan O'Donoghue, Rudy Bunel, Robert Stanforth, Sven Gowal, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli:
Verification of Non-Linear Specifications for Neural Networks. ICLR (Poster) 2019
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/UesatoKSERADHK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/UesatoKSERADHK19
Jonathan Uesato, Ananya Kumar, Csaba Szepesvári, Tom Erez, Avraham Ruderman, Keith Anderson, Krishnamurthy (Dj) Dvijotham, Nicolas Heess, Pushmeet Kohli:
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures. ICLR (Poster) 2019
[c3]
- view
- export record
  dblp key:
  - conf/nips/AlayracUHFSK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AlayracUHFSK19
Jean-Baptiste Alayrac, Jonathan Uesato, Po-Sen Huang, Alhussein Fawzi, Robert Stanforth, Pushmeet Kohli:
Are Labels Required for Improving Adversarial Robustness? NeurIPS 2019: 12192-12202
2018
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/UesatoOKO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/UesatoOKO18
Jonathan Uesato, Brendan O'Donoghue, Pushmeet Kohli, Aäron van den Oord:
Adversarial Risk and the Dangers of Evaluating Against Weak Attacks. ICML 2018: 5032-5041
2017
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DevlinUBSMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DevlinUBSMK17
Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. ICML 2017: 990-998

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2022
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08325
Maribeth Rauh, John Mellor, Jonathan Uesato, Po-Sen Huang, Johannes Welbl, Laura Weidinger, Sumanth Dathathri, Amelia Glaese, Geoffrey Irving, Iason Gabriel, William Isaac, Lisa Anne Hendricks:
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models. CoRR abs/2206.08325 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-14375
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-14375
Amelia Glaese, Nat McAleese, Maja Trebacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin J. Chadwick, Phoebe Thacker, Lucy Campbell-Gillingham, Jonathan Uesato, Po-Sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Sona Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving:
Improving alignment of dialogue agents via targeted human judgements. CoRR abs/2209.14375 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01790
Rohin Shah, Vikrant Varma, Ramana Kumar, Mary Phuong, Victoria Krakovna, Jonathan Uesato, Zac Kenton:
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals. CoRR abs/2210.01790 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-14275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-14275
Jonathan Uesato, Nate Kushman, Ramana Kumar, H. Francis Song, Noah Y. Siegel, Lisa Wang, Antonia Creswell, Geoffrey Irving, Irina Higgins:
Solving math word problems with process- and outcome-based feedback. CoRR abs/2211.14275 (2022)
2021
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09479
Leonard Berrada, Sumanth Dathathri, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Jonathan Uesato, Sven Gowal, M. Pawan Kumar:
Verifying Probabilistic Specifications with Functional Lagrangians. CoRR abs/2102.09479 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-07445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-07445
Johannes Welbl, Amelia Glaese, Jonathan Uesato, Sumanth Dathathri, John Mellor, Lisa Anne Hendricks, Kirsty Anderson, Pushmeet Kohli, Ben Coppin, Po-Sen Huang:
Challenges in Detoxifying Language Models. CoRR abs/2109.07445 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01577
Neel Nanda, Jonathan Uesato, Sven Gowal:
An Empirical Investigation of Learning from Biased Toxicity Labels. CoRR abs/2110.01577 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04359
Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, Zac Kenton, Sasha Brown, Will Hawkins, Tom Stepleton, Courtney Biles, Abeba Birhane, Julia Haas, Laura Rimell, Lisa Anne Hendricks, William Isaac, Sean Legassick, Geoffrey Irving, Iason Gabriel:
Ethical and social risks of harm from Language Models. CoRR abs/2112.04359 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11446
Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, H. Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat McAleese, Amy Wu, Erich Elsen, Siddhant M. Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-Baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien de Masson d'Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J. Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Edward Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving:
Scaling Language Models: Methods, Analysis & Insights from Training Gopher. CoRR abs/2112.11446 (2021)
2020
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03593
Sven Gowal, Chongli Qin, Jonathan Uesato, Timothy A. Mann, Pushmeet Kohli:
Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples. CoRR abs/2010.03593 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11645
Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian J. Goodfellow, Percy Liang, Pushmeet Kohli:
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming. CoRR abs/2010.11645 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08820
Ramana Kumar, Jonathan Uesato, Richard Ngo, Tom Everitt, Victoria Krakovna, Shane Legg:
REALab: An Embedded Perspective on Tampering. CoRR abs/2011.08820 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08827
Jonathan Uesato, Ramana Kumar, Victoria Krakovna, Tom Everitt, Richard Ngo, Shane Legg:
Avoiding Tampering Incentives in Deep RL via Decoupled Approval. CoRR abs/2011.08827 (2020)
2019
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-09592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-09592
Chongli Qin, Krishnamurthy (Dj) Dvijotham, Brendan O'Donoghue, Rudy Bunel, Robert Stanforth, Sven Gowal, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli:
Verification of Non-Linear Specifications for Neural Networks. CoRR abs/1902.09592 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13725
Jonathan Uesato, Jean-Baptiste Alayrac, Po-Sen Huang, Robert Stanforth, Alhussein Fawzi, Pushmeet Kohli:
Are Labels Required for Improving Adversarial Robustness? CoRR abs/1905.13725 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09338
Sven Gowal, Jonathan Uesato, Chongli Qin, Po-Sen Huang, Timothy A. Mann, Pushmeet Kohli:
An Alternative Surrogate Loss for PGD-based Adversarial Testing. CoRR abs/1910.09338 (2019)
2018
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05666
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05666
Jonathan Uesato, Brendan O'Donoghue, Aäron van den Oord, Pushmeet Kohli:
Adversarial Risk and the Dangers of Evaluating Against Weak Attacks. CoRR abs/1802.05666 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-10265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-10265
Krishnamurthy Dvijotham, Sven Gowal, Robert Stanforth, Relja Arandjelovic, Brendan O'Donoghue, Jonathan Uesato, Pushmeet Kohli:
Training verified learners with learned verifiers. CoRR abs/1805.10265 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-12715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-12715
Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy A. Mann, Pushmeet Kohli:
On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models. CoRR abs/1810.12715 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09300
Edward Grefenstette, Robert Stanforth, Brendan O'Donoghue, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli:
Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles. CoRR abs/1811.09300 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09716
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Jonathan Uesato, Pascal Frossard:
Robustness via curvature regularization, and vice versa. CoRR abs/1811.09716 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-01647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-01647
Jonathan Uesato, Ananya Kumar, Csaba Szepesvári, Tom Erez, Avraham Ruderman, Keith Anderson, Krishnamurthy Dvijotham, Nicolas Heess, Pushmeet Kohli:
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures. CoRR abs/1812.01647 (2018)
2017
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DevlinUBSMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DevlinUBSMK17
Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. CoRR abs/1703.07469 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-11054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-11054
Jacob Devlin, Jonathan Uesato, Rishabh Singh, Pushmeet Kohli:
Semantic Code Repair using Neuro-Symbolic Transformation Networks. CoRR abs/1710.11054 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.