Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Ofir Nachum

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c52]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/MazoureENT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/MazoureENT23
Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson:
Contrastive Value Learning: Implicit Models for Simple Offline RL. CoRL 2023: 1257-1267
[c51]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/ChebotarVHXLIKY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/ChebotarVHXLIKY23
Yevgen Chebotar, Quan Vuong, Karol Hausman, Fei Xia, Yao Lu, Alex Irpan, Aviral Kumar, Tianhe Yu, Alexander Herzog, Karl Pertsch, Keerthana Gopalakrishnan, Julian Ibarz, Ofir Nachum, Sumedh Anand Sontakke, Grecia Salazar, Huong T. Tran, Jodilyn Peralta, Clayton Tan, Deeksha Manjunath, Jaspiar Singh, Brianna Zitkovich, Tomas Jackson, Kanishka Rao, Chelsea Finn, Sergey Levine:
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions. CoRL 2023: 3909-3928
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/GurNMSHCNFF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/GurNMSHCNFF23
Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust:
Understanding HTML with Large Language Models. EMNLP (Findings) 2023: 2803-2821
[c49]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ChowTNGRGB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChowTNGRGB23
Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, Dhawal Gupta, Moonkyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier:
A Mixture-of-Expert Approach to RL-based Dialogue Management. ICLR 2023
[c48]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YangSAN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangSAN23
Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. ICLR 2023
[c47]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/VenutoYAPMN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/VenutoYAPMN23
David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. ICML 2023: 35024-35036
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0002XPCFNB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0002XPCFNB23
Jonathan Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill:
Supervised Pretraining Can Learn In-Context Reinforcement Learning. NeurIPS 2023
[c45]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BrandfonbrenerN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BrandfonbrenerN23
David Brandfonbrener, Ofir Nachum, Joan Bruna:
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation. NeurIPS 2023
[c44]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DuY0DN0SA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuY0DN0SA23
Yilun Du, Sherry Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Josh Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. NeurIPS 2023
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/rss/BrohanBCCDFGHHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rss/BrohanBCCDFGHHH23
Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alexander Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath, Igor Mordatch, Ofir Nachum, Carolina Parada, Jodilyn Peralta, Emily Perez, Karl Pertsch, Jornell Quiambao, Kanishka Rao, Michael S. Ryoo, Grecia Salazar, Pannag R. Sanketi, Kevin Sayed, Jaspiar Singh, Sumedh Sontakke, Austin Stone, Clayton Tan, Huong T. Tran, Vincent Vanhoucke, Steve Vega, Quan Vuong, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich:
RT-1: Robotics Transformer for Real-World Control at Scale. Robotics: Science and Systems 2023
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00111
Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. CoRR abs/2302.00111 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04129
Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans:
Foundation Models for Decision Making: Problems, Methods, and Opportunities. CoRR abs/2303.04129 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11854
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11854
Hiroki Furuta, Ofir Nachum, Kuang-Huei Lee, Yutaka Matsuo, Shixiang Shane Gu, Izzeddin Gur:
Multimodal Web Navigation with Instruction-Finetuned Foundation Models. CoRR abs/2305.11854 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14654
Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Tsang-Wei Edward Lee, Linda Luu, Ofir Nachum, Ken Oslund, Jason Powell, Diego Reyes, Francesco Romano, Fereshteh Sadeghi, Ron Sloat, Baruch Tabanpour, Daniel Zheng, Michael Neunert, Raia Hadsell, Nicolas Heess, Francesco Nori, Jeff Seto, Carolina Parada, Vikas Sindhwani, Vincent Vanhoucke, Jie Tan:
Barkour: Benchmarking Animal-level Agility with Quadruped Robots. CoRR abs/2305.14654 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16985
David Brandfonbrener, Ofir Nachum, Joan Bruna:
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation. CoRR abs/2305.16985 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14892
Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill:
Supervised Pretraining Can Learn In-Context Reinforcement Learning. CoRR abs/2306.14892 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10150
Yevgen Chebotar, Quan Vuong, Alex Irpan, Karol Hausman, Fei Xia, Yao Lu, Aviral Kumar, Tianhe Yu, Alexander Herzog, Karl Pertsch, Keerthana Gopalakrishnan, Julian Ibarz, Ofir Nachum, Sumedh Sontakke, Grecia Salazar, Huong T. Tran, Jodilyn Peralta, Clayton Tan, Deeksha Manjunath, Jaspiar Singh, Brianna Zitkovich, Tomas Jackson, Kanishka Rao, Chelsea Finn, Sergey Levine:
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions. CoRR abs/2309.10150 (2023)
2022
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/YangDNTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YangDNTS22
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. AISTATS 2022: 4376-4396
[c41]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/VenutoLPN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/VenutoLPN22
David Venuto, Elaine Lau, Doina Precup, Ofir Nachum:
Policy Gradients Incorporating the Future. ICLR 2022
[c40]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YangLN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangLN22
Mengjiao Yang, Sergey Levine, Ofir Nachum:
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data. ICLR 2022
[c39]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/FujimotoMPNG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FujimotoMPNG22
Scott Fujimoto, David Meger, Doina Precup, Ofir Nachum, Shixiang Shane Gu:
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error. ICML 2022: 6918-6943
[c38]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0002TND22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0002TND22
Jonathan Lee, George Tucker, Ofir Nachum, Bo Dai:
Model Selection in Batch Policy Optimization. ICML 2022: 12542-12569
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/LeeNZGT022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/LeeNZGT022
Kuang-Huei Lee, Ofir Nachum, Tingnan Zhang, Sergio Guadarrama, Jie Tan, Wenhao Yu:
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations. IROS 2022: 1447-1454
[c36]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GhasemipourGN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GhasemipourGN22
Seyed Kamyar Seyed Ghasemipour, Shixiang Shane Gu, Ofir Nachum:
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters. NeurIPS 2022
[c35]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LeeNYLFGFXJMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeeNYLFGFXJMM22
Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Sergio Guadarrama, Ian Fischer, Winnie Xu, Eric Jang, Henryk Michalewski, Igor Mordatch:
Multi-Game Decision Transformers. NeurIPS 2022
[c34]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LeeTNDB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeeTNDB22
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai, Emma Brunskill:
Oracle Inequalities for Model Selection in Offline Reinforcement Learning. NeurIPS 2022
[c33]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MazoureKNT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MazoureKNT22
Bogdan Mazoure, Ilya Kostrikov, Ofir Nachum, Jonathan Tompson:
Improving Zero-Shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions. NeurIPS 2022
[c32]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangSAN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangSAN22
Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. NeurIPS 2022
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12417
Scott Fujimoto, David Meger, Doina Precup, Ofir Nachum, Shixiang Shane Gu:
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error. CoRR abs/2201.12417 (2022)
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10816
Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. CoRR abs/2205.10816 (2022)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13703
Seyed Kamyar Seyed Ghasemipour, Shixiang Shane Gu, Ofir Nachum:
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters. CoRR abs/2205.13703 (2022)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15241
Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch:
Multi-Game Decision Transformers. CoRR abs/2205.15241 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00059
Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, Moonkyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier:
A Mixture-of-Expert Approach to RL-based Dialogue Management. CoRR abs/2206.00059 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12441
Aldo Pacchiano, Ofir Nachum, Nilesh Tripuraneni, Peter L. Bartlett:
Joint Representation Training in Sequential Tasks with Shared Structure. CoRR abs/2206.12441 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-13224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-13224
Kuang-Huei Lee, Ofir Nachum, Tingnan Zhang, Sergio Guadarrama, Jie Tan, Wenhao Yu:
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations. CoRR abs/2207.13224 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03945
Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust:
Understanding HTML with Large Language Models. CoRR abs/2210.03945 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13435
Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. CoRR abs/2210.13435 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02016
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai, Emma Brunskill:
Oracle Inequalities for Model Selection in Offline Reinforcement Learning. CoRR abs/2211.02016 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02100
Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson:
Contrastive Value Learning: Implicit Models for Simple Offline RL. CoRR abs/2211.02100 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-13337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-13337
David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. CoRR abs/2211.13337 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-06817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-06817
Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alexander Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath, Igor Mordatch, Ofir Nachum, Carolina Parada, Jodilyn Peralta, Emily Perez, Karl Pertsch, Jornell Quiambao, Kanishka Rao, Michael S. Ryoo, Grecia Salazar, Pannag Sanketi, Kevin Sayed, Jaspiar Singh, Sumedh Sontakke, Austin Stone, Clayton Tan, Huong T. Tran, Vincent Vanhoucke, Steve Vega, Quan Vuong, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich:
RT-1: Robotics Transformer for Real-World Control at Scale. CoRR abs/2212.06817 (2022)
2021
[c31]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/AjayKALN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AjayKALN21
Anurag Ajay, Aviral Kumar, Pulkit Agrawal, Sergey Levine, Ofir Nachum:
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning. ICLR 2021
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Fu0NTw0YZCKPLP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Fu0NTw0YZCKPLP21
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. ICLR 2021
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MatsushimaFMNG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MatsushimaFMNG21
Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu:
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization. ICLR 2021
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZhangPNPT0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangPNPT0021
Michael R. Zhang, Thomas Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi:
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization. ICLR 2021
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/FurutaMKMLNG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FurutaMKMLNG21
Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno, Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Shane Gu:
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning. ICML 2021: 3541-3552
[c26]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KostrikovFTN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KostrikovFTN21
Ilya Kostrikov, Rob Fergus, Jonathan Tompson, Ofir Nachum:
Offline Reinforcement Learning with Fisher Divergence Critic Regularization. ICML 2021: 5774-5783
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangN21
Mengjiao Yang, Ofir Nachum:
Representation Matters: Offline Pretraining for Sequential Decision Making. ICML 2021: 11784-11794
[c24]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/PacchianoLBN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PacchianoLBN21
Aldo Pacchiano, Jonathan N. Lee, Peter L. Bartlett, Ofir Nachum:
Near Optimal Policy Optimization via REPS. NeurIPS 2021: 1100-1110
[c23]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/NachumY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NachumY21
Ofir Nachum, Mengjiao Yang:
Provable Representation Learning for Imitation with Contrastive Fourier Features. NeurIPS 2021: 30100-30112
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-05815
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-05815
Mengjiao Yang, Ofir Nachum:
Representation Matters: Offline Pretraining for Sequential Decision Making. CoRR abs/2102.05815 (2021)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08050
Ilya Kostrikov, Jonathan Tompson, Rob Fergus, Ofir Nachum:
Offline Reinforcement Learning with Fisher Divergence Critic Regularization. CoRR abs/2103.08050 (2021)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09756
Aldo Pacchiano, Jonathan N. Lee, Peter L. Bartlett, Ofir Nachum:
Near Optimal Policy Optimization via REPS. CoRR abs/2103.09756 (2021)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-12726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-12726
Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno, Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Shane Gu:
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning. CoRR abs/2103.12726 (2021)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16596
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine:
Benchmarks for Deep Off-Policy Evaluation. CoRR abs/2103.16596 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-13877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-13877
Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi:
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization. CoRR abs/2104.13877 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-12272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-12272
Ofir Nachum, Mengjiao Yang:
Provable Representation Learning for Imitation with Contrastive Fourier Features. CoRR abs/2105.12272 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02096
David Venuto, Elaine Lau, Doina Precup, Ofir Nachum:
Policy Gradients Incorporating the Future. CoRR abs/2108.02096 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14770
Mengjiao Yang, Sergey Levine, Ofir Nachum:
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data. CoRR abs/2110.14770 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-14629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-14629
Bogdan Mazoure, Ilya Kostrikov, Ofir Nachum, Jonathan Tompson:
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions. CoRR abs/2111.14629 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-12320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-12320
Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai:
Model Selection in Batch Policy Optimization. CoRR abs/2112.12320 (2021)
2020
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/JiangN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/JiangN20
Heinrich Jiang, Ofir Nachum:
Identifying and Correcting Label Bias in Machine Learning. AISTATS 2020: 702-712
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/ChowNFDG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/ChowNFDG20
Yinlam Chow, Ofir Nachum, Aleksandra Faust, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
Safe Policy Learning for Continuous Control. CoRL 2020: 801-821
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KostrikovNT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KostrikovNT20
Ilya Kostrikov, Ofir Nachum, Jonathan Tompson:
Imitation Learning via Off-Policy Distribution Matching. ICLR 2020
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SohnCONLCB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SohnCONLCB20
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed H. Chi, Craig Boutilier:
BRPO: Batch Residual Policy Optimization. IJCAI 2020: 2824-2830
[c18]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DaiNC0SS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaiNC0SS20
Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
CoinDICE: Off-Policy Confidence Interval Estimation. NeurIPS 2020
[c17]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangND0S20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangND0S20
Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. NeurIPS 2020
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01866
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01866
Ofir Nachum, Bo Dai:
Reinforcement Learning via Fenchel-Rockafellar Duality. CoRR abs/2001.01866 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05522
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed H. Chi, Craig Boutilier:
BRPO: Batch Residual Policy Optimization. CoRR abs/2002.05522 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-07219
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-07219
Justin Fu, Aviral Kumar, Ofir Nachum, George Tucker, Sergey Levine:
D4RL: Datasets for Deep Data-Driven Reinforcement Learning. CoRR abs/2004.07219 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03647
Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu:
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization. CoRR abs/2006.03647 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13888
Çaglar Gülçehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel J. Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas:
RL Unplugged: Benchmarks for Offline Reinforcement Learning. CoRR abs/2006.13888 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03438
Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. CoRR abs/2007.03438 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13609
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13609
Ilya Kostrikov, Ofir Nachum:
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation. CoRR abs/2007.13609 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11652
Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
CoinDICE: Off-Policy Confidence Interval Estimation. CoRR abs/2010.11652 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13611
Anurag Ajay, Aviral Kumar, Pulkit Agrawal, Sergey Levine, Ofir Nachum:
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning. CoRR abs/2010.13611 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06919
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. CoRR abs/2012.06919 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/JiangJN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/JiangJN19
Heinrich Jiang, Jennifer Jang, Ofir Nachum:
Robustness Guarantees for Density Clustering. AISTATS 2019: 3342-3351
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/NachumAPGK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/NachumAPGK19
Ofir Nachum, Michael Ahn, Hugo Ponte, Shixiang Shane Gu, Vikash Kumar:
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real. CoRL 2019: 110-121
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NachumGLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NachumGLL19
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine:
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning. ICLR (Poster) 2019
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/WuTN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuTN19
Yifan Wu, George Tucker, Ofir Nachum:
The Laplacian in RL: Learning Representations with Efficient Approximations. ICLR (Poster) 2019
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GeladaKBNB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GeladaKBNB19
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. ICML 2019: 2170-2179
[c11]
- view
- export record
  dblp key:
  - conf/nips/NachumCD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NachumCD019
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li:
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections. NeurIPS 2019: 2315-2325
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-04966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-04966
Heinrich Jiang, Ofir Nachum:
Identifying and Correcting Label Bias in Machine Learning. CoRR abs/1901.04966 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-10031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-10031
Yinlam Chow, Ofir Nachum, Aleksandra Faust, Mohammad Ghavamzadeh, Edgar A. Duéñez-Guzmán:
Lyapunov-based Safe Policy Optimization for Continuous Control. CoRR abs/1901.10031 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02736
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. CoRR abs/1906.02736 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-04733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-04733
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li:
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections. CoRR abs/1906.04733 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-05224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-05224
Ofir Nachum, Michael Ahn, Hugo Ponte, Shixiang Gu, Vikash Kumar:
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real. CoRR abs/1908.05224 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-10618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-10618
Ofir Nachum, Haoran Tang, Xingyu Lu, Shixiang Gu, Honglak Lee, Sergey Levine:
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning? CoRR abs/1909.10618 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-02097
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-02097
Ofir Nachum, Heinrich Jiang:
Group-based Fair Learning Leads to Counter-intuitive Predictions. CoRR abs/1910.02097 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-11361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-11361
Yifan Wu, George Tucker, Ofir Nachum:
Behavior Regularized Offline Reinforcement Learning. CoRR abs/1911.11361 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02074
Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans:
AlgaeDICE: Policy Gradient from Arbitrary Experience. CoRR abs/1912.02074 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-05032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-05032
Ilya Kostrikov, Ofir Nachum, Jonathan Tompson:
Imitation Learning via Off-Policy Distribution Matching. CoRR abs/1912.05032 (2019)
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GordonENCWYC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GordonENCWYC18
Ariel Gordon, Elad Eban, Ofir Nachum, Bo Chen, Hao Wu, Tien-Ju Yang, Edward Choi:
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks. CVPR 2018: 1586-1595
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Nachum0XS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Nachum0XS18
Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control. ICLR (Poster) 2018
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChowNG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChowNG18
Yinlam Chow, Ofir Nachum, Mohammad Ghavamzadeh:
Path Consistency Learning in Tsallis Entropy Regularized MDPs. ICML 2018: 978-987
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Nachum0TS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Nachum0TS18
Ofir Nachum, Mohammad Norouzi, George Tucker, Dale Schuurmans:
Smoothed Action Value Functions for Learning Gaussian Policies. ICML 2018: 3689-3697
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/QuillenJNFIL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/QuillenJNFIL18
Deirdre Quillen, Eric Jang, Ofir Nachum, Chelsea Finn, Julian Ibarz, Sergey Levine:
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods. ICRA 2018: 6284-6291
[c5]
- view
- export record
  dblp key:
  - conf/nips/NachumGLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NachumGLL18
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine:
Data-Efficient Hierarchical Reinforcement Learning. NeurIPS 2018: 3307-3317
[c4]
- view
- export record
  dblp key:
  - conf/nips/ChowNDG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChowNDG18
Yinlam Chow, Ofir Nachum, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
A Lyapunov-based Approach to Safe Reinforcement Learning. NeurIPS 2018: 8103-8112
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-03501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-03501
Ofir Nachum, Yinlam Chow, Mohammad Ghavamzadeh:
Path Consistency Learning in Tsallis Entropy Regularized MDPs. CoRR abs/1802.03501 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-10264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-10264
Deirdre Quillen, Eric Jang, Ofir Nachum, Chelsea Finn, Julian Ibarz, Sergey Levine:
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods. CoRR abs/1802.10264 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-02348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-02348
Ofir Nachum, Mohammad Norouzi, George Tucker, Dale Schuurmans:
Smoothed Action Value Functions for Learning Gaussian Policies. CoRR abs/1803.02348 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07708
Yinlam Chow, Ofir Nachum, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
A Lyapunov-based Approach to Safe Reinforcement Learning. CoRR abs/1805.07708 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-08296
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-08296
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine:
Data-Efficient Hierarchical Reinforcement Learning. CoRR abs/1805.08296 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-01257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-01257
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine:
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning. CoRR abs/1810.01257 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-04586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-04586
Yifan Wu, George Tucker, Ofir Nachum:
The Laplacian in RL: Learning Representations with Efficient Approximations. CoRR abs/1810.04586 (2018)
2017
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ipl/DemaineGKLLMNSW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ipl/DemaineGKLLMNSW17
Erik D. Demaine, Varun Ganesan, Vladislav Kontsevoi, Qipeng Liu, Quanquan C. Liu, Fermi Ma, Ofir Nachum, Aaron Sidford, Erik Waingarten, Daniel Ziegler:
Arboral satisfaction: Recognition and LP approximation. Inf. Process. Lett. 127: 1-5 (2017)
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KaiserNRB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KaiserNRB17
Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio:
Learning to Remember Rare Events. ICLR (Poster) 2017
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Nachum0S17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Nachum0S17
Ofir Nachum, Mohammad Norouzi, Dale Schuurmans:
Improving Policy Gradient by Exploring Under-appreciated Rewards. ICLR (Poster) 2017
[c1]
- view
- export record
  dblp key:
  - conf/nips/NachumNXS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NachumNXS17
Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Bridging the Gap Between Value and Policy Based Reinforcement Learning. NIPS 2017: 2775-2785
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/NachumNXS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NachumNXS17
Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Bridging the Gap Between Value and Policy Based Reinforcement Learning. CoRR abs/1702.08892 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/KaiserNRB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KaiserNRB17
Lukasz Kaiser, Ofir Nachum, Aurko Roy, Samy Bengio:
Learning to Remember Rare Events. CoRR abs/1703.03129 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/NachumNXS17aa
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NachumNXS17aa
Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control. CoRR abs/1707.01891 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-06798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-06798
Ariel Gordon, Elad Eban, Ofir Nachum, Bo Chen, Tien-Ju Yang, Edward Choi:
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks. CoRR abs/1711.06798 (2017)
2016
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/NachumNS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NachumNS16
Ofir Nachum, Mohammad Norouzi, Dale Schuurmans:
Improving Policy Gradient by Exploring Under-appreciated Rewards. CoRR abs/1611.09321 (2016)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.