


Остановите войну!
for scientists:


default search action
Pieter Abbeel
Person information

- affiliation: University of California, Berkeley, USA
- affiliation: Stanford University, USA
- award (2021): ACM Prize in Computing
- award (2013): Presidential Early Career Award for Scientists and Engineers
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j27]Kourosh Hakhamaneshi
, Marcel Nassar, Mariano Phielipp, Pieter Abbeel, Vladimir Stojanovic
:
Pretraining Graph Neural Networks for Few-Shot Analog Circuit Modeling and Design. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(7): 2163-2173 (2023) - [c309]Joey Hejna, Pieter Abbeel, Lerrel Pinto:
Improving Long-Horizon Imitation through Instruction Prediction. AAAI 2023: 7857-7865 - [c308]Ajay Jain, Amber Xie, Pieter Abbeel:
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. CVPR 2023: 1911-1920 - [c307]Changyeon Kim, Jongjin Park, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
Preference Transformer: Modeling Human Preferences using Transformers for RL. ICLR 2023 - [c306]Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. ICLR 2023 - [c305]Weirui Ye, Yunsheng Zhang, Pieter Abbeel, Yang Gao:
Become a Proficient Player with Limited Data through Watching Pure Videos. ICLR 2023 - [c304]Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica:
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning. ICML 2023: 1361-1395 - [c303]Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas:
Guiding Pretraining in Reinforcement Learning with Large Language Models. ICML 2023: 8657-8677 - [c302]Hao Liu, Pieter Abbeel:
Emergent Agentic Transformer from Chain of Hindsight Experience. ICML 2023: 21362-21374 - [c301]Seohong Park, Kimin Lee, Youngwoon Lee, Pieter Abbeel:
Controllability-Aware Unsupervised Skill Discovery. ICML 2023: 27225-27245 - [c300]Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel:
Multi-View Masked World Models for Visual Robotic Manipulation. ICML 2023: 30613-30632 - [c299]David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. ICML 2023: 35024-35036 - [c298]Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran:
Masked Trajectory Models for Prediction, Representation, and Control. ICML 2023: 37607-37623 - [c297]Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel:
Temporally Consistent Transformers for Video Generation. ICML 2023: 39062-39098 - [c296]Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez:
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. ICML 2023: 41414-41428 - [c295]Kai Chen, Stephen James, Congying Sui, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS. ICRA 2023: 2855-2861 - [c294]Yuxuan Liu, Nikhil Mishra, Pieter Abbeel, Xi Chen:
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN. ICRA 2023: 7069-7075 - [c293]Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta:
Train Offline, Test Online: A Real Robot Learning Benchmark. ICRA 2023: 9197-9203 - [c292]Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Robust and Versatile Bipedal Jumping Control through Reinforcement Learning. Robotics: Science and Systems 2023 - [i301]Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. CoRR abs/2302.00111 (2023) - [i300]Hao Liu, Wilson Yan, Pieter Abbeel:
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment. CoRR abs/2302.00902 (2023) - [i299]Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel:
Multi-View Masked World Models for Visual Robotic Manipulation. CoRR abs/2302.02408 (2023) - [i298]Hao Liu, Carmelo Sferrazza, Pieter Abbeel:
Chain of Hindsight Aligns Language Models with Feedback. CoRR abs/2302.02676 (2023) - [i297]Seohong Park, Kimin Lee, Youngwoon Lee, Pieter Abbeel:
Controllability-Aware Unsupervised Skill Discovery. CoRR abs/2302.05103 (2023) - [i296]Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez:
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. CoRR abs/2302.05206 (2023) - [i295]Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas:
Guiding Pretraining in Reinforcement Learning with Large Language Models. CoRR abs/2302.06692 (2023) - [i294]Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning. CoRR abs/2302.09450 (2023) - [i293]Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu:
Aligning Text-to-Image Models using Human Feedback. CoRR abs/2302.12192 (2023) - [i292]Changyeon Kim, Jongjin Park, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
Preference Transformer: Modeling Human Preferences using Transformers for RL. CoRR abs/2303.00957 (2023) - [i291]Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans:
Foundation Models for Decision Making: Problems, Methods, and Opportunities. CoRR abs/2303.04129 (2023) - [i290]Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier:
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? CoRR abs/2303.18240 (2023) - [i289]Kevin Zakka, Laura M. Smith, Nimrod Gileadi, Taylor A. Howell, Xue Bin Peng, Sumeet Singh, Yuval Tassa, Pete Florence, Andy Zeng, Pieter Abbeel:
RoboPianist: A Benchmark for High-Dimensional Robot Control. CoRR abs/2304.04150 (2023) - [i288]Yuxuan Liu, Nikhil Mishra, Pieter Abbeel, Xi Chen:
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN. CoRR abs/2305.01910 (2023) - [i287]Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran:
Masked Trajectory Models for Prediction, Representation, and Control. CoRR abs/2305.02968 (2023) - [i286]Yuxuan Liu, Xi Chen, Pieter Abbeel:
Self-Supervised Instance Segmentation by Grasping. CoRR abs/2305.06305 (2023) - [i285]Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel:
Video Prediction Models as Rewards for Reinforcement Learning. CoRR abs/2305.14343 (2023) - [i284]Arnav Gudibande, Eric Wallace, Charlie Snell, Xinyang Geng, Hao Liu, Pieter Abbeel, Sergey Levine, Dawn Song:
The False Promise of Imitating Proprietary LLMs. CoRR abs/2305.15717 (2023) - [i283]Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee:
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models. CoRR abs/2305.16381 (2023) - [i282]Hao Liu, Pieter Abbeel:
Emergent Agentic Transformer from Chain of Hindsight Experience. CoRR abs/2305.16554 (2023) - [i281]Hao Liu, Pieter Abbeel:
Blockwise Parallel Transformer for Long Context Large Models. CoRR abs/2305.19370 (2023) - [i280]Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo:
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration. CoRR abs/2305.19476 (2023) - [i279]Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta:
Train Offline, Test Online: A Real Robot Learning Benchmark. CoRR abs/2306.00942 (2023) - [i278]Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Text-to-Video Models. CoRR abs/2306.01872 (2023) - [i277]Xinran Liang, Anthony Han, Wilson Yan, Aditi Raghunathan, Pieter Abbeel:
ALP: Action-Aware Embodied Learning for Perception. CoRR abs/2306.10190 (2023) - [i276]Joey Hejna, Pieter Abbeel, Lerrel Pinto:
Improving Long-Horizon Imitation Through Instruction Prediction. CoRR abs/2306.12554 (2023) - [i275]Xingyu Lin, John So, Sashwat Mahalingam, Fangchen Liu, Pieter Abbeel:
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks. CoRR abs/2307.03567 (2023) - [i274]Nikhil Mishra, Pieter Abbeel, Xi Chen, Maximilian Sieb:
Convolutional Occupancy Models for Dense Packing of Complex, Novel Objects. CoRR abs/2308.00091 (2023) - [i273]Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca D. Dragan:
Learning to Model the World with Language. CoRR abs/2308.01399 (2023) - [i272]Hiroshi Yoshitake, Pieter Abbeel:
The Impact of Overall Optimization on Warehouse Automation. CoRR abs/2308.06036 (2023) - [i271]Ademi Adeniji, Amber Xie, Carmelo Sferrazza, Younggyo Seo, Stephen James, Pieter Abbeel:
Language Reward Modulation for Pretraining Reinforcement Learning. CoRR abs/2308.12270 (2023) - [i270]Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James:
Language-Conditioned Path Planning. CoRR abs/2308.16893 (2023) - 2022
- [j26]Freek Stulp
, Michael Spranger, Kim Listmann, Stéphane Doncieux, Moritz Tenorth, George Konidaris, Pieter Abbeel:
Innovation Paths for Machine Learning in Robotics [Industry Activities]. IEEE Robotics Autom. Mag. 29(4): 141-144 (2022) - [c291]Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia:
Programmatic Modeling and Generation of Real-Time Strategic Soccer Environments for Reinforcement Learning. AAAI 2022: 6028-6036 - [c290]Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch:
Frozen Pretrained Transformers as Universal Computation Engines. AAAI 2022: 7628-7636 - [c289]Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg:
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision. CoRL 2022: 368-380 - [c288]Ilija Radosavovic, Tete Xiao, Stephen James, Pieter Abbeel, Jitendra Malik, Trevor Darrell:
Real-World Robot Learning with Masked Visual Pre-training. CoRL 2022: 416-426 - [c287]Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel:
Masked World Models for Visual Control. CoRL 2022: 1332-1344 - [c286]John So, Amber Xie, Sunggoo Jung, Jeffrey A. Edlund, Rohan Thakker, Ali-akbar Agha-mohammadi, Pieter Abbeel, Stephen James:
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data. CoRL 2022: 1871-1881 - [c285]Philipp Wu, Alejandro Escontrela, Danijar Hafner, Pieter Abbeel, Ken Goldberg:
DayDreamer: World Models for Physical Robot Learning. CoRL 2022: 2226-2240 - [c284]Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole:
Zero-Shot Text-Guided Object Generation with Dream Fields. CVPR 2022: 857-866 - [c283]Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking. ECCV (39) 2022: 533-550 - [c282]Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen:
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction. ECCV (10) 2022: 673-694 - [c281]Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel:
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator. ICIP 2022: 3943-3947 - [c280]Yuqing Du, Pieter Abbeel, Aditya Grover:
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation. ICLR 2022 - [c279]Kourosh Hakhamaneshi, Ruihan Zhao, Albert Zhan, Pieter Abbeel, Michael Laskin:
Hierarchical Few-Shot Imitation with Skill Transition Models. ICLR 2022 - [c278]Xinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel:
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning. ICLR 2022 - [c277]Jongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning. ICLR 2022 - [c276]Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch:
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. ICML 2022: 9118-9147 - [c275]Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox:
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks. ICML 2022: 13285-13301 - [c274]Younggyo Seo, Kimin Lee, Stephen L. James, Pieter Abbeel:
Reinforcement Learning with Action-Free Pre-Training from Videos. ICML 2022: 19561-19579 - [c273]Mandi Zhao, Fangchen Liu, Kimin Lee, Pieter Abbeel:
Towards More Generalizable One-shot Visual Imitation Learning. ICRA 2022: 2434-2444 - [c272]Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel:
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions. IROS 2022: 25-32 - [c271]Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto:
Playful Interactions for Representation Learning. IROS 2022: 992-999 - [c270]Albert Zhan, Ruihan Zhao, Lerrel Pinto, Pieter Abbeel, Michael Laskin:
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation. IROS 2022: 4040-4047 - [c269]Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel:
Multi-Objective Policy Gradients with Topological Constraints. IROS 2022: 9034-9039 - [c268]Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel:
Deep Hierarchical Planning from Pixels. NeurIPS 2022 - [c267]Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel:
Unsupervised Reinforcement Learning with Contrastive Intrinsic Control. NeurIPS 2022 - [c266]Fangchen Liu, Hao Liu, Aditya Grover, Pieter Abbeel:
Masked Autoencoding for Scalable and Generalizable Decision Making. NeurIPS 2022 - [c265]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. NeurIPS 2022 - [c264]Weirui Ye, Pieter Abbeel, Yang Gao:
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions. NeurIPS 2022 - [c263]Mandi Zhao, Pieter Abbeel, Stephen James:
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning. NeurIPS 2022 - [c262]Qiyang Li, Ajay Jain, Pieter Abbeel:
AdaCat: Adaptive categorical discretization for autoregressive models. UAI 2022: 1188-1198 - [i269]Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch:
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. CoRR abs/2201.07207 (2022) - [i268]Julius Frost, Olivia Watkins, Eric Weiner, Pieter Abbeel, Trevor Darrell, Bryan A. Plummer, Kate Saenko:
Explaining Reinforcement Learning Policies through Counterfactual Trajectories. CoRR abs/2201.12462 (2022) - [i267]Denis Yarats, David Brandfonbrener, Hao Liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto:
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning. CoRR abs/2201.13425 (2022) - [i266]Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel:
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery. CoRR abs/2202.00161 (2022) - [i265]Stephen James, Pieter Abbeel:
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning. CoRR abs/2202.03957 (2022) - [i264]Yuqing Du, Pieter Abbeel, Aditya Grover:
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation. CoRR abs/2202.10608 (2022) - [i263]Jongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning. CoRR abs/2203.10050 (2022) - [i262]Olivia Watkins, Trevor Darrell, Pieter Abbeel, Jacob Andreas, Abhishek Gupta:
Teachable Reinforcement Learning via Advice Distillation. CoRR abs/2203.11197 (2022) - [i261]Younggyo Seo, Kimin Lee, Stephen James, Pieter Abbeel:
Reinforcement Learning with Action-Free Pre-Training from Videos. CoRR abs/2203.13880 (2022) - [i260]Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel:
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions. CoRR abs/2203.15103 (2022) - [i259]Kourosh Hakhamaneshi, Marcel Nassar, Mariano Phielipp, Pieter Abbeel, Vladimir Stojanovic:
Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design. CoRR abs/2203.15913 (2022) - [i258]Stephen James, Pieter Abbeel:
Coarse-to-Fine Q-attention with Learned Path Ranking. CoRR abs/2204.01571 (2022) - [i257]Carl Qi, Pieter Abbeel, Aditya Grover:
Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning. CoRR abs/2204.03597 (2022) - [i256]Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin-picking. CoRR abs/2204.07049 (2022) - [i255]Stephen James, Pieter Abbeel:
Coarse-to-fine Q-attention with Tree Expansion. CoRR abs/2204.12471 (2022) - [i254]Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah:
An Empirical Investigation of Representation Learning for Imitation. CoRR abs/2205.07886 (2022) - [i253]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. CoRR abs/2205.10816 (2022) - [i252]Xinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel:
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning. CoRR abs/2205.12401 (2022) - [i251]Xinyang Geng, Hao Liu, Lisa Lee, Dale Schuurams, Sergey Levine, Pieter Abbeel:
Multimodal Masked Autoencoders Learn Transferable Representations. CoRR abs/2205.14204 (2022) - [i250]Mandi Zhao, Pieter Abbeel, Stephen James:
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning. CoRR abs/2206.03271 (2022) - [i249]Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel:
Patch-based Object-centric Transformers for Efficient Video Generation. CoRR abs/2206.04003 (2022) - [i248]Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel:
Deep Hierarchical Planning from Pixels. CoRR abs/2206.04114 (2022) - [i247]Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter Abbeel:
DayDreamer: World Models for Physical Robot Learning. CoRR abs/2206.14176 (2022) - [i246]Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel:
Masked World Models for Visual Control. CoRR abs/2206.14244 (2022) - [i245]Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg:
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision. CoRR abs/2206.14349 (2022) - [i244]Qiyang Li, Ajay Jain, Pieter Abbeel:
AdaCat: Adaptive Categorical Discretization for Autoregressive Models. CoRR abs/2208.02246 (2022) - [i243]Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel:
Multi-Objective Policy Gradients with Topological Constraints. CoRR abs/2209.07096 (2022) - [i242]Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel:
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator. CoRR abs/2209.07143 (2022) - [i241]Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox:
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks. CoRR abs/2209.07670 (2022) - [i240]Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel:
Temporally Consistent Video Transformer for Long-Term Video Prediction. CoRR abs/2210.02396 (2022) - [i239]Ilija Radosavovic, Tete Xiao, Stephen James, Pieter Abbeel, Jitendra Malik, Trevor Darrell:
Real-World Robot Learning with Masked Visual Pre-training. CoRR abs/2210.03109 (2022) - [i238]Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen:
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction. CoRR abs/2210.07424 (2022) - [i237]Ademi Adeniji, Amber Xie, Pieter Abbeel:
Skill-Based Reinforcement Learning with Intrinsic Reward Matching. CoRR abs/2210.07426 (2022) - [i236]Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica:
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning. CoRR abs/2210.10243 (2022) - [i235]