default search action
Archit Sharma
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2019
- [j1]Archit Sharma, Siddhartha Saxena, Piyush Rai:
A flexible probabilistic framework for large-margin mixture of experts. Mach. Learn. 108(8-9): 1369-1393 (2019)
Conference and Workshop Papers
- 2024
- [c21]Archit Sharma, Sandeep Gupta, Peeyush Thakur, Narendra Kumar Dhar, Laxmidhar Behera:
Evolutionary Search of Optimal Hyperparameters for Learning Various Robot Manipulation Tasks. CEC 2024: 1-8 - [c20]Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher D. Manning:
An Emulator for Fine-tuning Large Language Models using Small Language Models. ICLR 2024 - [c19]Charlotte Nicks, Eric Mitchell, Rafael Rafailov, Archit Sharma, Christopher D. Manning, Chelsea Finn, Stefano Ermon:
Language Model Detectors Are Easily Optimized Against. ICLR 2024 - [c18]Moritz Stephan, Alexander Khazatsky, Eric Mitchell, Annie S. Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn:
RLVF: Learning from Verbal Feedback without Overgeneralization. ICML 2024 - [c17]Fahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov, Jeff Schneider, Tengyang Xie, Stefano Ermon, Chelsea Finn, Aviral Kumar:
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data. ICML 2024 - [c16]Jingyun Yang, Max Sobol Mark, Brandon Vu, Archit Sharma, Jeannette Bohg, Chelsea Finn:
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning. ICRA 2024: 4804-4811 - [c15]Abby O'Neill, Abdul Rehman, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alexander Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie, Anthony Brohan, Antonin Raffin, Archit Sharma, Arefeh Yavary, Arhan Jain, Ashwin Balakrishna, Ayzaan Wahid, Ben Burgess-Limerick, Beomjoon Kim, Bernhard Schölkopf, Blake Wulfe, Brian Ichter, Cewu Lu, Charles Xu, Charlotte Le, Chelsea Finn, Chen Wang, Chenfeng Xu, Cheng Chi, Chenguang Huang, Christine Chan, Christopher Agia, Chuer Pan, Chuyuan Fu, Coline Devin, Danfei Xu, Daniel Morton, Danny Driess, Daphne Chen, Deepak Pathak, Dhruv Shah, Dieter Büchler, Dinesh Jayaraman, Dmitry Kalashnikov, Dorsa Sadigh, Edward Johns, Ethan Paul Foster, Fangchen Liu, Federico Ceola, Fei Xia, Feiyu Zhao, Freek Stulp, Gaoyue Zhou, Gaurav S. Sukhatme, Gautam Salhotra, Ge Yan, Gilbert Feng, Giulio Schiavi, Glen Berseth, Gregory Kahn, Guanzhi Wang, Hao Su, Haoshu Fang, Haochen Shi, Henghui Bao, Heni Ben Amor, Henrik I. Christensen, Hiroki Furuta, Homer Walke, Hongjie Fang, Huy Ha, Igor Mordatch, Ilija Radosavovic, Isabel Leal, Jacky Liang, Jad Abou-Chakra, Jaehyung Kim, Jaimyn Drake, Jan Peters, Jan Schneider, Jasmine Hsu, Jeannette Bohg, Jeffrey Bingham, Jeffrey Wu, Jensen Gao, Jiaheng Hu, Jiajun Wu, Jialin Wu, Jiankai Sun, Jianlan Luo, Jiayuan Gu, Jie Tan, Jihoon Oh, Jimmy Wu, Jingpei Lu, Jingyun Yang, Jitendra Malik, João Silvério, Joey Hejna, Jonathan Booher, Jonathan Tompson, Jonathan Yang, Jordi Salvador, Joseph J. Lim, Junhyek Han, Kaiyuan Wang, Kanishka Rao, Karl Pertsch, Karol Hausman, Keegan Go, Keerthana Gopalakrishnan, Ken Goldberg, Kendra Byrne, Kenneth Oslund, Kento Kawaharazuka, Kevin Black, Kevin Lin, Kevin Zhang, Kiana Ehsani, Kiran Lekkala, Kirsty Ellis, Krishan Rana, Krishnan Srinivasan, Kuan Fang, Kunal Pratap Singh, Kuo-Hao Zeng, Kyle Hatch, Kyle Hsu, Laurent Itti, Lawrence Yunliang Chen, Lerrel Pinto, Li Fei-Fei, Liam Tan, Linxi Jim Fan, Lionel Ott, Lisa Lee, Luca Weihs, Magnum Chen, Marion Lepert, Marius Memmel, Masayoshi Tomizuka, Masha Itkina, Mateo Guaman Castro, Max Spero, Maximilian Du, Michael Ahn, Michael C. Yip, Mingtong Zhang, Mingyu Ding, Minho Heo, Mohan Kumar Srirama, Mohit Sharma, Moo Jin Kim, Naoaki Kanazawa, Nicklas Hansen, Nicolas Heess, Nikhil J. Joshi, Niko Sünderhauf, Ning Liu, Norman Di Palo, Nur Muhammad (Mahi) Shafiullah, Oier Mees, Oliver Kroemer, Osbert Bastani, Pannag R. Sanketi, Patrick Tree Miller, Patrick Yin, Paul Wohlhart, Peng Xu, Peter David Fagan, Peter Mitrano, Pierre Sermanet, Pieter Abbeel, Priya Sundaresan, Qiuyu Chen, Quan Vuong, Rafael Rafailov, Ran Tian, Ria Doshi, Roberto Martín-Martín, Rohan Baijal, Rosario Scalise, Rose Hendrix, Roy Lin, Runjia Qian, Ruohan Zhang, Russell Mendonca, Rutav Shah, Ryan Hoque, Ryan Julian, Samuel Bustamante, Sean Kirmani, Sergey Levine, Shan Lin, Sherry Moore, Shikhar Bahl, Shivin Dass, Shubham D. Sonawani, Shuran Song, Sichun Xu, Siddhant Haldar, Siddharth Karamcheti, Simeon Adebola, Simon Guist, Soroush Nasiriany, Stefan Schaal, Stefan Welker, Stephen Tian, Subramanian Ramamoorthy, Sudeep Dasari, Suneel Belkhale, Sungjae Park, Suraj Nair, Suvir Mirchandani, Takayuki Osa, Tanmay Gupta, Tatsuya Harada, Tatsuya Matsushima, Ted Xiao, Thomas Kollar, Tianhe Yu, Tianli Ding, Todor Davchev, Tony Z. Zhao, Travis Armstrong, Trevor Darrell, Trinity Chung, Vidhi Jain, Vincent Vanhoucke, Wei Zhan, Wenxuan Zhou, Wolfram Burgard, Xi Chen, Xiaolong Wang, Xinghao Zhu, Xinyang Geng, Xiyuan Liu, Liangwei Xu, Xuanlin Li, Yao Lu, Yecheng Jason Ma, Yejin Kim, Yevgen Chebotar, Yifan Zhou, Yifeng Zhu, Yilin Wu, Ying Xu, Yixuan Wang, Yonatan Bisk, Yoonyoung Cho, Youngwoon Lee, Yuchen Cui, Yue Cao, Yueh-Hua Wu, Yujin Tang, Yuke Zhu, Yunchu Zhang, Yunfan Jiang, Yunshuang Li, Yunzhu Li, Yusuke Iwasawa, Yutaka Matsuo, Zehan Ma, Zhuo Xu, Zichen Jeff Cui, Zichen Zhang, Zipeng Lin:
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration. ICRA 2024: 6892-6903 - [c14]Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine:
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning. ICRA 2024: 16961-16969 - 2023
- [c13]A. S. Poornash, Atharva Deshmukh, Archit Sharma, Sriparna Saha:
APTSumm at BioLaySumm Task 1: Biomedical Breakdown, Improving Readability by Relevancy Based Selection. BioNLP@ACL 2023: 579-585 - [c12]Lucy Xiaoyang Shi, Archit Sharma, Tony Z. Zhao, Chelsea Finn:
Waypoint-Based Imitation Learning for Robotic Manipulation. CoRL 2023: 2195-2209 - [c11]Archit Sharma, Ahmed M. Ahmed, Rehaan Ahmad, Chelsea Finn:
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning. CoRL 2023: 3292-3308 - [c10]Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher D. Manning:
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback. EMNLP 2023: 5433-5442 - [c9]Rafael Rafailov, Archit Sharma, Eric Mitchell, Christopher D. Manning, Stefano Ermon, Chelsea Finn:
Direct Preference Optimization: Your Language Model is Secretly a Reward Model. NeurIPS 2023 - 2022
- [c8]Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn:
Autonomous Reinforcement Learning: Formalism and Benchmarking. ICLR 2022 - [c7]Archit Sharma, Rehaan Ahmad, Chelsea Finn:
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning. ICML 2022: 19645-19657 - [c6]Annie S. Chen, Archit Sharma, Sergey Levine, Chelsea Finn:
You Only Live Once: Single-Life Reinforcement Learning. NeurIPS 2022 - [c5]Annie Xie, Fahim Tajwar, Archit Sharma, Chelsea Finn:
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning. NeurIPS 2022 - 2021
- [c4]Jongwook Choi, Archit Sharma, Honglak Lee, Sergey Levine, Shixiang Shane Gu:
Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning. ICML 2021: 1953-1963 - [c3]Archit Sharma, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn:
Autonomous Reinforcement Learning via Subgoal Curricula. NeurIPS 2021: 18474-18486 - 2020
- [c2]Archit Sharma, Shixiang Gu, Sergey Levine, Vikash Kumar, Karol Hausman:
Dynamics-Aware Unsupervised Discovery of Skills. ICLR 2020 - [c1]Archit Sharma, Michael Ahn, Sergey Levine, Vikash Kumar, Karol Hausman, Shixiang Gu:
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning. Robotics: Science and Systems 2020
Informal and Other Publications
- 2024
- [i26]Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine:
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning. CoRR abs/2401.16013 (2024) - [i25]Moritz Stephan, Alexander Khazatsky, Eric Mitchell, Annie S. Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn:
RLVF: Learning from Verbal Feedback without Overgeneralization. CoRR abs/2402.10893 (2024) - [i24]Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar:
A Critical Evaluation of AI Feedback for Aligning Large Language Models. CoRR abs/2402.12366 (2024) - [i23]Lucy Xiaoyang Shi, Zheyuan Hu, Tony Z. Zhao, Archit Sharma, Karl Pertsch, Jianlan Luo, Sergey Levine, Chelsea Finn:
Yell At Your Robot: Improving On-the-Fly from Language Corrections. CoRR abs/2403.12910 (2024) - [i22]Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park, Ilija Radosavovic, Kaiyuan Wang, Albert Zhan, Kevin Black, Cheng Chi, Kyle Beltran Hatch, Shan Lin, Jingpei Lu, Jean Mercat, Abdul Rehman, Pannag R. Sanketi, Archit Sharma, Cody Simpson, Quan Vuong, Homer Rich Walke, Blake Wulfe, Ted Xiao, Jonathan Heewon Yang, Arefeh Yavary, Tony Z. Zhao, Christopher Agia, Rohan Baijal, Mateo Guaman Castro, Daphne Chen, Qiuyu Chen, Trinity Chung, Jaimyn Drake, Ethan Paul Foster, et al.:
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset. CoRR abs/2403.12945 (2024) - [i21]Kanishk Gandhi, Denise Lee, Gabriel Grand, Muxin Liu, Winson Cheng, Archit Sharma, Noah D. Goodman:
Stream of Search (SoS): Learning to Search in Language. CoRR abs/2404.03683 (2024) - [i20]Fahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov, Jeff Schneider, Tengyang Xie, Stefano Ermon, Chelsea Finn, Aviral Kumar:
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data. CoRR abs/2404.14367 (2024) - [i19]Judy Hanwen Shen, Archit Sharma, Jun Qin:
Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison. CoRR abs/2409.09603 (2024) - 2023
- [i18]Archit Sharma, Ahmed M. Ahmed, Rehaan Ahmad, Chelsea Finn:
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning. CoRR abs/2303.01488 (2023) - [i17]Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher D. Manning:
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback. CoRR abs/2305.14975 (2023) - [i16]Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn:
Direct Preference Optimization: Your Language Model is Secretly a Reward Model. CoRR abs/2305.18290 (2023) - [i15]Lucy Xiaoyang Shi, Archit Sharma, Tony Z. Zhao, Chelsea Finn:
Waypoint-Based Imitation Learning for Robotic Manipulation. CoRR abs/2307.14326 (2023) - [i14]Max Sobol Mark, Archit Sharma, Fahim Tajwar, Rafael Rafailov, Sergey Levine, Chelsea Finn:
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias. CoRR abs/2310.08558 (2023) - [i13]Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher D. Manning:
An Emulator for Fine-Tuning Large Language Models using Small Language Models. CoRR abs/2310.12962 (2023) - [i12]Jingyun Yang, Max Sobol Mark, Brandon Vu, Archit Sharma, Jeannette Bohg, Chelsea Finn:
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning. CoRR abs/2310.15145 (2023) - [i11]Annie S. Chen, Govind Chada, Laura M. Smith, Archit Sharma, Zipeng Fu, Sergey Levine, Chelsea Finn:
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment. CoRR abs/2311.01059 (2023) - 2022
- [i10]Archit Sharma, Rehaan Ahmad, Chelsea Finn:
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning. CoRR abs/2205.05212 (2022) - [i9]Annie S. Chen, Archit Sharma, Sergey Levine, Chelsea Finn:
You Only Live Once: Single-Life Reinforcement Learning. CoRR abs/2210.08863 (2022) - [i8]Annie Xie, Fahim Tajwar, Archit Sharma, Chelsea Finn:
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning. CoRR abs/2210.10765 (2022) - 2021
- [i7]Behzad Haghgoo, Allan Zhou, Archit Sharma, Chelsea Finn:
Discriminator Augmented Model-Based Reinforcement Learning. CoRR abs/2103.12999 (2021) - [i6]Jongwook Choi, Archit Sharma, Honglak Lee, Sergey Levine, Shixiang Shane Gu:
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning. CoRR abs/2106.01404 (2021) - [i5]Archit Sharma, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn:
Persistent Reinforcement Learning via Subgoal Curricula. CoRR abs/2107.12931 (2021) - [i4]Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn:
Autonomous Reinforcement Learning: Formalism and Benchmarking. CoRR abs/2112.09605 (2021) - 2020
- [i3]Archit Sharma, Michael Ahn, Sergey Levine, Vikash Kumar, Karol Hausman, Shixiang Gu:
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning. CoRR abs/2004.12974 (2020) - 2019
- [i2]Archit Sharma, Shixiang Gu, Sergey Levine, Vikash Kumar, Karol Hausman:
Dynamics-Aware Unsupervised Discovery of Skills. CoRR abs/1907.01657 (2019) - 2018
- [i1]Archit Sharma, Jasper L, Eric Zhang:
TrueChain: Highly Performant Decentralized Public Ledger. CoRR abs/1805.01457 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-15 21:35 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint