default search action
Silvio Savarese
Person information
- affiliation: Stanford University, Department of Computer Science, Stanford, CA, USA
- affiliation (2008 - 2013): University of Michigan, Department of Electrical and Computer Engineering, Ann Arbor, MI, USA
- affiliation (2005 - 2008): University of Illinois, Urbana-Champaign, IL, USA
- affiliation (PhD 2005): California Institute of Technology, Pasadena, CA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j31]Bokui Shen, Zhenyu Jiang, Christopher Bongsoo Choy, Silvio Savarese, Leonidas J. Guibas, Anima Anandkumar, Yuke Zhu:
Action-conditional implicit visual dynamics for deformable object manipulation. Int. J. Robotics Res. 43(4): 437-455 (2024) - [j30]Rachel Luo, Shengjia Zhao, Jonathan Kuck, Boris Ivanovic, Silvio Savarese, Edward Schmerling, Marco Pavone:
Sample-efficient safety assurances using conformal prediction. Int. J. Robotics Res. 43(9): 1409-1424 (2024) - [c217]Itai Feigenbaum, Devansh Arpit, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese, Huan Wang:
Causal Layering via Conditional Entropy. CLeaR 2024: 1176-1191 - [c216]Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu:
HIVE: Harnessing Human Feedback for Instructional Visual Editing. CVPR 2024: 9026-9036 - [c215]Le Xue, Ning Yu, Shu Zhang, Artemis Panagopoulou, Junnan Li, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP-2: Towards Scalable Multimodal Pre-Training for 3D Understanding. CVPR 2024: 27081-27091 - [c214]Jianguo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong:
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI. EACL (Findings) 2024: 2299-2315 - [c213]Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles:
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning. ECCV (45) 2024: 177-197 - [c212]Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai:
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations. ICLR 2024 - [c211]Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh R. N., Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization. ICLR 2024 - [c210]Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
Unified Training of Universal Time Series Forecasting Transformers. ICML 2024 - [c209]Rachel Luo, Rohan Sinha, Yixiao Sun, Ali Hindy, Shengjia Zhao, Silvio Savarese, Edward Schmerling, Marco Pavone:
Online Distribution Shift Detection via Recency Prediction. ICRA 2024: 16251-16263 - [i186]Itai Feigenbaum, Devansh Arpit, Huan Wang, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese:
Editing Arbitrary Propositions in LLMs without Subject Labels. CoRR abs/2401.07526 (2024) - [i185]Itai Feigenbaum, Devansh Arpit, Huan Wang, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese:
Causal Layering via Conditional Entropy. CoRR abs/2401.10495 (2024) - [i184]Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
Unified Training of Universal Time Series Forecasting Transformers. CoRR abs/2402.02592 (2024) - [i183]Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese:
Text2Data: Low-Resource Data Generation with Textual Control. CoRR abs/2402.10941 (2024) - [i182]Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Manoj Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong:
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning. CoRR abs/2402.15506 (2024) - [i181]Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla Kumar Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese:
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System. CoRR abs/2402.15538 (2024) - [i180]Chengshu Li, Ruohan Zhang, Josiah Wong, Cem Gokmen, Sanjana Srivastava, Roberto Martín-Martín, Chen Wang, Gabrael Levine, Wensi Ai, Benjamin Jose Martinez, Hang Yin, Michael Lingelbach, Minjune Hwang, Ayano Hiranaka, Sujay Garlanka, Arman Aydin, Sharon Lee, Jiankai Sun, Mona Anvari, Manasi Sharma, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R. Matthews, Ivan Villa-Renteria, Jerry Huayang Tang, Claire Tang, Fei Xia, Yunzhu Li, Silvio Savarese, Hyowon Gweon, C. Karen Liu, Jiajun Wu, Li Fei-Fei:
BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1, 000 Everyday Activities and Realistic Simulation. CoRR abs/2403.09227 (2024) - [i179]Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu:
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. CoRR abs/2404.07972 (2024) - [i178]Rithesh Murthy, Liangwei Yang, Juntao Tan, Tulika Manoj Awalgaonkar, Yilun Zhou, Shelby Heinecke, Sachin Desai, Jason Wu, Ran Xu, Sarah Tan, Jianguo Zhang, Zhiwei Liu, Shirley Kokane, Zuxin Liu, Ming Zhu, Huan Wang, Caiming Xiong, Silvio Savarese:
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases. CoRR abs/2406.10290 (2024) - [i177]Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Yejin Choi, Ludwig Schmidt:
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens. CoRR abs/2406.11271 (2024) - [i176]Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong:
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets. CoRR abs/2406.18518 (2024) - [i175]Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness. CoRR abs/2407.02518 (2024) - [i174]Yilun Zhou, Caiming Xiong, Silvio Savarese, Chien-Sheng Wu:
Shared Imagination: LLMs Hallucinate Alike. CoRR abs/2407.16604 (2024) - [i173]Tian Lan, Huan Wang, Caiming Xiong, Silvio Savarese:
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research. CoRR abs/2408.00930 (2024) - [i172]Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong:
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents. CoRR abs/2408.07060 (2024) - [i171]Le Xue, Manli Shu, Anas Awadalla, Jun Wang, An Yan, Senthil Purushwalkam, Honglu Zhou, Viraj Prabhu, Yutong Dai, Michael S. Ryoo, Shrikant Kendre, Jieyu Zhang, Can Qin, Shu Zhang, Chia-Chih Chen, Ning Yu, Juntao Tan, Tulika Manoj Awalgaonkar, Shelby Heinecke, Huan Wang, Yejin Choi, Ludwig Schmidt, Zeyuan Chen, Silvio Savarese, Juan Carlos Niebles, Caiming Xiong, Ran Xu:
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. CoRR abs/2408.08872 (2024) - [i170]Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong:
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. CoRR abs/2408.12590 (2024) - [i169]Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Manoj Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
xLAM: A Family of Large Action Models to Empower AI Agent Systems. CoRR abs/2409.03215 (2024) - [i168]Xuan-Phi Nguyen, Shrey Pandit, Senthil Purushwalkam, Austin Xu, Hailin Chen, Yifei Ming, Zixuan Ke, Silvio Savarese, Caiming Xong, Shafiq Joty:
SFR-RAG: Towards Contextually Faithful LLMs. CoRR abs/2409.09916 (2024) - [i167]Taha Aksu, Gerald Woo, Juncheng Liu, Xu Liu, Chenghao Liu, Silvio Savarese, Caiming Xiong, Doyen Sahoo:
GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation. CoRR abs/2410.10393 (2024) - [i166]Xu Liu, Juncheng Liu, Gerald Woo, Taha Aksu, Yuxuan Liang, Roger Zimmermann, Chenghao Liu, Silvio Savarese, Caiming Xiong, Doyen Sahoo:
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts. CoRR abs/2410.10469 (2024) - [i165]Michael S. Ryoo, Honglu Zhou, Shrikant Kendre, Can Qin, Le Xue, Manli Shu, Silvio Savarese, Ran Xu, Caiming Xiong, Juan Carlos Niebles:
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs. CoRR abs/2410.16267 (2024) - [i164]Zhiwei Liu, Weiran Yao, Jianguo Zhang, Rithesh Murthy, Liangwei Yang, Zuxin Liu, Tian Lan, Ming Zhu, Juntao Tan, Shirley Kokane, Thai Hoang, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent. CoRR abs/2410.18528 (2024) - [i163]Antonio A. Ginart, Naveen Kodali, Jason Lee, Caiming Xiong, Silvio Savarese, John Emmons:
Asynchronous Tool Usage for Real-Time Agents. CoRR abs/2410.21620 (2024) - [i162]Kung-Hsiang Huang, Akshara Prabhakar, Sidharth Dhawan, Yixin Mao, Huan Wang, Silvio Savarese, Caiming Xiong, Philippe Laban, Chien-Sheng Wu:
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments. CoRR abs/2411.02305 (2024) - [i161]Haolin Chen, Yihao Feng, Zuxin Liu, Weiran Yao, Akshara Prabhakar, Shelby Heinecke, Ricky Ho, Phil Mui, Silvio Savarese, Caiming Xiong, Huan Wang:
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding. CoRR abs/2411.04282 (2024) - [i160]Jierui Li, Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models. CoRR abs/2411.04329 (2024) - [i159]Anas Awadalla, Le Xue, Manli Shu, An Yan, Jun Wang, Senthil Purushwalkam, Sheng Shen, Hannah Lee, Oscar Lo, Jae Sung Park, Etash Guha, Silvio Savarese, Ludwig Schmidt, Yejin Choi, Caiming Xiong, Ran Xu:
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions. CoRR abs/2411.07461 (2024) - [i158]Ye Liu, Rui Meng, Shafiq Joty, Silvio Savarese, Caiming Xiong, Yingbo Zhou, Semih Yavuz:
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval. CoRR abs/2411.12644 (2024) - [i157]Yun Peng, Akhilesh Deepak Gotmare, Michael Lyu, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback. CoRR abs/2412.03578 (2024) - [i156]Zixian Ma, Jianguo Zhang, Zhiwei Liu, Jieyu Zhang, Juntao Tan, Manli Shu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Caiming Xiong, Ranjay Krishna, Silvio Savarese:
TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action. CoRR abs/2412.05479 (2024) - [i155]Jieyu Zhang, Le Xue, Linxin Song, Jun Wang, Weikai Huang, Manli Shu, An Yan, Zixian Ma, Juan Carlos Niebles, Silvio Savarese, Caiming Xiong, Zeyuan Chen, Ranjay Krishna, Ran Xu:
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models. CoRR abs/2412.07012 (2024) - [i154]Artemis Panagopoulou, Honglu Zhou, Silvio Savarese, Caiming Xiong, Chris Callison-Burch, Mark Yatskar, Juan Carlos Niebles:
ViUniT: Visual Unit Tests for More Robust Visual Programming. CoRR abs/2412.08859 (2024) - 2023
- [j29]Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K. C. Krithika, Sukumar Maddineni, Dae-ki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven C. H. Hoi, Huan Wang:
Merlion: End-to-End Machine Learning for Time Series. J. Mach. Learn. Res. 24: 226:1-226:6 (2023) - [j28]Roberto Martín-Martín, Mihir Patel, Hamid Rezatofighi, Abhijeet Shenoi, JunYoung Gwak, Eric Frankel, Amir Sadeghian, Silvio Savarese:
JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception of Humans in Built Environments. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 6748-6765 (2023) - [j27]Tran Thien Dat Nguyen, Hamid Rezatofighi, Ba-Ngu Vo, Ba-Tuong Vo, Silvio Savarese, Ian D. Reid:
How Trustworthy are Performance Evaluations for Basic Vision Tasks? IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8538-8552 (2023) - [c208]Dongxu Li, Junnan Li, Hung Le, Guangsen Wang, Silvio Savarese, Steven C. H. Hoi:
LAVIS: A One-stop Library for Language-Vision Intelligence. ACL (demo) 2023: 31-41 - [c207]Jiacheng Xu, Caiming Xiong, Silvio Savarese, Yingbo Zhou:
Best-k Search Algorithm for Neural Text Generation. ACL (1) 2023: 12385-12401 - [c206]Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Chengshu Li, Emily Jin, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín:
Modeling Dynamic Environments with Scene Graph Memory. AAMAS 2023: 2851-2853 - [c205]Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding. CVPR 2023: 1179-1189 - [c204]Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles:
Procedure-Aware Pretraining for Instructional Video Understanding. CVPR 2023: 10727-10738 - [c203]Bo Pang, Erik Nijkamp, Wojciech Kryscinski, Silvio Savarese, Yingbo Zhou, Caiming Xiong:
Long Document Summarization with Top-down and Bottom-up Inference. EACL (Findings) 2023: 1237-1254 - [c202]Junnan Li, Silvio Savarese, Steven C. H. Hoi:
Masked Unsupervised Self-training for Label-free Image Classification. ICLR 2023 - [c201]Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong:
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis. ICLR 2023 - [c200]Trevor Scott Standley, Ruohan Gao, Dawn Chen, Jiajun Wu, Silvio Savarese:
An Extensible Multi-modal Multi-task Object Dataset with Materials. ICLR 2023 - [c199]Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Emily Jin, Chengshu Li, Ruohan Zhang, Li Fei-Fei, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín:
Modeling Dynamic Environments with Scene Graph Memory. ICML 2023: 17976-17993 - [c198]Junnan Li, Dongxu Li, Silvio Savarese, Steven C. H. Hoi:
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models. ICML 2023: 19730-19742 - [c197]Ruohan Gao, Hao Li, Gokul Dharan, Zhuzhu Wang, Chengshu Li, Fei Xia, Silvio Savarese, Li Fei-Fei, Jiajun Wu:
Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear. ICRA 2023: 704-711 - [c196]Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu:
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild. NeurIPS 2023 - [c195]Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System. SIGDIAL 2023: 509-518 - [i153]Junnan Li, Dongxu Li, Silvio Savarese, Steven C. H. Hoi:
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models. CoRR abs/2301.12597 (2023) - [i152]Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu:
HIVE: Harnessing Human Feedback for Instructional Visual Editing. CoRR abs/2303.09618 (2023) - [i151]Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles:
Procedure-Aware Pretraining for Instructional Video Understanding. CoRR abs/2303.18230 (2023) - [i150]Qian Cheng, Doyen Sahoo, Amrita Saha, Wenzhuo Yang, Chenghao Liu, Gerald Woo, Manpreet Singh, Silvio Savarese, Steven C. H. Hoi:
AI for IT Operations (AIOps) on Cloud Platforms: Reviews, Opportunities and Challenges. CoRR abs/2304.04661 (2023) - [i149]Erik Nijkamp, Hiroaki Hayashi, Caiming Xiong, Silvio Savarese, Yingbo Zhou:
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages. CoRR abs/2305.02309 (2023) - [i148]Le Xue, Ning Yu, Shu Zhang, Junnan Li, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding. CoRR abs/2305.08275 (2023) - [i147]Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu:
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild. CoRR abs/2305.11147 (2023) - [i146]Trevor Standley, Ruohan Gao, Dawn Chen, Jiajun Wu, Silvio Savarese:
An Extensible Multimodal Multi-task Object Dataset with Materials. CoRR abs/2305.14352 (2023) - [i145]Andrey Kurenkov, Michael Lingelbach, Tanmay Agarwal, Chengshu Li, Emily Jin, Ruohan Zhang, Fei-Fei Li, Jiajun Wu, Silvio Savarese, Roberto Martín-Martín:
Modeling Dynamic Environments with Scene Graph Memory. CoRR abs/2305.17537 (2023) - [i144]Ruohan Gao, Hao Li, Gokul Dharan, Zhuzhu Wang, Chengshu Li, Fei Xia, Silvio Savarese, Li Fei-Fei, Jiajun Wu:
Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear. CoRR abs/2306.00923 (2023) - [i143]Rithesh Murthy, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Le Xue, Weiran Yao, Yihao Feng, Zeyuan Chen, Akash Gokul, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
REX: Rapid Exploration and eXploitation for AI Agents. CoRR abs/2307.08962 (2023) - [i142]Jianguo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong:
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI. CoRR abs/2307.10172 (2023) - [i141]Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh Murthy, Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization. CoRR abs/2308.02151 (2023) - [i140]Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents. CoRR abs/2308.05960 (2023) - [i139]Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System. CoRR abs/2308.08169 (2023) - [i138]Erik Nijkamp, Tian Xie, Hiroaki Hayashi, Bo Pang, Congying Xia, Chen Xing, Jesse Vig, Semih Yavuz, Philippe Laban, Ben Krause, Senthil Purushwalkam, Tong Niu, Wojciech Kryscinski, Lidiya Murakhovs'ka, Prafulla Kumar Choubey, Alexander R. Fabbri, Ye Liu, Rui Meng, Lifu Tu, Meghana Bhat, Chien-Sheng Wu, Silvio Savarese, Yingbo Zhou, Shafiq Joty, Caiming Xiong:
XGen-7B Technical Report. CoRR abs/2309.03450 (2023) - [i137]Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai:
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations. CoRR abs/2310.10616 (2023) - [i136]Tao Sun, Yan Hao, Shengyu Huang, Silvio Savarese, Konrad Schindler, Marc Pollefeys, Iro Armeni:
Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change. CoRR abs/2311.09346 (2023) - [i135]Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles:
X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning. CoRR abs/2311.18799 (2023) - 2022
- [c194]Chengshu Li, Ruohan Zhang, Josiah Wong, Cem Gokmen, Sanjana Srivastava, Roberto Martín-Martín, Chen Wang, Gabrael Levine, Michael Lingelbach, Jiankai Sun, Mona Anvari, Minjune Hwang, Manasi Sharma, Arman Aydin, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R. Matthews, Ivan Villa-Renteria, Jerry Huayang Tang, Claire Tang, Fei Xia, Silvio Savarese, Hyowon Gweon, C. Karen Liu, Jiajun Wu, Li Fei-Fei:
BEHAVIOR-1K: A Benchmark for Embodied AI with 1, 000 Everyday Activities and Realistic Simulation. CoRL 2022: 80-93 - [c193]Mahsa Ehsanpour, Fatemeh Sadat Saleh, Silvio Savarese, Ian D. Reid, Hamid Rezatofighi:
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection. CVPR 2022: 20951-20960 - [c192]Anthony Meng Huat Tiong, Junnan Li, Boyang Li, Silvio Savarese, Steven C. H. Hoi:
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training. EMNLP (Findings) 2022: 951-967 - [c191]Lyne P. Tchapmi, Trishiet Ray, Micael Tchapmi, Bokui Shen, Roberto Martin Martin, Silvio Savarese:
Generating Procedural 3D materials from Images using Neural Networks. IVSP 2022: 32-40 - [c190]Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Silvio Savarese, Steven Chu-Hong Hoi:
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning. NeurIPS 2022 - [c189]Bokui Shen, Zhenyu Jiang, Christopher B. Choy, Silvio Savarese, Leonidas J. Guibas, Anima Anandkumar, Yuke Zhu:
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation. Robotics: Science and Systems 2022 - [c188]Rachel Luo, Aadyot Bhatnagar, Yu Bai, Shengjia Zhao, Huan Wang, Caiming Xiong, Silvio Savarese, Stefano Ermon, Edward Schmerling, Marco Pavone:
Local calibration: metrics and recalibration. UAI 2022: 1286-1295 - [c187]Rachel Luo, Shengjia Zhao, Jonathan Kuck, Boris Ivanovic, Silvio Savarese, Edward Schmerling, Marco Pavone:
Sample-Efficient Safety Assurances Using Conformal Prediction. WAFR 2022: 149-169 - [i134]Bokui Shen, Zhenyu Jiang, Christopher B. Choy, Leonidas J. Guibas, Silvio Savarese, Anima Anandkumar, Yuke Zhu:
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation. CoRR abs/2203.06856 (2022) - [i133]Bo Pang, Erik Nijkamp, Wojciech Kryscinski, Silvio Savarese, Yingbo Zhou, Caiming Xiong:
Long Document Summarization with Top-down and Bottom-up Inference. CoRR abs/2203.07586 (2022) - [i132]Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong:
A Conversational Paradigm for Program Synthesis. CoRR abs/2203.13474 (2022) - [i131]Wenzhuo Yang, Hung Le, Silvio Savarese, Steven C. H. Hoi:
OmniXAI: A Library for Explainable AI. CoRR abs/2206.01612 (2022) - [i130]Junnan Li, Silvio Savarese, Steven C. H. Hoi:
Masked Unsupervised Self-training for Zero-shot Image Classification. CoRR abs/2206.02967 (2022) - [i129]Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Silvio Savarese, Steven C. H. Hoi:
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning. CoRR abs/2207.01780 (2022) - [i128]JunYoung Gwak, Silvio Savarese, Jeannette Bohg:
Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object Detection and Tracking. CoRR abs/2208.10056 (2022) - [i127]Dongxu Li, Junnan Li, Hung Le, Guangsen Wang, Silvio Savarese, Steven C. H. Hoi:
LAVIS: A Library for Language-Vision Intelligence. CoRR abs/2209.09019 (2022) - [i126]Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez-D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony G. Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi, Sonia Raychaudhuri, Mike Roberts, Silvio Savarese, Manolis Savva, Mohit Shridhar, Niko Sünderhauf, Andrew Szot, Ben Talbot, Joshua B. Tenenbaum, Jesse Thomason, Alexander Toshev, Joanne Truong, Luca Weihs, Jiajun Wu:
Retrospectives on the Embodied AI Workshop. CoRR abs/2210.06849 (2022) - [i125]Anthony Meng Huat Tiong, Junnan Li, Boyang Li, Silvio Savarese, Steven C. H. Hoi:
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training. CoRR abs/2210.08773 (2022) - [i124]Rachel Luo, Rohan Sinha, Ali Hindy, Shengjia Zhao, Silvio Savarese, Edward Schmerling, Marco Pavone:
Online Distribution Shift Detection via Recency Prediction. CoRR abs/2211.09916 (2022) - [i123]Jiacheng Xu, Caiming Xiong, Silvio Savarese, Yingbo Zhou:
Best-k Search Algorithm for Neural Text Generation. CoRR abs/2211.11924 (2022) - [i122]Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding. CoRR abs/2212.05171 (