default search action
Shaohui Peng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c18]Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li:
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning. AAAI 2024: 14599-14607 - [c17]Fan Wu, Rui Zhang, Qi Yi, Yunkai Gao, Jiaming Guo, Shaohui Peng, Siming Lan, Husheng Han, Yansong Pan, Kaizhao Yuan, Pengwei Jin, Ruizhi Chen, Yunji Chen, Ling Li:
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning. AAAI 2024: 15897-15905 - [c16]Enshuai Zhou, Yifan Hao, Rui Zhang, Yuxuan Guo, Zidong Du, Xishan Zhang, Xinkai Song, Chao Wang, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen:
Emergent Communication for Numerical Concepts Generalization. AAAI 2024: 17609-17617 - [c15]Huang Lei, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen:
Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding. ACL (1) 2024: 9125-9146 - [c14]Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Prompt-based Visual Alignment for Zero-shot Policy Transfer. ICML 2024 - [i16]Yunpu Zhao, Rui Zhang, Wenyi Li, Di Huang, Jiaming Guo, Shaohui Peng, Yifan Hao, Yuanbo Wen, Xing Hu, Zidong Du, Qi Guo, Ling Li, Yunji Chen:
Assessing and Understanding Creativity in Large Language Models. CoRR abs/2401.12491 (2024) - [i15]Yuxuan Guo, Shaohui Peng, Jiaming Guo, Di Huang, Xishan Zhang, Rui Zhang, Yifan Hao, Ling Li, Zikang Tian, Mingju Gao, Yutai Li, Yiming Gan, Shuai Liang, Zihao Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen:
Luban: Building Open-Ended Creative Agents via Autonomous Embodied Verification. CoRR abs/2405.15414 (2024) - [i14]Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Prompt-based Visual Alignment for Zero-shot Policy Transfer. CoRR abs/2406.03250 (2024) - [i13]Huang Lei, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen:
Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding. CoRR abs/2408.08506 (2024) - 2023
- [j2]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du, Qi Guo, Ruizhi Chen, Ling Li, Yunji Chen:
Learning controllable elements oriented representations for reinforcement learning. Neurocomputing 549: 126455 (2023) - [c13]Shaohui Peng, Xing Hu, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Conceptual Reinforcement Learning for Language-Conditioned Tasks. AAAI 2023: 9426-9434 - [c12]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Yunkai Gao, Kaizhao Yuan, Ruizhi Chen, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen:
Online Prototype Alignment for Few-shot Policy Transfer. ICML 2023: 39968-39983 - [c11]Yunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu, Qi Guo, Ling Li, Yunji Chen:
Context Shift Reduction for Offline Meta-Reinforcement Learning. NeurIPS 2023 - [c10]Jiaming Guo, Rui Zhang, Shaohui Peng, Qi Yi, Xing Hu, Ruizhi Chen, Zidong Du, Xishan Zhang, Ling Li, Qi Guo, Yunji Chen:
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression. NeurIPS 2023 - [c9]Yuxuan Guo, Yifan Hao, Rui Zhang, Enshuai Zhou, Zidong Du, Xishan Zhang, Xinkai Song, Yuanbo Wen, Yongwei Zhao, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen:
Emergent Communication for Rules Reasoning. NeurIPS 2023 - [c8]Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen:
ANPL: Towards Natural Programming with Interactive Decomposition. NeurIPS 2023 - [c7]Siming Lan, Rui Zhang, Qi Yi, Jiaming Guo, Shaohui Peng, Yunkai Gao, Fan Wu, Ruizhi Chen, Zidong Du, Xing Hu, Xishan Zhang, Ling Li, Yunji Chen:
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning. NeurIPS 2023 - [c6]Zikang Tian, Ruizhi Chen, Xing Hu, Ling Li, Rui Zhang, Fan Wu, Shaohui Peng, Jiaming Guo, Zidong Du, Qi Guo, Yunji Chen:
Decompose a Task into Generalizable Subtasks in Multi-Agent Reinforcement Learning. NeurIPS 2023 - [i12]Shaohui Peng, Xing Hu, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Conceptual Reinforcement Learning for Language-Conditioned Tasks. CoRR abs/2303.05069 (2023) - [i11]Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen:
ANPL: Compiling Natural Programs with Interactive Decomposition. CoRR abs/2305.18498 (2023) - [i10]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Yunkai Gao, Kaizhao Yuan, Ruizhi Chen, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen:
Online Prototype Alignment for Few-shot Policy Transfer. CoRR abs/2306.07307 (2023) - [i9]Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li:
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning. CoRR abs/2309.01352 (2023) - [i8]Siming Lan, Rui Zhang, Qi Yi, Jiaming Guo, Shaohui Peng, Yunkai Gao, Fan Wu, Ruizhi Chen, Zidong Du, Xing Hu, Xishan Zhang, Ling Li, Yunji Chen:
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning. CoRR abs/2311.01075 (2023) - [i7]Jiaming Guo, Rui Zhang, Shaohui Peng, Qi Yi, Xing Hu, Ruizhi Chen, Zidong Du, Xishan Zhang, Ling Li, Qi Guo, Yunji Chen:
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression. CoRR abs/2311.02104 (2023) - [i6]Yunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu, Qi Guo, Ling Li, Yunji Chen:
Context Shift Reduction for Offline Meta-Reinforcement Learning. CoRR abs/2311.03695 (2023) - [i5]Yuxuan Guo, Yifan Hao, Rui Zhang, Enshuai Zhou, Zidong Du, Xishan Zhang, Xinkai Song, Yuanbo Wen, Yongwei Zhao, Xuehai Zhou, Jiaming Guo, Qi Yi, Shaohui Peng, Di Huang, Ruizhi Chen, Qi Guo, Yunji Chen:
Emergent Communication for Rules Reasoning. CoRR abs/2311.04474 (2023) - 2022
- [j1]Xiaobing Chen, Hao Qi, Shaohui Peng, Yimin Zhuang, Tian Zhi, Yunji Chen:
Tetris: A Heuristic Static Memory Management Framework for Uniform Memory Multicore Neural Network Accelerators. J. Comput. Sci. Technol. 37(6): 1255-1270 (2022) - [c5]Shaohui Peng, Xing Hu, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning. NeurIPS 2022 - [c4]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen:
Object-Category Aware Reinforcement Learning. NeurIPS 2022 - [i4]Shaohui Peng, Xing Hu, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning. CoRR abs/2210.06964 (2022) - [i3]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen:
Object-Category Aware Reinforcement Learning. CoRR abs/2210.07802 (2022) - 2021
- [c3]Jiaming Guo, Rui Zhang, Xishan Zhang, Shaohui Peng, Qi Yi, Zidong Du, Xing Hu, Qi Guo, Yunji Chen:
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment. IJCAI 2021: 2476-2482 - [i2]Jiaming Guo, Rui Zhang, Xishan Zhang, Shaohui Peng, Qi Yi, Zidong Du, Xing Hu, Qi Guo, Yunji Chen:
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment. CoRR abs/2107.12216 (2021) - [i1]Ruizhi Chen, Xiaoyu Wu, Yansong Pan, Kaizhao Yuan, Ling Li, TianYun Ma, JiYuan Liang, Rui Zhang, Kai Wang, Chen Zhang, Shaohui Peng, Xishan Zhang, Zidong Du, Qi Guo, Yunji Chen:
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms. CoRR abs/2109.01768 (2021)
2010 – 2019
- 2019
- [c2]Xiaobing Chen, Shaohui Peng, Luyang Jin, Yimin Zhuang, Jin Song, Weijian Du, Shaoli Liu, Tian Zhi:
Partition and Scheduling Algorithms for Neural Network Accelerators. APPT 2019: 55-67 - [c1]Yimin Zhuang, Shaohui Peng, Xiaobing Chen, Shengyuan Zhou, Tian Zhi, Wei Li, Shaoli Liu:
Deep Fusion: A Software Scheduling Method for Memory Access Optimization. NPC 2019: 277-288
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-01 21:40 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint