


default search action
Chenliang Li 0003
Person information
- affiliation: Alibaba DAMO Academy, Hangzhou, China
Other persons with the same name
- Chenliang Li — disambiguation page
- Chenliang Li 0001
— Guilin University of Electronic Technology, Guilin, Guangxi, China
- Chenliang Li 0002
— University of Science and Technology of China, Hefei, China
- Chenliang Li 0004
— Beijing Jiaotong University, Beijing, China
- Chenliang Li 0005
— Wuhan University, Wuhan, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i33]Xuenan Xu, Jiahao Mei, Chenliang Li, Yuning Wu, Ming Yan, Shaopeng Lai, Ji Zhang, Mengyue Wu:
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio. CoRR abs/2503.05242 (2025) - [i32]Yuning Wu, Jiahao Mei, Ming Yan, Chenliang Li, Shaopeng Lai, Yuran Ren, Zijia Wang, Ji Zhang, Mengyue Wu, Qin Jin, Fei Huang:
WritingBench: A Comprehensive Benchmark for Generative Writing. CoRR abs/2503.05244 (2025) - [i31]Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan:
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning. CoRR abs/2505.17667 (2025) - [i30]Weizhou Shen, Chenliang Li, Fanqi Wan, Shengyi Liao, Shaopeng Lai, Bo Zhang, Yingcheng Shi, Yuning Wu, Gang Fu, Zhansheng Li, Bin Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan:
QwenLong-CPRS: Towards ∞-LLMs with Dynamic Context Optimization. CoRR abs/2505.18092 (2025) - [i29]Fuwen Luo, Shengfeng Lou, Chi Chen, Ziyue Wang, Chenliang Li, Weizhou Shen, Jiyue Guo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding. CoRR abs/2505.20715 (2025) - [i28]Xuanyu Lei, Chenliang Li, Yuning Wu, Kaiming Liu, Weizhou Shen, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning. CoRR abs/2506.05760 (2025) - 2024
- [c24]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang:
SocialBench: Sociality Evaluation of Role-Playing Conversational Agents. ACL (Findings) 2024: 2108-2126 - [c23]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. LREC/COLING 2024: 14664-14675 - [c22]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. LREC/COLING 2024: 17031-17041 - [c21]Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. EMNLP 2024: 16658-16680 - [c20]Anwen Hu
, Yaya Shi
, Haiyang Xu
, Jiabo Ye
, Qinghao Ye
, Ming Yan
, Chenliang Li
, Qi Qian
, Ji Zhang
, Fei Huang
:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. ACM Multimedia 2024: 6929-6938 - [i27]Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. CoRR abs/2401.07324 (2024) - [i26]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. CoRR abs/2402.16769 (2024) - [i25]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. CoRR abs/2403.00249 (2024) - [i24]Wei Ye, Chaoya Jiang, Haiyang Xu, Chenhao Ye, Chenliang Li, Ming Yan, Shikun Zhang, Songhang Huang, Fei Huang:
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection. CoRR abs/2403.07883 (2024) - [i23]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Xing Gao, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang, Jingren Zhou:
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents. CoRR abs/2403.13679 (2024) - [i22]Tianyuan Shi, Fanqi Wan, Canbin Huang, Xiaojun Quan, Chenliang Li, Ming Yan, Ji Zhang:
ProFuser: Progressive Fusion of Large Language Models. CoRR abs/2408.04998 (2024) - 2023
- [j1]Ming Yan
, Haiyang Xu
, Chenliang Li
, Junfeng Tian
, Bin Bi
, Wei Wang
, Xianzhe Xu
, Ji Zhang
, Songfang Huang
, Fei Huang
, Luo Si
, Rong Jin
:
Achieving Human Parity on Visual Question Answering. ACM Trans. Inf. Syst. 41(3): 79:1-79:40 (2023) - [c19]Xu Yang, Jiawei Peng, Zihua Wang, Haiyang Xu, Qinghao Ye, Chenliang Li, Songfang Huang, Fei Huang, Zhangzikang Li, Yu Zhang:
Transforming Visual Scene Graphs to Image Captions. ACL (1) 2023: 12427-12440 - [c18]Chenliang Li, He Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. EMNLP (Demos) 2023: 566-578 - [c17]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. EMNLP (Findings) 2023: 2841-2858 - [c16]Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang:
Learning Trajectory-Word Alignments for Video-Language Tasks. ICCV 2023: 2504-2514 - [c15]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang:
BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization. ICCV 2023: 2888-2898 - [c14]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. ICML 2023: 38728-38748 - [c13]Chaoya Jiang
, Haiyang Xu
, Wei Ye
, Qinghao Ye
, Chenliang Li
, Ming Yan
, Bin Bi
, Shikun Zhang
, Fei Huang
, Ji Zhang
:
COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment. ACM Multimedia 2023: 4480-4491 - [i21]Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang:
Learning Trajectory-Word Alignments for Video-Language Tasks. CoRR abs/2301.01953 (2023) - [i20]Zihua Wang, Xu Yang, Haiyang Xu, Hanwang Zhang, Chenliang Li, Songfang Huang, Fei Huang, Yu Zhang:
Adaptively Clustering Neighbor Elements for Image Captioning. CoRR abs/2301.01955 (2023) - [i19]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. CoRR abs/2302.00402 (2023) - [i18]Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou:
ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human. CoRR abs/2304.07849 (2023) - [i17]Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality. CoRR abs/2304.14178 (2023) - [i16]Xu Yang, Jiawei Peng, Zihua Wang, Haiyang Xu, Qinghao Ye, Chenliang Li, Ming Yan, Fei Huang, Zhangzikang Li, Yu Zhang:
Transforming Visual Scene Graphs to Image Captions. CoRR abs/2305.02177 (2023) - [i15]Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang:
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. CoRR abs/2306.04362 (2023) - [i14]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Yuhao Dan, Chenlin Zhao, Guohai Xu, Chenliang Li, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding. CoRR abs/2307.02499 (2023) - [i13]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang:
BUS: Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization. CoRR abs/2307.08504 (2023) - [i12]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Ji Zhang, Fei Huang:
COPA: Efficient Vision-Language Pre-training Through Collaborative Object- and Patch-Text Alignment. CoRR abs/2308.03475 (2023) - [i11]Chenliang Li, Hehong Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. CoRR abs/2309.00986 (2023) - [i10]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. CoRR abs/2310.05126 (2023) - [i9]Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. CoRR abs/2311.18248 (2023) - 2022
- [c12]Chaoya Jiang, Haiyang Xu, Chenliang Li, Ming Yan, Wei Ye, Shikun Zhang, Bin Bi, Songfang Huang:
TRIPS: Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection. EMNLP 2022: 4084-4096 - [c11]Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, He Chen, Guohai Xu, Zheng Cao
, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou, Luo Si:
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. EMNLP 2022: 7241-7259 - [i8]Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, Hehong Chen, Guohai Xu, Zheng Cao
, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou, Luo Si:
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. CoRR abs/2205.12005 (2022) - 2021
- [c10]Ming Yan, Chenliang Li, Bin Bi, Wei Wang, Songfang Huang:
A Unified Pretraining Framework for Passage Ranking and Expansion. AAAI 2021: 4555-4563 - [c9]Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao, Fei Huang:
E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning. ACL/IJCNLP (1) 2021: 503-513 - [c8]Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang:
Addressing Semantic Drift in Generative Question Answering with Auxiliary Extraction. ACL/IJCNLP (2) 2021: 942-947 - [c7]Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si:
StructuralLM: Structural Pre-training for Form Understanding. ACL/IJCNLP (1) 2021: 6309-6318 - [c6]Junfeng Tian, Min Gui, Chenliang Li, Ming Yan, Wenming Xiao:
MinD at SemEval-2021 Task 6: Propaganda Detection using Transfer Learning and Multimodal Fusion. SemEval@ACL/IJCNLP 2021: 1082-1087 - [c5]Guohai Xu, Yan Shao, Chenliang Li, Feng-Lin Li, Bin Bi, Ji Zhang, Haiqing Chen:
AliMe DA: A Data Augmentation Framework for Question Answering in Cold-start Scenarios. SIGIR 2021: 2637-2638 - [i7]Chenliang Li, Ming Yan, Haiyang Xu, Fuli Luo, Wei Wang, Bin Bi, Songfang Huang:
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels. CoRR abs/2103.07829 (2021) - [i6]Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si:
StructuralLM: Structural Pre-training for Form Understanding. CoRR abs/2105.11210 (2021) - [i5]Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao, Fei Huang:
E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning. CoRR abs/2106.01804 (2021) - [i4]Ming Yan, Haiyang Xu, Chenliang Li, Bin Bi, Junfeng Tian, Min Gui, Wei Wang:
Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training. CoRR abs/2108.09479 (2021) - [i3]Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin:
Achieving Human Parity on Visual Question Answering. CoRR abs/2111.08896 (2021) - 2020
- [c4]Bin Bi, Chen Wu, Ming Yan, Wei Wang, Jiangnan Xia, Chenliang Li:
Generating Well-Formed Answers by Machine Reading with Stochastic Selector Networks. AAAI 2020: 7424-7431 - [c3]Bin Bi, Chenliang Li, Chen Wu, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si:
PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation. EMNLP (1) 2020: 8681-8691 - [i2]Bin Bi, Chenliang Li, Chen Wu, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si:
PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation. CoRR abs/2004.07159 (2020)
2010 – 2019
- 2019
- [c2]Bin Bi, Chen Wu, Ming Yan, Wei Wang, Jiangnan Xia, Chenliang Li:
Incorporating External Knowledge into Machine Reading for Generative Question Answering. EMNLP/IJCNLP (1) 2019: 2521-2530 - [c1]Ming Yan, Chenliang Li, Chen Wu, Bin Bi, Wei Wang, Jiangnan Xia, Luo Si:
IDST at TREC 2019 Deep Learning Track: Deep Cascade Ranking with Generation-based Document Expansion and Pre-trained Language Modeling. TREC 2019 - [i1]Bin Bi, Chen Wu, Ming Yan, Wei Wang, Jiangnan Xia, Chenliang Li:
Incorporating External Knowledge into Machine Reading for Generative Question Answering. CoRR abs/1909.02745 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-08 23:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint