default search action

combined dblp search
author search
venue search
publication search

ask others

Anwen Hu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YeXYYHL0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YeXYYHL0Z024
Qinghao Ye, Haiyang Xu, Jiabo Ye, Ming Yan, Anwen Hu, Haowei Liu, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration. CVPR 2024: 13040-13051
[c18]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangHXYXJZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangHXYXJZ024
Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging. EMNLP 2024: 1882-1898
[c17]
- view
  - electronic edition @ aclanthology.org
  - details & citations
- export record
  dblp key:
  - conf/emnlp/HuXYYZZZJHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuXYYZZZJHZ24
Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding. EMNLP (Findings) 2024: 3096-3120
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/YeTYZHYZHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/YeTYZHYZHL24
Jiabo Ye, Junfeng Tian, Xiaoshan Yang, Zhenru Zhang, Anwen Hu, Ming Yan, Ji Zhang, Liang He, Xin Lin:
VG-Annotator: Vision-Language Models as Query Annotators for Unsupervised Visual Grounding. ICME 2024: 1-6
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuSXYYYL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuSXYYYL00024
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. ACM Multimedia 2024: 6929-6938
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12895
Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding. CoRR abs/2403.12895 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-14705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-14705
Qingrong He, Kejun Lin, Shizhe Chen, Anwen Hu, Qin Jin:
Think-Program-reCtify: 3D Situated Reasoning with Large Language Models. CoRR abs/2404.14705 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16635
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16635
Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning. CoRR abs/2404.16635 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-04840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-04840
Jiabo Ye, Haiyang Xu, Haowei Liu, Anwen Hu, Ming Yan, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou:
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models. CoRR abs/2408.04840 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03420
Anwen Hu, Haiyang Xu, Liang Zhang, Jiabo Ye, Ming Yan, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding. CoRR abs/2409.03420 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/ZhangRHJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/ZhangRHJ23
Liang Zhang, Ludan Ruan, Anwen Hu, Qin Jin:
Multimodal Pretraining from Monolingual to Multilingual. Mach. Intell. Res. 20(2): 220-232 (2023)
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/RuanH0ZZJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/RuanH0ZZJ23
Ludan Ruan, Anwen Hu, Yuqing Song, Liang Zhang, Sipeng Zheng, Qin Jin:
Accommodating Audio Modality in CLIP for Multimodal Processing. AAAI 2023: 9641-9649
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangHZHJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangHZHJ23
Liang Zhang, Anwen Hu, Jing Zhang, Shuo Hu, Qin Jin:
MPMQA: Multimodal Question Answering on Product Manuals. AAAI 2023: 13958-13966
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuCZJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuCZJ23
Anwen Hu, Shizhe Chen, Liang Zhang, Qin Jin:
InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation. ACL (1) 2023: 3171-3185
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YueZHZWJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YueZHZWJ23
Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang, Qin Jin:
Movie101: A New Movie Understanding Benchmark. ACL (1) 2023: 4669-4684
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YeHXYYXLT0ZJHLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YeHXYYXLT0ZJHLH23
Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. EMNLP (Findings) 2023: 2841-2858
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HuCZJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HuCZJ23
Anwen Hu, Shizhe Chen, Liang Zhang, Qin Jin:
Explore and Tell: Embodied Visual Captioning in 3D Environments. ICCV 2023: 2482-2491
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiLXMYHY00YLHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiLXMYHY00YLHZ23
Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. ACM Multimedia 2023: 4460-4470
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YueHZJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YueHZJ23
Zihao Yue, Anwen Hu, Liang Zhang, Qin Jin:
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation. NeurIPS 2023
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-06591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-06591
Ludan Ruan, Anwen Hu, Yuqing Song, Liang Zhang, Sipeng Zheng, Qin Jin:
Accommodating Audio Modality in CLIP for Multimodal Processing. CoRR abs/2303.06591 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-09660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-09660
Liang Zhang, Anwen Hu, Jing Zhang, Shuo Hu, Qin Jin:
MPMQA: Multimodal Question Answering on Product Manuals. CoRR abs/2304.09660 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-14178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-14178
Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality. CoRR abs/2304.14178 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-06002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-06002
Anwen Hu, Shizhe Chen, Liang Zhang, Qin Jin:
InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation. CoRR abs/2305.06002 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12140
Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang, Qin Jin:
Movie101: A New Movie Understanding Benchmark. CoRR abs/2305.12140 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04362
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04362
Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang:
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. CoRR abs/2306.04362 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13460
Zihao Yue, Anwen Hu, Liang Zhang, Qin Jin:
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation. CoRR abs/2306.13460 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-02499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-02499
Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Yuhao Dan, Chenlin Zhao, Guohai Xu, Chenliang Li, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding. CoRR abs/2307.02499 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-10447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-10447
Anwen Hu, Shizhe Chen, Liang Zhang, Qin Jin:
Explore and Tell: Embodied Visual Captioning in 3D Environments. CoRR abs/2308.10447 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05126
Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. CoRR abs/2310.05126 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-04257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-04257
Qinghao Ye, Haiyang Xu, Jiabo Ye, Ming Yan, Anwen Hu, Haowei Liu, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou:
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration. CoRR abs/2311.04257 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18248
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. CoRR abs/2311.18248 (2023)
2022
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangYHWJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangYHWJ22
Qi Zhang, Zihao Yue, Anwen Hu, Ziheng Wang, Qin Jin:
MovieUN: A Dataset for Movie Understanding and Narrating. EMNLP (Findings) 2022: 1873-1885
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangHJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangHJ22
Liang Zhang, Anwen Hu, Qin Jin:
Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval. NeurIPS 2022
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-11091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-11091
Liang Zhang, Anwen Hu, Qin Jin:
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition. CoRR abs/2206.11091 (2022)
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuCJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuCJ21
Anwen Hu, Shizhe Chen, Qin Jin:
Question-controlled Text-aware Image Captioning. ACM Multimedia 2021: 3097-3105
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06561
Yuqi Huo, Manli Zhang, Guangzhen Liu, Haoyu Lu, Yizhao Gao, Guoxing Yang, Jingyuan Wen, Heng Zhang, Baogui Xu, Weihao Zheng, Zongzheng Xi, Yueqian Yang, Anwen Hu, Jinming Zhao, Ruichen Li, Yida Zhao, Liang Zhang, Yuqing Song, Xin Hong, Wanqing Cui, Dan Yang Hou, Yingyan Li, Junyi Li, Peiyu Liu, Zheng Gong, Chuhao Jin, Yuchong Sun, Shizhe Chen, Zhiwu Lu, Zhicheng Dou, Qin Jin, Yanyan Lan, Wayne Xin Zhao, Ruihua Song, Ji-Rong Wen:
WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training. CoRR abs/2103.06561 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02050
Anwen Hu, Shizhe Chen, Qin Jin:
ICECAP: Information Concentrated Entity-aware Image Captioning. CoRR abs/2108.02050 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02059
Anwen Hu, Shizhe Chen, Qin Jin:
Question-controlled Text-aware Image Captioning. CoRR abs/2108.02059 (2021)
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HuDNW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HuDNW20
Anwen Hu, Zhicheng Dou, Jian-Yun Nie, Ji-Rong Wen:
Leveraging Multi-Token Entities in Document-Level Named Entity Recognition. AAAI 2020: 7961-7968
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuCJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuCJ20
Anwen Hu, Shizhe Chen, Qin Jin:
ICECAP: Information Concentrated Entity-aware Image Captioning. ACM Multimedia 2020: 4217-4225

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ccir/HuDW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccir/HuDW19
Anwen Hu, Zhicheng Dou, Ji-Rong Wen:
Document-Level Named Entity Recognition by Incorporating Global and Neighbor Features. CCIR 2019: 79-91

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.