default search action

combined dblp search
author search
venue search
publication search

ask others

Yuhang Zang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/ZangLHZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/ZangLHZL25
Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy:
Contextual Object Detection with Multimodal Large Language Models. Int. J. Comput. Vis. 133(2): 825-843 (2025)
[c28]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZangD0CLDWMDZCL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZangD0CLDWMDZCL25
Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. ACL (Findings) 2025: 6547-6563
[c27]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Ma0ZWD0CDW0S25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Ma0ZWD0CDW0S25
Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. ACL (Findings) 2025: 19568-19580
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/BuL0WDZCL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/BuL0WDZCL025
Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way. CVPR 2025: 12999-13008
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/XingHDL0ZCHWWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/XingHDL0ZCHWWL25
Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, Dahua Lin:
Conical Visual Concentration for Efficient Large Vision-Language Models. CVPR 2025: 14593-14603
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0001HW0Z0L025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0001HW0Z0L025
Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
WildAvatar: Learning In-the-wild 3D Avatars from the Web. CVPR 2025: 15963-15975
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/NiuLMGZHDDD0ZZC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/NiuLMGZHDDD0ZZC25
Junbo Niu, Yifei Li, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CVPR 2025: 18902-18913
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0001DD0ZCL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0001DD0ZCL025
Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction. CVPR 2025: 24045-24055
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LingB0DZWC0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LingB0DZWC0025
Pengyang Ling, Jiazi Bu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Tong Wu, Huaian Chen, Jiaqi Wang, Yi Jin:
MotionClone: Training-Free Motion Cloning for Controllable Video Generation. ICLR 2025
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuZD0CDHXL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuZD0CDHXL025
Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. ICLR 2025
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuDZD0ZCL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuDZD0ZCL025
Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation. ICML 2025
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WeiLZD0CTDG0QL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WeiLZD0CTDG0QL25
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? ICML 2025
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03218
Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction. CoRR abs/2501.03218 (2025)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03226
Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning. CoRR abs/2501.03226 (2025)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05510
Yifei Li, Junbo Niu, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CoRR abs/2501.05510 (2025)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-12368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-12368
Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. CoRR abs/2501.12368 (2025)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-05173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-05173
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? CoRR abs/2502.05173 (2025)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-08590
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-08590
Yujie Zhou, Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Qidong Huang, Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Anyi Rao, Jiaqi Wang, Li Niu:
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion. CoRR abs/2502.08590 (2025)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13128
Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation. CoRR abs/2502.13128 (2025)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-01785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-01785
Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang:
Visual-RFT: Visual Reinforcement Fine-Tuning. CoRR abs/2503.01785 (2025)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-05236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-05236
Yibin Wang, Yuhang Zang, Hao Li, Cheng Jin, Jiaqi Wang:
Unified Reward Model for Multimodal Understanding and Generation. CoRR abs/2503.05236 (2025)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-06232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-06232
Jiazi Bu, Pengyang Ling, Yujie Zhou, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance. CoRR abs/2504.06232 (2025)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-07957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-07957
Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
MM-IFEngine: Towards Multimodal Instruction Following. CoRR abs/2504.07957 (2025)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-03318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-03318
Yibin Wang, Zhimin Li, Yuhang Zang, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang:
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning. CoRR abs/2505.03318 (2025)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-14246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-14246
Ziyu Liu, Yuhang Zang, Yushan Zou, Zijian Liang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang:
Visual Agentic Reinforcement Fine-Tuning. CoRR abs/2505.14246 (2025)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-14677
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-14677
Jiaer Xia, Yuhang Zang, Peng Gao, Yixuan Li, Kaiyang Zhou:
Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning. CoRR abs/2505.14677 (2025)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-04997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-04997
Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. CoRR abs/2506.04997 (2025)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-19848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-19848
Long Xing, Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jinsong Li, Shuangrui Ding, Weiming Zhang, Nenghai Yu, Jiaqi Wang, Feng Wu, Dahua Lin:
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing. CoRR abs/2506.19848 (2025)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-02859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-02859
Jiaer Xia, Bingkui Tong, Yuhang Zang, Rui Shao, Kaiyang Zhou:
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation. CoRR abs/2507.02859 (2025)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-15852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-15852
Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Songxin He, Jianfan Lin, Junsong Tang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction. CoRR abs/2507.15852 (2025)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-00819
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-00819
Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Dahua Lin:
Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models. CoRR abs/2508.00819 (2025)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-04700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-04700
Zeyi Sun, Ziyu Liu, Yuhang Zang, Yuhang Cao, Xiaoyi Dong, Tong Wu, Dahua Lin, Jiaqi Wang:
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience. CoRR abs/2508.04700 (2025)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-17356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-17356
Jiazi Bu, Pengyang Ling, Yujie Zhou, Yibin Wang, Yuhang Zang, Tong Wu, Dahua Lin, Jiaqi Wang:
DiCache: Let Diffusion Model Determine Its Own Cache. CoRR abs/2508.17356 (2025)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-20096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-20096
Zeyi Sun, Yuhang Cao, Jianze Liang, Qiushi Sun, Ziyu Liu, Zhixiong Zhang, Yuhang Zang, Xiaoyi Dong, Kai Chen, Dahua Lin, Jiaqi Wang:
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning. CoRR abs/2508.20096 (2025)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-20751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-20751
Yibin Wang, Zhimin Li, Yuhang Zang, Yujie Zhou, Jiazi Bu, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang:
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning. CoRR abs/2508.20751 (2025)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-20317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-20317
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
SIM-CoT: Supervised Implicit Chain-of-Thought. CoRR abs/2509.20317 (2025)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22186
Junbo Niu, Zheng Liu, Zhuangcheng Gu, Bin Wang, Linke Ouyang, Zhiyuan Zhao, Tao Chu, Tianyao He, Fan Wu, Qintong Zhang, Zhenjiang Jin, Guang Liang, Rui Zhang, Wenzheng Zhang, Yuan Qu, Zhifei Ren, Yuefeng Sun, Yuanhong Zheng, Dongsheng Ma, Zirui Tang, Boyu Niu, Ziyang Miao, Hejun Dong, Siyi Qian, Junyuan Zhang, Jingzhou Chen, Fangdong Wang, Xiaomeng Zhao, Liqun Wei, Wei Li, Shasha Wang, Ruiliang Xu, Yuanyuan Cao, Lu Chen, Qianqian Wu, Huaiyu Gu, Lindong Lu, Keming Wang, Dechen Lin, Guanlin Shen, Xuanhe Zhou, Linfeng Zhang, Yuhang Zang, Xiaoyi Dong, Jiaqi Wang, Bo Zhang, Lei Bai, Pei Chu, Weijia Li, Jiang Wu, Lijun Wu, Zhenxiang Li, Guangyu Wang, Zhongying Tu, Chao Xu, Kai Chen, Yu Qiao, Bowen Zhou, Dahua Lin, Wentao Zhang, Conghui He:
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing. CoRR abs/2509.22186 (2025)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22624
Ziyu Liu, Yuhang Zang, Shengyuan Ding, Yuhang Cao, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
SPARK: Synergistic Policy And Reward Co-Evolving Framework. CoRR abs/2509.22624 (2025)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22647
Long Xing, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jianze Liang, Qidong Huang, Jiaqi Wang, Feng Wu, Dahua Lin:
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning. CoRR abs/2509.22647 (2025)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-23838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-23838
Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang:
2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC. CoRR abs/2509.23838 (2025)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-01982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-01982
Yujie Zhou, Pengyang Ling, Jiazi Bu, Yibin Wang, Yuhang Zang, Jiaqi Wang, Li Niu, Guangtao Zhai:
G²RPO: Granular GRPO for Precise Reward in Flow Models. CoRR abs/2510.01982 (2025)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-11063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-11063
Chang Liu, Henghui Ding, Kaining Ying, Lingyi Hong, Ning Xu, Linjie Yang, Yuchen Fan, Mingqi Gao, Jingkun Chen, Yunqi Miao, Gengshen Wu, Zhijin Qin, Jungong Han, Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Chang Soo Lim, Joonyoung Moon, Donghyeon Cho, Tingmin Li, Yixuan Li, Yang Yang, An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu, Yujie Xie, Hongyang Zhang, Zhihui Liu, Shihai Ruan, Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji, Alexey Nekrasov, Ali Athar, Daan de Geus, Alexander Hermans, Bastian Leibe:
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation. CoRR abs/2510.11063 (2025)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-18701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-18701
Yibin Wang, Zhimin Li, Yuhang Zang, Jiazi Bu, Yujie Zhou, Yi Xin, Junjun He, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang:
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation. CoRR abs/2510.18701 (2025)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-24693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-24693
Zihan Liu, Zhikang Niu, Qiuyang Xiao, Zhisheng Zheng, Ruoqi Yuan, Yuhang Zang, Yuhang Cao, Xiaoyi Dong, Jianze Liang, Xie Chen, Leilei Sun, Dahua Lin, Jiaqi Wang:
STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence. CoRR abs/2510.24693 (2025)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-27606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-27606
Yuhong Liu, Beichen Zhang, Yuhang Zang, Yuhang Cao, Long Xing, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning. CoRR abs/2510.27606 (2025)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-12921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-12921
Huiqiang Sun, Liao Shen, Zhan Peng, Kun Wang, Size Wu, Yuhang Zang, Tianqi Liu, Zihao Huang, Xingyu Zeng, Zhiguo Cao, Wei Li, Chen Change Loy:
Generative Photographic Control for Scene-Consistent Video Cinematic Editing. CoRR abs/2511.12921 (2025)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-15703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-15703
Beichen Zhang, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang:
Think Visually, Reason Textually: Vision-Language Synergy in ARC. CoRR abs/2511.15703 (2025)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-01248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-01248
Junyuan Zhang, Bin Wang, Qintong Zhang, Fan Wu, Zichen Wen, Jialin Lu, Junjie Shan, Ziqi Zhao, Shuya Yang, Ziling Wang, Ziyang Miao, Huaping Zhong, Yuhang Zang, Xiaoyi Dong, Ka-Ho Chow, Conghui He:
TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition. CoRR abs/2512.01248 (2025)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-05111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-05111
Shengyuan Ding, Xinyu Fang, Ziyu Liu, Yuhang Zang, Yuhang Cao, Xiangyu Zhao, Haodong Duan, Xiaoyi Dong, Jianze Liang, Bin Wang, Conghui He, Dahua Lin, Jiaqi Wang:
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning. CoRR abs/2512.05111 (2025)
2024
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0002FWZZKXL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0002FWZZKXL024
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever you Want. CVPR 2024: 13019-13029
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiuWHSYZCLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiuWHSYZCLL24
Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. ECCV (18) 2024: 37-53
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZhangZDZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZhangZDZW24
Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. ECCV (51) 2024: 310-325
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZangGS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZangGS024
Yuhang Zang, Hanlin Goh, Joshua M. Susskind, Chen Huang:
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization. ICLR 2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DuanYQFCLDZZWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DuanYQFCLDZZWL024
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. ACM Multimedia 2024: 11198-11201
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0016WLD0ZCDB00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0016WLD0ZCDB00024
Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Lin Bin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. NeurIPS 2024
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenLDZZCDWQLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenLDZZCDWQLZ24
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? NeurIPS 2024
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DongZZCWOZDZLYG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DongZZCWOZDZLYG24
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. NeurIPS 2024
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuCZWDZLX0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuCZWDZLX0L024
Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. NeurIPS 2024
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MaZC0JLLLMDZP0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MaZC0JLLLMDZP0W24
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun:
MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations. NeurIPS 2024
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/QianDZZDLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QianDZZDLW24
Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. NeurIPS 2024
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-15914
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-15914
Yuhang Zang, Hanlin Goh, Josh M. Susskind, Chen Huang:
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization. CoRR abs/2401.15914 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-16420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-16420
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13805
Ziyu Liu, Zeyi Sun, Yuhang Zang, Wei Li, Pan Zhang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition. CoRR abs/2403.13805 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15378
Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang:
Long-CLIP: Unlocking the Long-Text Capability of CLIP. CoRR abs/2403.15378 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20330
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06512
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-13044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-13044
Tao Chu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Qiong Liu, Jiaqi Wang:
Unified Scene Representation and Reconstruction for 3D Large Language Models. CoRR abs/2404.13044 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-12218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-12218
Tianqi Liu, Guangcong Wang, Shoukang Hu, Li Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. CoRR abs/2405.12218 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16009
Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. CoRR abs/2405.16009 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00093
Zeyi Sun, Tong Wu, Pan Zhang, Yuhang Zang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Bootstrap3D: Improving 3D Content Creation with Synthetic Data. CoRR abs/2406.00093 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04325
Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05338
Pengyang Ling, Jiazi Bu, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Tong Wu, Huaian Chen, Jiaqi Wang, Yi Jin:
MotionClone: Training-Free Motion Cloning for Controllable Video Generation. CoRR abs/2406.05338 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11739
Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11833
Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. CoRR abs/2406.11833 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01523
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun:
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations. CoRR abs/2407.01523 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02165
Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation. CoRR abs/2407.02165 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03320
Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11691
Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06241
Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way. CoRR abs/2410.06241 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-07167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-07167
Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate. CoRR abs/2410.07167 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16268
Shuangrui Ding, Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Yuwei Guo, Dahua Lin, Jiaqi Wang:
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree. CoRR abs/2410.16268 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17247
Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, Dahua Lin:
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction. CoRR abs/2410.17247 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17637
Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-01824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-01824
Zeyi Sun, Ziyang Chu, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models. CoRR abs/2412.01824 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-09596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-09596
Pan Zhang, Xiaoyi Dong, Yuhang Cao, Yuhang Zang, Rui Qian, Xilin Wei, Lin Chen, Yifei Li, Junbo Niu, Shuangrui Ding, Qipeng Guo, Haodong Duan, Xin Chen, Han Lv, Zheng Nie, Min Zhang, Bin Wang, Wenwei Zhang, Xinyue Zhang, Jiaye Ge, Wei Li, Jingwen Li, Zhongying Tu, Conghui He, Xingcheng Zhang, Kai Chen, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions. CoRR abs/2412.09596 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/ZangZHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/ZangZHL23
Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch. Int. J. Comput. Vis. 131(4): 987-1001 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14813
Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch. CoRR abs/2305.14813 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18279
Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy:
Contextual Object Detection with Multimodal Large Language Models. CoRR abs/2305.18279 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03818
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want. CoRR abs/2312.03818 (2023)
2022
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZangLZHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZangLZHL22
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Open-Vocabulary DETR with Conditional Matching. ECCV (9) 2022: 106-122
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11876
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Open-Vocabulary DETR with Conditional Matching. CoRR abs/2203.11876 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07521
Kaiyang Zhou, Yuanhan Zhang, Yuhang Zang, Jingkang Yang, Chen Change Loy, Ziwei Liu:
On-Device Domain Generalization. CoRR abs/2209.07521 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07225
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy:
Unified Vision and Language Prompt Learning. CoRR abs/2210.07225 (2022)
2021
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangZZCPGCLLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangZZCPGCLLL21
Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CVPR 2021: 9695-9704
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZangHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZangHL21
Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. ICCV 2021: 3437-3446
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-12867
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-12867
Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. CoRR abs/2102.12867 (2021)
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SongLZ0LY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SongLZ0LY20
Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan:
KPNet: Towards Minimal Face Detector. AAAI 2020: 12015-12022
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07543
Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan:
KPNet: Towards Minimal Face Detector. CoRR abs/2003.07543 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07557
Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang:
1st Place Solutions for OpenImage2019 - Object Detection and Instance Segmentation. CoRR abs/2003.07557 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-10032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-10032
Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CoRR abs/2008.10032 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XieZSYYL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XieZSYYL19
Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li:
Scene Text Detection with Supervised Pyramid Context Network. AAAI 2019: 9038-9045
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WangXSZWLYS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WangXSZWLYS19
Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen:
Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network. ICCV 2019: 8439-8448
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-05900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-05900
Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen:
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network. CoRR abs/1908.05900 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08605
Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li:
Scene Text Detection with Supervised Pyramid Context Network. CoRR abs/1811.08605 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.