


default search action
Yuhang Cao
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j4]Weibin Wu
, Yuhang Cao
, Ning Yi
, Rongyi Ou
, Zibin Zheng
:
Detecting and Reducing the Factual Hallucinations of Large Language Models with Metamorphic Testing. Proc. ACM Softw. Eng. 2(FSE): 1432-1453 (2025)
[j3]Yuan Dai
, Xuchen Gao
, Yunhui Qiu
, Jingyuan Li
, Yuhang Cao
, Yiqing Mao
, Sichao Chen
, Wenbo Yin
, Wai-Shing Luk
, Lingli Wang
:
COFFA: A Co-Design Framework for Fused-Grained Reconfigurable Architecture Towards Efficient Irregular Loop Handling. IEEE Trans. Computers 74(9): 3099-3113 (2025)
[j2]Feng Wang
, Yuhang Cao, Li Liu
, Qi Kang
, Jun Chen:
Federated Graph Neural Networks With Equivalent Hypergraph Construction for Traffic Flow Prediction. IEEE Trans. Knowl. Data Eng. 37(11): 6420-6435 (2025)
[c27]Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. ACL (Findings) 2025: 6547-6563
[c26]Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. ACL (Findings) 2025: 19568-19580
[c25]Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way. CVPR 2025: 12999-13008
[c24]Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
, Feng Wu, Dahua Lin:
Conical Visual Concentration for Efficient Large Vision-Language Models. CVPR 2025: 14593-14603
[c23]Junbo Niu, Yifei Li, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CVPR 2025: 18902-18913
[c22]Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction. CVPR 2025: 24045-24055
[c21]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. ICLR 2025
[c20]Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation. ICML 2025
[c19]Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? ICML 2025
[i52]Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction. CoRR abs/2501.03218 (2025)
[i51]Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang:
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning. CoRR abs/2501.03226 (2025)
[i50]Yifei Li, Junbo Niu, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
:
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? CoRR abs/2501.05510 (2025)
[i49]Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang
:
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model. CoRR abs/2501.12368 (2025)
[i48]Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang
, Xipeng Qiu, Dahua Lin:
VideoRoPE: What Makes for Good Video Rotary Position Embedding? CoRR abs/2502.05173 (2025)
[i47]Yujie Zhou, Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Qidong Huang, Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Anyi Rao, Jiaqi Wang
, Li Niu:
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion. CoRR abs/2502.08590 (2025)
[i46]Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation. CoRR abs/2502.13128 (2025)
[i45]Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang
:
Visual-RFT: Visual Reinforcement Fine-Tuning. CoRR abs/2503.01785 (2025)
[i44]Jiazi Bu, Pengyang Ling, Yujie Zhou, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance. CoRR abs/2504.06232 (2025)
[i43]Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
MM-IFEngine: Towards Multimodal Instruction Following. CoRR abs/2504.07957 (2025)
[i42]Ziyu Liu, Yuhang Zang, Yushan Zou, Zijian Liang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang:
Visual Agentic Reinforcement Fine-Tuning. CoRR abs/2505.14246 (2025)
[i41]Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang
, Yixin Cao, Aixin Sun:
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings. CoRR abs/2506.04997 (2025)
[i40]Long Xing, Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jinsong Li, Shuangrui Ding, Weiming Zhang, Nenghai Yu, Jiaqi Wang, Feng Wu, Dahua Lin:
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing. CoRR abs/2506.19848 (2025)
[i39]Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Songxin He, Jianfan Lin, Junsong Tang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang:
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction. CoRR abs/2507.15852 (2025)
[i38]Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Dahua Lin:
Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models. CoRR abs/2508.00819 (2025)
[i37]Zeyi Sun, Ziyu Liu, Yuhang Zang, Yuhang Cao, Xiaoyi Dong, Tong Wu, Dahua Lin, Jiaqi Wang:
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience. CoRR abs/2508.04700 (2025)
[i36]Lei Bai, Zhongrui Cai, Yuhang Cao, Maosong Cao, Weihan Cao, Chiyu Chen, Haojiong Chen, Kai Chen, Pengcheng Chen, Ying Chen, Yongkang Chen, Yu Cheng, Pei Chu, Tao Chu, Erfei Cui, Ganqu Cui, Long Cui, Ziyun Cui, Nianchen Deng, Ning Ding, Nanqing Dong, Peijie Dong, Shihan Dou, Sinan Du, Haodong Duan, Caihua Fan, Ben Gao, Changjiang Gao, Jianfei Gao, Songyang Gao, Yang Gao, Zhangwei Gao, Jiaye Ge, Qiming Ge, Lixin Gu, Yuzhe Gu, Aijia Guo, Qipeng Guo, Xu Guo, Conghui He, Junjun He, Yili Hong, Siyuan Hou, Caiyu Hu, Hanglei Hu, Jucheng Hu, Ming Hu, Zhouqi Hua, Haian Huang, Junhao Huang, Xu Huang, Zixian Huang, Zhe Jiang, Lingkai Kong, Linyang Li, Peiji Li, Pengze Li, Shuaibin Li, Tianbin Li, Wei Li, Yuqiang Li, Dahua Lin, Junyao Lin, Tianyi Lin, Zhishan Lin, Hongwei Liu, Jiangning Liu, Jiyao Liu, Junnan Liu, Kai Liu, Kaiwen Liu, Kuikun Liu, Shichun Liu, Shudong Liu, Wei Liu, Xinyao Liu, Yuhong Liu, Zhan Liu, Yinquan Lu, Haijun Lv, Hongxia Lv, Huijie Lv, Qitan Lv, Ying Lv, Chengqi Lyu, Chenglong Ma, Jianpeng Ma, Ren Ma, Runmin Ma, Runyuan Ma, Xinzhu Ma, Yichuan Ma, Zihan Ma, Sixuan Mi, Junzhi Ning, Wenchang Ning, Xinle Pang, Jiahui Peng, Runyu Peng
, Yu Qiao:
Intern-S1: A Scientific Multimodal Foundation Model. CoRR abs/2508.15763 (2025)
[i35]Zeyi Sun, Yuhang Cao, Jianze Liang, Qiushi Sun, Ziyu Liu, Zhixiong Zhang, Yuhang Zang, Xiaoyi Dong, Kai Chen, Dahua Lin, Jiaqi Wang:
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning. CoRR abs/2508.20096 (2025)
[i34]Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, Xipeng Qiu, Dahua Lin:
SIM-CoT: Supervised Implicit Chain-of-Thought. CoRR abs/2509.20317 (2025)
[i33]Ziyu Liu, Yuhang Zang, Shengyuan Ding, Yuhang Cao, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
SPARK: Synergistic Policy And Reward Co-Evolving Framework. CoRR abs/2509.22624 (2025)
[i32]Long Xing, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jianze Liang, Qidong Huang, Jiaqi Wang, Feng Wu, Dahua Lin:
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning. CoRR abs/2509.22647 (2025)
[i31]Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang:
2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC. CoRR abs/2509.23838 (2025)
[i30]Yuhang Cao, Haojun Yan, Danya Yao:
OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction. CoRR abs/2509.24308 (2025)
[i29]Chang Liu, Henghui Ding, Kaining Ying, Lingyi Hong, Ning Xu, Linjie Yang, Yuchen Fan, Mingqi Gao, Jingkun Chen, Yunqi Miao, Gengshen Wu, Zhijin Qin, Jungong Han, Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Chang Soo Lim, Joonyoung Moon, Donghyeon Cho, Tingmin Li, Yixuan Li, Yang Yang, An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu, Yujie Xie, Hongyang Zhang, Zhihui Liu, Shihai Ruan, Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji, Ran Hong, Feng Lu, Leilei Cao, An Yan, Alexey Nekrasov, Ali Athar, Daan de Geus, Alexander Hermans, Bastian Leibe:
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation. CoRR abs/2510.11063 (2025)
[i28]Zihan Liu, Zhikang Niu, Qiuyang Xiao, Zhisheng Zheng, Ruoqi Yuan, Yuhang Zang, Yuhang Cao, Xiaoyi Dong, Jianze Liang, Xie Chen, Leilei Sun, Dahua Lin, Jiaqi Wang:
STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence. CoRR abs/2510.24693 (2025)
[i27]Yuhong Liu, Beichen Zhang, Yuhang Zang, Yuhang Cao, Long Xing, Xiaoyi Dong, Haodong Duan, Dahua Lin, Jiaqi Wang:
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning. CoRR abs/2510.27606 (2025)- 2024
[c18]Shaoyang Sun, Boyin Jin, Jiahang Lou, Jiangnan Li
, Yuhang Cao, Jingyuan Li, Chen Shen, Yuan Dai
, Wenbo Yin, Wai-Shing Luk, Lingli Wang:
MDCRA: A Reconfigurable Accelerator Framework for Multiple Dataflow Lanes. ASAP 2024: 133-134
[c17]Xiang Lyu, Yuhang Cao, Pengpeng Zou, Weilin Zhou:
Ximalaya ASDR System for ICASSP 2024 in-Car Multi-Channel (ICMC) ASR Challenge. ICASSP Workshops 2024: 29-30
[c16]Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Díez, Lukás Burget, Yuhang Cao, Heng Lu, Jan Cernocký:
Diacorrect: Error Correction Back-End for Speaker Diarization. ICASSP 2024: 11181-11185
[c15]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. NeurIPS 2024
[i26]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang
, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024)
[i25]Yuhang Cao, Pan Zhang, Xiaoyi Dong, Dahua Lin, Jiaqi Wang:
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models. CoRR abs/2402.14767 (2024)
[i24]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024)
[i23]Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024)
[i22]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang
, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024)
[i21]Jiajun Xu, Qun Wang, Yuhang Cao, Baitao Zeng, Sicheng Liu:
A General-Purpose Device for Interaction with LLMs. CoRR abs/2408.10230 (2024)
[i20]Zihao Pan, Weibin Wu, Yuhang Cao, Zibin Zheng:
SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack. CoRR abs/2410.02240 (2024)
[i19]Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
:
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way. CoRR abs/2410.06241 (2024)
[i18]Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jiaqi Wang
, Dahua Lin, Weiming Zhang, Nenghai Yu:
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate. CoRR abs/2410.07167 (2024)
[i17]Shuangrui Ding, Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Yuwei Guo, Dahua Lin, Jiaqi Wang
:
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree. CoRR abs/2410.16268 (2024)
[i16]Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
, Feng Wu, Dahua Lin:
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction. CoRR abs/2410.17247 (2024)
[i15]Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang
:
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models. CoRR abs/2410.17637 (2024)
[i14]Pan Zhang, Xiaoyi Dong, Yuhang Cao, Yuhang Zang, Rui Qian, Xilin Wei, Lin Chen, Yifei Li, Junbo Niu, Shuangrui Ding, Qipeng Guo, Haodong Duan, Xin Chen, Han Lv, Zheng Nie, Min Zhang, Bin Wang, Wenwei Zhang, Xinyue Zhang, Jiaye Ge, Wei Li, Jingwen Li, Zhongying Tu, Conghui He, Xingcheng Zhang, Kai Chen, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions. CoRR abs/2412.09596 (2024)- 2023
[c14]Xiang Lyu, Yuhang Cao, Qing Wang, Jingjing Yin, Yuguang Yang, Pengpeng Zou, Yanni Hu, Heng Lu:
PP-MET: A Real-World Personalized Prompt Based Meeting Transcription System. ASRU 2023: 1-8
[c13]Yuhang Cao, Yunhui Qiu, Xuchen Gao
, Qilong Zhu, Wenbo Yin, Lingli Wang:
E2-ACE: An Energy-Efficient Reconfigurable Crypto-Accelerator with Agile End-to-End Toolchain. ICFPT 2023: 296-297
[c12]Qilong Zhu, Yuhang Cao, Yunhui Qiu, Xuchen Gao
, Wenbo Yin, Lingli Wang:
A Dynamic Partial Reconfigurable CGRA Framework for Multi-Kernel Applications. ICFPT 2023: 298-299
[c11]Jiaqi Wang
, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang
, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. ICCV 2023: 19787-19797
[c10]Guofeng Yi
, Yuguang Yang
, Yu Pan
, Yuhang Cao
, Jixun Yao
, Xiang Lv
, Cunhang Fan
, Zhao Lv, Jianhua Tao, Shan Liang
, Heng Lu
:
Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction. MuSe@ACM Multimedia 2023: 19-26
[c9]Heng Xie
, Jizhou Cui
, Yuhang Cao
, Junjie Chen
, Jianhua Tao
, Cunhang Fan
, Xuefei Liu
, Zhengqi Wen
, Heng Lu
, Yuguang Yang
, Zhao Lv
, Yongwei Li
:
Multimodal Cross-Lingual Features and Weight Fusion for Cross-Cultural Humor Detection. MuSe@ACM Multimedia 2023: 51-57
[i13]Jiaqi Wang
, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin:
V3Det: Vast Vocabulary Visual Detection Dataset. CoRR abs/2304.03752 (2023)
[i12]Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Díez, Lukás Burget, Yuhang Cao, Heng Lu, Jan Cernocký:
DiaCorrect: Error Correction Back-end For Speaker Diarization. CoRR abs/2309.08377 (2023)
[i11]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang
, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023)
[i10]Xiang Lyu, Yuhang Cao, Qing Wang, Jingjing Yin, Yuguang Yang, Pengpeng Zou, Yanni Hu, Heng Lu:
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System. CoRR abs/2309.16247 (2023)- 2022
[c8]Yunhui Qiu, Yuhang Cao, Yuan Dai
, Wenbo Yin, Lingli Wang:
TRAM: An Open-Source Template-based Reconfigurable Architecture Modeling Framework. FPL 2022: 61-69
[c7]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge. ICASSP 2022: 9166-9170
[i9]Maokui He, Xiang Lv, Weilin Zhou, Jingjing Yin, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Yuhang Cao, Heng Lu, Jun Du, Chin-Hui Lee:
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge. CoRR abs/2202.04855 (2022)
[i8]Yuhang Cao, Jiaqi Wang, Yiqi Lin, Dahua Lin:
MINI: Mining Implicit Novel Instances for Few-Shot Object Detection. CoRR abs/2205.03381 (2022)- 2021
[c6]Jiaqi Wang
, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CVPR 2021: 9695-9704
[c5]Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin:
Few-Shot Object Detection via Association and DIscrimination. NeurIPS 2021: 16570-16581
[i7]Shijie Fang, Yuhang Cao, Xinjiang Wang, Kai Chen, Dahua Lin, Wayne Zhang
:
WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection. CoRR abs/2105.11293 (2021)
[i6]Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin:
Few-Shot Object Detection via Association and DIscrimination. CoRR abs/2111.11656 (2021)- 2020
[c4]Yuhang Cao, Kai Chen, Chen Change Loy, Dahua Lin:
Prime Sample Attention in Object Detection. CVPR 2020: 11580-11588
[c3]Jiaqi Wang
, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin:
Side-Aware Boundary Localization for More Precise Object Detection. ECCV (4) 2020: 403-419
[i5]Kai Chen, Yuhang Cao, Chen Change Loy, Dahua Lin, Christoph Feichtenhofer:
Feature Pyramid Grids. CoRR abs/2004.03580 (2020)
[i4]Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CoRR abs/2008.10032 (2020)
2010 – 2019
- 2019
[j1]Feng Guo
, Yuhang Cao, Zhaoqiong Huang, Xing You, Haixing Guan, Jiaen Liang, Baoqing Li:
Speaker Direction-of-Arrival Estimation Based on Orthogonal Dipoles. Circuits Syst. Signal Process. 38(5): 2320-2334 (2019)
[c2]Yun Liu, Hui Zhang, Xueliang Zhang, Yuhang Cao:
Investigation of Cost Function for Supervised Monaural Speech Separation. INTERSPEECH 2019: 3178-3182
[i3]Yuhang Cao, Kai Chen, Chen Change Loy, Dahua Lin:
Prime Sample Attention in Object Detection. CoRR abs/1904.04821 (2019)
[i2]Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin:
MMDetection: Open MMLab Detection Toolbox and Benchmark. CoRR abs/1906.07155 (2019)
[i1]Jiaqi Wang, Wenwei Zhang
, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin:
Side-Aware Boundary Localization for More Precise Object Detection. CoRR abs/1912.04260 (2019)- 2017
[c1]Feng Guo, Yuhang Cao, Zheng Liu
, Jiaen Liang, Baoqing Li, Xiaobing Yuan:
Speaker Direction-of-Arrival Estimation Based on Frequency-Independent Beampattern. INTERSPEECH 2017: 1899-1903
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-12-09 01:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







