


default search action
Xihan Wei
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Shenghao Fu
, Junkai Yan
, Qize Yang, Xihan Wei, Xiaohua Xie
, Wei-Shi Zheng
:
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection. IEEE Trans. Multim. 27: 8835-8846 (2025)
[c7]Shenghao Fu, Qize Yang, Qijie Mo, Junkai Yan, Xihan Wei, Jingke Meng, Xiaohua Xie, Wei-Shi Zheng:
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models. CVPR 2025: 14987-14997
[c6]Yi-Xing Peng, Yu-Ming Tang, Kun-Yu Lin, Qize Yang, Jingke Meng, Xihan Wei, Wei-Shi Zheng:
Person De-reidentification: A Variation-guided Identity Shift Modeling. CVPR 2025: 29331-29341
[i20]Jiaxing Zhao, Boyuan Sun, Xiang Chen, Xihan Wei, Qibin Hou:
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding. CoRR abs/2501.05067 (2025)
[i19]Jiaxing Zhao, Boyuan Sun, Xiang Chen, Xihan Wei:
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness. CoRR abs/2501.07978 (2025)
[i18]Qize Yang, Detao Bai, Yi-Xing Peng, Xihan Wei:
Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis. CoRR abs/2501.09502 (2025)
[i17]Jiaxing Zhao, Qize Yang, Yixing Peng, Detao Bai, Shimin Yao, Boyuan Sun, Xiang Chen, Shenghao Fu, Weixuan chen, Xihan Wei, Liefeng Bo:
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding. CoRR abs/2501.15111 (2025)
[i16]Shenghao Fu, Qize Yang, Qijie Mo, Junkai Yan, Xihan Wei, Jingke Meng, Xiaohua Xie, Wei-Shi Zheng:
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models. CoRR abs/2501.18954 (2025)
[i15]Jiaxing Zhao, Xihan Wei, Liefeng Bo:
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning. CoRR abs/2503.05379 (2025)
[i14]Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng:
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection. CoRR abs/2503.10152 (2025)
[i13]Shenghao Fu, Qize Yang, Yuan-Ming Li, Yi-Xing Peng, Kun-Yu Lin, Xihan Wei, Jian-Fang Hu, Xiaohua Xie, Wei-Shi Zheng:
ViSpeak: Visual Instruction Feedback in Streaming Videos. CoRR abs/2503.12769 (2025)
[i12]Yi-Xing Peng, Qize Yang, Yu-Ming Tang, Shenghao Fu, Kun-Yu Lin, Xihan Wei, Wei-Shi Zheng:
ActionArt: Advancing Multimodal Large Models for Fine-Grained Human-Centric Video Understanding. CoRR abs/2504.18152 (2025)
[i11]Detao Bai, Zhiheng Ma, Xihan Wei, Liefeng Bo:
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization. CoRR abs/2505.03186 (2025)
[i10]Qize Yang, Shimin Yao, Weixuan chen, Shenghao Fu, Detao Bai, Jiaxing Zhao, Boyuan Sun, Bowen Yin, Xihan Wei, Jingren Zhou:
HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context. CoRR abs/2506.21277 (2025)
[i9]Boyuan Sun, Jiaxing Zhao, Xihan Wei, Qibin Hou:
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs. CoRR abs/2506.21862 (2025)
[i8]Shenghao Fu, Qize Yang, Yuan-Ming Li, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng:
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning. CoRR abs/2509.24786 (2025)
[i7]Yuan-Ming Li, Qize Yang, Nan Lei, Shenghao Fu, Ling-An Zeng, Jian-Fang Hu, Xihan Wei, Wei-Shi Zheng:
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation. CoRR abs/2512.10730 (2025)- 2024
[c5]Junkai Yan
, Yipeng Gao
, Qize Yang, Xihan Wei, Xuansong Xie, Ancong Wu
, Wei-Shi Zheng
:
DreamView: Injecting View-Specific Text Guidance Into Text-to-3D Generation. ECCV (25) 2024: 358-374
[c4]Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng:
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models. NeurIPS 2024
[i6]Junkai Yan, Yipeng Gao, Qize Yang, Xihan Wei, Xuansong Xie, Ancong Wu, Wei-Shi Zheng:
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation. CoRR abs/2404.06119 (2024)
[i5]Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng:
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models. CoRR abs/2410.19635 (2024)- 2022
[c3]Yuxuan Zhou
, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xian-Sheng Hua:
SP-ViT: Learning 2D Spatial Priors for Vision Transformers. BMVC 2022: 564
[c2]Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Xian-Sheng Hua, Lei Zhang
:
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition. ECCV (3) 2022: 627-644
[i4]Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng Hua:
SP-ViT: Learning 2D Spatial Priors for Vision Transformers. CoRR abs/2206.07662 (2022)
[i3]Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Xian-Sheng Hua, Lei Zhang:
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition. CoRR abs/2207.13259 (2022)- 2021
[c1]Qize Yang, Xihan Wei, Biao Wang, Xian-Sheng Hua, Lei Zhang
:
Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection. CVPR 2021: 5941-5950- 2020
[i2]Canyu Le, Zhonggui Chen, Xihan Wei, Biao Wang, Lei Zhang:
Continual Local Replacement for Few-shot Image Recognition. CoRR abs/2001.08366 (2020)
2010 – 2019
- 2019
[i1]Canyu Le, Xihan Wei, Biao Wang, Lei Zhang:
Learning Continually from Low-shot Data Stream. CoRR abs/1908.10223 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-25 03:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







