


default search action
Fangxun Shu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j1]Fangxun Shu
, Biaolong Chen
, Yue Liao
, Jinqiao Wang
, Si Liu
:
MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval. IEEE Trans. Multim. 26: 9962-9972 (2024)
Conference and Workshop Papers
- 2025
- [c3]Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. AAAI 2025: 17123-17131 - [c2]Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Fangxun Shu, Hao Jiang, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. AAAI 2025: 25543-25551 - [c1]Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan L. Yuille, Cihang Xie:
Autoregressive Pretraining with Mamba in Vision. ICLR 2025
Informal and Other Publications
- 2025
- [i13]Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. CoRR abs/2503.00540 (2025) - [i12]Yi Wang, Mushui Liu, Wanggui He, Longxiang Zhang, Ziwei Huang, Guanghao Zhang, Fangxun Shu, Tao Zhong, Dong She, Zhelun Yu, Haoyuan Li, Weilong Dai, Mingli Song, Jie Song, Hao Jiang:
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation. CoRR abs/2503.01298 (2025) - [i11]Guanghao Zhang, Tao Zhong, Yan Xia, Zhelun Yu, Haoyuan Li, Wanggui He, Fangxun Shu, Mushui Liu, Dong She, Yi Wang, Hao Jiang:
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation. CoRR abs/2503.05255 (2025) - [i10]Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang, Haoyuan Li, Fangxun Shu, Zhelun Yu, Peng Zhang, Hao Jiang, Fei Wu:
Fast-Slow Thinking for Large Vision-Language Model Reasoning. CoRR abs/2504.18458 (2025) - 2024
- [i9]Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, Wanggui He, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang, Yueting Zhuang:
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models. CoRR abs/2403.13447 (2024) - [i8]Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan L. Yuille, Cihang Xie:
Autoregressive Pretraining with Mamba in Vision. CoRR abs/2406.07537 (2024) - [i7]Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. CoRR abs/2407.07614 (2024) - [i6]Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation. CoRR abs/2408.15881 (2024) - [i5]Chenning Xu, Fangxun Shu, Dian Jin, Jinghao Wei, Hao Jiang:
SAG: Style-Aligned Article Generation via Model Collaboration. CoRR abs/2410.03137 (2024) - [i4]Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Long Chan, Hao Jiang, Leilei Gan, Fei Wu:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. CoRR abs/2412.04300 (2024) - 2023
- [i3]Fangxun Shu, Lei Zhang, Hao Jiang, Cihang Xie:
Audio-Visual LLM for Video Understanding. CoRR abs/2312.06720 (2023) - [i2]Lei Zhang, Fangxun Shu, Sucheng Ren, Bingchen Zhao, Hao Jiang, Cihang Xie:
Compress & Align: Curating Image-Text Data with Human Knowledge. CoRR abs/2312.06726 (2023) - 2022
- [i1]Fangxun Shu, Biaolong Chen, Yue Liao, Shuwen Xiao, Wenyu Sun, Xiaobo Li, Yousong Zhu, Jinqiao Wang, Si Liu:
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval. CoRR abs/2212.00986 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-06-16 23:55 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint