default search action

combined dblp search
author search
venue search
publication search

ask others

Fangxun Shu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HeFLWXSWZYLHGJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HeFLWXSWZYLHGJ25
Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. AAAI 2025: 17123-17131
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XiaoHGHLYSJZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XiaoHGHLYSJZ25
Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Fangxun Shu, Hao Jiang, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. AAAI 2025: 25543-25551
[c5]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/HuangHLWLYSDJ0G25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangHLWLYSDJ0G25
Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Weilong Dai, Hao Jiang, Fei Wu, Leilei Gan:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. ACL (1) 2025: 27501-27524
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ShuZJX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ShuZJX25
Fangxun Shu, Lei Zhang, Hao Jiang, Cihang Xie:
Audio-Visual LLM for Video Understanding. ICCVW 2025: 4305-4314
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DiYZLZCLHSJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DiYZLZCLHSJ25
Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. ICLR 2025
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RenLTWSZMYWWYX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RenLTWSZMYWWYX25
Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan L. Yuille, Cihang Xie:
Autoregressive Pretraining with Mamba in Vision. ICLR 2025
[c1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ShuLZZXZSCZYHFL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShuLZZXZSCZYHFL25
Fangxun Shu, Yue Liao, Lei Zhang, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chan, Tao Zhong, Zhelun Yu, Wanggui He, Siming Fu, Haoyuan Li, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation. ICLR 2025
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00540
Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. CoRR abs/2503.00540 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-01298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-01298
Yi Wang, Mushui Liu, Wanggui He, Longxiang Zhang, Ziwei Huang, Guanghao Zhang, Fangxun Shu, Tao Zhong, Dong She, Zhelun Yu, Haoyuan Li, Weilong Dai, Mingli Song, Jie Song, Hao Jiang:
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation. CoRR abs/2503.01298 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-05255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-05255
Guanghao Zhang, Tao Zhong, Yan Xia, Zhelun Yu, Haoyuan Li, Wanggui He, Fangxun Shu, Mushui Liu, Dong She, Yi Wang, Hao Jiang:
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation. CoRR abs/2503.05255 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-18458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-18458
Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang, Haoyuan Li, Fangxun Shu, Zhelun Yu, Peng Zhang, Hao Jiang, Fei Wu:
Fast-Slow Thinking for Large Vision-Language Model Reasoning. CoRR abs/2504.18458 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-14033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-14033
Weijie Yin, Yongjie Ye, Fangxun Shu, Yue Liao, Zijian Kang, Hongyuan Dong, Haiyang Yu, Dingkang Yang, Jiacong Wang, Han Wang, Wenzhuo Liu, Xiao Liang, Shuicheng Yan, Chao Feng:
SAIL-VL2 Technical Report. CoRR abs/2509.14033 (2025)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-02280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-02280
Fangxun Shu, Yongjie Ye, Yue Liao, Zijian Kang, Weijie Yin, Jiacong Wang, Xiao Liang, Shuicheng Yan, Chao Feng:
SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning. CoRR abs/2511.02280 (2025)
2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ShuCLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ShuCLWL24
Fangxun Shu, Biaolong Chen, Yue Liao, Jinqiao Wang, Si Liu:
MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval. IEEE Trans. Multim. 26: 9962-9972 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13447
Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, Wanggui He, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang, Yueting Zhuang:
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models. CoRR abs/2403.13447 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07537
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07537
Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan L. Yuille, Cihang Xie:
Autoregressive Pretraining with Mamba in Vision. CoRR abs/2406.07537 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-07614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-07614
Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. CoRR abs/2407.07614 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15881
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15881
Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation. CoRR abs/2408.15881 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03137
Chenning Xu, Fangxun Shu, Dian Jin, Jinghao Wei, Hao Jiang:
SAG: Style-Aligned Article Generation via Model Collaboration. CoRR abs/2410.03137 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-04300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-04300
Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Long Chan, Hao Jiang, Leilei Gan, Fei Wu:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. CoRR abs/2412.04300 (2024)
2023
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06720
Fangxun Shu, Lei Zhang, Hao Jiang, Cihang Xie:
Audio-Visual LLM for Video Understanding. CoRR abs/2312.06720 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06726
Lei Zhang, Fangxun Shu, Sucheng Ren, Bingchen Zhao, Hao Jiang, Cihang Xie:
Compress & Align: Curating Image-Text Data with Human Knowledge. CoRR abs/2312.06726 (2023)
2022
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-00986
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-00986
Fangxun Shu, Biaolong Chen, Yue Liao, Shuwen Xiao, Wenyu Sun, Xiaobo Li, Yousong Zhu, Jinqiao Wang, Si Liu:
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval. CoRR abs/2212.00986 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.