default search action
Zuxuan Wu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j16]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. ACM Comput. Surv. 57(2): 41:1-41:42 (2025) - 2024
- [j15]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Adaptive Cross-Modal Transferable Adversarial Attacks From Images to Videos. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3772-3783 (2024) - [j14]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 4747-4762 (2024) - [j13]Zejia Weng, Zuxuan Wu, Hengduo Li, Jingjing Chen, Yu-Gang Jiang:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. ACM Trans. Multim. Comput. Commun. Appl. 20(2): 35:1-35:18 (2024) - [c85]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CVPR 2024: 7827-7839 - [c84]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CVPR 2024: 7882-7891 - [c83]Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu:
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding. CVPR 2024: 13279-13288 - [c82]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniViD: A Generative Framework for Universal Video Understanding. CVPR 2024: 18209-18220 - [c81]Zhenxin Li, Shiyi Lan, José M. Álvarez, Zuxuan Wu:
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection. CVPR 2024: 20113-20123 - [c80]Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang:
Learning to Rank Patches for Unbiased Image Redundancy Reduction. CVPR 2024: 22831-22840 - [c79]Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei:
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation. ECCV (59) 2024: 162-178 - [c78]Haoran Chen, Zuxuan Wu, Xintong Han, Menglin Jia, Yu-Gang Jiang:
PromptFusion: Decoupling Stability and Plasticity for Continual Learning. ECCV (87) 2024: 196-212 - [c77]Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation. ECCV (38) 2024: 203-220 - [c76]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing. ECCV (18) 2024: 205-221 - [c75]Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Guo-Jun Qi, Yu-Gang Jiang:
Zero-shot High-fidelity and Pose-controllable Character Animation. IJCAI 2024: 1788-1797 - [c74]Tianyi Lu, Xing Zhang, Jiaxi Gu, Renjing Pei, Songcen Xu, Xingjun Ma, Hang Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. ACM Multimedia 2024: 6745-6754 - [c73]Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang:
ModelLock: Locking Your Model With a Spell. ACM Multimedia 2024: 11156-11165 - [i118]Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
Secrets of RLHF in Large Language Models Part II: Reward Modeling. CoRR abs/2401.06080 (2024) - [i117]Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
MouSi: Poly-Visual-Expert Vision-Language Models. CoRR abs/2401.17221 (2024) - [i116]Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang:
FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model. CoRR abs/2403.10242 (2024) - [i115]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniVid: A Generative Framework for Universal Video Understanding. CoRR abs/2403.17935 (2024) - [i114]Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang:
Learning to Rank Patches for Unbiased Image Redundancy Reduction. CoRR abs/2404.00680 (2024) - [i113]Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Yu-Gang Jiang, Guo-Jun Qi:
PoseAnimate: Zero-shot high fidelity pose controllable character animation. CoRR abs/2404.13680 (2024) - [i112]Haoran Chen, Micah Goldblum, Zuxuan Wu, Yu-Gang Jiang:
Adaptive Rentention & Correction for Continual Learning. CoRR abs/2405.14318 (2024) - [i111]Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang:
ModelLock: Locking Your Model With a Spell. CoRR abs/2405.16285 (2024) - [i110]Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion. CoRR abs/2405.20325 (2024) - [i109]Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments. CoRR abs/2406.04151 (2024) - [i108]Lingchen Meng, Jianwei Yang, Rui Tian, Xiyang Dai, Zuxuan Wu, Jianfeng Gao, Yu-Gang Jiang:
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs. CoRR abs/2406.04334 (2024) - [i107]Zhen Xing, Qi Dai, Zejia Weng, Zuxuan Wu, Yu-Gang Jiang:
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction. CoRR abs/2406.06465 (2024) - [i106]Zhenxin Li, Kailin Li, Shihao Wang, Shiyi Lan, Zhiding Yu, Yishen Ji, Zhiqi Li, Ziyue Zhu, Jan Kautz, Zuxuan Wu, Yu-Gang Jiang, José M. Álvarez:
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation. CoRR abs/2406.06978 (2024) - [i105]Xing Zhang, Jiaxi Gu, Haoyu Zhao, Shicong Wang, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu, Yu-Gang Jiang:
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding. CoRR abs/2406.07091 (2024) - [i104]Miaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li, Zheng Zhang, Qi Dai, Chong Luo, Xin Geng, Baining Guo:
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms. CoRR abs/2406.09397 (2024) - [i103]Junke Wang, Yi Jiang, Zehuan Yuan, Bingyue Peng, Zuxuan Wu, Yu-Gang Jiang:
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation. CoRR abs/2406.09399 (2024) - [i102]Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024) - [i101]Weijie Zheng, Xingjun Ma, Hanxun Huang, Zuxuan Wu, Yu-Gang Jiang:
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers. CoRR abs/2408.01705 (2024) - [i100]Zejia Weng, Xitong Yang, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang:
GenRec: Unifying Video Generation and Recognition with Diffusion Models. CoRR abs/2408.15241 (2024) - [i99]Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei:
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation. CoRR abs/2409.07454 (2024) - [i98]Zhengfu He, Wentao Shu, Xuyang Ge, Lingjie Chen, Junxuan Wang, Yunhua Zhou, Frances Liu, Qipeng Guo, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang, Xipeng Qiu:
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders. CoRR abs/2410.20526 (2024) - [i97]Rui Tian, Qi Dai, Jianmin Bao, Kai Qiu, Yifan Yang, Chong Luo, Zuxuan Wu, Yu-Gang Jiang:
REDUCIO! Generating 1024⨉1024 Video within 16 Seconds using Extremely Compressed Motion Latents. CoRR abs/2411.13552 (2024) - [i96]Zhiheng Xi, Dingwen Yang, Jixuan Huang, Jiafu Tang, Guanyu Li, Yiwen Ding, Wei He, Boyang Hong, Shihan Dou, Wenyu Zhan, Xiao Wang, Rui Zheng, Tao Ji, Xiaowei Shi, Yitao Zhai, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Zuxuan Wu, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Yu-Gang Jiang:
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision. CoRR abs/2411.16579 (2024) - [i95]Shuyuan Tu, Zhen Xing, Xintong Han, Zhi-Qi Cheng, Qi Dai, Chong Luo, Zuxuan Wu:
StableAnimator: High-Quality Identity-Preserving Human Image Animation. CoRR abs/2411.17697 (2024) - [i94]Zhihao Sun, Haoran Jiang, Haoran Chen, Yixin Cao, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang:
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection. CoRR abs/2411.19466 (2024) - [i93]Wujian Peng, Lingchen Meng, Yitong Chen, Yiweng Xie, Yang Liu, Tao Gui, Hang Xu, Xipeng Qiu, Zuxuan Wu, Yu-Gang Jiang:
Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning. CoRR abs/2412.03565 (2024) - [i92]Hui Zhang, Dexiang Hong, Tingwei Gao, Yitong Wang, Jie Shao, Xinglong Wu, Zuxuan Wu, Yu-Gang Jiang:
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation. CoRR abs/2412.03859 (2024) - 2023
- [j12]Tianyi Liu, Zuxuan Wu, Jingjing Chen, Yu-Gang Jiang:
Multimodal Pre-training Method for Vision-language Understanding and Generation. Int. J. Softw. Informatics 13(2): 143-155 (2023) - [j11]Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang, Larry S. Davis:
Towards Transferable Adversarial Attacks on Image and Video Transformers. IEEE Trans. Image Process. 32: 6346-6358 (2023) - [j10]Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang:
Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation. IEEE Trans. Multim. 25: 1665-1673 (2023) - [j9]Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. IEEE Trans. Multim. 25: 2382-2392 (2023) - [j8]Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Self-Supervised Learning for Semi-Supervised Temporal Language Grounding. IEEE Trans. Multim. 25: 7747-7757 (2023) - [c72]Bingchen Huang, Zhineng Chen, Peng Zhou, Jiayin Chen, Zuxuan Wu:
Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning. AAAI 2023: 908-916 - [c71]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CVPR 2023: 2268-2278 - [c70]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CVPR 2023: 6132-6142 - [c69]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CVPR 2023: 6312-6322 - [c68]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CVPR 2023: 11402-11411 - [c67]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Enhancing the Self-Universality for Transferable Targeted Attacks. CVPR 2023: 12281-12290 - [c66]Hui Zhang, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang:
Prototypical Residual Networks for Anomaly Detection and Localization. CVPR 2023: 16281-16291 - [c65]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CVPR 2023: 18816-18826 - [c64]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CVPR 2023: 22721-22731 - [c63]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers are Good Mask Auto-Labelers. CVPR 2023: 23745-23755 - [c62]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. ICCV 2023: 19879-19890 - [c61]Yiqiang Lv, Jingjing Chen, Zhipeng Wei, Kai Chen, Zuxuan Wu, Yu-Gang Jiang:
Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models. ICME 2023: 2831-2836 - [c60]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. ICML 2023: 36978-36989 - [c59]Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos. ACM Multimedia 2023: 698-708 - [c58]Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
On the Importance of Spatial Relations for Few-shot Action Recognition. ACM Multimedia 2023: 2243-2251 - [c57]Haoran Chen, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
Multi-Prompt Alignment for Multi-Source Unsupervised Domain Adaptation. NeurIPS 2023 - [c56]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. NeurIPS 2023 - [i91]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers Are Good Mask Auto-Labelers. CoRR abs/2301.03992 (2023) - [i90]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. CoRR abs/2302.00624 (2023) - [i89]Haoran Chen, Zuxuan Wu, Xintong Han, Menglin Jia, Yu-Gang Jiang:
PromptFusion: Decoupling Stability and Plasticity for Continual Learning. CoRR abs/2303.07223 (2023) - [i88]Hui Zhang, Zheng Wang, Zuxuan Wu, Yu-Gang Jiang:
DiffusionAD: Denoising Diffusion for Anomaly Detection. CoRR abs/2303.08730 (2023) - [i87]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang:
OmniTracker: Unifying Object Tracking by Tracking-with-Detection. CoRR abs/2303.12079 (2023) - [i86]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CoRR abs/2303.14124 (2023) - [i85]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. CoRR abs/2304.10465 (2023) - [i84]Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. CoRR abs/2304.14407 (2023) - [i83]Wujian Peng, Zejia Weng, Hengduo Li, Zuxuan Wu:
BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning. CoRR abs/2305.12912 (2023) - [i82]Wenfeng Yan, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
Prompting Large Language Models to Reformulate Queries for Moment Localization. CoRR abs/2306.03422 (2023) - [i81]Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
On the Importance of Spatial Relations for Few-shot Action Recognition. CoRR abs/2308.07119 (2023) - [i80]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CoRR abs/2308.09710 (2023) - [i79]Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu:
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation. CoRR abs/2309.03549 (2023) - [i78]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data. CoRR abs/2310.05010 (2023) - [i77]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. CoRR abs/2310.10647 (2023) - [i76]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. CoRR abs/2310.12152 (2023) - [i75]Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. CoRR abs/2310.16400 (2023) - [i74]Junke Wang, Lingchen Meng, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang:
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning. CoRR abs/2311.07574 (2023) - [i73]Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation. CoRR abs/2311.14671 (2023) - [i72]Hui Zhang, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang:
AdaDiff: Adaptive Step Selection for Fast Diffusion. CoRR abs/2311.14768 (2023) - [i71]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model. CoRR abs/2311.17338 (2023) - [i70]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CoRR abs/2311.18830 (2023) - [i69]Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models. CoRR abs/2311.18837 (2023) - [i68]Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu:
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding. CoRR abs/2312.00081 (2023) - [i67]Zhenxin Li, Shiyi Lan, José M. Álvarez, Zuxuan Wu:
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection. CoRR abs/2312.01696 (2023) - 2022
- [j7]Zuxuan Wu, Hengduo Li, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis:
A Dynamic Frame Selection Framework for Fast Video Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44(4): 1699-1711 (2022) - [j6]Xing Zhang, Zuxuan Wu, Yu-Gang Jiang:
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. IEEE Trans. Multim. 24: 313-322 (2022) - [j5]Xue Song, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval. IEEE Trans. Multim. 24: 2914-2923 (2022) - [c55]Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Attacking Video Recognition Models with Bullet-Screen Comments. AAAI 2022: 312-320 - [c54]Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis:
Rethinking Pseudo Labels for Semi-supervised Object Detection. AAAI 2022: 1314-1322 - [c53]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Boosting the Transferability of Video Adversarial Examples via Temporal Translation. AAAI 2022: 2659-2667 - [c52]Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang:
Towards Transferable Adversarial Attacks on Vision Transformers. AAAI 2022: 2668-2676 - [c51]Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein:
Robust Optimization as Data Augmentation for Large-scale Graphs. CVPR 2022: 60-69 - [c50]Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang:
ObjectFormer for Image Manipulation Detection and Localization. CVPR 2022: 2354-2363 - [c49]Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim:
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition. CVPR 2022: 12299-12308 - [c48]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan:
BEVT: BERT Pretraining of Video Transformers. CVPR 2022: 14713-14723 - [c47]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Cross-Modal Transferable Adversarial Attacks from Images to Videos. CVPR 2022: 15044-15053 - [c46]Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang:
Efficient Video Transformers with Spatial-Temporal Token Selection. ECCV (35) 2022: 69-86 - [c45]Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors. ECCV (1) 2022: 535-551 - [c44]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-supervised Vision Transformers. ECCV (30) 2022: 605-620 - [c43]