default search action
Xiaoshuai Sun
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j58]Yinan Li, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yunpeng Luo, Rongrong Ji:
M3ixup: A multi-modal data augmentation approach for image captioning. Pattern Recognit. 158: 110941 (2025) - 2024
- [j57]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. Int. J. Comput. Vis. 132(1): 1-19 (2024) - [j56]Gen Luo, Yiyi Zhou, Jiamu Sun, Xiaoshuai Sun, Rongrong Ji:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. IEEE Trans. Multim. 26: 3689-3700 (2024) - [c124]Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation. AAAI 2024: 1985-1993 - [c123]Zhipeng Qian, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks. AAAI 2024: 4551-4559 - [c122]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. AAAI 2024: 5940-5948 - [c121]Mingrui Wu, Yuqi Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Toward Open-Set Human Object Interaction Detection. AAAI 2024: 6066-6073 - [c120]Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun:
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks. AAAI 2024: 7864-7872 - [c119]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CVPR 2024: 26648-26658 - [c118]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. ECCV (46) 2024: 381-398 - [c117]Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji:
AnyTrans: Translate AnyText in the Image with Large Scale Models. EMNLP (Findings) 2024: 2432-2444 - [c116]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. ICME 2024: 1-6 - [c115]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. ICML 2024 - [c114]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. ICML 2024 - [c113]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models. ICML 2024 - [c112]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. ICML 2024 - [c111]Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. ACM Multimedia 2024: 905-914 - [c110]Ziyin Zhou, Ke Sun, Zhongxi Chen, Huafeng Kuang, Xiaoshuai Sun, Rongrong Ji:
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model. ACM Multimedia 2024: 3627-3636 - [c109]Shengxin Chen, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Rongrong Ji:
QueryMatch: A Query-based Contrastive Learning Framework for Weakly Supervised Visual Grounding. ACM Multimedia 2024: 4177-4186 - [c108]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. ACM Multimedia 2024: 7852-7861 - [i74]Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun:
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks. CoRR abs/2401.07709 (2024) - [i73]Gen Luo, Yiyi Zhou, Yuxin Zhang, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models. CoRR abs/2403.03003 (2024) - [i72]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. CoRR abs/2403.06702 (2024) - [i71]Qiong Wu, Weihao Ye, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models. CoRR abs/2403.15226 (2024) - [i70]Zhongxi Chen, Ke Sun, Ziyin Zhou, Xianming Lin, Xiaoshuai Sun, Liujuan Cao, Rongrong Ji:
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis. CoRR abs/2403.18471 (2024) - [i69]Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. CoRR abs/2404.00650 (2024) - [i68]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. CoRR abs/2405.00954 (2024) - [i67]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Xiaopeng Hong, Yongjian Wu, Rongrong Ji:
Image Captioning via Dynamic Path Customization. CoRR abs/2406.00334 (2024) - [i66]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. CoRR abs/2406.01451 (2024) - [i65]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. CoRR abs/2406.05620 (2024) - [i64]Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji:
AnyTrans: Translate AnyText in the Image with Large Scale Models. CoRR abs/2406.11432 (2024) - [i63]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in LVLMs. CoRR abs/2406.16449 (2024) - [i62]Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model. CoRR abs/2407.05352 (2024) - [i61]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. CoRR abs/2407.05363 (2024) - [i60]Qiong Wu, Zhaoxi Ke, Yiyi Zhou, Gen Luo, Xiaoshuai Sun, Rongrong Ji:
Routing Experts: Learning to Route Dynamic Experts in Multi-modal Large Language Models. CoRR abs/2407.14093 (2024) - [i59]Yiwei Ma, Zhibin Wang, Xiaoshuai Sun, Weihuang Lin, Qiang Zhou, Jiayi Ji, Rongrong Ji:
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model. CoRR abs/2407.16198 (2024) - [i58]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. CoRR abs/2407.20664 (2024) - [i57]Mingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models. CoRR abs/2407.21534 (2024) - [i56]Ziyin Zhou, Ke Sun, Zhongxi Chen, Huafeng Kuang, Xiaoshuai Sun, Rongrong Ji:
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model. CoRR abs/2408.05669 (2024) - [i55]Mingrui Wu, Oucheng Huang, Jiayi Ji, Jiale Li, Xinyue Cai, Huafeng Kuang, Jianzhuang Liu, Xiaoshuai Sun, Rongrong Ji:
TraDiffusion: Trajectory-Based Training-Free Image Generation. CoRR abs/2408.09739 (2024) - [i54]Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji:
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing. CoRR abs/2408.14180 (2024) - [i53]Ke Sun, Shen Chen, Taiping Yao, Hong Liu, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion. CoRR abs/2410.04372 (2024) - [i52]Yaxin Luo, Gen Luo, Jiayi Ji, Yiyi Zhou, Xiaoshuai Sun, Zhiqiang Shen, Rongrong Ji:
γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models. CoRR abs/2410.13859 (2024) - 2023
- [j55]Jinlu Zhang, Jing He, Yiyi Zhou, Xiaoshuai Sun, Xiao Yu:
HSM-QA: Question Answering System Based on Hierarchical Semantic Matching. IEEE Access 11: 77826-77839 (2023) - [j54]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards local visual modeling for image captioning. Pattern Recognit. 138: 109420 (2023) - [j53]Jipeng Wu, Rongrong Ji, Qiang Wang, Shengchuan Zhang, Xiaoshuai Sun, Yan Wang, Mingliang Xu, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. IEEE Trans. Multim. 25: 1204-1216 (2023) - [j52]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. IEEE Trans. Multim. 25: 3723-3736 (2023) - [j51]Jiayi Ji, Xiaoyang Huang, Xiaoshuai Sun, Yiyi Zhou, Gen Luo, Liujuan Cao, Jianzhuang Liu, Ling Shao, Rongrong Ji:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. IEEE Trans. Multim. 25: 3962-3974 (2023) - [j50]Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-Wen Lin, Qi Tian:
A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension. IEEE Trans. Neural Networks Learn. Syst. 34(1): 134-143 (2023) - [c107]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. AAAI 2023: 2528-2536 - [c106]Mingrui Wu, Jiaxin Gu, Yunhang Shen, Mingbao Lin, Chao Chen, Xiaoshuai Sun:
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation. AAAI 2023: 2839-2846 - [c105]Lei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension. CVPR 2023: 1-10 - [c104]Jingjia Huang, Yinan Li, Jiashi Feng, Xinglong Wu, Xiaoshuai Sun, Rongrong Ji:
Clover: Towards A Unified Video-Language Alignment and Fusion Model. CVPR 2023: 14856-14866 - [c103]Jiamu Sun, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension. CVPR 2023: 19144-19154 - [c102]Yiwei Ma, Haowei Wang, Xiaoqing Zhang, Guannan Jiang, Xiaoshuai Sun, Weilin Zhuang, Jiayi Ji, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. ICCV 2023: 2737-2748 - [c101]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. ACM Multimedia 2023: 3403-3414 - [c100]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. ACM Multimedia 2023: 4157-4168 - [c99]Xiaoxiong Du, Jun Peng, Yiyi Zhou, Jinlu Zhang, Siting Chen, Guannan Jiang, Xiaoshuai Sun, Rongrong Ji:
PixelFace+: Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks. ACM Multimedia 2023: 4666-4677 - [c98]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. ACM Multimedia 2023: 7164-7174 - [c97]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. NeurIPS 2023 - [c96]Qiong Wu, Wei Yu, Yiyi Zhou, Shubin Huang, Xiaoshuai Sun, Rongrong Ji:
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models. NeurIPS 2023 - [i51]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. CoRR abs/2301.03160 (2023) - [i50]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards Local Visual Modeling for Image Captioning. CoRR abs/2302.06098 (2023) - [i49]Gen Luo, Minglang Huang, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
Towards Efficient Visual Adaption via Structural Re-parameterization. CoRR abs/2302.08106 (2023) - [i48]Gen Luo, Yiyi Zhou, Lei Jin, Xiaoshuai Sun, Rongrong Ji:
Towards End-to-end Semi-supervised Learning for One-stage Object Detection. CoRR abs/2302.11299 (2023) - [i47]Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CoRR abs/2303.08348 (2023) - [i46]Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. CoRR abs/2303.15764 (2023) - [i45]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. CoRR abs/2305.15023 (2023) - [i44]Shubin Huang, Qiong Wu, Yiyi Zhou, Weijie Chen, Rongsheng Zhang, Xiaoshuai Sun, Rongrong Ji:
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting. CoRR abs/2306.00409 (2023) - [i43]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Tianshuo Xu, Xiaoshuai Sun, Tongliang Liu, Rongrong Ji, Dacheng Tao:
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer. CoRR abs/2306.17504 (2023) - [i42]Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Towards General Visual-Linguistic Face Forgery Detection. CoRR abs/2307.16545 (2023) - [i41]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. CoRR abs/2308.02982 (2023) - [i40]Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Continual Face Forgery Detection via Historical Distribution Preserving. CoRR abs/2308.06217 (2023) - [i39]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. CoRR abs/2308.16632 (2023) - [i38]Qiong Wu, Wei Yu, Yiyi Zhou, Shubin Huang, Xiaoshuai Sun, Rongrong Ji:
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models. CoRR abs/2309.01479 (2023) - [i37]Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji:
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues. CoRR abs/2310.09503 (2023) - [i36]Haowei Wang, Jiayi Ji, Tianyu Guo, Yilong Yang, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning. CoRR abs/2310.10975 (2023) - [i35]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. CoRR abs/2310.18142 (2023) - [i34]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. CoRR abs/2311.00397 (2023) - [i33]Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation. CoRR abs/2312.00085 (2023) - [i32]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CoRR abs/2312.12470 (2023) - 2022
- [j49]Tingting Han, Sicheng Zhao, Xiaoshuai Sun, Jun Yu:
Modeling long-term video semantic distribution for temporal action proposal generation. Neurocomputing 490: 217-225 (2022) - [j48]Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Jinsong Su, Deyu Meng, Yue Gao, Chunhua Shen:
Plenty is Plague: Fine-Grained Learning for Visual Question Answering. IEEE Trans. Pattern Anal. Mach. Intell. 44(2): 697-709 (2022) - [j47]Mingbao Lin, Rongrong Ji, Xiaoshuai Sun, Baochang Zhang, Feiyue Huang, Yonghong Tian, Dacheng Tao:
Fast Class-Wise Updating for Online Hashing. IEEE Trans. Pattern Anal. Mach. Intell. 44(5): 2453-2467 (2022) - [j46]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks. IEEE Trans. Image Process. 31: 3386-3398 (2022) - [j45]Jiayi Ji, Yiwei Ma, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Rongrong Ji:
Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning. IEEE Trans. Image Process. 31: 4321-4335 (2022) - [j44]Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis. IEEE Trans. Multim. 24: 4356-4366 (2022) - [c95]Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CVPR 2022: 14462-14471 - [c94]Mingrui Wu, Xuying Zhang, Xiaoshuai Sun, Yiyi Zhou, Chao Chen, Jiaxin Gu, Xing Sun, Rongrong Ji:
DIFNet: Boosting Visual Information Flow for Image Captioning. CVPR 2022: 17999-18008 - [c93]Ke Sun, Hong Liu, Taiping Yao, Xiaoshuai Sun, Shen Chen, Shouhong Ding, Rongrong Ji:
An Information Theoretic Approach for Attention-Driven Face Forgery Detection. ECCV (14) 2022: 111-127 - [c92]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple Yet Universal Network for Visual Grounding. ECCV (35) 2022: 598-615 - [c91]Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji:
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation. ECCV (14) 2022: 643-660 - [c90]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. ACM Multimedia 2022: 638-647 - [c89]Jun Peng, Han Pan, Yiyi Zhou, Jing He, Xiaoshuai Sun, Yan Wang, Yongjian Wu, Rongrong Ji:
Towards Open-Ended Text-to-Face Generation, Combination and Manipulation. ACM Multimedia 2022: 5045-5054 - [c88]Jun Peng, Xiaoxiong Du, Yiyi Zhou, Jing He, Yunhang Shen, Xiaoshuai Sun, Rongrong Ji:
Learning Dynamic Prior Knowledge for Text-to-Face Pixel Synthesis. ACM Multimedia 2022: 5132-5141 - [c87]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji, Dacheng Tao:
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach. NeurIPS 2022 - [i31]Fuhai Chen, Xiaoshuai Sun, Xuri Ge, Jianzhuang Liu, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension. CoRR abs/2203.06382 (2022) - [i30]Chengpeng Dai, Fuhai Chen, Xiaoshuai Sun, Rongrong Ji, Qixiang Ye, Yongjian Wu:
Global2Local: A Joint-Hierarchical Attention for Video Captioning. CoRR abs/2203.06663 (2022) - [i29]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple yet Universal Network for Visual Grounding. CoRR abs/2203.16265 (2022) - [i28]Jing He, Yiyi Zhou, Qi Zhang, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji:
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation. CoRR abs/2204.00833 (2022) - [i27]Mingrui Wu, Jiaxin Gu, Yunhang Shen, Mingbao Lin, Chao Chen, Xiaoshuai Sun, Rongrong Ji:
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation. CoRR abs/2204.03541 (2022) - [i26]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks. CoRR abs/2204.07780 (2022) - [i25]Gen Luo, Yiyi Zhou, Jiamu Sun, Shubin Huang, Xiaoshuai Sun, Qixiang Ye, Yongjian Wu, Rongrong Ji:
What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study. CoRR abs/2204.07913 (2022) - [i24]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. CoRR abs/2207.07285 (2022) - [i23]Jingjia Huang, Yinan Li, Jiashi Feng, Xiaoshuai Sun, Rongrong Ji:
Clover: Towards A Unified Video-Language Alignment and Fusion Model. CoRR abs/2207.07885 (2022) - [i22]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji, Dacheng Tao:
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach. CoRR abs/2210.05177 (2022) - 2021
- [j43]