default search action
Xiaoshuai Sun
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2025
- [j58]Yinan Li, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yunpeng Luo, Rongrong Ji:
M3ixup: A multi-modal data augmentation approach for image captioning. Pattern Recognit. 158: 110941 (2025) - 2024
- [j57]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. Int. J. Comput. Vis. 132(1): 1-19 (2024) - [j56]Gen Luo, Yiyi Zhou, Jiamu Sun, Xiaoshuai Sun, Rongrong Ji:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. IEEE Trans. Multim. 26: 3689-3700 (2024) - 2023
- [j55]Jinlu Zhang, Jing He, Yiyi Zhou, Xiaoshuai Sun, Xiao Yu:
HSM-QA: Question Answering System Based on Hierarchical Semantic Matching. IEEE Access 11: 77826-77839 (2023) - [j54]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards local visual modeling for image captioning. Pattern Recognit. 138: 109420 (2023) - [j53]Jipeng Wu, Rongrong Ji, Qiang Wang, Shengchuan Zhang, Xiaoshuai Sun, Yan Wang, Mingliang Xu, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. IEEE Trans. Multim. 25: 1204-1216 (2023) - [j52]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. IEEE Trans. Multim. 25: 3723-3736 (2023) - [j51]Jiayi Ji, Xiaoyang Huang, Xiaoshuai Sun, Yiyi Zhou, Gen Luo, Liujuan Cao, Jianzhuang Liu, Ling Shao, Rongrong Ji:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. IEEE Trans. Multim. 25: 3962-3974 (2023) - [j50]Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-Wen Lin, Qi Tian:
A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension. IEEE Trans. Neural Networks Learn. Syst. 34(1): 134-143 (2023) - 2022
- [j49]Tingting Han, Sicheng Zhao, Xiaoshuai Sun, Jun Yu:
Modeling long-term video semantic distribution for temporal action proposal generation. Neurocomputing 490: 217-225 (2022) - [j48]Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Jinsong Su, Deyu Meng, Yue Gao, Chunhua Shen:
Plenty is Plague: Fine-Grained Learning for Visual Question Answering. IEEE Trans. Pattern Anal. Mach. Intell. 44(2): 697-709 (2022) - [j47]Mingbao Lin, Rongrong Ji, Xiaoshuai Sun, Baochang Zhang, Feiyue Huang, Yonghong Tian, Dacheng Tao:
Fast Class-Wise Updating for Online Hashing. IEEE Trans. Pattern Anal. Mach. Intell. 44(5): 2453-2467 (2022) - [j46]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks. IEEE Trans. Image Process. 31: 3386-3398 (2022) - [j45]Jiayi Ji, Yiwei Ma, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Rongrong Ji:
Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning. IEEE Trans. Image Process. 31: 4321-4335 (2022) - [j44]Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis. IEEE Trans. Multim. 24: 4356-4366 (2022) - 2021
- [j43]Ying Zheng, Hongxun Yao, Xiaoshuai Sun, Shengping Zhang, Sicheng Zhao, Fatih Porikli:
Sketch-specific data augmentation for freehand sketch recognition. Neurocomputing 456: 528-539 (2021) - [j42]Xiawu Zheng, Yang Zhang, Sirui Hong, Huixia Li, Lang Tang, Youcheng Xiong, Jin Zhou, Yan Wang, Xiaoshuai Sun, Pengfei Zhu, Chenglin Wu, Rongrong Ji:
Evolving Fully Automated Machine Learning via Life-Long Knowledge Anchors. IEEE Trans. Pattern Anal. Mach. Intell. 43(9): 3091-3107 (2021) - [j41]Ying Zheng, Hongxun Yao, Xiaoshuai Sun:
Deep Semantic Parsing of Freehand Sketches With Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning. IEEE Trans. Multim. 23: 3590-3602 (2021) - 2020
- [j40]Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian:
Hadamard Matrix Guided Online Hashing. Int. J. Comput. Vis. 128(8): 2279-2306 (2020) - [j39]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Wenlong Xie, Sicheng Zhao, Wei Yu:
Actionness-pooled Deep-convolutional Descriptor for fine-grained action recognition. Neurocomputing 398: 442-452 (2020) - [j38]Chen Wang, Shifan Zhu, Desheng Lyu, Xiaoshuai Sun:
What is damaged: a benchmark dataset for abnormal traffic object classification. Multim. Tools Appl. 79(25-26): 18481-18494 (2020) - [j37]Rongrong Ji, Ke Li, Yan Wang, Xiaoshuai Sun, Feng Guo, Xiaowei Guo, Yongjian Wu, Feiyue Huang, Jiebo Luo:
Semi-Supervised Adversarial Monocular Depth Estimation. IEEE Trans. Pattern Anal. Mach. Intell. 42(10): 2410-2422 (2020) - [j36]Tingting Han, Hongxun Yao, Wenlong Xie, Xiaoshuai Sun, Sicheng Zhao, Jun Yu:
TVENet: Temporal variance embedding network for fine-grained action representation. Pattern Recognit. 103: 107267 (2020) - [j35]Mingbao Lin, Rongrong Ji, Shen Chen, Xiaoshuai Sun, Chia-Wen Lin:
Similarity-Preserving Linkage Hashing for Online Image Retrieval. IEEE Trans. Image Process. 29: 5289-5300 (2020) - [j34]Sheng Jin, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Lei Zhang, Xian-Sheng Hua:
Deep Saliency Hashing for Fine-Grained Retrieval. IEEE Trans. Image Process. 29: 5336-5351 (2020) - 2019
- [j33]Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Jinsong Su, Rongrong Ji:
Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method. IEEE Access 7: 7555-7564 (2019) - [j32]Sheng Jin, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou:
Unsupervised semantic deep hashing. Neurocomputing 351: 19-25 (2019) - [j31]Taisong Jin, Zhengtao Yu, Yue Gao, Shengxiang Gao, Xiaoshuai Sun, Cuihua Li:
Robust ℓ2-Hypergraph and its applications. Inf. Sci. 501: 708-723 (2019) - [j30]Xiusheng Lu, Hongxun Yao, Sicheng Zhao, Xiaoshuai Sun, Shengping Zhang:
Action recognition with multi-scale trajectory-pooled 3D convolutional descriptors. Multim. Tools Appl. 78(1): 507-523 (2019) - [j29]Yasi Wang, Hongxun Yao, Wei Yu, Dong Wang, Shangchen Zhou, Xiaoshuai Sun:
Gradual recovery based occluded digit images recognition. Multim. Tools Appl. 78(2): 2571-2586 (2019) - [j28]Taisong Jin, Rongrong Ji, Yue Gao, Xiaoshuai Sun, Xibin Zhao, Dacheng Tao:
Correntropy-Induced Robust Low-Rank Hypergraph. IEEE Trans. Image Process. 28(6): 2755-2769 (2019) - [j27]Wenlong Xie, Hongxun Yao, Xiaoshuai Sun, Tingting Han, Sicheng Zhao, Tat-Seng Chua:
Discovering Latent Discriminative Patterns for Multi-Mode Event Representation. IEEE Trans. Multim. 21(6): 1425-1436 (2019) - 2018
- [j26]Wei Yu, Xiaoshuai Sun, Kuiyuan Yang, Yong Rui, Hongxun Yao:
Hierarchical semantic image matching using CNN feature pyramid. Comput. Vis. Image Underst. 169: 40-51 (2018) - [j25]Cheng Pang, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Wei Yu:
Rediscover flowers structurally. Multim. Tools Appl. 77(7): 7851-7863 (2018) - [j24]Cheng Pang, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Yanhao Zhang:
Exploring part-aware segmentation for fine-grained visual categorization. Multim. Tools Appl. 77(23): 30291-30310 (2018) - [j23]Ying Zheng, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Fatih Porikli:
Distinctive action sketch for human action recognition. Signal Process. 144: 323-332 (2018) - [j22]Wenlong Xie, Hongxun Yao, Sicheng Zhao, Xiaoshuai Sun, Tingting Han:
Event patches: Mining effective parts for event detection and understanding. Signal Process. 149: 82-87 (2018) - [j21]Xuanhan Wang, Lianli Gao, Peng Wang, Xiaoshuai Sun, Xianglong Liu:
Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length. IEEE Trans. Multim. 20(3): 634-644 (2018) - 2017
- [j20]Wei Yu, Kuiyuan Yang, Hongxun Yao, Xiaoshuai Sun, Pengfei Xu:
Exploiting the complementary strengths of multi-layer CNN features for image retrieval. Neurocomputing 237: 235-241 (2017) - [j19]Wenlong Xie, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Wei Yu, Shengping Zhang:
Actor identification via mining representative actions. Neurocomputing 244: 1-9 (2017) - [j18]Chen Wang, Hongxun Yao, Xiaoshuai Sun:
Anomaly detection based on spatio-temporal sparse representation and visual attention analysis. Multim. Tools Appl. 76(5): 6263-6279 (2017) - [j17]Ying Zheng, Hongxun Yao, Xiaoshuai Sun, Xuesong Jiang, Fatih Porikli:
Breaking video into pieces for action recognition. Multim. Tools Appl. 76(21): 22195-22212 (2017) - [j16]Chao Li, Zi Huang, Yang Yang, Jiewei Cao, Xiaoshuai Sun, Heng Tao Shen:
Hierarchical Latent Concept Discovery for Video Event Detection. IEEE Trans. Image Process. 26(5): 2149-2162 (2017) - [j15]Tingting Han, Hongxun Yao, Chenliang Xu, Xiaoshuai Sun, Yanhao Zhang, Jason J. Corso:
Dancelets Mining for Video Recommendation Based on Dance Styles. IEEE Trans. Multim. 19(4): 712-724 (2017) - 2016
- [j14]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Yanhao Zhang:
Unsupervised discovery of crowd activities by saliency-based clustering. Neurocomputing 171: 347-361 (2016) - [j13]Litao Yu, Xiaoshuai Sun, Zi Huang:
Robust spatial-temporal deep model for multimedia event detection. Neurocomputing 213: 48-53 (2016) - 2015
- [j12]Yanhao Zhang, Qingming Huang, Lei Qin, Sicheng Zhao, Xiusheng Lu, Xiaoshuai Sun, Hongxun Yao:
Strategy for aesthetic photography recommendation via collaborative composition model. IET Comput. Vis. 9(5): 691-698 (2015) - [j11]Sicheng Zhao, Lujun Chen, Hongxun Yao, Yanhao Zhang, Xiaoshuai Sun:
Strategy for dynamic 3D depth data matching towards robust action retrieval. Neurocomputing 151: 533-543 (2015) - [j10]Yasi Wang, Hongxun Yao, Xiaoshuai Sun, Pengfei Xu, Sicheng Zhao:
深度学习中的自编码器的表达能力研究 (Representation Ability Research of Auto-encoders in Deep Learning). 计算机科学 42(9): 56-60 (2015) - 2014
- [j9]Pengfei Xu, Hongxun Yao, Rongrong Ji, Xianming Liu, Xiaoshuai Sun:
Where should I stand? Learning based human position recommendation for mobile photographing. Multim. Tools Appl. 69(1): 3-29 (2014) - [j8]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Xianming Liu:
Toward Statistical Modeling of Saccadic Eye-Movement and Visual Saliency. IEEE Trans. Image Process. 23(11): 4649-4662 (2014) - 2013
- [j7]Sicheng Zhao, Hongxun Yao, Xiaoshuai Sun:
Video classification and recommendation based on affective analysis of viewers. Neurocomputing 119: 101-110 (2013) - [j6]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji:
Visual attention modeling based on short-term environmental adaption. J. Vis. Commun. Image Represent. 24(2): 171-180 (2013) - [j5]Xianming Liu, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xiaoshuai Sun:
Bidirectional-isomorphic manifold learning at image semantic understanding & representation. Multim. Tools Appl. 64(1): 53-76 (2013) - 2012
- [j4]Rongrong Ji, Hongxun Yao, Wei Liu, Xiaoshuai Sun, Qi Tian:
Task-Dependent Visual-Codebook Compression. IEEE Trans. Image Process. 21(4): 2282-2293 (2012) - [j3]Rongrong Ji, Hongxun Yao, Qi Tian, Pengfei Xu, Xiaoshuai Sun, Xianming Liu:
Context-Aware Semi-Local Feature Detector. ACM Trans. Intell. Syst. Technol. 3(3): 44:1-44:27 (2012) - 2011
- [j2]Rongrong Ji, Hongxun Yao, Xiaoshuai Sun:
Actor-independent action search using spatiotemporal vocabulary with appearance hashing. Pattern Recognit. 44(3): 624-638 (2011) - 2009
- [j1]Rongrong Ji, Hongxun Yao, Pengfei Xu, Xiaoshuai Sun:
Visual and textual fusion for semantically supervised region-based retrieval. Multim. Syst. 15(4): 201-219 (2009)
Conference and Workshop Papers
- 2024
- [c119]Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation. AAAI 2024: 1985-1993 - [c118]Zhipeng Qian, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks. AAAI 2024: 4551-4559 - [c117]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. AAAI 2024: 5940-5948 - [c116]Mingrui Wu, Yuqi Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Toward Open-Set Human Object Interaction Detection. AAAI 2024: 6066-6073 - [c115]Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun:
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks. AAAI 2024: 7864-7872 - [c114]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CVPR 2024: 26648-26658 - [c113]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. ECCV (46) 2024: 381-398 - [c112]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. ICME 2024: 1-6 - [c111]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. ICML 2024 - [c110]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. ICML 2024 - [c109]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models. ICML 2024 - [c108]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. ICML 2024 - 2023
- [c107]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. AAAI 2023: 2528-2536 - [c106]Mingrui Wu, Jiaxin Gu, Yunhang Shen, Mingbao Lin, Chao Chen, Xiaoshuai Sun:
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation. AAAI 2023: 2839-2846 - [c105]Lei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension. CVPR 2023: 1-10 - [c104]Jingjia Huang, Yinan Li, Jiashi Feng, Xinglong Wu, Xiaoshuai Sun, Rongrong Ji:
Clover: Towards A Unified Video-Language Alignment and Fusion Model. CVPR 2023: 14856-14866 - [c103]Jiamu Sun, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension. CVPR 2023: 19144-19154 - [c102]Yiwei Ma, Haowei Wang, Xiaoqing Zhang, Guannan Jiang, Xiaoshuai Sun, Weilin Zhuang, Jiayi Ji, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. ICCV 2023: 2737-2748 - [c101]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. ACM Multimedia 2023: 3403-3414 - [c100]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. ACM Multimedia 2023: 4157-4168 - [c99]Xiaoxiong Du, Jun Peng, Yiyi Zhou, Jinlu Zhang, Siting Chen, Guannan Jiang, Xiaoshuai Sun, Rongrong Ji:
PixelFace+: Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks. ACM Multimedia 2023: 4666-4677 - [c98]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. ACM Multimedia 2023: 7164-7174 - [c97]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. NeurIPS 2023 - [c96]Qiong Wu, Wei Yu, Yiyi Zhou, Shubin Huang, Xiaoshuai Sun, Rongrong Ji:
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models. NeurIPS 2023 - 2022
- [c95]Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CVPR 2022: 14462-14471 - [c94]Mingrui Wu, Xuying Zhang, Xiaoshuai Sun, Yiyi Zhou, Chao Chen, Jiaxin Gu, Xing Sun, Rongrong Ji:
DIFNet: Boosting Visual Information Flow for Image Captioning. CVPR 2022: 17999-18008 - [c93]Ke Sun, Hong Liu, Taiping Yao, Xiaoshuai Sun, Shen Chen, Shouhong Ding, Rongrong Ji:
An Information Theoretic Approach for Attention-Driven Face Forgery Detection. ECCV (14) 2022: 111-127 - [c92]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple Yet Universal Network for Visual Grounding. ECCV (35) 2022: 598-615 - [c91]Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji:
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation. ECCV (14) 2022: 643-660 - [c90]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. ACM Multimedia 2022: 638-647 - [c89]Jun Peng, Han Pan, Yiyi Zhou, Jing He, Xiaoshuai Sun, Yan Wang, Yongjian Wu, Rongrong Ji:
Towards Open-Ended Text-to-Face Generation, Combination and Manipulation. ACM Multimedia 2022: 5045-5054 - [c88]Jun Peng, Xiaoxiong Du, Yiyi Zhou, Jing He, Yunhang Shen, Xiaoshuai Sun, Rongrong Ji:
Learning Dynamic Prior Knowledge for Text-to-Face Pixel Synthesis. ACM Multimedia 2022: 5132-5141 - [c87]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji, Dacheng Tao:
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach. NeurIPS 2022 - 2021
- [c86]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. AAAI 2021: 1655-1663 - [c85]Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji:
Dual-level Collaborative Transformer for Image Captioning. AAAI 2021: 2286-2293 - [c84]Xuying Zhang, Xiaoshuai Sun, Yunpeng Luo, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words. CVPR 2021: 15465-15474 - [c83]Yiyi Zhou, Tianhe Ren, Chaoyang Zhu, Xiaoshuai Sun, Jianzhuang Liu, Xinghao Ding, Mingliang Xu, Rongrong Ji:
TRAR: Routing the Attention Spans in Transformer for Visual Question Answering. ICCV 2021: 2054-2064 - 2020
- [c82]Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua:
SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation. AAAI 2020: 11157-11164 - [c81]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji:
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation. CVPR 2020: 10031-10040 - [c80]Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Gen Luo, Xiaopeng Hong, Jinsong Su, Xinghao Ding, Ling Shao:
K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering. ACM Multimedia 2020: 1245-1254 - [c79]Gen Luo, Yiyi Zhou, Rongrong Ji, Xiaoshuai Sun, Jinsong Su, Chia-Wen Lin, Qi Tian:
Cascade Grouped Attention Network for Referring Expression Segmentation. ACM Multimedia 2020: 1274-1282 - [c78]Xiaoshuai Sun, Xuying Zhang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Exploring Language Prior for Mode-Sensitive Visual Attention Modeling. ACM Multimedia 2020: 4199-4207 - [c77]Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji, Fuhai Chen, Jianzhuang Liu, Qi Tian:
Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal. ACM Multimedia 2020: 4226-4234 - 2019
- [c76]Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Yongjian Wu, Yunsheng Wu:
Towards Optimal Discrete Online Hashing with Balanced Similarity. AAAI 2019: 8722-8729 - [c75]Xiawu Zheng, Rongrong Ji, Xiaoshuai Sun, Baochang Zhang, Yongjian Wu, Feiyue Huang:
Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer. AAAI 2019: 9291-9298 - [c74]Yiyi Zhou, Rongrong Ji, Jinsong Su, Xiangming Li, Xiaoshuai Sun:
Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning. AAAI 2019: 9316-9323 - [c73]Yiyi Zhou, Rongrong Ji, Jinsong Su, Xiaoshuai Sun, Weiqiu Chen:
Dynamic Capsule Attention for Visual Question Answering. AAAI 2019: 9324-9331 - [c72]Jun Peng, Yiyi Zhou, Liujuan Cao, Xiaoshuai Sun, Jinsong Su, Rongrong Ji:
Towards Cross-modality Topic Modelling via Deep Topical Correlation Analysis. ICASSP 2019: 4115-4119 - [c71]Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang:
Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images. ICCV 2019: 2690-2698 - [c70]Jianyu Wang, Shaohui Liu, Feng Jiang, Xiaoshuai Sun, Yongliang Liu:
A Video Post-Filter Deblocking Method Based on Temporal Boosting Residual Networks. ICME 2019: 1174-1179 - [c69]Taisong Jin, Liujuan Cao, Baochang Zhang, Xiaoshuai Sun, Cheng Deng, Rongrong Ji:
Hypergraph Induced Convolutional Manifold Networks. IJCAI 2019: 2670-2676 - [c68]Huafeng Kuang, Rongrong Ji, Hong Liu, Shengchuan Zhang, Xiaoshuai Sun, Feiyue Huang, Baochang Zhang:
Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing. ACM Multimedia 2019: 48-56 - [c67]Fuhai Chen, Rongrong Ji, Jiayi Ji, Xiaoshuai Sun, Baochang Zhang, Xuri Ge, Yongjian Wu, Feiyue Huang, Yan Wang:
Variational Structured Semantic Inference for Diverse Image Captioning. NeurIPS 2019: 1929-1939 - [c66]Jie Hu, Rongrong Ji, Shengchuan Zhang, Xiaoshuai Sun, Qixiang Ye, Chia-Wen Lin, Qi Tian:
Information Competing Process for Learning Diversified Representations. NeurIPS 2019: 2175-2186 - 2018
- [c65]Zheng Xu, Xitong Yang, Xue Li, Xiaoshuai Sun:
Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization. BMVC 2018: 243 - [c64]Fuhai Chen, Rongrong Ji, Xiaoshuai Sun, Yongjian Wu, Jinsong Su:
GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints. CVPR 2018: 1345-1353 - [c63]Weiqing Wang, Hongzhi Yin, Zi Huang, Xiaoshuai Sun, Nguyen Quoc Viet Hung:
Restricted Boltzmann Machine Based Active Learning for Sparse Recommendation. DASFAA (1) 2018: 100-115 - [c62]Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Xiaojun Tong:
Weighted voxel: a novel voxel representation for 3D reconstruction. ICIMCS 2018: 33:1-33:4 - [c61]JunLei Zhang, Hongxun Yao, JiaLing He, Xiaoshuai Sun:
Illustrate your travel notes: web-based story visualization. ICIMCS 2018: 44:1-44:5 - [c60]Chuang Lin, Hongxun Yao, Wei Yu, Xiaoshuai Sun:
Cycle-Consistency Based Hierarchical Dense Semantic Correspondence. ICIP 2018: 818-822 - [c59]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Wenlong Xie, Yanhao Zhang:
Add: Actionness-Pooled Deep-Convolutional Descriptor. ICME 2018: 1-6 - [c58]Xiawu Zheng, Rongrong Ji, Xiaoshuai Sun, Yongjian Wu, Feiyue Huang, Yanhua Yang:
Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval. IJCAI 2018: 1226-1233 - 2017
- [c57]Xiaoshuai Sun, Jiewei Cao, Chao Li, Lei Zhu, Heng Tao Shen:
Web-Based Semantic Fragment Discovery for On-Line Lingual-Visual Similarity. AAAI 2017: 182-188 - [c56]Xiaoshuai Sun, Zi Huang, Hongzhi Yin, Heng Tao Shen:
An Integrated Model for Effective Saliency Prediction. AAAI 2017: 274-281 - [c55]Hongzhi Yin, Hongxu Chen, Xiaoshuai Sun, Hao Wang, Yang Wang, Quoc Viet Hung Nguyen:
SPTF: A Scalable Probabilistic Tensor Factorization Model for Semantic-Aware Behavior Prediction. ICDM 2017: 585-594 - [c54]Haoran Li, Hongxun Yao, Yuxin Hou, Xiaoshuai Sun:
Gated additive skip context connection for object detection. ICIP 2017: 680-684 - [c53]Yuxin Hou, Hongxun Yao, Haoran Li, Xiaoshuai Sun:
Dancing like a superstar: Action guidance based on pose estimation and conditional pose alignment. ICIP 2017: 1312-1316 - [c52]Yasi Wang, Hongxun Yao, Wei Yu, Xiaoshuai Sun:
Object Discovery and Cosegmentation Based on Dense Correspondences. PCM (2) 2017: 119-128 - [c51]Xiusheng Lu, Hongxun Yao, Xiaoshuai Sun, Shengping Zhang, Yanhao Zhang:
Trajectory-Pooled 3D Convolutional Descriptors for Action Recognition. PCM (1) 2017: 247-257 - [c50]Yu Xia, Hongxun Yao, Xiaoshuai Sun, Yanhao Zhang:
Shallow and Deep Model Investigation for Distinguishing Corn and Weeds. PCM (1) 2017: 693-702 - [c49]Wenbo Tang, Hongxun Yao, Xiaoshuai Sun, Wei Yu:
Multi-scale Discriminative Patches for Fined-Grained Visual Categorization. PCM (1) 2017: 712-721 - 2016
- [c48]Wenlong Xie, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao, Tingting Han, Cheng Pang:
Mining representative actions for actor identification. ICASSP 2016: 1253-1257 - [c47]Jiewei Cao, Zi Huang, Peng Wang, Chao Li, Xiaoshuai Sun, Heng Tao Shen:
Quartet-net Learning for Visual Instance Retrieval. ACM Multimedia 2016: 456-460 - 2015
- [c46]Dong Wang, Bin Wang, Sicheng Zhao, Xiaoshuai Sun, Hongxun Yao, Hong Liu:
Dual-mode video stabilization based on adaptive motion clustering. ICIMCS 2015: 6:1-6:4 - [c45]Yinghao Huang, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao:
Boost sparse coding based abnormal event detection via explicitly applying temporal continuity constraint. ICIMCS 2015: 29:1-29:4 - [c44]Ying Zheng, Hongxun Yao, Xiaoshuai Sun, Sicheng Zhao:
Distinctive action sketch. ICIP 2015: 576-580 - [c43]Sicheng Zhao, Hongxun Yao, Xiaolei Jiang, Xiaoshuai Sun:
Predicting discrete probability distribution of image emotions. ICIP 2015: 2459-2463 - [c42]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Yanhao Zhang, Sicheng Zhao, Xiusheng Lu, Yinghao Huang, Wenlong Xie:
"Clustering of Dancelets": Towards Video Recommendation Based on Dance Styles. ACM Multimedia 2015: 915-918 - [c41]Cheng Pang, Hongxun Yao, Zhiyuan Yang, Xiaoshuai Sun, Sicheng Zhao, Yanhao Zhang:
Part-Aware Segmentation for Fine-Grained Categorization. PCM (1) 2015: 538-548 - 2014
- [c40]Cheng Pang, Hongxun Yao, Xiaoshuai Sun:
Discriminative Features for Bird Species Classification. ICIMCS 2014: 256 - [c39]Yuankai Qi, Hongxun Yao, Xiaoshuai Sun, Xin Sun, Yanhao Zhang, Qingming Huang:
Structure-aware multi-object discovery for weakly supervised tracking. ICIP 2014: 466-470 - [c38]Xiaoshuai Sun, Hongxun Yao:
Exploring covert attention for generic boosting of saliency models. ICIP 2014: 1179-1183 - [c37]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Yanhao Zhang:
"Clustering by saliency" - Unsupervised discovery of crowd activities. ICIP 2014: 2388-2392 - [c36]Sicheng Zhao, Yue Gao, Xiaolei Jiang, Hongxun Yao, Tat-Seng Chua, Xiaoshuai Sun:
Exploring Principles-of-Art Features For Image Emotion Recognition. ACM Multimedia 2014: 47-56 - [c35]Haoran Li, Hongxun Yao, Xiaoshuai Sun:
Using Label Propagation to Get Confidence Map for Segmentation. PCM 2014: 84-92 - 2013
- [c34]Xiaoshuai Sun, Xin-Jing Wang, Hongxun Yao, Lei Zhang:
Exploring Implicit Image Statistics for Visual Representativeness Modeling. CVPR 2013: 516-523 - [c33]Xue Li, Hongxun Yao, Xiaoshuai Sun, Yanhao Zhang:
On dense sampling size. ICIP 2013: 290-294 - [c32]Sicheng Zhao, Hongxun Yao, Xiaoshuai Sun, Xiaolei Jiang, Pengfei Xu:
Flexible Presentation of Videos Based on Affective Content Analysis. MMM (1) 2013: 368-379 - 2012
- [c31]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji:
What are we looking for: Towards statistical modeling of saccadic eye movements and visual saliency. CVPR 2012: 1552-1559 - [c30]Yanhao Zhang, Xiaoshuai Sun, Hongxun Yao, Lei Qin, Qingming Huang:
Aesthetic composition represetation for portrait photographing recommendation. ICIP 2012: 2753-2756 - [c29]Xiaoshuai Sun, Hongxun Yao:
Memorable basis: towards human-centralized sparse representation. ACM Multimedia 2012: 761-764 - [c28]Lujun Chen, Hongxun Yao, Xiaoshuai Sun, Hongming Zhang:
Real-Time Viewfinder Composition Assessment and Recommendation to Mobile Photographing. PCM 2012: 707-714 - [c27]Tingting Han, Hongxun Yao, Xiaoshuai Sun, Guoyi Liu:
Action Segmentation in Dance Videos. PCM 2012: 832-840 - [c26]Lujun Chen, Hongxun Yao, Xiaoshuai Sun:
Action retrieval based on generalized dynamic depth data matching. VCIP 2012: 1 - 2011
- [c25]Gaoxiang Zhang, Feng Jiang, Debin Zhao, Xiaoshuai Sun, Shaohui Liu:
Saliency Detection: A Self-Adaption Sparse Representation Approach. ICIG 2011: 461-465 - [c24]Sicheng Zhao, Hongxun Yao, Xiaoshuai Sun:
Affective Video Classification Based on Spatio-temporal Feature Fusion. ICIG 2011: 795-800 - [c23]Yanhao Zhang, Hongxun Yao, Pengfei Xu, Rongrong Ji, Xiaoshuai Sun, Xianming Liu:
Video stabilization based on saliency driven SIFT matching and discriminative RANSAC. ICIMCS 2011: 65-69 - [c22]Wei Yu, Hongxun Yao, Xianming Liu, Rongrong Ji, Xiaoshuai Sun, Pengfei Xu:
Contextual dictionaries for image super resolution. ICIMCS 2011: 150-153 - [c21]Pengfei Xu, Hongxun Yao, Rongrong Ji, Xiaoshuai Sun, Xianming Liu:
A spatiotemporal context phrase description for general dynamic texture. ICIMCS 2011: 154-157 - [c20]Xue Li, Hongxun Yao, Xiaoshuai Sun, Rongrong Ji, Xianming Liu, Pengfei Xu:
Sparse representation based visual element analysis. ICIP 2011: 657-660 - [c19]Xianming Liu, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xiaoshuai Sun, Qi Tian:
Learning heterogeneous data for hierarchical web video classification. ACM Multimedia 2011: 433-442 - [c18]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Xianming Liu, Pengfei Xu:
Unsupervised fast anomaly detection in crowds. ACM Multimedia 2011: 1469-1472 - [c17]Sicheng Zhao, Hongxun Yao, Xiaoshuai Sun, Pengfei Xu, Xianming Liu, Rongrong Ji:
Video indexing and recommendation based on affective analysis of viewers. ACM Multimedia 2011: 1473-1476 - 2010
- [c16]Rongrong Ji, Hongxun Yao, Xiaoshuai Sun, Bineng Zhong, Wen Gao:
Towards semantic embedding in visual vocabulary. CVPR 2010: 918-925 - [c15]Kun Yuan, Hongxun Yao, Rongrong Ji, Xiaoshuai Sun:
Mining actor correlations with hierarchical concurrence parsing. ICASSP 2010: 798-801 - [c14]Xianming Liu, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xiaoshuai Sun, Qi Tian:
Visual topic model for web image annotation. ICIMCS 2010: 126-130 - [c13]Pengfei Xu, Hongxun Yao, Rongrong Ji, Xiaoshuai Sun, Xianming Liu:
A robust texture descriptor using multifractal analysis with Gabor filter. ICIMCS 2010: 147-150 - [c12]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xianming Liu, Shaohui Liu:
Visual saliency as sequential eye fixation probability. ICIP 2010: 1093-1096 - [c11]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xianming Liu, Shaohui Liu:
Saliency detection based on short-term sparse representation. ICIP 2010: 1101-1104 - [c10]Pengfei Xu, Hongxun Yao, Rongrong Ji, Xiaoshuai Sun, Xianming Liu:
A rotation and scale invariant texture description approach. VCIP 2010: 77442T - 2009
- [c9]Kun Yuan, Rongrong Ji, Hongxun Yao, Xiaoshuai Sun, Pengfei Xu, Xianming Liu:
VisualCor system: search actor correlations in TV series. ICIMCS 2009: 213-218 - [c8]Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Shaohui Liu:
Photo assessment based on computational visual attention model. ACM Multimedia 2009: 541-544 - [c7]Xianming Liu, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xiaoshuai Sun:
What is a complete set of keywords for image description & annotation on the web. ACM Multimedia 2009: 613-616 - 2008
- [c6]Pengfei Xu, Rongrong Ji, Hongxun Yao, Xiaoshuai Sun, Tianqiang Liu, Xianming Liu:
Text Particles Multi-band Fusion for Robust Text Detection. ICIAR 2008: 587-596 - [c5]Rongrong Ji, Pengfei Xu, Hongxun Yao, Zhen Zhang, Xiaoshuai Sun, Tianqiang Liu:
Directional correlation analysis of local Haar binary pattern for text detection. ICME 2008: 885-888 - [c4]Xianming Liu, Rongrong Ji, Hongxun Yao, Pengfei Xu, Xiaoshuai Sun, Tianqiang Liu:
Cross-media manifold learning for image retrieval & annotation. Multimedia Information Retrieval 2008: 141-148 - [c3]Xiaoshuai Sun, Rongrong Ji, Hongxun Yao, Pengfei Xu, Tianqiang Liu, Xianming Liu:
Place retrieval with graph-based place-view model. Multimedia Information Retrieval 2008: 268-275 - [c2]Rongrong Ji, Xiaoshuai Sun, Hongxun Yao, Pengfei Xu, Tianqiang Liu, Xianming Liu:
Attention-driven action retrieval with DTW-based 3d descriptor matching. ACM Multimedia 2008: 619-622 - [c1]Tianqiang Liu, Hongxun Yao, Rongrong Ji, Yan Liu, Xianming Liu, Xiaoshuai Sun, Pengfei Xu, Zhen Zhang:
Vision-Based Semi-supervised Homecare with Spatial Constraint. PCM 2008: 416-425
Informal and Other Publications
- 2024
- [i72]Siyu Zou, Jiji Tang, Yiyi Zhou, Jing He, Chaoyi Zhao, Rongsheng Zhang, Zhipeng Hu, Xiaoshuai Sun:
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks. CoRR abs/2401.07709 (2024) - [i71]Gen Luo, Yiyi Zhou, Yuxin Zhang, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models. CoRR abs/2403.03003 (2024) - [i70]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. CoRR abs/2403.06702 (2024) - [i69]Qiong Wu, Weihao Ye, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models. CoRR abs/2403.15226 (2024) - [i68]Zhongxi Chen, Ke Sun, Ziyin Zhou, Xianming Lin, Xiaoshuai Sun, Liujuan Cao, Rongrong Ji:
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis. CoRR abs/2403.18471 (2024) - [i67]Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. CoRR abs/2404.00650 (2024) - [i66]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. CoRR abs/2405.00954 (2024) - [i65]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Xiaopeng Hong, Yongjian Wu, Rongrong Ji:
Image Captioning via Dynamic Path Customization. CoRR abs/2406.00334 (2024) - [i64]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. CoRR abs/2406.01451 (2024) - [i63]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. CoRR abs/2406.05620 (2024) - [i62]Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji:
AnyTrans: Translate AnyText in the Image with Large Scale Models. CoRR abs/2406.11432 (2024) - [i61]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in LVLMs. CoRR abs/2406.16449 (2024) - [i60]Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model. CoRR abs/2407.05352 (2024) - [i59]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. CoRR abs/2407.05363 (2024) - [i58]Qiong Wu, Zhaoxi Ke, Yiyi Zhou, Gen Luo, Xiaoshuai Sun, Rongrong Ji:
Routing Experts: Learning to Route Dynamic Experts in Multi-modal Large Language Models. CoRR abs/2407.14093 (2024) - [i57]Yiwei Ma, Zhibin Wang, Xiaoshuai Sun, Weihuang Lin, Qiang Zhou, Jiayi Ji, Rongrong Ji:
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model. CoRR abs/2407.16198 (2024) - [i56]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. CoRR abs/2407.20664 (2024) - [i55]Mingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models. CoRR abs/2407.21534 (2024) - [i54]Ziyin Zhou, Ke Sun, Zhongxi Chen, Huafeng Kuang, Xiaoshuai Sun, Rongrong Ji:
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model. CoRR abs/2408.05669 (2024) - [i53]Mingrui Wu, Oucheng Huang, Jiayi Ji, Jiale Li, Xinyue Cai, Huafeng Kuang, Jianzhuang Liu, Xiaoshuai Sun, Rongrong Ji:
TraDiffusion: Trajectory-Based Training-Free Image Generation. CoRR abs/2408.09739 (2024) - [i52]Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji:
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing. CoRR abs/2408.14180 (2024) - 2023
- [i51]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. CoRR abs/2301.03160 (2023) - [i50]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards Local Visual Modeling for Image Captioning. CoRR abs/2302.06098 (2023) - [i49]Gen Luo, Minglang Huang, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Zhiyu Wang, Rongrong Ji:
Towards Efficient Visual Adaption via Structural Re-parameterization. CoRR abs/2302.08106 (2023) - [i48]Gen Luo, Yiyi Zhou, Lei Jin, Xiaoshuai Sun, Rongrong Ji:
Towards End-to-end Semi-supervised Learning for One-stage Object Detection. CoRR abs/2302.11299 (2023) - [i47]Peng Mi, Jianghang Lin, Yiyi Zhou, Yunhang Shen, Gen Luo, Xiaoshuai Sun, Liujuan Cao, Rongrong Fu, Qiang Xu, Rongrong Ji:
Active Teacher for Semi-Supervised Object Detection. CoRR abs/2303.08348 (2023) - [i46]Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. CoRR abs/2303.15764 (2023) - [i45]Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, Rongrong Ji:
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models. CoRR abs/2305.15023 (2023) - [i44]Shubin Huang, Qiong Wu, Yiyi Zhou, Weijie Chen, Rongsheng Zhang, Xiaoshuai Sun, Rongrong Ji:
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting. CoRR abs/2306.00409 (2023) - [i43]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Tianshuo Xu, Xiaoshuai Sun, Tongliang Liu, Rongrong Ji, Dacheng Tao:
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer. CoRR abs/2306.17504 (2023) - [i42]Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Towards General Visual-Linguistic Face Forgery Detection. CoRR abs/2307.16545 (2023) - [i41]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. CoRR abs/2308.02982 (2023) - [i40]Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Continual Face Forgery Detection via Historical Distribution Preserving. CoRR abs/2308.06217 (2023) - [i39]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. CoRR abs/2308.16632 (2023) - [i38]Qiong Wu, Wei Yu, Yiyi Zhou, Shubin Huang, Xiaoshuai Sun, Rongrong Ji:
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models. CoRR abs/2309.01479 (2023) - [i37]Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji:
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues. CoRR abs/2310.09503 (2023) - [i36]Haowei Wang, Jiayi Ji, Tianyu Guo, Yilong Yang, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning. CoRR abs/2310.10975 (2023) - [i35]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. CoRR abs/2310.18142 (2023) - [i34]Minglang Huang, Yiyi Zhou, Gen Luo, Guannan Jiang, Weilin Zhuang, Xiaoshuai Sun:
Towards Omni-supervised Referring Expression Segmentation. CoRR abs/2311.00397 (2023) - [i33]Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation. CoRR abs/2312.00085 (2023) - [i32]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CoRR abs/2312.12470 (2023) - 2022
- [i31]Fuhai Chen, Xiaoshuai Sun, Xuri Ge, Jianzhuang Liu, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Differentiated Relevances Embedding for Group-based Referring Expression Comprehension. CoRR abs/2203.06382 (2022) - [i30]Chengpeng Dai, Fuhai Chen, Xiaoshuai Sun, Rongrong Ji, Qixiang Ye, Yongjian Wu:
Global2Local: A Joint-Hierarchical Attention for Video Captioning. CoRR abs/2203.06663 (2022) - [i29]Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji:
SeqTR: A Simple yet Universal Network for Visual Grounding. CoRR abs/2203.16265 (2022) - [i28]Jing He, Yiyi Zhou, Qi Zhang, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji:
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation. CoRR abs/2204.00833 (2022) - [i27]Mingrui Wu, Jiaxin Gu, Yunhang Shen, Mingbao Lin, Chao Chen, Xiaoshuai Sun, Rongrong Ji:
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation. CoRR abs/2204.03541 (2022) - [i26]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yan Wang, Liujuan Cao, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks. CoRR abs/2204.07780 (2022) - [i25]Gen Luo, Yiyi Zhou, Jiamu Sun, Shubin Huang, Xiaoshuai Sun, Qixiang Ye, Yongjian Wu, Rongrong Ji:
What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study. CoRR abs/2204.07913 (2022) - [i24]Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji:
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval. CoRR abs/2207.07285 (2022) - [i23]Jingjia Huang, Yinan Li, Jiashi Feng, Xiaoshuai Sun, Rongrong Ji:
Clover: Towards A Unified Video-Language Alignment and Fusion Model. CoRR abs/2207.07885 (2022) - [i22]Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji, Dacheng Tao:
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach. CoRR abs/2210.05177 (2022) - 2021
- [i21]Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji:
Dual-Level Collaborative Transformer for Image Captioning. CoRR abs/2101.06462 (2021) - [i20]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Xinghao Ding, Yongjian Wu, Feiyue Huang, Yue Gao, Rongrong Ji:
Towards Language-guided Visual Recognition via Dynamic Convolutions. CoRR abs/2110.08797 (2021) - 2020
- [i19]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Liujuan Cao, Chenglin Wu, Cheng Deng, Rongrong Ji:
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation. CoRR abs/2003.08813 (2020) - [i18]Mingbao Lin, Rongrong Ji, Xiaoshuai Sun, Baochang Zhang, Feiyue Huang, Yonghong Tian, Dacheng Tao:
Fast Class-wise Updating for Online Hashing. CoRR abs/2012.00318 (2020) - [i17]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. CoRR abs/2012.07061 (2020) - 2019
- [i16]Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Yongjian Wu, Yunsheng Wu:
Towards Optimal Discrete Online Hashing with Balanced Similarity. CoRR abs/1901.10185 (2019) - [i15]Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang, Xiaojun Tong:
Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images. CoRR abs/1901.11153 (2019) - [i14]Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian:
Hadamard Matrix Guided Online Hashing. CoRR abs/1905.04454 (2019) - [i13]Mingbao Lin, Rongrong Ji, Shen Chen, Feng Zheng, Xiaoshuai Sun, Baochang Zhang, Liujuan Cao, Guodong Guo, Feiyue Huang:
Supervised Online Hashing via Similarity Distribution Learning. CoRR abs/1905.13382 (2019) - [i12]Jie Hu, Rongrong Ji, Shengchuan Zhang, Xiaoshuai Sun, Qixiang Ye, Chia-Wen Lin, Qi Tian:
Information Competing Process for Learning Diversified Representations. CoRR abs/1906.01288 (2019) - [i11]Rongrong Ji, Ke Li, Yan Wang, Xiaoshuai Sun, Feng Guo, Xiaowei Guo, Yongjian Wu, Feiyue Huang, Jiebo Luo:
Semi-Supervised Adversarial Monocular Depth Estimation. CoRR abs/1908.02126 (2019) - [i10]Chen Shen, Rongrong Ji, Fuhai Chen, Xiaoshuai Sun, Xiangming Li:
Scene-based Factored Attention for Image Captioning. CoRR abs/1908.02632 (2019) - [i9]Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xiaoshuai Sun, Chia-Wen Lin, Jiayi Ji, Baochang Zhang, Feiyue Huang, Liujuan Cao:
Semantic-aware Image Deblurring. CoRR abs/1910.03853 (2019) - [i8]Ying Zheng, Hongxun Yao, Xiaoshuai Sun:
Deep Semantic Parsing of Freehand Sketches with Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning. CoRR abs/1910.06023 (2019) - [i7]Ying Zheng, Hongxun Yao, Xiaoshuai Sun, Shengping Zhang, Sicheng Zhao, Fatih Porikli:
Sketch-Specific Data Augmentation for Freehand Sketch Recognition. CoRR abs/1910.06038 (2019) - [i6]Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Xiaoshuai Sun, Wenxiu Sun:
Toward 3D Object Reconstruction from Stereo Images. CoRR abs/1910.08223 (2019) - [i5]Shen Chen, Liujuan Cao, Mingbao Lin, Yan Wang, Xiaoshuai Sun, Chenglin Wu, Jingfei Qiu, Rongrong Ji:
Hadamard Codebook Based Deep Hashing. CoRR abs/1910.09182 (2019) - [i4]Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xiansheng Hua:
SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation. CoRR abs/1911.08688 (2019) - [i3]Yiyi Zhou, Rongrong Ji, Gen Luo, Xiaoshuai Sun, Jinsong Su, Xinghao Ding, Chia-Wen Lin, Qi Tian:
A Real-time Global Inference Network for One-stage Referring Expression Comprehension. CoRR abs/1912.03478 (2019) - 2018
- [i2]Zheng Xu, Xitong Yang, Xue Li, Xiaoshuai Sun:
The Effectiveness of Instance Normalization: a Strong Baseline for Single Image Dehazing. CoRR abs/1805.03305 (2018) - [i1]Xiaoshuai Sun:
Semantic and Contrast-Aware Saliency. CoRR abs/1811.03736 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-11 18:20 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint