default search action
Rongrong Ji
Person information
- affiliation: Xiamen University, Xiamen, China
- affiliation: Columbia University, New York, NY, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j189]Yinan Li, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yunpeng Luo, Rongrong Ji:
M3ixup: A multi-modal data augmentation approach for image captioning. Pattern Recognit. 158: 110941 (2025) - 2024
- [j188]Shaohui Lin, Bo Ji, Rongrong Ji, Angela Yao:
A closer look at branch classifiers of multi-exit architectures. Comput. Vis. Image Underst. 239: 103900 (2024) - [j187]Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. Int. J. Comput. Vis. 132(1): 1-19 (2024) - [j186]Bingjie Liu, Qiancheng Zheng, Heng Wei, Jinxian Zhao, Haoyuan Yu, Yiyi Zhou, Fei Chao, Rongrong Ji:
Deep hybrid transformer network for robust modulation classification in wireless communications. Knowl. Based Syst. 300: 112191 (2024) - [j185]Tianshuo Xu, Lijiang Li, Peng Mi, Xiawu Zheng, Fei Chao, Rongrong Ji, Yonghong Tian, Qiang Shen:
Uncovering the Over-Smoothing Challenge in Image Super-Resolution: Entropy-Based Quantification and Contrastive Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6199-6215 (2024) - [j184]Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Yonghong Tian, Jie Chen, Rongrong Ji:
Training-Free Transformer Architecture Search With Zero-Cost Proxy Guided Evolution. IEEE Trans. Pattern Anal. Mach. Intell. 46(10): 6525-6541 (2024) - [j183]Yimin Xu, Mingbao Lin, Hong Yang, Fei Chao, Rongrong Ji:
Shadow-aware dynamic convolution for shadow removal. Pattern Recognit. 146: 109969 (2024) - [j182]Yimin Xu, Nanxi Gao, Fei Chao, Rongrong Ji:
An efficient blur kernel estimation method for blind image Super-Resolution. Pattern Recognit. 154: 110590 (2024) - [j181]Xiantao Hu, Bineng Zhong, Qihua Liang, Shengping Zhang, Ning Li, Xianxian Li, Rongrong Ji:
Transformer Tracking via Frequency Fusion. IEEE Trans. Circuits Syst. Video Technol. 34(2): 1020-1031 (2024) - [j180]Jiaxin Ye, Bineng Zhong, Qihua Liang, Shengping Zhang, Xianxian Li, Rongrong Ji:
Positive-Sample-Free Object Tracking via a Soft Constraint. IEEE Trans. Circuits Syst. Video Technol. 34(3): 1364-1375 (2024) - [j179]Yaozong Zheng, Bineng Zhong, Qihua Liang, Guorong Li, Rongrong Ji, Xianxian Li:
Toward Unified Token Learning for Vision-Language Tracking. IEEE Trans. Circuits Syst. Video Technol. 34(4): 2125-2135 (2024) - [j178]Huafeng Kuang, Hong Liu, Xianming Lin, Rongrong Ji:
Defense Against Adversarial Attacks Using Topology Aligning Adversarial Training. IEEE Trans. Inf. Forensics Secur. 19: 3659-3673 (2024) - [j177]Jinyu Yang, Mingqi Gao, Feng Zheng, Xiantong Zhen, Rongrong Ji, Ling Shao, Ales Leonardis:
Weakly-Supervised RGBD Video Object Segmentation. IEEE Trans. Image Process. 33: 2158-2170 (2024) - [j176]Haixin Ding, Shengchuan Zhang, Qiong Wu, Songlin Yu, Jie Hu, Liujuan Cao, Rongrong Ji:
Bilateral Knowledge Interaction Network for Referring Image Segmentation. IEEE Trans. Multim. 26: 2966-2977 (2024) - [j175]Shuman Fang, Zhiwen Lin, Ke Yan, Jie Li, Xianming Lin, Rongrong Ji:
HODN: Disentangling Human-Object Feature for HOI Detection. IEEE Trans. Multim. 26: 3125-3136 (2024) - [j174]Gen Luo, Yiyi Zhou, Jiamu Sun, Xiaoshuai Sun, Rongrong Ji:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. IEEE Trans. Multim. 26: 3689-3700 (2024) - [c326]Mingrui Wu, Yuqi Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Toward Open-Set Human Object Interaction Detection. AAAI 2024: 6066-6073 - [c325]Yunshan Zhong, Yuyao Zhou, Yuxin Zhang, Fei Chao, Rongrong Ji:
Learning Image Demoiréing from Unpaired Real Data. AAAI 2024: 7623-7631 - [c324]Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li, Rongrong Ji:
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization. LREC/COLING 2024: 11429-11439 - [c323]You Huang, Zongyu Lan, Liujuan Cao, Xianming Lin, Shengchuan Zhang, Guannan Jiang, Rongrong Ji:
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything. CVPR 2024: 3120-3130 - [c322]Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen:
GraCo: Granularity-Controllable Interactive Segmentation. CVPR 2024: 3501-3510 - [c321]Jingjing Xie, Yuxin Zhang, Mingbao Lin, Zhihang Lin, Liujuan Cao, Rongrong Ji:
UniPTS: A Unified Framework for Proficient Post-Training Sparsity. CVPR 2024: 5746-5755 - [c320]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CVPR 2024: 6390-6399 - [c319]Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji:
Aligning and Prompting Everything All at Once for Universal Visual Perception. CVPR 2024: 13193-13203 - [c318]Jinxia Xie, Bineng Zhong, Zhiyi Mo, Shengping Zhang, Liangtao Shi, Shuxiang Song, Rongrong Ji:
Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers. CVPR 2024: 19300-19309 - [c317]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CVPR 2024: 26648-26658 - [c316]Xu Peng, Junwei Zhu, Boyuan Jiang, Ying Tai, Donghao Luo, Jiangning Zhang, Wei Lin, Taisong Jin, Chengjie Wang, Rongrong Ji:
PortraitBooth: A Versatile Portrait Model for Fast Identity-Preserved Personalization. CVPR 2024: 27070-27080 - [c315]Binghan Chen, Jianlong Hu, Xiawu Zheng, Wei Lin, Fei Chao, Rongrong Ji:
Functionally Similar Multi-Label Knowledge Distillation. ICASSP 2024: 7210-7214 - [c314]Jinyu He, Xiaowei Song, Xiaohan Yan, Nan Wang, Yuqi Miao, Zijian Jiang, Fei Chao, Yan Zhang, Shengchuan Zhang, Rongrong Ji:
GreedyAgent: Crafting Efficient Agents for Meta-learning from Learning Curves via Greedy Algorithm Selection. ICIC (LNAI 1) 2024: 488-499 - [c313]Yuxin Zhang, Lirui Zhao, Mingbao Lin, Yunyun Sun, Yiwu Yao, Xingjia Han, Jared Tanner, Shiwei Liu, Rongrong Ji:
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs. ICLR 2024 - [c312]Xingbin Liu, Jinghao Zhou, Tao Kong, Xianming Lin, Rongrong Ji:
Exploring Target Representations for Masked Autoencoders. ICLR 2024 - [c311]Yuexiao Ma, Huixia Li, Xiawu Zheng, Feng Ling, Xuefeng Xiao, Rui Wang, Shilei Wen, Fei Chao, Rongrong Ji:
AffineQuant: Affine Transformation Quantization for Large Language Models. ICLR 2024 - [c310]Yuxin Zhang, Yuxuan Du, Gen Luo, Yunshan Zhong, Zhenyu Zhang, Shiwei Liu, Rongrong Ji:
CaM: Cache Merging for Memory-efficient LLMs Inference. ICML 2024 - [c309]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. ICML 2024 - [c308]Xudong Li, Timin Gao, Runze Hu, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Jingyuan Zheng, Yunhang Shen, Ke Li, Yutao Liu, Pingyang Dai, Rongrong Ji:
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity. ICML 2024 - [c307]Xudong Li, Runze Hu, Jingyuan Zheng, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Ke Li, Yunhang Shen, Yutao Liu, Pingyang Dai, Rongrong Ji:
Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment. ICML 2024 - [c306]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. ICML 2024 - [c305]Yuexiao Ma, Huixia Li, Xiawu Zheng, Feng Ling, Xuefeng Xiao, Rui Wang, Shilei Wen, Fei Chao, Rongrong Ji:
Outlier-aware Slicing for Post-Training Quantization in Vision Transformer. ICML 2024 - [c304]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models. ICML 2024 - [c303]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. ICML 2024 - [c302]Yunshan Zhong, Jiawei Hu, You Huang, Yuxin Zhang, Rongrong Ji:
ERQ: Error Reduction for Post-Training Quantization of Vision Transformers. ICML 2024 - [e13]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part I. Lecture Notes in Computer Science 14425, Springer 2024, ISBN 978-981-99-8428-2 [contents] - [e12]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part II. Lecture Notes in Computer Science 14426, Springer 2024, ISBN 978-981-99-8431-2 [contents] - [e11]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part III. Lecture Notes in Computer Science 14427, Springer 2024, ISBN 978-981-99-8434-3 [contents] - [e10]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part IV. Lecture Notes in Computer Science 14428, Springer 2024, ISBN 978-981-99-8461-9 [contents] - [e9]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part V. Lecture Notes in Computer Science 14429, Springer 2024, ISBN 978-981-99-8468-8 [contents] - [e8]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part VI. Lecture Notes in Computer Science 14430, Springer 2024, ISBN 978-981-99-8536-4 [contents] - [e7]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part VII. Lecture Notes in Computer Science 14431, Springer 2024, ISBN 978-981-99-8539-5 [contents] - [e6]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part VIII. Lecture Notes in Computer Science 14432, Springer 2024, ISBN 978-981-99-8542-5 [contents] - [e5]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part IX. Lecture Notes in Computer Science 14433, Springer 2024, ISBN 978-981-99-8545-6 [contents] - [e4]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part X. Lecture Notes in Computer Science 14434, Springer 2024, ISBN 978-981-99-8548-7 [contents] - [e3]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part XI. Lecture Notes in Computer Science 14435, Springer 2024, ISBN 978-981-99-8551-7 [contents] - [e2]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part XII. Lecture Notes in Computer Science 14436, Springer 2024, ISBN 978-981-99-8554-8 [contents] - [e1]Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part XIII. Lecture Notes in Computer Science 14437, Springer 2024, ISBN 978-981-99-8557-9 [contents] - [i260]Yunshan Zhong, Yuyao Zhou, Yuxin Zhang, Fei Chao, Rongrong Ji:
Learning Image Demoireing from Unpaired Real Data. CoRR abs/2401.02719 (2024) - [i259]Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Chang Liu, Xiawu Zheng, Rongrong Ji, Jie Chen:
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation. CoRR abs/2401.09732 (2024) - [i258]Yunpeng Gong, Zhun Zhong, Zhiming Luo, Yansong Qu, Rongrong Ji, Min Jiang:
Cross-Modality Perturbation Synergy Attack for Person Re-identification. CoRR abs/2401.10090 (2024) - [i257]Xudong Li, Jingyuan Zheng, Runze Hu, Yan Zhang, Ke Li, Yunhang Shen, Xiawu Zheng, Yutao Liu, Shengchuan Zhang, Pingyang Dai, Rongrong Ji:
Feature Denoising Diffusion Model for Blind Image Quality Assessment. CoRR abs/2401.11949 (2024) - [i256]Yimin Xu, Nanxi Gao, Zhongyun Shan, Fei Chao, Rongrong Ji:
Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration. CoRR abs/2401.13221 (2024) - [i255]Song Guo, Fan Wu, Lei Zhang, Xiawu Zheng, Shengchuan Zhang, Fei Chao, Yiyu Shi, Rongrong Ji:
EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs. CoRR abs/2402.12419 (2024) - [i254]Hui Lin, Zhiheng Ma, Rongrong Ji, Yaowei Wang, Zhou Su, Xiaopeng Hong, Deyu Meng:
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling. CoRR abs/2402.15297 (2024) - [i253]Gen Luo, Yiyi Zhou, Yuxin Zhang, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models. CoRR abs/2403.03003 (2024) - [i252]Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji:
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation. CoRR abs/2403.06168 (2024) - [i251]Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji:
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization. CoRR abs/2403.06702 (2024) - [i250]Jinxia Xie, Bineng Zhong, Zhiyi Mo, Shengping Zhang, Liangtao Shi, Shuxiang Song, Rongrong Ji:
Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers. CoRR abs/2403.10574 (2024) - [i249]Jianlong Hu, Xu Chen, Zhenye Gan, Jinlong Peng, Shengchuan Zhang, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Liujuan Cao, Rongrong Ji:
DMAD: Dual Memory Bank for Real-World Anomaly Detection. CoRR abs/2403.12362 (2024) - [i248]Yuexiao Ma, Huixia Li, Xiawu Zheng, Feng Ling, Xuefeng Xiao, Rui Wang, Shilei Wen, Fei Chao, Rongrong Ji:
AffineQuant: Affine Transformation Quantization for Large Language Models. CoRR abs/2403.12544 (2024) - [i247]Qiong Wu, Weihao Ye, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models. CoRR abs/2403.15226 (2024) - [i246]Zhongxi Chen, Ke Sun, Ziyin Zhou, Xianming Lin, Xiaoshuai Sun, Liujuan Cao, Rongrong Ji:
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis. CoRR abs/2403.18471 (2024) - [i245]Xiaorui Huang, Gen Luo, Chaoyang Zhu, Bo Tong, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
Deep Instruction Tuning for Segment Anything Model. CoRR abs/2404.00650 (2024) - [i244]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CoRR abs/2404.01342 (2024) - [i243]Yongdong Luo, Haojia Lin, Xiawu Zheng, Yigeng Jiang, Fei Chao, Jie Hu, Guannan Jiang, Songan Zhang, Rongrong Ji:
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization. CoRR abs/2404.11064 (2024) - [i242]Dingming Liu, Shaowei Li, Ruoyan Zhou, Lili Liang, Yongguan Hong, Fei Chao, Rongrong Ji:
ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model. CoRR abs/2404.12903 (2024) - [i241]Chi Huang, Xinyang Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji:
NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation. CoRR abs/2404.13921 (2024) - [i240]Wensheng Pan, Timin Gao, Yan Zhang, Runze Hu, Xiawu Zheng, Enwei Zhang, Yuting Gao, Yutao Liu, Yunhang Shen, Ke Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji:
Multi-Modal Prompt Learning on Blind Image Quality Assessment. CoRR abs/2404.14949 (2024) - [i239]Mingbao Lin, Zhihang Lin, Wengyi Zhan, Liujuan Cao, Rongrong Ji:
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method. CoRR abs/2404.15141 (2024) - [i238]Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji:
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM. CoRR abs/2404.16033 (2024) - [i237]Ziyue Zhang, Mingbao Lin, Rongrong Ji:
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion. CoRR abs/2404.17230 (2024) - [i236]Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen:
GraCo: Granularity-Controllable Interactive Segmentation. CoRR abs/2405.00587 (2024) - [i235]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. CoRR abs/2405.00954 (2024) - [i234]Zhihang Lin, Mingbao Lin, Luxi Lin, Rongrong Ji:
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference. CoRR abs/2405.05803 (2024) - [i233]Xinyang Li, Zhangyu Lai, Linning Xu, Jianfei Guo, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji:
Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion. CoRR abs/2405.09874 (2024) - [i232]Yansong Qu, Shaohui Dai, Xinyang Li, Jianghang Lin, Liujuan Cao, Shengchuan Zhang, Rongrong Ji:
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane. CoRR abs/2405.17596 (2024) - [i231]You Huang, Zongyu Lan, Liujuan Cao, Xianming Lin, Shengchuan Zhang, Guannan Jiang, Rongrong Ji:
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything. CoRR abs/2405.18706 (2024) - [i230]Jingjing Xie, Yuxin Zhang, Mingbao Lin, Zhihang Lin, Liujuan Cao, Rongrong Ji:
UniPTS: A Unified Framework for Proficient Post-Training Sparsity. CoRR abs/2405.18810 (2024) - [i229]Chaoyou Fu, Yuhan Dai, Yondong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun:
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis. CoRR abs/2405.21075 (2024) - [i228]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Xiaopeng Hong, Yongjian Wu, Rongrong Ji:
Image Captioning via Dynamic Path Customization. CoRR abs/2406.00334 (2024) - [i227]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. CoRR abs/2406.01451 (2024) - [i226]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. CoRR abs/2406.05620 (2024) - [i225]Chenyu Zhou, Mengdan Zhang, Peixian Chen, Chaoyou Fu, Yunhang Shen, Xiawu Zheng, Xing Sun, Rongrong Ji:
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models. CoRR abs/2406.10228 (2024) - [i224]Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji:
AnyTrans: Translate AnyText in the Image with Large Scale Models. CoRR abs/2406.11432 (2024) - [i223]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in LVLMs. CoRR abs/2406.16449 (2024) - [i222]Xin Chen, Jie Hu, Xiawu Zheng, Jianghang Lin, Liujuan Cao, Rongrong Ji:
Depth-Guided Semi-Supervised Instance Segmentation. CoRR abs/2406.17413 (2024) - [i221]Xinyang Li, Zhangyu Lai, Linning Xu, Yansong Qu, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji:
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text. CoRR abs/2406.17601 (2024) - [i220]Wenhao Li, Mingbao Lin, Yunshan Zhong, Shuicheng Yan, Rongrong Ji:
UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMs. CoRR abs/2406.18173 (2024) - [i219]Timin Gao, Wensheng Pan, Yan Zhang, Sicheng Zhao, Shengchuan Zhang, Xiawu Zheng, Ke Li, Liujuan Cao, Rongrong Ji:
Local Manifold Learning for No-Reference Image Quality Assessment. CoRR abs/2406.19247 (2024) - [i218]Liujuan Cao, Jianghang Lin, Zebo Hong, Yunhang Shen, Shaohui Lin, Chao Chen, Rongrong Ji:
HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection. CoRR abs/2406.19394 (2024) - [i217]You Huang, Wenbin Lai, Jiayi Ji, Liujuan Cao, Shengchuan Zhang, Rongrong Ji:
HRSAM: Efficiently Segment Anything in High-Resolution Images. CoRR abs/2407.02109 (2024) - [i216]Bang Li, Donghao Luo, Yujie Liang, Jing Yang, Zengmao Ding, Xu Peng, Boyuan Jiang, Shengwei Han, Dan Sui, Peichao Qin, Pian Wu, Chaoyang Wang, Yun Qi, Taisong Jin, Chengjie Wang, Xiaoming Huang, Zhan Shu, Rongrong Ji, Yongge Liu, Yunsheng Wu:
Oracle Bone Inscriptions Multi-modal Dataset. CoRR abs/2407.03900 (2024) - [i215]Wengyi Zhan, Mingbao Lin, Chia-Wen Lin, Rongrong Ji:
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource. CoRR abs/2407.04241 (2024) - [i214]Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model. CoRR abs/2407.05352 (2024) - [i213]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. CoRR abs/2407.05363 (2024) - [i212]Yunshan Zhong, Jiawei Hu, You Huang, Yuxin Zhang, Rongrong Ji:
ERQ: Error Reduction for Post-Training Quantization of Vision Transformers. CoRR abs/2407.06794 (2024) - [i211]Zhihang Lin, Mingbao Lin, Meng Zhao, Rongrong Ji:
AccDiffusion: An Accurate Method for Higher-Resolution Image Generation. CoRR abs/2407.10738 (2024) - [i210]