


default search action
International Journal of Multimedia Information Retrieval, Volume 14
Volume 14, Number 1, March 2025
- Qiuhong Tian, Weilun Miao, Lizao Zhang, Ziyu Yang, Yang Yu, Yanying Zhao, Lan Yao:
STCA: an action recognition network with spatio-temporal convolution and attention. 1 - Fan Yang, Nor Azman Ismail, Pang Yee Yong, Alhuseen Omar Alsayed:
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features. 2 - Hao Wen, Ziqian Lu, Fengli Shen, Zheming Lu, Jia-Lin Cui:
Improving skeleton-based action recognition with interactive object information. 3 - Ziyong Lin, Xiaolong Jiang, Jie Zhang, Mingyong Li:
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval. 4 - Hao Chen, Wu Huang, Tao Zhang
:
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation. 5 - Zhong Ji, Yuanheng Liu, Xuan Wang, Jingren Liu, Jiale Cao, YunLong Yu:
Multi-task classification network for few-shot learning. 6 - Changqin Huang, Zhenheng Lin, Zhongmei Han, Qionghao Huang, Fan Jiang, Xiaodi Huang
:
PAMoE-MSA: polarity-aware mixture of experts network for multimodal sentiment analysis. 7 - Digambar Pawar, Raghavendra Gowda, Krishna Chandra:
Image forgery classification and localization through vision transformers. 8 - Lixia Xue, Jiang Dong, Ronggui Wang, Juan Yang:
MFAFD: a few-shot learning method for cascading models with parameter free attention and finite discrete space. 9 - Qiang Zhang, Qin Shi, Teng Cheng, Junning Zhang, Jiong Chen:
VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds. 10
Volume 14, Number 2, June 2025
- Weichen Zhao, Yuxing Lu, Zhiyuan Liu, Yuan Yang, Ge Jiao:
Cross-modal alignment with synthetic caption for text-based person search. 11 - Hemraj Singh, Mridula Verma, Ramalingaswamy Cheruku:
DMFNet: geometric multi-scale pixel-level contrastive learning for video salient object detection. 12 - Manh-Duy Nguyen, Binh T. Nguyen, Cathal Gurrin:
Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance. 13

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.