


default search action
Multimedia Systems, Volume 31
Volume 31, Number 1, February 2025
- Yanyan Jiang, Yongping Huang, Haipeng Chen, Yingda Lyu:

Multi-modality boundary-guided network for generalizable image manipulation localization. 1 - Kanglin Wang, Qingxuan Shi, Xiaoyang Li

, Enyi Wu, Zifan Li:
Optimizing codebook training through control chart analysis. 2 - Liang Zhang, Shifeng Li, Yan Cheng, Xi Luo, Xiaoru Liu:

Learning dual updatable memory modules for video anomaly detection. 3 - Quy Hoang Nguyen, Minh-Van Truong Nguyen, Kiet Van Nguyen:

New benchmark dataset and fine-grained cross-modal fusion framework for Vietnamese multimodal aspect-category sentiment analysis. 4 - Jinghua Li

, Zhuowei Bai, Dehui Kong, Dongpan Chen, Qianxing Li, Baocai Yin:
3d human pose estimation based on conditional dual-branch diffusion. 5 - Hongyan Li, Ziyang Zhang, Zhaoming Hao, Baoqing Xu, Weifeng Wang, Jing Sun:

PAR-mono: monocular video depth estimation network based on channel separation and dynamic attention. 6 - Liyan Xiong, Zhida Li, Xiaohui Huang, Heng Wang:

CSFNet: A novel counting network based on context features and multi-scale information. 7 - Sifeng Zhu, Duan Haowei, Yao Yaxing, Chen Hao, Hai Zhu:

Improved NSGA-II algorithm-based task offloading decision in the internet of vehicles edge computing scenario. 8 - Weina Zhou

, Xianglin Gao:
Sat-DehazeGAN: an efficient dehazing model in water-sky background for river-sea transport. 9 - Jiachang Sun

, Fuxian Zhu
:
Multilayer interactive attention bottleneck transformer for aspect-based multimodal sentiment analysis. 10 - Keyang Cheng, Liutao Wei, Jingfeng Tang, Yongzhao Zhan:

Constraint embedding for prompt tuning in vision-language pre-trained model. 11 - Qian Zhang, Shasha Li, Mingwen Shao:

DS-Diff: a dual-stage network with degradation-aware and semantic-aware for adverse weather removal based on diffusion models. 12 - Junxian Wu, Yujia Zhang, Michael Kampffmeyer, Yi Pan, Chenyu Zhang, Shiying Sun, Hui Chang, Xiaoguang Zhao:

HierGAT: hierarchical spatial-temporal network with graph and transformer for video HOI detection. 13 - Fuqun Zhao, He Huang, Wenxiang Hu:

An optimized hierarchical point cloud registration algorithm. 14 - Wei Liu, Yurong Zheng, Zhihui Xiang, Yingmeng Wang, Zhao Tian

, Wei She:
An efficient federated learning method based on enhanced classification-GAN for medical image classification. 15 - Junyi Wang, Zexin Guo, Dewei Yi

, Yining Hua, Qinggang Meng:
Enhanced multi-branch learning for long-tailed image recognition. 16 - Wenhui Lian, Xinwu Liu

, Yue Chen:
Non-convex fractional-order TV model for image inpainting. 17 - Lingyao Jia, Bingbing Zhang, Peihua Li:

Stochastic stylization transformer with self-supervision for iris recognition. 18 - Yunbo Gu, Qianyu Wu, Junting Zou, Baosheng Li, Xiaoli Mai, Yudong Zhang, Yang Chen:

Multi-modal clear cell renal cell carcinoma grading with the segment anything model. 19 - Haibo Zhang, Xizhi Wang, Haoran Sun, Yiwei Sun, Yanan Jin, Ruoxue Li

, Guohua Geng:
CR-DM: A novel craniofacial reconstruction framework based on diffusion model. 20 - Yu Cao, Ran Ma, KaiFan Zhao, Ping An:

WFIL-NET: image inpainting based on wavelet downsampling and frequency integrated learning module. 21 - Anqi Shi, Xin Shu, Dan Xu, Fang Wang:

GCMR-Net: A Global Context-Enhanced Multi-scale Residual Network for medical image segmentation. 22 - Yuantao Wang, Yuanyao Lu, Yongsheng Qiu:

Gated image-adaptive network for driving-scene object detection under nighttime conditions. 23 - Huilin Wang, Huaming Qian:

SR-DAYOLOv8: cross-domain adaptive object detection based on super-resolution domain classifier. 24 - Zhongxu Li, Qihan He, Lingfei Ren, Wenyong Yao, Wenyuan Yang:

PCAF: UAV scenarios detector via pyramid converge-and-assign fusion network. 25 - Zhouwang Zheng, Weiwei Yu:

RG-YOLO: multi-scale feature learning for underwater target detection. 26 - Qin Guo, Xiangchao Feng, Peng Xue, Shoujun Sun, Xiangrong Li:

Dual-domain multi-scale feature extraction for image dehazing. 27 - Xiaonuo Dongye, Dongdong Weng, Haiyan Jiang, Zeyu Tian, Yihua Bao, Pukun Chen:

Personalized decision-making for agents in face-to-face interaction in virtual reality. 28 - Weijian Hu, Yinyin Xu, Ke Han, Lingfang Li, Jiang Wang:

TSPLNet: a three-stage progressive lightweight network for shadow removal. 29 - Yumeng Zhang, Kaixing Fan, Ying Yu:

Research on passengers behavior recognition method in public transport vehicles based on efficient 3D CNN. 30 - Chen He, Shenshen Li, Zheng Wang, Hua Chen, Fumin Shen, Xing Xu:

Chatting with interactive memory for text-based person retrieval. 31 - Guanxiao Li, Ke Zhang, Yu Su, Jingyu Wang:

Aggregating multi-scale flow-enhanced information in transformer for video inpainting. 32 - Md Shamim Hossain

, Shamima Aktar, Weiyong Liu, Naijie Gu, Zhangjin Huang:
IGINet: integrating geometric information to enhance inter-modal interaction for fine-grained image captioning. 33 - Jixin Liu, Sufang Yao, Haigen Yang, Ning Sun:

Detection of typical abnormal behavior in home-based elderly care based on ViT-iECGAN significant information migration compensation. 34 - Rui Liu, Sicong Zhang, Yang Xu, Weida Xu

, Xinlong He:
High-resolution network-based multi-feature fusion for generalized forgery detection. 35 - Penghao Li, Huanjie Tao, Hui Zhou, Ping Zhou, Yishi Deng:

Enhanced Multiview attention network with random interpolation resize for few-shot surface defect detection. 36 - Jindong Ma, Haitao Zhang

:
HDR-DANet: single HDR image reconstruction via dual attention. 37 - Wu Zeng, Zhengying Xiao:

Enhancing long-tailed classification via multi-strategy weighted experts with hybrid distillation. 38 - Jian Zheng, Shumiao Ren, Jingyue Zhang, Shiyan Wang, Lin Li:

Binary classification for imbalanced data using data conformity mechanism. 39 - Feng Hou, Yao Zhang, Yang Liu, Jin Yuan, Cheng Zhong, Yang Zhang, Zhongchao Shi, Jianping Fan, Zhiqiang He:

Gradient-aware domain-invariant learning for domain generalization. 40 - Xiangchun Yu, Huofa Liu, Dingwen Zhang

, Miaomiao Liang, Lingjuan Yu, Jian Zheng:
Ground truth is the best teacher: supervised semantic segmentation inspired by knowledge transfer mechanisms. 41 - Jinlong Qu, Qi Li, Jie Pan, Mingzheng Sun, Xingzheng Lu, Ying Zhou, Hongliang Zhu:

SS-YOLOv8: small-size object detection algorithm based on improved YOLOv8 for UAV imagery. 42 - Xue Li, Chunhua Zhu, Fei Zhou, Huawei Tao:

Facial expression recognition via joint loss constraining attention-modulated contextual spatial information network. 43 - Jiawei Ding, Zhiyi Tan, Guanming Lu, Jinsheng Wei:

Adaptive discriminant feature learning for GNN-based session recommendation. 44 - Modafar Al-Shouha, Gábor Szücs:

ReDiT: re-evaluating large visual question answering model confidence by defining input scenario difficulty and applying temperature mapping. 45 - Cunjuan Zhu, Yanyi Zhang, Qi Jia, Weimin Wang, Yu Liu:

Temporal refinement and multi-grained matching for moment retrieval and highlight detection. 46 - Haisheng Li, Rongrong Yuan, Qiuyi Li, Cong Hu:

Research on image captioning using dilated convolution ResNet and attention mechanism. 47 - Yufan Hu, Yi Zhang, Lixin Zhang:

Long-tailed video recognition via majority-guided diffusion model. 48 - Zongmin Li, Yachuan Li, Xavier Soria P., Chaozhi Yang, Qian Xiao, Yun Bai, Hua Li, Xiangdong Wang:

Compact twice fusion network for edge detection. 49 - Jun Tang, Enxue Ma, Yang Qu, Wenbo Gao, Yuchen Zhang, Lin Gan:

UAPT: an underwater acoustic target recognition method based on pre-trained Transformer. 50 - Chenglong Shao, Tongzhen Si, Xiaohui Yang:

Exploring granularity-associated invariance features for text-to-image person re-identification. 51 - Maojin Sun:

A method for solving the multiple degradation video quality enhancement problem: a processing framework for AI-based coding damage repair in concert with video super-resolution. 52 - Dahong Xu, Siyu Jiang, Yihan Zhang, Xi Li:

Psychological analysis of house-tree-person drawings based on multimodal large models. 53 - Tingyu Wang, Rui Zhai, Longge Wang, Junyang Yu, Han Li, Zhicheng Wang, Jinhu Wu:

Multi-scale attention and loss penalty mechanism for multi-view clustering. 54 - Ronqi Wang, Ronguo Zhang, Jing Hu, Rui Zhang, Lifang Wang, Xiaojun Liu:

Position-aware feature matching algorithm based non-rigid point cloud registration. 55 - Miaohui Zhang, Chenxing Shen, Yangyang Deng, Li Wang:

Camouflaged object detection via boundary refinement. 56 - Ziyi Miao, Lan Yao, Feng Zeng, Yi Wang, ZhiGuo Hong:

An effective retrieval model for home textile images based on deep feature extraction. 57 - Weihua Ou, Yingjie Chen, Linqing Liang, Jianping Gou, Jiahao Xiong, Jiacheng Zhang, Lingge Lai, Lei Zhang:

Cross-modal retrieval of chest X-ray images and diagnostic reports based on report entity graph and dual attention. 58 - Dingyu Lu, Zihou Liu, Dongming Zhang, Jing Zhang, Guoqing Jin:

Spatial-temporal transformer network for protecting person-of-interest from deepfaking. 59 - Zhenguang Wang, Huanjie Tao, Hui Zhou, Yishi Deng, Ping Zhou:

A content-style control network with style contrastive learning for underwater image enhancement. 60 - Danyang Cao, Hongbo Zhou, Yongfu Wang:

Improve the image caption generation on out-domain dataset by external knowledge augmented. 61 - Xiaowen Shi, Chao Zhou

, Yuan-Gen Wang:
Generative adversarial defense via conditional diffusion model. 62 - Jieyu An, Binfen Ding, Wan Mohd Nazmee Wan Zainon:

Improving multimodal sentiment prediction through vision-language feature interaction. 63 - Kangkang Xu, Wen Han, Yixiang Fang, Yi Zhao, Jun Li, Junxiang Wang:

A robust image watermarking framework based on U2-net encoder and loss function weight assignment. 64 - Zhixue Liang, Wenyong Dong, Bo Zhang:

CLIP-TSA: CLIP-guided open-vocabulary semantic segmentation with two-level semantic awareness. 65 - Liangtai Zhou, Weiwei Zhang, Banghui Zhang, Xiaobin Li, Jianqing Zhu:

A strong benchmark for yoga action recognition based on lightweight pose estimation model. 66 - Xinmin Cheng, Maoke Ran, Benyao Chen, Hongwei Yin

:
Image channel and spatial information integrated method for fall detection. 67 - Mingqi Liu, Zhixin Li:

A dissimilarity feature-driven decomposition network for multimodal sentiment analysis. 68 - Xianghua Kong, Ning Xu, Zefang Sun, Zhewen Shen, Bolun Zheng, Chenggang Yan, Jinbo Cao, Rongbao Kang, An-An Liu:

Counterfactual GAN for debiased text-to-image synthesis. 69 - Carlos Marín-Lora, Miguel Chover:

GameScript: a simplified scripting language for video game development. 70 - Pengqi Yin:

Visual-textual adversarial learning for person re-identification. 71 - Guangsheng Luo, ZhiJun Fang, JianLing Liu, YiFanBai Bai:

CLIP guided image caption decoding based on monte carlo tree search. 72 - Wei Zhang, Hongjie Li, Wei Ke:

LF-GIANet: cascaded global-view information adaptation-guided network for light field image super-resolution. 73 - Jiwu Sun, Cheng Xu

, Cheng Zhang, Yujia Zheng, Pengfei Wang, Hongzhe Liu:
Flood scenarios vehicle detection algorithm based on improved YOLOv9. 74 - Lingtao Wang, Yong Hu:

Topic-guided multi-domain fake news detection. 75 - Tongchi Zhou, Hongyu He, Yanzhao Wang, Yuan Liao:

Improved gated recurrent units together with fusion for semantic segmentation of remote sensing images based on parallel hybrid network. 76 - Sujuan Li, Gengsheng Xie

:
Relation-aware non-local attention network for person re-identification. 77 - Haomou Bai:

Inception-like Large Kernel network for lightweight image super-resolution. 78 - Zerui Xu, Dechao Chen, Wenyan Gong:

UMSSNet: a unified multi-scale segmentation network for heterogeneous medical images. 79 - Zhanyang Liang, Yan Wo:

From coarse to fine: a two-stage common semantic space construction for unpaired cross modal retrieval. 80 - Junyin Peng, Hong Tang, Wenbin Zheng:

Hierarchical heterogeneous graph network based multimodal emotion recognition in conversation. 81 - Jiliang Wang, Cancan Jin, Siwang Zhou:

Segmentation-aware image super-resolution with generative adversarial networks. 82 - Xiangchun Yu, Huofa Liu, Dingwen Zhang

, Jianqing Wu, Jian Zheng:
Hierarchical Region-level Decoupling Knowledge Distillation for semantic segmentation. 83 - Feng Xue, Peng Li, Yu Li, Shujie Li:

WPELip: enhance lip reading with word-prior information. 84 - Zhiyong Xiao, Yang Li, Zhaohong Deng:

Food image segmentation based on deep and shallow dual-branch network. 85 - Jue Tian, Lele Guan, Yang Liu, Le Zhang, Yanping Chen:

Deepphysio: detecting deepFake with non-personalized feature of physiological signal. 86 - Andrea Morales-Garzón, Karel Gutiérrez-Batista, María J. Martín-Bautista:

Adaptafood: an intelligent system to adapt recipes to specialised diets and healthy lifestyles. 87 - Saif Ur Rehman Khan

, Sohaib Asif
, Ming Zhao, Wei Zou, Yangfan Li:
Optimize brain tumor multiclass classification with manta ray foraging and improved residual block techniques. 88 - Yubo Zhang, Liying Zheng, Qingming Huang:

Multi-object tracking based on graph neural networks. 89 - Libo Cheng

, Wenlin Du
, Zhe Li
, Xiaoning Jia
:
AFEV-INet: adaptive feature extraction variational interactive network for remote sensing image denoising. 90 - Juan Yang, Yuhang Wei, Ronggui Wang, Lixia Xue:

VTIENet: visual-text information enhancement network for image captioning. 91 - Xiangwei Chen, Chenghai Yu:

SCG-DETR: a high-precision railway turnout defect detection method based on attention feature fusion and SMP-CGLU approach. 92 - Penglei Wang, Xin Fan

, Qimeng Yang, Shengwei Tian, Long Yu:
Object detection of mural images based on improved YOLOv8. 93 - Gaili Li, Yongna Yuan, Ruisheng Zhang:

A spatial-temporal graph attention network for protein-ligand binding affinity prediction based on molecular geometry. 94 - Dehua Ma, Xiaoliang Zhu, Yanxiang Li, Wenzhe Meng, Siping Xu:

RANet: A receptive aggregation network for polyp segmentation. 95 - Guangyong Gao, Xiaoan Chen, Li Li:

SSRH: screen-shooting robust hyperlink based on deep learning. 96 - Jihua Ye, Youcai Zou, Zhixiong Wang, Tiantian Wang, Chao Wang, Wentao Wan:

ADMF-ER: a novel approach for wild expression recognition integrating adaptive dropout and multi-level features. 97
Volume 31, Number 2, April 2025
- Liang Yang, Qi Yang, Jingjie Zeng, Tao Peng, Zhihao Yang, Hongfei Lin:

Dialogue sentiment analysis based on dialogue structure pre-training. 98 - Haozhe Tang, Lei Yu, Yu Shao:

MARCFusion: adaptive residual cross-domain fusion network for medical image fusion. 99 - Aizhong Mi, Xianru Huang, Zhanqiang Huo, Luyao Liu:

Context-aware learning and background activation suppression for weakly supervised semantic segmentation. 100 - Ang Li, Xinghao Yang, Baodi Liu, Honglong Chen, Dapeng Tao, Weifeng Liu:

Parentheses insertion based sentence-level text adversarial attack. 101 - Ruinian Shi, Qiang He, Hengyou Wang, Changlun Zhang:

FDC-Net: foreground dynamic capture with deep feature enhancement for video anomaly detection. 102 - Liman Jiang, Canlong Zhang, Lei Wu, Zhixin Li, Zhiwen Wang, Chunrong Wei:

Joint feature augmentation and posture label for cloth-changing person re-identification. 103 - Di Wu, Yuying Zheng, Peng Cheng:

Co-interaction for intent recognition and slot filling with global-local-spatial-temporal feature fusion. 104 - Zijie Song

, Zhenzhen Hu
, Richang Hong
:
Grid Jigsaw Representation with CLIP: a new perspective on image clustering. 105 - Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen:

ViOCRVQA: novel benchmark dataset and VisionReader for visual question answering by understanding Vietnamese text in images. 106 - Huan Pan, Ruiya Ji, Wenming Cao, Zhao Huang

, Jianqi Zhong:
Optimizing human motion prediction through decoupled motion spatio-temporal trends. 107 - Xiaowen Ruan, Zhaobo Qi, Yuanrong Xu, Weigang Zhang:

Dual-guided multi-modal bias removal strategy for temporal sentence grounding in video. 108 - Jian Gao, Yuhe Zhang, Jinghao Hu, Tong Yang, Pengbo Zhou, Wen Tang, Wuyang Shui, Guohua Geng:

IOPCNet: inner and outer point classification based low overlap rate local-to-global point cloud registration. 109 - Yang Xuan, Xiao-Yu Zhang

, Chen Li, Hui Wang, Chaoxu Mu:
LAM-YOLOv11 for UAV transmission line inspection: overcoming environmental challenges with enhanced detection efficiency. 110 - Zhengyang Lu, Ying Chen:

Self-supervised monocular depth estimation via multiple bilateral consistency. 111 - Jinxia Yu, Fabao Xue, Zhanqiang Huo, Yingxu Qiao:

Combining implicit and explicit priors for zero-reference low-light image enhancement and denoising. 112 - Fatemeh Shafizadegan, Ahmad Reza Naghsh-Nilchi, Elham Shabaninia:

Hybrid embedding for multimodal few-frame action recognition. 113 - Jingcheng Zhang, Yu Zhu, Shengjun Peng, Axi Niu, Qingsen Yan, Jinqiu Sun, Yanning Zhang:

A multi-scale feature cross-dimensional interaction network for stereo image super-resolution. 114 - Tianyu Hong, Guowei Teng, Ping An, Liquan Shen:

Spherical rotation for high efficiency ERP 360-degree video coding. 115 - Zixu Hu, Zhengtao Yu, Junjun Guo:

Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors. 116 - Shihui Zhang, Xueqiang Han, Zhiguo Cui, Sheng Zhan, Qing Tian:

Fast-colorfool: faster and more transferable semantic adversarial attack with complementary colors and cumulative perturbation. 117 - Wei Song, Dong Li:

Region attention and label embedding for facial action unit detection. 118 - ShaoDong Cui, Kaibo Duan, Wen Ma, Hiroyuki Shinnou:

CCGN: consistency contrastive-learning graph network for multi-modal fake news detection. 119 - Yifei Yang, Zhengyong Feng, Wei Jin, Pengcheng Miao:

ADD-YOLO: a new model for object detection in aerial images. 120 - Muhammad Anwar, Zhiyue Yan, Wenming Cao, Naeem Hussain:

STHRA: selective transformer hierarchical reciprocal attention-based deformable medical image registration. 121 - Xuezhi Xiang, Xiankun Zhou, Xinyao Wang, Mingliang Zhai, Abdulmotaleb El-Saddik:

Multi-object tracking with scale-aware transformer and enhanced association strategy. 122 - Qiong Hu, Masrah Azrifah Azmi Murad

, Qi Li:
Advancing music emotion recognition: large-scale dataset construction and evaluator impact analysis. 123 - Fudong Nian, Yanhong Gu, Wentao Wang, Aoyu Liu, Dong Zhang, Fanding Li:

Rwkv-vg: visual grounding with RWKV-driven encoder-decoder framework. 124 - Zhengwei Jin, Yun Wei:

UMPA: Unified multi-modal prompt with adapter for vision-language models. 125 - Chun Zhang, Jin Wang, Yunhui Shi, Baocai Yin, Nam Ling:

A CNN-transformer hybrid network with selective fusion and dual attention for image super-resolution. 126 - Jing Lv, Zhi Liu, Gongyang Li:

Few-shot fine-tuning with auxiliary tasks for video anomaly detection. 127 - Xu Liu, Chenhua Liu, Xianye Zhou, Guodong Fan:

EATNet: edge-aware and transformer-based network for RGB-D salient object detection. 128 - Xin Chao, Xiaosha Qi, Ruiqi Ding, Genlin Ji:

Vehicle lane change behavior recognition based on multi-scale three-stream 3D ResNets. 129 - Ang Li, Xinghao Yang, Baodi Liu, Honglong Chen, Dapeng Tao, Weifeng Liu:

Correction: Parentheses insertion based sentence-level text adversarial attack. 130 - Zhiwei Tang, ShuWei Xu, Haozhe Jin, Shichong Liu, Rui Zhai, Ke Lu:

Personalized federated learning via decoupling self-knowledge distillation and global adaptive aggregation. 131 - Mingyang Lei, Hong Song, Tianyu Fu, Deqiang Xiao, Danni Ai, Jingfan Fan, Yifei Yang, Ying Gu, Jian Yang:

SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization. 132 - Lehao Rong, Liqing Huang:

Image deblurring algorithm based on unsupervised network and alternating optimization iterations. 133 - Caifeng Liu, Lianyu Hu

:
Rethinking the temporal downsampling paradigm for continuous sign language recognition. 134 - Zhiheng Gong, Huan Rong, Zhongfeng Chen, Yixiang Tang, Victor S. Sheng:

EDCM-EA: event prediction based on event development context mining considering event arguments. 135 - Dongliang Cao, Wang Ren, Changhong Yu, Bin Wu:

IFMOT: interactive perception and feature optimization network for multi-object tracking. 136 - Hongfei Liu, Ning He, Xunrui Huang, Runjie Li:

A video anomaly detection framework based on feature-strengthened and memory feature-ernhanced reconstruction. 137 - Yuxin Li, Hu Lu, Tingting Qin, Juanjuan Tu, Shengli Wu

:
CM-DASN: visible-infrared cross-modality person re-identification via dynamic attention selection network. 138 - Hui Chen, Rong Chen, Yushi Li, Haoran Li, Nannan Li:

Unsupervised single-image dehazing via self-guided inverse-retinex GAN. 139 - Boyuan Ma, Donglin Zhang, Xiao-Jun Wu:

Food nutrition estimation with RGB-D fusion module and bidirectional feature pyramid network. 140 - Kang Tong, Yiquan Wu:

Small object detection using hybrid evaluation metric with context decoupling. 141 - Qingsong Tang, Yalei Ren, Zhanghui Shan, Chenyang Bao, Yang Liu:

Dual-branch aggregation and edge refinement network for few shot semantic segmentation. 142 - Younghoon Lee:

Enhancing plant health classification via diffusion model-based data augmentation. 143 - Hanqi Jiang, Jinlong Shi, Yongjie Gao, Xin Shu, Suqin Bai, Qiang Qian, Dan Xu:

Psg-6d: prior-free implicit category-level 6D pose estimation with SO(3)-equivariant network and point cloud global enhancement. 144 - Yuzhen Niu, Siling Chen, Shanshan Chen, Fusheng Li

:
Progressive fusion of local and global image features for cross-modal image aesthetic assessment. 145 - Kuo Tan, Zhaobo Qi, Jianping Zhong, Yuanrong Xu, Weigang Zhang:

KN-VLM: KNowledge-guided Vision-and-Language Model for visual abductive reasoning. 146 - Jiacheng Zhao, Haojie Che, Yongxi Li

:
Spatial enhanced multi-level alignment learning for text-image person re-identification with coupled noisy labels. 147 - Haojie Che, Jiacheng Zhao, Yongxi Li

:
Multi-level fine-grained center calibration network for unsupervised person re-identification. 148 - Jiacheng Lu

, Kaiwen Wang, Hui Ding, Zhuhong Shao
, Rongyin Qin, Guoping Huo:
MSCA-Sp R-CNN: a segmentation algorithm for pneumonia small lesions integrating multi-scale channel attention and sub-pixel upsampling. 149 - Yang Liu

, Wenyi Zhu, Linyu Dong, Yuzhong Zhang, Xiang Guo:
Enhancing interpretability in video-based personality trait recognition using SHAP analysis. 150 - Yalin Song, Peng Qian, Kexin Zhang, Shichong Liu, Rui Zhai, Ran Song:

An improved Multi-Scale Fusion and Small Object Enhancement method for efficient pedestrian detection in dense scenes. 151 - Jing Wang, Xiaohong Li, Xuesong Dai, Shuo Zhuang

, Meibin Qi:
Contrastive learning-based joint pre-training for unsupervised domain adaptive person re-identification. 152 - Teerath Kumar, Alessandra Mileo, Malika Bendechache

:
Saliency-based metric and FaceKeepOriginalAugment: a novel approach for enhancing fairness and Diversity. 153 - Xinwang Chen, Fengrui Ji, Renxin Chu, Baolin Liu:

Data-free pruning of CNN using kernel similarity. 154 - Xuanrui Xiong, Haihong Huang, Tianyu Li, Xiaolin Fan, Yuan Zhang:

DSFAT: a dual-stream framework assisted by textual information for person re-identification in real scenes. 155 - Tianming Zhan, Chenyang Lu, Huapeng Wu, Chenyun Wang:

A novel gradient and semantic-aware transformer network for low-light image enhancement. 156 - Jian Shi, Rui Xu, Baoli Sun, Tiantian Yan, Zhi-Hui Wang, Haojie Li:

Structure-preserving dental plaque segmentation via dynamically complementary information interaction. 157 - Abeer Ayoub, Walid El-Shafai, Fathi E. Abd El-Samie, Ehab K. I. Hamad, S. El-Rabaie:

Video and image quality enhancement using an enhanced lower bound on transmission map dehazing technique. 158 - Haochen Zhang, Shuai Zhang:

Federated semi-supervised polyp image detection based on client feature alignment. 159 - Xuehua Song, Junxing Zhou, Hua Jin, Xin Yuan, Chang-da Wang:

Enhancing cross-modality person re-identification through attention-guided asymmetric feature learning. 160 - Yiming Xing, Jindong Zhang:

Residual channel prior-guided multi-scale progressive dehazing network with hybrid attention. 161 - Jianming Zhang

, Zhijian Feng, Jia Jiang, Xiangnan Shi, Jin Zhang:
RGB-Net: transformer-based lightweight low-light image enhancement network via RGB channel separation. 162 - Xianhui Nie, Yong Fang

, Xin Liu, Hao Li, Zi Wang, Longzhen Qiu:
Incorporating human attention shifting features for enhanced local dimming performance. 163 - Min Dai, Wenshan Zhang, Wenguang Zheng:

Digital stabilization method for old movies based on mobilesam and optical flow. 164 - Zhixiong Liu, Fang Liu, Mohan Zhang, Shenglan Cui:

ACIH-VQT: aesthetic constraints incorporated hierarchical VQ-transformer for text logo synthesis. 165 - Lanhui Liu

, Menglin Kong, Cong Cao
, Zhanjie Shu, Kecheng Liu, Xingquan Li
, Muzhou Hou:
Personalized music recommendation algorithm based on machine learning. 166 - Yang Liu, Xinyu Liu, Ling Zhao, Bo Mi:

Automatic extraction method for humming-to-Guzheng melody based on improved YIN algorithm. 167 - Dalang Liu, Yunbo Rao, Jialong Zhu, Yanjin Ma, Jie Li:

FSformer: fusing frequency and spatial domain transformer network for underwater image enhancement. 168 - Chiqin Li, Lun Xie, Xinheng Wang, Hang Pan

, Zhiliang Wang:
A disentanglement mamba network with a temporally slack reconstruction mechanism for multimodal continuous emotion recognition. 169 - Hichem Metmer

, Xiaoshan Yang:
FedMRG: federated medical report generation via text-aware learning rate adjustment and multi-level prototype collaboration. 170 - Hong Liang, Yu Li, Qian Zhang, Mingwen Shao:

Do-DETR: enhancing DETR training convergence with integrated denoising and RoI mechanism. 171 - Jianxin Wang, Haijian Shao, Xing Deng, Shuheng Lian:

Robust novel view synthesis from multi-view feature stereo matching priors. 172 - Wencong Zhang, Zhiyang Guo, Wengang Zhou, Houqiang Li:

AAGS: Appearance-Aware 3D Gaussian Splatting with Unconstrained Photo Collections. 173 - Linyu Huang

, Zijie Xue
, Qian Ning, Yong Guo, Yongsheng Li:
A guidance and alignment transformer model for visible-infrared person re-identification. 174 - Wen Li, Xiaoning Song, Wenjie Zhang, Yang Hua, Xiaojun Wu:

Link prediction via adversarial knowledge distillation and feature aggregation. 175 - Ming Fang, Qi Liu, Jianping Ren, Jie Li, Xinning Du, Shuhua Liu:

A three-stream fusion network for 3D skeleton-based action recognition. 176 - Ouafa Talha, Wenju Zhou, Naitong Yuan, Yuan Xu:

Improved YOLOv8-C2fCA for embryonic cell detection and counting. 177 - Chunman Yan, Huiling Li:

CAPNet: tomato leaf disease detection network based on adaptive feature fusion and convolutional enhancement. 178 - Eunsam Kim, Jinsung Kim, Choonhwa Lee:

Efficient time-extended TV viewing through hybrid data redundancy in networked appliances. 179 - Memoona Aziz, Umair Rehman, Muhammad Umair Danish

, Syed Ali, Amir Zaib Abbasi
:
Towards a unified evaluation framework: integrating human perception and metrics for AI-generated images. 180 - Zhenping Mou, Tianqi Song, Hong Luo:

Dual-visual collaborative enhanced transformer for image captioning. 181 - Dangguo Shao, Rui Xu, Lei Ma, Sanli Yi:

Tubular-aware mamba for accurate retinal vessel segmentation: preserving fine details and topological connectivity. 182 - Yongsheng Ye, Guoguang Tan, Qiang Liu, Liu Liu, Jiawei Chu, Bin Wen, Lili Li:

TSSSKD-YOLO: an intelligent classification and defect detection method of insulators on transmission lines by fusing knowledge distillation in multiple scenarios. 183
Volume 31, Number 3, June 2025
- Yu Zheng, Yuze Gao, Jingren Liu, Ning Yao, Zhong Ji:

Radlora: a smart low-rank adaptive approach for radiological image classification. 184 - Yuqi Wang, Cunhe Li:

Inter-class distance enhanced prototypical network for few-shot text classification. 185 - Chao Li, Caie Xu, Wujie Zhou:

FDFNet-S*: frequency domain fusion networks for RGB-D mirror segmentation by contrastive knowledge refinement. 186 - Sitong Chen, Yucheng Shu, Lihong Qiao, Zhengyang Wu

, Jing Ling, Jiang Wu, Weisheng Li:
3D point cloud semantic segmentation based on visual guidance and feature enhancement. 187 - Yufan Hu, Fang Zhang, Ran Wei, Junling Gao:

Learning semantic-unified cross-modal representations for open-vocabulary video scene graph generation. 188 - Zhenhua Li, Lei Zhang, Songlin Yin, Ge Zhang:

MSCFF-Net: multi-scale context feature fusion network for polyp segmentation. 189 - Mengxiang Wang, Guiyu Xia, Zhedong Jin, Paike Yang, Yubao Sun:

Geometric transformation supervised disentanglement of pose and expression for talking face generation. 190 - Xincai Lu, Zhanquan Sun, Chenjie Zou, Chun He, Xinping Hu:

Class-adaptive attention transfer and multilevel entropy decoupled knowledge distillation. 191 - Minh-Trieu Tran

, Guee-Sang Lee:
Occluded scene text detection via context-awareness from sketch-level image representations. 192 - Guillem Rodríguez Corominas, Maria J. Blesa, Christian Blum

:
SoftBinReduce: data reduction for color quantization through soft binning. 193 - Liyun Zhang, Ming Zhang

, Fei Fan, Yang Liu:
Mixed multi-scale residual attention networks for single image super-resolution reconstruction. 194 - Wenzao Li, Linsong Xiao

, Sai Yao, Chengyu Hou, Zhan Wen, Dehao Ren:
ED-YOLO: an object detection algorithm for drone imagery focusing on edge information and small object features. 195 - Ning Zhu, Shaofan Wang, Yanfeng Sun, Baocai Yin:

Uncertainty-guided recurrent prototype distillation for graph few-shot class-incremental learning. 196 - Xiaolong Zhu, Borui Cao, Weihang Zhang, Huiqi Li:

Adaptive multi-scale feature extraction and fusion network with deep supervision for retinal vessel segmentation. 197 - Pengfei Li, Huihuang Zhao, Mugang Lin, Qingyun Liu, Peng Tang, Yangfan Zhou:

Expressive talking face generation via audio visual control. 198 - Chenyu Yuan, Jing Zhang, Wensheng Li, Li Zhuo:

DR-YOLO: dual reconstructed YOLO for logo detection in livestreaming. 199 - Shuqin Chen, Zhixin Sun, Li Yang, Yikang Hu, Shifeng Wu, Yi Zhang:

Refined linguistic deliberation for video captioning via cascade transformer and LSTM. 200 - Rui Yang, Hui Zhang, Mulan Qiu, Min Wang:

MLSTIF: multi-level spatio-temporal and human-object interaction feature fusion network for spatio-temporal action detection. 201 - Chengming Han, Zikai Wu, Hongjuan Zhang, Anxue Dong:

Large-scale multi-view subspace clustering with latent centroid anchor guidance. 202 - Jingwen Cai

, Fen Xiao, Kehan Zhang, Xieping Gao:
Adaptive region assisted GAN for image steganography. 203 - Xiaoyan Jiang, Si-Yuan Lu, Yu-Dong Zhang:

SAM-LCA: a computationally efficient SAM-based model for tuberculosis detection in chest X-rays. 204 - Hongjie Jia, Tengteng Wang, Heping Song

:
Neighbor-relation aware low-rank multi-view subspace clustering. 205 - Yunze Liang, Halidanmu Abudukelimu, Jishang Chen, Abudoukelimu Abulizi, Wenqiang Guo:

MAML-XL: a symbolic music generation method based on meta-learning and Transformer-XL. 206 - Yongchao Liu:

Application of fusing Bézier curves and 3D models in VR stereo vision. 207 - Zhan Wang, Shucheng Huang, Qi Fan, Yifan Jiao, Mingxing Li:

A multi-scale network with multi-view correlation for vehicle re-identification. 208 - Chaocan Xue

, Bin Lin, Jinlei Zheng, Jiaqing Li, Quanxi Feng:
Robust correlation tracking with closed-loop feedback control. 209 - Shiyuan Guo, Jian Wang, Zhangquan Wang, Guiming Yu, Songyang Wu

:
BotICC: enhancing social bot detection through implicit connection computation. 210 - Jiexia Lin

, Xiaodong Zhu
:
Multi-granular dynamic interaction network for multimodal sarcasm detection. 211 - Ruoxuan Zhang, Dantong Ouyang, Ximing Li, Hongtao Bai, Chenming Zhang

, Lili He:
Learning multi-scale features automatically from food and ingredients. 212 - Huan Hu, Fengwen Liu, Nan Su

, Wenqiang Hu:
ECF-Net: lumber defect segmentation network with enhanced feature and content-aware fusion. 213 - Sergey Lavrushkin

, Maksim Khrebtov, Anastasia Antsiferova
, Georgii Bychkov
, Alexey Soloviev
, Dmitriy S. Vatolin
:
Stable VMAF: investigating VMAF's vulnerabilities to adversarial attacks. 214 - Lixia Xue, ZiQian Jin, Ronggui Wang, Juan Yang:

BMFNet: Bidirectional Multimodal Fusion Network for image captioning. 215 - Junjie Ye, Wenxiao Zhang, Xun Yang:

Advancing crowd counting accuracy in diverse environments via comprehensive domain alignment strategies. 216 - Zhong Guan, Yongli Hu, Huajie Jiang, Yanfeng Sun, Baocai Yin:

Multi-view Isolated sign language recognition based on cross-view and multi-level transformer. 217 - Yu-Guang Yang, Wen Cheng, Guang-Bao Xu, Dong-Huan Jiang, Yi-Hua Zhou, Wei-Min Shi, Dong-Hua Jiang:

A verifiable variable threshold visual image secret sharing scheme. 218 - Xinwang Xiao, Huihuang Zhao, Yuhang Li, Peng Tang, Yue Deng:

TSGFormer: temporal-aware network and spatial encoding GCN for three-dimensional human pose estimation. 219 - Zhengjin Zhang

, Nannan Li, Wenmin Wang, Huiwen Guo:
Causalseg: investigating causality modeling for semi-supervised video object segmentation. 220 - Yukun Xiao, Long Yu, Shengwei Tian, Shirong Yu, Dezhi Zhang, Xiaojing Kang, Weidong Wu, Rui Lu:

Msfusenet: a multi-stage information fusion network for multi-modal skin lesion diagnosis. 221 - Xuecun Yang, Jiayu Li, Qingyun Zhang, Yixiang Wang, Zhonghua Dong, Gaoting Zhu:

TMU-GAN: a compliance detection algorithm for protective equipment in power operations. 222 - Jianwu Long, Qi Luo, Chen Zhang:

Image smoothing algorithm based on texture intensity adaptation and edge consistency. 223 - Donghui Wang, Jinhua Wang, Ning He, Jingzun Zhang, Sen Zhang, Shuai Liu:

Image shadow removal algorithm based on MaskGuideAttention and mask-aware enhancement module. 224 - Chunyu Lu, Tianran Chen, Duo Shang, Jun Luo, Xin Hui, Ruhui Shi:

Encoder-decoder with bilateral gated fusion for multimodal relation extraction. 225 - Lei Wang, Changming Zhu:

Global semantic space feature fusion for multi-view clustering. 226 - Yongxi Li, Wenzhong Tang, Lvhong Xiong, Shuai Wang, Haoming Wang, Xi Zhu:

Cross-modality geometry-guided historical momentum learning for coupled noisy visible-infrared re-identification. 227 - Duokui He, Zhongjun Tang, Qianqian Chen, Yiran Wang, Yingtong Lu:

Two-stage dynamic topic modeling approach for identifying consumer demands of animated series. 228 - Jianqiu Li, Wenzhu Yan

, Yanmeng Li:
Multiple kernel subspace representation and graph construction learning for multi-view clustering. 229 - Jinqiang Yan, Yinghao Zhang, Jiamin Hu, Haiyuan Cui, Jieru Chi, Guowei Yang, Chenglizhao Chen, Teng Yu:

Prior-based bi-encoder transformer for underwater image enhancement. 230 - Liju Han, Changming Zhu:

DRMFE: optimizing incomplete multi-view clustering through dual recovery and multi-scale feature enhancement. 231 - Yi Chen, Chong Wang, Zhehao Li, Sunqi Lin, Jinhui Xiang, Yuqi Li, Jiangbo Qian:

Enhancing open-vocabulary object detection through region-word and region-vision matching. 232 - Jie Zhu, Jingjing Fan, Jianguang Zhao, Shufang Wu, Jianan Liu:

Text semantic structure-guided correlation learning for cross-modal retrieval. 233 - Weilong Li

, Qiang Zhang, Lei Zhang, Xiao-Yuan Jing:
GLGRAN: next POI recommendation on global-local graph representation with attention network. 234 - Zhengfang Jiang, Haipeng Chen, Yongping Yang, Xianzhu Liu, Yingda Lyu:

Rethinking Polyp Segmentation from the Perspectives of Matching Views and Seeking Camouflage. 235 - Zhihan Wang, Huiqian Du, Min Xie:

HCDet: hidden X-ray contraband detection based on HyAtt-CNN and local implicit feature pyramid network. 236 - Liyun Dou, Jiaqing Qiu, Meng Chen, Jin Wang:

MLA-net: Multi-layer Attention Network for Arbitrary Style Transfer. 237 - Sachin Sakthi Kuppusami Sakthivel

, Young Hoon Joo, Jae Hoon Jeong:
Learning disruptor-aware channel selection and reliability with target regularization for robust visual tracking. 238 - Shaowu Zhang, Pengyuan Du, Xijun Cui, Hongfei Lin, Liang Yang:

Fine-grained sentiment analysis based on cross-modal information translation. 239 - Youzhi Zhang, Lifei Wan, Yiren Chen, Xiaofei Zhou, Xiaolin Zhang, Deyang Liu:

Deep hierarchical network for full-reference omnidirectional image quality assessment. 240 - Jing Dong, Junzhuo Zhang, Ben Xie, Jie Zhang, Chang Liu, Wei Cheng

:
ST-GRU: spatiotemporal gated recurrent unit for video prediction. 241 - Son T. Luu

, Trung Vo
, Le-Minh Nguyen:
MCVE: multimodal claim verification and explanation framework for fact-checking system. 242 - Sijia Wang, Yun Ge, Qiyang Liu, Yan Zeng:

Automatic Weight Allocation: optimizing remote sensing image retrieval from contrastive learning perspective. 243 - Ruhan He, Ruixue Liu, Tao Peng, Xinrong Hu:

CST: a melody generation method based on ChatGPT and Structure Transformer. 244 - Yong Meng, Suting Chen, Xinyu Lu, Wenliang Xu, Zhenxing Shi, Xuefen Zhou:

SEGMTM: a spectrum prediction method based on enhanced graph convolution and multi-scale time decomposition. 245 - Yuguang Shi, Sifan Zhou

, Wei Wang, Xiaobo Lu:
Depth-free view synthesis from diffusion models for monocular 3D detector in autonomous driving. 246 - Wei Chen, Xiaogang Wei:

Spatial interpolation of head-related transfer functions using a physics-informed autoencoder. 247 - Ezequiel Perez-Zarate, Oscar Ramos-Soto

, Chunxiao Liu, Diego Oliva, Marco Pérez-Cisneros:
ALEN: a dual-approach for uniform and non-uniform low-light image enhancement. 248 - Xu Liu, Chenhua Liu, Xianye Zhou, Guodong Fan:

Enhancing low-light object detection with En-YOLO: leveraging dual attention and implicit feature learning. 249 - Benxue Sun, Mingxuan Chen, Liming Hu, Anjie Wang, Zhijun Fang:

MSCC-RetNet: a multi-scale color corrected retinex network for underwater image enhancement. 250 - Liyue Ge, Congxuan Zhang

, Zhen Chen, Ke Lu, Cheng Feng:
SRSA-Depth: shape and region similarity awareness for outdoor monocular depth estimation. 251 - Chenyuan Zhao

, Yu Zhu, Qingsen Yan, Jinqiu Sun, Axi Niu, Yanning Zhang:
Modeling optical imaging pipeline and learning contrastive-based representation for hybrid-corrupted image restoration. 252 - Rizwan Abbas

, Björn W. Schuller, Xuewei Li, Chi Lin, Xi Li:
Emotion recognition in live broadcasting: a multimodal deep learning framework. 253 - Carlos Marín-Lora, Miguel Chover:

Correction: GameScript: a simplified scripting language for video game development. 254 - Dan Wang, Jin Wang, Yunhui Shi, Baocai Yin, Nam Ling:

Collaborative point cloud geometry compression for both human vision and machine vision. 255 - Jingjing Wang, Junyong Ye, Xinyuan Liu, Youwei Li, Guangyi Xu, Chaoming Zheng:

MLKD-CLIP: Multi-layer Feature Knowledge Distillation of CLIP for Open-vocabulary Action Recognition. 256 - Zhengyang Lu, Qian Xia, Weifan Wang, Feng Wang:

CLIP-aware domain-adaptive super-resolution. 257 - Xueyu Yu, Yong Liu, Hao Hu, Xinzhi Li, Mingdi Bo, Dong Zhang, Zijun Zhou:

DMFI-YOLO: dynamic multi-scale feature interaction for enhanced underwater object detection based on YOLO. 258 - Zhongwei Lin, Yanmin Luo, Wanyuan Gong, Huabiao Zhou, Liuge Li:

A distribution-aware 2D multi-person pose estimation method with attention mechanisms. 259 - Tsung-Han Tsai, Chin-Wei Hsu:

An SoC-based CNN accelerator for face recognition using HWCK data scheduling. 260 - Zhigang Zhou, Long-Zheng Dai, Zeng-Liang Bai, Yiyou Dong:

Prometheus: an efficient federated collaborative learning framework for coevolution of edge-cloud heterogeneous models. 261 - Haoran Liu, Zijian Sun, Haibin Li, Yaqian Li, Wenming Zhang, Tao Song:

EAVFormer: an end-to-end audio and visual emotion recognition network based on transformers. 262 - Jie Wang, Yue Yu, Jietao Cheng, Jun Li, Jun Tang:

PillarBAPI: enhancing pillar-based 3D object detection through attentive pseudo-image feature extraction. 263 - Stephen Ekaputra Limantoro

, Jhe-Hao Lin, Chih-Yu Wang, Yi-Lung Tsai, Hong-Han Shuai, Ching-Chun Huang, Wen-Huang Cheng:
Swapped logit distillation via bi-level teacher alignment. 264 - Chunyu Du, Baodi Liu, Yanjiang Wang:

Target data guided few-shot remote sensing scene classification in reproducing Hilbert kernel space. 265 - Yvon Apedo

, Huanjie Tao:
A weakly supervised pavement crack segmentation based on adversarial learning and transformers. 266 - Dangguo Shao, Gaoan Huang, Lei Ma, Yuxin Wu, Kaiqiang Tang, Sanli Yi, Jingtao Li, Nuoyun Duan, Chunyun Pu:

LMR-IPGN: An Effective Model for automatic summarization of Chinese long text. 267 - Thitirat Siriborvornratanakul, Songpol Bunyang:

Optimizing low-resource language encoders for text-to-image generation: a case study on Thai. 268 - Qing Pan, Zuqing Huang, Nili Tian:

Hierarchical triple-branch network for camouflaged object detection via progressive feature refinement. 269
Volume 31, Number 4, August 2025
- Main Uddin, Zhangjie Fu, Xiang Zhang, Abu Bakor Hayat Arnob:

Spatial and frequency feature fusion using multi-scale cross attention for enhancing deepfake face detection. 270 - Hong Xia, Yifan Zhang, Hui Jia, Yanping Chen, Jing Xu, Shiyong Li:

A medical visual question-answering model based on multi-scale feature fusion and question Feature enhancement. 271 - Mingwen Shao, Yuanyuan Liu, Lingzhuang Meng, Xun Shao:

Meta-prompt tuning for low-resource visual question answering. 272 - Dongbo Huang, Hui Wang, Yuqian Zhao, Feifei Guo, Fan Zhang, Pei Chen, Chunhua Yang, Weihua Gui:

Weakly supervised free-space segmentation by fusing spatial priors and region features for auto-driving. 273 - Dangguo Shao, Gaoan Huang, Lei Ma, Yuxin Wu, Kaiqiang Tang, Sanli Yi, Jingtao Li, Nuoyun Duan, Chunyun Pu:

Correction: LMR-IPGN: An Effective Model for automatic summarization of Chinese text. 274 - Muhammad Usman

, Xiaodi Hou, Yi Guo, Zonglin Liang, Yijia Zhang:
IMGEF: integrated multimodal graph-enhanced framework for radiology report generation. 275 - Feng Wang, Liju Yin, Yiming Qin, Xiaoning Gao, Hui Zhou, Yulin Deng:

OA-iTNet: object attention inverted transformer network for low-light-level image denoising. 276 - Lei Liu, Li Guo:

Enhancing robustness through domain-generalized semi-supervised learning under limited sample label scarcity. 277 - Yixin Wang, Xujian Zhao, Chuanpeng Deng, Yao Xiao, Haoxin Ruan, Peiquan Jin, Xuebo Cai:

A survey on music emotion recognition using learning models. 278 - Payal Kadam, Deepali Vora:

Systematic frame selection and quality assessment for efficient video summarization. 279 - Houqin Bian, Qifei Chen, Haolin Zhang, Lunming Qin, Liang Xue, Haoyang Cui, Xi Wang:

MambaLF: an efficient local feature extraction and matching with state space model. 280 - Shuze Geng, Yifan Liu, Gang Yan, Haowei Wang, Pengfei Zhao, Wenjie Xia:

Token recombination based shallow-deep feature fusion for occluded person re-identification. 281 - Debin Wang, Turdi Tohti, Dongfang Han, Zicheng Zuo

, Yi Liang, Yuanyuan Liao, Qingwen Yang:
Vef-BART: an effective method to mitigate hallucinations through vision enhancement and fusion in BART-based multimodal abstractive summarization. 282 - Dan Xu, Wenqian Xu, Yang Zhou, Xin Shu, Qiang Qian:

Adaptive color-corrected multicolor space enhancement network for underwater image enhancement. 283 - Bin Ge, Xiaolong Peng, Chenxing Xia, Hailong Chen:

Camouflaged object detection with integrated feature fusion and boundary optimization. 284 - Tao Gao, Jiangshan Feng, Xiaoqun Wu, Haisheng Li, Xiaochuan Wang:

Unsupervised arbitrary-scale point cloud upsampling by learning neural gradient function. 285 - Daoping Du, Lanlan Pan, Ye Liang, Honghao Yang, Xinyu Sui, Xiang Li:

Underwater image restoration based on a dual-branch super-resolution residual network. 286 - Muwei Jian, Yanjie Zhong, Haoran Zhang, Xiaoguang Li, Hui Yu:

GLMF-NET: global and local multi-scale fusion network for polyp segmentation. 287 - Xiaofei Zhang, Xiaoguang Di, Maozheng Liu:

DBLDNet: dual branch low light object detector based on feature localization and multi-scale feature enhancement. 288 - Fei Pan, Lianyu Zhao, Chenglin Wang

, Xianfeng Wang, Guangxin Ren:
Efficient local-global feature fusion transformer for siamese object tracking. 289 - Juheon Hwang, Taewan Kim, Heeseok Oh, Jiwoo Kang:

Convolutional neural shading for high-quality 3D reconstruction from multi-view images. 290 - Ming Yuan, Hao Meng, Tianhao Yan, Junbao Wu:

Fa YOLO: fog-aware instance segmentation of ships with feature refinement in foggy scenes. 291 - Huan Lei, Ze Wu, Lei Shang, Hong Zhao, Wenyuan Yang:

TCF-DETR: multi-scale token-channel fusion transformer for enhanced small object detection. 292 - Bo Sun, Turdi Tohti, Dongfang Han, Yi Liang

, Zicheng Zuo
, Yuanyuan Liao, Qingwen Yang:
Advancing Chinese travel sentiment analysis: a novel dataset and DFRAN approach for missing modalities. 293 - Meng Lyu, Yifei Li, Xiong Chen:

KL-CLIP: a K-means learning model for zero-shot anomaly segmentation. 294 - Zhigang Shi

, Zhongyi Huang, Zhiming Fang, Feng Tang:
Adjacent memory segmentation networks for robust visual tracking. 295 - Taewan Kim, Jiwoo Kang:

Face and voice cross-modal association with learning convex feature embedding. 296 - Zhihua Gan, Weihong Han, Zhongxiang Xie, Bo Zhang, Xiuli Chai:

CCM-Net: image splicing localization network based on context-aware and cross-domain multi-scale fusion. 297 - Yongfeng Qi, Hongli Xie, Yajuan Gao, Yuanzhe Lin, Heng Zhang, Haixi Han:

Generalizable face forgery detection based on adaptive spatial-frequency information mining. 298 - Ai Jian, Yun Wei:

Parameter-efficient transfer learning of prompts and adapters on vision-language models. 299 - Yajie Gu, Mingjie Wang, Jianhou Gan, Yiming Zhao, Jiatian Mei, Chuanzhi Zhang:

Text semantic-guided adaptive feature aggregation for image-text retrieval. 300 - Xuanchi Gong, Ziyang Xue, Tengjun Liu, Yubao Sun, Yifan Zhang:

Enhancing semantics consistency via hybrid attention fusion in multimodal sentiment analysis of short videos. 301 - Guoqiang Dang, Li Liu, Dongmei Liu, Fucheng Cao:

Integrating global signals with fine-grained consistency for conditional image generation. 302 - Weiran Chen

, Guiqian Zhu, Ying Li, Yi Ji, Chunping Liu:
SiamHCC: a novel siamese network for quality evaluation of handwritten Chinese characters. 303 - Yingmei Zhang

, Sungchan Kim, Hyo Jong Lee:
ASCNet: attention-scale sparse cascade network for multimodal infrared and visible image fusion. 304 - Fudong Nian, Weijie Lu, Jun Wang, Chengqian Li, Yun Fu, Zhize Wu:

Visual-language collaborative multimodal transformer network for group activity detection in surveillance videos. 305 - Abeer Ayoub, Walid El-Shafai

, Fathi E. Abd El-Samie
, Ehab K. I. Hamad
, El-Sayed M. El-Rabaie:
Quality enhancement of near-infrared and visible videos using an optimized dehazing technique. 306 - Zhongyuan Chen, Chong Lu, Yihan Wang:

Self-attention mechanism prior to modality fusion for multimodal sentiment analysis. 307 - Dinesh Elayaperumal, Sachin Sakthi Kuppusami Sakthivel, Young Hoon Joo, Jae Hoon Jeong:

Learning sparse spatial attribute-aware correlation filter tracking via rank-based surrounding strategy. 308 - Padmaja Vudayagiri, Rajeswari Sana, Hanaa A. Abdallah, Neha Agarwal, Saurabh Agarwal:

Synthetic shadows: the interplay of forensic detection and anti-forensic techniques in GAN-generated images. 309 - Zhengkai Yang, Shuyan Xiao, Weige Tao, Lingjiao Pan, Yizhuang Miao, Wenli Yu:

Mpv-pcqa: multimodal no-reference point cloud quality assessment via point cloud and captured dynamic video. 310 - Qiyu Deng, Yu Chen, Chuwei Cheng, Junhong Xiao, Ming Tao, Xiaozhao Fang:

Asymmetric semantic preserving hashing for cross-modal retrieval. 311 - Sharifah Mousli, Sona Taheri, Estrid He:

ConASD: Contrastive Few Shot Learning for Detecting Autism Spectrum Disorder via Eye Tracking Scanpath. 312 - Xueqiang Lyu, Zihe Tian, Xingqiang Zhao, Jing Han, Zangtai Cai, Yuzhong Chen:

Multi-modal semi-supervised semantic segmentation for indoor scenes via adaptive CutMix and contrastive learning. 313 - Ahmad Naeem, Hassaan Malik, Mui-Zzud-Din, Abolghasem Sadeghi-Niaraki, Daesik Jeong, Rizwan Ali Naqvi:

SkinDWNet: a novel deep learning model for multiclass classification of skin cancers using dermoscopic images. 314 - Lazarus Kwao, Jing Ma, Sophyani Banaamwini Yussif, Wisdom Xornam Ativi, Ben Beklisi Kwame Ayawli:

Tb-mmrd: transformer-based multi-modal election rumor detection with agreement-aware gating and semantic fusion. 315 - Beike Yu, Dafang Wang, Jiang Cao, Pengyu Zhu, Yifei Zhao:

Vehiclesim: realistic and 3D-aware video editing with one image for autonomous driving. 316
Volume 31, Number 5, October 2025
- Muhammad Faisal Abrar

, Ali Alferaidi, Tariq S. Almurayziq, Muhammad Saqib
, Raza Uddin, Wilayat Khan, Jawad Khan, Mohammad Alsaffar
:
A dual-modal analysis of credibility in integrating interpretive structural modeling (ISM) and BERT for enhanced fake news detection. 317 - Ziyang Wu, Yin Lin, Qidong Huang, Wengang Zhou, Houqiang Li:

Multi-scale count-task guided feature enhancement face detection. 318 - Wenwen He, Yi Zhang, Zhiyuan Liu, Yalan Ye, Qinghua Ren, Yongzhao Zhan:

Unsupervised subdomain adaptation framework guided by pseudo label for cross-subject and cross-session EEG emotion recognition. 319 - Zhengan Lu, Zhuang Zhou, Shuobin Wei, Zizhao Yuan, Binghua Su:

Spcformer: spatial perception correction transformer for semantic segmentation of scene parsing. 320 - Chunting Wang, Xini Ding, Xuan Zhao, Huiliang Shang, Lin Gu

, Miao Wang:
Adaptive frequency-aware network for action quality assessment. 321 - Heng Zhang, Jiasong Ding, Qin Hang, Yonghong Huang, Nongsen Huang:

Enhancing object detection robustness through adversarial noise filtering with luma non-local means. 322 - Rui Jin, Yong Liao:

CAPTCHA farm detection and user authentication via mouse-trajectory similarity measurement. 323 - Shuaifang Wei, Xiaomin Yang, Gwanggil Jeon:

PDSRN: a progressive distillation network for generalizable single image super-resolution. 324 - Xinheng Wang, Lun Xie, Chiqin Li, Mengsheng Wang, Ziyang Liu, Xiaolan Peng, Zhiliang Wang:

A cross-modal fusion network based on dual attention mechanism for emotion recognition in conversation. 325 - Qing Li, Hao Zhai

, You Yang, Xiaoning Sun, Long Wang:
Multi-focus image fusion based on re-parameterized large kernel convolution and edge information fusion. 326 - Qiaoyun Zhang, Chih-Yung Chang, Christopher Chuang, Wen-Hwa Liao, Diptendu Sinha Roy:

Teaching authentic sign language through multiple representation learning. 327 - Jijun Wang, Yan Wu, Yujian Mo:

DSFusion: a dynamic dual-scale multimodal fusion framework for robust 3D object detection. 328 - Xiaoming Yang, Liming Yuan, Xianbin Wen:

Hfffap-net: unsupervised fundus image enhancement with high-frequency feature fusion and artifact processing. 329 - Md. Sajjatul Islam

, Yongsheng Sang, Adam A. Q. Mohammed
, Jiancheng Lv:
Facial micro-expression recognition from videos through domain adaptation and multi-modal spatio-temporal feature ensemble. 330 - Hua Huang, Qiaoli Qin:

Implicit and explicit knowledge enhanced cross-modal representation for image-text retrieval. 331 - Haitao Xiong, Junhong Ding, Yuchen Zhou, Yuanyuan Cai:

STAGVid2C: enhancing video-based commonsense captioning with spatio-temporal action graph. 332 - Yanbin Liu, Qin Shi, Ziming Zhu, Xiaofeng Ling, Yu Zhu

:
Dual attention transformer with adaptive frequency enhancement for real-world Chinese-English scene text image super-resolution. 333 - Jiarui Zhu, Jun Hou, Penghang Yu

, Zhiyi Tan, Bing-Kun Bao:
LD4MRec: simplifying and powering diffusion model for multimedia recommendation. 334 - Huanyu Zhu, Zhihao Shen

, Chengxiao Dai, Zhitao Yu:
Multimodal large language model enhancement network for multimodal sentiment analysis. 335 - Jiangtao Huang, Dong He, Wenming Cao, Jianqi Zhong:

Progressively deeper attention networks for 3D human motion prediction. 336 - Yinjie Chen, Wenyi Tang, Yunbo Rao, Hui Ding, Shuzhen Zhu, Yuanyuan Wang:

Big-LITTLE-Net: a dual-branch network for small UAV detection. 337 - Wei Yang, Shuai Wang, Jiaqi Wu, Wei Chen, Zijian Tian:

A self-supervised enhancement method for real world low-light images using Retinex and camera response function. 338 - Huimin Guo

, Yin Gu, Wu Du, Boyang Chen, Taiwei Jiao, Wei Qian, He Ma:
DilateMobileU-Net: an efficient hybrid segmentation model for polyp diagnoses. 339 - Shaoxin Qiu, Junhai Zhai, Jiankai Chen:

MPLR: a long-tailed recognition method based on visual language prompts. 340 - Juheon Hwang, Taewan Kim, Jiwoo Kang:

Collaborative feature aggregation for face super-resolution and robust re-identification. 341 - Xiebing Chen, Yue Wang, Bilian Chen:

Field-enhancing factorization machine for click-through rate prediction. 342 - Shijia Liu

, Yong Wang, Sen Li, Yuming Liu:
MADRL-based bitrate allocation for QoE fairness in 360° video streaming with viewport prediction. 343 - Lianghu Jing, Bo Wang:

Dual-domain aware network for salient object detection in low-light images. 344 - Yaguan Qian, Jiaqiang Sha, Bin Wang, Zhaoquan Gu, Yanchun Zhang:

Enhancing transferability of targeted adversarial examples through amplitude spectrum alignment. 345 - Hongying Zhang, Jiatian Tang:

SFFN-YOLO for small object detection in aerial images. 346 - Wenjie Li, Changming Zhu:

Cross-view attention with adversarial learning for incomplete multi-view clustering. 347 - Zhongmin Liu, Zhenhua Li, Wenjin Hu:

VAFTrack: asynchronous feature fusion via visual receptive weighted key-value perceptual for visual tracking. 348 - Qian Liu, Mengting Liu:

Semi-supervised dictionary learning based deep network for person search. 349 - Cheng Zeng, Mingying Zhu, Jing Liu, Chongri Liu, Ruolin Liang, Junxin Chen, Hang Lin:

A combine bit-wise and vector-wise interactive features network for CTR prediction. 350 - ZunWang Ke, Guosheng Wang, Yugui Zhang, YunLong Shi, Fengyu Guo, Yuelin Zou, Zhaofan Li, Run Guo, Ji-Sheng Zhou:

Semantic Segmentation Network combining Gaussian Perception and Iterative Multi-Scale Attention. 351 - Ruofan Feng, Jiwei Qin, Dezhi Sun, Weilin Tang, Xizhong Qin:

MTSMNet: a multi-scale trend-seasonal mixing network for long-term time series forecasting. 352 - Shuhuan Zhao

, Peijing Zhao, Zixin Hao, Shuaiqi Liu:
A dual-branch approach with multi-stage semantic integration and dual optical flow for micro-expression recognition. 353 - ZeYuan Niu, Ping Zhang, Chen Zhang, ZeLong Huang, Xin Zhang:

DHRA-UNet: a lightweight SLM powder-spreading defect image segmentation algorithm. 354 - Yanchao Li, Zhuowen Ouyang, Guanxiao Li:

Unseen-aware semi-supervised model for robust human activity recognition. 355 - Lizhi Zheng, Yao Fan, Zhiwei Zhao:

Thangka image segmentation based on detail enhancement and multi-scale edge guidance. 356 - Xingran Guo, Juanli Li

, Bo Li
, Rui Xia
, Tianyu Zhang:
On-line monitoring of structural performance of scraper conveyor driven by digital twin. 357 - Aiping Yang, Chenhui Yu, Jinbin Wang

, Zihao Wei, Jiale Cao, Liping Liu:
Cross-Scale Atomic Feature Enhanced Network for high-fidelity Single Image Super-Resolution. 358 - Chenglong Sun, Wenjie Li:

A two-stage interaction approach for enhancing generalization of deepfake detection. 359 - Yahui Deng, Guangrui Bai, Erbao Dong:

Low-light image enhancement based on adaptive enhancement matrix. 360 - Kai Jiang, Baoju Zhang, Bo Zhang, Cuiping Zhang, BoHua Chu, Yameng Zhang, Mengqi Xue:

An adaptive weight fusion low-light image enhancement based on HSV space. 361 - Guanjun Sheng, Yongzhen Ke, Shuai Yang, Kai Wang:

Aesblip2: generating image aesthetic caption via prompting. 362 - Xuzhong Hu, Zaipeng Duan, Pei An, Jun Zhang, Jie Ma:

Lidar-camera range-view fusion for 3D object detection in autonomous driving. 363 - Maryam Karimi

, Meysam Ghalyani, Seyede Fatemeh Noorani:
QUIQ: quality-sensitive features for no-reference underwater image quality assessment. 364 - José Manuel Alcalde-Llergo, Andrea Zingoni, Pilar Aparicio-Martínez, Sara Pinzi

, Enrique Yeguas-Bolivar:
Design and evaluation of a serious game in virtual reality to increase empathy towards students with phonological dyslexia. 365 - Xie Wei

, Haorui Wu, Liang Haoming, Langwen Zhang, Xiaoyuan Yu:
Simplifying complexity: a double-phase detection algorithm for defects of injection molded parts within the limited computer source. 366 - Xinyu Deng, Xianghai Hui:

Enhancing AIGC-driven creativity: a CreaNet-GAN approach for digital art colorization and animation. 367 - Xiaoyan Zhang, Ling Luo, Jingbo Xia, Xiangfei Dai:

DA-MVSNet:depth-aware multi-view stereo network for 3D reconstruction. 368 - Chao Li, Xin Li, Xiangkai Zhu, Qingtian Zeng, Hua Duan, Nengfu Xie:

Heterogeneous graph structure learning based on feature and topology information extraction. 369 - Shougang Ren, Yuchen Zhou, Xingjian Gu, Xin Shu, Xiangbo Shu:

Maximum dissimilarity channel complementary reconstruction for convolutional efficiency. 370 - Ying Zhou, Lei Chen, Tianhuan Huang, Ju Liu, Xianye Ben:

Dual complementarity transformer for micro-expression recognition. 371 - Xin Li

, Bingxin Xu, Hongzhe Liu, Weiguo Pan, Cheng Xu
:
Generalization-oriented face forgery detection via discriminative feature analysis and normalization. 372 - Junwei Zhou, Benyi Zhang, Shengping Wu, Lei Zhou

, Yanchao Yang, Jianwen Xiang:
Jpeg stereo image lossy recompression with mutual information enhancement. 373 - Hong Xia, Siyu Feng, Hui Jia, Yanping Chen:

EM-OFRP: enhanced memory-based optical flow reconstruction and variational prediction for video anomaly detection. 374 - Mingyi Sun, Zhuyuan He, Renyong Huang:

Enhancing graphic design through deep graph convolution and skill optimization: the DGC-RSA approach. 375 - Changpeng Ji, TianYu Tan, Wei Dai:

Multimodal sentiment analysis based on temporal perception and cross-modal interaction. 376 - Rui-Xiang Kan

, Mei Wang, Hongbing Qiu:
Kin-LeapK: an enhanced human-computer interaction system with improved AdaBoost visual and audio information recognition methods. 377 - Fucheng Cao, Dongmei Liu, Guoqiang Dang:

Feature loop consistency optimization for enhanced control precision in text-to-image generation. 378 - Miao Xiaorui:

Classification of Chinese Guzheng genres based on CNN with attention mechanism. 379 - Hengbo Ma, Longge Wang, Junyang Yu, Rui Zhai, Yalin Song, Han Li:

DSGAC: deep self-supervised global attention for attributed graph clustering. 380 - Wenjie Mei, Jiefu Mei:

Adp-clf: adaptive dual-perception contrastive learning for gastrointestinal endoscopic image classification. 381 - Juan Cai, Min Long, Le-Bing Zhang, Quantao Yao, Xiangling Ding:

A styleGAN-based face de-morphing network for restoring accomplice's facial image. 382 - Mingwen Shao, Wenjie Liu, Lingzhuang Meng, Huan Liu, Xiaodong Tan:

DiffRA: universal restorative adversarial attack based on diffusion model. 383 - Junjun Guo, Bo Xu, Hui Li:

A prompt-based dual-layer cross-modal distillation learning method for aspect-based sentiment analysis. 384 - Runbang Liu, Zhiyu Zhu, Huilin Ge

, Xingyue Du, Yongdong Shu, Qingshan Ji:
Infrared ship target detector based on forward and backward propagated polarization feature extraction module. 385 - Weixuan Gao, Nengbin Lv, Fuzhou Du:

AWBN-YOLO: a surface defect detection method for aero-engine blades in sample-limited scenarios. 386 - Shunjie Wang, Guoyong Cai, Guangrui Lv:

Adaptive graph interaction guided correlation and discriminant learning for aspect-based multimodal sentiment analysis. 387 - Akpedje Ingrid Hermilda C. F. Tossou, Gengkun Wu, Mingchen Wei, Letian Wang:

SEFC-Net: enhanced crack segmentation using attention mechanisms and channel prior convolutional attention in mining area. 388 - Xuan-Nam Cao

, Nhat-Tan Vo
, Minh-Triet Tran
:
Pmfs: Progressive mouth-to-face synthesis for realistic talking face generation. 389 - Yuqiang Li, Yiyi Ma, Xinyi Shen, Chun Liu:

CL-FGAN: curriculum learning-guided emotion recognition in conversation model based on frequency graph attention network. 390 - Di Wu, Mingyue Yan, Yao Chen:

CHCoT-MSLU: a coupled hierarchical chain-of-thought prompt learning model for multi-intent spoken language understanding. 391 - Yi Zhao, Jin Zhang, Jibing Gong, Jiquan Peng, Xindong Wu, Shishan Gong, Shuying Du:

Balancing global and local interests in cross-domain recommendation systems. 392 - Chunjian Su, Luhui Li, Hongen Wei, Hening Sun, Yongxu Chen, Daolong Zhang, Tinyi Din, Chenming Li:

Enhanced target recognition and localization using binocular vision and infrared thermal imaging. 393 - Zhangjian Ji, Donglin Cheng, Kai Feng:

Exploring stronger transformer representation learning for occluded person re-identification. 394 - Qinglong Xu, Haixing Zhu, Yuan Wang, Zhongjie Shi, Weipeng Liu:

Cross-teaching with dual uncertainty awareness for semi-supervised medical image segmentation. 395 - Xiang Lu, Yue Feng, Xudong Jia, Tao Chen:

Multi-label classification of tongue images using label semantic embedding and dual-branch network. 396 - Senbao Zhang, Shucheng Huang, Li Pengyi, Mingxing Li:

A Multimodal framework for 3D few-shot class-incremental learning. 397 - Zuhe Li, Hongyang Chen, Fengqin Wang, Gang Xu, Qidong Liu, Yushan Pan:

Efficient 3D human pose estimation via spatio-temporal graph transformer with token pruning. 398 - Yongheng Zhang, Liwei Chen, Shigang Wang, Yan Zhao, Jian Wei:

PGAF-Net: an adaptive fusion network with polarization-guided hybrid attention for dual-polarized SAR ship classification. 399 - Danyang Cao, Cheng Cheng, Guanmin Zhang:

Compact and efficient language modeling for classical poetry: generation and interpretation. 400 - Homa Omarzadeh

, Monireh Hosseini:
Detecting offensive language on instagram with a combined approach of the Gray Wolf algorithm and deep learning networks. 401 - Emre Akdemir

, Necaattin Barisçi, M. Ali Akcayol, Nurettin Dogan
:
Selecting generated synthetic features using clustering algorithm for generalized zero-shot learning. 402 - Xinrong Wu, Zhiming Shi:

A comprehensive features representation for no-reference image quality assessment. 403
Volume 31, Number 6, December 2025
- Lidong Wang, Tao Huang, Yin Zhang, Kang An, Jie Yuan:

Predicting retweets using social trust-aware graph neural network approach. 404 - Jingjing Bi, Zonghao Tang:

Application of superstar learning platform in the teaching of medical english listening. 405 - Guanzheng Jiang, Changming Zhu:

Similarity-guided contrastive learning for deep multi-view clustering. 406 - Nuoya Li, Weiguo Pan, Bingxin Xu, Hongzhe Liu, Songyin Dai, Cheng Xu

:
Ihenet: an illumination invariant hierarchical feature enhancement network for low-light object detection. 407 - Jianming Liu, Huihua Wang, Zecen He:

Research on underwater image enhancement algorithm based on classification adaptive color correction and dual parallel branch optimization network. 408 - Fujiao Ju, Shuhan Zhao, Shaotao Zhu:

Enhanced pneumonia lesion segmentation using a hybrid CNN-BiFormer network with residual haar wavelet downsampling and shared attention. 409 - Liwei Deng

, Boda Wu, Jiandong Wang:
A multi-label classification method combined with texture enhancement for deepfake face detection. 410 - Yanxia Liang, Huanhuan Zhang, Xin Liu, Xiaopeng Yang, Fuping Wang, Jing Jiang:

Enhancing cross-modal voice-face association with heterogeneous hashing network. 411 - Junho Kim, Sangjin Lee, Jungheum Park:

An in-depth forensic examination of video files edited by Apple Photos. 412 - Haifeng Zhao, Qinghua Ling, Wenhai Qin, Leilei Ma, Dengdi Sun:

Dual-level semantic alignment for video moment retrieval and highlight detection. 413 - Jialin Wu, Tao Yang, Jintao Meng:

Research and application of sign language recognition and target tracking model based on YOLO-Mamba. 414 - Haoxuan Wu

, Lai-Man Po, Yuyang Liu
, Wing-Yin Yu, Tianqi Zhang, Zeyu Jiang
, Kun Li
:
Multi-SBoRA: regional and non-overlapping weight updates for multi-concept customization of diffusion models. 415 - Aiying Guo, Zijun Deng, Jingjing Liu:

Single-image super-resolution via lightweight shuffle feature fusion network. 416 - Qiao Ma, Yingbo Jia, Haixin Gong, Ruize Guo, Yu Cao, Zhengtang Li, Xie Han, Liqun Kuang, Fengguang Xiong:

A novel iterative deformable joint attention network for remote sensing image change detection. 417 - Yiqing He, Zefeng Zheng

, Zhuowei Wang
, Hanwei Wu, Yunyun Zhang, Lianglun Cheng:
Wavelet guided real time detection transformer with sparse attention. 418 - Jing Wang, Mingyu Shi, Junyan Fan, Yanzhu Zhang, Ruiping Wang:

KECAN: knowledge-enhanced cross-modal alignment network for ophthalmic report generation. 419 - Abed Heshmati

, Mohsen Afsharchi, Sajad Ahmadian, Majid Meghdadi:
SiGR: a novel sign-aware graph neural network for recommender systems. 420 - Chuwei Cheng, Yu Chen, Tianle Hu, Qiyu Deng, Sixian Chan, Xiaozhao Fang:

Label enhancement hashing induced by class prototypes for domain adaptive retrieval. 421 - Yong Li, Zhenguo Yang, Lap-Kei Lee, Fu Lee Wang, Yingying Qu, Tianyong Hao:

Inference enhanced model with answer refinement for medical visual question answering. 422 - Ying Wang, Zhao Yang, Yanxiang Zhao, Ablameyko Sergey, Fang Zuo:

Fa-yolo: multi-scale feature fusion for spectral image object detection in complex scenes. 423 - Guanqun Guo, Li Zhang, Boqiang Jia, Jiayu Zhang, Wenjie Wang:

Tool-YOLO: a target detection network based on feature extraction and feature fusion. 424 - Shuzhi Su, Yang Xu, Yanmin Zhu, Chao Wang:

Detecting unknown objects in open world via open world objectness score and distance-sensitive NMS. 425 - Lichao Su, Liming Huang, Shiyan Tu:

Dual-stream progressive neural network based on cross fusion in image manipulation localization. 426 - Zhenyao Li, Jie Jin

, Daobing Zhang, Chaoyang Chen:
Design and realization of pulse-controlled multi-memristor Hopfield neural networks and their applications in information encryption. 427 - Imad Tbaileh

, Selami Bagriyanik
:
Visual quality assessment of E-commerce product images using convolutional neural networks. 428 - Hongyuan Jing

, Hui Zhang, Mengmeng Zhang, Qiyu Rong
:
GL-MambaNet: Mamba-based global and local feature fusion for image dehazing. 429 - Juan Yang, Anbo Liu, Ronggui Wang, Lixia Xue:

Dual-stage pixel transformer with enhanced visual context for image captioning. 430 - Xianchen Wang, Can Pei, Jianbiao He, Zhiwei Lu:

Dynamic scale-aware vehicle re-identification via optimized YOLO-BFP and RIoU metric learning. 431 - TianYi Yu, Shayan Nejadshamsi:

A new method for attributed graph clustering with dual-manifold orthogonal matrix learning. 432 - Kai Xu, Lichun Wang, Shuang Li, Jianjia Xin, Baocai Yin:

Bcgn: BLIP-based cross-modal grasping network for language-conditioned robotic grasping. 433 - Xiuchuan Cheng, Kangning Yin, Zhen Ding

, Guisong Liu, Zhiguo Wang:
Continual adaptation Person re-identification via vision-language fusion with enhanced annotation robustness. 434 - Yankui Xu, Lina Cheng:

Face emotion recognition based on Gabor wavelet and particle swarm optimization algorithm. 435 - Zhixin Li, Yunfeng Dong, Xiaoming Wu, Xiangzhi Liu:

SD-HRNet: a lightweight high-resolution network for human pose estimation based on spatial decoupling. 436 - Jiaxiang Zheng

, Moxi Cao, Chongbin Zhang:
AI-driven generation of guzheng music from classical Chinese poetry: toward a new paradigm of creative practice in Chinese traditional Music. 437 - Ye Zhao, Yong Zhu, Yuxi Gong, Xueliang Liu, Liangfeng Xu:

CBLC-SOOD: contrastive background and label correction for semi-supervised oriented object detection. 438 - Kun Qu, Man Zhang, Yang Yang, Hao Xue, Xiang-Jun Shen:

An extremely fast deep spectral clustering method in Fourier domain for large-scale data. 439 - Zichen Wang, Kaixi Wang, Xiaozhu Jia, Xinchun Cui:

A multi-scale blending steganalysis model based on interactive feature extraction. 440 - Jie Wang, Dianlong Fang, Wenjun Hu:

Ppca: precise perturbation and feature approximation for enhanced black-box attacks in remote sensing image classification. 441 - Wei Zhao, Zhuoran Tang, Qihan Yang:

Speech-driven talking face video generation. 442 - Yang Su

, Shunquan Tan
, Yunqiao Zhang, Jiwu Huang
:
Universal forged image detection and localization via self-supervised data generation and large-scale model adaptation. 443 - Jingmin Yang, Hongbin Zhang, Wenjie Zhang, Jinghui Ren:

Awcf-yolo11: hierarchical attention fusion and adaptive channel refinement for object detection in remote sensing imagery. 444 - Congying Wu, Xiaofan Song, Zan Li, Xiaoxu Wang, Chen Chen, Jihua Huang:

Scheduling periodic queries in multi-channel on-demand broadcasting environments. 445 - Xingpeng Zhang

, Peng Guo, Qiuli Wang
, Kaixin Wang, Sijing Wu, Jing Xu, Yang Yu:
Hybrid attention multi-scale feature aggregation for efficient nuclei segmentation and classification in H&E-stained images. 446 - Hongkun Zhao, Siyuan Liu, Yang Chen, Fanmin Kong, Qingtian Zeng, Kang Li:

GCIF: graph based cross-modal information fusion for conversational emotion recognition. 447 - Ke Han, Long Jin, Junpeng Yang, Zongwang Lv:

Multi-branch attention feature fusion network for person re-identification. 448 - Donghyun Han, Byoung-Dai Lee:

Shadow feature refinement network: progressive feature refinement based on knowledge distillation for effective shadow removal. 449 - Tao Liu, JianLong Hu, MengYu Zhao:

Ecca-unet: edge-aware and channel-enhanced cross-attention network for medical image segmentation. 450 - Jun Wang, Youzhou Wu, Baodi Liu, Wenzheng Wang, Haoran Xu, Keding Wang:

Fourier aids CNN and transformer for semantic segmentation of remote sensing images. 451 - Xiaoyu Liu, Yue Zhang, Qin Wang, Zhenglin Li, Cor Ke Xu:

Attention-guided few-shot learning for metal surface defect classification. 452 - Yijun Cao, Hongjiao Li, Botao Zhang, Ning Xue, Hongliang Yin, Pu Chen:

Personalized federated learning via multifaceted feature matching and element-wise classifier fusion. 453 - Mingjun Xi, Pingshan Liu, Seshu Yu:

Temporal information-aware multimodal learning network for user-generated video popularity prediction. 454 - Amirhossein Tahmouresi, Sogand Basirian, Amirhossein Javanshir, YoungJin Cha:

Enhanced prediction of persistent earthquake-induced groundwater level changes with advanced feature engineering and machine learning. 455 - Hongyuan Lu, Huiqian Du, Min Xie:

MCLSC-Fusion: a multi-scale cross-modality long-short connection fusion network for infrared and visible images. 456 - Mengjun Miao, Heming Huang, Feipeng Da:

Wavelet-guided spatial-frequency transformer with physics-based refinement for remote sensing image dehazing. 457 - Cheng Xian, Xiuyuan Li, Mingyong Li:

Fine-tuning CLIP for difference-guided composed image retrieval. 458 - Wuyuan Ye, Zhengdong Luo, Mengcheng Chen:

Calibrating feature representations for few-shot image recognition via vicinal mixup. 459 - Jiangpeng Li, Yan Niu:

Parallax-robust correlation volume for optical flow computation neural networks. 460 - Yongchao Qiao, Ya'nan Guan, Qihan He, Zhongxu Li, Jingmin Yang, Wenyuan Yang:

Slfmamba:a state space based vision foundation models fine-tuning for domain generalized semantic segmentations. 461 - Muhammad Usman, Ziwei Ma, Yijia Zhang:

Mwcl: Memory-driven and mapping alignment with weighted contrastive learning for radiology reports. 462 - Jian Jiang, Yan Tian, Yongchuan Xu, Zhaocheng Xu, Xun Wang:

Sdreplay: diffusion model for continual semantic segmentation in traffic scenarios. 463 - Jiajun Wu, Zhiwei Liang, Songhao Zhu:

Unsupervised cross-domain pedestrian re-identification via squeeze-excitation attention and latent feature mining. 464 - Feng Chen, Wentao Chen, Xiang Liu, Jin Hu, Yinlong Yuan, Yun Cheng, Liang Hua:

A human-machine hybrid intelligence method based on causal representation for solving non-independent and identically distributed problems. 465 - Jing Dong, Jinxiong Fan, Junzhuo Zhang, Chang Liu, Wei Cheng

:
TA-LSTM: Temporal Attention LSTM for spatiotemporal weather prediction. 466 - Rongrong Jia, Shiqiang Du, Wei Dang, Huaikun Zhang, Jizhao Liu, Jing Lian:

St-diffnet: Diffusion-based inpainting of dunhuang murals with structural and textural guidance. 467 - Zhuoyang Xia, Meng Jian, Zihan Liu, Yulong Bai, Lifang Wu, Shaona Wang:

Mitigating long-tail bias in recommendations via graph diffusion. 468 - Muhammad Faisal Abrar, Ali Alferaidi, Tariq S. Almurayziq, Muhammad Saqib, Raza Uddin, Wilayat Khan, Jawad Khan, Mohammad Salih Alsaffar:

Correction: A dual-modal analysis of credibility in integrating interpretive structural modeling (ISM) and BERT for enhanced fake news detection. 469 - Xin Deng, Zheng Xu, Wenzhu Yang:

Unidirectional guidance network for enhanced small object detection in UAV imagery. 470 - Xiaobin Li, Weiwei Zhang, Maohai Pang, Jianqing Zhu:

MDIKD: multi-dimensional integration knowledge distillation. 471 - Tianqi Liu, Gaoyun An, Zhaoqilin Yang, Xingyu Ren, Qiuqi Ruan:

EIRA: an explicit-implicit representation alignment for multimodal relation extraction. 472 - Baiting Zhao, Yingying Shang, Xiaofen Jia, Zhenhuan Liang, Rui Hu:

DAFMixerSR: a lightweight fusion-enhanced adaptive perception network for image super-resolution. 473 - Hongwei Chen, Jianpeng Wang, Yuan Zhu, Liya Xi, Chang Ma:

Lista-net: a lightweight spatiotemporal adaptive network for skeleton-based action recognition. 474 - Zejiang Xu, Yu Chen, Yuanyuan Liu, Xiaozhao Fang, Han Na, Weijun Sun, Yonghui Huang:

Structure center fusion and guidance learning for domain adaptive retrieval. 475

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














