


default search action
The Visual Computer, Volume 41
Volume 41, Number 1, January 2025
- Nadia Magnenat-Thalmann:
Welcome to the Year 2025. 1-2 - Acknowledgement to reviewers 2024. 3-10
- Wenji Yang
, Liping Xie, Wenbin Qian, Canghai Wu, Hongyun Yang:
Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA. 11-24 - Gusu Song, Shaoyan Gai
, Feipeng Da:
Memory-based gradient-guided progressive propagation network for video deblurring. 25-40 - Rohit Pratap Singh
, Dolendro Singh Laiphrakpam:
Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods. 41-51 - Zhe Li
, Hui Lv, Libo Cheng, Xiaoning Jia:
Image deblocking algorithm based on GC and SSR. 53-66 - I-Chao Shen
, Li-Wen Su, Yu-Ting Wu, Bing-Yu Chen:
StylePart: image-based shape part manipulation. 67-78 - Youssef Ait Khouya
, Mohammed Ait Oussous, Abdeslam Jakimi
, Faouzi Ghorbel
:
Stable and invertible invariants description for gray-level images based on Radon transform. 79-97 - Mahmoud A. Eldosoky
, Jianping Li, Amin Ul Haq, Fanyu Zeng, Mao Xu, Shakir Khan
, Inayat Khan:
WallNet: Hierarchical Visual Attention-Based Model for Putty Bulge Terminal Points Detection. 99-114 - Rajendra Nagar
:
Robust extrinsic symmetry estimation in 3D point clouds. 115-128 - Chen Zhao, Weiling Cai
, Zheng Yuan:
Spectral normalization and dual contrastive regularization for image-to-image translation. 129-140 - Ziliang Feng, Ju Zhang, Xusong Ran, Donglu Li, Chengfang Zhang:
Ghost-Unet: multi-stage network for image deblurring via lightweight subnet learning. 141-155 - Chunlu Li
, Feipeng Da:
Refined dense face alignment through image matching. 157-171 - Xiongbo Lu, Feng Liu, Yi Rong, Yaxiong Chen, Shengwu Xiong:
MakeupDiffuse: a double image-controlled diffusion model for exquisite makeup transfer. 173-189 - Junjie Liu, Junlong Liu, Rongxin Jiang, Boxuan Gu, Yaowu Chen, Chen Shen:
Boosted verification using siamese neural network with DiffBlock. 191-208 - Xujia Qin
, Xinyu Li, Mengjia Li, Hongbo Zheng, Xiaogang Xu:
Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement. 209-227 - Xiaochun Lei
, Zeyu Chen
, Zhaoxin Yu
, Zetao Jiang
:
BENet: boundary-enhanced network for real-time semantic segmentation. 229-241 - Feihu Bian, Suya Xiong, Ran Yi, Lizhuang Ma:
Multi-view stereo-regulated NeRF for urban scene novel view synthesis. 243-255 - Hengrui Zhang
, Yongfeng Qi, Huili Chen, Panpan Cao, Anye Liang, Shengcong Wen:
LSDNet: lightweight stochastic depth network for human pose estimation. 257-270 - Zubair Ahmad Lone
, Alwyn Roshan Pais:
Salient object detection in HSI using MEV-SFS and saliency optimization. 271-280 - Clement Mailhe
, Amine Ammar, Francisco Chinesta, Dominique Baillargeat:
Towards improving synthetic-to-real image correlation for instance recognition in structure monitoring. 281-301 - Yue Yu, Yue Yang, Jingshuo Xing:
PMGAN: pretrained model-based generative adversarial network for text-to-image generation. 303-314 - Haoyu Xiong, Yu Xiang:
Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios. 315-330 - Zhixuan Tang, Haiyun Shen, Peng Yu, Kaisong Zhang, Jianyu Chen:
Infrared tracking for accurate localization by capturing global context information. 331-343 - Yixiu Liu, Long Zhan
, Yu Feng, Pengju Si, Shaowei Jiang, Qiang Zhao, Chenggang Yan:
Loose-tight cluster regularization for unsupervised person re-identification. 345-358 - Le-Anh Tran
, Dong-Chul Park:
Encoder-decoder networks with guided transmission map for effective image dehazing. 359-382 - Yixiu Liu, Tao Jiang, Pengju Si, Shangdong Zhu
, Chenggang Yan, Shuai Wang, Haibing Yin:
Unpaired semantic neural person image synthesis. 383-397 - Yan Huang, Xinchang Lu, Jia Fu:
Single image reflection removal via self-attention and local discrimination. 399-408 - Ziyang Chen
, Yang Zhao
, Junling He, Yujie Lu, Zhongwei Cui, Wenting Li
, Yongjun Zhang
:
Feature distribution normalization network for multi-view stereo. 409-421 - Dayu Jia, Yanwei Pang, Jiale Cao, Jing Pan:
SSNet: a joint learning network for semantic segmentation and disparity estimation. 423-435 - Ye Li, Wu Zhang, Meiling Wu, Di Zhang, Zhiguo Wang, Changjiang You:
Multi-keypoints matching network for clothing detection. 437-449 - Zhentao Zhang
, Wenhao Li, Yuxi Cheng, Qingnan Huang, Taorong Qiu:
An improved residual learning model and its application to hardware image classification. 451-464 - Ping Ma
, Xinyi He, Yiyang Chen, Yuan Liu:
ISOD: improved small object detection based on extended scale feature pyramid network. 465-479 - Jian Xiong, Jie Wu, Ming Tang, Pengwen Xiong, Yushui Huang, Hang Guo:
Combining YOLO and background subtraction for small dynamic target detection. 481-490 - Henry Senior, Gregory G. Slabaugh, Shanxin Yuan, Luca Rossi:
Graph neural networks in vision-language image understanding: a survey. 491-516 - Yuanhao Chai, Jingyu Gong, Xin Tan, Jiachen Xu, Yuan Xie, Lizhuang Ma:
Learnable scene prior for point cloud semantic segmentation. 517-534 - Kunhong Xiong, Linbo Qing
, Lindong Li, Li Guo, Yonghong Peng:
Facial expression recognition based on local-global information reasoning and spatial distribution of landmark features. 535-548 - Lixia Xue, Wenhao Wang, Ronggui Wang, Juan Yang:
Modular dual-stream visual fusion network for visual question answering. 549-562 - Jinguang Chen
, Xin Zhang, Lili Ma
, Bo Yang, Kaibing Zhang:
CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM. 563-577 - Huihui Li, Junhao Zhu, Guihua Wen, Haoyang Zhong:
Structural self-contrast learning based on adaptive weighted negative samples for facial expression recognition. 579-590 - Lihuan Zheng
, Wanru Xu, Zhenjiang Miao, Xinxiu Qiu, Shanshan Gong:
RESTHT: relation-enhanced spatial-temporal hierarchical transformer for video captioning. 591-604 - Yanxiang Hu
, Panpan Wu
, Bo Zhang
, Wenhao Sun
, Yaru Gao
, Caixia Hao
, Xinran Chen
:
A new multi-focus image fusion quality assessment method with convolutional sparse representation. 605-624 - Shuyu Xiao, Yongfang Wang, Yihan Wang:
SISIM: statistical information similarity-based point cloud quality assessment. 625-638 - Jing Wu, Hao Wu
, Guowu Yuan
:
Detail-aware image denoising via structure preserved network and residual diffusion model. 639-658 - Luhan Wang
, Jun Li, Shangwei Guo, Shaokun Han:
A cascaded graph convolutional network for point cloud completion. 659-674 - Zhongxu Li, Qihan He, Wenyuan Yang
:
E-FPN: an enhanced feature pyramid network for UAV scenarios detection. 675-693 - Jiakun Zhao, Yige Cai
:
SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion. 695-708 - Hao Zhou, Junjie Yin, Yilun Yang, Meie Fang
, Ping Li:
Topology-guided accelerated vector field streamline visualization. 709-722 - Kun Wu, Lei Zhu
, Weihang Shi, Wenwu Wang:
Automated fabric defect detection using multi-scale fusion MemAE. 723-737 - A. Lubna
, Saidalavi Kalady, A. Lijiya:
Visual question answering on blood smear images using convolutional block attention module powered object detection. 739-757 - Xiyu Wei, Yanmei Dong, Qin Liu, Lei Wang, Liantang Lou:
Robust corner detection in continuous space. 759-772 - Jing Zhao, Yongjun He, Zheng Shi, Jian Qin, Yining Xie
:
A style-aware network based on multi-task learning for multi-domain image normalization. 773-783
Volume 41, Number 2, January 2025
- Jianliang Li, Jinming Zhang
, Xiaohai Zhang, Ming Chen:
Edge-guided generative network with attention for point cloud completion. 785-798 - Haowei Zhu
, Suqin Bai, Jinlong Shi, Chenggen Wang, Yunhan Sun, Jiawen Lu, Xin Shu, Shucheng Huang:
IOFusion: instance segmentation and optical-flow guided 3D reconstruction in dynamic scenes. 799-813 - Chao Yang, Meng Yang
, Hongyu Li
, Linlu Jiang, Xiang Suo
, Lijuan Mao, Weiliang Meng, Zhen Li:
A survey on soccer player detection and tracking with videos. 815-829 - Sameer Bhimrao Patil
, Suresh Shirgave:
Instructor emotion recognition system using manta ray foraging algorithm for improving the content delivery in video lecture. 831-851 - Ting Yu, Weiliang Meng, Zhongqi Wu, Jianwei Guo, Xiaopeng Zhang:
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow. 853-867 - Yasmeen Cheema
, Muhammad Nadeem Cheema, Anam Nazir, Fahad Ahmed KhoKhar, Ping Li, Ayaz Ahmed:
A novel approach for improving open scene text translation with modified GAN. 869-881 - Pengbin Fu, Ganyun Xiao, Huirong Yang:
SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder. 883-900 - Roberto Alcover-Couso
, Juan C. SanMiguel, Marcos Escudero-Viñolo
, Pablo Carballeira:
Per-class curriculum for Unsupervised Domain Adaptation in semantic segmentation. 901-919 - Supriya Agrawal
, Prachi Natu:
OBB detector: occluded object detection based on geometric modeling of video frames. 921-943 - Xin Wang, Jin Feng, Jiajia Ding, Jun Gao:
Light field salient object detection based on discrete viewpoint selection and multi-feature fusion. 945-960 - Zhizhen Zhou, Yejing Huo, Guoheng Huang, An Zeng, Xuhang Chen
, Lian Huang, Zinuo Li:
QEAN: quaternion-enhanced attention network for visual dance generation. 961-973 - Shunsuke Takao
:
Underwater image sharpening and color correction via dataset based on revised underwater image formation model. 975-990 - Junqing Yuan, Mengting Fan, Zhenyang Liu, Tongxuan Han, Zhenzhong Kuang, Chihao Pan, Jiajun Ding:
Collaborative neural radiance fields for novel view synthesis. 991-1006 - Can Zhang, Feipeng Da, Shaoyan Gai:
Point clouds feature frequency domain analysis based on multilayer perceptron. 1007-1020 - Lei Wang, Xue-Song Tang
, Kuangrong Hao:
GFPE-ViT: vision transformer with geometric-fractal-based position encoding. 1021-1036 - Fahad Ahmed KhoKhar, Jamal Hussain Shah, Rabia Saleem, Anum Masood:
Harnessing deep learning for faster water quality assessment: identifying bacterial contaminants in real time. 1037-1048 - Yixiao Jin, Fu Gui, Minghao Chen, Xiang Chen, Haoxuan Li, Jingfa Zhang
:
Deep learning-driven automated quality assessment of ultra-widefield optical coherence tomography angiography images for diabetic retinopathy. 1049-1059 - Bo Qian, Xiangning Wang, Zhouyu Guan
, Dawei Yang, An-ran Ran, Tingyao Li, Zheyuan Wang, Yang Wen, Xinming Shu, Jinyang Xie, Shichang Liu, Guanyu Xing, Julio Silva-Rodríguez, Riadh Kobbi, Ping Li, Tingli Chen, Lei Bi, Jinman Kim, Weiping Jia
, Huating Li, Jing Qin, Ping Zhang, Ching-Yu Cheng, Pheng-Ann Heng, Tien Yin Wong, Carol Y. Cheung, Yih-Chung Tham, Nadia Magnenat-Thalmann, Bin Sheng:
HRDC challenge: a public benchmark for hypertension and hypertensive retinopathy classification from fundus images. 1061-1077 - Dapeng Yan, Gangyi Ding, Kexiang Huang
, Tianyu Huang:
Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN. 1079-1096 - Yan Zhou, Xiang Chen, Tingyao Li, Shiqun Lin, Bin Sheng, Ruhan Liu, Rongping Dai:
GAMNet: a gated attention mechanism network for grading myopic traction maculopathy in OCT images. 1097-1108 - Gang Liu
, Jiebang Wang, Yao Qian, Yonghua Li:
Infrared and visible image fusion method based on visual saliency objects and fuzzy region attributes. 1109-1125 - Shweta Saboo
, Joyeeta Singha:
Semantic hand gesture integration system using self-co-articulation and movement epenthesis detection. 1127-1140 - Lars Zawallich
:
Unfolding polyhedra via tabu search. 1141-1154 - Bo Qian, Hao Chen, Yupeng Xu, Yang Wen, Huating Li, Yuan Xie, David Dagan Feng, Jinman Kim, Lei Bi, Xun Xu, Xiangui He, Bin Sheng
:
Deep contour attention learning for scleral deformation from OCT images. 1155-1170 - Lan Wei, Nikolaos M. Freris:
Multi-scale graph neural network for physics-informed fluid simulation. 1171-1181 - Mengsi Guo, Mingfu Xiong
, Jin Huang, Xinrong Hu, Tao Peng:
Face photo-sketch portraits transformation via generation pipeline. 1183-1196 - Mengsi Wang
, Yuan Mei, Lichun Yang, Bin Tian, Kaijun Wu:
SDR: stepwise deep rectangling model for stitched images. 1197-1211 - Qingkuo Meng
, Yongjian Huai, Fei Ma, Wentao Ye, Haifeng Xu, Siyu Yang:
Visualization of the occurrence and spread of wildfires in three-dimensional natural scenes. 1213-1226 - Xuan Miao, Shijie Li, Zheng Li, Wenzheng Xu, Ning Yang:
Multi-scale gated network for efficient image super-resolution. 1227-1239 - Václav Skala:
A new fully projective O(lg N) line convex polygon intersection algorithm. 1241-1249 - Gaoming Yang, Yifeng Ding
, Xianjin Fang, Ji Zhang, Yan Chu:
Fast face swapping with high-fidelity lightweight generator assisted by online knowledge distillation. 1251-1271 - Wensheng Li, Jing Zhang
, Jiafeng Li, Li Zhuo:
Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map. 1273-1291 - Xiangning Wang, Zhouyu Guan
, Bo Qian, Tingli Chen, Qiang Wu:
A deep learning system for the detection of optic disc neovascularization in diabetic retinopathy using optical coherence tomography angiography images. 1293-1302 - Mei Zhang, Lingling Liu, Yongtao Pei, Guojing Xie, Jinghua Wen:
Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement. 1303-1317 - Ya'nan Guan, Shujiao Liao, Wenyuan Yang:
AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution. 1319-1333 - Yong Liu, Xingyuan Li, Yong Liu, Wei Zhong:
SimpliFusion: a simplified infrared and visible image fusion network. 1335-1350 - Liping Zhu, Silin Wu, Xianxiang Chang, Yixuan Yang, Xuan Li:
Rethinking group activity recognition under the open set condition. 1351-1366 - Yuanqi Hu, Jianqi Zhang, Ling Bai, Jing Li, Bing Li, Ying Zang, Wenjun Hu:
From sketch to reality: precision-friendly 3D generation technology. 1367-1378 - Wenxuan Liu, Xuemei Jia, Yihao Ju, Yakun Ju, Kui Jiang, Shifeng Wu, Luo Zhong, Xian Zhong:
Fragrant: frequency-auxiliary guided relational attention network for low-light action recognition. 1379-1394 - Wuzhen Shi, Fei Tao, Yang Wen:
Joint super-resolution-based fast face image coding for human and machine vision. 1395-1408 - Shengzhou Luo
, Jingxing Xu, John Dingliana, Mingqiang Wei, Lu Han, Lewei He, Jiahui Pan
:
Publisher Correction: Twinenet: coupling features for synthesizing volume rendered images via convolutional encoder-decoders and multilayer perceptrons. 1409-1411 - Liwen Huang, Shujiao Liao, Wenyuan Yang
:
Correction: DC-PSENet: a novel scene text detection method integrating double ResNet-based and changed channels recursive feature pyramid. 1413-1414
Volume 41, Number 3, February 2025
- Yanfeng Zhao, Zhenjian Yang, Yunjie Zhang, Yadong Chen:
BGFNet: boundary information-aided graph structure fusion network for semantic segmentation of remote sensing images. 1415-1433 - Shuo Tong, Han Liu, Runyuan Guo
, Wenqing Wang, Ding Liu:
Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration. 1435-1451 - Pengshu Du, Xiao Wang, Qi Zheng, Xi Wang, WeiGang Li, Xin Xu:
Glare countering and exploiting via dual stream network for nighttime vehicle detection. 1453-1466 - Yongli Liu, Degang Yang
, Tingting Song, Yichen Ye, Xin Zhang
:
YOLO-SSP: an object detection model based on pyramid spatial attention and improved downsampling strategy for remote sensing images. 1467-1484 - Robin G. C. Maack, Felix Raith
, Juan F. Pérez
, Gerik Scheuermann, Christina Gillmann:
A workflow to systematically design uncertainty-aware visual analytics applications. 1485-1498 - QiGuang Zhu, Qiang Cen, YuXin Wang, Weidong Chen, Shuo Liu
:
An underwater target recognition algorithm incorporating improved attention mechanism and downsampling. 1499-1509 - Wenyue Sun, Jindong Zhang, Yitong Liu:
Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects. 1511-1525 - Jun Yang, Zilu Wu, Renbiao Wu:
Micro-expression recognition based on contextual transformer networks. 1527-1541 - Ya Li, Ziming Li, Huiwang Liu, Qing Wang:
ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation. 1543-1554 - Jindrich Adolf
, Peter Kán
, Tiare Feuchtner
, Barbora Adolfová
, Jaromír Dolezal
, Lenka Lhotská
:
Offistretch: camera-based real-time feedback for daily stretching exercises. 1555-1571 - Qunpo Liu
, Zhiwei Lu, Ruxin Gao, Xuhui Bu, Naohiko Hanajima:
SimpleMask: parameter link and efficient instance segmentation. 1573-1589 - Xiao Fang, Xin Gao, Baofeng Li, Feng Zhai, Yu Qin, Zhihang Meng, Jiansheng Lu, Chun Xiao:
A non-uniform low-light image enhancement method with multi-scale attention transformer and luminance consistency loss. 1591-1608 - Haibin Li, Aodi Guo, Yaqian Li
:
CCMA: CapsNet for audio-video sentiment analysis using cross-modal attention. 1609-1620 - Xun Zhao, Feiyun Xu
, Zheng Liu:
TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze. 1621-1635 - Qi Zhao, Congxuan Zhang
, Zhibo Rao, Zhen Chen, Zige Wang, Ke Lu:
GPDF-Net: geometric prior-guided stereo matching with disparity fusion refinement. 1637-1654 - Haihua Ding, Chuan Lin, Fuzhang Li, Yongcai Pan:
A feature aggregation network for contour detection inspired by complex cells properties. 1655-1671 - Zhengwu Yuan, Peixian Tang, Xinguang Sang, Fan Zhang, Zheqi Zhang:
Visionary: vision-aware enhancement with reminding scenes generated by captions via multimodal transformer for embodied referring expression. 1673-1688 - Munish Bhardwaj, Nafis Uddin Khan
, Vikas Baghel
:
Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering. 1689-1704 - Houfu Peng, Xing Lu, Daoxun Xia, Xiaoyao Xie:
A novel image restoration solution for cross-resolution person re-identification. 1705-1717 - Caifeng Liu, Fangjie Gu:
Differential motion attention network for efficient action recognition. 1719-1731 - Gang Zhang, Yang Geng, Zhao G. Gong:
A comprehensive review of deep learning approaches for group activity analysis. 1733-1755 - Huijuan Wang, Xinyue Chen, Quanbo Yuan, Peng Liu:
A review of 3D object detection based on autonomous driving. 1757-1775 - Libo Sun, Yifan Li, Wenhu Qin:
PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving. 1777-1788 - Mohamed Charfeddine Mzoughi, Najib Ben Aoun, Sami Naouali:
A review on kinship verification from facial information. 1789-1809 - Jiawei Chen, Wen Su
, Mengjiao Ge, Ye He, Jun Yu:
To-Former: semantic segmentation of transparent object with edge-enhanced transformer. 1811-1825 - Ying Ma, Meng Wang, Guangyun Lu, Yajun Sun:
Multi-label semantic sharing based on graph convolutional network for image-to-text retrieval. 1827-1840 - Xiafan Li, Hongyan Quan:
MVPCL: multi-view prototype consistency learning for semi-supervised medical image segmentation. 1841-1854 - Yihe Nie, Xingbo Zhao, Yongxiang Li
, Qianwen Lu, Qingchuan Tao, Yanmei Yu:
DEAR: a novel deep-level semantics feature reinforce framework for Infrared Small Object Segmentation. 1855-1872 - Aokun Mei, Hua Huo, Jiaxin Xu, Ningya Xu:
Multistage attention region supplement transformer for fine-grained visual categorization. 1873-1889 - Tong Li, Zhaoxuan Zhang, Yuxin Wang, Yan Cui, Yuqi Li, Dongsheng Zhou, Baocai Yin, Xin Yang:
Self-supervised indoor scene point cloud completion from a single panorama. 1891-1905 - Xuyuan Zhang
, Chen Xu
, Yu Han
, George Baciu
:
Fabric image recolorization by fuzzy pretrained neural network. 1907-1920 - Shilong Wang
, Qianwen Hou, Jiaang Li, Jianlei Liu:
TSID-Net: a two-stage single image dehazing framework with style transfer and contrastive knowledge transfer. 1921-1938 - Xiaohong Zhang, Shengwu Xiong, Zhaoyang Sun, Jianwen Xiang:
Semi-hard constraint augmentation of triplet learning to improve image corruption classification. 1939-1956 - Huijuan Wang
, Boyan Cui, Quanbo Yuan, Gangqiang Pu, Xueli Liu, Jie Zhu:
Mini-3DCvT: a lightweight lip-reading method based on 3D convolution visual transformer. 1957-1969 - Zhigang Huang, Wanli Xue
, Yuxi Zhou
, Jinlu Sun, Yazhou Wu, Tiantian Yuan, Shengyong Chen:
Dual-stage temporal perception network for continuous sign language recognition. 1971-1986 - Zixuan Yu, Zhenjun Tang
, Xiaoping Liang, Hanyun Zhang, Ronghai Sun, Xianquan Zhang:
A novel image hashing with low-rank sparse matrix decomposition and feature distance. 1987-1998 - Shiyu Li, Zehao Liu, Meijing Gao, Yang Bai, Haozheng Yin:
MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration. 1999-2010 - Suyi Liu, Fang Xu, Chengdong Wu
, Jianning Chi, Xiaosheng Yu, Longxing Wei, Chuanjiang Leng:
CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer. 2011-2027 - Jun Wu
, Wanyu Nie
, Yu Zheng, Gan Zuo, Jiaming Dong, Siwei Wei:
Malleable pruning meets more scaled wide-area of attention model for real-time crack detection. 2029-2046 - Qiwang Li, Mingwen Shao, Fukang Liu, Yuanjian Qiao, Zhiyong Hu:
Contrastive local constraint for irregular image reconstruction and editability. 2047-2060 - Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li:
Correction: Digital human and embodied intelligence for sports science: advancements, opportunities and prospects. 2061 - Dhruv Meduri, Mohit Sharma, Vijay Natarajan:
Correction to: Jacobi set simplification for tracking topological features in time-varying scalar fields. 2063
Volume 41, Number 4, March 2025
- Hanqin Wang, Alexei Sourin:
Visual signatures for music mood and timbre. 2065-2077 - Khawla Ben Salah, Mohamed Othmani, Jihen Fourati, Monji Kherallah:
Advancing spatial mapping for satellite image road segmentation with multi-head attention. 2079-2089 - Mikolaj Maik
, Jakub Flotynski
, Krzysztof Walczak
:
Knowledge-based approach to adaptive XR interface design for non-programmers. 2091-2105 - Max Reimann
, Martin Büßemeyer, Benito Buchheim, Amir Semmo, Jürgen Döllner, Matthias Trapp:
Artistic style decomposition for texture and shape editing. 2107-2122 - Hiba Mzoughi
, Ines Njeh, Mohamed Ben Slima, Nouha Farhat, Chokri Mhiri:
Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI). 2123-2142 - Dingning Long
, Rongrong Chen
:
Cognitive capacity and aesthetics: the influence of visual working memory on landscape ink painting preference. 2143-2156 - Liangwei Wang
, Zhan Wang
, Xi Zhao
, Fugee Tsung
, Wei Zeng
:
Antarctica storytelling: creating interactive story maps for polar regions with graphic-based approach. 2157-2169 - Chuang Wu, Tingqin He:
Efficient minor defects detection on steel surface via res-attention and position encoding. 2171-2185 - Junjie Zhang, Yi Lin, Xin Zhou, Pangrong Shi, Xiaoqiang Zhu, Dan Zeng:
Precision in pursuit: a multi-consistency joint approach for infrared anti-UAV tracking. 2187-2202 - Jiayi Xu, Xuan Tan, Yixuan Ju, Xiaoyang Mao, Shanqing Zhang:
High similarity controllable face anonymization based on dynamic identity perception. 2203-2217 - Mohamed Elsayed, Mohamed Reda, Ahmed S. Mashaly, Ahmed Saleh:
LERFNet: an enlarged effective receptive field backbone network for enhancing visual drone detection. 2219-2232 - Jialin Zhu
, He Wang, David Hogg, Tom Kelly:
Learning to sculpt neural cityscapes. 2233-2249 - Suresh Cheekaty, G. Muneeswari
:
Advancing autism prediction through visual-based AI approaches: integrating advanced eye movement analysis and shape recognition with Kalman filtering. 2251-2270 - Huaping Zhou, Bin Deng, Kelei Sun, Shunxiang Zhang, Yongqi Zhang:
UTE-CrackNet: transformer-guided and edge feature extraction U-shaped road crack image segmentation. 2271-2283 - Xiaoyang Zhao, Zhuo Wang, Zhongchao Deng, Hongde Qin, Zhongben Zhu:
Transmission-guided multi-feature fusion Dehaze network. 2285-2297 - Randa I. Elanwar, Margrit Betke
:
Generative adversarial networks for handwriting image generation: a review. 2299-2322 - Yixi Li, Yanzhe Liu, Rong Chen
, Hui Li, Na Zhao:
Point cloud upsampling via a coarse-to-fine network with transformer-encoder. 2323-2337 - Neil Patrick Del Gallego
, Joel Ilao
, Macario O. Cordel II, Conrado R. Ruiz Jr.
:
Training a shadow removal network using only 3D primitive occluders. 2339-2376 - Qunpo Liu
, Qi Tang, Bo Su, Xuhui Bu, Naohiko Hanajima, Manli Wang:
Wire rope damage detection based on a uniform-complementary binary pattern with exponentially weighted guide image filtering. 2377-2390 - Jianjian Jiang, Ziwei Chen, Fangyuan Lei, Long Xu, Jiahao Huang
, Xiaochen Yuan:
Multi-granularity hypergraph-guided transformer learning framework for visual classification. 2391-2408 - Yueqian Pan, Qiaohong Chen, Xian Fang:
DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation. 2409-2424 - Wei Li
, Bowen Li, Jingqi Wang
, Weiliang Meng, Jiguang Zhang, Xiaopeng Zhang:
ROMOT: Referring-expression-comprehension open-set multi-object tracking. 2425-2437 - Longfeng Shen
, Bin Hou, Yulei Jian, Xisong Tu, Yingjie Zhang, Lingying Shuai
, Fangzhen Ge, Debao Chen:
TransFGVC: transformer-based fine-grained visual classification. 2439-2459 - Avantika Saklani, Shailendra Tiwari
, H. S. Pannu:
Deep attentive multimodal learning for food information enhancement via early-stage heterogeneous fusion. 2461-2476 - Xiang Suo
, Weidi Tang, Lijuan Mao, Zhen Li:
Digital human and embodied intelligence for sports science: advancements, opportunities and prospects. 2477-2493 - Jiaxuan Zhu, Ming Shao, Libo Sun, Siyu Xia:
ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition. 2495-2510 - JiaYan Wen, YuanSheng Zhuang, JunYi Deng:
EDM: a enhanced diffusion models for image restoration in complex scenes. 2511-2527 - Canlin Li, Xinyue Wang, Ran Yi, Wenjiao Zhang, Lihua Bi, Lizhuang Ma:
MCLGAN: a multi-style cartoonization method based on style condition information. 2529-2544 - Haobo Dong, Tianyu Song, Xuanyu Qi, Jiyu Jin, Guiyue Jin, Lei Fan:
Exploring high-quality image deraining Transformer via effective large kernel attention. 2545-2561 - Surendrabikram Thapa, Abhijit Sarkar
:
A deep dive into enhancing sharing of naturalistic driving data through face deidentification. 2563-2594 - Runtao Xi, Jiahao Lyu
, Kang Sun, Tian Ma:
Learning kernel parameter lookup tables to implement adaptive bilateral filtering. 2595-2605 - Yi-lun Wang, Yi-zheng Lang, Yunsheng Qian
:
Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and "pixel healthiness" evaluation. 2607-2627 - Alireza Dehghanpour, Zahra Sharifi, Masoud Dehyadegari
:
Point cloud downsampling based on the transformer features. 2629-2638 - Yabo Wu, Wenting Li
, Ziyang Chen
, Hui Wen, Zhongwei Cui, Yongjun Zhang:
Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling. 2639-2654 - Yumei Tan, Haiying Xia, Shuxiang Song:
Robust consistency learning for facial expression recognition under label noise. 2655-2667 - Wen-Kai Tsai
, Hsin-Chih Wang:
Real-time salient object detection based on accuracy background and salient path source selection. 2669-2690 - Nauman Ullah Gilal, Marwa K. Qaraqe, Jens Schneider, Marco Agus:
Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing. 2691-2708 - Ying Ni, Xiaoli Wang, Hanghang Peng, Yonzhi Li, Jinyang Wang, Haoxuan Li, Jin Huang:
Dual-branch dilated context convolutional for table detection transformer in the document images. 2709-2720 - Yubo Zhang, Lei Xu, Haibin Xiang, Haihua Kong, Junhao Bi, Chao Han:
LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution. 2721-2736 - Xiaoyu Song, Dezhi Han, Chongqing Chen, Xiang Shen, Huafeng Wu:
Vman: visual-modified attention network for multimodal paradigms. 2737-2754 - Zekang Liu, Wei Feng, Liqing Gao, Lianyu Hu:
DBL-SC: background-independent sign language recognition based on spatial channel separation computation. 2755-2766 - Ze Ouyang, Huihuang Zhao, Yudong Zhang, Long Chen:
STVDNet: spatio-temporal interactive video de-raining network. 2767-2782 - R. Raja Sekar, T. Dhiliphan Rajkumar, Koteswara Rao Anne:
Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme. 2783-2800 - Lirong Li, Jiang Ding, Hao Cui, Zhiqiang Chen, Guisheng Liao:
LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes. 2801-2815 - Saba Ghazanfar Ali, Xiaoxia Wang, Ping Li, Huating Li, Po Yang, Younhyun Jung, Jing Qin, Jinman Kim, Bin Sheng:
EGDNet: an efficient glomerular detection network for multiple anomalous pathological feature in glomerulonephritis. 2817-2834 - Pan Wu, Jin Tang:
FHFN: content and context feature hierarchical fusion networks for multi-focus image fusion. 2835-2856 - Ling-Xiao Qin, Hong-Mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-Sheng Jia:
Adaptive learning-enhanced lightweight network for real-time vehicle density estimation. 2857-2873 - Jit Chatterjee
, Maria Torres Vega
:
3D-Scene-Former: 3D scene generation from a single RGB image using Transformers. 2875-2889 - Xinyi Liu, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Guo Zhong, Xuhang Chen
, Chi-Man Pun:
Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression. 2891-2906 - Jiazhe Miao, Tao Peng, Fei Fang, Xinrong Hu, Li Li:
TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details. 2907-2921 - Wei Song, Kaili Yang:
Dual adaptive local semantic alignment for few-shot fine-grained classification. 2923-2937 - Changhong Shi, Weirong Liu, Jiahao Meng, Xiongfei Jia, Jie Liu:
Self-prior guided generative adversarial network for image inpainting. 2939-2951 - Chunyu Liu, Yixiao Jin, Zhouyu Guan
, Tingyao Li, Yiming Qin, Bo Qian, Zehua Jiang, Yilan Wu, Xiangning Wang, Ying Feng Zheng, Dian Zeng:
Visual-language foundation models in medicine. 2953-2972 - Xin Zhao, Yinhuang Chen, Chengzhuan Yang, Lincong Fang:
FuseNet: a multi-modal feature fusion network for 3D shape classification. 2973-2985 - Hao Li, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Xuhang Chen
, Guo Zhong, Chi-Man Pun:
Psanet: prototype-guided salient attention for few-shot segmentation. 2987-3001
Volume 41, Number 5, March 2025
- Liang Zhang, Shifeng Li, Xi Luo, Xiaoru Liu, Ruixuan Zhang:
Video anomaly detection with both normal and anomaly memory modules. 3003-3015 - Hong Zhao, Wengai Li
, Dailin Huang, Jinhai Huang, Lijun Zhang:
M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis. 3017-3035 - Xunan Tan, Xiang Suo, Wenjun Li, Lei Bi, Fangshu Yao:
Data visualization in healthcare and medicine: a survey. 3037-3058 - Junding Sun, Chenxu Wang, Haifeng Sima, Xiaosheng Wu, Shuihua Wang, Yudong Zhang:
Mfpenet: multistage foreground-perception enhancement network for remote-sensing scene classification. 3059-3076 - R. Varun Prakash, V. Karthikeyan, S. Vishali, M. Karthika
:
Multi-level LSTM framework with hybrid sonic features for human-animal conflict evasion. 3077-3093 - Xintao Liu, Yan Gao, Changqing Zhan, Qiao Wang, Yu Zhang, Yi He, Hongyan Quan:
Directional latent space representation for medical image segmentation. 3095-3107 - Yan Zhou, Haibin Zhou, Yin Yang, Jianxun Li, Richard Irampaye, Dongli Wang, Zhengpeng Zhang:
Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation. 3109-3128 - Fengling Li, Zheng Yang, Yan Gui:
SES-yolov5: small object graphics detection and visualization applications. 3129-3142 - Xiaoying Chen, Weijie Ye:
Dual representations network for few-shot learning based on local descriptor importance: integrating global and local features. 3143-3154 - Zezheng Tang, Yihua Wu, Xinming Xu:
The study of recognizing ripe strawberries based on the improved YOLOv7-Tiny model. 3155-3171 - Daipeng Yang, Bo Peng, Xi Wu:
A bio-inspired edge and segment detection method by modeling multiple visual regions. 3173-3188 - Jianjun Zhu, Huihuang Zhao, Yudong Zhang:
Filter-deform attention GAN: constructing human motion videos from few images. 3189-3204 - Mingjian Li
, Younhyun Jung, Shaoli Song, Jinman Kim:
Attention-driven visual emphasis for medical volumetric image visualization. 3205-3219 - Jun Wang, Honghui Cao
, Chenhao Sun, Ziqing Huang, Yonghua Zhang:
Motion perception-driven multimodal self-supervised video object segmentation. 3221-3238 - Gang Chen, Wenju Wang, Haoran Zhou
, Xiaolin Wang:
EGCT: enhanced graph convolutional transformer for 3D point cloud representation learning. 3239-3261 - Haojie Gao, Peishun Liu, Xiaolong Ma, Zikang Yan, Ningning Ma, Wenqiang Liu, Xuefang Wang, Ruichun Tang:
TP-LSM: visual temporal pyramidal time modeling network to multi-label action detection in image-based AI. 3263-3281 - Guowei Zhang
, Wuzhi Li, Yutong Tang, Shuixuan Chen, Li Wang:
Lightweight CNN-ViT with cross-module representational constraint for express parcel detection. 3283-3295 - Jianglei Ye, Yigang Wang, Fengmao Xie, Qin Wang, Xiaoling Gu, Zizhao Wu
:
Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention. 3297-3308 - Xingquan Cai, Haoyu Zhang, LiZhe Chen, YiJie Wu, Haiyan Sun:
3D human pose estimation using spatiotemporal hypergraphs and its public benchmark on opera videos. 3309-3327 - Zhiyuan Li, Xin Jin, Qian Jiang, Puming Wang, Shin-Jye Lee, Shaowen Yao, Wei Zhou:
Crafting imperceptible and transferable adversarial examples: leveraging conditional residual generator and wavelet transforms to deceive deepfake detection. 3329-3344 - Wan-He Kai, Kai-Xin Xing:
Video-driven musical composition using large language model with memory-augmented state space. 3345-3357 - Wenzhe Shi, Ziqi Hu, Hao Chen, Hengjia Zhang, Jiale Yang, Li Li:
Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal. 3359-3370 - Xu Liu, Tong Zhou, Chong Wang, Yuping Wang, Yuanxin Wang, Qinjingwen Cao, Weizhi Du, Yonghuan Yang, Junjun He, Yu Qiao, Yiqing Shen:
Toward the unification of generative and discriminative visual foundation model: a survey. 3371-3412 - Yaping Deng, Yingjiang Li, Zibo Wei, Keying Li:
GLDC: combining global and local consistency of multibranch depth completion. 3413-3422 - Weifeng Cao, Xiaoyan Lei
, Jun Shi, Wanyong Liang, Jie Liu, Zongfei Bai:
HASN: hybrid attention separable network for efficient image super-resolution. 3423-3435 - Sunhan Xu, Jinhua Wang, Ning He, Guangmei Xu, Geng Zhang:
Optimizing underwater image enhancement: integrating semi-supervised learning and multi-scale aggregated attention. 3437-3455 - Yazhuo Fan, Jianhua Song, Lei Yuan, Yunlin Jia:
HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron. 3457-3472 - Muhammad Fahad, Tao Zhang, Yasir Iqbal, Azaz Ikram, Fazeela Siddiqui
, Bin Younas Abdullah, Malik Muhammad Nauman, Xin Zhao, Yanzhang Geng:
Advanced deepfake detection with enhanced Resnet-18 and multilayer CNN max pooling. 3473-3486 - Jiajun Yang, Xuesong Zhang, Cunli Song:
Research on a small target object detection method for aerial photography based on improved YOLOv7. 3487-3501 - Pengbo Bo, Qingxiang Liu, Caiming Zhang:
Topological structure extraction for computing surface-surface intersection curves. 3503-3518 - Wenji Yang, Hang An, Wenchao Hu, Xinxin Ma, Liping Xie:
Text-guided floral image generation based on lightweight deep attention feature fusion GAN. 3519-3535 - Ali Salar, Ali Ahmadi:
Enhancing high-vocabulary image annotation with a novel attention-based pooling. 3537-3551 - Yiting Wu, Pinqi Fang, Xiangning Wang, Jie Shen:
Predicting pancreatic diseases from fundus images using deep learning. 3553-3564 - Shunzhou Wang, Yao Lu, Wang Xia, Peiqi Xia, Ziqi Wang, Wei Gao:
Light field angular super-resolution by view-specific queries. 3565-3580 - Xiaohu Wang, Xin Yang, Hengrui Li, Tao Li:
FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution. 3581-3593 - Minsoo Choi, Christos Mousas
, Nicoletta Adamo, Sanjeevani Patankar, Klay Hauser, Fangzheng Zhao, Richard E. Mayer:
ASAP: animation system for agent-based presentations. 3595-3610 - Dinghao Guo
, Dali Chen, Xin Lin, Zheng Xue, Wei Zheng, Xianling Li:
Semi-supervised image semantic segmentation method with semantic regions patching and uncertainty-guided loss. 3611-3626 - YaTing Liu, ChengDong Lan, Wanjian Feng:
DLKN: enhanced lightweight image super-resolution with dynamic large kernel network. 3627-3644 - Andrea Bodonyi, István Csoba, Roland Kunkli:
Real-time ray transfer for lens flare rendering using sparse polynomials. 3645-3662 - Shijie Li, Shanhua Yao, Zhonggen Wang, Juan Wu:
FFCANet: a frequency channel fusion coordinate attention mechanism network for lane detection. 3663-3678
Volume 41, Number 6, April 2025
- Zhaijuan Ding, Yanyu Liu, Sen Liu, Kangjian He, Dongming Zhou:
$\hbox {KD}^{3}$mt: knowledge distillation-driven dynamic mixer transformer for medical image fusion. 3679-3693 - Lin Wang, Jie Li, Chun Qi, Fengping Wang, Pan Wang:
Progressive Crowd Enhancement De-Background Network for crowd counting. 3695-3717 - Baoan Li, Long Zhang, Shangzhi Teng, Xueqiang Lyu:
Attribute correlation mask fusion network for pedestrian attribute recognition. 3719-3734 - Yasmin M. Alsakar, Nehal A. Sakr, Shaker H. Ali El-Sappagh, Tamer Abuhmed, Mohammed Elmogy:
Underwater image restoration and enhancement: a comprehensive review of recent trends, challenges, and applications. 3735-3783 - Xiaopan Li, Shiqian Wu, Xin Yuan, Shoulie Xie, Sos S. Agaian:
Hierarchical wavelet-guided diffusion model for single image deblurring. 3785-3800 - Yawen Xiang, Heng Zhou, Chengyang Li, Fangwei Sun, Zhongbo Li, Yongqiang Xie:
Deep learning in motion deblurring: current status, benchmarks and future prospects. 3801-3827 - Yunxi Chen, Yuanjie Cao, Fei Fang, Jin Huang, Xinrong Hu, Ruhan He, Junjie Zhang:
SACANet: end-to-end self-attention-based network for 3D clothing animation. 3829-3842 - Yuanjie Dang, Jiangyun Chen, Peng Chen, Nan Gao, Ruohong Huan, Dongdong Zhao:
Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection. 3843-3852 - Qian Wan, Bin Zhou, Yanjiang Wang:
BSCGAN: structured minority class image generation under class-balanced pretraining. 3853-3865 - Shize Wang, Gang Wu, Jin Wang, Qing Zhu, Yunhui Shi, Baocai Yin:
SBC-Net: semantic-guided brightness curve estimation network for low-light image enhancement. 3867-3882 - Xinzhe Xie, Buyu Guo, Peiliang Li, Shuangyan He, Sangjun Zhou:
SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network. 3883-3906 - Zitao Gao, Xiangjian Liu, Anna K. Wang, Liyu Lin:
A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition. 3907-3923 - Ronghui Feng, Yuefei Wang, Jiajing Xue, Yuquan Xu, Yutong Zhang, Xi Yu:
CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections. 3925-3955 - Guowen Yue, Ge Jiao, Chen Li, Jiahao Xiang:
When CNN meet with ViT: decision-level feature fusion for camouflaged object detection. 3957-3972 - Shuo Yang, Xiaoling Gu, Zhenzhong Kuang, Feiwei Qin, Zizhao Wu:
Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey. 3973-4000 - Chen Li, Weiqi Yan, Hongwei Zhao, Shihua Zhou, Yueping Wang:
TFFD-Net: an effective two-stage mixed feature fusion and detail recovery dehazing network. 4001-4016 - Kailin Liu, Yonghong Hou, Zihui Guo, Wenjie Yin, Yi Ren:
Visual context learning based on cross-modal knowledge for continuous sign language recognition. 4017-4031 - Qiang Cen, QiGuang Zhu, YuXin Wang, Weidong Chen, Shuo Liu:
YOLOv9-YX: lightweight algorithm for underwater target detection. 4033-4045 - Le-Anh Tran, Dong-Chul Park:
Lightweight image dehazing networks based on soft knowledge distillation. 4047-4066 - Haiyuan Cao, Deng Chen, Yanduo Zhang, Huabing Zhou, Dawei Wen, Congcong Cao:
MFINet: a multi-scale feature interaction network for point cloud registration. 4067-4079 - Libo Sun, Jiahui Yan, Yongchun Qiu, Wenhu Qin:
The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning. 4081-4095 - Guowei An, Yaonan Wang, Kai Zeng, Qing Zhu, Xiaofang Yuan:
Deep spatial and discriminative feature enhancement network for stereo matching. 4097-4110 - Qiyang Liu, Yun Ge, Sijia Wang, Ting Wang, Jinlong Xu:
Dynamic manifold-based sample selection in contrastive learning for remote sensing image retrieval. 4111-4127 - Ziwei Zeng, Lihong Li, Zoufei Zhao, Qingqing Liu:
Improved fine-grained image classification in few-shot learning based on channel-spatial attention and grouped bilinear convolution. 4129-4141 - Yiqian Huang, Shuqi Liu, Fei Dong, Xu Li, Xin Yang, Ya Zhou, Jinxiang Huang, Yong Song:
PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking. 4143-4156 - Yong Zhang, Qingguo Shan, Wenyun Chen, Wenzhe Liu:
EEG emotion recognition approach using multi-scale convolution and feature fusion. 4157-4169 - Guowei Zhang, Weidong Zhang, Wuzhi Li, Li Wang, Huankang Cui:
A dynamic attention mechanism for object detection in road or strip environments. 4171-4181 - Youjie Zhou, Runyu Jiao, Zhonghan Tao, Xichang Liang, Yi Wan:
Spatial-frequency attention-based optical and scene flow with cross-modal knowledge distillation. 4183-4198 - Pham Thanh Huu, Nguyen Thai An, Nguyen Ngoc Trung, Huynh Ngoc Thien, Nguyen Sy Duc, Nguyen Thi Ty:
Judicial decision prediction using an integrated attention based bidirectional long-short term memory and dilated skip residual convolution neural network. 4199-4220 - Xinbiao Lu, Gaofan Zhan, Wen Wu, Wentao Zhang, Xiaolong Wu, Changjiang Han:
Van-DETR: enhanced real-time object detection with vanillanet and advanced feature fusion. 4221-4238 - Chenchen Xu, Kaixin Han, Weiwei Xu:
Image-aware layout generation with user constraints for poster design. 4239-4252 - Zhen Huang, Yongjian Zhu, Qiao Zhang, Hongyan Zang, Tengfei Lei:
Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection. 4253-4267 - Yongbo Yu, Weidong Li, Linyan Bai, Jinlong Duan, Xuehai Zhang:
UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration. 4269-4285 - Liping Zhu, Haibo Zhou, Silin Wu, Tianrong Cheng, Hongjun Sun:
Polynomial for real-time rendering of neural radiance fields. 4287-4300 - Yong Zhang, Da Liu, Li Jiang, Huibing Wang, Wenzhe Liu:
Feature decomposition and structural learning for multi-diverse and multi-view data clustering. 4301-4320 - Pengjie Liu, Yanzhan Chen, Fan Yu, Qian Zhang:
Mastering adverse weather: a two-stage approach for robust semantic segmentation in autonomous driving. 4321-4346 - Yuqi Xiao, Yongjun Wu:
A dual-channel correlation filtering tracker for real-time tracking based on deep features of improved CaffeNet and integrated manual features. 4347-4361 - Dejin Zhao, Yunjie Ma, Xiaolong Yuan, Tong Tong, Dechao Wang, Rui Sun, Lili Cheng, Jianhai Zhang:
SME: Spatial multi-scale enhanced attention for automated detection of micro-defect on automobile complex paint surfaces. 4363-4376 - Yuanhong Zhong, Ting Chen, Daidi Zhong, Xiaoming Liu:
Wavelet-guided network with fine-grained feature extraction for vessel segmentation. 4377-4392 - Ling-Xiao Qin, Hong-Mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-Sheng Jia:
Correction: Adaptive learning-enhanced lightweight network for real-time vehicle density estimation. 4393-4394
Volume 41, Number 7, May 2025
- Long Zhang, QingHua Zhou, Shuai Tang, Yunxiang Chen:
High-definition multi-scale voice-driven facial animation: enhancing lip-sync clarity and image detail. 4395-4403 - Qiaohong Chen, Shufan Xie, Xian Fang, Qi Sun:
CTHFNet: contrastive translation and hierarchical fusion network for text-video-audio sentiment analysis. 4405-4418 - Xuanpeng Li, Hengshuo Cao, Jinming Li, Guangyu Li, Lin Zhao:
A shoreline extraction method based on dual-loop network framework. 4419-4430 - Viktor Leonhardt, Alexander Wiebel, Christoph Garth:
A framework for visual comparison of scalar fields with uncertainty. 4431-4448 - Ye Liu, Lei Zhu, Liang Wan, Xing Wang:
Masked frequency-color fusion network for video instance-level hazy lane detection. 4449-4461 - Jibing Peng, Yaohua Yi, Ying Zhou:
DPDTRN: a dynamic pixel-level difficulty-aware texture reconstruction network for document super-resolution. 4463-4480 - Huangyuan Wu, Bin Li, Lianfang Tian, Chao Dong:
DDFA: a displacement and diffusion-based feature augmentation method for imbalanced image recognition. 4481-4495 - Yunfei Qiu, Shuai Jiao, Qingtang Su:
Enhancing color image watermarking via fast quaternion Schur decomposition: a high-quality blind approach. 4497-4515 - Rui Sun, Xiaolu Yu, Huidong Feng, Fei Wang, Xudong Zhang:
Motion-robust mask face presentation attack detection via dual-stream texture-rPPG network. 4517-4532 - Zhiwen Shao, Yifan Cheng, Yong Zhou, Xiang Xiang, Jian Li, Bing Liu, Dit-Yan Yeung:
High-level LoRA and hierarchical fusion for enhanced micro-expression recognition. 4533-4546 - Kesai Wang, Xifan Yao, Nanfeng Ma, Guangjun Ran:
PLMOT-SLAM: a point-line features fusion SLAM system with moving object tracking. 4547-4565 - Ping Lu, Youcheng Cai, Jiale Yang, Dong Wang, Tingting Wu:
Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction. 4567-4580 - Zhengyan Liu, Huiwen Wang, Lihong Wang, Shanshan Wang:
Locality-constrained double-layer structure scaled simplex multi-view subspace clustering. 4581-4601 - Tianxiang Huo, Zhenqi Liu, Shichao Zhang, Jiening Wu, Rui Yuan, Shukai Duan, Lidan Wang:
CDNet: object detection based on cross-level aggregation and deformable attention for UAV aerial images. 4603-4621 - Krishnendu Maity, Susanta Mukhopadhyay:
LPSIS: a lossless secret image sharing scheme based on Legendre polynomials with low-cost reconstruction. 4623-4637 - Yuesong Tian, Li Shen, Xiang Tian, Dacheng Tao, Zhifeng Li, Wei Liu, Yaowu Chen:
DGL-GAN: discriminator-guided GAN compression. 4639-4660 - Javed Aymat Husen Shaikh, Shailendrakumar M. Mukane, Santosh Nagnath Randive:
Lightweight progressive recurrent network for video de-hazing in adverse weather conditions. 4661-4672 - Jinchang Zhu, Dayang Sun, Yu Cheng, Hailong Wang, Yujing Chen, Yaowei Chen:
GaitHF: enhancing appearance-based gait recognition through height fused images. 4673-4686 - Wanjun Zhong, Haohao Hu, Yuerong Wang, Li Li, Tianyu Han, Chunyong Li, Peng Zan:
Hierarchical evidence aggregation in two dimensions for active water surface object detection. 4687-4702 - Julien Thomas, Boyu Kuang, Yizhong Wang, Stuart Barnes, Karl Jenkins:
Advanced semantic segmentation of aircraft main components based on transfer learning and data-driven approach. 4703-4722 - Hongfei Li, Xueyang Li:
Dim and small objects detection in aerial images with stacked attention mechanism and improved loss function. 4723-4739 - Yanliang Ge, Junchao Ren, Cong Zhang, Min He, Hongbo Bi, Qiao Zhang:
Feature-aware and iterative refinement network for camouflaged object detection. 4741-4758 - Mohamad Haniff Junos, Anis Salwa Mohd Khairuddin:
YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction. 4759-4778 - Sardor Mamarasulov, Lianggangxu Chen, Changgu Chen, Yang Li, Changbo Wang:
Data augmentation with attention framework for robust deepfake detection. 4779-4798 - Jian Ni, Zheng Wang, Yixiao Wang, Wenjian Tao, Ao Shen:
DRCL: rethinking jigsaw puzzles for unsupervised medical image segmentation. 4799-4813 - Huanshuo Zhang, Guobiao Ren:
Intelligent leaf disease diagnosis: image algorithms using Swin Transformer and federated learning. 4815-4838 - Václav Skala:
A new fully projective O(log N) point-in-convex polygon algorithm: a new strategy. 4839-4850 - Jianuo Wang, Huawei Li, Yumin Chen:
Seg-invRender: fusing semantic segmentation based on NeRF for inverse rendering considering shadows. 4851-4864 - Wuzhen Shi, Aixue Yin, Yingxiang Li, Bo Qian:
Cross-view Transformer for enhanced multi-view 3D reconstruction. 4865-4877 - Jiaxing Yu, Zheng Chen, Jingkai Wang, Linghe Kong, Jiajie Yan, Wei Gu:
Enhancing Image Super-Resolution with Dual Compression Transformer. 4879-4892 - Saleha Masood, Mousa Ahmad Al Bashrawi, Muhammad Attique Khan, Anam Nazir:
Exploring ChatGPT applications in healthcare: a comprehensive overview. 4893-4914 - Yaqi Sun, Xiaolan Xie, Zhi Li, Huihuang Zhao:
Image style transfer with saliency constrained and SIFT feature fusion. 4915-4930 - Zean Jin, Yulong Bai, Wei Song, Qinghe Yu, Xiaoxin Yue:
EduCodeVR: VR for programming teaching through simulated farm and traffic. 4931-4955 - Zeyu Cai, Ziyu Zhang, Chengqian Jin, Feipeng Da:
DMDC: a cross-attention network for dynamic mask-based dual-camera snapshot hyperspectral Photography. 4957-4974 - Baokai Zu, Tong Cao, Yafang Li, Jianqiang Li, Hongyuan Wang, Quanzeng Wang:
RESwinT: enhanced pollen image classification with parallel window transformer and coordinate attention. 4975-4990 - Yaqian Li, Xin Zhan, Haibin Li, Wenming Zhang:
Selection and guidance: high-dimensional identity consistency preservation for face inpainting. 4991-5003 - Yang Yang, Changming Zhu:
Deep multi-view clustering based on global hybrid alignment with cross-contrastive learning. 5005-5017 - Tiago Madeira, Miguel Oliveira, Paulo Dias:
Reflection-aware 3D mirror segmentation and pose estimation. 5019-5028 - Tao Shi, Yao Ding, Kui-feng Zhu, Yan-jie Su:
DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision. 5029-5041 - Congying An, Jingjing Wu, Huanlong Zhang:
Occlusion-aware segmentation via RCF-Pix2Pix generative network. 5043-5057 - Zidi Cao, Jiayi Han, Sipeng Yang, Xiaogang Jin:
Fast best viewpoint selection with geometry-enhanced multiple views and cross-modal distillation. 5075-5086 - Hongru Wang, Hu Cheng, Jingtao Zhang:
Faster-PGYOLO: an efficient framework for floating debris detection in inland waters. 5087-5104 - Yanchen Liu, Changming Zhu:
DMVMLC-VT: Deep incomplete multi-view multi-label image classification with view translation and pseudo-label enhancement. 5105-5121 - Miao Yang, Meng Yang, Weiliang Meng, Ping Li, Zhen Li:
Msc-Net: multi-stage colorization network for real-world images with specular highlights. 5123-5134 - KeXuan Wang, ChenHua Liu, RongFu Zhang:
CMA-SOD: cross-modal attention fusion network for RGB-D salient object detection. 5135-5151 - Yanliang Ge, Taichuan Liang, Junchao Ren, Jiaxue Chen, Hongbo Bi:
Enhanced salient object detection in remote sensing images via dual-stream semantic interactive network. 5153-5169 - Jianguo Ning, Lei Zhang, Xiangzhao Xu:
Virtual simulation for the dynamic response of concrete blocks under blast loading. 5171-5187 - Shue Liu, Siwei Zhao, Yiying Wang, Jiaming Xin, Dashe Li:
An enhanced underwater fish segmentation method in complex scenes using Swin transformer with cross-scale feature fusion. 5189-5203 - Zewei Zhao, Xiaotie Ma, Yingjie Shi, Xiaotong Yang:
Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding. 5205-5221

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.