


default search action
31st ICONIP 2024: Auckland, New Zealand - Part VII
- Mufti Mahmud
, Maryam Doborjeh
, Kevin Wong
, Andrew Chi Sing Leung
, Zohreh Doborjeh
, M. Tanveer
:
Neural Information Processing - 31st International Conference, ICONIP 2024, Auckland, New Zealand, December 2-6, 2024, Proceedings, Part VII. Lecture Notes in Computer Science 15292, Springer 2026, ISBN 978-981-96-6593-8 - Fuyuan Cheng
, Yuxi Li
, Xiangtao Lu, Guibo Luo
, Yuesheng Zhu
:
LoTraNet: Locality-Guided Transformer Network for Image Manipulation Localization. 1-14 - Bingyan Nie, Wulin Xie, Jiang Long, Xiaohuan Lu
:
Dual-Level Contrastive Learning Framework. 15-30 - Yuxin Wu, Xin Ruan, Wenguang Zheng
:
DLAFormer: A Novel Approach to Image Super-Resolution with Comprehensive Attention Mechanisms. 31-45 - Pengcheng Zhao, Yanxiang Chen, Yang Zhao, Zhao Zhang:
Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics. 46-60 - Yassin Terraf, Youssef Iraqi
:
CoMISI: Multimodal Speaker Identification in Diverse Audio-Visual Conditions Through Cross-Modal Interaction. 61-77 - XinChao Wang, Hongxiang Li, Xinzhong Sun, Lihong Zhao, Liqiang Wang, Kai Zhang, Xuzhen Hu, Yong Wang, Yuexian Zou:
Multi-scale Spatial Feature Aggregation For Efficient Super Resolution. 78-92 - Chunshi Wang, Bin Zhao, Shuxue Ding:
SCANet: Split Coordinate Attention Network for Building Footprint Extraction. 93-105 - Shouxi Zhao
, Tianren Zhang, Qin Zou
, Chi Chen
, Zhongyuan Wang
:
XFusion: Cross-Attention Transformer for Multi-focus Image Fusion. 106-119 - Zhiyu Zhang, Zhiqiang Tian, Hao Luo, Gang Zhou:
Guided DiffusionDet: Guided Diffusion Model for Object Detection with Resample Mechanism. 120-134 - Guangxiong Gao, Qilong Zheng, Chengcheng Li:
Mutual Information-Based Mixed Precision Quantization. 135-149 - Junwei Tian, Canlong Zhang
, Zhixin Li, Zhiwen Wang, Chunrong Wei:
MLLM-Driven Semantic Enhancement and Alignment for Text-Based Person Search. 150-164 - Qin Guo, Guibo Luo
, Zhiqiang Bai
, Yuesheng Zhu
:
TFCM: Tuning-Free Facial Concept-Erasure in Text-to-Image Models Through Attention and Sample Modulation. 165-180 - Jingyun Yang, Jingge Wang, Guoqing Zhang, Yang Li:
Selecting the Best Sequential Transfer Path for Medical Image Segmentation with Limited Labeled Data. 181-193 - Mengyao Li, Yanbin Liu, Ling Chen:
Knowledge Distillation with Differentiable Optimal Transport on Graph Neural Networks. 194-209 - Leyi Zhu
, Weihuang Liu
, Xinyi Chen, Zimeng Li
, Xuhang Chen
, Zhen Wang
, Chi-Man Pun
:
Test-Time Intensity Consistency Adaptation for Shadow Detection. 210-224 - Mengting Li, Chuang Zhu
:
Learning from Noisy Labels for Long-Tailed Data via Optimal Transport. 225-240 - Bin Ma
, Haocheng Wang, Ruihe Ma, Yongjin Xian
, Chunpeng Wang
:
LCRPS: Large-Capacity Residual Plane Steganography Based on Multiple Adversarial Networks. 241-254 - Ruofan Zhang, Xuezhong Qian, Wei Song:
Aesthetics-Guided Multi-scale Feature Fusion for Style Transfer. 255-269 - Xiaohai Li, Jieyao Zhang, Jiaming Gu, Xiaoyuan Lu, Liang Zhang:
BEVRoad: A Cross-Modal and Temporary-Recurrent 3D Object Detector for Infrastructure Perception. 270-284 - Kangyu Tang
, Penglei Liu
, Jun Cheng
:
Dilated Pyramid Attention in Hierarchical Vision Transformer for Texture Recognition. 285-298 - Zhiyuan Wang, Jun Li, Jianhua Xu:
Attention-Based Domain Adaptive YOLO for Cross-Domain Object Detection. 299-314 - Yihuan Zhu
, Simiao Wang, Zhengxing Sun
:
In-WSOD: Integrality Weakly Supervised Object Detection with Classification and Localization Consistency. 315-330 - Guohua Lv
, Wenkuo Song, Zhonghe Wei, Aimei Dong, Jinyong Cheng, Guangxiao Ma:
GLEGNet: Infrared and Visible Image Fusion via Global-Local Feature Extraction and Edge-Gradient Preservation. 331-345 - Osama Ahmad
, Omer Abdul Jalil
, Usman Nazir
, Murtaza Taj
:
Mending of Spatio-Temporal Dependencies in Block Adjacency Matrix. 346-360 - Babita
, Kadali Sri Akash
, M. Sajid
, Deepak Ranjan Nayak
, Muhammad Tanveer
:
CaDT-Net: A Cascaded Deformable Transformer Network for Multiclass Breast Cancer Histopathological Image Classification. 361-372 - Yongtong Gu, Jinlai Zhang, Kefu Yi, Du Xu:
DIFA: Deformable Implicit Feature Alignment for Roadside Cooperative Perception. 373-387 - Tamotsu Kurioka, Teppei Suzuki
, Rei Kawakami
, Ikuro Sato
:
Transferring Teacher's Invariance to Student Through Data Augmentation Optimization. 388-401 - Haoning Wu, Kaiyan Zhao, Shaowu Wu, Xiaoping Wu, Xiaoguang Niu:
AARR-Net: An Attention Assistance Feature Fusion and Model Recursive Recovery Network for Category-Level 6D Object Pose Estimation. 402-416 - Jinyu Shi, Chenyang Zhao, Ruofei Zheng:
BRS-YOLO: A Balanced Optical Remote Sensing Object Detection Method. 417-429 - Yuxuan He, Haibin Xie, Junheng Liu, Wei Jiang, Xinglong Zhang, Xin Xu:
HDKI: A Hierarchical Deep Koopman Framework for Spatio-Temporal Prediction with Image Observations. 430-445

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.