default search action
33rd ICANN 2024: Lugano, Switzerland - Part III
- Michael Wand, Kristína Malinovská, Jürgen Schmidhuber, Igor V. Tetko:
Artificial Neural Networks and Machine Learning - ICANN 2024 - 33rd International Conference on Artificial Neural Networks, Lugano, Switzerland, September 17-20, 2024, Proceedings, Part III. Lecture Notes in Computer Science 15018, Springer 2024, ISBN 978-3-031-72337-7
Computer Vision: Anomaly Detection
- Junwei Wang, Yunpeng Wang, Jinquan Zeng:
Hybrid Encoder for Anomaly Detection Based on Latent Feature Regularization. 3-13
Computer Vision: Segmentation
- Haoran Yang, Longyi Tang, Tingting Wu, Binyu Yan:
DGFormer: A Dynamic Kernel with Gaussian Fusion Transformer for Semantic Image Segmentation. 17-30 - Qingwei Geng, Xiaodong Gu:
Integrating Audio-Visual Contexts with Refinement for Segmentation. 31-44 - Manuel Traub, Frederic Becker, Adrian Sauter, Sebsastian Otte, Martin V. Butz:
Loci-Segmented: Improving Scene Segmentation Learning. 45-61 - Yao Shen, Chunmeng Liu, Hanlin Chen, Kaiyang Zeng, Guangyao Li:
Measuring Affinity: Similarity-Based Auxiliary Unlabeled Guidance for Few-Shot Segmentation. 62-75 - Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen, Guangwei Gao:
MFPNet: A Multi-scale Feature Propagation Network for Lightweight Semantic Segmentation. 76-86 - Chen Wang, Di Zhang, Xiaolong Li, Huifang Ma, Zhixin Li:
Weakly-Supervised Semantic Segmentation via Label Re-assignment in Dual-View Framework. 87-99
Computer Vision: Pose Estimation and Tracking
- Zheyan Gao, Jinyan Chen, Yuxin Liu, Yucheng Jin:
DT2S-Pose: A Deeper Temporal-Spatial Skeleton Refine Model for Pedestrian Pose Estimation. 103-117 - Yangliu He, Haoge Deng, Qiwei Shen, Jianxin Liao:
DTG: Learning A Dynamic Token Graph for 3D Pose Forecasting. 118-129 - Yingqi He, Jinghua Li, Dehui Kong, Baocai Yin:
Dual-Branch Network with Online Knowledge Distillation for 3D Hand Pose Estimation. 130-143 - Dongyang Yu, Haoyue Zhang, Ruisheng Zhao, Guoqi Chen, Wangpeng An, Yanhong Yang:
MovePose: A High-Performance Human Pose Estimation Algorithm on Mobile and Edge Devices. 144-158 - Rui Li, Jinlong Li:
Siamese Visual Tracking with Correlation and Awareness. 159-173
Computer Vision: Video Processing
- Hong Yu, Yu Zhang, Yuanqiu Liu, Hui Li, Han Liu:
Alignment-Enhanced Network for Temporal Language Grounding in Videos. 177-192 - Fengzhen Yu, Xiaodong Gu:
Boundary-Aware Noise-Resistant Video Moment Retrieval. 193-206 - Wei Li, Dezhao Luo, Dongbao Yang, Weiping Wang:
Large Language Model for Action Anticipation. 207-222 - Manuel Traub, Frederic Becker, Sebsastian Otte, Martin V. Butz:
Learning Object Permanence from Videos via Latent Imaginations. 223-240 - Jingze Chen, Simiao Zhuang, Qiqin Lin, Junfeng Yao, Lei Li:
SSFlowNet: Semi-supervised Scene Flow Estimation on Point Clouds With Pseudo Label. 241-255 - Yaxin Hu, Erhardt Barth:
Video Understanding Using 2D-CNNs on Salient Spatio-Temporal Slices. 256-270
Computer Vision: Generative Methods
- Xinlai Guo, Yanyun Tao, Yuzhen Zhang, Biao Xu, Jianyin Zheng, Guang Ji:
A Robust Image Dehazing Model Using Cycle Generative Adversarial Network with an Improved Atmospheric Scatter Model. 273-286 - Yuankun Chen, Dazhong Rong, Yi Li:
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Ground Image Synthesis. 287-302 - Ziteng Zhang, Peng Qiao, Dou Yong, Sidun Liu, Wenyu Li, Li Cao, Luo Chen:
Dual Dreamer: Extending Single-View Dreamer with Few Shot of Complementary Views. 303-317 - Yuansheng Ma, Dong Zhang, Suyang Zhu, Shoushan Li:
Hair Transfer with Efficient Heuristic Chain of Editing. 318-332 - Jialiang Xu, Weiran Chen, Lingbing Xu, Weitao Song, Yi Ji, Ying Li, Chunping Liu:
MAGIC: Multi-prompt Any Length Video Generation Model with Controllable Inter-frame Correlation and Low Barrier. 333-348 - Xing Bai, Jun Zhou, Pengyuan Zhang, Ruipeng Hao:
Make Audio Solely Drive Lip in Talking Face Video Synthesis. 349-360 - Mohua Chen, Hanchao Liu, Lanfang Dong:
P2H-GAN: An Effective Method For Generating Handwritten Expressions. 361-376 - Yan Zhang, Yefei Wang, Jialu Xiong, Jie Zhou, Jinshan Zeng:
SCI-Font: Enhancing Content-Style Representation for Chinese Calligraphy Generation with Skeleton, Contour and Inexact Paired Data. 377-391
Topics in Computer Vision
- Gokul Sudheesh Kumar, Aparna Raj, Sujala D. Shetty:
Driver Safety System: A Real-Time Sleep Detection and Lane Detection Model Using IoT and Deep Learning. 395-414 - Ting Huang, Jian Huang:
Gaze Target Detection with Visual Prompt Tuning Based on Attention. 415-429 - Dekun Lin, Tailai Peng, Rui Chen, Xinran Xie, Zhe Cui:
Let Multi-classification Help Deep Imbalanced Regression. 430-447 - Jingqi Hu, Chen Mao, Chong Tan, Hui Li, Hong Liu, Min Zheng:
ProGEO: Generating Prompts Through Image-Text Contrastive Learning for Visual Geo-Localization. 448-462
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.