


default search action
MMM 2024, Amsterdam, The Netherlands - Part II
- Stevan Rudinac

, Alan Hanjalic
, Cynthia C. S. Liem
, Marcel Worring
, Björn Þór Jónsson
, Bei Liu
, Yoko Yamakata
:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part II. Lecture Notes in Computer Science 14555, Springer 2024, ISBN 978-3-031-53307-5 - Yuxuan Zhang, Huibin Tan, Long Lan, Xiao Teng, Jing Ren, Yongjun Zhang:

Self-distillation Enhanced Vertical Wavelet Spatial Attention for Person Re-identification. 1-13 - Tao Zhang, Ju Zhang, Yicheng Zou, Yu Zhang:

High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification. 14-27 - Xin Dong

, Rui Wang
, Sanyi Zhang
, Lihua Jing
:
HPattack: An Effective Adversarial Attack for Human Parsing. 28-41 - Fahong Wang, Zhao Liu, Jie Lei, Zeyu Zou, Wentao Han, Juan Xu, Xuan Li, Zunlei Feng, Ronghua Liang:

Dynamic-Static Graph Convolutional Network for Video-Based Facial Expression Recognition. 42-55 - Kezhou Chen, Shuo Wang, Yanbin Hao:

Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis. 56-69 - Xi Gu, Yuanyuan Xu

, Kun Zhu:
Semantic Importance-Based Deep Image Compression Using a Generative Approach. 70-81 - Wenbin Gan

, Minh-Son Dao, Koji Zettsu:
Drive-CLIP: Cross-Modal Contrastive Safety-Critical Driving Scenario Representation Learning and Zero-Shot Driving Risk Analysis. 82-97 - Peide Zhu, Zhen Wang, Manabu Okumura, Jie Yang

:
MRHF: Multi-stage Retrieval and Hierarchical Fusion for Textbook Question Answering. 98-111 - Tongwei Ma

, Lilian Zhang
, Bo Sun
, Chen Fan
:
Multi-scale Decomposition Dehazing with Polarimetric Vision. 112-126 - Qianqian Jin, Fazhi He

, Wei Tang:
CLF-Net: A Few-Shot Cross-Language Font Generation Method. 127-140 - Yixing Lu, Zhaoxin Fan, Min Xu:

Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation. 141-155 - Sze An Peter Tan, Guangyu Gao

, Jia Zhao
:
Audio-Visual Segmentation by Leveraging Multi-scaled Features Learning. 156-169 - Wei Liu, Jun Li, Zhijian Wu, Jianhua Xu, Bo Yang:

Multi-head Hashing with Orthogonal Decomposition for Cross-modal Retrieval. 170-183 - Guangrui Liu

, Wei Wu
:
Fusion Boundary and Gradient Enhancement Network for Camouflage Object Detection. 184-198 - Carlo Bretti

, Pascal Mettes
, Hendrik Vincent Koops
, Daan Odijk
, Nanne van Noord
:
Find the Cliffhanger: Multi-modal Trailerness in Soap Operas. 199-212 - Ruichen Li, Lei Wu, Pei Dong, Minggang He:

SM-GAN: Single-Stage and Multi-object Text Guided Image Editing. 213-226 - Shilong Yu, Chenhui Yang:

MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction. 227-238 - Gia-Bao Le

, Van-Tien Nguyen
, Trung-Nghia Le
, Minh-Triet Tran
:
NearbyPatchCL: Leveraging Nearby Patches for Self-supervised Patch-Level Multi-class Classification in Whole-Slide Images. 239-252 - Songkang Dai

, Song-Lu Chen
, Qi Liu
, Chao Zhu
, Yan Liu
, Feng Chen, Xu-Cheng Yin
:
Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation. 253-266 - Linyi Qian

, Qian Huang
, Yulin Chen
, Junzhou Chen
:
A Purified Stacking Ensemble Framework for Cytology Classification. 267-280 - Jing Zhang, Wei Wu:

SEAS-Net: Segment Exchange Augmentation for Semi-supervised Brain Tumor Segmentation. 281-295 - Lihua Du

, Wei Wu
, Chen Li
:
Super-Resolution-Assisted Feature Refined Extraction for Small Objects in Remote Sensing Images. 296-309 - Zhenlei Cui, Zhenhua Tang, Jianze Li, Kai Chen:

Lightweight Image Captioning Model Based on Knowledge Distillation. 310-324 - Yuan-Yuan Liu

, Qi Liu
, Song-Lu Chen
, Feng Chen, Xu-Cheng Yin:
Irregular License Plate Recognition via Global Information Integration. 325-339 - Xiaohai Zhang, Jinming Zhang, Jianliang Li, Ming Chen:

TNT-Net: Point Cloud Completion by Transformer in Transformer. 340-352 - Jiacheng Chen

, Fei Wu
, Wanliang Wang
, Haoxin Sheng
:
Fourier Transformer for Joint Super-Resolution and Reconstruction of MR Image. 353-364 - Yangjie Cao, Bo Wang, Zhenqiang Li, Jie Li:

MVD-NeRF: Resolving Shape-Radiance Ambiguity via Mitigating View Dependency. 365-378 - Jingzhi Zhang, Xudong Li, Linghui Sun, Chengjie Bai:

DPM-Det: Diffusion Model Object Detection Based on DPM-Solver++ Guided Sampling. 379-393 - Sicheng Wang, Hao Jiang, Lei Xiang:

CT-MVSNet: Efficient Multi-view Stereo with Cross-Scale Transformer. 394-408 - Feifei Xu, Wang Zhou, Tao Sun, Jiahao Lu, Ziheng Yu, Guangzhen Li:

A Coarse and Fine Grained Masking Approach for Video-Grounded Dialogue. 409-422 - Xiaotong Bu, Jiwen Dong, Mengjiao Zhang, Guang Feng, Xizhan Gao, Sijie Niu:

Deep Self-supervised Subspace Clustering with Triple Loss. 423-436 - Baotong Su, Wenguang Zheng:

LigCDnet:Remote Sensing Image Cloud Detection Based on Lightweight Framework. 437-450 - Qizhen Chen, Xin Chen, Xiaoling Deng, Yubin Lan

:
Gait Recognition Based on Temporal Gait Information Enhancing. 451-463 - Yanyan Jiao, Wenzhu Yang, Wenjie Xing:

Learning Complementary Instance Representation with Parallel Adaptive Graph-Based Network for Action Detection. 464-478 - Xu Chen, Zhibin Zhang:

CESegNet:Context-Enhancement Semantic Segmentation Network Based on Transformer. 479-493 - Lu Zhang, Jingliang Peng, Na Lv:

MoCap-Video Data Retrieval with Deep Cross-Modal Learning. 494-506 - Guangjie Yang, Dajian Zhong, Yu-Jie Xiong, Hongjian Zhan:

LRATNet: Local-Relationship-Aware Transformer Network for Table Structure Recognition. 507-520

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














