default search action
26th MMM 2020: Daejeon, South Korea
- Yong Man Ro, Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve:
MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part I. Lecture Notes in Computer Science 11961, Springer 2020, ISBN 978-3-030-37730-4
Oral Session 1A: Audio and Signal Processing
- Xiuxiu Jing, Yike Ma, Qiang Zhao, Ke Lyu, Feng Dai:
Light Field Reconstruction Using Dynamically Generated Filters. 3-13 - Lili Guo, Longbiao Wang, Jianwu Dang, Zhilei Liu, Haotian Guan:
Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information. 14-25 - Congzhou Tian, Hangyu Li, Deshun Yang, Xiaoou Chen:
Gen-Res-Net: A Novel Generative Model for Singing Voice Separation. 26-36 - Congzhou Tian, Deshun Yang, Xiaoou Chen:
A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation. 37-48 - Daniel Mélo, Nazareno Andrade:
Exploiting the Importance of Personalization When Selecting Music for Relaxation. 49-61
Oral Session 2A: Coding and HVS
- Yunchang Li, Zhijie Huang, Jun Sun:
An Efficient Encoding Method for Video Compositing in HEVC. 65-76 - Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou, Guoping Qiu:
VHS to HDTV Video Translation Using Multi-task Adversarial Learning. 77-86 - Haibing Yin, Yafen Xing, Guangjing Xia, Xiaofeng Huang, Chenggang Yan:
Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics. 87-98 - Minh-Man Ho, Gang He, Zheng Wang, Jinjia Zhou:
Down-Sampling Based Video Coding with Degradation-Aware Restoration-Reconstruction Deep Neural Network. 99-110 - Chengpeng Fu, Jinqiang Wang, Jitao Sang, Jian Yu, Changsheng Xu:
Beyond Literal Visual Modeling: Understanding Image Metaphor Based on Literal-Implied Concept Mapping. 111-123
Oral Session 3A: Color Processing and Art
- Zhengqing Li, Zhengjun Zha, Yang Cao:
Deep Palette-Based Color Decomposition for Image Recoloring with Aesthetic Suggestion. 127-138 - Carlos Castellanos, Bello Bello, Hyeryeong Lee, Mungyu Lee, Yoo Seok Lee, In Seop Chang:
On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations. 139-150 - Haiyang Wei, Zhixin Li, Canlong Zhang:
Image Captioning Based on Visual and Semantic Attention. 151-162 - Wengang Cheng, Pengli Dou, Dengwen Zhou:
An Illumination Insensitive and Structure-Aware Image Color Layer Decomposition Method. 163-175 - Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni:
CartoonRenderer: An Instance-Based Multi-style Cartoon Image Translator. 176-187
Oral Session 4A: Detection and Classification
- Yiting Cheng, Yankai Wang, Lizhe Qi, Wenqiang Zhang:
Multi-condition Place Generator for Robust Place Recognition. 191-202 - Lingyun Zeng, You Song, Wenhai Wang:
Guided Refine-Head for Object Detection. 203-214 - Yafeng Zhou, Yongtao Wang, Zheqi He, Zhi Tang, Ching Y. Suen:
Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics. 215-226 - Nikolaos Gkalelis, Vasileios Mezaris:
Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification. 227-238 - Jacob Gately, Ying Liang, Matthew Kolessar Wright, Natasha Kholgade Banerjee, Sean Banerjee, Soumyabrata Dey:
Automatic Material Classification Using Thermal Finger Impression. 239-250
Oral Session 5A: Face
- Hongkong Ge, Jiayuan Dong, Liyan Zhang:
Face Attributes Recognition Based on One-Way Inferential Correlation Between Attributes. 253-265 - Yahui Wang, Huimin Ma, Xinpeng Xing, Zeyu Pan:
Eulerian Motion Based 3DCNN Architecture for Facial Micro-Expression Recognition. 266-277 - Siyi Mo, Wenming Yang, Guijin Wang, Qingmin Liao:
Emotion Recognition with Facial Landmark Heatmaps. 278-289 - Jianli Zhou, Jun Chen, Chao Liang, Jin Chen:
One-Shot Face Recognition with Feature Rectification via Adversarial Learning. 290-302 - Ruolin Zheng, Weixin Li, Yunhong Wang:
Visual Sentiment Analysis by Leveraging Local Regions and Human Faces. 303-314
Oral Session 6A: Image Processing
- Tong Zhang, Xiaolong Li, Wenfa Qi, Zongming Guo:
Prediction-Error Value Ordering for High-Fidelity Reversible Data Hiding. 317-328 - Xin Xu, Xin Teng:
Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation. 329-340 - Han Fang, Jun Chen, Qi Tian:
Multi-branch Body Region Alignment Network for Person Re-identification. 341-352 - Wenguang Wang, Zhouhui Lian, Yingmin Tang, Jianguo Xiao:
DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search. 353-364 - Abdullah Alfarrarjeh, Zeyu Ma, Seon Ho Kim, Cyrus Shahabi:
3D Spatial Coverage Measurement of Aerial Images. 365-377
Oral Session 7A: Leaning and Knowledge Representation
- Hongkai Li, Cong Bai, Ling Huang, Yu-Gang Jiang, Shengyong Chen:
Instance Image Retrieval with Generative Adversarial Training. 381-392 - Xinjie Feng, Hongxun Yao, Wenbin Che, Shengping Zhang:
An Effective Way to Boost Black-Box Adversarial Attack. 393-404 - Lizi Liao, Lyndon Kennedy, Lynn Wilcox, Tat-Seng Chua:
Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain. 405-418 - Haoran Chen, Minghua Zhu, Xuesong Cai, Jufeng Luo, Yunzhou Qiu:
Improved Model Structure with Cosine Margin OIM Loss for End-to-End Person Search. 419-430 - Feng Ni, Xixin Cao:
Effective Barcode Hunter via Semantic Segmentation in the Wild. 431-442
Oral Session 7B: Video Processing
- Qinyu Li, Lijun Chen, Hanli Wang, Xianhui Liu:
Wonderful Clips of Playing Basketball: A Database for Localizing Wonderful Actions. 445-454 - Zefeng Sun, Hanli Wang, Yun Yi, Qinyu Li:
Structural Pyramid Network for Cascaded Optical Flow Estimation. 455-467 - Muchun Chen, Yugang Chen, Truong Tan Loc, Bingbing Ni:
Real-Time Multiple Pedestrians Tracking in Multi-camera System. 468-479 - Ying She, Yang Yi:
Learning Multi-feature Based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking. 480-491 - Evlampios Apostolidis, Eleni Adamantidou, Alexandros I. Metsai, Vasileios Mezaris, Ioannis Patras:
Unsupervised Video Summarization via Attention-Driven Adversarial Learning. 492-504
Poster Session
- Zhijie Huang, Yunchang Li, Jun Sun:
Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping. 507-518 - Zikai Song, Junqing Yu, Hengyou Cai, Yangliu Hu, Yi-Ping Phoebe Chen:
Fine-Grain Level Sports Video Search Engine. 519-531 - Seunghan Yang, Seungjun Jung, Heekwang Kang, Changick Kim:
The Korean Sign Language Dataset for Action Recognition. 532-542 - Dongqi Tang, Hao Kong, Xi Meng, Ruo-Ze Liu, Tong Lu:
SEE-LPR: A Semantic Segmentation Based End-to-End System for Unconstrained License Plate Detection and Recognition. 543-554 - Changbo Zhai, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua:
Action Co-localization in an Untrimmed Video by Graph Neural Networks. 555-567 - Zhonghan Niu, Yang-Hao Zhou, Yu-Bin Yang, Jiancong Fan:
A Novel Attention Enhanced Dense Network for Image Super-Resolution. 568-580 - Ping Liu, Hongbo Yang, Jingnan Fu:
Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network. 581-592 - Qiuyuan Han, Jin Zheng:
Multi-scale Spatial Location Preference for Semantic Segmentation. 593-604 - Wei Chen, Ruimin Hu, Xiaochen Wang, Dengshi Li:
HRTF Representation with Convolutional Auto-encoder. 605-616 - Xuan Zhang, Guangxing Han, Wenduo He:
Unsupervised Feature Propagation for Fast Video Object Detection Using Generative Adversarial Networks. 617-627 - Gjorgji Strezoski, Rogier Knoester, Nanne van Noord, Marcel Worring:
OmniEyes: Analysis and Synthesis of Artistically Painted Eyes. 628-641 - Xiyue Gao, Jun Chen, Jing Yao, Wenqian Zhu:
LDSNE: Learning Structural Network Embeddings by Encoding Local Distances. 642-652 - Liwen Zhang, Ziqiang Shi, Jiqing Han, Anyan Shi, Ding Ma:
FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks. 653-665 - Chenhao Hu, Ruimin Hu, Xiaochen Wang, Tingzhao Wu, Dengshi Li:
Multi-step Coding Structure of Spatial Audio Object Coding. 666-678 - Soumya Chatterjee, Wei-Ta Chu:
Thermal Face Recognition Based on Transformation by Residual U-Net and Pixel Shuffle Upsampling. 679-689 - Shyi-Chyi Cheng, Ting-Lan Lin, Ping-Yuan Tseng:
K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-Point Clustering. 690-701 - Siying Zhai, Xiwei Hu, Xuanhong Chen, Bingbing Ni, Wenjun Zhang:
Resolution Booster: Global Structure Preserving Stitching Method for Ultra-High Resolution Image Translation. 702-713 - Haiyu Jiang, Yan Song, Jiang He, Xiangbo Shu:
Cross Fusion for Egocentric Interactive Action Recognition. 714-726 - Sun'ao Liu, Hai Xu, Yizhi Liu, Hongtao Xie:
Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion. 727-738 - Jian Cao, Na Tang, Jun Wang, Fan Liang:
Texture-Based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC. 739-751 - Siying Liang, Ping Wang:
An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features. 752-763 - Wenfeng Song, Shuai Li, Yuting Guo, Shaoqi Li, Aimin Hao, Hong Qin, Qinping Zhao:
Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos. 764-777 - Ruicong Xu, Li Niu, Liqing Zhang:
Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism. 778-789 - Binxin Yang, Xuejin Chen, Richang Hong, Zihan Chen, Yuhang Li, Zheng-Jun Zha:
Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis. 790-801 - Lv Chen, Dengpan Ye, Shunzhi Jiang:
High Accuracy Perceptual Video Hashing via Low-Rank Decomposition and DWT. 802-812 - Dongyang Li, Ruimin Hu, Wenxin Huang, Xiaochen Wang, Dengshi Li, Fei Zheng:
HMM-Based Person Re-identification in Large-Scale Open Scenario. 813-825 - Junchen Deng, Ci Wang, Shiqi Liu:
No Reference Image Quality Assessment by Information Decomposition. 826-838
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.