default search action
26th ACM Multimedia 2018: Seoul, Republic of Korea
- Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei:
2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. ACM 2018, ISBN 978-1-4503-5665-7
FF-1
- Max Mühlhäuser:
Session details: FF-1. - Chuan-Xiang Li, Zhen-Duo Chen, Peng-Fei Zhang, Xin Luo, Liqiang Nie, Wei Zhang, Xin-Shun Xu:
SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval. 1-9 - Ana Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan:
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams. 10-17 - Xingxing Wei, Jun Zhu, Sitong Feng, Hang Su:
Video-to-Video Translation with Global Temporal Consistency. 18-25 - Jinxing Li, Bob Zhang, Guangming Lu, David Zhang:
Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual Classification. 26-34 - Jia-Xing Zhong, Nannan Li, Weijie Kong, Tao Zhang, Thomas H. Li, Ge Li:
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector. 35-44 - Jianshu Li, Jian Zhao, Yunpeng Chen, Sujoy Roy, Shuicheng Yan, Jiashi Feng, Terence Sim:
Multi-Human Parsing Machines. 45-53 - Xuanyi Dong, Linchao Zhu, De Zhang, Yi Yang, Fei Wu:
Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering. 54-62 - Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan:
Hierarchical Memory Modelling for Video Captioning. 63-71 - Zheng Wang, Xiang Bai, Mang Ye, Shin'ichi Satoh:
Incremental Deep Hidden Attribute Learning. 72-80 - Huarong Chen, Bin Wang, Tianxiang Pan, Liwang Zhou, Hua Zeng:
CropNet: Real-Time Thumbnailing. 81-89 - Zhi-Qi Cheng, Xiao Wu, Siyu Huang, Jun-Xiu Li, Alexander G. Hauptmann, Qiang Peng:
Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model Search. 90-98 - Yingying Zhu, Jiong Wang, Lingxi Xie, Liang Zheng:
Attention-based Pyramid Aggregation Network for Visual Place Recognition. 99-107 - Changde Du, Changying Du, Hao Wang, Jinpeng Li, Wei-Long Zheng, Bao-Liang Lu, Huiguang He:
Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data. 108-116 - Yuxiao Chen, Jianbo Yuan, Quanzeng You, Jiebo Luo:
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM. 117-125 - Feifei Zhang, Tianzhu Zhang, Qirong Mao, Lingyu Duan, Changsheng Xu:
Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach. 126-135 - Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. 136-144 - Zhengzhe Liu, Xiaojuan Qi, Lei Pang:
Self-boosted Gesture Interactive System with ST-Net. 145-153 - Felix Kosmalla, Christian Murlowski, Florian Daiber, Antonio Krüger:
Slackliner - An Interactive Slackline Training Assistant. 154-162 - Yaoyu Li, Tianzhu Zhang, Lingyu Duan, Changsheng Xu:
A Unified Generative Adversarial Framework for Image Generation and Person Re-identification. 163-172 - Anahita Mahzari, Afshin Taghavi Nasrabadi, Aliehsan Samiei, Ravi Prakash:
FoV-Aware Edge Caching for Adaptive 360° Video Streaming. 173-181
Keynote 1
- Susanne Boll:
Session details: Keynote 1. - Marianna Obrist:
Don't just Look - Smell, Taste, and Feel the Interaction. 182
FF-2
- Peng Cui:
Session details: FF-2. - Rui Zhang, Sheng Tang, Yu Li, Junbo Guo, Yongdong Zhang, Jintao Li, Shuicheng Yan:
Style Separation and Synthesis via Generative Adversarial Networks. 183-191 - Hao Xiao, Weiyao Lin, Bin Sheng, Ke Lu, Junchi Yan, Jingdong Wang, Errui Ding, Yihao Zhang, Hongkai Xiong:
Group Re-Identification: Leveraging and Integrating Multi-Grain Information. 192-200 - Xu Gao, Tingting Jiang:
OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene. 201-210 - Yuke Li:
Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal Consistency. 211-219 - Rui Shao, Xiangyuan Lan, Pong C. Yuen:
Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain Adaptation. 220-228 - Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen:
Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations. 229-238 - Xiaomeng Song, Yucheng Shi, Xin Chen, Yahong Han:
Explore Multi-Step Reasoning in Video Question Answering. 239-247 - Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Nannan Sun, Jianlong Tan, Yongdong Zhang:
Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling. 248-256 - Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei Zhang:
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos. 257-264 - Zhihang Fu, Zhongming Jin, Guo-Jun Qi, Chen Shen, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua:
Previewer for Multi-Scale Object Detector. 265-273 - Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou:
Learning Discriminative Features with Multiple Granularities for Person Re-Identification. 274-282 - Guoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao:
StripNet: Towards Topology Consistent Strip Structure Segmentation. 283-291 - Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. 292-301 - Can Wang, Shangfei Wang:
Personalized Multiple Facial Action Unit Recognition through Generative Adversarial Recognition Network. 302-310 - Cigdem Beyan, Muhammad Shahid, Vittorio Murino:
Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal Features. 311-319 - Eugene Yujun Fu, Michael Xuelin Huang, Hong Va Leong, Grace Ngai:
Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight. 320-327 - Qianli Xu, Vigneshwaran Subbaraju, Chee How Cheong, Aijing Wang, Kathleen Kang, Munirah Bashir, Yanhong Dong, Liyuan Li, Joo-Hwee Lim:
Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics. 328-336 - Wendy Bolier, Wolfgang Hürst, Guido van Bommel, Joost Bosman, Harriët Bosman:
Drawing in a Virtual 3D Space - Introducing VR Drawing in Elementary School Art Education. 337-345 - Luca Lovagnini, Wenxiao Zhang, Farshid Hassani Bijarbooneh, Pan Hui:
CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core Architectures. 346-354 - Wenxiao Zhang, Bo Han, Pan Hui:
Jaguar: Low Latency Mobile Augmented Reality with Flexible Tracking. 355-363
Keynote 2
- Tao Mei:
Session details: Keynote 2. - Xian-Sheng Hua:
Challenges and Practices of Large Scale Visual Intelligence in the Real-World. 364
Deep-1 (Image Translation)
- Nicu Sebe:
Session details: Deep-1 (Image Translation). - Yuheng Zhi, Huawei Wei, Bingbing Ni:
Structure Guided Photorealistic Style Transfer. 365-373 - Xuewen Yang, Dongliang Xie, Xin Wang:
Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation. 374-382 - Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao Liu, Zequn Jie, Jiashi Feng:
Multi-View Image Generation from a Single-View. 383-391 - Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Meng Liu, Xueying Qin:
Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute Manipulation. 392-401
Vision-1 (Machine Learning)
- Jingkuan Song:
Session details: Vision-1 (Machine Learning). - Jindong Wang, Wenjie Feng, Yiqiang Chen, Han Yu, Meiyu Huang, Philip S. Yu:
Visual Domain Adaptation with Manifold Embedded Distribution Alignment. 402-410 - Zheyan Shen, Peng Cui, Kun Kuang, Bo Li, Peixuan Chen:
Causally Regularized Learning with Agnostic Data Selection Bias. 411-419 - Yanjie Liang, Qiangqiang Wu, Yi Liu, Yan Yan, Hanzi Wang:
Robust Correlation Filter Tracking with Shepherded Instance-Aware Proposals. 420-428 - Fan Qi, Xiaoshan Yang, Changsheng Xu:
A Unified Framework for Multimodal Domain Adaptation. 429-437
Multimedia-1 (Multimedia Recommendation & Discovery)
- Mark Liao:
Session details: Multimedia-1 (Multimedia Recommendation & Discovery). - Shintami Chusnul Hidayati, Cheng-Chun Hsu, Yu-Ting Chang, Kai-Lung Hua, Jianlong Fu, Wen-Huang Cheng:
What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape. 438-446 - Xiaowen Huang, Shengsheng Qian, Quan Fang, Jitao Sang, Changsheng Xu:
CSAN: Contextual Self-Attention Network for User Sequential Recommendation. 447-455 - Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:
Attentive Interactive Convolutional Matching for Community Question Answering in Social Multimedia. 456-464 - Francesco Gelli, Tiberio Uricchio, Xiangnan He, Alberto Del Bimbo, Tat-Seng Chua:
Beyond the Product: Discovering Image Posts for Brands in Social Media. 465-473
Vision-2 (Object & Scene Understanding)
- Zheng-Jun Zha:
Session details: Vision-2 (Object & Scene Understanding). - Lishi Zhang, Chenghan Fu, Jia Li:
Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions. 474-482 - Mengyang Pu, Yaping Huang, Qingji Guan, Qi Zou:
GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation. 483-491 - Hengcan Shi, Hongliang Li, Qingbo Wu, Fanman Meng, King N. Ngan:
Boosting Scene Parsing Performance via Reliable Scale Prediction. 492-500 - Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao, Yi Fang:
Learning to Synthesize 3D Indoor Scenes from Monocular Images. 501-509
Multimodal-1 (Multimodal Reasoning)
- Xian-Sheng Hua:
Session details: Multimodal-1 (Multimodal Reasoning). - Chaojun Han, Fumin Shen, Li Liu, Yang Yang, Heng Tao Shen:
Visual Spatial Attention Network for Relationship Detection. 510-518 - Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Object-Difference Attention: A Simple Relational Attention for Visual Question Answering. 519-527 - Jinwei Qi, Yuxin Peng, Yunkan Zhuo:
Life-long Cross-media Correlation Learning. 528-536 - Yue Gu, Xinyu Li, Kaixiang Huang, Shiyu Fu, Kangning Yang, Shuhong Chen, Moliang Zhou, Ivan Marsic:
Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder. 537-545
System-1 (Video Analysis & Streaming)
- Xin Yang:
Session details: System-1 (Video Analysis & Streaming). - Wentao Liu, Zhengfang Duanmu, Zhou Wang:
End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks. 546-554 - Ibrahim Ben Mustafa, Tamer Nadeem, Emir Halepovic:
FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDN. 555-563 - Lan Xie, Xinggong Zhang, Zongming Guo:
CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming. 564-572 - Abdelhak Bentaleb, Ali C. Begen, Saad Harous, Roger Zimmermann:
A Distributed Approach for Bitrate Selection in HTTP Adaptive Streaming. 573-581
FF-3
- Zhu Li:
Session details: FF-3. - Qing Zhang, Ganzhao Yuan, Chunxia Xiao, Lei Zhu, Wei-Shi Zheng:
High-Quality Exposure Correction of Underexposed Photos. 582-590 - Qianqian Xu, Jiechao Xiong, Xinwei Sun, Zhiyong Yang, Xiaochun Cao, Qingming Huang, Yuan Yao:
A Margin-based MLE for Crowdsourced Partial Ranking. 591-599 - Ana Garcia del Molino, Michael Gygli:
PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation. 600-608 - Lu Pang, Yaowei Wang, Yi-Zhe Song, Tiejun Huang, Yonghong Tian:
Cross-Domain Adversarial Feature Learning for Sketch Re-identification. 609-617 - Quan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, Kun Gai:
Semantic Human Matting. 618-626 - Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan:
Geometry Guided Adversarial Facial Expression Synthesis. 627-635 - Siqi Wang, Yijie Zeng, Qiang Liu, Chengzhang Zhu, En Zhu, Jianping Yin:
Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event Detection. 636-644 - Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin:
BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network. 645-653 - Xianghui Luo, Zhuo Su, Jiaming Guo, Gengwei Zhang, Xiangjian He:
Trusted Guidance Pyramid Network for Human Parsing. 654-662 - Jingjing Li, Lei Zhu, Zi Huang, Ke Lu, Jidong Zhao:
I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification. 663-671 - Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin:
Look Deeper See Richer: Depth-aware Image Paragraph Captioning. 672-680 - Huaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu:
Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering. 681-689 - Junyu Gao, Tianzhu Zhang, Changsheng Xu:
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling. 690-699 - Yongcheng Liu, Lu Sheng, Jing Shao, Junjie Yan, Shiming Xiang, Chunhong Pan:
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection. 700-708 - Jiayu Wang, Wengang Zhou, Jinhui Tang, Zhongqian Fu, Qi Tian, Houqiang Li:
Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation. 709-717 - Yangbangyan Jiang, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang:
When to Learn What: Deep Cognitive Subspace Clustering. 718-726 - Wendong Zhang, Feng Gao, Bingbing Ni, Lingyu Duan, Yichao Yan, Jingwei Xu, Xiaokang Yang:
Depth Structure Preserving Scene Image Generation. 727-736 - Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang:
CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. 737-745 - Gusi Te, Wei Hu, Amin Zheng, Zongming Guo:
RGCNN: Regularized Graph CNN for Point Cloud Segmentation. 746-754 - Bin Liu, Yue Cao, Mingsheng Long, Jianmin Wang, Jingdong Wang:
Deep Triplet Quantization. 755-763
Keynote 3
- Jiebo Luo:
Session details: Keynote 3. - Ernest A. Edmonds:
What has Art Got to do With It? 773
Best Paper Session
- Rainer Lienhart, Tao Mei:
Session details: Best Paper Session. - Hao Tang, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe:
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild. 774-782 - Bei Liu, Jianlong Fu, Makoto P. Kato, Masatoshi Yoshikawa:
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training. 783-791 - Jian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, Jiashi Feng:
Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing. 792-800 - Lizi Liao, Yunshan Ma, Xiangnan He, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Dialogue Systems. 801-809
Doctoral Symposium
- Meng Wang:
Session details: Doctoral Symposium. - Na Zhao:
End2End Semantic Segmentation for 3D Indoor Scenes. 810-814 - Sabrina Kletz:
On Reducing Effort in Evaluating Laparoscopic Skills. 815-819 - Tianran Hu:
Decode Human Life from Social Media. 820-824
FF-4
- Wen-Huang Cheng:
Session details: FF-4. - Yiling Wu, Shuhui Wang, Qingming Huang:
Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. 825-833 - Zhendong Mao, Quan Wang, Yongdong Zhang, Bin Wang:
Post Tuned Hashing: A New Approach to Indexing High-dimensional Data. 834-842 - Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, Tat-Seng Chua:
Cross-modal Moment Localization in Videos. 843-851 - Zhaoda Ye, Yuxin Peng:
Multi-Scale Correlation for Sequential Cross-modal Hashing Learning. 852-860 - Litao Yu, Yongsheng Gao, Jun Zhou:
Generative Adversarial Product Quantisation. 861-869 - Yubin Deng, Chen Change Loy, Xiaoou Tang:
Aesthetic-Driven Image Enhancement by Adversarial Learning. 870-878 - Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu:
Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment. 879-886 - Zheqi He, Yafeng Zhou, Yongtao Wang, Siwei Wang, Xiaoqing Lu, Zhi Tang, Ling Cai:
An End-to-End Quadrilateral Regression Network for Comic Panel Extraction. 887-895 - Xin Yang, Jinyu Chen, Zhiwei Wang, Qiaozhe Zhang, Wenyu Liu, Chunyuan Liao, Kwang-Ting Cheng:
Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network. 896-904 - Xiaojing Ma, Changming Liu, Sixing Cao, Bin Zhu:
JPEG Decompression in the Homomorphic Encryption Domain. 905-913