


default search action
25th ACM Multimedia Thematic Workshops 2017: Mountain View, CA, USA
- Wanmin Wu, Jianchao Yang, Qi Tian, Roger Zimmermann:

Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23 - 27, 2017. ACM 2017, ISBN 978-1-4503-5416-5
Session 1
- Matthijs Douze, Hervé Jégou, Jeff Johnson:

An Evaluation of Large-scale Methods for Image Instance and Class Discovery. 1-9 - Torben Wallbaum, Swamy Ananthanarayan, Shadan Sadeghian Borojeni, Wilko Heuten

, Susanne Boll:
Towards a Tangible Storytelling Kit for Exploring Emotions with Children. 10-16 - Farshid Farhat, Mohammad Mahdi Kamani, Sahil Mishra, James Z. Wang

:
Intelligent Portrait Composition Assistance: Integrating Deep-learned Models and Photography Idea Retrieval. 17-25 - Baris Kandemir

, Zihan Zhou, Jia Li, James Z. Wang
:
Beyond Saliency: Assessing Visual Balance with High-level Cues. 26-34 - Takumi Karasawa, Kohei Watanabe, Qishen Ha, Antonio Tejero-de-Pablos

, Yoshitaka Ushiku
, Tatsuya Harada:
Multispectral Object Detection for Autonomous Vehicles. 35-43 - Zhenfang Chen, Zhanghui Kuang, Kwan-Yee K. Wong

, Wei Zhang
:
Aggregated Deep Feature from Activation Clusters for Particular Object Retrieval. 44-51 - Jiewei Cao

, Zi Huang
, Heng Tao Shen:
Local Deep Descriptors in Bag-of-Words for Image Retrieval. 52-58 - Shanmin Pang, Wei Zhang, Li Zhu, Jihua Zhu, Jianru Xue:

Beyond Sum and Weighted Aggregation: An Efficient Mixed Aggregation Method with Multiple Weights for Image Search. 59-67 - Chun-Yu Tsai, John R. Kender:

Detecting Culture-specific Tags for News Videos through Multimodal Embedding. 68-74 - Abdullah Alfarrarjeh, Cyrus Shahabi, Seon Ho Kim:

Hybrid Indexes for Spatial-Visual Search. 75-83 - Shanshan Huang, Yichao Xiong, Ya Zhang

, Jia Wang:
Unsupervised Triplet Hashing for Fast Image Retrieval. 84-92 - Xiaopeng Li

, James She:
Relational Variational Autoencoder for Link Prediction with Multimedia Data. 93-100 - Haitian Pang, Zhi Wang, Chen Yan, Qinghua Ding, Lifeng Sun:

First Mile in Crowdsourced Live Streaming: A Content Harvest Network Approach. 101-109 - Yiyang Zhao, Linnan Wang, Wei Wu

, George Bosilca, Richard W. Vuduc, Jinmian Ye, Wenqi Tang, Zenglin Xu:
Efficient Communications in Training Large Scale Neural Networks. 110-116 - Sven Seele, Tobias Haubrich, Jonas Schild

, Rainer Herpers
, Marcin Grzegorzek
:
Augmenting Cognitive Processes and Behavior of Intelligent Virtual Agents by Modeling Synthetic Perception. 117-125 - Linsen Chen, Cc Dong, Du Chen, Han Li, Yuanyuan Zhao, Xun Cao, Zhan Ma:

Mobile Multispectral Video Streaming. 126-134 - Jun-Ho Choi, Manri Cheon, Min-Su Choi, Jong-Seok Lee:

Impact of Three-Dimensional Video Scalability on Multi-View Activity Recognition using Deep Learning. 135-143 - Bhojan Anand, Pan Wenren:

CloudHide: Towards Latency Hiding Techniques for Thin-client Cloud Gaming. 144-152 - Jan Willem Kleinrouweler, Fabijan Bajo, Britta Meixner, Sergio Cabrero, Pablo César

:
Mobile Instant Video Sharing: Does More Information Help? 153-160 - Ahmed Hamza, Hamed Ahmadi, Saleh Almowuena, Mohamed Hefeeda

:
QoE-fair Adaptive Streaming of Free-viewpoint Videos over LTE Networks. 161-169 - Hamed Ahmadi, Omar Eltobgy, Mohamed Hefeeda

:
Adaptive Multicast Streaming of Virtual Reality Content to Mobile Users. 170-178 - Toan H. Vu, An Dang, Le Dung, Jia-Ching Wang:

Self-Gated Recurrent Neural Networks for Human Activity Recognition on Wearable Devices. 179-185 - Marius Noreikis, Yu Xiao

, Antti Ylä-Jääski
:
SeeNav: Seamless and Energy-Efficient Indoor Navigation using Augmented Reality. 186-193 - Wenxiao Zhang, Sikun Lin

, Farshid Hassani Bijarbooneh, Hao Fei Cheng, Pan Hui:
CloudAR: A Cloud-based Framework for Mobile Augmented Reality. 194-200 - Tamay Aykut

, Stefan Lochbrunner, Mojtaba Karimi, Burak Cizmeci, Eckehard G. Steinbach
:
A Stereoscopic Vision System with Delay Compensation for 360° Remote Reality. 201-209 - Liming Xu, Andrew P. French, Dave Towey

, Steve Benford
:
Recognizing the Presence of Hidden Visual Markers in Digital Images. 210-218 - Sahil Narang, Andrew Best, Ari Shapiro, Dinesh Manocha:

Generating Virtual Avatars with Personalized Walking Gaits using Commodity Hardware. 219-227 - Xianglong Feng, Mengmei Ye, Viswanathan Swaminathan, Sheng Wei:

Towards the Security of Motion Detection-based Video Surveillance on IoT Devices. 228-235 - Masoud Mazloom, Bouke Hendriks, Marcel Worring

:
Multimodal Context-Aware Recommender for Post Popularity Prediction in Social Media. 236-244 - Stevan Rudinac, Iva Gornishka

, Marcel Worring
:
Multimodal Classification of Violent Online Political Extremism Content with Graph Convolutional Networks. 245-252 - Wenjuan Liao, Zhigang Tu

, Shizheng Wang, Yongzhou Li, Rui Zhong, Hui Zhong:
Compressed-domain Video Synopsis via 3D Graph Cut and Blank Frame Deletion. 253-261 - Fotis P. Kalaganis, Elisavet Chatzilari

, Spiros Nikolopoulos
, Nikos Laskaris
, Yiannis Kompatsiaris:
A Collaborative Representation Approach to Detecting Error-Related Potentials in SSVEP-BCIs. 262-270
Session 2
- Hanqi Wang, Siliang Tang

, Yin Zhang, Tao Mei
, Yueting Zhuang, Fei Wu:
Learning Deep Contextual Attention Network for Narrative Photo Stream Captioning. 271-279 - Jung Uk Kim

, Hak Gu Kim, Yong Man Ro
:
Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network. 280-286 - Yao Liu, Jianqiang Huang, Chang Zhou, Deng Cai, Xian-Sheng Hua:

Spatiotemporal Multi-Task Network for Human Activity Understanding. 287-295 - Andreas Leibetseder, Manfred Jürgen Primus, Stefan Petscharnig, Klaus Schoeffmann:

Real-Time Image-based Smoke Detection in Endoscopic Videos. 296-304 - Luowei Zhou, Chenliang Xu, Parker A. Koch, Jason J. Corso

:
Watch What You Just Said: Image Captioning with Text-Conditional Attention. 305-313 - Chen Yan, Peng Wang, Lifeng Sun:

Sensing Urban with Wi-Fi and Satellite: Functional Region Discovery across Cities. 314-322 - Genta Yoshimura, Atsunori Kanemura, Hideki Asoh

:
Reconstructable and Interpretable Representations for Time Series with Time-Skip Sparse Dictionary Learning. 323-331 - Jie Shao, Zhicheng Zhao, Fei Su, Ting Yue:

Towards Improving Canonical Correlation Analysis for Cross-modal Retrieval. 332-339 - Jing Huo, Yang Gao, Yinghuan Shi, Hujun Yin:

Variation Robust Cross-Modal Metric Learning for Caricature Recognition. 340-348 - Lele Chen, Sudhanshu Srivastava, Zhiyao Duan, Chenliang Xu:

Deep Cross-Modal Audio-Visual Generation. 349-357 - Baoyang Chen, Wenmin Wang, Jinzhuo Wang

:
Video Imagination from a Single Image with Transformation Generation. 358-366 - Takumi Ege, Keiji Yanai

:
Image-Based Food Calorie Estimation Using Knowledge on Food Categories, Ingredients and Cooking Directions. 367-375 - Kofi Boakye, Sachin Farfade, Hamid Izadinia, Yannis Kalantidis, Pierre Garrigues:

Tag Prediction at Flickr: A View from the Darkroom. 376-384 - Suibing Tong, Hefei Ling, Yuzhuo Fu, Dan Wang:

Cross-View Gait Identification with Embedded Learning. 385-392 - Mengxi Lin, Nakamasa Inoue, Koichi Shinoda

:
CTC Network with Statistical Language Modeling for Action Sequence Recognition in Videos. 393-401 - Neelay Pandit, Sherine Abdelhak:

Evolution of Trajectories: A Novel Representation for Deep Action Recognition. 402-407 - Yue Wu, Hongfu Liu, Jun Li, Yun Fu:

Deep Face Recognition with Center Invariant Loss. 408-414 - Ilija Ilievski, Jiashi Feng:

Generative Attention Model with Adversarial Self-learning for Visual Question Answering. 415-423 - Chen Shen, Chang Zhou, Zhongming Jin, Wenqing Chu, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua:

Learning Feature Embedding with Strong Neural Activations for Fine-Grained Retrieval. 424-432 - Yue Wang, Jinlai Liu, Xiaojie Wang:

Image Caption with Synchronous Cross-Attention. 433-441 - Shao-Ping Lu, Ruxandra-Marina Florea, Pablo César

, Peter Schelkens
, Adrian Munteanu:
Efficient Depth-aware Image Deformation Adaptation for Curved Screen Displays. 442-450 - Yunke Zhang, Kangkang Hu, Peiran Ren, Changyuan Yang, Weiwei Xu

, Xian-Sheng Hua:
Layout Style Modeling for Automating Banner Design. 451-459 - Feiran Huang, Xiaoming Zhang, Zhoujun Li

, Tao Mei
, Yueying He, Zhonghua Zhao:
Learning Social Image Embedding with Deep Multimodal Attention Networks. 460-468 - Audrey Ziwei Hu, Ryan E. Janzen, Max Hao Lu

, Steve Mann:
Liquid Jets as Logic-Computing Fluid-User-Interfaces. 469-476 - Brandon Mechtley, Julian Stein, Christopher Roberts, Sha Xin Wei:

Rich State Transitions in a Media Choreography Framework Using an Idealized Model of Cloud Dynamics. 477-484 - Conor Keighrey

, Ronan Flynn
, Siobhan Murray, Sean Brennan
, Niall Murray:
Comparing User QoE via Physiological and Interaction Measurements of Immersive AR and VR Speech and Language Therapy Applications. 485-492 - Biao Ma, Amy R. Reibman

:
Measuring and Improving the Viewing Experience of First-person Videos. 493-501 - Neetika Gupta, Mukesh Kumar Rohil:

An Experimental Study of Markerless Image Registration Methods on Varying Quality of Images for Augmented Reality Applications. 502-510 - Ashutosh Singla, Stephan Fremerey, Werner Robitza, Pierre R. Lebreton, Alexander Raake

:
Comparison of Subjective Quality Evaluation for HEVC Encoded Omnidirectional Videos at Different Bit-rates for UHD and FHD Resolution. 511-519 - Wen Heng, Tingting Jiang

:
Surveillance Video Quality Assessment Based on Face Recognition. 520-528 - Alison Marczewski, Adriano Veloso, Nivio Ziviani:

Learning Transferable Features for Speech Emotion Recognition. 529-536 - Sih-Huei Chen, Shao Hui Wu, Yuan-Shan Lee, Rocky Lo, Jia-Ching Wang:

Hierarchical Representation Based on Bayesian Nonparametric Tree-Structured Mixture Model for Playing Technique Classification. 537-543 - Andrea Salgian, David Vickerman, David Vassallo:

A Smart Mirror for Music Conducting Exercises. 544-549

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














