


default search action
22nd ACM Multimedia 2014: Orlando, FL, USA
- Kien A. Hua, Yong Rui, Ralf Steinmetz, Alan Hanjalic, Apostol Natsev, Wenwu Zhu:
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. ACM 2014, ISBN 978-1-4503-3063-3
Keynote 1
- Harry Shum:
Bing, the fastest growing image search engine. 1
Keynote 2
- Rosalind W. Picard:
Affective media and wearables: surprising findings. 3-4
Keynote 3
- Klara Nahrstedt:
Back and to the future: quality provisioning for multimedia content delivery. 5
Best Paper Session
- Fangxiang Feng, Xiaojie Wang, Ruifan Li:
Cross-modal Retrieval with Correspondence Autoencoder. 7-16 - AmirHossein Habibian, Thomas Mensink
, Cees G. M. Snoek:
VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events. 17-26 - Yelin Kim, Emily Mower Provost
:
Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition. 27-36
Multimedia Art and Entertainment
- Javier Villegas, Angus Graeme Forbes
:
Analysis/synthesis approaches for creatively processing video signals. 37-46 - Sicheng Zhao, Yue Gao, Xiaolei Jiang, Hongxun Yao, Tat-Seng Chua, Xiaoshuai Sun:
Exploring Principles-of-Art Features For Image Emotion Recognition. 47-56 - Jiajia Li, Grace Ngai
, Stephen Chi-fai Chan
, Kien A. Hua, Hong Va Leong
, Alvin T. S. Chan:
From Writing to Painting: A Kinect-Based Cross-Modal Chinese Painting Generation System. 57-66 - Charles Roberts, Matthew Wright, JoAnn Kuchera-Morin, Tobias Höllerer:
Gibber: Abstractions for Creative Multimedia Programming. 67-76
Action, Activity, and Event Recognition
- Zhigang Ma, Yi Yang, Nicu Sebe
, Alexander G. Hauptmann:
Multiple Features But Few Labels?: A Symbiotic Solution Exemplified for Video Analysis. 77-86 - Chengcheng Jia, Yu Kong, Zhengming Ding, Yun Raymond Fu:
Latent Tensor Transfer Learning for RGB-D Action Recognition. 87-96 - Keze Wang
, Xiaolong Wang, Liang Lin, Meng Wang, Wangmeng Zuo:
3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks. 97-106 - Pei Xu, Mao Ye, Xue Li
, Qihe Liu, Yi Yang, Jian Ding:
Dynamic Background Learning through Deep Auto-encoder Networks. 107-116
Music, Speech and Audio
- Bin Wu, Erheng Zhong, Andrew Horner, Qiang Yang:
Music Emotion Recognition by Multi-label Multi-layer Multi-instance Multi-view Learning. 117-126 - Kuang Mao, Ju Fan, Lidan Shou, Gang Chen, Mohan S. Kankanhalli
:
Song Recommendation for Social Singing Community. 127-136 - Hervé Bredin, Anindya Roy, Nicolas Pécheux, Alexandre Allauzen:
"Sheldon speaking, Bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification. 137-146 - Kai Li
, Jun Ye, Kien A. Hua:
What's Making that Sound? 147-156
Deep Learning for Multimedia
- Ji Wan, Dayong Wang, Steven Chu-Hong Hoi
, Pengcheng Wu, Jianke Zhu, Yongdong Zhang, Jintao Li:
Deep Learning for Content-Based Image Retrieval: A Comprehensive Study. 157-166 - Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue:
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. 167-176 - Tianjun Xiao, Jiaxing Zhang, Kuiyuan Yang, Yuxin Peng, Zheng Zhang:
Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification. 177-186 - Hanwang Zhang
, Yang Yang, Huan-Bo Luan, Shuicheng Yan, Tat-Seng Chua:
Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes. 187-196
Multimedia Grand Challenge
- Shintami Chusnul Hidayati, Kai-Lung Hua
, Wen-Huang Cheng, Shih-Wei Sun
:
What are the Fashion Trends in New York? 197-200 - Yin-Hsi Kuo, Yan-Ying Chen, Bor-Chun Chen, Wen-Yu Lee, Chun-Che Wu, Chia-Hung Lin, Yu-Lin Hou, Wen-Feng Cheng, Yi-Chih Tsai, Chung-Yen Hung, Liang-Chi Hsieh, Winston H. Hsu:
Discovering the City by Mining Diverse and Multimodal Data Streams. 201-204 - Jan Zahálka
, Stevan Rudinac, Marcel Worring
:
New Yorker Melange: Interactive Brew of Personalized Venue Recommendations. 205-208 - Rajiv Ratn Shah, Yi Yu, Anwar Dilawar Shaikh, Suhua Tang, Roger Zimmermann
:
ATLAS: Automatic Temporal Segmentation and Annotation of Lecture Videos Based on Modelling Transition Time. 209-212 - Brendan Jou, Subhabrata Bhattacharya, Shih-Fu Chang:
Predicting Viewer Perceived Emotions in Animated GIFs. 213-216 - Yogesh Singh Rawat, Mohan S. Kankanhalli
:
Context-Based Photography Learning using Crowdsourced Images and Social Media. 217-220 - Mei-Chen Yeh
, Hsiao-Wei Lin:
Virtual Portraitist: Aesthetic Evaluation of Selfies Based on Angle. 221-224 - Jian Wang, Cuicui Kang, Yonghao He, Shiming Xiang, Chunhong Pan:
Cross Modal Deep Model and Gaussian Process Based Model for MSR-Bing Challenge. 225-228 - Yalong Bai
, Wei Yu, Tianjun Xiao, Chang Xu
, Kuiyuan Yang, Wei-Ying Ma, Tiejun Zhao:
Bag-of-Words Based Deep Neural Network for Image Retrieval. 229-232 - Yingwei Pan
, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo
:
Click-through-based Subspace Learning for Image Search. 233-236
Multimedia HCI and QoE
- Luming Zhang, Yue Gao, Chao Zhang
, Hanwang Zhang
, Qi Tian, Roger Zimmermann
:
Perception-Guided Multimodal Feature Fusion for Photo Aesthetics Assessment. 237-246 - Hiromi Nemoto, Philippe Hanhart, Pavel Korshunov, Touradj Ebrahimi
:
Impact of Ultra High Definition on Visual Attention. 247-256 - Jiangyang Zhang, C.-C. Jay Kuo
:
An Objective Quality of Experience (QoE) Assessment Index for Retargeted Images. 257-266 - Wei Song, Dian Tjondronegoro
, Ivan Himawan
:
Acceptability-based QoE Management for User-centric Mobile Video Delivery: A Field Study Evaluation. 267-276
Multimedia Analysis and Mining
- Wenxuan Xie, Yuxin Peng, Jianguo Xiao:
Weakly-Supervised Image Parsing via Constructing Semantic Graphs and Hypergraphs. 277-286 - Xiaopeng Zhang, Hongkai Xiong
, Wengang Zhou, Qi Tian:
Fused one-vs-all mid-level features for fine-grained visual categorization. 287-296 - Wei Zhang, Hongzhi Li, Chong-Wah Ngo
, Shih-Fu Chang:
Scalable Visual Instance Mining with Threads of Features. 297-306 - Yanfei Wang, Fei Wu, Jun Song, Xi Li, Yueting Zhuang:
Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval. 307-316
Multimedia Systems
- Vengatanathan Krishnamoorthi, Niklas Carlsson, Derek L. Eager, Anirban Mahanti, Nahid Shahmehri:
Quality-adaptive Prefetching for Interactive Branched Video using HTTP-based Adaptive Streaming. 317-326 - Benjamin Rainer, Christian Timmerer:
Self-Organized Inter-Destination Multimedia Synchronization For Adaptive Media Streaming. 327-336 - Kiana Calagari, Krzysztof Templin, Tarek Elgamal, Khaled M. Diab, Piotr Didyk
, Wojciech Matusik, Mohamed Hefeeda:
Anahita: A System for 3D Video Streaming with Depth Customization. 337-346 - Li Lin, Xiaofei Liao, Guang Tan, Hai Jin, Xiaobin Yang, Wei Zhang, Bo Li:
LiveRender: A Cloud Gaming System Based on Compressed Graphics Streaming. 347-356
Emotional and Social Signals in Multimedia
- Enver Sangineto
, Gloria Zen, Elisa Ricci
, Nicu Sebe
:
We are not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer. 357-366 - Tao Chen, Felix X. Yu, Jiawei Chen, Yin Cui, Yan-Ying Chen, Shih-Fu Chang:
Object-Based Visual Sentiment Concept Analysis and Application. 367-376 - Florian Lingenfelser, Johannes Wagner, Elisabeth André
, Gary McKeown
, William Curran:
An Event Driven Fusion Approach for Enjoyment Recognition in Real-time. 377-386 - John R. Zhang, Jason Sherwin, Jacek Dmochowski, Paul Sajda, John R. Kender:
Correlating Speaker Gestures in Political Debates with Audience Engagement Measured via EEG. 387-396
High Risks High Rewards
- Michael Riegler, Martha A. Larson, Mathias Lux, Christoph Kofler:
How 'How' Reflects What's What: Content-based Exploitation of How Users Frame Social Images. 397-406 - Miaojing Shi, Teddy Furon, Hervé Jégou:
A Group Testing Framework for Similarity Search in High-dimensional Spaces. 407-416 - Eva Mohedano, Graham Healy
, Kevin McGuinness
, Xavier Giró-i-Nieto
, Noel E. O'Connor, Alan F. Smeaton
:
Object Segmentation in Images using EEG Signals. 417-426 - Oche Ejembi, Saleem N. Bhatti
:
Help Save The Planet: Please Do Adjust Your Picture. 427-436
Multimedia Applications
- Kenta Kusumoto, Teemu Kinnunen, Jari Kätsyri, Heikki Lindroos, Pirkko Oittinen:
Media Experience of Complementary Information and Tweets on a Second Screen. 437-446 - Pradeep Kumar Jayaraman, Chi-Wing Fu
:
Interactive Line Drawing Recognition and Vectorization with Commodity Camera. 447-456 - Xin Lu, Zhe Lin, Hailin Jin, Jianchao Yang, James Z. Wang
:
RAPID: Rating Pictorial Aesthetics using Deep Learning. 457-466 - Si Liu, Xiaodan Liang, Luoqi Liu, Ke Lu, Liang Lin, Shuicheng Yan:
Fashion Parsing with Video Context. 467-476
Privacy, Health and Well-being
- Andrey Bogomolov, Bruno Lepri, Michela Ferron
, Fabio Pianesi, Alex Pentland:
Daily Stress Recognition from Mobile Phone Data, Weather Conditions and Individual Traits. 477-486 - Shenggao Zhu, Robert J. Ellis
, Gottfried Schlaug, Yee Sien Ng
, Ye Wang
:
Validating an iOS-based Rhythmic Auditory Cueing Evaluation (iRACE) for Parkinson's Disease. 487-496 - Zhan Qin, Jingbo Yan, Kui Ren
, Chang Wen Chen
, Cong Wang
:
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing. 497-506 - Huijie Lin, Jia Jia, Quan Guo, Yuanyuan Xue, Qi Li, Jie Huang, Lianhong Cai, Ling Feng:
User-level psychological stress detection from social media using deep neural network. 507-516
Multimedia Search and Indexing
- Jianfeng Wang, Heng Tao Shen, Shuicheng Yan, Nenghai Yu, Shipeng Li
, Jingdong Wang
:
Optimized Distances for Binary Code Ranking. 517-526 - Yao Hu, Zhongming Jin, Hongyi Ren
, Deng Cai, Xiaofei He:
Iterative Multi-View Hashing for Cross Media Indexing. 527-536 - Xiaopeng Yang, Tao Mei, Yongdong Zhang:
Rescue Tail Queries: Learning to Image Search Re-rank via Click-wise Multimodal Fusion. 537-546 - Lu Jiang, Deyu Meng, Teruko Mitamura, Alexander G. Hauptmann:
Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search. 547-556
Social Media and Crowd
- Ming Yan, Jitao Sang, Changsheng Xu:
Mining Cross-network Association for YouTube Video Promotion. 557-566 - Xue Geng, Hanwang Zhang
, Zheng Song, Yang Yang, Huan-Bo Luan, Tat-Seng Chua:
One of a Kind: User Profiling by Social Curation. 567-576 - Axel Carlier, Lilian Calvet, Duong-Trung-Dung Nguyen, Wei Tsang Ooi
, Pierre Gurdjos, Vincent Charvillat:
3D Interest Maps From Simultaneous Video Recordings. 577-586 - Prem Seetharaman, Bryan Pardo:
Crowdsourcing a Reverberation Descriptor Map. 587-596
Multimedia Recommendations
- Peng Cui, Zhiyu Wang, Zhou Su:
What Videos Are Similar with You?: Learning a Common Attributed Representation for Video Recommendation. 597-606 - Rajiv Ratn Shah, Yi Yu, Roger Zimmermann
:
ADVISOR: Personalized Video Soundtrack Recommendation by Late Fusion with Heuristic Rankings. 607-616 - Shaowei Liu, Peng Cui, Wenwu Zhu, Shiqiang Yang, Qi Tian:
Social Embedding Image Distance Learning. 617-626 - Xinxi Wang, Ye Wang
:
Improving Content-based and Hybrid Music Recommendation using Deep Learning. 627-636
Doctoral Symposium 1
- Mario Taschwer:
Medical case retrieval. 639-642 - Stefan Wilk, Wolfgang Effelsberg:
Mobile Video Broadcasting Services: Combining Video Composition and Network Efficient Transmission. 643-646 - David Grunberg:
Music-information retrieval in environments containing acoustic noise. 647-650 - Jeffrey J. Scott:
Automated Multi-Track Mixing and Analysis of Instrument Mixtures. 651-654
Doctoral Symposium 2
- Jichao Sun:
Local Selection of Features for Image Search and Annotation. 655-658 - Manfred Jürgen Primus:
Segmentation and Indexing of Endoscopic Videos. 659-662 - Desara Xhura:
Learning recognition of semantically relevant video segments from endoscopy videos contributed and edited in a private social network. 663-666 - Mario Guggenberger:
Multimodal Alignment of Videos. 667-670
Open Source Software Competition 1
- Xin Yang, Chong Huang, Kwang-Ting (Tim) Cheng
:
libLDB: a library for extracting ultrafast and distinctive binary feature description. 671-674 - Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, Trevor Darrell:
Caffe: Convolutional Architecture for Fast Feature Embedding. 675-678 - Joan Alabort-i-Medina, Epameinondas Antonakos, James Booth
, Patrick Snape, Stefanos Zafeiriou:
Menpo: A Comprehensive Platform for Parametric Image Alignment and Visual Deformable Models. 679-682
Open Source Software Competition 2
- Jack Jansen
:
VideoLat: An Extensible Tool for Multimedia Delay Measurements. 683-686 - Matthijs Douze, Hervé Jégou:
The Yael Library. 687-690 - Giuseppe Becchi, Marco Bertini, Lorenzo Cioni, Alberto Del Bimbo, Andrea Ferracani, Daniele Pezzatini, Mathias Lux:
Loki+Lire: a framework to create web-based multimedia search engines. 691-694
Art Exhibit
- Parag Kumar Mital:
Audiovisual Resynthesis in an Augmented Reality. 695-698 - Charles Roberts:
Sound-Light Giblet. 699-700 - Michael Riegler, Mathias Lux, Christian Zellot, Lukas Knoch, Horst Schnattler, Sabrina Napetschnig, Julian Kogler, Claus Degendorfer, Norbert Spot, Manuel Zoderer:
Gone: an interactive experience for two people. 701-704 - Sarah Linebaugh:
Circles and Sounds. 705-708 - Yuan-Yi Fan:
Qi Visualizer: An Interactive Pulse Spectrogram Visualization using Mobile Participatory Biometrics. 709-712 - F. Myles Sciotto, Jean-Michel Crettaz:
Stoicheia: Architecture, Sound and Tesla's Apotheosis. 713-716 - Lonce Wyse:
States of Diffusion for n+1 devices. 717-719
Demos 1: Searching and Finding
- Julien Champ
, Alexis Joly, Pierre Bonnet
:
Fine-grained Visual Faceted Search. 721-722 - André F. Araújo, David M. Chen, Peter Vajda, Bernd Girod:
Real-time query-by-image video search system. 723-724 - Vamsidhar Reddy Gaddam, Ragnar Langseth, Håkon Kvale Stensland, Carsten Griwodz, Pål Halvorsen, Øystein Landsverk:
Automatic Real-Time Zooming and Panning on Salient Objects from a Panoramic Video. 725-726 - Hao-Kai Wen, Wei-Che Chang, Chia-Hu Chang
, Yin-Tzu Lin, Ja-Ling Wu
:
Event Detection in Broadcasting Video for Halfpipe Sports. 727-728 - Jianquan Liu, Shoji Nishimura, Takuya Araki:
Wally: A Scalable Distributed Automated Video Surveillance System with Rich Search Functionalities. 729-730 - Junshi Huang, Wei Xia, Shuicheng Yan:
Deep Search with Attribute-aware Deep Network. 731-732 - Rene Kaiser
, Wolfgang Weiss, Manolis Falelakis, Marian Florin Ursu:
Virtual Director Adapting Visual Presentation to Conversation Context in Group Videoconferencing: An Interactive Demo. 733-734 - Jie Wu, Changhu Wang, Liqing Zhang, Yong Rui:
SmartVisio: Interactive Sketch Recognition with Natural Correction and Editing. 735-736
Demos 2: Senses and Sensors
- Nimesha Ranasinghe, Kuan-Yi Lee, Gajan Suthokumar, Ellen Yi-Luen Do
:
Taste+: Digitally Enhancing Taste Sensations of Food and Beverages. 737-738 - Prem Seetharaman, Bryan Pardo:
Reverbalize: A Crowdsourced Reverberation Controller. 739-740 - Mark Cartwright, Bryan Pardo:
SynthAssist: an audio synthesizer programmed with vocal imitation. 741-742 - Yong-Xiang Wang, Li-Yun Lo, Min-Chun Hu:
Eat as much as you can: a kinect-based facial rehabilitation game based on mouth and tongue movements. 743-744 - Ahmad M. Qamar, Imad Afyouni, Delwar Hossain, Faizan Ur Rehman
, Asad H. Toonsi, Mohamed Abdur Rahman
, Saleh M. Basalamah
:
A Multimedia E-Health Framework Towards An Interactive And Non-Invasive Therapy Monitoring Environment. 745-746 - Hongyun Cai, Zhongxian Tang, Yang Yang, Zi Huang
:
EventEye: Monitoring Evolving Events from Tweet Streams. 747-748 - Yuan Tian, Suraj Raghuraman, Yin Yang, Xiaohu Guo, Balakrishnan Prabhakaran:
3D Immersive Cardiopulmonary Resuscitation (CPR) Trainer. 749-750 - Mei-Chen Yeh
, Hsiao-Wei Lin:
Taking good selfies on your phone. 751-752
Demos 3: Systems
- Peng Wang
, Yang Yang, Zi Huang
, Jiewei Cao
, Heng Tao Shen:
WeMash: An Online System for Web Video Mashup. 753-754 - Zhenhuan Gao, Chien-Nan (Shannon) Chen, Klara Nahrstedt:
FreeViewer: An Intelligent Director for 3D Tele-Immersion System. 755-756 - Jun Chen, Chaokun Wang, Lei Yang, Qingfu Wen, Xu Wang:
MiSCon: a hot plugging tool for real-time motion-based system control. 757-758 - Zhineng Chen, Jinfeng Bai, Chong-Wah Ngo
, Bailan Feng, Bo Xu:
CeleLabel: an interactive system for annotating celebrities in web videos. 759-760 - Yoshiyuki Kawano, Keiji Yanai
:
FoodCam-256: A Large-scale Real-time Mobile Food RecognitionSystem employing High-Dimensional Features and Compression of Classifier Weights. 761-762 - Daisuke Ochi, Yutaka Kunita, Kensaku Fujii, Akira Kojima, Shinnosuke Iwaki, Junichi Hirose:
HMD Viewing Spherical Video Streaming System. 763-764 - Duong-Trung-Dung Nguyen, Axel Carlier, Wei Tsang Ooi
, Vincent Charvillat:
Jiku director 2.0: a mobile video mashup system with zoom and pan using motion maps. 765-766 - Mario Guggenberger, Mathias Lux, László Böszörményi:
ClockDrift: a mobile application for measuring drift in multimedia devices. 767-768
Posters 1
- Guangxin Ren, Junjie Cai, Shipeng Li
, Nenghai Yu, Qi Tian:
Salable Image Search with Reliable Binary Code. 769-772