default search action
18th ACM Multimedia 2010: Firenze, Italy
- Alberto Del Bimbo, Shih-Fu Chang, Arnold W. M. Smeulders:
Proceedings of the 18th International Conference on Multimedia 2010, Firenze, Italy, October 25-29, 2010. ACM 2010, ISBN 978-1-60558-933-6
Plenary -- P1
- Duncan J. Watts:
Using the web to do social science. 1-2 - Mubarak Shah:
Visual crowd surveillance is like hydrodynamics. 3-4
Full - F1/content track/automatic image tagging
- Yi Shen, Jianping Fan:
Leveraging loosely-tagged images and inter-object correlations for tag recommendation. 5-14 - Fei Wu, Yahong Han, Qi Tian, Yueting Zhuang:
Multi-label boosting for image annotation by structural grouping sparsity. 15-24 - Dong Liu, Shuicheng Yan, Yong Rui, Hong-Jiang Zhang:
Unified tag analysis with multi-edge graph. 25-34 - Xiangyu Chen, Yadong Mu, Shuicheng Yan, Tat-Seng Chua:
Efficient large-scale image annotation by probabilistic collaborative multi-label propagation. 35-44
Full - F2/systems track/improving media delivery
- Hassan Shojania, Baochun Li:
Tenor: making coding practical from servers to smartphones. 45-54 - Cong Ly, Cheng-Hsin Hsu, Mohamed Hefeeda:
Improving online gaming quality using detour paths. 55-64 - Jong-Seok Lee, Francesca De Simone, Naeem Ramzan, Zhijie Zhao, Engin Kurutepe, Thomas Sikora, Jörn Ostermann, Ebroul Izquierdo, Touradj Ebrahimi:
Subjective evaluation of scalable video coding for content distribution. 65-72 - Di Niu, Baochun Li, Shuqiao Zhao:
Self-diagnostic peer-assisted video streaming through a learning framework. 73-82
Full - F3/content track/classification of content elements
- Jana Machajdik, Allan Hanbury:
Affective image classification using features inspired by psychology and art theory. 83-92 - Yibiao Zhao, Song Chun Zhu, Siwei Luo:
CO3 for ultra-fast and accurate interactive segmentation. 93-102 - Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, Hanqing Lu:
A generic framework for event detection in various video domains. 103-112 - Xiaobai Liu, Jiashi Feng, Shuicheng Yan, Hai Jin:
Image segmentation with patch-pair density priors. 113-122
Full - F4/applications track/applications of geo-tagging
- Yue Gao, Jinhui Tang, Richang Hong, Qionghai Dai, Tat-Seng Chua, Ramesh C. Jain:
W2Go: a travel guidance system by automatic landmark ranking. 123-132 - Yuki Arase, Xing Xie, Takahiro Hara, Shojiro Nishio:
Mining people's trips from large scale geo-tagged photos. 133-142 - Xin Lu, Changhu Wang, Jiang-Ming Yang, Yanwei Pang, Lei Zhang:
Photo2Trip: generating travel routes from geo-tagged photos for trip planning. 143-152 - Yannis Avrithis, Yannis Kalantidis, Giorgos Tolias, Evaggelos Spyrou:
Retrieving landmark and non-landmark images from community photo collections. 153-162
Full - F5/content track/learning concepts in images
- Shuhui Wang, Shuqiang Jiang, Qingming Huang, Qi Tian:
S3MKL: scalable semi-supervised multiple kernel learning for image data mining. 163-172 - Lijun Zhang, Chun Chen, Jiajun Bu, Zhengguang Chen, Shulong Tan, Xiaofei He:
Discriminative codeword selection for image representation. 173-182 - Linjun Yang, Alan Hanjalic:
Supervised reranking for web image search. 183-192
Full - F6/applications/human-centered multimedia track/user-adapted media access
- Sang Min Yoon, Maximilian Scherer, Tobias Schreck, Arjan Kuijper:
Sketch-based 3D model retrieval using diffusion tensor fields of suggestive contours. 193-200 - Axel Carlier, Vincent Charvillat, Wei Tsang Ooi, Romulus Grigoras, Géraldine Morin:
Crowdsourced automatic zoom and scroll for video retargeting. 201-210 - Che-Hua Yeh, Yuan-Chen Ho, Brian A. Barsky, Ming Ouhyoung:
Personalized photograph ranking and selection system. 211-220
Full - F7/applications/content track/multimodal image and video search
- Yuxin Chen, Nenghai Yu, Bo Luo, Xue-wen Chen:
iLike: integrating visual and textual features for vertical search. 221-230 - Yannis Avrithis, Giorgos Tolias, Yannis Kalantidis:
Feature map hashing: sub-linear indexing of appearance and global geometry. 231-240 - John Adcock, Matthew Cooper, Laurent Denoue, Hamed Pirsiavash, Lawrence A. Rowe:
TalkMiner: a lecture webcast search engine. 241-250 - Nikhil Rasiwasia, José Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert R. G. Lanckriet, Roger Levy, Nuno Vasconcelos:
A new approach to cross-modal multimedia retrieval. 251-260
Full - F8/applications track/assisted authoring of media content
- Yingen Xiong, Kari Pulli:
Color and luminance compensation for mobile panorama construction. 261-270 - Subhabrata Bhattacharya, Rahul Sukthankar, Mubarak Shah:
A framework for photo-quality assessment and enhancement based on visual aesthetics. 271-280 - Yang Yang Xiang, Mohan S. Kankanhalli:
Automated aesthetic enhancement of videos. 281-290 - Bin Cheng, Bingbing Ni, Shuicheng Yan, Qi Tian:
Learning to photograph. 291-300
Full - F9/human-centered multimedia/systems track/enriched and extended media presentation
- Sayumi Sugimoto, Daisuke Noguchi, Yuichi Bannai, Ken-ichi Okada:
Ink jet olfactory display enabling instantaneous switches of scents. 301-310 - Kuan-Wen Chen, Chih-Wei Lin, Mike Y. Chen, Yi-Ping Hung:
e-Fovea: a multi-resolution approach with steerable focus to large-scale and high-resolution monitoring. 311-320 - Wei Song, Dian Tjondronegoro, Tony Shu-Hsien Wang, Michael J. Docherty:
Impact of zooming and enhancing region of interests for optimizing user experience on mobile sports video. 321-330 - Austin Abrams, Robert Pless:
Webcams in context: web interfaces to create live 3D environments. 331-340
Full - F10/human-centered multimedia track/improved interactivity
- Jochen Huber, Jürgen Steimle, Max Mühlhäuser:
Toward more efficient user interfaces for mobile video browsing: an in-depth exploration of the design space. 341-350 - Fernanda Brandi, Julius Kammerl, Eckehard G. Steinbach:
Error-resilient perceptual coding for networked haptic interaction. 351-360 - Chunyuan Liao, Hao Tang, Qiong Liu, Patrick Chiu, Francine Chen:
FACT: fine-grained cross-media interaction with documents via a portable hybrid paper-laptop interface. 361-370 - Philip DeCamp, George Shaw, Rony Kubat, Deb Roy:
An immersive system for browsing and visualizing surveillance video. 371-380
Full - F11/applications/content track/novel aids for music retrieval
- Yi Yu, Michel Crucianu, Vincent Oria, Ernesto Damiani:
Combining multi-probe histogram and order-statistics based LSH for scalable audio content retrieval. 381-390 - Jiajun Bu, Shulong Tan, Chun Chen, Can Wang, Hao Wu, Lijun Zhang, Xiaofei He:
Music recommendation by unified hypergraph: combining social media information and music content. 391-400 - Zhendong Zhao, Xinxi Wang, Qiaoliang Xiang, Andy M. Sarroff, Zhonghua Li, Ye Wang:
Large-scale music tag recommendation with explicit multiple attributes. 401-410 - Michael Kuhn, Roger Wattenhofer, Samuel Welten:
Social audio features for advanced music retrieval interfaces. 411-420
Full - F12/applications/human-centered multimedia track/narrowing the experience gap
- Richang Hong, Meng Wang, Mengdi Xu, Shuicheng Yan, Tat-Seng Chua:
Dynamic captioning: video accessibility enhancement for hearing impairment. 421-430 - Chunxi Liu, Qingming Huang, Shuqiang Jiang, Changsheng Xu:
The third eye: mining the visual cognition across multi-language communities. 431-440 - Aiden R. Doherty, Zhengwei Qiu, Colum Foley, Hyowon Lee, Cathal Gurrin, Alan F. Smeaton:
Green multimedia: informing people of their carbon footprint through two simple sensors. 441-450 - Xintao Hu, Fan Deng, Kaiming Li, Tuo Zhang, Hanbo Chen, Xi Jiang, Jinglei Lv, Dajiang Zhu, Carlos Faraco, Degang Zhang, Arsham Mesbah, Junwei Han, Xian-Sheng Hua, Li Xie, L. Stephen Miller, Lei Guo, Tianming Liu:
Bridging low-level features and high-level semantics via fMRI brain imaging for video classification. 451-460
Full - F13/applications/content/human-centered multimedia track/processing of social media
- Guangyu Zhu, Shuicheng Yan, Yi Ma:
Image tag refinement towards low-rank, content-tag prior and error sparsity. 461-470 - Aixin Sun, Sourav S. Bhowmick:
Quantifying tag representativeness of visual content of social images. 471-480 - Vivek K. Singh, Mingyan Gao, Ramesh C. Jain:
Social pixels: genesis and evaluation. 481-490 - Dong Liu, Xian-Sheng Hua, Meng Wang, Hong-Jiang Zhang:
Image retagging. 491-500
Full - F14/applications/content track/detection of near-duplicate content
- Shiliang Zhang, Qingming Huang, Gang Hua, Shuqiang Jiang, Wen Gao, Qi Tian:
Building contextual visual vocabulary for large-scale image applications. 501-510 - Wengang Zhou, Yijuan Lu, Houqiang Li, Yibing Song, Qi Tian:
Spatial coding for large scale partial-duplicate web image search. 511-520 - Xiangmin Zhou, Lei Chen:
Monitoring near duplicates over video streams. 521-530 - Lifeng Shang, Linjun Yang, Fei Wang, Kwok-Ping Chan, Xian-Sheng Hua:
Real-time large scale near-duplicate web video retrieval. 531-540
Full - F15/applications/human-centered multimedia track/automatic generation of media content
- Prarthana Shrestha, Peter H. N. de With, Hans Weda, Mauro Barbieri, Emile H. L. Aarts:
Automatic mashup generation from multiple-camera concert recordings. 541-550 - Marco Cristani, Anna Pesarin, Carlo Drioli, Vittorio Murino, Antonio Rodà, Michele Grapulin, Nicu Sebe:
Toward an automatically generated soundtrack from low-level cross-modal correlations for automotive scenarios. 551-560 - Pere Obrador, Rodrigo de Oliveira, Nuria Oliver:
Supporting personal photo storytelling for social albums. 561-570 - Mathew Laibowitz, Nan-Wei Gong, Joseph A. Paradiso:
Multimedia content creation using societal-scale ubiquitous camera networks and human-centric wearable sensing. 571-580
Full - F16/systems track/3D video
- Simone Milani, Giancarlo Calvagno:
A cognitive approach for effective coding and transmission of 3D video. 581-590 - Jiazhi Xia, Ying He, Dao Thi Phuong Quynh, Xiaoming Chen, Steven C. H. Hoi:
Modeling 3D facial expressions using geometry videos. 591-600 - Shu Shi, Mahsa Kamali, Klara Nahrstedt, John C. Hart, Roy H. Campbell:
A high-quality low-delay remote rendering system for 3D video. 601-610
Short - S1/applications/human-centered multimedia track
- Richang Hong, Xiaotong Yuan, Mengdi Xu, Meng Wang, Shuicheng Yan, Tat-Seng Chua:
Movie2Comics: a feast of multimedia artwork. 611-614 - Scott A. Carter, John Adcock, John Doherty, Stacy M. Branham:
NudgeCam: toward targeted, higher quality media capture. 615-618 - Kuiyuan Yang, Xian-Sheng Hua, Meng Wang, Hong-Jiang Zhang:
Tagging tags. 619-622 - Ju-Chun Ko, Wei-Han Chen, Meng-Chieh Yu, Han-Hung Lin, Jin-Yao Lin, Szu-Wei Wu, Yi-Yu Chung, I-Ling Hu, Wei-Ting Peng, Shih-Yao Lin, Chia-Han Chang, Pei-Hsuan Chou, King-Jen Chang, Mei-Lan Chang, Sue-Huei Chen, Jin-Shing Chen, Ming-Sui Lee, Mike Y. Chen, Yi-Ping Hung:
i-m-Space: interactive multimedia-enhanced space for rehabilitation of breast cancer patients. 623-626 - Zhonghua Li, Qiaoliang Xiang, Jason Hockman, Jianqing Yang, Yu Yi, Ichiro Fujinaga, Ye Wang:
A music search engine for therapeutic gait training. 627-630 - Minwoo Park, Jiebo Luo, Robert T. Collins, Yanxi Liu:
Beyond GPS: determining the camera viewing direction of a geotagged image. 631-634 - Zhenxing Niu, Qi Tian, Xinbo Gao:
Real-world trajectory extraction for attack pattern analysis in soccer video. 635-638 - Yicheng Song, Juan Cao, Zhineng Chen, Yongdong Zhang, Jintao Li:
Tag transformer. 639-642 - Kar-Han Tan, Dan Gelb, Ramin Samadani, Ian N. Robinson, W. Bruce Culbertson, John G. Apostolopoulos:
Gaze awareness and interaction support in presentations. 643-646 - Hongyuan Cai, Jiang Yu Zheng:
Digesting omni-video along routes for navigation. 647-650 - David M. Chen, Sam S. Tsai, Bernd Girod, Cheng-Hsin Hsu, Kyu-Han Kim, Jatinder Pal Singh:
Building book inventories using smartphones. 651-654 - Clayton Brian Atkins, Nicholas P. Lyons, Xuemei Zhang, Daniel Tretter:
Templated recursive image composition. 655-658 - Subramanian Ramanathan, Jacopo Staiano, Kyriaki Kalimeri, Nicu Sebe, Fabio Pianesi:
Putting the pieces together: multimodal analysis of social attention in meetings. 659-662 - Kori Inkpen, Rajesh Hegde, Sasa Junuzovic, Christopher Brooks, John C. Tang, Zhengyou Zhang:
AIR conferencing: accelerated instant replay for in-meeting multimodal review. 663-666 - Harish Katti, Subramanian Ramanathan, Mohan S. Kankanhalli, Nicu Sebe, Tat-Seng Chua, Kalpathi R. Ramakrishnan:
Making computers look the way we look: exploiting visual attention for image understanding. 667-670 - Yinsheng Zhou, Graham Percival, Xinxi Wang, Ye Wang, Shengdong Zhao:
MOGCLASS: a collaborative system of mobile devices forclassroom music education. 671-674 - Phan Nhat Hai, Van Duc Thong Hoang, Hyoseop Shin:
Adaptive combination of tag and link-based user similarity in flickr. 675-678 - Marco Piovesana, Ying-Jui Chen, Neng-Hao Yu, Hsiang-Tao Wu, Li-Wei Chan, Yi-Ping Hung:
Multi-display map touring with tangible widget. 679-682 - Ryan E. Janzen, Steve Mann:
"Stray": a new multimedia music composition using the andantephone. 683-686 - Yaohua Yu, Zhengjie Liu:
A user study of visual versus sonically-enhanced interfaces for use while walking. 687-690 - Jiayao Hu, Shifeng Chen, Jianzhuang Liu, Xiaoou Tang:
Fast image rearrangement via multi-scale patch copying. 691-694 - Xiong Li, Liwei Wang, Huanxi Liu, Yuncai Liu:
Learning parts-based representation for face transition. 695-698 - Shelley Buchinger, Ewald Hotop, Helmut Hlavacs, Francesca De Simone, Touradj Ebrahimi:
Gesture and touch controlled video player interface for mobile devices. 699-702 - Hamdi Dibeklioglu, Roberto Valenti, Albert Ali Salah, Theo Gevers:
Eyes do not lie: spontaneous versus posed smiles. 703-706
Short - S2/content/systems track
- Chen Liu, Bing Cui, Anthony K. H. Tung:
Integrating web 2.0 resources by wikipedia. 707-710 - Zhipeng Wu, Shuqiang Jiang, Liang Li, Peng Cui, Qingming Huang, Wen Gao:
Vicept: link visual features to concepts for large-scale image understanding. 711-714 - Stefan Siersdorfer, Enrico Minack, Fan Deng, Jonathon S. Hare:
Analyzing and predicting sentiment of images on the social web. 715-718 - Xian Xiao, Changsheng Xu, Jinqiao Wang:
Landmark image classification using 3D point clouds. 719-722 - Xiangyu Wang, Mohan S. Kankanhalli:
Portfolio theory of multimedia fusion. 723-726 - Stevan Rudinac, Martha A. Larson, Alan Hanjalic:
Exploiting noisy visual concept detection to improve spoken content based video retrieval. 727-730 - Nesrine Changuel, Nicholas Mastronarde, Mihaela van der Schaar, Bessem Sayadi, Michel Kieffer:
End-to-end stochastic scheduling of scalable video overtime-varying channels. 731-734 - Hichem Sahbi, Xi Li:
Context dependent SVMs for interconnected image network annotation. 735-738 - Li Weng, Bart Preneel:
A novel video hash algorithm. 739-742 - Wei-Ta Chu, Wen-Long Liu, Jen-Yu Yu:
Age classification for pose variant and occluded faces. 743-746 - Howard Zhou, Tucker Hermans, Asmita V. Karandikar, James M. Rehg:
Movie genre classification via scene categorization. 747-750 - Yang Liu, Feng Zhou, Wei Liu, Fernando De la Torre, Yan Liu:
Unsupervised summarization of rushes videos. 751-754 - Yue Zhang, Nadeem Jamali:
Negotiating multimedia advertising with attention owners. 755-758 - Wen Sun, Yan Lu, Shipeng Li:
ReDi: an interactive virtual display system for ubiquitous devices. 759-762 - Huifeng Shen, Zhaotai Pan, Haicheng Sun, Yan Lu, Shipeng Li:
A proxy-based mobile web browser. 763-766 - Hui Feng, Hefei Ling, Fuhao Zou, WeiQi Yan, Zhengding Lu:
Optimal collusion attack for digital fingerprinting. 767-770 - Toshie Misu, Yasutaka Matsuo, Shinichi Sakaida, Yoshiaki Shishikui:
Novel framework for single/multi-frame super-resolution using sequential Monte Carlo method. 771-774 - Petros Daras, Theodoros Semertzidis, Lambros Makris, Michael G. Strintzis:
Similarity content search in content centric networks. 775-778 - Zhi Li, Ali C. Begen, Xiaoqing Zhu, Bernd Girod:
Accelerated IPTV channel change with transcoded unicast bursting. 779-782 - Srisakul Thakolsri, Wolfgang Kellerer, Eckehard G. Steinbach:
Qoe-based rate adaptation scheme selection for resource-constrained wireless video transmission. 783-786 - Eladio Martin, Oriol Vinyals, Gerald Friedland, Ruzena Bajcsy:
Precise indoor localization using smart phones. 787-790 - Dongbo Huang, Jin Zhao, Xin Wang:
Trading bandwidth for playback lag: can active peers help? 791-794 - Shujie Liu, Chang Wen Chen:
3D video transcoding for virtual views. 795-798 - Espen Jacobsen, Carsten Griwodz, Pål Halvorsen:
Pull-patching: a combination of multicast and adaptive segmented HTTP streaming. 799-802
Short - S3/applications/content track
- Feng Xie, Yi Shen, Xiaofei He:
K-way min-max cut for image clustering and junk images filtering from Google images. 803-806 - Amirali Jazayeri, Hongyuan Cai, Mihran Tuceryan, Jiang Yu Zheng:
Smart video systems in police cars. 807-810 - Po-Nung Tseng, Yen-Liang Lin, Winston H. Hsu:
Interactive inquiry for object of interest in video playback by motion-augmented graph cut. 811-814 - An-Jung Cheng, Fang-Erh Lin, Yin-Hsi Kuo, Winston H. Hsu:
GPS, compass, or camera?: investigating effective mobile sensors for automatic search-based image annotation. 815-818 - Markus Buzeck, Jörg Müller:
TwitterSigns: microblogging on the walls. 819-822 - Natasha Gelfand, Andrew Adams, Sung Hee Park, Kari Pulli:
Multi-exposure imaging on mobile devices. 823-826 - Congcong Li, Alexander C. Loui, Tsuhan Chen:
Towards aesthetics: a photo quality assessment and photo selection system. 827-830