ICME 2004: Taipei, Taiwan
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, ICME 2004, 27-30 June 2004, Taipei, Taiwan. IEEE 2004
Volume 1
Jingliang Peng, C. C. Jay Kuo: Progressive geometry encoder using octree-based space partitioning. 1-4
Qi Tian, Jie Yu, Ting Rui, Thomas S. Huang: Parameterized discriminant analysis for image classification. 5-8
Marco Grangetto, Enrico Magli, Gabriella Olmo: Reliable JPEG 2000 wireless imaging by means of error-correcting MQ coder. 9-12
Shin-Hyoung Kim, Jae-Ho Choi, Hyun-Bin Kim, Jong-Whan Jang: A new snake algorithm for object segmentation in stereo images. 13-16
Hanghang Tong, Mingjing Li, HongJiang Zhang, Changshui Zhang: Blur detection for digital images using wavelet transform. 17-20
Timothy K. Shih, Rong-Chi Chang, Liang-Chen Lu, Huan-Chi Huang: Multi-layer inpainting on Chinese artwork. 21-24
A. C. Yu, Guobin Shen, Bing Zeng, Oscar C. Au: Arbitrarily-shaped video coding: smart padding versus MPEG-4 LPE/zero padding. 25-28
Hung-Chang Chang, Shang-Hong Lai, Kuang-Rong Lu: A robust and efficient video stabilization algorithm. 29-32
Chitra L. Madhwacharyula, Mohan S. Kankanhalli, Philippe Mulhem: Content based editing of semantic video metadata. 33-36
Wook-Hyun Jeong, Young-Suk Yoon, Yo-Sung Ho: Design of robust reversible variable-length codes using the property of free distance. 37-40
Dong Xu, Jianzhuang Liu, Zhengkai Liu, Xiaoou Tang: Indoor shadow detection for video segmentation. 41-44
Maja Pantic, Ioannis Patras: Temporal modeling of facial actions from face profile image sequences. 49-52
Ze-Jing Chuang, Chung-Hsien Wu: Emotion recognition using acoustic features and textual content. 53-56
Chang Hong Lin, Tiehan Lv, Wayne Wolf, Burak Ozer: A peer-to-peer architecture for distributed real-time gesture recognition. 57-60
Kei Igarashi, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura, Hüseyin Abut: Biometric identification using driving behavioral signals. 65-68
Rong Yan, Alexander G. Hauptmann: Multi-class active learning for video semantic feature extraction. 69-72
Milind R. Naphade, John R. Smith: Active learning for simultaneous annotation of multiple binary semantic concepts. 77-80
Feng Jing, Mingjing Li, HongJiang Zhang, Bo Zhang: Entropy-based active learning with support vector machines for content-based image retrieval. 85-88
Qiang Wang, Debin Zhao, Siwei Ma, Yan Lu, Qingming Huang, Wen Gao: Context-based 2D-VLC for video coding. 89-92
Ronggang Wang, Chao Huang, Jintao Li, Yanfei Shen: Sub-pixel motion compensation interpolation filter in AVS. 93-96
Xiangyang Ji, Debin Zhao, Wen Gao, Qingming Huang, Siwei Ma, Yan Lu: New bi-prediction techniques for B pictures coding. 101-104
Zhengguo Li, Nam Ling, Susanto Rahardja, Xiao Lin, Ping Li: An iterative method for hypothetical reference decoder. 105-108
Milind R. Naphade, Apostol Natsev, Ching-Yung Lin, John R. Smith: Multi-granular detection of regional semantic concepts. 109-112
Deng Cai, Xiaofei He, Wei-Ying Ma, Ji-Rong Wen, HongJiang Zhang: Organizing WWW images based on the analysis of page layout and Web link structure. 113-116
DongQing Zhang, Ching-Yung Lin, Shih-Fu Chang, John R. Smith: Semantic video clustering across sources using bipartite spectral clustering. 117-120
Ming-Sui Lee, Mei-Yin Shen, C. C. Jay Kuo: Color matching techniques for video mosaic applications. 121-124
Charalampos Laftsidis, Costas Kotropoulos, Ioannis Pitas: MPEG-4 compliant reproduction of face animation created in Maya. 125-128
Jinhui Chao, Jongdae Kim, N. Atsushi: A new surface model based on a fibre bundle of 1-parameter groups. 129-132
Ishtiaq Rasool Khan, Masahiro Okuda, Shinichi Takahashi: Regular 3D mesh reconstruction based on cylindrical mapping. 133-136
Xiao Yang, Ruby B. Lee: PLX FP: an efficient floating-point instruction set for 3D graphics. 137-140
Masahiro Okuda, Kyoko Nagatomo, Masaaki Ikehara, Shinichi Takahashi: Similarity detection of 3D meshes using 2D hierarchical regular grids. 145-148
Sung Kyu Choi, Jong-Gu Jeon, Woo-Sung Shim, Won-Kap Jang, Victor H. S. Ha: Design and implementation of H.264-based video decoder for digital multimedia broadcasting. 149-152
Zhuo Li, Kin-Man Lam, Lansun Shen: Rate control for MPEG-4 FGS coded video using piecewise rate distortion model. 153-156
A. S. Abraham, Ju Wang, Jonathan C. L. Liu: Bandwidth-aware video encoding with adaptive image scaling. 157-160
Jin-Soo Kim, Jae-Gon Kim, Kyeongok Kang, Jinwoong Kim: A distortion control scheme for allocating constant distortion in FD-CD video transcoder. 161-164
Seonki Kim, Yo-Sung Ho: Rate control algorithm for H.264/AVC video coding standard based on rate-quantization model. 165-168
Ligang Dong, Bharadwaj Veeravalli: Design and analysis of variable bit rate caching strategies for continuous media data. 169-172
Longin Jan Latecki, Tao Jin, Jaiwant Mulik: A two-stream approach for adaptive rate control in multimedia applications. 173-176
Hideaki Kimata, Masaki Kitahara, Kazuto Kamikura, Yoshiyuki Yashima: Hierarchical reference picture selection method for temporal scalability beyond H.264. 181-184
Sammo Cho, Byungjun Bae, Geon Kim, Jinhwan Lee, Young Kwon Hahm, Hyuckjae Lee: Development of a remultiplexer for digital multimedia broadcasting. 185-188
Morihiko Tamai, Tao Sun, Keiichi Yasumoto, Naoki Shibata, Minoru Ito: Energy-aware QoS adaptation for streaming video based on MPEG-7. 189-192
Hye-Joo Lee, Bum Suk Choi, Jong Won Seok, Jin Woo Hong: Design of protection and distribution service model for digital broadcasting content. 193-196
Stefan Elf, Jeremiah Scholl, Peter Parnes: Schemes for user-interest controlled video bandwidth adaptation in a collaborative workspace environment. 197-200
Tomoya Enokido, Makoto Takizawa: QoS-based hybrid concurrency control on distributed multimedia objects. 201-204
Tomoki Yoshihisa, Masahiko Tsukamoto, Shojiro Nishio: A scheduling scheme to enable fast-forward for continuous media data broadcasting. 209-212
Man-Ching Yuen, Weijia Jia, Chi-Chung Cheung: Simple mathematical modeling of efficient path selection for QoS routing in load balancing. 217-220

Satoshi Itaya, Tomoya Enokido, Makoto Takizawa: Atomicity and causality of multimedia messages in group communication. 229-232
Mugen Peng, Wenbo Wang: TDD-CDMA uplink capacity investigation in the background noise floor. 233-236
Dai Boong Lee, Hwangjun Song: Dynamic class selecting mechanism for guaranteed service with minimum cost over relative differentiated-services networks. 237-240
Shu-Ching Chen, Mei-Ling Shyu, Chengjun Zhan, Srinivas Peeta: A novel rate-based hop by hop congestion control algorithm. 245-248
Athina Markopoulou, Eric Setton, M. Kalman, John G. Apostolopoulos: WiSE video: using in-band wireless loss notification to improve rate-controlled video streaming. 249-252
Nualsawat Hiransakolwong, Khanh Vu, Kien A. Hua, Sheau-Dong Lang: Shape recognition based on the medial axis approach. 257-260
Yimin Wu, Aidong Zhang: PatternQuest: learning patterns of interest using relevance feedback in multimedia information retrieval. 261-264
Shu-Ching Chen, Mei-Ling Shyu, Min Chen, Chengcui Zhang: A decision tree-based multimodal data mining framework for soccer goal detection. 265-268
Anthony J. T. Lee, Ruey-Wen Hong, Meng-Fang Chang: An approach to content-based video retrieval. 273-276
Masahito Kumano, Yasuo Ariki, Kiyoshi Tsukada, S. Hamaguchi, Hajime Kiyose: Automatic extraction of PC scenes based on feature mining for a real time delivery system of baseball highlight scenes. 277-280
Timothy K. Shih, Ching-Sheng Wang, Yuan-Kai Chiu, Yi-Tsou Hsin, Chun-Hong Huang: On automatic actions retrieval of martial arts. 281-284
Andy Chang, Peter H. W. Wong, Yick Ming Yeung, Oscar C. Au: Fast integer motion estimation for H.264 video coding standard. 289-292
Yu-lung Lo, Wen-lin Li: Linear time for discovering non-trivial repeating patterns in music databases. 293-296
Marcel Worring, Giang P. Nguyen, Laura Hollink, Jan van Gemert, Dennis Koelma: Accessing video archives using interactive search. 297-300

Akihiro Kuwabara, Kazutoshi Sumiya, Katsumi Tanaka: Query relaxation and answer integration for cross-media meta-searches. 309-312
Ming-Yang Wu, Yao-Cyuan Wu, Chih-Yi Chiu, Shih-Pin Chao, Shi-Nine Yang: HUMOR: a HUman MOtion Retrieval system with multi-modal queries. 315-316
Yeong-Yuh Xu, Y. H. Chen, C. L. Tseng, Por-Shen Lai, Hsin-Chia Fu: Multimedia TV news browsing system. 317-318
Yi-Chin Huang, Meng-Jyi Shieh, Chien-Feng Huang, Ching-Che Kao, Shu-Min Yang, Wen-Chin Chen: A visual MPEG-4 scene editor. 319-320
A. Raman, M. Jain, T. C. Rajendra, S. Satheesh, S. Sethuraman, V. K. Jain, V. P. Das: Low-cost wireless projector interface device using TI TMS320DM270. 321-322
Flora Chia-I Chang, Wen-Chih Chang, Hsuan-Che Yang, Timothy K. Shih, Huan-Chao Keh: Courseware development using influence diagram with SCORM compatibility. 323-324
Timothy K. Shih, Nigel H. Lin, Hsuan-Pu Chang, Kuan-Hao Huang: Adaptive pocket SCORM reader. 325-326
Xu Huang, Allan C. Madoc, M. Wagner: Noises removal for images by wavelet-based Bayesian estimator via Levy process analysis. 327-330
Tiehan Lv, Burak Ozer, Wayne Wolf: A real-time background subtraction method with camera motion compensation. 331-334
Kei Kawamura, Hiroshi Watanabe, Hideyoshi Tominaga: Vector representation of binary images containing halftone dots. 335-338
Dirk Farin, Peter H. N. de With, Wolfgang Effelsberg: Video-object segmentation using multi-sprite background subtraction. 343-346
Ming Qu, Frank Y. Shih, Ju Jing, Haimin Wang: Solar flare tracking using image processing techniques. 347-350
Xing Yi, Changshui Zhang, Jingdong Wang: Multi-view EM algorithm and its application to color image segmentation. 351-354
Abdolah Chalechale, Golshah Naghdy, Prashan Premaratne, H. Moghaddasi: Chain-based extraction of line segments to describe images. 355-358
Rajas A. Sambhare, Yu Hen Hu: Content-based image post-processing for blurring artifact reduction. 359-362
Lanlan Chang, Yap-Peng Tan: Combined use of spatial and spectral correlations for enhanced color filter array demosaicking. 363-366
Yik-Hing Fung, Yuk-Hee Chan: Restoring halftoned color-quantized images with simulated annealing. 367-370
Ahmed Swilem, Kousuke Imamura, Hideo Hashimoto: A high-speed codebook design algorithm for ECVQ using angular constraint with search space partitioning. 371-374
Hang Nguyen, Pierre Duhamel, Jérôme Brouet, Denis Rouffet: Robust VLC sequence decoding exploiting additional video stream properties with reduced complexity. 375-378
Feng Pan, Zhengguo Li, Keng Pang Lim, Xiao Lin, Susanto Rahardja, Dajun Wu, Si Wu: Complexity adaptive quantization for intra-frames in very low bit rate video coding. 379-382
Seong Hwan Jang, Nikil Jayant: An efficient bit allocation algorithm in dependent coding framework and one-way video applications. 383-386
Junqiang Lan, Xinhua Zhuang, Wenjun Zeng: Single-pass frame-level constant distortion bit allocation for smooth video quality. 387-390
Maja Pantic, Léon J. M. Rothkrantz: Case-based reasoning for user-profiled recognition of emotions from face images. 391-394
Yu Zhang, Terence Sim, Chew Lim Tan: Reanimating real humans: automatic reconstruction of animated faces from range data. 395-398
Jingrong Jia, Lijun Yin, Joseph P. Morrissey: On the importance of skin color for "other-race" effect. 399-402
Mohamed Hammami, Dzmitry V. Tsishkou, Liming Chen: Adult content Web filtering and face detection using data-mining based kin-color model. 403-406
Takahiro Ueoka, Tatsuyuki Kawamura, Yasuyuki Kono, Masatsugu Kidode: Functional evaluation of a vision-based object remembrance support system. 407-410
Bongsoo Jung, Byeungwoo Jeon, Myung Don Kim, BongSue Suh, Song In Choi: Selective temporal error concealment algorithm for H.264/AVC. 411-414
Peng Zhang, Debin Zhao, Siwei Ma, Yan Lu, Wen Gao: Multiple modes intra-prediction in intra coding. 419-422
Yanfei Shen, Dongming Zhang, Chao Huang, Jintao Li: Adaptive weighted prediction in video coding. 427-430
Mahalingam Ramkumar, Nasir D. Memon: A system for digital rights management using key predistribution. 431-434
Ching-Yung Lin, Belle L. Tseng: Semantic multimedia authentication with model vector signature. 435-438
Hong Heather Yu: On content protection for mobile consumer multimedia applications. 439-442
Bin B. Zhu, Min Feng, Shipeng Li: An efficient key scheme for layered access control of MPEG-4 FGS video. 443-446
Kwang Yong Kim, Jin Woo Hong, Chul Min Park, Hoe Kyung Jung: The architecture of MPEG-4 based IPMP authoring system. 447-450
Mei-Ling Shyu, Shu-Ching Chen, C. Ranasingha: Router active queue management for both multimedia and best-effort traffic flows. 451-454
Michael Harville, Ramin Samadani, Daniel Tretter, Debargha Mukherjee, Ullas Gargi, N. Chang: Mediabeads: an architecture for path-enhanced media applications. 455-458
Chun-Ming Huang, Kai-Chao Yang, Jia-Shung Wang: Support fast scan operations with video streaming technology. 463-466
Tai-Lun Chang, Ying-Ming Tsai, Chih-Da Chien, Chien-Chang Lin, Jiun-In Guo: A high-performance MPEG4 bitstream processing core. 467-470
Qi Zhang, Qing Li, Yunyang Dai, C. C. Jay Kuo: Reducing memory bank conflict for embedded multimedia systems. 471-474
Chu-Chuan Lee, Pao-Chi Chang: Deterministic traffic regulation with decoder buffer constraints for streaming videos. 475-478
Min-You Wu, Yan Zhu, Wei Shu: Optimal multicast overlay placement for realtime streaming media. 479-482
Kitae Nahm, C. C. Jay Kuo: Design and performance evaluation of TCP-friendly thin-layered video multicast scheme. 487-490
Xiaorong Li, Bharadwaj Veeravalli: Performance evaluation of a destination-based video distribution strategy for reservation-based multimedia systems. 491-494
Chao-Tung Yang, Ko-Tzu Wang: A VOD system on high-availability and load balancing Linux servers. 499-502
Shoaib Khan, Rüdiger Schollmeier, Eckehard G. Steinbach: A performance comparison of multiple description video streaming in peer-to-peer and content delivery networks. 503-506
Xin Yan, Kongwah Wan, Qi Tian, Mun-Kew Leong, Ping Xiao: Two dimensional timeline and its application to Conversant Media system. 507-510
Hong-Kwai Lam, Oscar C. Au, Chi-Wah Wong: Fast motion vector re-estimation for arbitrary video downsizing using spatial-variant filter. 515-518
Xiao-Dong Yu, Ping Xue, Qi Tian: A statistical approach for object motion estimation with MPEG motion vectors. 519-522
Chieh-Ling Huang, E-Liang Chen, Pau-Choo Chung, Yuh-Ren Choo: Optical flow back-projection for genuine motion vector estimation. 523-526
Chi-Geun Lee, Ho Geun Lee, Hyun-Jin Shim, Sung Tae Jung, Sang-Seol Lee: A 4-way pipelined processing architecture for three step search block-matching motion estimation. 527-530
Peng Yang, Yuwen He, Shiqiang Yang: An unsymmetrical-cross multi-resolution motion search algorithm for MPEG4-AVC/H.264 coding. 531-534
ShouWen Lai, Li Fen: Optimized DCT domain motion vector estimation in frame skipped transcoding. 535-538
G. Martinez: Improving the speed of convergence of a maximum-likelihood motion estimation algorithm of a human face. 539-542
Yongfang Liang, Ishfaq Ahmad, V. Swaminathan: Fast priority search algorithm for block motion estimation. 543-546
Eun-Young Elaine Kang, Isaac Cohen, Gérard G. Medioni: A layer extraction system based on dominant motion estimation and global registration. 551-554
Si Wu, Yong-Dong Zhang, Shouxun Lin: An automatic segmentation algorithm for moving objects in video sequences under multi-constraints. 555-558
Jacob Augustine, Shivarama Rao, Norman P. Jouppi, Subu Iyer: Region of interest editing of MPEG-2 video streams in the compressed domain. 559-562
Edmundo Saez, José I. Benavides, Nicolas Guil: Reliable real time scene change detection in MPEG compressed video. 567-570
Wing-San Chau, Oscar C. Au, Tak-Song Chong: Key frame selection by macroblock type and motion vector analysis. 575-578
Dian Tjondronegoro, Yi-Ping Phoebe Chen, Binh Pham: Classification of self-consumable highlights for soccer video summaries. 579-582
Kuniaki Uehara, Miki Amano, Yasuo Ariki, Masahito Kumano: Video shooting navigation system by real-time useful shot discrimination based on video grammar. 583-586
Andrew Vassiliou, Andrew Salway, D. Pitt: Formalising stories: sequences of events and state changes. 587-590
Kongwah Wan, Changsheng Xu: Robust soccer highlight generation with a novel dominant-speech feature extractor. 591-594
Jinjun Wang, Changsheng Xu, Chng Eng Siong, Qi Tian: Sports highlight detection from keyword sequences using HMM. 599-602
Dawei Ding, Jun Yang, Qing Li, Liping Wang, Wenyin Liu: Automatic detection of Flash movie genre using Bayesian approach. 603-606
Shun-Chuan Chen: Active learning for story segmentation of spoken documents. 607-610
Regunathan Radhakrishnan, Ziyou Xiong, Ajay Divakaran, T. Kan: Time series analysis and segmentation using eigenvectors for mining semantic audio label sequences. 611-614
Chuan-Wang Chang, Hewijin Christine Jiau: An improved music representation method by using harmonic-based chord decision algorithm. 615-618
Soo-Chang Pei, Yu-Ting Chuang: Automatic text detection using multi-layer color quantization in complex color images. 619-622
Hongli Luo, Mei-Ling Shyu, Shu-Ching Chen: An end-to-end video transmission framework with efficient bandwidth utilization. 623-626
Hai Gao, Xiao-Dong Yu, Lei Wang, Ping Xue, Qi Tian: Robust multi-level video representation using mean shift analysis. 627-630
Peter Quax, Tom Jehaes, Chris Flerackers, Wim Lamotte: Scalable transmission of avatar video streams in virtual environments. 631-634
Jhing-Fa Wang, Han-Jen Hsu, Hong-Ming Wang: Constrained texture synthesis by scalable sub-patch algorithm. 635-638
Yao-Chang Huang, Shyh-Kang Jenor: An audio recommendation system based on audio signature description scheme in MPEG-7 Audio. 639-642
Ying Cai, Zhan Chen, Wallapak Tavanapong, Johnny Wong: Providing scalable on-demand video services for heterogeneous receivers. 643-646
Shigeyuki Sakazawa, Yasuhiro Takishima, Yasuyuki Nakajima, Masahiro Wada, Kazuo Hashimoto: Multimedia contents management and transmission system "VAST-web" and its effective transport protocol "SVFTP". 647-650
Wen-Nung Lie, Cheng-Hsiung Tseng, Tom C.-I. Lin: Constant-quality rate allocation for spectral fine granular scalable (SFGS) video coding by using dynamic programming approach. 655-658
Chih-Hung Kuo, Mei-Yin Shen, C. C. Jay Kuo: Fast inter-prediction mode decision and motion search for H.264. 663-666
Chun-Jen Tsai, Chih-Wei Tang, Ching-Ho Chen, Ya-Hui Yu: Adaptive rate-distortion optimization using perceptual hints. 667-670
Shu-Chuan Chu, John F. Roddick, Zhe-Ming Lu, Jeng-Shyang Pan: Hadamard transform based equal-average equal-variance equal-norm nearest neighbor codeword search algorithm. 671-674

Zhiming Zhang, JeongHoon Park, Yongje Kim: A novel deblocking algorithm using edge flow-directed filter and curvelet transform. 683-686
Susu Yao, Weisi Lin, Zhongkang Lu, Ee Ping Ong, Xiaokang Yang: Adaptive nonlinear diffusion processes for ringing artifacts removal on JPEG 2000 images. 691-694
Yongfang Liang, Ishfaq Ahmad, Jiancong Luo, Yu Sun: Fast motion estimation using hierarchical motion intensity structure. 699-702
Yeping Su, Ming-Ting Sun: A non-iterative motion vector based global motion estimation algorithm. 703-706
Shengqi Yang, Wayne Wolf, Narayanan Vijaykrishnan: Search speed and power driven integrated software and hardware optimizations for motion estimation algorithms. 707-710
Hong-guang Zhang, Shi-bao Zheng: An extensible digital television middleware architecture based on hardware abstraction layer. 711-714
Yi-Chin Huang, Tu-Chun Yin, Kou-Shin Yang, Yan-Jun Chang, Meng-Jyi Shieh, Wen-Chin Chen: Design and implementation of an efficient MPEG-4 interactive terminal on embedded devices. 715-718
Wei Niu, Jiao Long, Dan Han, Yuan-Fang Wang: Human activity detection and recognition for video surveillance. 719-722
Qiang Zhu, Kwang-Ting Cheng, HongJiang Zhang: SSD tracking using dynamic template and log-polar transformation. 723-726
Eiji Kasutani, Ryoma Oami, Akio Yamada, I. Sato, K. Hirata: Video material archive system for efficient video editing based on media identification. 727-730
Chun-Shien Lu, Chao-Yong Hsu, Shih-Wei Sun, Pao-Chi Chang: Robust mesh-based hashing for copy detection and tracing of images. 731-734
Eloi Batlle, Jaume Masip, Enric Guaus, Pedro Cano: Scalability issues in an HMM-based audio fingerprinting. 735-738
Rosa Lancini, Francesco Mapelli, R. Pezzano: Audio content identification by using perceptual hashing. 739-742
Zixiang Yang, Wei Tsang Ooi, Qibin Sun: Hierarchical, non-uniform locality sensitive hashing and its application to video identification. 743-746
Yoshinari Kameda, Takayoshi Koyama, Yasuhiro Mukaigawa, Fumito Yoshikawa, Yuichi Ohta: Free viewpoint browsing of live soccer games. 747-750
Volume 2
Mika Rautiainen, Timo Ojala, Tapio Seppänen: Cluster-temporal browsing of large news video databases. 751-754
Björn Schuller, Gerhard Rigoll, Manfred K. Lang: Multimodal music retrieval for large databases. 755-758
Giang P. Nguyen, Marcel Worring: Optimizing similarity based visualization in content based image retrieval. 759-762
Roberto Lopez-Gulliver, Norihiro Hagita, Masami Suzuki, T. Satoh, Hiroko Tochigi: SenseWeb: a multi-user environment for browsing images from the Internet. 763-766
M. C. S. Paterno, Fun Siong Lim, Wee Kheng Leow: Fuzzy semantic labeling for image retrieval. 767-770
Jürgen Assfalg, Gianpaolo D'Amico, Alberto Del Bimbo, Pietro Pala: 3D content-based retrieval with spin images. 771-774
Chengcui Zhang, Shu-Ching Chen, Mei-Ling Shyu: Multiple object retrieval for image databases using multiple instance learning and relevance feedback. 775-778
Limin Ma, Qiang Zhou, David M. Chelberg, Mehmet Celenk: Shape-based image retrieval with relevance feedback. 779-782

Stefano Berretti, Gianpaolo D'Amico, Alberto Del Bimbo: Shape representation by spatial partitioning for content based retrieval applications. 791-794
Ramazan Savas Aygün, Aidong Zhang: Sprite pyramid for videos and images having finite-depth scenes. 795-798
Frederik De Keukelaere, Saar De Zutter, Rik Van de Walle: Adding functionality to multimedia content in an MPEG-21 scenario. 799-802
David Atienza, Marc Leeman, Francky Catthoor, Geert Deconinck, Jose Manuel Mendias, Vincenzo De Florio, Rudy Lauwereins: Fast prototyping and refinement of complex dynamic data types in multimedia applications for consumer embedded devices. 803-806
Hui Shen, Mohan S. Kankanhalli, S. H. Srinivasan, Wei-Qi Yan: Mosaic based view enlargement for moving objects in moving pictures. 807-810
Jinguk Jeong, Jongho Nang: An efficient bitmap indexing method for similarity search in high dimensional multimedia databases. 815-818
Marc Rovira, Jordi Gonzàlez, Alejandro López, Jordi Mas, Albert Puig, Jordi Fabregat, Gabriel Fernandez: IndexTV: a MPEG-7 based personalized recommendation system for digital TV. 823-826
Cheng-Yu Wei, Nevenka Dimitrova, Shih-Fu Chang: Color-mood analysis of films based on syntactic and psychological models. 831-834
Liang-Chen Lu, Jun Ohya: Computer vision based analysis of the botanical tree's dynamical behaviors for the reproduction in virtual space. 839-842
Yisong Chen, Horace Ho-Shing Ip: Simulating vivid 3D solid textures from 2D growable patterns. 843-846
Tsai-Yen Li, Mao-Yung Liao, Chun-Feng Liao: An extensible scripting language for interactive animation in a speech-enabled virtual environment. 851-854
Chien-Chang Ho, Yan-Hong Lu, Hung-Te Lin, Shuen-Huei Guan, Sheng-Yao Cho, Rung-Huei Liang, Bing-Yu Chen, Ming Ouhyoung: Feature refinement strategy for extended marching cubes: Handling on dynamic nature of real-time sculpting application. 855-858
Chin-Chen Chang: Novel hierarchical approach for radiosity. 863-866
G. Moschos, Nikos Nikolaidis, Ioannis Pitas: Anatomically-based 3D face and oral cavity model for creating virtual medical patients. 867-870
Ki-Ryong Kwon, Bong-Ju Jang, Eung-Joo Lee, Young Huh: Copyright protection of architectural CAD drawing using the multiple watermarking scheme. 871-874
Yu Hu, Qing Li, C. C. Jay Kuo: Efficient implementation of elliptic curve cryptography (ECC) on VLIW-micro-architecture media processor. 879-882
Ming Jiang, Xiaolin Wu, Edward K. Wong, Nasir D. Memon: Steganalysis of boundary-based steganography using autoregressive model of digital boundaries. 883-886
Abdolah Chalechale, Golshah Naghdy, Prashan Premaratne, Alfred Mertins: Document image analysis and verification using cursive signature. 887-890
Takeyuki Uehara, Reihaneh Safavi-Naini, Philip Ogunbona: An MPEG tolerant authentication system for video data. 891-894

Pei-Ming Huang, Da-Chun Wu, Wen-Hsiang Tsai: A novel block-based authentication technique for binary images by block pixel rearrangements. 903-906
Guo-Shiang Lin, Chia H. Yeh, C. C. Jay Kuo: Data hiding domain classification for blind image steganalysis. 907-910
Ju Wang, A. R. Steele, Jonathan C. L. Liu: Efficient integration of watermarking with MPEG compression. 911-914
Zhishou Zhang, Gang Qiu, Qibin Sun, Xiao Lin, Zhicheng Ni, Yun Q. Shi: A unified authentication framework for JPEG2000. 915-918
Feng-Hsing Wang, Lakhmi C. Jain, Jeng-Shyang Pan: Design of hierarchical keys for a multi-user-based watermarking system. 919-922

Jinhai Wu, Bin B. Zhu, Shipeng Li, Fuzong Lin: Efficient oracle attacks on Yeung-Mintzer and variant authentication schemes. 931-934
Chu-Hsing Lin, Yi-Yi Lai: A fingerprint-based user authentication scheme for multimedia systems. 935-938

Chaur-Chin Chen: RSA scheme with MRF and ECC for data encryption. 947-950
Chang-Lung Tsai, Kuo-Chin Fan, Char-Dir Chung, Thomas C. Chuang: Data hiding of binary images using pair-wise logical computation mechanism. 951-954
Huijuan Yang, Alex C. Kot: Text document authentication by integrating inter character and word spaces watermarking. 955-958
Phen-Lan Lin, Chung-Kai Hsieh, Po-Whei Huang: Hierarchical watermarking scheme for image authentication and recovery. 963-966
Alessandra Lumini, Dario Maio: Adaptive positioning of a visible watermark in a digital image. 967-970
Pang-Chieh Wang, Ting-Wei Hou: An AV object oriented encryption algorithm for MPEG-4 streams. 971-974
Hong-Kwai Lam, Oscar C. Au, Chi-Wah Wong: Automatic white balancing using adjacent channels adjustment in RGB domain. 979-982
N. Tashev: Gain self-calibration procedure for microphone arrays. 983-986
Michael N. Wallick, Yong Rui, Li-wei He: A portable solution for automatic lecture room camera management. 987-990
Junwei Han, King N. Ngan, Mingjing Li, HongJiang Zhang: Learning semantic concepts from user feedback log for image retrieval. 995-998
Chia-Chen Kuo, Ming-Syan Chen: DDS: an efficient dynamic dimension selection algorithm for nearest neighbor search in high dimensions. 999-1002
Y. Wu, Belle L. Tseng, John R. Smith: Ontology-based multi-classification learning for video concept detection. 1003-1006
Chuan-Yu Chang: A contextual-based Hopfield neural network for medical image edge detection. 1011-1014
Koji Zettsu, Yutaka Kidawara, Katsumi Tanaka: Discovering aspect-based correlation of Web contents for cross-media information retrieval. 1015-1018
Yelizaveta Marchenko, Tat-Seng Chua, A. Irina, Ramesh Jain: Representation and retrieval of paintings based on art history concepts. 1023-1026
Chuan-Yu Cho, Ya-Ting Chuang, Pei-Chi Chu, Shih-Yu Huang, Jia-Shung Wang: Efficient motion-vector-based video search using query by clip. 1027-1030
Hong-Ru Lee, Jyh-Shing Roger Jang: i-Ring: a system for humming transcription and chord generation. 1031-1034
Xiaoping Hu, Ümit Y. Ogras, Nicholas H. Zamora, Radu Marculescu: Data partitioning techniques for pervasive multimedia platforms. 1035-1038
Aravindan Raghuveer, Ewa Kusmierek, David Hung-Chang Du: Network-aware rate adaptation for video streaming. 1039-1042
Chen-Lung Chan, Shih-Yu Huang, Mato Jan, Jia-Shung Wang: Peer-to-peer video delivery scheme for large scale video-on-demand applications. 1043-1046
Dmitri Jarnikov, Peter van der Stok, Clemens C. Wüst: Predictive control of video quality under fluctuating bandwidth conditions. 1051-1054
Minqiang Jiang, Xiaoquan Yi, Nam Ling: Frame layer bit allocation scheme for constant quality video. 1055-1058
Ewa Kusmierek, David Hung-Chang Du: Optimizing periodic broadcast resource requirements with proxy. 1059-1062
Vu-Thanh Nguyen, Ee-Chien Chang, Wei Tsang Ooi: Layered coding with good allocation outperforms multiple description coding over multiple paths. 1067-1070

Yusuo Hu, Xing Xie, Zonghai Chen, Wei-Ying Ma: Attention model based progressive image transmission. 1079-1082
Sin-Ming Cheung, Yuk-Hee Chan: A technique for lossy compression of error-diffused halftones. 1083-1086
Winston H. Hsu, Shih-Fu Chang: Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation. 1091-1094
Lekha Chaisorn, Tat-Seng Chua, Chin-Hui Lee, Qi Tian: A hierarchical approach to story segmentation of large broadcast news video corpus. 1095-1098
Shih-Hung Lee, Chia H. Yeh, C. C. Jay Kuo: Video skimming based on story units via general tempo analysis. 1099-1102
Cees Snoek, Marcel Worring, Alexander G. Hauptmann: Detection of TV news monologues by style analysis. 1103-1106
Noboru Babaguchi, T. Ishida, Keisuke Morisawa: Scene retrieval with sign sequence matching based on video and audio features. 1107-1110
Xuan Jing, Lap-Pui Chau: An efficient inter mode decision approach for H.264 video coding. 1111-1114
Wenbin Jiang, Manli Zhou: A fast BMA based on combining search candidate subsampling and APDS. 1115-1118
Zhibin Pan, Koji Kotani, Tadahiro Ohmi: An improved fast encoding method for vector quantization based on memory-efficient data structure. 1119-1122
Tien-Ying Kuo, Yang Liang, Chin-Cheng Chu: Variable frame skipping scheme based on estimated quality of non-coded frames at decoder for real-time block-based video coding. 1127-1130
Hao-Song Kong, Yao Nie, Anthony Vetro, Huifang Sun, Kenneth E. Barner: Coding artifacts reduction using edge map guided adaptive and fuzzy filtering. 1135-1138
Yan-Chen Lu, Chun-Fu Shen, Chi-Kuang Chen: A novel hardware accelerator architecture for MPEG-2/4 AAC encoder. 1139-1142
Chih-Ming Chen, Yung-Chang Chen, Chien-Min Chen: Multiple description motion compensation video coding for MPEG-4 FGS over lossy packet networks. 1143-1146
Feng Pan, Xiao Lin, Susanto Rahardja, Keng Pang Lim, Zhengguo Li: A directional field based fast intra mode decision algorithm for H.264 video coding. 1147-1150
Yong-Dong Zhang, Feng Dai, Shouxun Lin: Fast 4*4 intra-prediction mode selection for H.264. 1151-1154
Lih-Jen Kau, Yuan-Pei Lin: Lossless image coding using a switching predictor with run-length encodings. 1155-1158
F. A. R. Nascimento, F. J. Fraga: New methods for improvement of sinusoidal transform vocoders. 1159-1162
Xueming Li, Fang Wei: An improved practical efficient implementation of ICT used in H.264. 1163-1166
Daw-Tung Lin, Chen-Ming Yang: Real-time eye detection using face-circle fitting and dark-pixel filtering. 1167-1170
Fatih Murat Porikli: Learning object trajectory patterns by spectral clustering. 1171-1174
Junxian Wang, How-Lung Eng, Alvin Harvey Kam, Wei-Yun Yau: Integrating color and motion to enhance human detection within aquatic environment. 1179-1182
Ben Yip, W. Y. Siu, Jesse S. Jin: Pose determination of human head using one feature point based on head movement. 1183-1186
Chin-Chen Chang, Chung-Mou Pengwu: Gesture recognition approach for sign language using curvature scale space and hidden Markov model. 1187-1190
Ming-yu Chen, Alexander G. Hauptmann: Towards robust face recognition from multiple views. 1191-1194
Jiatao Song, Jilin Liu, Zheru Chi, Wei Wang: Locatization of human eyes based on a series of binary images. 1199-1202
Mark Chan, Chia-Yen Chen, Gareth Barton, Patrice Delmas, Georgy L. Gimel'farb, Philippe Leclercq, T. Fischer: Evaluation of 3D face analysis and synthesis techniques. 1203-1206
Hwangjun Song, Sun Jae Chung, Kyoungwon Min, Hyeok Koo Jung: Online face recognition system through the Internet. 1207-1210
P. Jincahitra: Polyphonic instrument identification using independent subspace analysis. 1211-1214
Björn Schuller, Gerhard Rigoll, Manfred K. Lang: Emotion recognition in the manual interaction with graphical user interfaces. 1215-1218
Hadi Seyedarabi, Ali Aghagolzadeh, Sohrab Khanmohammadi: Recognition of six basic facial expressions by feature-points tracking using RBF neural network and fuzzy inference system. 1219-1222
Huicheng Zheng, Hongmei Liu, Mohamed Daoudi: Blocking objectionable images: adult images and harmful symbols. 1223-1226
S. H. Srinivasan: Local earth mover's distance and face warping. 1227-1230
YeSun Joung, Kyuheon Kim, Jinwong Kim: Interactive broadcasting contents authoring and searching system. 1231-1234
Chun-Chuan Yang, Yung-Chi Wang, Chen-Kuei Chu: Reuse of SMI 2.0 scripts in dividable dynamic timeline-based authoring. 1235-1238
Alexander P. Vazhenin, Ying-Hong Wang, Dmitry A. Vazhenin: Web-based platform for multimedia programming. 1239-1242
Chun-Chuan Yang, Chen-Kuei Chu, Yung-Chi Wang: Dividable dynamic timeline-based authoring for SMI 2.0 presentations. 1243-1246
Hsiau Wen Lin, Timothy K. Shih, Wen-Chih Chang, Chao-Hsun Yang, Chun-Chia Wang: A Petri nets-based approach to modeling SCORM sequence. 1247-1250
Chao-ming Teng, Hao-Hua Chu, Chon-in Wu: mProducer: authoring multimedia personal experiences on mobile phones. 1251-1254
Özcan Öksüz, Ugur Güdükbay, E. Cetin: Computer vision based text and equation editor for LATEX. 1255-1258
D.-Y. Huang, X. Lin, R. Yu: Sensitivity analysis of a cascade RLS-LMS algorithm for different resolution audio signals. 1263-1266
Pinar Duygulu, Ming-yu Chen, Alexander G. Hauptmann: Comparison and combination of two novel commercial detection methods. 1267-1270
Yu-Long Qiao, Jeng-Shyang Pan, Sheng-He Sun: Improved partial distance search for k nearest-neighbor classification. 1275-1278
Ju Cheng Yang, Dong-Sun Park: Detecting region-of-interest (ROI) in digital mammogram by using morphological bandpass filter. 1279-1282
H. Leung: Analysis of traditional Chinese seals and synthesis of personalized seals. 1283-1286


Anicet Kouomou Choupo, Laure Berti-Equille, Annie Morin: Multimedia indexing and retrieval with features association rules mining. 1299-1302
Benjamin Bustos, Daniel A. Keim, Dietmar Saupe, Tobias Schreck, Dejan V. Vranic: Using entropy impurity for improved 3D object similarity search. 1303-1306
Sheng Gao, Chin-Hui Lee, Yongwei Zhu: An unsupervised learning approach to musical event detection. 1307-1310
Wei Lai, Xiaodong Gu, Ren-Hua Wang, Wei-Ying Ma, HongJiang Zhang: A content-based bit allocation model for video streaming. 1315-1318
Eric Setton, A. Shionozaki, Bernd Girod: Real-time streaming of prestored multiple description video with restart. 1323-1326
Bo Xie, Wenjun Zeng: Rate-distortion optimized dynamic bitstream switching for scalable video streaming. 1327-1330
Yiu-Pong Lai, Man-Chun Hui, Chi-Wah Kok, Man-Hung Siu: Speech recognition enhancement by psychoacoustic modeled noise suppression. 1335-1338
S. H. Srinivasan: Characterizing music dynamics for improvisation. 1339-1342
Chang Huai You, Soo Ngee Koh, Susanto Rahardja: Kalman filtering speech enhancement incorporating masking properties for mobile communication in a car environment. 1343-1346
Namunu Chinthaka Maddage, Kongwah Wan, Changsheng Xu, Ye Wang: Singing voice detection using twice-iterated composite Fourier transform. 1347-1350
Hadi Harb, Liming Chen, J.-Y. Auloge: Mixture of experts for audio classification: an application to male female classification and musical genre recognition. 1351-1354
Li-Wei Kang, Jin-Jang Leou: A hybrid error concealment scheme for MPEG-2 video transmission based on best neighborhood matching algorithm. 1355-1358
Tak-Song Chong, Oscar C. Au, Wing-San Chau, Tai-Wai Chan: Temporal error concealment for video transmission. 1363-1366
Wei Tu, Eckehard G. Steinbach: Proxy-based error tracking for H.264 based real-time video transmission in mobile environments. 1367-1370
Bo Yan, Kam-Wing Ng: A burst-error concealment algorithm with selective spatial interpolation for visual communications over noisy channels. 1371-1374
Tao Wu, Sadhna Ahuja, Sudhir S. Dixit: Efficient mobile content delivery by exploiting user interest correlation. 1375-1378
Wei Wei, Avideh Zakhor: Robust multipath source routing protocol (RMPSR) for video communication over wireless ad hoc networks. 1379-1382
Bo Shen, Zhichen Xu, Susie J. Wee, John G. Apostolopoulos: Semantic-enhanced distribution & adaptation networks. 1383-1386
Jacob Chakareski, John G. Apostolopoulos, Susie J. Wee, Wai-tian Tan, Bernd Girod: R-D hint tracks for low-complexity R-D optimized video streaming. 1387-1390
Edward Y. Chang, Yuan-Fang Wang, I-Jeng Wang: Toward building a robust and intelligent video surveillance system: a case study. 1391-1394
Hideo Saito, Naho Inamoto, Sachiko Iwase: Sports scene analysis and visualization from multiple-view video. 1395-1398
Marco Bertini, Alberto Del Bimbo, Walter Nunziati: Common visual cues for sports highlights detection. 1399-1402
Jonathan H. Connell, Andrew W. Senior, Arun Hampapur, Ying-li Tian, Lisa M. G. Brown, Sharath Pankanti: Detection and tracking in the IBM PeopleVision system. 1403-1406

Shwu-Huey Yen, Mei-Fen Chen, Hwei-Jen Lin, Chia-Jen Wang, Chiu-Hsiang Liu: The extraction of characters on dated color postcards. 1415-1418
Xing Qin, Xiaolang Yan, Chong-Peng Yang, Yang Ye: Tiling artifact reduction for JPEG2000 image at low bit-rate. 1419-1422
Jiancong Luo, Ishfaq Ahmad, Yongfang Liang, Yu Sun: Motion estimation for content adaptive video compression. 1427-1430
Grzegorz Pastuszak: A high-performance architecture of arithmetic coder in JPEG2000. 1431-1434
Hiroaki Etou, Yoshihiro Okada, Koichi Niijima: Feature preserving motion compression based on hierarchical curve simplification. 1435-1438
Lena Chang, S. J. Chang, S. W. Leu, J. D. Chen: An efficient eigen-space approach for management of satellite image databases. 1439-1442
Fei Zuo, Peter H. N. de With: Real-time facial feature extraction using statistical shape model and Haar-wavelet based feature search. 1443-1446
Homayoon S. M. Beigi: Aggressive compression of the dynamics of handwriting and signature signals. 1447-1450
Sarah Lee, Tania Stathaki: Endomorphic modelling for two-dimensional time-varying autoregressive model signals. 1451-1454
Xueqin Zhao, Jianming Lu, Y. Nomura, Takashi Yahagi: A new method of second-order parallel adaptive Volterra filter. 1455-1458
T. Thongkamwitoon, Supavadee Aramvith, Thanarat H. Chalidabhongse: An adaptive real-time background subtraction and moving shadows detection. 1459-1462
Kimiaki Shirahama, Kazuhisa Iwamoto, Kuniaki Uehara: Video data mining: rhythms in a movie. 1463-1466
Jiashu Zhang, L. Zhang, Heng-Ming Tai: Efficient video object segmentation using adaptive background registration and edge-based change detection techniques. 1467-1470
Nicholas H. Zamora, Xiaoping Hu, Ümit Y. Ogras, Radu Marculescu: Resource-aware video processing techniques for ambient multimedia systems. 1471-1474
Ryoma Oami, Ana B. Benitez, Shih-Fu Chang, Nevenka Dimitrova: Understanding and modeling user interests in consumer videos. 1475-1478
Rongshan Yu, Xiao Lin, Susanto Rahardja, Chi Chung Ko: A statistics study of the MDCT coefficient distribution for audio. 1483-1486
Rong Ding, Qionghai Dai, Wenli Xu, Dongdong Zhu, Hao Yin: Background-frame based motion compensation for video compression. 1487-1490
Feng Pan, Xiao Lin, Susanto Rahardja, Ee Ping Ong, Weisi Lin: Measuring blocking artifacts using edge direction information. 1491-1494
Thomas Köckerbauer, M. Kumar, Andreas Uhl: Lightweight JPEG 2000 confidentiality for mobile environments. 1495-1498
Volume 3
Lan-Da Van, Hsin-Fu Luo, Chien-Ming Wu, Wen-Hsiang Hu, Chun-Ming Huang, Wei-Chang Tsai: A high-performance area-aware DSP processor architecture for video codecs. 1499-1502
Ramazan Savas Aygün, Aidong Zhang: Integrating virtual camera controls into digital video. 1503-1506
Yuan Li, Wei Tsang Ooi: Distributed construction of resource-efficient overlay tree by approximating MST. 1507-1510
M. Uehara: Change aware distributed file system for a distributed search engine. 1511-1514
Chiou-Yng Lee, Chung-Jyi Chang: Low-complexity linear array multiplier for normal basis of type-II. 1515-1518
Jianfeng Chen, Louis Shue, Koksoon Phua, Hanwu Sun: Experimental study of dual microphone systems. 1519-1522
Ihab Amer, Wael M. Badawy, Graham A. Jullien: A VLSI prototype for Hadamard transform with application to MPEG-4 part 10. 1523-1526
Huai-yu Zhuang, Chengke Wu, Jia-Xian Deng: A fast algorithm and hardware implementation for rate-distortion optimization in JPEG2000. 1527-1530
J. Marston, G. MacCarthy, Beth Logan, Pedro J. Moreno, Jean-Manuel Van Thong: News Tuner: a simple interface for searching and browsing radio archives. 1531-1534
Wei-Hao Lin, Alexander G. Hauptmann: Merging rank lists from multiple sources in video classification. 1535-1538
Mukesh A. Zaveri, Shabbir N. Merchant, Uday B. Desai: Small and fast moving object detection and tracking in sports video sequences. 1539-1542
Luca Marchesotti, Stefano Piva, Carlo S. Regazzoni: A dynamic model integrating colour and shape information for objects tracking in conditions of occlusion. 1547-1550
Xiao-Feng Tong, Han-Qing Lu, Qing-Shan Liu: A three-layer event detection framework and its application in soccer video. 1551-1554
Xinguo Yu, Hon Wai Leong, Changsheng Xu, Qi Tian: A robust Hough-based algorithm for partial ellipse detection in broadcast soccer video. 1555-1558

