


default search action
ICME 2003: Baltimore, MD, USA
- Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, ICME 2003, 6-9 July 2003, Baltimore, MD, USA. IEEE Computer Society 2003, ISBN 0-7803-7965-9

Volume 1
Networked Video I
- Thinh P. Q. Nguyen, Puneet Mehra, Avideh Zakhor:

Path diversity and bandwidth allocation for multimedia streaming. 1-4 - Susie J. Wee, John G. Apostolopoulos, Wai-tian Tan, Sumit Roy:

Research and design of a mobile streaming media content delivery network. 5-8 - Jacob Chakareski, Eric Setton, Yi J. Liang, Bernd Girod:

Video streaming with diversity. 9-12 - Marco Fumagalli, Phoom Sagetong, Antonio Ortega:

Estimation of erased data in a H.263 coded stream by using unbalanced multiple description coding. 13-16 - Amy R. Reibman

, Vinay A. Vaishampayan
:
Quality monitoring for compressed video subjected to packet loss. 17-20
Automatic Indexing
- Rémi Ronfard

, Tien Tran-Thuong:
A framework for aligning and indexing movies with their script. 21-24 - Xiaofei He, Wei-Ying Ma

, Hong-Jiang Zhang:
Imagerank: spectral techniques for structural analysis of image database. 25-28 - Adam Berenzweig, Daniel P. W. Ellis, Steve Lawrence:

Anchor space for classification and similarity measurement of music. 29-32 - Tong Zhang:

Automatic singer identification. 33-36 - Matthew R. Boutell, Jiebo Luo

, Robert T. Gray:
Sunset scene classification using simulated image recomposition. 37-40
Multimodal Interfaces
- Yeow Kee Tan, Nasser Sherkat

, Tony Allen:
Eye gaze and speech for data entry: a comparison of different data entry methods. 41-44 - Yasuhito Sawahata

, Kiyoharu Aizawa:
Wearable imaging system for summarizing personal experiences. 45-48 - Timothy T. H. Chen, Sidney S. Fels, Saehee Sarah Min:

FlowField and beyond: applying pressure-sensitive multi-point touchpad interaction. 49-52 - Xin Fan, Xing Xie, Wei-Ying Ma

, Hong-Jiang Zhang, He-Qin Zhou:
Visual attention based image browsing on mobile devices. 53-56 - Björn W. Schuller

, Martin Zobl, Gerhard Rigoll, Manfred K. Lang:
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge. 57-60
Speech and Audio Processing I
- Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo

:
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems. 61-64 - Rongshan Yu, Xiao Lin, Susanto Rahardja

, Chi Chung Ko:
A fine granular scalable perceptually lossy and lossless audio coder. 65-68 - Simon Lucey

, Tsuhan Chen
:
An investigation into subspace rapid speaker adaptation for verification. 69-72 - Manuel J. Reyes Gomez, Daniel P. W. Ellis:

Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling. 73-76 - Chih-Kai Yang, Sou-Gee Chen:

New static and dynamic search algorithms for fast MP3 bit allocations. 77-80
Image Processing I
- Yongmin Li

, Li-Qun Xu, Geoff Morrison, Charles Nightingale, Jason Morphett:
Robust panorama from MPEG video. 81-84 - Jun-Wei Hsieh:

Fast stitching algorithm for moving object detection and mosaic construction. 85-88 - Zhang John Chen, Jagath Samarabandu:

Planar region depth filling using edge detection with embedded confidence technique and Hough transform. 89-92 - S. H. Srinivasan, Mohan S. Kankanhalli

:
Wide baseline spectral matching. 93-96 - Wei-Qi Yan, Mohan S. Kankanhalli

:
Colorizing infrared home videos. 97-100 - Hasan F. Ates

, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation. 101-104 - Andy Chang, Oscar C. Au, Yick Ming Yeung:

A novel approach to fast multi-block motion estimation for H.264 video coding. 105-108 - Gulcin Caner, A. Murat Tekalp

, Wendi B. Heinzelman
:
Super resolution recovery for multi-camera surveillance imaging. 109-112 - Yu Hen Hu, Rajas A. Sambhare:

Constrained texture synthesis for image post processing. 113-116
Multimedia Architectures and Implementation
- Nikolaos Bellas

, Malcolm Dwyer:
A programmable, high performance vector array unit used for real-time motion estimation. 117-120 - Tay-Jyi Lin, Chin-Chi Chang, Tsung-Hsun Yang, Yu-Ming Chang, Chien-Hung Lin, Chen-Chia Lee, Hung-Yueh Lin, Chein-Wei Jen:

Performance evaluation of ring-structure register file in multimedia applications. 121-124 - Tay-Jyi Lin, Tsung-Hsun Yang, Chein-Wei Jen:

Coefficient optimization for area-effective multiplier-less FIR filters. 125-128 - Satoshi Nishiguchi, Kazuhide Higashi, Yoshinari Kameda, Michihiko Minoh:

A sensor-fusion method for detecting a speaking student. 129-132 - Tsung-Han Tsai, Wen-Cheng Chen, Chun-Nan Liu:

A low power VLSI implementation for variable length decoder in MPEG-1 layer III. 133-136 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Ya-Yun Shih, Liang-Gee Chen

:
Novel word-level algorithm of embedded block coding in JPEG 2000. 137-140 - Jongmyon Kim, D. Scott Wills:

Quantized color instruction set for media-on-demand applications. 141-144 - Michelle Yan, James Shaw, Vahid Khamsi, Shih-Ping Liou:

Tracking and presenting user attention for collaborative browsing using heterogeneous devices. 145-148 - Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:

Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III. 149-152
Text, Graphics, Face, Scene, and Song Recognition
- Ioannis Andreou, Nikitas M. Sgouros:

Sketch creation utilizing shape matching techniques. 153-156 - Michael H. Lee, Surya Nepal, Uma Srinivasan:

Edge-based semantic classification of sports video sequences. 157-160 - Gees C. Stein, Jens Rittscher

, Anthony Hoogs:
Enabling video annotation using a semantic database extended with visual knowledge. 161-164 - Hidehisa Nagano, Kunio Kashino, Hiroshi Murase:

A fast search algorithm for background music signals based on the search for numerous small signal components. 165-168 - Ahmet Ekin, A. Murat Tekalp

:
Generic play-break event detection for summarization and hierarchical sports video analysis. 169-172 - Amit Chakraborty, Peiya Liu, Liang H. Hsu:

Extracting anchorable information units from PDF files. 173-176 - Lijun Yin, Sergey Royt, Matt T. Yourst, Anup Basu:

Recognizing facial expressions using active textures with wrinkles. 177-180 - Francis K. H. Quek, Yingen Xiong:

Oscillatory gestures and discourse. 181-184
Networked Video II
- Haitao Zheng:

Optimizing wireless multimedia transmissions through cross layer design. 185-188 - Jacco R. Taal, Ivaylo Haratcherev, Koen Langendoen

, Inald Lagendijk:
Quality of service controlled adaptive video-coding over IEEE 802.11 wireless links. 189-192 - Thomas Stockhammer

:
Is fine-granular scalable video coding beneficial for wireless video applications? 193-196 - Jie Chen, S. Hsia:

Joint cross-layer design for wireless QoS video delivery. 197-200 - Trista Pei-Chun Chen, Tsuhan Chen

:
Shaping for video with frame dependency. 201-204
Multimedia Security and Content Protection I
- H. Vicky Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:

Performance of detection statistics under collusion attacks on independent multimedia fingerprints. 205-208 - Alexia Giannoula, Anastasios Tefas

, Nikos Nikolaidis
, Ioannis Pitas:
Improving the detection reliability of correlation-based watermarking techniques. 209-212 - Ming Sun Fu, Oscar C. Au:

A multi-bit robust watermark for halftone images. 213-216 - Nedeljko Cvejic, Djordje Tujkovic, Tapio Seppänen:

Increasing robustness of an audio watermark using turbo codes. 217-220 - Jonathan Foote, John Adcock, Andreas Girgensohn:

Time base modulation: a new approach to watermarking audio. 221-224
Virtual Reality and Imaging I
- Satya P. Mallick, Mohan M. Trivedi:

Parametric face modeling and affect synthesis. 225-228 - Inmaculada Rodríguez Santiago

, Manuel Peinado, Ronan Boulic, Daniel Meziat:
Bringing the human arm reachable space to a virtual environment for its analysis. 229-232 - Cha Zhang, Tsuhan Chen

:
A system for active image-based rendering. 233-236 - Yuzhong Shen, Kenneth E. Barner:

Surface denoising with directional fuzzy vector median filtering. 237-240 - Yong-In Yoon, Jang-Hwan Im, Dae-Hyun Kim, Jong-Soo Choi:

Reconstruction of linearly parameterized models using the vanishing points from a single image. 241-244
Authentication and Recognition
- Wende Zhang, Tsuhan Chen

:
Personal authentication based on generalized symmetric max minimal distance in subspace. 245-248 - Thang Viet Nguyen, Jagdish Chandra Patra

, Ee Luang Ang:
Blind image extraction from nonlinear mixtures using MLP-based ICA. 249-252 - Wei Wang, Aidong Zhang, Yuqing Song:

Identification of objects from image regions. 253-256 - S. Palanivel

, B. S. Venkatesh, B. Yegnanarayana:
Real time face authentication system using autoassociative neural network models. 257-260 - Dong-Wan Kang, Jun Ohya:

Postures of a human wearing a multiple-colored suit based on color information processing. 261-264
Wireless Multimedia Techniques
- Wei Wang, Michael R. Lyu:

Automatic generation of dubbing video slides for mobile wireless environment. 265-268 - Surya Nepal, Uma Srinivasan:

Adaptive video highlights for wired and wireless platforms. 269-272 - Dirk Trossen, Hemant H. Chaskar:

Enabling user-tailored MMS delivery in heterogeneous access scenarios. 273-276 - Shengjie Zhao, Zixiang Xiong, Xiaodong Wang:

Optimal resource allocation for wireless video over CDMA networks. 277-280 - Amol Bhatkar, Rajarathnam Chandramouli

, Narayanan Vijaykrishnan, Mary Jane Irwin:
Computation and transmission energy modeling through profiling for MPEG4 video transmission. 281-284 - Wen Xu, Sheila S. Hemami:

Delay-optimized robust transmission of images over multiple channels. 285-288 - Wanghong Yuan, Klara Nahrstedt:

Buffering approach for energy saving in video sensors. 289-292 - Jiancong Chen, S.-H. Gary Chan, Qian Zhang, Wenwu Zhu, Jin Chen:

A distributed power adaptation algorithm for multimedia delivery over ad hoc networks. 293-296
Content-based Retrieval
- Jieh Hsiang, Wen-Jun Liu, Bee-Chung Chen, Hsieh-Chang Tu:

Multidimensional interactive fine-grained image retrieval. 297-300 - Jürgen Assfalg, Alberto Del Bimbo

, Pietro Pala
:
Curvature maps for 3D CBR. 301-304 - Xiangdong Zhou, Qi Zhang, Lan Lin, Ailin Deng, Gang Wu:

Image retrieval by fuzzy clustering of relevance feedback records. 305-308 - Jun Gao, George Tzanetakis

, Peter Steenkiste
:
Content-based retrieval of music in scalable peer-to-peer networks. 309-312 - Lei Zhang, Fang Qian, Mingjing Li, Hong-Jiang Zhang:

An efficient memorization scheme for relevance feedback in image retrieval. 313-316 - Yuxin Peng, Chong-Wah Ngo

, Qing-Jie Dong, Zongming Guo, Jianguo Xiao:
Video clip retrieval by maximal matching and optimal matching in graph theory. 317-320 - Xin Huang, Shu-Ching Chen, Mei-Ling Shyu:

Incorporating real-valued multiple instance learning into relevance feedback for image retrieval. 321-324 - Ming Hong Pi, Mrinal Mandal, Anup Basu:

Image retrieval based on 2-D histogram of fractal parameters. 325-328 - Giridharan Iyengar, Harriet J. Nock, Chalapathy Neti:

Audio-visual synchrony for detection of monologues in video archives. 329-332 - Min Xu

, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video. 333-336
Image Processing II
- Ching-Yeh Chen, Shao-Yi Chien

, Yi-Hau Chen, Yu-Wen Huang
, Liang-Gee Chen
:
Unsupervised object-based sprite coding system for tennis sport. 337-340 - Armando J. Pinho

, António J. R. Neves
:
Block-based histogram packing of color-quantized images. 341-344 - Nejat Kamaci, Yucel Altunbasak:

Performance comparison of the emerging H.264 video coding standard with the existing standards. 345-348 - Xiaodong Gu, Hong-Jiang Zhang:

Implementing dynamic GOP in video encoding. 349-352 - Yung-Gi Wu, Ming-Zhi Huang, Yu-Ling Wen:

Fractal image compression with variance and mean. 353-356 - Martin P. Boliek, Gene K. Wu:

JPEG 2000-like access using the JPM compound document file format. 357-360 - Shou-Yi Tseng:

Efficient motion estimation algorithm using run-time and distortion optimization approach. 361-364 - Liang Zhang:

Statistical model for intensity differences of corresponding points between stereo image pairs. 365-368 - Yuhua Ding, George J. Vachtsevanos, Anthony J. Yezzi Jr., Wayne Daley, Bonnie S. Heck-Ferri:

A real-time curve evolution-based image fusion algorithm for multisensory image segmentation. 369-372 - Bernd Girod, Chuo-Ling Chang, Prashant Ramanathan, Xiaoqing Zhu:

Light field compression using disparity-compensated lifting. 373-376
Speech Coding, Analysis, and Synthesis
- Christian H. Ritz

, Ian S. Burnett
, Jason Lukasiak:
Low bit rate wideband WI speech coding. 377-380 - Houman Zarrinkoub, Paul Mermelstein:

Joint optimization of short-term and long-term predictors in CELP speech coders. 381-384 - Om Deshmukh, Carol Y. Espy-Wilson:

A measure of aperiodicity and periodicity in speech. 385-388 - K. Sreenivasa Rao, B. Yegnanarayana:

Prosodic manipulation using instants of significant excitation. 389-392 - Arun Kumar, Ashish Verma:

Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts. 393-396 - Xiaodong He, Wu Chou:

minimum classification error linear regression for acoustic model adaptation of continuous density HMMS. 397-400 - Björn W. Schuller

, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. 401-404 - Dong Wang, Lie Lu

, Hong-Jiang Zhang:
Speech segmentation without speech recognition. 405-408 - Julien Pinquier

, Jean-Luc Rouas
, Régine André-Obrecht:
A fusion study in speech / music classification. 409-412
Multimedia Technology for Gaming
- Mohammed Chalil, K. P. Sreekumar, Manoj Sankar:

MPEG-4 based framework for game engines to handle virtual advertisements in game. 413-416 - Amaryllis Raouzaiou, Kostas Karpouzis, Stefanos D. Kollias

:
Emotion representation for online gaming. 417-420 - Ghassan Al-Regib

, Yucel Altunbasak:
3TP: an application-layer protocol for streaming 3-D graphics. 421-424 - Magy Seif El-Nasr, Ian Horswill:

Expressive lighting for interactive entertainment. 425-428 - Son Minh Tran, Marius Preda

, Françoise J. Prêteux, Kalman Fazekas:
Exploring MPEG-4 BIFS features for creating multimedia games. 429-432
Multimedia Learning
- Raghavendra Singh, Ravi Kothari:

Relevance feedback algorithm based on learning from labeled and unlabeled data. 433-436 - Milind R. Naphade, Ching-Yung Lin, Apostol Natsev, Belle L. Tseng, John R. Smith:

A framework for moderate vocabulary semantic visual concept detection. 437-440 - Shinsuke Nakajima, Shinichi Kinoshita, Katsumi Tanaka:

Amplifying the differences between your positive samples and neighbors in image retrieval. 441-444 - Apostol Natsev, John R. Smith:

Active selection for multi-example querying by content. 445-448 - Tzvetanka I. Ianeva, Arjen P. de Vries

, Hein Röhrig:
Detecting cartoons: a case study in automatic video-genre classification. 449-452
QoS
- Wuttipong Kumwilaisak, Qian Zhang, Wenwu Zhu, C.-C. Jay Kuo

, Ya-Qin Zhang:
On the rate constraint of transmitting multiple priority classes with QoS. 453-456 - Bo Shen:

Meta-caching and meta-transcoding for server-side service proxy. 457-460 - Sheau-Ru Tong, Chun-Cheng Chang:

Harmonic DiffServ: provisioning scalable heterogeneous-QoS multicast in DiffServ networks. 461-464 - Rajeev Kumar:

A protocol with transcoding to support QoS over Internet for multimedia traffic. 465-468 - Nam Pham Ngoc, Gauthier Lafruit, Jean-Yves Mignolet, Serge Vernalde, Geert Deconinck

, Rudy Lauwereins:
A framework for mapping scalable networked applications on run-time reconfigurable platforms. 469-472
Image/Video Rendering/Synthesis
- Pun-Mo Ho, Tien-Tsin Wong, Kwok-Hung Choy, Chi-Sing Leung

:
PCA-based compression for image-based relighting. 473-476 - Amit A. Kale, Amit K. Roy-Chowdhury, Rama Chellappa:

Video based rendering of planar dynamic scenes. 477-480 - Sarah John, Mikhail A. Vorontsov:

Multiframe selective information fusion for 'looking through the woods'. 481-484 - Timothy K. Shih, Liang-Chen Lu, Ying-Hong Wang, Rong-Chi Chang

:
Multi-resolution image inpainting. 485-488 - Zhanfeng Yue, Liang Zhao, Rama Chellappa:

View synthesis of articulating humans using visual hull. 489-492
Layered, Scalable & Multiple Descriptions Transmission
- Xiao Su

, Rod Fatoohi:
Scalable coded image transmissions over peer-to-peer networks. 493-496 - Ji-An Zhao, Bo Li, Ishfaq Ahmad:

Traffic modeling for layered video. 497-500 - Lechang Cheng, Mabo Robert Ito:

Receiver-driven layered multicast using active networks. 501-504 - Chung-Ming Huang, Yuan-Tse Yu, Guo-Shiung Liau:

A statistical flow control mechanism for layered multimedia over the differentiated service network. 505-508 - Eric Setton, Yi J. Liang, Bernd Girod:

Adaptive multiple description video streaming over multiple channels with active probing. 509-512 - Ivan Lee

, Ling Guan:
Centralized peer-to-peer streaming with layered video. 513-516 - Ali C. Begen

, Yucel Altunbasak, Özlem Ergun:
Fast heuristics for multi-path selection for multiple description encoded video streaming. 517-520 - Bo Xie, Wenjun Zeng

:
Source characteristics based fast bitstream switching. 521-524 - Augustin Gavrilescu, Adrian Munteanu, Peter Schelkens

, Jan Cornelis
:
Embedded multiple description scalar quantizers for progressive image transmission. 525-528
Image Compression
- Mylène C. Q. Farias

, Sanjit K. Mitra, John M. Foley:
Perceptual contributions of blocky, blurry and noisy artifacts to overall annoyance. 529-532 - Jingdong Wang

, Jianguo Lee, Changshui Zhang:
Kernel GMM and its application to image binarization. 533-536 - Rastislav Lukac, Bogdan Smolka

, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos:
Generalized adaptive vector sigma filters. 537-540 - Yao Nie, Kenneth E. Barner:

Optimized fuzzy transformation for image deblocking. 541-544 - Ee Ping Ong

, Weisi Lin, Zhongkang Lu
, Susu Yao, Xiaokang Yang, Lijun Jiang:
No-reference JPEG-2000 image quality metric. 545-548 - Giuseppe Messina

, Alfio Castorina, Sebastiano Battiato
, Angelo Bosco:
Image quality improvement by adaptive exposure correction techniques. 549-552 - Giovanni Motta, Francesco Rizzo, James A. Storer:

Partitioned vector quantization: application to lossless compression of hyperspectral images. 553-556 - Daewon Kim, Daekyu Shin:

Energy-based adaptive DCT/IDCT for video coding. 557-560 - Lorenzo Granai, Fulvio Moschetti, Pierre Vandergheynst:

Ridgelet transform applied to motion compensated images. 561-564
Coding and Noise Removal
- Phil Spencer Whitehead, David V. Anderson, Mark A. Clements:

Adaptive, acoustic noise suppression for speech enhancement. 565-568 - Ashish Jagmohan, Anshul Sehgal, Narendra Ahuja:

WYZE-PMD based multiple description video codec. 569-572 - Nualsawat Hiransakolwong, Kien A. Hua, Khanh Vu, Piotr S. Windyga:

Segmentation of ultrasound liver images: an automatic approach. 573-576 - Nuwan D. Nanayakkara

, Jagath Samarabandu:
Unsupervised model based image segmentation using domain knowledge based fuzzy logic and edge enhancement. 577-580 - Zhengguo Li, Feng Pan, Keng Pang Lim, Genan Feng, Xiao Lin, Susanto Rahardja

, Dajun Wu:
Adaptive frame layer rate control for H.264. 581-584 - Bogdan Smolka

, Konstantinos N. Plataniotis, Rastislav Lukac, Anastasios N. Venetsanopoulos:
Similarity based impulsive noise removal in color images. 585-588 - Siva Somasundaram, Koduvayur P. Subbalakshmi

:
3-D multiple description video coding for packet switched networks. 589-592 - Xu Huang, Allan C. Madoc, Andrew D. Cheetham:

Wavelet-based Bayesian estimator for Poisson noise removal from images. 593-596 - Hideaki Kimata, Masaki Kitahara, Yoshiyuki Yashima:

3D motion vector coding with block base adaptive interpolation filter on H.264. 597-600 - Ligang Lu, Vadim Sheinin:

Real-time MPEG video coding with information look-ahead. 601-604
Watermarking and Fingerprinting
- Micheal Mullarkey, Neil J. Hurley

, Guenole C. M. Silvestre, Teddy Furon:
Application of side-informed embedding and polynomial detection to audio watermarking. 605-608 - Ming Sun Fu, Oscar C. Au:

A novel method to embed watermark in different halftone images: data hiding by conjugate error diffusion (DHCED). 609-612 - Hong Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:

Nonlinear collusion attacks on independent fingerprints for multimedia. 613-616 - Z. Jane Wang, Min Wu, Hong Zhao, K. J. Ray Liu, Wade Trappe:

Resistance of orthogonal Gaussian fingerprints to collusion attacks. 617-620 - Jeffrey A. Bloom:

Security and rights management in digital cinema. 621-624 - Anandabrata Pal, Kulesh Shanmugasundaram, Nasir D. Memon

:
Automated reassembly of fragmented images. 625-628 - Kaliappan Gopalan:

Audio steganography using bit modification. 629-632 - Heather Yu:

Scalable encryption for multimedia content access control. 633-636 - Andreas Kalivas, Anastasios Tefas

, Ioannis Pitas:
Watermarking of 3D models using principal component analysis. 637-640
Video Processing for Multi-Camera Surveillance Systems
- Matteo Gandetto, Luca Marchesotti, S. Sciutto, D. Negroni, Carlo S. Regazzoni

:
From multi-sensor surveillance towards smart interactive spaces. 641-644 - Ser-Nam Lim, Ahmed M. Elgammal

, Larry S. Davis:
Image-based pan-tilt camera control in a multi-camera surveillance environment. 645-648 - Omar Javed, Zeeshan Rasheed, Orkun Alatas, Mubarak Shah

:
KNIGHT™: a real time surveillance system for multiple and non-overlapping cameras. 649-652 - Fatih Porikli

, Ajay Divakaran:
Multi-camera calibration, object tracking and query generation. 653-656 - Karsten Müller, Aljoscha Smolic, Michael Drose, Patrick Voigt, Thomas Wiegand:

Multi-texture modeling of 3D traffic scenes. 657-660
Wireless Multimedia I
- Jianping Hua, Zixiang Xiong:

Optimal rate allocation in progressive joint source-channel coding for image transmission over CDMA networks. 661-664 - Jie Chen:

Fast hopping OFDM and packet-awareness coder design for wireless multimedia delivery. 665-668 - Xiaofeng Xu, Mihaela van der Schaar, Santhana Krishnamachari, Sunghyun Choi

, Yao Wang
:
Adaptive error control for fine-granular-scalability video coding over IEEE 802.11 wireless LANs. 669-672 - Shirish Karande, Syed A. Khayam, Michael Krappel, Hayder Radha:

Analysis and modeling of errors at the 802.11b link layer. 673-676 - Yong Sun, Zixiang Xiong, Xiaodong Wang:

Iterative decoding of differentially space-time coded multiple descriptions of images. 677-680
Multimedia Hardware and Architectures
- Sebastiano Battiato

, Alfio Castorina, Mirko Guarnera, Filippo Vella
:
A light viewfinder pipeline for consumer devices application. 681-684 - Minseok Song, Heonshik Shin:

Minimization of buffer requirements using variable-size parity groups for fault-tolerant video servers. 685-688 - Chunjiang J. Duanmu, M. Omair Ahmad, M. N. S. Swamy:

8-bit partial sums of 16 luminance values for fast block motion estimation. 689-692 - Yu-Wen Huang

, To-Wei Chen, Bing-Yu Hsieh, Tu-Chih Wang, Te-Hao Chang, Liang-Gee Chen
:
Architecture design for deblocking filter in H.264/JVT/AVC. 693-696 - Xinjian Chen, Qionghai Dai:

A novel VLSI architecture for multidimensional discrete wavelet transform. 697-700
Novel Applications
- Juan Carlos Guerri

, Carlos Enrique Palau
, Ana Pajares, Angela Belda, Juan José Cermeño, Manuel Esteve:
A multimedia telemedicine system to assess musculoskeletal disorders. 701-704 - Shu-Ching Chen, Keqi Zhang, Min Chen:

A real-time 3D animation environment for storm surge. 705-708 - Panu Hämäläinen, Marko Hännikäinen, Timo D. Hämäläinen, Riku Soininen:

Offline architecture for real-time betting. 709-712 - Chung-Sheng Li, Charu C. Aggarwal, Murray Campbell, Yuan-Chi Chang, Gregory Glass, Vijay S. Iyengar, Mahesh Joshi, Ching-Yung Lin, Milind R. Naphade, John R. Smith, Belle L. Tseng, Min Wang, Kun-Lung Wu, Philip S. Yu:

Epi-SPIRE: a system for environmental and public health activity monitoring. 713-716 - John V. Harrison, Anna Andrusiewicz:

Enhancing digital advertising using dynamically configurable multimedia. 717-720
Speech and Audio Processing II
- Sunil Bharitkar

, Philip Hilmes, Chris Kyriakakis:
Sensitivity of multichannel room equalization to listener position. 721-724 - Sascha Spors, Achim Kuntz, Rudolf Rabenstein:

Listening room compensation for wave field synthesis. 725-728 - Kenzo Obata, Kentaro Noguchi, Yoshiaki Tadokoro:

A new sound source location algorithm based on formant frequency for sound image localization. 729-732 - Arvindh Krishnaswamy, Julius O. Smith III:

Inferring control inputs to an acoustic violin from audio spectra. 733-736 - Yong Rui, Dinei A. F. Florêncio:

New direct approaches to robust sound source localization. 737-740 - Parham Aarabi, Guangji Shi, Omid S. Jahromi:

Robust speech separation using time-frequency masking. 741-744 - Zhe Feng, Yaqian Zhou, Lide Wu, Zongge Li:

Audio classification based on maximum entropy model. 745-748 - Kuntal Sengupta, Prabir Burman:

Non-parametric approach to ICA using kernel density estimation. 749-752 - Jean-Luc Rouas

, Jérôme Farinas, François Pellegrino, Régine André-Obrecht:
Modeling prosody for language identification on read and spontaneous speech. 753-756
Multimedia Indexing
- Yimin Wu, Aidong Zhang:

An adaptive classification method for multimedia retrieval. 757-760 - Janghyun Yoon, Nikil Jayant:

Semantics-sensitive image retrieval: an information fusion approach. 761-764 - Anlei Dong, Bir Bhanu

:
Concept learning and transplantation for dynamic image databases. 765-768 - Paisarn Muneesawang, Ling Guan:

Image retrieval with embedded sub-class information using Gaussian mixture models. 769-772 - Jeroen Vendrig, Marcel Worring, Arnold W. M. Smeulders:

Components and systems for interactive video indexing. 773-776 - Andrea Kutics, Akihiko Nakagawa, Kiyotaka Tanaka, Minoru Yamada, Yasuo Sambe, Sakuichi Ohtsuka:

Linking images and keywords for semantics-based image retrieval. 777-780 - Alejandro Jaimes, John R. Smith:

Semi-automatic, data-driven construction of multimedia ontologies. 781-784 - Keiji Yanai

:
Image collector II: a system for gathering more than one thousand images from the Web for one keyword. 785-788
QoS and Broadcasts
- Corina Scheiter, Rainer Steffen, Markus Zeller, Rudi Knorr, Benno Stabernack, Kai-Immo Wels:

A system for QOS-enabled MPEG-4 video transmission over Bluetooth for mobile applications. 789-792 - Chin-Hei Chien, Wanjiun Liao

:
A self-configuring RED gateway for quality of service (QoS) networks. 793-796 - Jia Zhang, Jen-Yao Chung, Zhixing Zhang:

A router model for QoS-based multimedia Web services. 797-800 - Hong Kee Sul, Hyunchul Kim, Kilnam Chon:

A hybrid pagoda broadcasting protocol: fixed-delay pagoda broadcasting protocol with partial preloading. 801-804 - Nera W. C. Liu, Jack Y. B. Lee:

Constrained consonant broadcasting - a generalized periodic broadcasting scheme for large scale video streaming. 805-808 - Yeonjoon Chung, Ahmed H. Tewfik:

An efficient video broadcasting protocol with scalable preloading scheme. 809-812 - Virgilio Rodriguez:

Resource management for scalably encoded information: the case of image transmission over wireless networks. 813-816 - Chow-Sing Lin, Tzong-Yao Chang, Jin-Ru Hsieh:

On utilizing multi-channel to provide scheduled video delivery. 817-820 - Deepak S. Turaga, Mihaela van der Schaar:

Content-adaptive filtering in the UMCTF framework. 821-824 - Zhizhong Zhe, Hong Ren Wu

, Zhenghua Yu, Tim Ferguson, Damian M. Tan:
Performance evaluation of a perceptual ringing distortion metric for digital video. 825-828
Signal Processing Theory and Methods I
- Abdessamad Ben Hamza

, Hamid Krim, Bilge Karaçali:
Structural risk minimization using nearest neighbor rule. 829-832 - Hamid Reza Abutalebi

, Hamid Sheikhzadeh, Robert L. Brennan, George H. Freeman:
Affine projection algorithm for oversampled subband adaptive filters. 833-836 - Mohammad Bilal Malik

:
State-space RLS. 837-840 - Behrouz Nowrouzian, Arthur T. G. Fuller, M. N. S. Swamy:

A necessary and sufficient condition for the BIBO stability of general-order bode-type variable-amplitude wave-digital equalizers. 841-844 - Zhong Ji, Shuren Qi:

Detection of EEG basic rhythm feature by using band relative intensity ratio(BRIR). 845-848 - Kamyar Hazaveh, Kaamran Raahemifar:

Optimized local discriminant basis algorithm. 849-852 - Palghat P. Vaidyanathan, Byung-Jun Yoon:

Discrete probability density estimation using multirate DSP models. 853-856 - Andre Tkacenko, Palghat P. Vaidyanathan:

On the least squares signal approximation model for overdecimated rational nonuniform filter banks and applications. 857-860 - J. Michael Peterson, Shubha Kadambe:

A probabilistic approach for blind source separation of underdetermined convolutive mixtures. 861-864 - Jie Liang

, Lu Gan, Chengjie Tu, Trac D. Tran, Kai-Kuang Ma:
On efficient implementation of oversampled linear phase perfect reconstruction filter banks. 865-868
Volume 2
Smart Cameras
- Jörn Jachalsky, Martin Wahler, Peter Pirsch, S. Capperon, Winfried Gehrke, W. M. Kruijtzer, Antonio Núñez

:
A core for ambient and mobile intelligent imaging applications. 1-4 - Wayne H. Wolf, I. Burak Özer, Tiehan Lv:

Architectures for distributed smart cameras. 5-8 - Kohsia S. Huang, Mohan M. Trivedi:

Distributed video arrays for tracking, human identification, and activity analysis. 9-12 - John W. Fisher III, Trevor Darrell:

Learning cross-modal appearance models with application to tracking. 13-16 - Jacky Mallett, M. Michael Bove Jr.:

Eye Society. 17-20
Multimedia Retrieval
- Feng Jing, Mingjing Li, Hong-Jiang Zhang, Bo Zhang:

Support vector machines for region-based image retrieval. 21-24 - Charles Parker:

Towards intelligent string matching in query-by-humming systems. 25-28 - Wing Ho Leung, Tsuhan Chen

:
Hierarchical matching for retrieval of hand-drawn sketches. 29-32 - Joo-Hwee Lim, Philippe Mulhem, Qi Tian:

Event-based home photo retrieval. 33-36 - Bo Feng, Qing Li

, Jun Yang, Liu Wenyin, Jian Zhai:
Efficient database facilities for content-based Flash retrieval. 37-40
Network Adaptive Techniques
- Hui Cheng, Xi Min Zhang, Yun-Qing Shi, Anthony Vetro, Huifang Sun:

Rate allocation for FGS coded video using composite R-D analysis. 41-44 - Nicola Franchi, Marco Fumagalli, Rosa Lancini:

Optimised source and channel coding for video transmission over ADSL. 45-48 - Gene Cheung, Connie Chan:

Jointly optimal reference frame & quality of service selection for H.261 video coding over lossy networks. 49-52 - Ashwatha Matthur, Padmavathi Mundur:

Dynamic load balancing across mirrored multimedia servers. 53-56 - Hongliang Li, Guizhong Liu, Yongli Li, Zhongwei Zhang:

An effective burstiness estimation model for VBR video stream. 57-60
Multimedia Software and Architectures
- Oliver Schreer

, Nicole Atzpadin, Serap Askar, Peter Kauff:
Advanced 3D signal processing for Virtual Team User Environments. 61-64 - James C. Beyer, David H. C. Du:

Data storage and delivery protocols to support interactive high-resolution image browsing on a PC-cluster based image-wall. 65-68 - Wei Shu, Min-You Wu:

Scalability of closed-loop video delivery service. 69-72 - Marc Leeman, David Atienza Alonso:

Intermediate variable elimination in a global context for a 3D multimedia application. 73-76 - Andreas Girgensohn:

A fast layout algorithm for visual video summaries. 77-80
Virtual Reality and Imaging II
- Kostas Karpouzis, Amaryllis Raouzaiou, Paraskevi K. Tzouveli

, Spiros Ioannou, Stefanos D. Kollias
:
MPEG-4: one multimedia standard to unite all. 81-84 - Takahito Kawanishi, Masaru Tsuchida, Shigeru Takagi, Hiroshi Murase:

Small cylindrical display for anthropomorphic agents. 85-88 - Hitoshi Kanda, Jun Ohya:

Efficient, realistic method for animating dynamic behaviors of 3D botanical trees. 89-92 - Wang Hee Lee, Kuntal Sengupta, Rajeev Sharma:

Augmented reality with occlusion rendering using background-foreground segmentation and trifocal tensors. 93-96 - Lijun Jiang, Shiqian Wu, Dajun Wu, Ee Ping Ong

, Susanto Rahardja
:
3D shape modeling by color phase stepping light projection. 97-100 - Angus M. K. Siu, Rynson W. H. Lau

:
Relief occlusion-adaptive meshes for 3D imaging. 101-104 - Roberta L. Gomes, Guillermo de Jesús Hoyos-Rivera, Jean-Pierre Courtiat:

Collaborative virtual environments: going beyond virtual reality. 105-108 - Irene Cheng:

Efficient 3D object simplification and fragmented texture scaling for online visualization. 109-112
Robustness, Error Concealment and Loss Recovery
- Wenjun Zeng

:
Spatial-temporal error concealment with side information for standard video codecs. 113-116 - Hyunjoo Kim, Sooyong Kang, Heon Young Yeom:

Node selection for a fault-tolerant streaming service on a peer-to-peer network. 117-120 - Thenghong H. Yeo, Wai Choong Wong

, Dong-Yan Huang:
Soft decision unequal error protection scheme for MPEG advanced audio coding. 121-124 - Fan Zhai, Randall Berry, Thrasyvoulos N. Pappas

, Aggelos K. Katsaggelos
:
A rate-distortion optimized error control scheme for scalable video streaming over the Internet. 125-128 - Shirish S. Karande, Hayder Radha:

A new family of channel coding schemes for real-time visual communications. 129-132 - Gaurav Agarwal, Alwin Anbu, Aniruddha Sinha

:
A fast algorithm to find the region-of-interest in the compressed MPEG domain. 133-136 - Chui Sian Ong, Klara Nahrstedt, Wanghong Yuan:

Quality of protection for mobile multimedia applications. 137-140 - Timothy K. Shih, Louis H. Lin, Jen-Shiun Chiang:

Progressive image transmission by adaptive interpolation. 141-144 - Wei-Ying Kung, Chang-Su Kim, C.-C. Jay Kuo

:
A spatial-domain error concealment method with edge recovery and selective directional interpolation. 145-148 - Pascal Bourdon, Bertrand Augereau, Christian Olivier, Christian Chatellier:

A PDE-based method for ringing artifact removal on grayscale and color JPEG2000 images. 149-152
Networked Multimedia
- Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang:

Replication strategies for peer-to-peer based multimedia distribution service. 153-156 - Amir Asif

:
Multimedia learning objects for digital signal processing in communications. 157-160 - David S. Doermann

, Arvind Karunanidhi, Niketu Parekh, M. A. Khan, S. Chen, Hasan Timucin Ozdemir, M. Miwa, Kuo Chu Lee:
Issues in the transmission, analysis, storage and retrieval of surveillance video. 161-164 - Tayeb Lemlouma, Nabil Layaïda:

Encoding multimedia presentations for user preferences and limited environments. 165-168 - Keng Pang Lim, Dajun Wu, Si Wu, Susanto Rahardja, Xiao Lin, Lijun Jiang, Rongshan Yu, Feng Pan, Zhengguo Li, Susu Yao, Genan Feng, Chi Chung Ko:

Video streaming on embedded devices through GPRS network. 169-172 - Qiang Ma

, Katsumi Tanaka:
WebTelop: dynamic TV-content augmentation by using Web pages. 173-176 - Yasuhiko Watanabe, Kazuya Sono, Kazuya Yokomizo, Yoshihiro Okada:

Translation camera on mobile phone. 177-180 - Mihai M. Lazarescu

, Svetha Venkatesh
:
Using camera motion to identify types of American football plays. 181-184
Moving from Features to Semantics using Computational Media Aesthetics
- Marc Davis:

Active capture: integrating human-computer interaction and computer vision/audition to automate media capture. 185-188 - Hangzai Luo, Jianping Fan, Jing Xiao, Xingquan Zhu

:
Semantic principal video shot classification via mixture Gaussian. 189-192 - Simon Moncrieff, Svetha Venkatesh

, Chitra Dorai:
Horror film genre typing and scene labeling via audio analysis. 193-196 - Barbara Barry, Glorianna Davenport:

Documenting life: videography and common sense. 197-200 - John W. Mateer:

Developing effective test sets and metrics for evaluating automated media analysis systems. 201-204
Multimedia Security and Content Protection II
- Yan Sun, K. J. Ray Liu:

Multi-layer key management for secure multimedia multicast communications. 205-208 - Qibin Sun, Dajun He, Zhishou Zhang, Qi Tian:

A secure and robust approach to scalable video authentication. 209-212 - Dekun Zou, Chai Wah Wu

, Guorong Xuan, Yun-Qing Shi:
A content-based image authentication system with lossless data hiding. 213-216 - Z. Jane Wang, Min Wu, Wade Trappe, K. J. Ray Liu:

Anti-collusion of group-oriented fingerprinting. 217-220 - Ankur Datta, Niels da Vitoria Lobo, John J. Leeson:

Novel feature vector for image authentication. 221-224
Source and Channel Coding
- Shan Liu

, C.-C. Jay Kuo
:
Joint temporal-spatial rate control for adaptive video transcoding. 225-228 - Seong Hwan Jang, Nikil Jayant:

An adaptive non-linear motion vector resampling algorithm for down-scaling video transcoding. 229-232 - Hua Yang, Kenneth Rose:

Source-channel prediction in error resilient video coding. 233-236 - Tao Chen, Zhihai He:

Single-pass distortion-smoothing encoding for low bit-rate video streaming applications. 237-240 - Cheng-Yu Pai, William E. Lynch:

MPEG-4 rate control algorithm using Laplace parameter estimation. 241-244
Image Coding and Enhancement
- Haohong Wang, Guido M. Schuster, Aggelos K. Katsaggelos

:
Minmax optimal shape coding using skeleton decomposition. 245-248 - Min Shao, Kenneth E. Barner:

Soft-partition-based weighted sum filters for image enhancement. 249-252 - Yibin Yang, Lilla Böröczky:

Joint resolution enhancement and artifact reduction for MPEG-2 encoded digital video. 253-256 - Haohong Wang, Guido M. Schuster, Aggelos K. Katsaggelos

:
Operational rate-distortion optimal bit allocation between shape and texture for MPEG-4 video coding. 257-260 - Zhibin Pan, Koji Kotani, Tadahiro Ohmi:

A fast full search equivalent encoding method for vector quantization by using appropriate features. 261-264
Video Analysis
- Xinguo Yu, Qi Tian, Kongwah Wan:

A novel ball detection framework for real soccer video. 265-268 - Ying Li, Yufei Ma, Hong-Jiang Zhang:

Salient region detection and tracking in video. 269-272 - Xinguo Yu, Changsheng Xu, Qi Tian, Hon Wai Leong:

A ball tracking framework for broadcast soccer video. 273-276 - Rainer Lienhart, Luhong Liang, Alexander Kuranov:

A detector tree of boosted classifiers for real-time object detection and tracking. 277-280 - Min Xu

, Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli
, Qi Tian:
Creating audio keywords for event detection in soccer video. 281-284 - Shunsuke Kamijo, Masao Sakauchi:

Segmentation of vehicles and pedestrians in traffic scene by spatio-temporal Markov random field model. 285-288 - Alan Hanjalic:

Multimodal approach to measuring excitement in video. 289-292 - Rong Jin, Alexander G. Hauptmann:

Learning to identify video shots with people based on face detection. 293-296 - Yang Ran, Qinfen Zheng:

Multi moving people detection from binocular sequences. 297-300 - Zuzana Cernekova

, Constantine Kotropoulos, Ioannis Pitas:
Video shot segmentation using singular value decomposition. 301-304
Multimedia Streaming Architecture
- Hai Jin, Dafu Deng:

HHMSM: a hierarchical hybrid multicast stream merging scheme for large-scale video-on-demand systems. 305-308 - Zhen Li, Guobin Shen, Shipeng Li

, Edward J. Delp
:
L-TFRC: an end-to-end congestion control mechanism for video streaming over the Internet. 309-312 - Chen-Lung Chan, Shih-Yu Huang, Jia-Shung Wang:

Cooperative cache framework for video streaming applications. 313-316 - Toufik Ahmed

, Ahmed Mehaoua, Vincent Lecuire:
Streaming MPEG-4 audio visual objects using TCP-friendly rate control and unequal error protection. 317-320 - Longin Jan Latecki

, Kishore Kulkarni, Jaiwant Mulik:
Better audio performance when video stream is monitored by TCP congestion control. 321-324 - Xuxian Jiang, Yu Dong, Dongyan Xu, Bharat K. Bhargava:

GnuStream: a P2P media streaming system prototype. 325-328 - Jun Guo

, Peter G. Taylor
, Moshe Zukerman
, Sammy Chan
, Kit-Sang Tang
, Eric W. M. Wong
:
On the efficient use of video-on-demand storage facility. 329-332 - Michael Harville, Michele Covell, Susie J. Wee:

An architecture for componentized, network-based media services. 333-336
Image Classification and Detection
- Sungju Youm, Woosaeng Kim:

Dynamic threshold method for scene change detection. 337-340 - Woosaeng Kim, Ji Yoon Kim:

Image classification using spatial relationship matrix based on color spatio-histogram. 341-344 - Xavier Gibert, Huiping Li, David S. Doermann

:
Sports video classification using HMMS. 345-348 - Shaohua Kevin Zhou, Rama Chellappa, Baback Moghaddam:

Adaptive visual tracking and recognition using particle filters. 349-352 - Hwajeong Lee, Daehwan Kim

, Daijin Kim, Sung Yang Bang:
Real-time automatic vehicle management system using vehicle tracking and car plate number identification. 353-356 - Junqiang Lan, Xinhua Zhuang:

Embedded SLCCA for wavelet image coding. 357-360 - Jian Zhou, Huai-Rong Shao, Chia Shen, Ming-Ting Sun:

FGS enhancement layer truncation with minimized intra-frame quality variation. 361-364 - Aya Aner-Wolf:

Determining a scene's atmosphere by film grammar rules. 365-368 - Mukesh A. Zaveri, Uday B. Desai, S. N. Merchant:

Tracking multiple maneuvering point targets using multiple filter bank in infrared image sequence. 369-372
Indexing, Segmentation, and Retrieval
- Paisarn Muneesawang, Ling Guan:

Automatic relevance feedback for video retrieval. 373-376 - Miki Haseyama, Isao Kondo:

2-D functional AR model for image identification. 377-380 - Chi-Man Pun

:
Invariant content-based image retrieval by wavelet energy signatures. 381-384 - Jiqiang Song, Min Cai, Michael R. Lyu:

A robust statistic method for classifying color polarity of video text. 385-388 - Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:

Dynamic-segmentation-based feature dimension reduction for quick audio/video searching. 389-392 - Miki Haseyama, Atsushi Matsumura:

A trainable retrieval system for cartoon character images. 393-396 - Sangoh Jeong, Chee Sun Won

, Robert M. Gray:
Histogram-based image retrieval using Gauss mixture vector quantization. 397-400 - Qixiang Ye, Wen Gao, Wei Zeng:

Color image segmentation using density-based clustering. 401-404
Video Segmentation for Semantic Annotation and Transcoding
- Ba Tu Truong, Svetha Venkatesh

, Chitra Dorai:
Identifying film takes for cinematic analysis. 405-408 - Nathalie Peyrard, Patrick Bouthemy:

Motion-based selection of relevant video segments for video summarisation. 409-412 - Winston H. Hsu, Shih-Fu Chang:

A statistical framework for fusing mid-level perceptual features in news story segmentation. 413-416 - Anthony Vetro, Tetsuji Haga, Kazuhiko Sumi, Huifang Sun:

Object-based coding for long-term archive of surveillance video. 417-420 - Marco Bertini

, Rita Cucchiara
, Alberto Del Bimbo
, Andrea Prati
:
Object and event detection for semantic annotation and transcoding. 421-424
Wireless Multimedia II
- Syed A. Khayam, Shirish Karande, Michael Krappel, Hayder Radha:

Cross-layer protocol design for real-time multimedia applications over 802.11 b networks. 425-428 - Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:

An end-to-end TCP-friendly streaming protocol for multimedia over wireless Internet. 429-432 - Zhijun Lei, Nicolas D. Georganas:

Rate adaptation transcoding for video streaming over wireless channels. 433-436 - Yong Pei, James W. Modestino:

Interactive video coding and transmission over wired-to-wireless IP networks using an edge proxy. 437-440 - Allen Miu, John G. Apostolopoulos, Wai-tian Tan, Mitchell D. Trott:

Low-latency wireless video over 802.11 networks using path diversity. 441-444
Multimedia Semantics
- John R. Smith, Milind R. Naphade, Apostol Natsev:

Multimedia semantic indexing using model vectors. 445-448 - Dinh Quoc Phung

, Svetha Venkatesh
, Chitra Dorai:
On the extraction of thematic and dramatic functions of content in educational videos. 449-452 - Brett Adams, Chitra Dorai, Svetha Venkatesh

, Hung Hai Bui:
Indexing narrative structure and semantics in motion pictures with a probabilistic framework. 453-456 - Jiebo Luo

, Amit Singhal, Weiyu Zhu:
Natural object detection in outdoor scenes based on probabilistic spatial context models. 457-460 - Shinichi Takagi, Shinobu Hattori, Kazumasa Yokoyama, Akihisa Kodate, Hideyoshi Tominaga:

Sports video categorizing method using camera motion parameters. 461-464
Face, Body, and Audio-visual Analysis
- Yong Ma, Xiaoqing Ding:

Robust real-time face detection based on cost-sensitive AdaBoost method. 465-468 - Jonathan H. Connell, Norman Haas, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos, Senem Velipasalar:

A real-time prototype for small-vocabulary audio-visual ASR. 469-472 - Mingkun Li, Dongge Li, Nevenka Dimitrova, Ishwar K. Sethi

:
Audio-visual talking face detection. 473-476 - Chung-Lin Huang, Chia-Ying Chung:

A real-time model-based human motion analysis system. 477-480 - Petar S. Aleksic, Aggelos K. Katsaggelos

:
Product HMMs for audio-visual continuous speech recognition using facial animation parameters. 481-484
Multimedia Security and Content Protection III
- Peter Hon-Wah Wong, Yick Ming Yeung, Oscar C. Au:

Capacity for JPEG2000-to-JPEG2000 images watermarking. 485-488 - Chun-Shien Lu:

Dual security-based image steganography. 489-492 - Yongdong Wu, Feng Bao, Changsheng Xu:

The security flaws in some authentication watermarking schemes. 493-496 - Huiping Guo, Nicolas D. Georganas:

Digital image watermarking for joint ownership verification without a trusted dealer. 497-500 - Feilong Liu, Yangsheng Wang:

An improved block dependent fragile image watermark. 501-504 - Gwenaël J. Doërr

, Jean-Luc Dugelay
:
New intra-video collusion attack using mosaicing. 505-508 - Shaohui Liu, Hongxun Yao, Wen Gao:

Neural network based steganalysis in still images. 509-512 - Serhat Erküçük

, Sridhar Krishnan, Mehmet Zeytinoglu:
Robust audio watermarking using a chirp based technique. 513-516
Multimedia Distribution
- Min-You Wu, Wei Shu:

Video distribution with edge stations and Wi-Fi delivery networks. 521-524 - Si Woong Jang, Yong Woon Park:

A dynamic multicasting policy based on proxy caching. 525-528 - Bahjat Qazzaz, Javier Moreno, Xiaoyuan Yang, Porfidio Hernández

, Remo Suppi
, Emilio Luque:
Admission control policies for video on demand brokers. 529-532 - Qiang Liu, Jenq-Neng Hwang:

A new congestion control algorithm for layered multicast in heterogeneous multimedia dissemination. 533-536 - Hugh Melvin, Liam Murphy

:
An integrated NTP-RTCP solution to audio skew detection and compensation for VoIP applications. 537-540 - Hong Man, Yang Li:

Multi-stream video transport over DiffServ wireless LANS. 541-544 - Syed Irtiza Ali, Hayder Radha:

Hierarchical handoff schemes over wireless LAN/WAN networks for multimedia applications. 545-548
Image Compression and Modeling
- Takahiro Nakayama

, Masahiro Konda, Koji Takeuchi, Koji Kotani, Tadahiro Ohmi:
Adaptive resolution vector quantization technique and basic codebook design method for compound image compression. 549-552 - Xingsong Hou, Guizhong Liu:

A wavelet packet image coding algorithm based on quadtree classification and UTCQ. 553-556 - Xiaopeng Fan, Yan Lu, Wen Gao:

A novel coefficient scanning scheme for directional spatial prediction-based image compression. 557-560 - Deepak S. Turaga, Mihaela van der Schaar:

Reduced complexity spatio-temporal scalable motion compensated wavelet video encoding. 561-564 - Yuxin Liu, Zhen Li, Paul Salama

, Edward J. Delp
:
A discussion of leaky prediction based scalable coding. 565-568 - Jean Cardinal:

Compression of side information. 569-572 - Feng Pan, Zhengguo Li, Keng Pang Lim, Dajun Wu, Rongshan Yu, Genan Feng:

An adaptive rate control algorithm for video coding over personal digital assistants (PDA). 573-576 - Geovanni Martinez:

Maximum-likelihood motion estimation of a human face. 577-580 - Mihaela van der Schaar, Deepak S. Turaga:

Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding. 581-584 - Shan Suthaharan

:
A perceptually significant block-edge impairment metric for digital video coding. 585-588
Signal Processing Theory and Methods II
- Zheng Fang, Yingbo Hua:

Maximum likelihood method for blind identification of multiple autoregressive channels. 589-592 - Khim Sia Tan, Woon-Seng Gan

, Jun Yang
, Meng Hwa Er:
Constant beamwidth beamformer for difference frequency in parametric array. 593-596 - Omid S. Jahromi, Parham Aarabi:

Time delay estimation and signal reconstruction using multi-rate measurements. 597-600 - Yunnan Wu, Sun-Yuan Kung:

Detection for MIMO systems with imprecise channel knowledge. 601-604 - Xinying Zhang, Sun-Yuan Kung:

Capacity analysis for parallel and sequential MIMO equalizers. 605-608 - Timo Roman, Mihai Enescu, Visa Koivunen:

Time-domain method for tracking dispersive channels in MIMO OFDM systems. 609-612 - Frank Papenfuß, Yuri Artyukh, Eugene S. Boole, Dirk Timmermann

:
Optimal sampling functions in nonuniform sampling driver designs to overcome the Nyquist limit. 613-616 - Pamornpol Jinachitra:

Constrained EM estimates for harmonic source separation. 617-620 - Khaled Amleh, Hongbin Li:

Blind code timing and carrier offset estimation for DS-CDMA systems. 621-624 - M. Mauricio Lara, Aldo G. Orozco-Lugo

, Desmond C. McLernon
, Hugo J. Muro-Lemus:
Blind recovery of multiple packets in ad hoc mobile networks using polynomial phase modulating sequences. 625-628
Multimedia Authoring and Presentation
- Jun Kong, Meikang Qiu, Kang Zhang:

Authoring multimedia documents through grammatical specifications. 629-632 - Zhaohui Sun, Jon Riek, Alexander C. Loui:

High resolution multimedia slide show composition for video CD and DVD rendering. 633-636 - Tero Jokela:

Authoring tools for mobile multimedia content. 637-640 - Heikki Keränen, Tapani Rantakokko, Jani Mäntyjärvi

:
Sharing and presenting multimedia and context information within online communities using mobile terminals. 641-644 - Ilpo Koskinen:

User-generated content in mobile multimedia: empirical evidence from user studies. 645-648
Multimedia Streaming
- Yang Guo, Kyoungwon Suh, Jim Kurose, Don Towsley

:
A peer-to-peer on-demand streaming service and its performance evaluation. 649-652 - Joohee Kim, Russell M. Mersereau, Yucel Altunbasak:

Network-adaptive video streaming using multiple description coding and path diversity. 653-656 - Giancarlo Fortino

, Wilma Russo
, Eugenio Zimeo:
Enhancing cooperative playback systems with efficient encrypted multimedia streaming. 657-660 - Matthias Ohlenroth, Hermann Hellwagner

:
A protocol for adaptation-aware multimedia streaming. 661-664 - Yufeng Shan, Shivkumar Kalyanaraman:

Hybrid video downloading/streaming over peer-to-peer networks. 665-668
Capturing and Indexing Multimedia Events and Content
- Werner Geyer, Heather Richter, Gregory D. Abowd:

Making multimedia meeting records more meaningful. 669-672 - Jiqiang Song, Michael R. Lyu, Jenq-Neng Hwang, Min Cai:

PVCAIS: a personal videoconference archive indexing system. 673-676 - Yoshinari Kameda, Satoshi Nishiguchi, Michihiko Minoh:

CARMUL: concurrent automatic recording for multimedia lecture. 677-680 - Nikolai Joukov, Tzi-cker Chiueh:

Lectern II: a multimedia lecture capturing and editing system. 681-684 - Avare Stewart, Patrick Wolf, Matthias L. Hemmje:

Media and metadata management for capture and access systems in electronic lecturing environments. 685-688
Image/Video Indexing and Retrieval
- Yanjun Qi, Alexander G. Hauptmann, Ting Liu:

Supervised classification for video shot segmentation. 689-692 - Zhu Li, Aggelos K. Katsaggelos

, Bhavan Gandhi:
Temporal rate-distortion based optimal video summary generation. 693-696 - Ankur Mani:

Video segmentation using stabilized inverse diffusion. 697-700 - Daekyu Shin, Daewon Kim, Hyunsool Kim, Sanghui Park:

An image retrieval technique using rotationally invariant Gabor features and a localization method. 701-704 - Alexander Haubold, John R. Kender:

Analysis and interface for instructional video. 705-708
Speech and Audio Processing III
- Manu Mathew, Vasudha Bhat, Shine M. Thomas, Changhoon Yim

:
Modified MP3 encoder using complex modified cosine transform. 709-712 - Björn W. Schuller

, Gerhard Rigoll, Manfred K. Lang:
HMM-based music retrieval using stereophonic feature information and framelength adaptation. 713-716 - Aaron S. Master, Yi-Wen Liu:

Robust chirp parameter estimation for Hann windowed signals. 717-720 - Ting-Yao Wu, Lie Lu

, Ke Chen
, Hong-Jiang Zhang:
UBM-based incremental speaker adaptation. 721-724 - Cheng-Yuan Lin, Jyh-Shing Roger Jang

:
New refinement schemes for voice conversion. 725-728 - Dong-Yan Huang, Ruihua Ma:

Integer fast modified cosine transform. 729-732 - Hadi Harb

, Liming Chen:
Gender identification using a general audio classifier. 733-736 - Jouni Paulus

, Anssi Klapuri:
Conventional and periodic N-grams in the transcription of drum sequences. 737-740 - Steven J. Rennie, Parham Aarabi, Trausti T. Kristjansson, Brendan J. Frey, Kannan Achan:

Robust variational speech separation using fewer microphones than speakers. 741-744
Video Processing for Multimedia Interaction
- Alexandre R. J. François

, Eun-Young Elaine Kang:
A handheld mirror simulation. 745-748 - Jamey Graham, Jonathan J. Hull:

A paper-based interface for video browsing and retrieval. 749-752 - Frank M. Shipman III, Andreas Girgensohn, Lynn Wilcox:

Creating navigable multi-level video summaries. 753-756 - Lalitha Agnihotri, Nevenka Dimitrova, John R. Kender, John Zimmerman:

Study on requirement specifications for personalized multimedia summarization. 757-760 - Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:

Supporting VCR-like operations in SMIL2.0 players. 761-764 - Erkut Erdem, Aykut Erdem

, Volkan Atalay
, A. Enis Çetin
:
Computer vision based unistroke keyboard system and mouse for the handicapped. 765-768 - Ishwar Ramani, Rajiv P. Bharadwaja, P. Venkat Rangan:

Location tracking for media appliances in wireless home networks. 769-772 - Lujun Yuan, Wen Gao, Yan Lu:

Latest arrival time leaky bucket for HRD constrained video coding. 773-776
Motion Estimation
- Charay Lerdsudwichai, Mohamed Abdel-Mottaleb

:
Algorithm for multiple faces tracking. 777-780 - Patrick Lanvin, Jean-Charles Noyer

, Mohammed Benjelloun
:
Non-linear estimation of image motion and tracking. 781-784 - Mireya S. Garcia, Henri Nicolas:

Video object motion applications focusing on non-planar rotation. 785-788 - Yu-Kuang Tu, Jar-Ferr Yang, Yi-Nung Shen, Ming-Ting Sun:

Fast variable-size block motion estimation using merging procedure with an adaptive threshold. 789-792 - Hongbin Wang, Hua Lin:

A spectral clustering approach to motion segmentation based on motion trajectory. 793-796 - Korada Ramkishor, T. S. Raghu, K. Suman, Pallapothu S. S. B. K. Gupta:

Spatial correlation based fast field motion vector estimation algorithm for interlaced video encoding. 797-800 - Ye Lu, Cheng Lu, Ze-Nian Li:

A modified space frequency decomposition algorithm for visual motion. 801-804 - Sumeer Goel, Mohsen Shaaban, Tarek Darwish, Hanan A. Mahmoud, Magdy A. Bayoumi:

Memory accesses reduction for MIME algorithm. 805-808 - Yu-Wen Huang

, Bing-Yu Hsieh, Tu-Chih Wang, Shao-Yi Chien
, Shyh-Yih Ma, Chun-Fu Shen, Liang-Gee Chen
:
Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264. 809-812 - Shunan Lin, Anthony Vetro, Yao Wang

:
Rate-distortion analysis of the multiple description motion compensation video coding scheme. 813-816
Design and Implementation of Signal Processing Systems
- Adel Baganne, Imed Bennour, Mehrez Elmarzougui, Eric Martin:

A simulation based approach for incorporating virtual components IP cores into multimedia systems design. 817-820 - Atsushi Hatabu, Takashi Miyazaki, Ichiro Kuroda:

Optimization of decision-timing for early termination of SSDA-based block matching. 821-824 - Xiaojuan Hu, Linda DeBrunner, Victor E. DeBrunner:

Design of space-efficient, wide- and narrow transition-band, FIR filters. 825-828 - Duy Cuong Nguyen, Parham Aarabi, Ali Sheikholeslami:

Real-time sound localization using field-programmable gate arrays. 829-832 - Sang Yoon Park, Nam Ik Cho

:
Fixed point error analysis of CORDIC processor based on the variance propagation. 833-836 - Justin J. Song, Jian Li, Yen-Kuang Chen

:
Quality-delay-and-computation trade-off analysis of acoustic echo cancellation on general-purpose CPU. 837-840 - Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan, Hamid Reza Abutalebi

, Edmund C. Y. Tam, Peter Iles, Kar Wai Wong:
ETSI AMR-2 VAD: evaluation and ultra low-resource implementation. 841-844 - Daisuke Takahashi

:
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method. 845-848 - Sung-Won Lee, In-Cheol Park:

Low-power hybrid structure of digital matched filters for direct sequence spread spectrum systems. 849-852
Volume 3
Theoretical Insights and Improvements for Multimodal Biometrics
- Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux, Sébastien Marcel:

Speech & face based biometric authentication at IDIAP. 1-4 - Julian Fiérrez-Aguilar

, Javier Ortega-Garcia
, Joaquin Gonzalez-Rodriguez
:
Fusion strategies in multimodal biometric verification. 5-8 - Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti:

Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction. 9-12 - Xiaoguang Lu, Yunhong Wang, Anil K. Jain:

Combining classifiers for face recognition. 13-16 - Arslan Brömme:

A classification of biometric signatures. 17-20
Summarization
- Michael G. Christel, Chang Huang:

Enhanced access to digital video through visually rich interfaces. 21-24 - Berna Erol, Dar-Shyang Lee, Jonathan J. Hull:

Multimodal summarization of meeting recordings. 25-28 - Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:

Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models. 29-32 - Stefano Berretti, Alberto Del Bimbo, Pietro Pala:

Merging results of distributed image libraries. 33-36 - Rui Cai, Lie Lu

, Hong-Jiang Zhang, Lian-Hong Cai:
Highlight sound effects detection in audio stream. 37-40
Multistream Audio and Video Processing for Telepresence
- Douglas L. Jones:

Four-dimensional sound source recovery from arbitrary acoustic arrays. 41-44 - Qiong Liu, Don Kimber, Jonathan Foote, Chunyuan Liao:

Multichannel video/audio acquisition for immersive conferencing. 45-48 - Wolfgang Herbordt, Herbert Buchner, Walter Kellermann, Rudolf Rabenstein, Sascha Spors, Heinz Teutsch:

Full-duplex multichannel communication: real-time implementations in a general framework. 49-52 - Parham Aarabi, Bob Mungamuru:

Scene reconstruction using distributed microphone arrays. 53-56 - Ankur Mohan, Ramani Duraiswami, Dmitry N. Zotkin, Daniel DeMenthon, Larry S. Davis:

Using computer vision to generate customized spatial audio. 57-60
Video/Image tracking
- Takashi Yamamoto, Rama Chellappa:

Shape and motion driven particle filtering for human body tracking. 61-64 - Karthik Hariharakrishnan, Dan Schonfeld, Philippe Raffy, Fathy Yassa:

Object tracking using adaptive block matching. 65-68 - Gabriel Tsechpenakis, Kostas Rapantzikos, Nicolas Tsapatsoulis, Stefanos D. Kollias:

Object tracking in clutter and partial occlusion through rule-driven utilization of Snakes. 69-72 - Ofer Miller, Ety Navon, Amir Averbuch:

Tracking of moving objects based on graph edges similarity. 73-76 - Hao Jiang, Mark S. Drew:

Shadow-resistant tracking in video. 77-80
Multimedia Security and Content Protection IV
- Adnan Abdul-Aziz Gutub

, Mohammad K. Ibrahim:
High performance elliptic curve GF(2k) cryptoprocessor architecture for multimedia. 81-84 - Wei-Qi Yan, Mohan S. Kankanhalli:

Scrambling of engineering drawings. 85-88 - Mitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto:

Nonlinear separation of signature trajectories for on-line personal authentication. 89-92 - José Gabriel Rodríguez Carneiro Gomes, Mylène Christine Queiroz de Farias, Sanjit K. Mitra, Marco Carli:

An accurate billing mechanism for multimedia communications. 93-96 - Dipti Prasad Mukherjee, Subhamoy Maitra:

Robust buyer authentication scheme for multimedia object. 97-100 - Haiping Lu, Alex C. Kot, Susanto Rahardja:

Binary image watermarking through biased binarization. 101-104 - Suk Hwan Lee

, Tae-Su Kim, Byung-Ju Kim, Seong Geun Kwon, Ki-Ryong Kwon, Kuhn-Il Lee:
3D polygonal meshes watermarking using normal vector distributions. 105-108 - Nut Taesombut, Vineet Kumar, Rishi Dubey, P. Venkat Rangan:

Secure registration protocol for media appliances in wireless home networks. 109-112
Human Movement and Face Analysis
- Naresh P. Cuntoor, Amit A. Kale, Rama Chellappa:

Combining multiple evidences for gait recognition. 113-116 - Richard D. Green, Ling Guan:

Tracking human movement patterns using particle filtering. 117-120 - Jian Li, Shaohua Kevin Zhou, Chandra Shekhar:

A comparison of subspace analysis for face recognition. 121-124 - Jianyu Wang, Wen Gao, Shiguang Shan, XiaoPeng Hu:

Facial feature tracking combining model-based and model-free method. 125-128 - Shaohua Kevin Zhou, Rama Chellappa:

Simultaneous tracking and recognition of human faces from video. 129-132 - Gang Pan, Zhaohui Wu, Yunhe Pan:

Automatic 3D face verification from range data. 133-136 - Heng Liu, Shengye Yan, Xilin Chen, Wen Gao:

Rotated face detection in color images using radial template (RT). 137-140 - Xiujuan Chai, Shiguang Shan, Wen Gao, Bo Cao:

Novel example-based shape learning for fast face alignment. 141-144 - Do-Hyung Kim, Jaeyeon Lee, Jung Soh, YunKoo Chung:

Real-time face verification using multiple feature combination and a support vector machine supervisor. 145-148 - Wen Gao, Shiguang Shan, Xiujuan Chai, Xiaowei Fu:

Virtual face image generation for illumination and pose insensitive face recognition. 149-152
Image and Video Coding and Analysis
- Chengjie Tu, Trac D. Tran, Jie Liang:

Error resilient pre-/post-filtering for DCT-based block coding systems. 153-156 - Aysegul Cuhadar, Sinan Tasdoken:

Multiple arbitrary shape ROI coding with zerotree based wavelet coders. 157-160 - Marie Babel, Olivier Déforges:

Lossless and lossy minimal redundancy pyramidal decomposition for scalable image compression technique. 161-164 - Jari Korhonen, Ye Wang:

Schemes for error resilient streaming of perceptually coded audio. 165-168 - Stefano Belfiore, Marco Grangetto, Enrico Magli, Gabriella Olmo:

Spatio-temporal video error concealment with perceptually optimized mode selection. 169-172 - Son Lam Phung

, Douglas Chai, Abdesselam Bouzerdoum:
Adaptive skin segmentation in color images. 173-176 - Takuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Tetsuro Kuge:

Invertible deinterlacing with variable coefficients and its lifting implementation. 177-180 - Namrata Vaswani, Amit K. Roy-Chowdhury, Rama Chellappa:

Statistical shape theory for activity modeling. 181-184 - John N. Carter, Pelopidas Lappas, Robert I. Damper:

Evidence-based object tracking via global energy maximization. 185-188 - Manoranjan Paul, M. Manzur Murshed, Laurence Dooley:

A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions. 189-192
Speech and Audio Processing IV
- Ye Wang, Jian Tang, Ali Ahmaniemi, Markus Vaalgamaa:

Parametric vector quantization for coding percussive sounds in music. 193-196 - Mukund Devarajan, Fansheng Meng, Penny Hix, Stephen A. Zahorian:

HMM-neural network monophone models for computer based articulation training for the hearing impaired. 197-200 - Suryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana:

Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances. 201-204 - Daniel Garcia-Romero, Julian Fiérrez-Aguilar, Joaquin Gonzalez-Rodriguez

, Javier Ortega-Garcia
:
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. 205-208 - Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:

An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. 209-212 - Jianhua Tao, Xing Ni:

Auditive learning based Chinese F0 prediction. 213-216 - Justinian P. Rosca, Radu V. Balan, Christophe Beaugeant:

Multi-channel psychoacoustically motivated speech enhancement. 217-220 - Jin Li:

A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform. 221-224
Signal Processing and Testing in Multimodal Biometrics
- Frank Zoebisch, Claus Vielhauer:

A test tool to support brute-force online and offline signature forgery tests on mobile devices. 225-228 - Marios Savvides, Krithika Venkataramani, B. V. K. Vijaya Kumar

:
Incremental updating of advanced correlation filters for biometric authentication systems. 229-232 - Ziyou Xiong, Yunqiang Chen, Roy Wang, Thomas S. Huang:

A real time automatic access control system based on face and eye corners detection, face recognition and speaker identification. 233-236 - Umut Uludag, Anil K. Jain:

Multimedia content protection via biometrics-based encryption. 237-240 - Enrico Grosso, Massimo Tistarelli:

On testing methods for biometric authentication. 241-244
Multimedia Coding and Transport
- Narasinha Kamat, Ju Wang, Jonathan C. L. Liu:

A delay-efficient rerouting scheme for VoIP traffic. 245-248 - Xiaofei Liao, Hai Jin:

A new cluster-based distributed video recorder server. 249-252 - Zhihua Chen, Bobby Bodenheimer, J. Fritz Barnes:

Extending progressive meshes for use over unreliable networks. 253-256 - Christian Bachmeir, Peter Tabery, Serdar Uzumcu, Eckehard G. Steinbach:

A scalable virtual programmable real-time testbed for rapid multimedia service creation and evaluation. 257-260 - Bulent Cavusoglu, Dan Schonfeld, Rashid Ansari:

Real-time adaptive forward error correction for MPEG-2 video communications over RTP networks. 261-264
Multimedia Standards
- Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:

Modeling of the non-deterministic synchronization behaviors in SMIL2.0 documents. 265-268 - Zaher Aghbari, Akifumi Makinouchi:

Extending MPEG-7 description scheme of moving regions by the semantic visual-spatio-temporal relationships. 269-272 - Jason Lukasiak, David Stirling, Nick Harders, Shane Perrow:

Performance of MPEG-7 low level audio descriptors with compressed data. 273-276 - Yick Ming Yeung, Oscar C. Au, Andy Chang:

Efficient rate control technique for JPEG2000 image coding using priority scanning. 277-280 - Jae-Gon Kim, Yong Wang, Shih-Fu Chang:

Content-adaptive utility-based video adaptation. 281-284
Face Analysis and Modeling
- Haitao Wang, Hong Wei, Yangsheng Wang:

Face representation under different illumination conditions. 285-288 - A-Nasser Ansari, Mohamed Abdel-Mottaleb:

3D face modeling using two orthogonal views and a generic face model. 289-292 - Chong Luo, Tat-Seng Chua, Teck Khim Ng:

Face tracking in video with hybrid of Lucas-Kanade and condensation algorithm. 293-296 - Xin Fan, Qi Zhang, Dequn Liang, Ling Zhao:

Face image restoration based on statistical prior and image blur measure. 297-300 - Yao-Hong Tsai, Yea-Shuan Huang:

Fast hierarchical face detection. 301-304
Segmentation, Summarization, and Structuring
- Ichiro Ide, Hiroshi Mo, Norio Katayama, Shin'ichi Satoh:

Topic-based inter-video structuring of a large-scale news video corpus. 305-308 - Ewa Kijak, Guillaume Gravier, Patrick Gros, Lionel Oisel, Frédéric Bimbot:

HMM based structuring of tennis videos using visual and audio cues. 309-312 - Lionel Brunel, Pierre Mathieu:

Fast method of segmentation and indexing MPEG1-2 flow. 313-316 - Yue Zhang, Mario A. Nascimento, Osmar R. Zaïane:

Building image mosaics: an application of content-based image retrieval. 317-320 - Wenli Zhang, Xiaomeng Wu, Shunsuke Kamijo, Masao Sakauchi:

A proposal for a video content generation support system and its application. 321-324 - Yan Liu, John R. Kender:

Fast scene segmentation using multi-level feature selection. 325-328 - Jek Charlson So Yu, Mohan S. Kankanhalli, Philippe Mulhem:

Semantic video summarization in compressed domain MPEG video. 329-332 - Xingquan Zhu, Xindong Wu:

Sequential association mining for video summarization. 333-336 - Eliza Yingzi Du, Chein-I Chang, Paul D. Thouin:

An unsupervised approach to color video thresholding. 337-340 - Darren E. Butler, Sridha Sridharan, V. Michael Bove Jr.:

Real-time adaptive background segmentation. 341-344
Rate Control and Packet Classification for Transmission
- Enrico Masala

, Juan Carlos De Martin:
Analysis-by-synthesis distortion computation for rate-distortion optimized multimedia streaming. 345-348 - Yuh-Ching Wang, Jin-Jang Leou:

A rate control scheme for H.26L video transmission. 349-352 - Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:

Ensuring fairness in multimedia multicast streaming with optimal rate allocation and client buffer utilization. 353-356 - S. R. Subramanya, Jagannathan Sarangapani, Mingsheng Peng:

A scheme for fair, rate-based end-to-end congestion control of multimedia traffic in packet switched networks. 357-360 - Chi-Wah Wong, Oscar C. Au, Bojun Meng, Hong-Kwai Lam:

Perceptual rate control for low-delay video communications. 361-364 - Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:

Per-class queue management and adaptive packet drop mechanism for multimedia networking. 365-368 - Davide Quaglia, Juan Carlos De Martin:

Adaptive packet classification for constant perceptual quality of service delivery of video streams over time-varying networks. 369-372 - Qiang Liu, Jenq-Neng Hwang:

End-to-end available bandwidth estimation and time measurement adjustment for multimedia QOS. 373-376 - Lifeng Zhao, C.-C. Jay Kuo:

Buffer-constrained R-D optimized rate control for video coding. 377-380
Audio Signal Processing
- Dmitry N. Zotkin, Shihab A. Shamma, Powen Ru, Ramani Duraiswami, Larry S. Davis:

Pitch and timbre manipulations using cortical representation of sound. 381-384 - Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo:

Multidimensional humming transcription using a statistical approach for query by humming systems. 385-388 - Arvindh Krishnaswamy:

Application of pitch tracking to South Indian classical music. 389-392 - Mohammed Raad, Alfred Mertins, Ian S. Burnett:

Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT). 393-396 - Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:

Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification. 397-400 - Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:

Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. 401-404 - Lie Lu, Yi Mao, Liu Wenyin, Hong-Jiang Zhang:

Audio restoration by constrained audio texture synthesis. 405-408 - Tetsuro Kitahara, Masataka Goto

, Hiroshi G. Okuno
:
Musical instrument identification based on F0-dependent multivariate normal distribution. 409-412 - Dreten De Koning, Werner Verhelst:

On psychoacoustic noise shaping for audio requantization. 413-416
Architecture, Implementation, and Design
- Nicolas Ventroux, Jean-François Nezan, Mickaël Raulet, Olivier Déforges:

Rapid prototyping for an optimized MPEG4 decoder implementation over a parallel heterogeneous architecture. 417-420 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen:

Hardware oriented rate control algorithm and implementation for realtime video coding. 421-424 - Ho-Man Tang, Michael R. Lyu, Irwin King:

Face recognition committee machine. 425-428 - Shantanu Chakrabartty, Masakazu Yagi, Tadashi Shibata, Gert Cauwenberghs:

Robust cephalometric landmark identification using support vector machines. 429-432 - Richard Kuehnel, Yuke Wang:

A method of generating uniformly distributed sequences over [0, K], where K+1 is not a power of two. 433-436 - Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang:

An efficient implementation of multi-prime RSA on DSP processor. 437-440 - Donglai Xu, Rui Gao, Hadj Batatia:

An improved parallel architecture fro MPEG-4 motion estimation in 3G mobile applications. 441-444 - Toshiyuki Yamane, Yasunao Katayama:

An ultra-fast Reed-Solomon decoder soft-IP with 8-error correcting capability. 445-448
Multimedia Technology in Bioinformatics
- Zuyi Wang, Sun-Yuan Kung, Junying Zhang, Javed I. Khan, Jianhua Xuan, Yue Joseph Wang:

Computational intelligence approach for gene expression data mining and classification. 449-452 - Harry Hochheiser, Eric H. Baehrecke, Stephen M. Mount, Ben Shneiderman:

Dynamic querying for pattern identification in microarray and genomic data. 453-456 - Sophia R. He, Edmond J. Breen, Sybille M. N. Hunt:

Proteomics: approaches and image analysis tools for drug discovery. 457-460 - Jinwook Seo, Marina Bakay, Po Zhao, Yi-Wen Chen, Priscilla Clarkson, Ben Shneiderman, Eric P. Hoffman:

Interactive color mosaic and dendrogram displays for signal/noise optimization in microarray data analysis. 461-464 - Per B. Hojte, Xiaoxing Wang:

Registering electrophoresis images for bioinformatics study of protein. 465-468
Video Analysis and Mining
- Dong-Jun Lan, Yufei Ma, Hong-Jiang Zhang:

A novel motion-based representation for video mining. 469-472 - Belle L. Tseng, Ching-Yung Lin, DongQing Zhang, John R. Smith:

Improved text overlay detection in videos using a fusion-based classifier. 473-476 - Chih-Yi Chiu, Shih-Pin Chao, Jui-Hsiang Chao, Wen-Yen Chang, Hsin-Chih Lin, Shi-Nine Yang:

Motion indexing and synthesis. 477-480 - Cees G. M. Snoek, Marcel Worring:

Time interval maximum entropy based event indexing in soccer video. 481-484 - Li-Qun Xu, Yongmin Li

:
Video classification using spatial-temporal features and PCA. 485-488
Multimedia Computing Systems and Appliances
- Ju Wang, Jonathan C. L. Liu, Yishu He:

Efficient buffering control for a software-only, high-level, high-profile, MPEG-2 decoder. 489-492 - Yan Zhu, Min-You Wu, Wei Shu:

Comparison study and evaluation of overlay multicast networks. 493-496 - Yoshitaka Nakamura, Hirozumi Yamaguchi, Akihito Hiromori, Keiichi Yasumoto, Teruo Higashino, Kenichi Taniguchi:

On designing end-user multicast for multiple video sources. 497-500 - Eugenio Costamagna, Lorenzo Favalli, Francesco Tarantola:

Characterization and modeling of campus-level IP network traffic. 501-504 - Stuart Goose, Rajanikanth Tanikella, Sreedhar Kodlahalli:

Attenuator: towards preserving the original appearance of large documents when rendered on small screen mobile devices. 505-508
Fast Algorithm for Video Processing
- Keman Yu, Jiangbo Lu, Jiang Li, Shipeng Li

:
Practical real-time video codec for mobile devices. 509-512 - Hyungjoon Kim, Yucel Altunbasak:

Low-complexity rate-distortion optimal macroblock mode selection for MPEG-like video coders. 513-516 - Hye-Yeon C. Tourapis, Alexis M. Tourapis:

Fast motion estimation within the H.264 codec. 517-520 - Bojun Meng, Oscar C. Au, Chi-Wah Wong, Hong-Kwai Lam:

Efficient intra-prediction mode selection for 4×4 blocks in H.264. 521-524 - Jun Xin, Ming-Ting Sun, Vincent Hsu:

Diversity-based fast block motion estimation. 525-528
Multimedia Human-Machine Interface and Interaction
- Yao-Jen Chang, Chao-Kuei Hsieh, Pei-Wei Hsu, Yung-Chang Chen:

Speech-assisted facial expression analysis and synthesis for virtual conferencing systems. 529-532 - Ashish Verma, Nitendra Rajput, L. Venkata Subramaniam:

Using viseme based acoustic models for speech driven lip synthesis. 533-536 - Atsuo Yoshitaka, Hirokazu Seki:

Detecting auditory information in concentration based on eye movement. 537-540 - Martin Zobl, Michael Geiger, Björn W. Schuller, Manfred K. Lang, Gerhard Rigoll:

A real-time system for hand gesture controlled operation of in-car devices. 541-544 - Olivier Pietquin

, Thierry Dutoit:
Aided design of finite-state dialogue management systems. 545-548 - Laurence Devillers, Lori Lamel, Ioana Vasilescu:

Emotion detection in task-oriented spoken dialogues. 549-552 - Nils Klarlund:

Editing by voice and the role of sequential symbol systems for improved human-to-computer information rates. 553-556 - Amarnag Subramanya, Raghunandan S. Kumaran, John N. Gowdy:

Real time eye tracking for human computer interfaces. 557-560 - Alper Kanak, Engin Erzin, Yucel Yemez, A. Murat Tekalp:

Joint audio-video processing for biometric speaker identification. 561-564 - Ying Li, Shrikanth S. Narayanan, C.-C. Jay Kuo:

Audiovisual-based adaptive speaker identification. 565-568
Algorithms and Architectures for Multimedia Communcations
- Sumit Roy, John Ankcorn, Susie J. Wee:

Architecture of a modular streaming media server for content delivery networks. 569-572 - Hideaki Ito, Teruo Fukumura:

A delivery method of videos with required minimum bandwidths. 573-576 - Shiang-Chun Liou, Hsuan-Chia Lu, Kuo-Hsien Yeh:

A capable location prediction and resource reservation scheme in wireless networks for multimedia. 577-580 - Yen-Chi Lee, Yucel Altunbasak, Russell M. Mersereau:

A drift-free motion-compensated predictive encoding technique for multiple description coding. 581-584 - Enrico Magli, Massimo Mancin, Luca Merello:

Low-complexity video compression for wireless sensor networks. 585-588 - Shuhua Peng, Xiaodong Liu, Qionghai Dai, Yu Cheng:

An improved RM algorithm for preventing streaming media tasks from starvation. 589-592 - Gaurav Harit, Santanu Chaudhury, Gaurav Garg, Pramod Kumar Sharma:

A framework for video representation and transcoding using appearance spaces. 593-596 - Andrea Cavallaro, Olivier Steiger, Touradj Ebrahimi:

Semantic segmentation and description for video transcoding. 597-600 - Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen:

Performance analysis of hardware oriented algorithm modification in H.264. 601-604
Speech Recognition and Enhancement
- Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:

Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. 605-608 - Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura:

In-car speech recognition using distributed microphones: adapting to automatically detected driving conditions. 609-612 - LiFeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang:

Automatic speaker recognition using dynamic Bayesian network. 613-616 - Phu Chien Nguyen, Masato Akagi, Tu Bao Ho:

Temporal decomposition: a promising approach to VQ-based speaker identification. 617-620 - Guillaume Lathoud, Iain McCowan:

Location based speaker segmentation. 621-624 - Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:

Non-native English speech recognition using bilingual English lexicon and acoustic models. 625-628 - Guangji Shi, Parham Aarabi:

Robust digit recognition using phase-dependent time-frequency masking. 629-632 - Jounghoon Beh, Hanseok Ko

:
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech. 633-636

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














