


default search action
ICASSP 2003: Hong Kong
- 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '03, Hong Kong, April 6-10, 2003. IEEE 2003, ISBN 0-7803-7663-3
Volume 1
Keynotes
- Nikil Jayant:
Pervasive broadband: opportunities for signal processing. 1 - Ya-Qin Zhang:
Advances in networked media - theory and practice. 2 - Georgios B. Giannakis:
Ultra-wideband communications: an idea whose time has come. 3
Acoustic Modeling for Robust ASR
- Bryan L. Pellom, Kadri Hacioglu:
Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task. 4-7 - Wei-Tyng Hong:
A discriminative and robust training algorithm for noisy speech recognition. 8-11 - Xiaodong Cui, Yifan Gong:
Variable parameter Gaussian mixture hidden Markov modeling for speech recognition. 12-15 - Takehito Utsuro, Yasuhiro Kodama, Tomohiro Watanabe, Hiromitsu Nishizaki, Seiichi Nakagawa:
Confidence of agreement among multiple LVCSR models and model combination by SVM. 16-19 - Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. 20-23 - Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. 24-27
Language ID
- Sonia Parandekar, Katrin Kirchhoff:
Multi-stream language identification using data-driven dependency selection. 28-31 - A. K. V. Sai Jayram, V. Ramasubramanian, Thippur V. Sreenivas:
Language identification using parallel sub-word recognition. 32-35 - Qian-Rong Gu, Tadashi Shibata:
Speaker and text independent language identification using predictive error histogram vectors. 36-39 - Jean-Luc Rouas, Jérôme Farinas, François Pellegrino
, Régine André-Obrecht:
Modeling prosody for language identification on read and spontaneous speech. 40-43 - Eddie Wong, Sridha Sridharan:
Three approaches to multilingual phone recognition. 44-47 - Jilei Tian, Janne Suontausta:
Scalable neural network based language identification from written text. 48-51
Novel Feature Extraction and Processing
- Panu Somervuo:
Experiments with linear and nonlinear feature transformations in HMM based phone recognition. 52-55 - Sunil Sivadas, Hynek Hermansky:
Generalized tandem feature extraction. 56-59 - Andrew C. Lindgren, Michael T. Johnson, Richard J. Povinelli:
Speech recognition using reconstructed phase space features. 60-63 - Bojana Gajic, Kuldip K. Paliwal:
Robust speech recognition using features based on zero crossings with peak amplitudes. 64-67 - Hema A. Murthy, Venkata Gadde:
The modified group delay function and its application to phoneme recognition. 68-71 - Jinfu Ni, Hisashi Kawai:
Tone feature extraction through parametric modeling and analysis-by-synthesis-based pattern matching. 72-75
Speech Enhancement I
- Jong Uk Kim, Sang-Gyun Kim, Chang D. Yoo:
The incorporation of masking threshold to subspace speech enhancement. 76-79 - Lee Lin, W. Harvey Holmes, Eliathamby Ambikairajah:
Subband noise estimation for speech enhancement using a perceptual Wiener filter. 80-83 - Justinian Rosca, Radu V. Balan, Christophe Beaugeant:
Multi-channel psychoacoustically motivated speech enhancement. 84-87 - Steven J. Rennie, Parham Aarabi, Trausti T. Kristjansson, Brendan J. Frey, Kannan Achan:
Robust variational speech separation using fewer microphones than speakers. 88-91 - Tomohiro Nakatani, Masato Miyoshi:
Blind dereverberation of single channel speech signal based on harmonic structure. 92-95 - Marcin Kuropatwinski
, W. Bastiaan Kleijn
:
Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions. 96-99
Packet Loss and Channel Coding
- Jonas Lindblom, Per Hedelin:
Error protection and packet loss concealment based on a signal matched sinusoidal vocoder. 100-103 - Christoffer Asgaard Rødbro, Mads Græsbøll Christensen
, Søren Vang Andersen, Søren Holdt Jensen:
Compressed domain packet loss concealment of sinusoidally coded speech. 104-107 - Philippe Gournay, François Rousseau, Roch Lefebvre:
Improved packet loss recovery using late frames for prediction-based speech coders. 108-111 - Costas S. Xydeas, Fotis Zafeiropoulos:
Model-based packet loss concealment for AMR coders. 112-115 - Moon-Keun Lee, Sung-Kyo Jung, Hong-Goo Kang, Young-Cheol Park, Dae Hee Youn:
A packet loss concealment algorithm based on time-scale modification for CELP-type speech coders. 116-119 - Anand D. Subramaniam, William R. Gardner, Bhaskar D. Rao:
Joint source-channel decoding of speech spectrum parameters over erasure channels using Gaussian mixture models. 120-123
Acoustic Modeling: Survey of New Techniques
- Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
Recognition method with parametric trajectory generated from mixture distribution HMMs. 124-127 - John W. McDonough, Alex Waibel:
Maximum mutual information speaker adapted training with semi-tied covariance matrices. 128-131 - Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Automatic complexity control for HLDA systems. 132-135 - Vlasios Doumpiotis, Stavros Tsakalidis, William Byrne:
Discriminative training for segmental minimum Bayes risk decoding. 136-139 - Tetsuji Ogawa, Tetsunori Kobayashi:
Hybrid modeling of PHMM and HMM for speech recognition. 140-143 - Sebastian Stüker, Tanja Schultz, Florian Metze, Alex Waibel:
Multilingual articulatory features. 144-147
Speech Modeling and Analysis
- Ashraf Alkhairy:
Mathematical models of vocal tract with distributed sources. 148-151 - Paavo Alku, Tom Bäckström:
All-pole modeling of wide-band speech with symmetric linear prediction. 152-155 - Karl Schnell, Arild Lacroix:
Generation of nasalized speech sounds based on branched tube models obtained from separate mouth and nose outputs. 156-159 - Mark Thomson, Simon Boland, Mike Wu, Julien Epps, Michael Smithers:
Decomposition of speech into voiced and unvoiced components based on a state-space signal model. 160-163 - Ramon Prieto, Sora Kim:
Time delay estimation and adaptive frame length iterations for noise robust pitch extraction. 164-167 - Yu Shi, Eric Chang:
Spectrogram-based formant tracking via particle filters. 168-171
New Methods for Speaker Recognition, Segmentation, and Implementation
- Masafumi Nishida, Tatsuya Kawahara:
Unsupervised speaker indexing using speaker model selection based on Bayesian information criterion. 172-175 - Guillaume Lathoud, Iain A. McCowan:
Location based speaker segmentation. 176-179 - Yassine Mami, Delphine Charlet:
Speaker identification by anchor models with PCA/LDA post-processing. 180-183 - Phu Chien Nguyen, Masato Akagi, Tu Bao Ho:
Temporal decomposition: a promising approach to VQ-based speaker identification. 184-187 - LiFeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang:
Automatic speaker recognition using dynamic Bayesian network. 188-191 - Chengyuan Ma, Eric Chang:
Comparison of discriminative training methods for speaker verification. 192-195
Large Vocabulary Speech Recognition
- Gustavo Hernández Ábrego, Xavier Menéndez-Pidal, Thomas Kemp, Katsuki Minamino, Helmut Lucke:
Automatic set-up for speech recognition engines based on merit optimization. 196-199 - Miroslav Novak, Radek Hampl, Pavel Krbec, Vladimír Bergl, Jan Sedivý:
Two-pass search strategy for large list recognition on embedded speech recognition platforms. 200-203 - Sabine Deligne, Lidia Mangu:
On the use of lattices for the automatic generation of pronunciations. 204-207 - Dimitra Vergyri, Andreas Stolcke, Venkata Ramana Rao Gadde, Luciana Ferrer, Elizabeth Shriberg:
Prosodic knowledge sources for automatic speech recognition. 208-211 - Jean-Luc Gauvain, Lori Lamel, Holger Schwenk, Gilles Adda, Langzhou Chen, Fabrice Lefèvre:
Conversational telephone speech recognition. 212-215 - Bhuvana Ramabhadran, Jing Huang, Michael Picheny:
Towards automatic transcription of large spoken archives - English ASR for the MALACH project. 216-219
Unsupervised Language Model Adaption
- Langzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda:
Unsupervised language model adaptation for broadcast news. 220-223 - Michiel Bacchiani, Brian Roark:
Unsupervised language model adaptation. 224-227 - Takaaki Hori, Daniel Willett, Yasuhiro Minami:
Language model adaptation using WFST-based speaking-style translation. 228-231 - Erwin Leeuwis, Marcello Federico, Mauro Cettolo:
Language modeling and transcription of the TED corpus lectures. 232-235 - Tadasuke Yokoyama, Takahiro Shinozaki, Koji Iwano, Sadaoki Furui:
Unsupervised class-based language model adaptation for spontaneous speech recognition. 236-239 - Wen Wang, Mary P. Harper, Andreas Stolcke:
The robustness of an almost-parsing language model given errorful training data. 240-243
Speech Synthesis Overview
- Jerome R. Bellegarda:
Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy. 244-247 - Matthias Eichner, Steffen Werner, Matthias Wolff, Rüdiger Hoffmann:
Towards spontaneous speech synthesis - LM based selection of pronunciation variants. 248-251 - Ki-Seung Lee, Jeongsu Kim:
Context-adaptive phone boundary refining for a TTS database. 252-255 - Hideki Kawahara, Hisami Matsui:
Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation. 256-259 - Matthew Lee, Mark J. T. Smith:
Spectral modification for digital singing voice synthesis using asymmetric generalized Gaussians. 260-263 - Min Chu, Hu Peng, Yong Zhao, Zhengyu Niu, Eric Chang:
Microsoft Mulan - a bilingual TTS system. 264-267
Spoken Language Understanding
- Yulan He
, Steve J. Young:
Hidden vector state model for hierarchical semantic parsing. 268-271 - Anand Venkataraman, Luciana Ferrer, Andreas Stolcke, Elizabeth Shriberg:
Training a prosody-based dialog act tagger from unlabeled data. 272-275 - Gökhan Tür, Robert E. Schapire, Dilek Hakkani-Tür
:
Active learning for spoken language understanding. 276-279 - Ciprian Chelba, Milind Mahajan, Alex Acero:
Speech utterance classification. 280-283 - Ye-Yi Wang, Alex Acero:
Concept acquisition in example-based grammar authoring. 284-287 - Juan M. Huerta, David M. Lubensky:
Graph-based representation and techniques for NLU application development. 288-291
Speaker Adaption
- Daniel Willett, Thomas Niesler, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri:
Pervasive unsupervised adaptation for lecture speech transcription. 292-295 - Kyung-Tak Lee, Lynette Melnar, Jim Talley, Christian Wellekens:
Symbolic speaker adaptation with phone inventory expansion. 296-299 - Guo-Hong Ding, Bo Xu, Juha Iso-Sipilä, Yang Cao:
Fast speaker adaptation using triple diagonal and shared block diagonal transform matrices. 300-303 - Dong Kook Kim, Young Joon Kim, Woohyung Lim, Nam Soo Kim:
Online adaptation using speatransformation space model evolution. 304-307 - Bowen Zhou, John H. L. Hansen:
Discriminative acoustic model using eigenspace mapping for rapid speaker adaptation. 308-311 - Daniel Povey, Philip C. Woodland, Mark J. F. Gales:
Discriminative map for acoustic model adaptation. 312-315
Robust ASR in Mobile and Distributed Environments
- Richard C. Rose, Iker Arizmendi, Sarangarajan Parthasarathy:
An efficient framework for robust mobile speech recognition services. 316-319 - Luca Cristoforetti, Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer:
Use of parallel recognizers for robust in-car speech interaction. 320-323 - Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura:
In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions. 324-327 - Kadri Hacioglu, Bryan L. Pellom:
A distributed architecture for robust automatic speech recognition. 328-331 - Jan Stadermann, Gerhard Rigoll:
Flexible feature extraction and HMM design for a hybrid distributed speech recognition system in noisy environments. 332-335 - Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
OOV-detection and channel error protection for distributed speech recognition over wireless networks. 336-339
Language Modelling and Large Vocabulary Recognition
- Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:
Non-native English speech recognition using bilingual English lexicon and acoustic models. 340-343 - Katrin Kirchhoff, Jeff A. Bilmes, Sourin Das, Nicolae Duta, Melissa Egan, Gang Ji, Feng He, John Henderson, Daben Liu, Mohammed Noamany, Patrick Schone, Richard M. Schwartz, Dimitra Vergyri:
Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. 344-347 - Renato De Mori, Frédéric Béchet, Gérard Subsol, Dominique Massonié:
Dynamic scheduling of decoding processes for directory assistance. 348-351 - Cyril Allauzen, Mehryar Mohri:
Generalized optimization algorithm for speech recognition transducers. 352-355 - Diamantino Caseiro, Isabel Trancoso:
A tail-sharing WFST composition algorithm for large vocabulary speech recognition. 356-359 - Fabio Brugnara:
Context-dependent search in a context-independent network. 360-363 - Adam Janin, Don Baron, Jane Edwards, Dan Ellis, David Gelbart, Nelson Morgan, Barbara Peskin, Thilo Pfau, Elizabeth Shriberg, Andreas Stolcke, Chuck Wooters
:
The ICSI Meeting Corpus. 364-367 - Máté Szarvas, Sadaoki Furui:
Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR. 368-371 - Ahmad Emami, Peng Xu, Frederick Jelinek:
Using a connectionist model in a syntactical based language model. 372-375 - Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao:
Semantic n-gram language modeling with the latent maximum entropy principle. 376-379 - Hong-Kwang Jeff Kuo, Chin-Hui Lee, Imed Zitouni, Eric Fosler-Lussier:
Minimum verification error training for topic verification. 380-383 - Tomonori Kikuchi, Sadaoki Furui, Chiori Hori:
Automatic speech summarization based on sentence extraction and compaction. 384-387 - Bhiksha Raj, Edward W. D. Whittaker:
Lossless compression of language model structure and word identifiers. 388-391
Feature Processing for Robust ASR
- Shingo Kuroiwa, Satoru Tsuge:
Blind equalization techniques for ETSI standard DSR front-end. 392-395 - Rita Singh, Bhiksha Raj:
Tracking noise via dynamical systems with a continuum of states. 396-399 - Ni-Chun Wang, Jeih-Weih Hung, Lin-Shan Lee:
Data-driven temporal filters based on multi-eigenvectors for robust features in speech recognition. 400-403 - Kam-keung Chu, Shu-hung Leung, Chun-Shing Yip:
Perceptually non-uniform spectral compression for noisy speech recognition. 404-407 - Michael L. Seltzer, Richard M. Stern:
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments. 408-411 - Chuan Jia, Peng Ding, Bo Xu:
Sequential MAP estimation based speech feature enhancement for noise robust speech recognition. 412-415 - Peter Jancovic, Münevver Köküer, Fionn Murtagh:
Reliability-based estimation of the number of noisy features: application to model-order selection in the union models. 416-419 - Ji Ming, Francis Jack Smith:
A posterior union model for improved robust speech recognition in nonstationary noise. 420-423 - Françoise Beaufays, Daniel Boies, Mitch Weintraub, Qifeng Zhu:
Using speech/non-speech detection to bias recognition search on noisy data. 424-427 - Lingyun Gu, Jianbo Gao, A. G. Harris:
Endpoint detection in noisy environment using a Poincare recurrence metric. 428-431 - Izhak Shafran, Richard Rose:
Robust speech detection and segmentation for real-time ASR applications. 432-435 - Oh-Wook Kwon, Te-Won Lee:
Optimizing speech/non-speech classifier design using AdaBoost. 436-439
Speech Analysis
- Etan Fisher, Joseph Tabrikian, Shlomo Dubnov:
Generalized likelihood ratio test for voiced/unvoiced decision using the harmonic plus noise model. 440-443 - Ye Tian, Ji Wu, Zuoying Wang, Dajin Lu:
Fuzzy clustering and Bayesian information criterion based threshold estimation for robust voice activity detection. 444-447 - Om Deshmukh, Carol Y. Espy-Wilson:
A measure of aperiodicity and periodicity in speech. 448-451 - Pusadee Seresangtakul, Tomio Takara:
A generative model of fundamental frequency contours for polysyllabic words of Thai tones. 452-455 - Ching X. Xu, Yi Xu:
F0 perturbations by consonants and their implications on tone recognition. 456-459 - Wai C. Chu:
Gradient-descent based window optimization for linear prediction analysis. 460-463 - Issam Bazzi, Alex Acero, Li Deng:
An expectation maximization approach for formant tracking using a parameter-free non-linear predictor. 464-467 - Dong Wang, Lie Lu
, Hong-Jiang Zhang:
Speech segmentation without speech recognition. 468-471 - Akemi Hoshino, Akio Yasuda:
The evaluation of Chinese aspiration sounds uttered by Japanese students using VOT and power. 472-475 - Dorel Picovici, Abdulhussain E. Mahdi:
Output-based objective speech quality measure using self-organizing map. 476-479 - Serdar Yildirim, Shrikanth S. Narayanan:
An information-theoretic analysis of developmental changes in speech. 480-483 - Patrick J. Clemins, Michael T. Johnson:
Application of speech recognition to African elephant (Loxodonta africana) vocalizations. 484-487
Speech Synthesis: Prosody
- Shaw-Hwa Hwang, Cheng-Yu Yeh:
An efficient text analyzer with prosody generator-driven approach for Mandarin text-to-speech. 488-491 - Sheng Zhao, Jianhua Tao, DanLing Jiang:
Chinese prosodic phrasing with extended features. 492-495 - Neng-Huang Pan, Ming-Shing Yu, Ming-Jer Wu:
A Mandarin intonation prediction model that can output real pitch patterns. 496-499 - Jianhua Tao, Xing Ni:
Auditive learning based Chinese F0 prediction. 500-503 - Tu Trong Do, Tomio Takara:
Precise tone generation for Vietnamese text-to-speech system. 504-507 - Haiping Li, Fangxin Chen, Li Qin Shen, Xijun Ma:
Trainable Cantonese/English dual language speech synthesis system. 508-511 - Wei-Chih Kuo, Xiang-Rui Zhong, Yih-Ru Wang, Sin-Horng Chen:
A high-performance Min-Nan/Taiwanese TTS system. 512-515 - Xijun Ma, Wei Zhang, Qin Shi, Weibin Zhu, Liqin Shen:
Automatic prosody labeling using both text and acoustic information. 516-519 - Pierluigi Salvo Rossi, Francesco Palmieri, Francesco Cutugno:
Inversion of F0 model for natural-sounding speech synthesis. 520-523 - Hans Kruschke, Andreas Koch:
Parameter extraction of a quantitative intonation model with wavelet analysis and evolutionary optimization. 524-527 - K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. 528-531 - Rüdiger Hoffmann, Oliver Jokisch, Diane Hirschfeld, Guntram Strecha, Hans Kruschke, Ulrich Kordon, Uwe Koloska:
A multilingual TTS system with less than 1 Mbyte footprint for embedded applications. 532-535
Acoustic Adaption Techniques
- Mark J. F. Gales, Yuan Dong, Daniel Povey, Philip C. Woodland:
Porting: SwitchBoard to the VoiceMail task. 536-539 - Zhirong Wang, Tanja Schultz, Alex Waibel:
Comparison of acoustic model adaptation techniques on non-native speech. 540-543 - Denis Jouvet, Katarina Bartkova, Lionel Delphin-Poulat, Alexandre Ferrieux, Xavier Lamming, Jean Monné, Christophe Raix:
About improving recognition of spontaneously uttered French city-names. 544-547 - Gyucheol Jang, Sooyoung Woo, Minho Jin, Chang D. Yoo:
Improvements in speaker adaptation using weighted training. 548-551 - Tor André Myrvoll, Frank K. Soong:
Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation. 552-555 - Xiaodong He, Wu Chou:
Minimum classification error linear regression for acoustic model adaptation of continuous density HMMs. 556-559 - Rohit Sinha, Srinivasan Umesh:
A method for compensation of Jacobian in speaker normalization. 560-563 - Eric H. C. Choi, Trym Holter, Julien Epps, Arun Gopalakrishnan:
Temporal structure constrained transformation for speaker adaptation. 564-567 - Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Application of variational Bayesian estimation and clustering to acoustic model adaptation. 568-571 - Daben Liu, Francis Kubala:
Online speaker clustering. 572-575 - Yoshifumi Onishi, Ken-ichi Iso:
Speaker adaptation by hierarchical EigenVoice. 576-579 - Fabrice Lauri, Irina Illina, Dominique Fohr:
Combining EigenVoices and structural MLLR for speaker adaptation. 580-583
Spoken Language Systems and Confidence Measures
- Ananth Sankar, Su-Lin Wu:
Utterance verification based on statistics of phone-level confidence scores. 584-587 - Yassine Benayed, Dominique Fohr, Jean Paul Haton, Gérard Chollet:
Confidence measures for keyword spotting using support vector machines. 588-591 - Alberto Sanchís, Alfons Juan, Enrique Vidal:
Improving utterance verification using a smoothed naive Bayes model. 592-595 - Dilek Hakkani-Tür, Giuseppe Riccardi:
A general algorithm for word graph matrix decomposition. 596-599 - Ka-Yee Leung, Man-Hung Siu:
Phone level confidence measure using articulatory features. 600-603 - Ruhi Sarikaya, Yuqing Gao, Michael Picheny:
Word level confidence measurement using semantic features. 604-607 - Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke:
A prosody-based approach to end-of-utterance detection that does not require speech recognition. 608-611 - Mike Lincoln, Stephen Cox:
A comparison of language processing techniques for a constrained speech translation system. 612-615 - Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui:
Language model switching based on topic detection for dialog speech recognition. 616-619 - Stephen J. Cox:
Discriminative techniques in call routing. 620-623 - Chiori Hori, Takaaki Hori, Hideki Isozaki, Eisaku Maeda, Shigeru Katagiri, Sadaoki Furui:
Deriving disambiguous queries in a spoken interactive ODQA system. 624-627 - Corinna Cortes, Patrick Haffner, Mehryar Mohri:
Lattice kernels for spoken-dialog classification. 628-631 - Patrick Haffner, Gökhan Tür, Jerry H. Wright:
Optimizing SVMs for complex call classification. 632-635 - Fu-Hua Liu, Liang Gu, Yuqing Gao, Michael Picheny:
Use of statistical N-gram models in natural language generation for machine translation. 636-639
Speech Enhancement Including Applications to Robust ASR
- Florian Hilger, Hermann Ney, Olivier Siohan, Frank K. Soong:
Combining neighboring filter channels to improve quantile based histogram equalization. 640-643 - Umit H. Yapanel, Satya Dharanipragada:
Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition. 644-647 - Jounghoon Beh, Hanseok Ko
:
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech. 648-651 - Vincent Barreaud, Irina Illina, Dominique Fohr:
On-line frame-synchronous compensation of non-stationary noise. 652-655 - Sirko Molau, Florian Hilger, Hermann Ney:
Feature space normalization in adverse acoustic conditions. 656-659 - Yifan Gong:
Model-space compensation of microphone and noise for speaker-independent speech recognition. 660-663 - Manuel J. Reyes Gomez, Bhiksha Raj, Dan Ellis:
Multi-channel source separation by factorial HMMs. 664-667 - Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. 668-671 - Li Deng, Jasha Droppo, Alex Acero:
Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data. 672-675 - Bradford W. Gillespie, Les E. Atlas:
Strategies for improving audible quality and speech recognition accuracy of reverberant speech. 676-679 - Peter Jax, Peter Vary:
Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model. 680-683 - Guangji Shi, Parham Aarabi:
Robust digit recognition using phase-dependent time-frequency masking. 684-687
Speech Synthesis: Segmental Modelling and Processing
- Fangxin Chen:
Syllable clustering and spectral discontinuity in syllable-based TTS systems. 688-691 - Christophe Blouin, Paul C. Bagshaw, Olivier Rosec:
A method of unit preselection for speech synthesis based on acoustic clustering and decision trees. 692-695 - Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
Segment selection considering local degradation of naturalness in concatenative speech synthesis. 696-699 - David Dorran, Robert Lawlor, Eugene Coyle:
High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA). 700-703 - Xu Shao, Ben Milner:
Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model. 704-707 - Ellen Eide, Andrew Aaron, Raimo Bakis, Paul S. Cohen, Robert E. Donovan, Wael Hamza, T. Mathes, Michael Picheny, M. Polkosky, M. Smith, Mahesh Viswanathan:
Recent improvements to the IBM trainable speech synthesis system. 708-711 - Qin Yan, Saeed Vaseghi:
Analysis, modelling and synthesis of formants of British, American and Australian accents. 712-715 - Junichi Yamagishi, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A training method for average voice model based on shared decision tree context clustering and speaker adaptive training. 716-719 - Arun Kumar, Ashish Verma:
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts. 720-723 - Emir Turajlic, Dimitrios Rentzos, Saeed Vaseghi, Ching-Hsiang Ho:
Evaluation of methods for parameteric formant transformation in voice conversion. 724-727 - Yingying Xu, Hao Tang, Peiren Zhang:
An advanced text-to-speech server system based on SOAP protocol. 728-731 - Hao Tang, Bo Yin, Ren-Hua Wang:
Study on distributed speech synthesis system. 732-735
Acoustic Modeling of Coarticulation, Lexical and Task Information
- Georg Stemmer, Viktor Zeißler, Christian Hacker, Elmar Nöth, Heinrich Niemann:
A phone recognizer helps to recognize words better. 736-739 - Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Speech recognition using voice-characteristic-dependent acoustic models. 740-743 - Jian-Lai Zhou, Frank Seide, Li Deng:
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - model and training. 744-747 - Frank Seide, Jian-Lai Zhou, Li Deng:
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - MAP decoding and evaluation. 748-751 - Yanli Zheng, Mark Hasegawa-Johnson:
Acoustic segmentation using switching state Kalman filter. 752-755 - Chak-Fai Li, Man-Hung Siu:
An efficient incremental likelihood evaluation for polynomial trajectory model using with application to model training and recognition. 756-759 - Pascale Fung, Yi Liu:
Triphone model reconstruction for Mandarin pronunciation variations. 760-763 - Supphanat Kanokphara, Virongrong Tesprasit, Rachod Thongprasirt:
Pronunciation variation speech recognition without dictionary modification on sparse database. 764-767 - Pieter Nel, Johan A. du Preez:
Automatic syllabification using hierarchical hidden Markov models. 768-771 - Abhinav Sethy, Shrikanth S. Narayanan:
Split-lexicon based hierarchical recognition of speech using syllable and word level acoustic units. 772-775 - Jinsong Zhang, Keikichi Hirose, Satoshi Nakamura:
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones. 776-779 - Andrej Ljolje:
Multiple task-domain acoustic models. 780-783
Speech Coding and Speech Analysis
- Changchun Bao:
Harmonic excitation LPC (HE-LPC) speech coding at 2.3 kb/s. 784-787 - Mu-Liang Wang, Jar-Ferr Yang:
Complexity reduced shape VQ of spectral envelope with perception consideration. 788-791 - Geneviève Baudoin, Fadi El Chami:
Corpus based very low bit rate speech coding. 792-795 - Ranniery Maia, Ricardo J. da R. Cirigliano, Daniel Rojtenberg, Fernando Gil Vianna Resende Jr.:
Mixed-excited phonetic vocoding at 265 bps. 796-799 - Takahiro Hoshiya, Shinji Sako, Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Improving the performance of HMM-based very low bit rate speech coding. 800-803 - Christian H. Ritz, Ian S. Burnett, Jason Lukasiak:
Low bit rate wideband WI speech coding. 804-807 - Su Yang, Zongge Li, Yan-Qiu Chen:
A fractal based voice activity detector for Internet telephone. 808-811 - Dhany Arifianto, Takao Kobayashi:
IFAS-based voiced/unvoiced classification of speech signal. 812-815 - Sumit Basu:
A linked-HMM model for robust voicing and speech detection. 816-819 - Arthur P. Lobo, Philipos C. Loizou:
Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition. 820-823 - Peter Kabal:
Ill-conditioning and bandwidth expansion in linear prediction of speech. 824-827 - Davor Petrinovic:
Discrete weighted mean square all-pole modeling. 828-831
Feature-Oriented Acoustic Modeling
- Xiang Li, Richard M. Stern:
Training of stream weights for the decoding of speech using parallel feature streams. 832-835 - Yimin Zhang
, Qian Diao, Shan Huang, Wei Hu, Chris D. Bartels, Jeff A. Bilmes:
DBN based multi-stream models for speech. 836-839 - Konstantin Markov, Satoshi Nakamura:
Hybrid HMM/BN LVCSR system integrating multiple acoustic features. 840-843 - S. S. Airey, Mark J. F. Gales:
Product of Gaussians and multiple stream systems. 844-847 - Karthik Visweswariah, Peder A. Olsen, Ramesh Gopinath, Scott Axelrod:
Maximum likelihood training of subspaces for inverse covariance modeling. 848-851 - Vincent Vanhoucke, Ananth Sankar:
Mixtures of inverse covariances. 852-855 - Satya Dharanipragada, Karthik Visweswariah:
Covariance and precision modeling in shared multiple subspaces. 856-859 - Peng Ding, Shuwu Zhang, Bo Xu:
Comparison and study of some variants of partially tied covariance modeling. 860-863 - Scott Axelrod, Ramesh Gopinath, Peder A. Olsen, Karthik Visweswariah:
Dimensional reduction, covariance modeling, and computational complexity in ASR systems. 864-867 - Alain Biem:
Optimizing features and models using the minimum classification error criterion. 868-871 - Leo J. Lee, Hagai Attias, Li Deng:
Variational inference and learning for segmental switching state space models of hidden speech dynamics. 872-875 - Rong Zhang, Alexander I. Rudnicky:
Improving the performance of an LVCSR system through ensembles of acoustic models. 876-879
Speech Enhancement II
- Thomas Lotter, Christian Benien, Peter Vary:
Multichannel speech enhancement using Bayesian spectral amplitude estimation. 880-883 - Erik M. Visser, Te-Won Lee:
Speech enhancement using blind source separation and two-channel energy based speaker detection. 884-887 - Masashi Unoki, Masashi Furukawa, Keigo Sakata, Masato Akagi:
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal. 888-891 - Mingyang Wu, DeLiang Wang:
A one-microphone algorithm for reverberant speech enhancement. 892-895 - Colin Breithaupt, Rainer Martin
:
MMSE estimation of magnitude-squared DFT coefficients with superGaussian priors. 896-899 - Chang Huai You, SooNgee Koh, Susanto Rahardja:
Adaptive β-order MMSE estimation for speech enhancement. 900-903 - Marcel Gabrea:
Double affine projection algorithm-based speech enhancement algorithm. 904-907 - Sharon Gannot, Israel Cohen:
Speech enhancement based on the general transfer function GSC and postfiltering. 908-911 - Hong Cai, Éric Grivel, Mohamed Najim:
A dual Kalman filter-based smoother for speech enhancement. 912-915 - Masanori Kato, Akihiko Sugiyama, Masahiro Serizawa:
A family of 3GPP-standard noise suppressors for the AMR codec and the evaluation results. 916-919 - Michael T. Johnson, Andrew C. Lindgren, Richard J. Povinelli, Xiaolong Yuan:
Performance of nonlinear speech enhancement using phase space reconstruction. 920-923 - John-Paul Hosom, Alexander Kain, Taniya Mishra, Jan P. H. van Santen, Melanie Fried-Oken, Janice Staehely:
Intelligibility of modifications to dysarthric speech. 924-928
Volume 2
Feature Extraction Techniques and Applications
- Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. 1-4 - Hugo Meinedo, João Paulo Neto:
Audio segmentation, classification and clustering in a broadcast news task. 5-8 - Tin Lay Nwe, Say Wei Foo, Liyanage C. De Silva:
Classification of stress in speech using linear and nonlinear features. 9-12 - Aldebaro Klautau
:
Mining speech: automatic selection of heterogeneous features using boosting. 13-16 - Julien Pinquier
, Jean-Luc Rouas, Régine André-Obrecht:
A fusion study in speech/music classification. 17-20 - Tarek Abu-Amer, Julie Carson-Berndsen:
Multi-linear HMM based system for articulatory feature extraction. 21-24 - Takashi Fukuda, Wataru Yamamoto, Tsuneo Nitta:
Distinctive phonetic feature extraction for robust speech recognition. 25-28 - Kim Foong Chow, Shiang Chen Liew, Kim-Teng Lua:
Thin client front-end processor for distributed speech recognition. 29-32 - Mohamed Chetouani, Bruno Gas, Jean-Luc Zarader:
Modular neural predictive coding for discriminative feature extraction. 33-36 - Changxue Ma:
Novel robust feature extraction based on spectrally masked channel energy ratio (SMaChER) for speech recognition. 37-40 - Qin Li, Les E. Atlas:
Time-variant least squares harmonic modeling. 41-44 - Brian Mak, Yik-Cheung Tam, Roger Hsiao:
Discriminative training of auditory filters of different shapes for robust speech recognition. 45-48
Speaker Verifikation and Identification Systems
- Claude Barras, Jean-Luc Gauvain:
Feature and score normalization for speaker verification of cellular data. 49-52 - Douglas A. Reynolds:
Channel robust speaker verification via feature mapping. 53-56 - Tsuneo Kato, Tohru Shimizu:
Improved speaker, verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns. 57-60 - Ganesh N. Ramaswamy, Jirí Navrátil, Upendra V. Chaudhari, Ran D. Zilca:
The IBM system for the NIST-2002 cellular speaker verification evaluation. 61-64 - Gurmeet Singh, Ashish Panda, Saurav Bhattacharyya, Thambipillai Srikanthan:
Vector quantization techniques for GMM based speaker verification. 65-68 - Mathieu Ben, Frédéric Bimbot:
D-MAP: a distance-normalized MAP estimation of speaker models for automatic speaker verification. 69-72 - Jiuqing Deng, Qixiu Hu:
Open set text-independent speaker recognition based on set-score pattern classification. 73-76 - Jean-François Bonastre, Sylvain Meignier, Téva Merlin:
Speaker detection using multi-speaker audio files for both enrollment and test. 77-80 - Ran D. Zilca, Jirí Navrátil, Ganesh N. Ramaswamy:
Depitch and the role of fundamental frequency in speaker recognition. 81-84 - Yvonne Moh, Patrick Nguyen, Jean-Claude Junqua:
Towards domain independent speaker clustering. 85-88 - Daniel Moraru, Sylvain Meignier, Laurent Besacier, Jean-François Bonastre, Ivan Magrin-Chagnolleau:
The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation. 89-92 - Joaquín González-Rodríguez, Julian Fiérrez-Aguilar, Javier Ortega-Garcia:
Forensic identification reporting using automatic speaker recognition systems. 93-96
General Topics in Robust ASR
- Jian Wu, Qiang Huo:
Modelling uncertainty in stochastic vector mapping with minimum classification error training for robust speech recognition. 97-100 - Yuan-Fu Liao, Jeng-Shien Lin, Sin-Horng Chen:
A mismatch-aware stochastic matching algorithm for robust speech recognition. 101-104 - Febe de Wet, Johan de Veth, Bert Cranen, Louis Boves:
The impact of spectral and energy mismatch on the Aurora2 digit recognition task. 105-108 - Dusan Macho, Yan Ming Cheng:
On the use of wideband signal for noise robust ASR. 109-112 - Murat Akbacak, John H. L. Hansen:
Environmental sniffing: noise knowledge estimation for robust speech systems. 113-116 - Zhaobing Han, Shuwu Zhang, Huayun Zhang, Bo Xu:
A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR. 117-120 - Olivier Bellot, Driss Matrouf, Pascal Nocera, Georges Linarès, Jean-François Bonastre
:
Structural speaker adaptation using maximum a posteriori approach and a Gaussian distributions merging technique. 121-124 - Xianxian Zhang, John H. L. Hansen:
CSA-BF: novel constrained switched adaptive beamforming for speech enhancement & recognition in real car environments. 125-128 - Ben Milner, Xu Shao:
Low bit-rate feature vector compression using transform coding and non-uniform bit allocation. 129-132 - Shajith Ikbal, Hemant Misra, Hervé Bourlard:
Phase autocorrelation (PAC) derived robust speech features. 133-136 - Diego Giuliani, Matteo Gerosa:
Investigating recognition of children's speech. 137-140 - Fernando Díaz-de-María, Jesús Vicente-Peña, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno:
Linear equalization of the modulation spectra: a novel approach for noisy speech recognition. 141-144
Speech Coding
- Fang-Chu Chen, I-Hsien Lee:
CELP based speech coding with fine granularity scalability. 145-148 - Gang Zhang, Keming Xie, Xueying Zhang, Liying Huangfu:
Optimizing gain codebook of LD-CELP. 149-152 - Jacek Stachurski, Alan McCree, Vishu Viswanathan, Ari Heikkinen, Anssi Rämö, Sakari Himanen, Peter Blöcher:
Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s. 153-156 - Houman Zarrinkoub, Paul Mermelstein:
Joint optimization of short-term and long-term predictors in CELP speech coders. 157-160 - Fredrik Nordén, Turaj Zakizadeh Shabestary, Per Hedelin:
Rate adjustable speech coding by lattice quantization. 161-164 - Christian Sturt, Stephane Villette, Ahmet M. Kondoz:
LSF quantisation for pitch synchronous speech coders. 165-168 - Miguel Arjona Ramírez
:
A waveform extractor for scalable speech coding. 169-172 - Sung-Kyo Jung, Kyoung-Tae Kim, Hong-Goo Kang, Dae Hee Youn:
A cascaded algebraic codebook structure to improve the performance of speech coder. 173-176 - Sunil Lee, Seongho Seo, Dalwon Jang, Chang D. Yoo:
A novel transcoding algorithm for AMR and EVRC speech codecs via direct parameter transformation. 177-180 - Marcos Faúndez-Zanuy
:
Wide band sub-band speech coding using nonlinear prediction. 181-184 - Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya:
Super-frame based source controlled variable rate coding using approximated trellis diagram. 185-188 - José L. Pérez-Córdoba
, Antonio M. Peinado
, Victoria E. Sánchez, Antonio J. Rubio:
A study of joint source-channel coding of LSP parameters for wideband speech coding. 189-192
Speaker ID/Verification: Discriminative Methods and Multiple Speakers
- Ting-Yao Wu, Lie Lu
, Ke Chen, Hong-Jiang Zhang:
UBM-based real-time speaker segmentation for broadcasting news. 193-196 - Takayuki Arai:
Estimating number of speakers by the modulation characteristics of speech. 197-200 - S. Krishnakumar, K. R. Prasanna Kumar, N. Balakrishnan:
Pitch maxima for robust speaker recognition. 201-204 - Yang Shao, DeLiang Wang:
Co-channel speaker identification using usable speech extraction based on multi-pitch tracking. 205-208 - William M. Campbell:
A SVM/HMM system for speaker recognition. 209-212 - Fabio Valente, Christian Wellekens:
Minimum classification error/eigenvoices training for speaker identification. 213-216 - Qi Li, Biing-Hwang Juang:
Fast discriminative training for sequential observations with application to speaker identification. 217-220 - Vincent Wan, Steve Renals
:
SVMSVM: support vector machine speaker verification methodology. 221-224 - Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Hybrid HMM/ANN and GMM combination for user-customized password speaker verification. 225-228 - Daniel Garcia-Romero, Julian Fiérrez-Aguilar, Joaquín González-Rodríguez, Javier Ortega-Garcia:
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. 229-232 - Chun-Nan Hsu, Hau-Chung Yu, Bo-Hou Yang:
Speaker verification without background speaker models. 233-236
Emerging Industrial Applications
- Bogong Su, Jian Wang, Erh-Wen Hu, Joseph B. Manzano:
De-pipeline a software-pipelined loop. 237-240 - Ajay Kumar:
Inspection of surface defects using optimal FIR filters. 241-244 - Xinhao Tian, Jing Lin, Ken R. Fyfe, Ming Jian Zuo:
Gearbox fault diagnosis using independent component analysis in the frequency domain and wavelet filtering. 245-248 - Jonathon C. Ralston, David W. Hainsworth, Ronald J. McPhee, David C. Reid, Chad O. Hargrave:
Application of signal processing technology for automatic underground coal mining machinery. 249-252
Biomedical and Biometric Technology I
- Aziz Umit Batur, Bruce E. Flinchbaugh, Monson H. Hayes III:
A DSP-based approach for the implementation of face recognition algorithms. 253-256 - Kumari L. Fernando, V. John Mathews, Michael W. Varner, Edward B. Clark:
Robust estimation of fetal heart rate variability using Doppler ultrasound. 257-260 - Ayman El-Baz
, Aly A. Farag, Robert Falk, Renato La Rocca:
Automatic identification of lung abnormalities in chest spiral CT scans. 261-264 - Pega Zarjam, Mostefa Mesbah, Boualem Boashash:
Detection of newborn EEG seizure using optimal features based on discrete wavelet transform. 265-268 - Philip de Chazal, Richard B. Reilly
:
Automatic classification of ECG beats using waveform shape and heart beat interval features. 269-272
Speech Recognition
- Yong-Beom Lee, John R. Deller Jr.:
Heuristic structural modifications to the HMM for efficient resource utilization. 273-276 - Astrid Hagen, João Paulo Neto:
Multi-stream processing using context-independent and context-dependent hybrid systems. 277-280 - Sergey Astrov, Josef G. Bauer, Sorel Stan:
High performance speaker and vocabulary independent ASR technology for mobile phones. 281-284 - Say Wei Foo, Liang Dong:
A boosted multi-HMM classifier for recognition of visual speech elements. 285-288 - Claudio Eccher, Lorenzo Eccher, Daniele Falavigna, Luca Nardelli, Marco Orlandi, Andrea Sboner:
On the usage of automatic voice recognition in an interactive Web based medical application. 289-292 - Xuan Zhu, Yining Chen, Jia Liu, Runsheng Liu:
A novel efficient decoding algorithm for CDHMM-based speech recognizer on chip. 293-296
DSP Architectures
- Toshiyuki Yamane, Yasunao Katayama:
An ultra-fast Reed-Solomon decoder soft-IP with 8-error correcting capability. 297-300 - Jeff H. Derby, Jaime H. Moreno:
A high-performance embedded DSP core with novel SIMD features. 301-304 - Nigel C. Paver, Bradley C. Aldrich, Moinul H. Khan:
Intel® wireless MMXTM technology: a 64-bit SIMD architecture for mobile multimedia. 305-308 - Bipul Das, Swapna Banerjee:
A low complexity architecture for complex discrete wavelet transform. 309-312 - Kar-Lik Wong, Nigel P. Topham:
High performance IDCT realization using complex arithmetic. 313-316 - Mark Rygh, Jeff Fratus, Kevin Lee, Syed Husaini, Vidya Premkumar, Konstantinos Konstantinides:
A DVD processor with dual CPUs and integrated digital front-end for advanced DVD-based consumer appliances. 317-320
Communication Technologies
- Koushik Maharatna, Eckhard Grass, Ulrich Jagdhold:
A novel 64-point FF/IFFT processor for IEEE 802.11(a) standard. 321-324 - Heping Ding:
Sub-channel below the perceptual threshold in audio. 325-328 - Alexander R. Wright, Patrick A. Naylor:
I/Q mismatch compensation in zero-IF OFDM receivers with application to DAB. 329-332 - Milos Krstic, Alfonso Troya, Koushik Maharatna, Eckhard Grass:
Optimized low-power synchronizer design for the IEEE 802.11a standard. 333-336 - Jim Chou, Kannan Ramchandran, Daniel Grobe Sachs, Douglas L. Jones:
Audio data hiding with application to surround sound. 337-340 - Andrew Fort, Jan-Willem Weijers, Veerle Derudder, Wolfgang Eberle, André Bourdoux:
A performance and complexity comparison of auto-correlation and cross-correlation for OFDM burst synchronization. 341-344
Biomedical and Biometric Technology II
- Heng-Da Cheng, Jingli Wang:
Fuzzy logic and scale space approach to microcalcification detection. 345-348 - Mitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto:
Nonlinear separation of signature trajectories for on-line personal authentication. 349-352 - Do-Hyung Kim, Jaeyeon Lee, Jung Soh, YunKoo Chung:
Real-time face verification using multiple feature combination and a support vector machine supervisor. 353-356 - John K. Mell, Donald A. Jordan, Yuping Xiao, Yibin Zheng, Joseph G. Akar, David E. Haines:
Wavelet analysis of atrial fibrillation electrograms. 357-360 - Gail L. Rosen, Jeffrey D. Moore:
Investigation of coding structure in DNA. 361-364 - Tanveer Fathima Syeda-Mahmood:
Detecting salient changes in genomic signals. 365-368 - Mukund Devarajan, Fansheng Meng, Penny Hix, Stephen A. Zahorian:
HMM-neural network monophone models for computer-based articulation training for the hearing impaired. 369-372 - Abed Elhamid Lawabni, Ahmed H. Tewfik:
Detection and screening of sleep apnea using spectral and time domain analysis of heart rate variability. 373-376 - Alper Kanak, Engin Erzin, Yücel Yemez, A. Murat Tekalp:
Joint audio-video processing for biometric speaker identification. 377-380 - Guoqin Cui, Wen Gao:
SVMs for few examples-based face recognition. 381-384 - Wan Mimi Diyana, Julie Larcher, Rosli Besar:
A comparison of clustered microcalcifications automated detection methods in digital mammogram. 385-388 - Hamid Hassanpour, Mostefa Mesbah, Boualem Boashash:
Comparative performance of time-frequency based newborn EEG seizure detection using spike signatures. 389-392
Defense, Tracking and Security Applications
- Jung-Chieh Chen, Ching-Shyang Maa, Jiunn-Tsair Chen:
Factor graphs for mobile position location. 393-396 - Anindya Sao Paul, Arnab K. Shaw, Koel Das, Atindra Mitra:
Improved HRR-ATR using hybridization of HMM and eigen-template-matched filtering. 397-400 - LipChen Alex Chan, Sandor Z. Der, Nasser M. Nasrabadi:
Improved target detector for FLIR imagery. 401-404 - Kun Lu, Jiong Wang, Xingzhao Liu:
A piecewise parametric method based on polynomial phase model to compensate ionospheric phase contamination. 405-408 - Mukesh A. Zaveri, Uday B. Desai, S. N. Merchant:
Tracking multiple maneuvering point targets using multiple filter bank in infrared image sequence. 409-412 - Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang:
An efficient implementation of multi-prime RSA on DSP processor. 413-416 - Heather Yu:
Scalable encryption for multimedia content access control. 417-420 - Kaliappan Gopalan:
Audio steganography using bit modification. 421-424 - Pei Jung Chung, William J. J. Roberts:
Recursive estimation of K-distribution parameters. 425-428 - Ping Han, Renbiao Wu, Yunhong Wang, Zhaohua Wang:
An efficient SAR ATR approach. 429-432
Radio, Telephony and Television
- Nicolas Ventroux, Jean-François Nezan, Mickaël Raulet, Olivier Déforges:
Rapid prototyping for an optimized MPEG-4 decoder implementation over a parallel heterogenous architecture. 433-436 - Azzédine Touzni, Haosong Fu, Mark Fimoff, Wayne Bretl:
Enhanced 8-VSB transmission for North-American HDTV terrestrial broadcast. 437-440 - Ligang Lu, Vadim Sheinin:
Real-time MPEG video coding with information look-ahead. 441-444 - Jon Arnold, Adrian Caldow, Kevin Harman:
A reconfigurable 100 Mchip/s spread spectrum receiver. 445-448 - Wen Xu, Matthias Marke:
On determining soft output of the cellular text telephone modem (CTM) demodulator. 449-452 - Van-Tam Nguyen, Patrick Loumeau, Jean-François Naviner:
Temporel and spectral analysis of time interleaved high pass sigma delta converter. 453-456 - Vasyl Semenov, Alexander Kalyuzhny, Alexander Kovtonyuk:
Efficient calculation of line spectral frequencies based on new method for solution of transcendental equations. 457-460 - Yeqing Qian, Qi Li, Tianren Yao:
Analysis of different predistortion structures and efficient least-square adaptive algorithms. 461-464 - Jung-Min Choi, Jung Su Kim, Jae Hong Park, Jong-Wha Chong:
Fast Kalman/LMS algorithms on the strong multipath channel. 465-468 - Jianfeng Chen, Louis Shue, Hanwu Sun:
A pseudo adaptive microphone array. 469-472 - Bharath Siravara, Mohamed M. Mansour, Randy Cole, Neeraj Magotra:
Comparative study of wideband single reference active noise cancellation algorithms on a fixed-point DSP. 473-476
High Performance Video and Image Processing Architectures
- Magnus Nilsson, Chaminda Weerasinghe, Serge Lichman, Yu Shi, Igor Kharitonenko:
Design and implementation of a CMOS sensor based video camera incorporating a combined AWB/AEC module. 477-480 - Yijun Li, Ramy E. Aly, Magdy A. Bayoumi, Samia A. Mashali:
Parallel high-speed architecture for EBCOT in JPEG2000. 481-484 - Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III. 485-488 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen:
Hardware oriented rate control algorithm and implementation for realtime video coding. 489-492 - Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen:
Performance analysis of hardware oriented algorithm modifications in H.264. 493-496 - Joseph R. Cavallaro, Mani Vaya:
Viturbo: a reconfigurable architecture for Viterbi and turbo decoding. 497-500
High Performance DSP Computational Kernels
- Markus Püschel:
Cooley-Tukey FFT like algorithms for the DCT. 501-504 - Tapio Saramäki, Mrinmoy Bhattacharya:
Multiplierless realization of recursive digital filters using allpass structures. 505-508 - Ji-Suk Park, Byeong-Kuk Kim, Jin-Gyun Chung, Keshab K. Parhi
:
An asynchronous sample-rate converter from CD to DAT. 509-512 - Jinxin Hao, Gang Li:
An improved stability measure for digital filter implementation. 513-516 - Mrinmoy Bhattacharya, Tapio Saramäki:
Some observations leading to multiplierless implementation of linear phase FIR filters. 517-520 - Seonil Choi, Gokul Govindu, Ju-wook Jang, Viktor K. Prasanna:
Energy-efficient and parameterized designs for fast Fourier transform on FPGAs. 521-524
Design Methods for Optimized DSP Architectures
- Adel Baganne, Imed Bennour, Mehrez Elmarzougui, Eric Martin:
A simulation based approach for incorporating virtual components IP cores into multimedia systems design. 525-528 - Changchun Shi, Robert W. Brodersen:
An automated floating-point to fixed-point conversion methodology. 529-532 - Atsushi Hatabu, Takashi Miyazaki, Ichiro Kuroda:
Optimization of decision-timing for early termination of SSDA-based block matching. 533-536 - Franz Franchetti, Markus Püschel:
Short vector code generation and adaptation for DSP algorithms. 537-540 - Aca Gacic, Markus Püschel, José M. F. Moura:
Fast automatic software implementations of FIR filters. 541-544 - Joseph Yeh, John Wawrzynek:
Quality based compute-resource allocation in real-time signal processing. 545-548
Performance Evaluation and Design Methods for DSP Systems
- Jia Wang, Jun Sun, Songyu Yu:
1-D and 2-D transforms from integers to integers. 549-552 - Finbarr O'Regan, Conor Heneghan:
Algorithmic analysis and implementation of a novel natural gradient adaptive filter for sparse systems. 553-556 - Zixue Zhao, Gang Li:
Comparative study of the generalized DFIIt structure and its equivalent state-space realization. 557-560 - Claire Fang Fang, Tsuhan Chen, Rob A. Rutenbar:
Floating-point error analysis based on affine arithmetic. 561-564 - Sang Yoon Park, Nam Ik Cho:
Fixed point error analysis of CORDIC processor based on the variance propagation. 565-568 - Xiaojuan Hu, Linda DeBrunner, Victor E. DeBrunner:
Design of space-efficient, wide- and narrow transition-band, FIR filters. 569-572 - Duy Cuong Nguyen, Parham Aarabi, Ali Sheikholeslami:
Real-time sound localization using field-programmable gate arrays. 573-576 - Victor E. DeBrunner, Ewa Matusiak:
An algorithm to reduce the complexity required to convolve finite length sequences using the Hirschman optimal transform (HOT). 577-580 - Abdsamad Benkrid, Khaled Benkrid, Danny Crookes:
A novel approach for diminishing and predicting the error dynamic range in finite wordlength FIR based architectures. 581-584 - Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan, Hamid Reza Abutalebi, Edmund C. Y. Tam, Peter Iles, Kar Wai Wong:
ETSI AMR-2 VAD: evaluation and ultra low-resource implementation. 585-588 - Miodrag Bolic, Petar M. Djuric, Sangjin Hong:
New resampling algorithms for particle filters. 589-592 - Justin J. Song, Jian Li, Yen-Kuang Chen:
Quality-delay-and-computation trade-off analysis of acoustic echo cancellation on general-purpose CPU. 593-596
Innovative DSP Systems and Applications
- Joe C. Chen, Len Yip, Hanbiao Wang, Daniela Maniezzo, Ralph E. Hudson, Jeremy Elson, Kung Yao, Deborah Estrin:
DSP implementation of a distributed acoustical beamformer on a wireless sensor platform. 597-600 - Scott Morrison, Jeremy S. Parks, Karl S. Gugel:
A high-performance multi-purpose DSP architecture for signal processing research. 601-604 - Margarita Cabrera, Xavier Castell, Rafael Montoliu:
Crack detection system based on spectral analysis of a ultrasonic resonance signals. 605-608 - Zhaohui Liu, John V. McCanny:
Implementation of adaptive beamforming based on QR decomposition for CDMA. 609-612 - Michael J. Thul, Frank Gilbert, Norbert Wehn:
Concurrent interleaving architectures for high-throughput channel coding. 613-616 - Amine Bermak, Dominique Martinez:
A very high density VLSI implementation of threshold network ensembles (TNE). 617-620 - Taeksang Hwang, Wonyong Sung:
Implementation of a digital copier using TMS320C6414 VLIW DSP processor. 621-624 - Adnan Abdul-Aziz Gutub
, Mohammad K. Ibrahim:
High radix parallel architecture for GF(p) elliptic curve processor. 625-628 - Zhongfeng Wang, Keshab K. Parhi
:
Efficient interleaver memory architectures for serial turbo decoding. 629-632 - Frank Kienle, Gerd Kreiselmaier, Norbert Wehn:
VLSI-implementation issues of turbo trellis-coded modulation. 633-636 - Marco Liem, Otto Manck:
Architecture of a single chip acoustic echo and noise canceller using cross spectral estimation. 637-640 - Chiman Kwan, Zhubing Ren, Roger Xu, Leonard Haynes, Vernon Lenz:
High performance VOX prototype development and experimental results. 641-644
IPS and Architectures for DSP Applications
- Oscal T.-C. Chen, Nan-Ying Shen, Chih-Chien Shen:
A low-power multiplication accumulation calculation unit for multimedia applications. 645-648 - Jeremy Johnson, Xu Xu:
A recursive implementation of the dimensionless FFT. 649-652 - Zhi-Xiu Lin, An-Yeu Wu:
Mixed-scaling-rotation CORDIC (MSR-CORDIC) algorithm and architecture for scaling-free high-performance rotational operations. 653-656 - Eric Tell, Mikael Olausson, Dake Liu:
A general DSP processor at the cost of 23K gates and 1/2 a man-year design time. 657-660 - Jiangmin Gu, Chip-Hong Chang:
Low voltage, low power (5: 2) compressor cell for fast arithmetic circuits. 661-664 - Daisuke Takahashi
:
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method. 665-668 - Bogdan J. Falkowski, Cheng Fu:
Fastest linearly independent arithmetic transforms over GF(3). 669-672 - An-Yeu Wu, I-Hsien Lee, Cheng-Shing Wu:
Angle quantization approach for lattice IIR filter implementation and its trellis de-allocation algorithm. 673-676 - Hyugjin Kwon, Jihong Kim:
A low-power image convolution algorithm for variable voltage processors. 677-680 - John Dunlop, Albert Simpson, Shahid Masud, Moira Wylie, Jonathan Cochrane, Roger Kinkead:
Semiconductor IP core for ultra low power MPEG-4 video decode in system-on-silicon. 681-684 - Sung-Won Lee, In-Cheol Park
:
Low-power hybrid structure of digital matched filters for direct sequence spread spectrum systems. 685-688 - Donglai Xu, Rui Gao, Hadj Batatia:
An improved parallel architecture for MPEG-4 motion estimation in 3G mobile applications. 689-692
Neural Models and Systems
- Gen Hori:
A general framework for SVD flows and joint SVD flows. 693-696 - Deniz Erdogmus, Yadunandana N. Rao, M. Can Ozturk, Luis Vielva, José C. Príncipe:
On the convergence of SIPEX: a simultaneous principal components extraction algorithm. 697-700 - Joaquin Quiñonero Candela, Agathe Girard, Jan Larsen
, Carl Edward Rasmussen:
Propagation of uncertainty in Bayesian kernel models - application to multiple-step ahead forecasting. 701-704 - Andrew I. Hanna, Ian Yates, Danilo P. Mandic:
Analysis of the class of complex-valued error adaptive normalised nonlinear gradient descent algorithms. 705-708 - Arthur Gretton, Frédéric Desobry:
On-line one-class support vector machines. An application to signal segmentation. 709-712 - Erik McDermott, Shigeru Katagiri:
A new formalization of minimum classification error using a Parzen estimate of classification chance. 713-716
Blind Source Separation and Independent Component Analysis
- Vince D. Calhoun, Tülay Adali:
Complex ICA for fMRI analysis: performance of several approaches. 717-720 - Seungjin Choi:
Differential learning and random walk model. 721-724 - Mirko Knaak, Shoko Araki, Shoji Makino:
Geometrically constraint ICA for convolutive mixtures of sound. 725-728 - Scott C. Douglas, Sun-Yuan Kung:
A nonlinear recursive least-squares algorithm for the blind separation of finite-alphabet sources. 729-732 - Konstantinos I. Diamantaras, Theophilos Papadimitriou:
Blind signal separation using oriented PCA neural models. 733-736 - Ignacio Santamaría, Jesús Ibáñez, Luis Vielva, Carlos Pantaleón:
Blind equalization of constant modulus signals via support vector regression. 737-740
Neural Networks for Speech Processing
- Hemant Misra
, Hervé Bourlard, Vivek Tyagi:
New entropy based combination rules in HMM/ANN multi-stream ASR. 741-744 - Man-Wai Mak, Ming-Cheung Cheung, Sun-Yuan Kung:
Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation. 745-748 - Guoning Hu, DeLiang Wang:
Separation of stop consonants. 749-752 - Suryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana:
Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances. 753-756 - Francis F. Li, Trevor J. Cox:
A neural network for blind identification of speech transmission index. 757-760 - Eros Pasero, Alfonso Montuori:
Neural network based arithmetic coding for real-time audio transmission on the TMS320C6000 DSP platform. 761-764
Architectures and Applications of Neural Networks
- Kai-Pui Lam, Sui-Tung Mak:
An FPGA-based eigenfilter using fast Hebbian learning. 765-768 - Stefan Winter, Hiroshi Sawada, Shoji Makino:
Geometrical understanding of the PCA subspace method for overdetermined blind source separation. 769-772 - Rajai El Dajani, Maryvonne Miquel, Pierre Maison-Blanche, Paul Rubel:
Time series prediction using parametric models and multilayer perceptrons: case study on heart signals. 773-776 - Takaya Soma, Kuniaki Yosui, Takashi Matsumoto:
Reconstructions and predictions of nonlinear dynamical systems by Rao-Blackwellised sequential Monte Carlo. 777-780 - Jerónimo Arenas-García, Fernando Pérez-Cruz:
Multi-class support vector machines: a new approach. 781-784 - Anne-Sophie Capelle, Christine Fernandez-Maloigne, Olivier Colot:
Introduction of spatial information within the context of evidence theory. 785-788 - Stefano Squartini, Amir Hussain, Francesco Piazza:
A recurrent multiscale architecture for long-term memory prediction task. 789-792 - Ali A. Hasan, Mohammed A. Hasan:
Constrained gradient descent and line search for solving optimization problem with elliptic constraints. 793-796 - De-Shuang Huang, Horace H. S. Ip:
Finding the maximum modulus roots of polynomials based on constrained neural networks. 797-800 - Richard Kuehnel, Yuke Wang:
A method of generating uniformly distributed sequences over [0, K], where K+1 is not a power of two. 801-804 - Artur Wróblewski, Thomas Erl, Josef A. Nossek:
Bireciprocal lattice wave digital filters with almost linear phase response. 805-808
Neural Networks for Pattern Recognition and Image Processing
- David J. Miller, John Browning:
A mixture model and EM algorithm for robust classification, outlier rejection, and class discovery. 809-812 - Sung-Jung Cho, Michael Perrone, Eugene H. Ratzlaff:
EM mixture model probability table compression. 813-816 - Roongroj Nopsuwanchai, Alain Biem:
Discriminative training of tied mixture density HMMs for online handwritten digit recognition. 817-820 - Songfeng Zheng, Xiaofeng Lu, Nanning Zheng, Weipu Xu:
Unsupervised clustering based reduced support vector machines. 821-824 - Shantanu Chakrabartty, Masakazu Yagi, Tadashi Shibata, Gert Cauwenberghs:
Robust cephalometric landmark identification using support vector machines. 825-828 - Xiaofeng Lu, Songfeng Zheng, Nanning Zheng, Weixiang Liu:
Learning features from examples for face detection. 829-832 - S. Palanivel, B. S. Venkatesh, B. Yegnanarayana:
Real time face recognition system using autoassociative neural network models. 833-836 - Ho-Man Tang, Michael R. Lyu, Irwin King:
Face recognition committee machine. 837-840 - Chunrong Yuan, Heinrich Niemann:
Appearance-based neural image processing for 3-D object recognition and localization. 841-844 - Tat-Seng Chua, HuaMin Feng, A. Chandrashekhara:
An unified framework for shot boundary detection via active learning. 845-848 - Heng-Da Cheng, Muyi Cui:
Mass lesion detection with a fuzzy neural network. 849-852
Volume 3
Image and Video Indexing and Retrieval
- Paisarn Muneesawang, Ling Guan:
Automatic relevance feedback for video retrieval. 1-4 - Wing Ho Leung, Tsuhan Chen:
Retrieval of hand-drawn sketches with partial matching. 5-8 - Wai-Pak Choi, Kin-Man Lam, Wan-Chi Siu:
Maximal disk based histogram for shape retrieval. 9-12 - Bin Luo, Richard C. Wilson, Edwin R. Hancock:
Spectral method for learning structural variations in graphs. 13-16 - Fariborz Mahmoudi, Jamshid Shanbehzadeh, Amir-Masoud Eftekhari-Moghadam, Hamid Soltanian-Zadeh:
A new non-segmentation shape-based image indexing method. 17-20 - Rong Yan, Yan Liu, Rong Jin, Alexander G. Hauptmann:
On predicting rare classes with SVM ensembles in scene classification. 21-24
Human Movement Analysis and Tracking
- Richard D. Green, Ling Guan:
Tracking human movement patterns using particle filtering. 25-28 - Jose Juarez Gonzalez, Ik Soo Lim
, Pascal Fua, Daniel Thalmann:
Robust tracking and segmentation of human motion in an image sequence. 29-32 - Naresh P. Cuntoor, Amit A. Kale, Rama Chellappa:
Combining multiple evidences for gait recognition. 33-36 - Yang Ran, Qinfen Zheng:
Multi moving people detection from binocular sequences. 37-40 - R. Venkatesh Babu, K. R. Ramakrishnan:
Compressed domain human motion recognition using motion history information. 41-44 - Henry C. Tan, Ruwan Janapriya, Liyanage C. De Silva:
An automatic system for multiple human tracking and actions recognition in office environment. 45-48
Watermarking I
- Qiang Cheng, Yingge Wang, Thomas S. Huang:
How to design efficient watermarks? 49-52 - Alexia Briassouli, Pierre Moulin:
Detection-theoretic analysis of warping attacks in spread-spectrum watermarking. 53-56 - Micheal Mullarkey, Neil J. Hurley, Guenole C. M. Silvestre, Teddy Furon:
Application of side-informed embedding and polynomial detection to audio watermarking. 57-60 - Jin S. Seo, Jaap Haitsma, Ton Kalker, Chang D. Yoo:
Affine transform resilient image fingerprinting. 61-64 - Tie Liu, Pierre Moulin:
Error exponents for one-bit watermarking. 65-68 - John Barr, Brett Bradley, Brett T. Hannigan:
Using digital watermarks with image signatures to mitigate the threat of the copy attack. 69-72
Video Coding I
- Claudia Mayer:
Motion compensated in-band prediction for wavelet-based spatially scalable video coding. 73-76 - Randa Atta, Mohammed Ghanbari:
A layered video coding scheme with its optimum bit allocation. 77-80 - Mihaela van der Schaar, Deepak S. Turaga:
Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding. 81-84 - Zhenzhong Chen, King Ngi Ngan
:
Improved single video object rate control for MPEG-4. 85-88 - Lifeng Zhao, C.-C. Jay Kuo:
Buffer-constrained R-D optimized rate control for video coding. 89-92 - Zhen Li, Feng Wu, Shipeng Li
, Edward J. Delp:
Wavelet video coding via a spatially adaptive lifting structure. 93-96
Image andVideo Interpolation
- Xiqun Lu, Paul S. Hong, Mark J. T. Smith:
An efficient directional image interpolation method. 97-100 - Hussein A. Aly, Eric Dubois:
Crafting the observation model for regularized image up-sampling. 101-104 - Takuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Tetsuro Kuge:
Invertible deinterlacing with variable coefficients and its lifting implementation. 105-108 - Hasan F. Ates, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation. 109-112 - Tien-Ying Kuo, Lin-Ying Chuang:
Fast global motion-compensated frame interpolator for very low-bit-rate video quality enhancement. 113-116 - Hezerul Abdul Karim, Michel Bister, Mohammad Umar Siddiqi:
Low rate video frame interpolation - challenges and solution. 117-120
Face Recognition
- Jian Li, Shaohua Kevin Zhou, Chandra Shekhar:
A comparison of subspace analysis for face recognition. 121-124 - Juwei Lu, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos:
Regularized D-LDA for face recognition. 125-128 - Xiaogang Wang, Xiaoou Tang:
An improved Bayesian face recognition algorithm in PCA subspace. 129-132 - Juhua Zhu, Bede Liu, Stuart C. Schwartz:
General illumination correction and its application to face normalization. 133-136 - Alberto Albiol, Luis Torres, Edward J. Delp:
The indexing of persons in news sequences using audio-visual data. 137-140 - Haitao Wang, Yangsheng Wang:
Recognizing face images under different lighting conditions. 141-144
Motion Estimation
- Yu-Wen Huang, Bing-Yu Hsieh, Tu-Chih Wang, Shao-Yi Chien, Shyh-Yih Ma, Chun-Fu Shen, Liang-Gee Chen:
Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264. 145-148 - Shiloh L. Dockstader, Nikita S. Imennov, A. Murat Tekalp:
Stochastic modeling of motion tracking failures. 149-152 - Yui-Lam Chan, Wan-Chi Siu:
An adaptive partial distortion search for block motion estimation. 153-156 - Ingo Stuke, Til Aach, Cicero Mota, Erhardt Barth:
Linear and regularized solutions for multiple motions. 157-160 - Mingren Shi, Victor Solo:
Empirical choice of smoothing parameters in optical flow with correlated errors. 161-164 - Jesús Chamorro-Martínez, Joaquín Fernández-Valdivia, Jose A. García, Javier Martinez-Baena:
A frequency-domain approach for the extraction of motion patterns. 165-168
Video Summarization
- Baoxin Li, Hao Pan, M. Ibrahim Sezan:
A general framework for sports video summarization with its application to soccer. 169-172 - Ahmet Ekin, A. Murat Tekalp:
Shot type classification by dominant color for sports video segmentation and summarization. 173-176 - Hsuan-Wei Chen, Jin-Hau Kuo, Jen-Hao Yeh, Ja-Ling Wu:
A multi-modal-feature based algorithm for parsing news program videos. 177-180 - Zuzana Cernekova
, Constantine Kotropoulos, Ioannis Pitas:
Video shot segmentation using singular value decomposition. 181-184 - Kongwah Wan, Joo-Hwee Lim, Changsheng Xu, Xinguo Yu:
Real-time camera field-view tracking in soccer video. 185-188 - Min Xu, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video. 189-192
Face Analysis
- Gang Pan, Zhaohui Wu, Yunhe Pan:
Automatic 3D face verification from range data. 193-196 - José Luis Landabaso, Montse Pardàs, Antonio Bonafonte
:
HMM recognition of expressions in unrestrained video intervals. 197-200 - Wen-Shiung Chen, Shang-Yuan Yuan:
A novel personal biometric authentication technique using human iris based on fractal dimension features. 201-204 - Jianyu Wang, Wen Gao, Shiguang Shan, XiaoPeng Hu:
Facial feature tracking combining model-based and model-free method. 205-208 - Jun Wang, Radhakrishna S. V. Achanta, Mohan S. Kankanhalli, Philippe Mulhem:
A hierarchical framework for face tracking using state vector fusion for compressed video. 209-212 - Heng Liu, Shengye Yan, Xilin Chen, Wen Gao:
Rotated face detection in color images using radial template (RT). 213-216 - Shi-Lin Wang, Wing Hong Lau, Shu-hung Leung:
A new real-time lip contour extraction algorithm. 217-220 - Ying Guo, Geoff Poulton, Jiaming Li, Mark Hedley, Rong-yu Qiao:
Soft margin AdaBoost for face pose classification. 221-224 - Shaohua Kevin Zhou, Rama Chellappa:
Simultaneous tracking and recognition of human faces from video. 225-228 - Xiujuan Chai, Shiguang Shan, Wen Gao, Bo Cao:
Novel example-based shape learning for fast face alignment. 229-232 - Norman Poh Hoon Thian, Sébastien Marcel, Samy Bengio:
Improving face authentication using virtual samples. 233-236
Lossless and Lossy Image Coding
- Guang Deng, Hua Ye:
A general framework for the second-level adaptive prediction. 237-240 - Giovanni Motta, Francesco Rizzo, James A. Storer:
Partitioned vector quantization: application to lossless compression of hyperspectral images. 241-244 - Mehmet Utku Celik
, A. Murat Tekalp, Gaurav Sharma:
Level-embedded lossless image compression. 245-248 - Marie Babel, Olivier Déforges:
Lossless and lossy minimal redundancy pyramidal decomposition for scalable image compression technique. 249-252 - Ahmed Abu-Hajar, Ravi Sankar:
Enhanced partial-SPIHT for lossless and lossy image compression. 253-256 - Aysegül Çuhadar, Sinan Tasdoken:
Multiple, arbitrary shape ROI coding with zerotree based wavelet coders. 257-260 - Yick Ming Yeung, Oscar C. Au, Andy Chang:
Successive bit-plane rate allocation technique for JPEG2000 image coding. 261-264 - Chi-Keung Fong, Wai-kuen Cham:
An improved edge-model based representation and its application in image post-processing. 265-268 - Wenhuan Xu, Asoke K. Nandi, Jihong Zhang:
A new fuzzy reinforcement learning vector quantization algorithm for image compression. 269-272 - Chengjie Tu, Trac D. Tran, Jie Liang:
Error resilient pre-/post-filtering for DCT-based block coding systems. 273-276 - Xingsong Hou, Guizhong Liu, Yiyang Zou:
Embedded quadtree-based image compression in DCT domain. 277-280 - Yu Hen Hu, Rajas A. Sambhare:
Constrained texture synthesis for image post processing. 281-284
Multidimensional Signal Processing Theory and Methods
- Mats T. Andersson, Hans Knutsson:
Transformation of local spatio-temporal structure tensor fields. 285-288 - Steven M. Kay, Christopher P. Carbone:
Vector space solution to the multidimensional Yule-Walker equations. 289-292 - Weixiang Liu, Nanning Zheng, Xiaofeng Lu:
Non-negative matrix factorization for visual coding. 293-296 - Erik G. Miller:
A new class of entropy estimators for multi-dimensional densities. 297-300 - Dimitri Van De Ville, Thierry Blu, Michael Unser:
Recursive filtering for splines on hexagonal lattices. 301-304 - Ilya Pollak, Jeffrey Mark Siskind, Mary P. Harper, Charles A. Bouman:
Modeling and estimation of spatial random trees with application to image classification. 305-308 - Ngai-Fong Law, Wan-Chi Siu:
A fast and efficient computational structure for the 2D over-complete wavelet transform. 309-312 - Arlene A. Cole-Rhodes, Abake Adenle:
Automatic image registration by stochastic optimization of mutual information. 313-316 - Subrata Rakshit, Malay Kumar Nema:
Symmetric residue pyramids - an extension of Burt Laplacian pyramids. 317-320 - Takao Hinamoto, Keisuke Higashi, Wu-Sheng Lu:
Jointly optimized error feedback and realization for roundoff noise minimization in two-dimensional state-space digital filters. 321-324 - Eva Dejnozková, Petr Dokládal:
A parallel algorithm for solving the Eikonal equation. 325-328 - Florent Perronnin, Jean-Luc Dugelay, Kenneth Rose:
Iterative decoding of two-dimensional hidden Markov models. 329-332
Image and Video Segmentation
- Yuan Been Chen, Oscal T.-C. Chen:
Robust fully-automatic segmentation based on modified edge-following technique. 333-336 - Mathias Ortner, Xavier Descombes, Josiane Zerubia
:
Building extraction from digital elevation models. 337-340 - Gouchol Pok, Jyh-Charn Liu, Keun Ho Ryu:
Fast estimation of the number of texture segments using cooccurrence statistics. 341-344 - Qixiang Ye, Wen Gao, Wei Zeng:
Color image segmentation using density-based clustering. 345-348 - Darren E. Butler, Sridha Sridharan, V. Michael Bove Jr.:
Real-time adaptive background segmentation. 349-352 - Son Lam Phung
, Douglas Chai, Abdesselam Bouzerdoum:
Adaptive skin segmentation in color images. 353-356 - Soo-Chang Pei, Jian-Jiun Ding:
The generalized radial Hilbert transform and its applications to 2D edge detection (any direction or specified directions). 357-360 - Shunsuke Kamijo, Masao Sakauchi:
Segmentation of vehicles and pedestrians in traffic scene by spatio-temporal Markov random field model. 361-364 - Hanfeng Chen, Feihu Qi, Su Zhang:
Supervised video object segmentation using a small number of interactions. 365-368 - Su Zhang, Hanfeng Chen, Zheru Chi
, Pengfei Shi:
An algorithm for segmenting moving vehicles. 369-372 - Eliza Yingzi Du, Chein-I Chang:
An unsupervised approach to color video thresholding. 373-376 - Day-Fann Shen, Ming-Tsong Huang:
A watershed-based image segmentation using JND property. 377-380
Video Coding II
- Lorenzo Granai, Fulvio Moschetti, Pierre Vandergheynst:
Ridgelet transform applied to motion compensated images. 381-384 - Huipin Zhang, Frank Bossen:
A heuristic search method of adaptive interpolation filters in motion compensated predictive video coding. 385-388 - Bojun Meng, Oscar C. Au:
Fast intra-prediction mode selection for 4A blocks in H.264. 389-392 - Habibollah Danyali, Alfred Mertins:
Fully scalable texture coding of arbitrarily shaped video objects. 393-396 - Manoranjan Paul, M. Manzur Murshed, Laurence Dooley:
A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions. 397-400 - Shunan Lin, Anthony Vetro, Yao Wang:
Rate-distortion analysis of the multiple description motion compensation video coding scheme. 401-404 - Sadaatsu Kato, Kazuo Sugimoto, Satoru Adachi, Minoru Etoh:
Structured "truncated Golomb code" for context-based adaptive VLC. 405-408 - Ee Ping Ong, Hua Wang, Ping Xue:
Video coding based on true motion estimation. 409-412 - Andy Chang, Oscar C. Au, Yick Ming Yeung:
A novel approach to fast multi-frame selection for H.264 video coding. 413-416 - Yiannis Andreopoulos, Mihaela van der Schaar, Adrian Munteanu, Joeri Barbarien, Peter Schelkens, Jan Cornelis:
Fully-scalable wavelet video coding using in-band motion compensated temporal filtering. 417-420 - Lap-Pui Chau, Xuan Jing:
Efficient three-step search algorithm for block motion estimation in video coding. 421-424 - Hsi-Tzeng Chan, Chung-Lin Huang:
Multiple description and matching pursuit coding for video transmission over the Internet. 425-428
Image Processing: Applications
- Antoine Roueff, Jérôme I. Mars, Jocelyn Chanussot, Helle Pederson:
Simultaneous group and phase correction for the estimation of dispersive propagating waves in the time-frequency plane. 429-432 - Benayad Nsiri
, Thierry Chonavel, Jean-Marc Boucher:
Blind estimation of long impulse response and non-minimum phase wavelets application to seismic data. 433-436 - Qian Du, Sumit Chakrarvarty:
Unsupervised hyperspectral image classification using blind source separation. 437-440 - Yibin Zheng:
A new algorithm for retrieval of 2D exponentials. 441-444 - Anthony Sourice, Guy Plantier, Jean-Louis Saumet:
Two-dimensional frequency estimation using autocorrelation phase fitting. 445-448 - Jingxin Zhang, Jim Schroeder, Nicholas J. Redding:
SAR image enhancement for small target detection. 449-452 - Zhiping Lin, Qiyue Zou, Raimund J. Ober:
The Fisher information matrix for two-dimensional data sets. 453-456 - Damien Muti, Salah Bourennane:
Multidimensional signal processing using lower-rank tensor approximation. 457-460 - Wen-Hung Liao
, Dai-Yun Li:
Homomorphic processing techniques for near-infrared images. 461-464 - Huafeng Liu, Lung Ngong Wong, Pengcheng Shi:
Cardiac motion and material properties analysis using data confidence weighted extended Kalman filter framework. 465-468 - Cha Zhang, Tsuhan Chen:
On generalized sampling for image-based rendering data. 469-472 - Chaminda Weerasinghe, Wanqing Li, Philip Ogunbona
:
Stereoscopic panoramic video generation using centro-circular projection technique. 473-476
Image and Video Analysis I
- Chip-Hong Chang, Rui Xiao, Thambipillai Srikanthan:
An adaptive initialization technique for color quantization by self organizing feature map. 477-480 - Steve Mann, Corey Manders, James Fung:
The lightspace change constraint equation (LCCE) with practical application to estimation of the projectivity+gain transformation between multiple pictures of the same subject matter. 481-484 - Nilanjan Dasgupta, Lawrence Carin:
Context-based graphical modeling for wavelet domain signal processing. 485-488 - Hui Cheng:
Temporal registration of video sequences. 489-492 - Namrata Vaswani, Amit K. Roy-Chowdhury, Rama Chellappa:
Statistical shape theory for activity modeling. 493-496 - Amit K. Roy-Chowdhury, Amit A. Kale, Rama Chellappa:
Video synthesis of arbitrary views for approximately planar scenes. 497-500 - John N. Carter, Pelopidas Lappas, Robert I. Damper:
Evidence-based object tracking via global energy maximization. 501-504 - Fenghui Yao, Guifeng Shao:
Detection of 3D symmetry axis from fragments of a broken pottery bowl. 505-508 - Minghui Xia, Bede Liu:
"Super-resolution curve" and image registration. 509-512
Watermarking II
- Adnan M. Alattar, Eugene T. Lin, Mehmet Utku Celik:
Watermarking low bit-rate Advanced Simple Profile MPEG-4 bitstreams. 513-516 - Jun Tian:
High capacity reversible data embedding and content authentication. 517-520 - Patrick Bas, Nicolas Le Bihan, Jean-Marc Chassery:
Color image watermarking using quaternion Fourier transform. 521-524 - Shih-Hsuan Yang:
Wavelet filter evaluation for image watermarking. 525-528 - Ming Sun Fu, Oscar C. Au:
A novel method to embed watermark in different halftone images: data hiding by conjugate error diffusion (DHCED). 529-532 - Yanjiang Yang, Feng Bao:
An invertible watermarking scheme for authentication of Electronic Clinical Brain Atlas. 533-536 - Dajun He, Qibin Sun, Qi Tian:
An object based watermarking solution for MPEG4 video authentication. 537-540 - Quan He, Guangchuan Su:
A semi-blind robust watermarking for digital images. 541-544 - Tao Zhang, Xijian Ping:
Reliable detection of LSB steganography based on the difference image histogram. 545-548 - Guorui Feng, Ling-ge Jiang, Chen He, Dong-Jian Wang:
A novel algorithm for embedding and detecting digital watermarks. 549-552 - Slaven Marusic, David B. H. Tay, Guang Deng:
A parametric family of wavelet filters for diversity in watermarking application. 553-556 - Oscar C. Au, Ming Sun Fu:
A symmetric key watermark for halftone images. 557-560
Image and Video Indexing and Retrieval II
- Rozenn Dahyot
, Anil C. Kokaram, Niall Rea, Hugh Denman:
Joint audio visual retrieval for tennis broadcasts. 561-564 - Chi-Man Pun:
Invariant content-based image retrieval by wavelet energy signatures. 565-568 - Haim H. Permuter, Joseph M. Francos, Ian H. Jermyn
:
Gaussian mixture models of texture and colour for image database retrieval. 569-572 - Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
Dynamic-segmentation-based feature dimension reduction for quick audio/video searching. 573-576 - Seong-O Shim, Tae-Sun Choi:
Image indexing by modified color cooccurrence matrix. 577-580 - Jiqiang Song, Min Cai, Michael R. Lyu:
A robust statistic method for classifying color polarity of video text. 581-584 - Ming Hong Pi, Mrinal Mandal, Anup Basu:
Image retrieval based on histogram of new fractal parameters. 585-588 - Anxiang Hong, Zheru Chi
, Gang Chen, Zhiyong Wang:
Region-of-interest based flower images retrieval. 589-592 - Michael Hoeynck, Jens-Rainer Ohm:
Shape retrieval with robustness against partial occlusion. 593-596 - Guoping Qiu:
Appearance indexing. 597-600 - Chun-Ho Cheung, Lai-Man Po:
A novel histogram-biasing factor for fast sorted histogram-based measurement in large image database retrieval system. 601-604 - Miki Haseyama, Isao Kondo:
2-D functional AR model for image identification. 605-608 - Xiaokang Yang, Weisi Lin, Zhongkang Lu, Ee Ping Ong, Susu Yao:
Just-noticeable-distortion profile with nonlinear additivity model for perceptual masking in color images. 609-612 - Zhizhong Zhe, Hong Ren Wu, Zhenghua Yu, Tim Ferguson, Damian M. Tan:
Performance evaluation of a perceptual ringing distortion metric for digital video. 613-616 - Zhongkang Lu
, Weisi Lin, Ee Ping Ong, Susu Yao, Xiaokang Yang:
Perceptual-quality significance map (PQSM) and its application on video quality distortion metrics. 617-620 - Deepak S. Turaga, Mihaela van der Schaar:
Content-adaptive filtering in the UMCTF framework. 621-624 - Toshiyuki Uto, Masaaki Ikehara:
A smooth extension for the nonexpansive orthogonal wavelet decomposition of finite length signals. 625-628 - Roger Pique, Luis Torres:
Efficient face coding in video sequences combining adaptive principal component analysis and a hybrid codec approach. 629-632 - Hideaki Kimata, Masaki Kitahara, Yoshiyuki Yashima:
3D motion vector coding with block base adaptive interpolation filter on H.264. 633-636 - Jong Chul Ye, Yingwei Chen:
Rate-distortion optimized data partitioning for video using backward adaptation. 637-640 - Gabriella Olmo
, Cristiano Cucco, Marco Grangetto, Enrico Magli:
Few decoders in the encoder: a low complexity encoding strategy for H.26L. 641-644 - Òscar Divorra Escoda, Pierre Vandergheynst:
A locally temporal adaptive transform scheme for sub-band video coding. 645-648 - Golam Sorwar, M. Manzur Murshed, Laurence Dooley:
A fully adaptive performance-scalable distance-dependent thresholding search algorithm for video coding. 649-652 - Shing-Chow Chan, King To Ng, Zhi-Feng Gan, Kin-Lok Chan, Heung-Yeung Shum:
The compression of simplified dynamic light fields. 653-656
Image and Video Analysis II
- Lingyan Bi, Kwok Ping Chan, Yinglin Yu:
Modified CLT-domain motion estimation. 657-660 - Xuan Jing, Ce Zhu, Lap-Pui Chau:
Smooth constrained block matching criterion for motion estimation. 661-664 - Hai-Yun Wang, Kai-Kuang Ma:
Motion field discontinuity classification for tensor-based optical flow estimation. 665-668 - Yu-Chan Lim, Kyeong-Yuk Min, Jong-Wha Chong:
A pentagonal fast block matching algorithm for motion estimation using adaptive search range. 669-672 - Miki Haseyama, Atsushi Matsumura:
A trainable retrieval system for cartoon character images. 673-676 - Sangoh Jeong, Chee Sun Won, Robert M. Gray:
Histogram-based image retrieval using Gauss mixture vector quantization. 677-680 - Shan Suthaharan:
A perceptually significant block-edge impairment metric for digital video coding. 681-684 - Li Cheng, Terry Caelli:
Doubly-MRF stereo matching. 685-688 - Yong Yu, Isabelle Bloch, Alain Trouvé:
A unified unsupervised clustering algorithm and its first application to landcover classification. 689-692 - Peng Wang, Yufei Ma, Hong-Jiang Zhang, Shiqiang Yang:
A people similarity based approach to video indexing. 693-696 - Shunren Xia, Weidong Xu, Yutang Shen:
Two intelligent algorithms applied to automatic chromosome incision. 697-700 - Zhonghua Liang, Ping Wang, Zheng Tan:
Moving object detection from MPEG bit stream. 701-704
Image and Video Restoration
- Javier Mateos, Rafael Molina
, Aggelos K. Katsaggelos:
Bayesian high resolution image reconstruction with incomplete multisensor low resolution systems. 705-708 - Javier Abad, Miguel Vega, Rafael Molina
, Aggelos K. Katsaggelos:
Parameter estimation in super-resolution image reconstruction problems. 709-712 - Ying Fai Ho:
Peer region determination based impulsive noise detection. 713-716 - Euncheol Choi, Moon Gi Kang:
Deblocking algorithm for DCT-based compressed images using anisotropic diffusion. 717-720 - Bogdan Smolka, Konstantinos N. Plataniotis, Rastislav Lukac, Anastasios N. Venetsanopoulos:
New class of impulsive noise reduction filters based on kernel density estimation. 721-724 - Ryo Nakagaki, Aggelos K. Katsaggelos:
A VQ-based blur identification algorithm. 725-728 - Pascal Bourdon, Bertrand Augereau, Christian Olivier, Christian Chatellier:
A PDE-based method for ringing artifact removal on grayscale and color JPEG2000 images. 729-732 - Alexander M. Bronstein, Michael M. Bronstein, Michael Zibulevsky, Yehoshua Y. Zeevi:
Separation of semireflective layers using sparse ICA. 733-736 - Lorenzo Cappellari, Truong Q. Nguyen:
Deblocking of video sequences with lapped embedded IDCT. 737-740 - Ju Jia Zou, Hong Yan:
Model-based smoothing for reducing artifacts in compressed images. 741-744 - Rastislav Lukac, Bogdan Smolka, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos, Pavol Zavarsky:
Angular multichannel sigma filter. 745-748
Signal Processing Education
- Roxana Saint-Nom, Daniel Jacoby:
Switched capacitors: a bridge between analog and digital SP. 749-752 - Yong Lian:
The joy of learning DSP in a large class. 753-756 - Eliathamby Ambikairajah, Julien Epps, Ming Sheng, Branko G. Celler:
Evaluation of a virtual teaching laboratory for signal processing education. 757-760 - Jeng-Kuang Hwang:
Innovative communication design lab based on PC sound card and Matlab: a software-defined-radio OFDM modem example. 761-764 - John Håkon Husøy:
Making a case for iterative linear equation solvers in DSP education. 765-768 - Thad B. Welch, Robert W. Ives, Michael G. Morrow, Cameron H. G. Wright:
Using DSP hardware to teach modem design and analysis techniques. 769-772 - Jesús Ibáñez, Carlos Pantaleón, Luis Vielva, Ignacio Santamaría:
Teaching digital communications: a DSP approach. 773-776 - Swaroop Appadwedula, Richard G. Baraniuk, Matthew Berry, Mark D. Butala, Hyeokho Choi, Mark A. Haun, Douglas L. Jones, Michael L. Kramer, Dima Moussa, Lee C. Potter, Daniel Grobe Sachs, Brian Wade, Raymond S. Wagner:
Open-content signal processing laboratories in connexions. 777-780 - Amir Asif
:
Multimedia learning objects for digital signal processing in communications. 781-784