


default search action
ICASSP 2005: Philadelphia, Pennsylvania, USA
- 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, Pennsylvania, USA, March 18-23, 2005. IEEE 2005, ISBN 0-7803-8874-7
Volume 1
Voice Morphing
- Javier Latorre, Koji Iwano, Sadaoki Furui:
Polyglot Synthesis Using a Mixture of Monolingual Corpora. 1-4 - Ashish Verma, Arun Kumar:
Introducing Roughness in Individuality Transformation through Jitter Modeling and Modification. 5-8 - Tomoki Toda
, Alan W. Black, Keiichi Tokuda:
Spectral Conversion Based on Maximum Likelihood Estimation Considering Global Variance of Converted Parameter. 9-12 - David Suendermann, Antonio Bonafonte
, Hermann Ney, Harald Höge:
A Study on Residual Prediction Techniques for Voice Conversion. 13-16 - Patrick Perrot, Guido Aversano, Raphaël Blouet, Maurice Charbit, Gérard Chollet
:
Voice Forgery Using ALISP: Indexation in a Client Memory. 17-20 - Long Qin, Gao Peng Chen, Zhen-Hua Ling, Li-Rong Dai:
An Improved Spectral and Prosodic Transformation Method in STRAIGHT-based Voice Conversion. 21-24
Spoken Language Understanding and Dialog
- Ryuichiro Higashinaka, Katsuhito Sudoh, Mikio Nakano
:
Incorporating Discourse Features into Confidence Scoring of Intention Recognition Results in Spoken Dialogue Systems. 25-28 - Christian Raymond, Frédéric Béchet, Nathalie Camelin, Renato De Mori, Géraldine Damnati:
Semantic Interpretation With Error Correction. 29-32 - Gang Ji, Jeff A. Bilmes:
Dialog Act Tagging Using Graphical Models. 33-36 - Charles Lewis, Giuseppe Di Fabbrizio:
A Clarification Algorithm for Spoken Dialogue Systems. 37-40 - Gökhan Tür
:
Model Adaptation For Spoken Language Understanding. 41-44 - Xiao Li, Asela Gunawardana, Alex Acero
:
Unsupervised Semantic Intent Discovery from Call Log Acoustics. 45-48
Speech Perception and Psychacoustics
- Chiharu Morioka, Atsuko Kurashima, Akira Takahashi:
Proposal on Objective Speech Quality Assessment for Wideband IP Telephony. 49-52 - Qiang Fu, Mark A. Clements, Klaus Mewes:
Neural Cell Type Recognition Between Globus Pallidus Externus and Globus Pallidus Internus By Gaussian Mixture Modeling. 53-56 - Hitoshi Aoki, Akira Takahashi:
Analysis of Relationship Betweeen Overall Quality and Psychological Factors Affecting High-Quality Speech Communication Services. 57-60 - Maria Schuster, Elmar Nöth
, Tino Haderlein, Stefan Steidl
, Anton Batliner, Frank Rosanowski:
Can you Understand him? Let's Look at his Word Accuracy - Automatic Evaluation of Tracheoesophageal Speech. 61-64 - Marc A. Boillot, John G. Harris:
A Warped Bandwidth Expansion Filter. 65-68 - Sungyub Yoo, J. Robert Boston, John D. Durrant, Kristie Kovacyk, Stacey Karn, Susan Shaiman
, Amro El-Jaroudi, Ching-Chung Li:
Relative Energy And Intelligibility Of Transient Speech Information. 69-72
Confidence Measures and Rejection Algorithms
- Enrico Bocchieri, Sarangarajan Parthasarathy:
Rejection Using Rank Statistics Based on HMM State Shortlists. 73-76 - Taeyoon Kim, Hanseok Ko
:
Speaker Adaptive Confidence Scoring Using Bayesian Combining. 77-80 - Graham Greenland, Willy Wong
, Hans Kunov:
Improving utterance verification using additional confidence measures in isolated speech recognition interfaces. 81-84 - Wai Kit Lo, Frank K. Soong:
Generalized Posterior Probability for Minimum Error Verification of Recognized Sentences. 85-88 - Soundararajan Srinivasan, DeLiang Wang:
Robust Speech Recognition by Integrating Speech Separation and Hypothesis Testing. 89-92 - Yue-wen Fu, Limin Du:
Combination of Multiple predictors to Improve Confidence Measure Based on Local Posterior Probabilities. 93-96
Discriminative Training
- Khe Chai Sim, Mark J. F. Gales:
Adaptation of Precision Matrix Models on Large Vocabulary Continuous Speech Recognition. 97-100 - Chaojun Liu, Hui Jiang, Xinwei Li:
Discriminative Training of CDHMMs for Maximum Relative Separation Margin. 101-104 - Mohamed Afify, Xinwei Li, Hui Jiang:
Statistical Performance Analysis of MCE/GPD Learning in Gaussian Classifiers and Hidden Markov Models. 105-108 - Lambert Mathias, Girija Yegnanarayanan, Jürgen Fritsch:
Discriminative Training of Acoustic Models Applied to Domains with Unreliable Transcripts. 109-112 - Erik McDermott, Shigeru Katagiri:
Minimum Classification Error for Large Scale Speech Recognition Tasks using Weighted Finite State Transducers. 113-116 - Bo Liu, Hui Jiang, Jian-Lai Zhou, Ren-Hua Wang:
Discriminative Training Based on the Criterion of Least Phone Competing Tokens for Large Vocabulary Speech Recognition. 117-120
Quantization and Quality Measurement
- Stephen So
, Kuldip K. Paliwal
:
Multi-Frame GMM-Based Block Quantisation of Line Spectral Frequencies for Wideband Speech Coding. 121-124 - Tiago H. Falk
, Qingfeng Xu, Wai-Yip Chan:
Non-Intrusive GMM-Based Speech Quality Measurement. 125-128 - Stephen D. Voran:
A Multiple-Description PCM Speech Coder using Structured Dual Vector Quantizers. 129-132 - Minoru Kohata, Motoyuki Suzuki, Shozo Makino:
A New Segment Quantizer for Line Spectral Frequencies Using Lempel-Ziv Algorithm. 133-136 - Hiroyuki Ehara, Toshiyuki Morii, Masahiro Oshikiri, Koji Yoshida:
Predictive VQ for Bandwidth Scalable LSP Quantization. 137-140 - Yannis Agiomyrgiannakis, Yannis Stylianou:
Coding with Side Information Techniques for LSF Reconstruction in Voice Over IP. 141-144
Speech Enhancement with Noise Reduction
- Changhuai You, Soo Ngee Koh, Susanto Rahardja:
Signal Subspace Speech Enhancement for Audible Noise Reduction. 145-148 - Ning Ma, Martin Bouchard
, Rafik A. Goubran:
A Wavelet Kalman Filter with Perceptual Masking for Speech Enhancement in Colored Noise. 149-152 - Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
Adaptive Time Segmentation of Noisy Speech for Improved Speech Enhancement. 153-156 - Cyril Plapous, Claude Marro, Pascal Scalart:
Speech Enhancement Using Harmonic Regeneration. 157-160 - Zhong Lin, Rafik A. Goubran:
Instant Noise Estimation Using Fourier Transform of AMDF and Variable Start Minima Search. 161-164 - Guo-Hong Ding, Xia Wang, Yang Cao, Feng Ding, Yuezhong Tang:
Speech Enhancement Based on Speech Spectral Complex Gaussian Mixture Model. 165-168
Speaker Recognition Using Acoustic and Higher Level Features
- Andrew O. Hatch, Barbara Peskin, Andreas Stolcke:
Improved Phonetic Speaker Recognition Using Lattice Decoding. 169-172 - Sachin S. Kajarekar, Luciana Ferrer, Elizabeth Shriberg, M. Kemal Sönmez, Andreas Stolcke, Anand Venkataraman, Jing Zheng:
SRI's 2004 NIST Speaker Recognition Evaluation System. 173-176 - Douglas A. Reynolds, William M. Campbell, Terry T. Gleason, Carl Quillen, Douglas E. Sturim, Pedro A. Torres-Carrasquillo, André Adami
:
The 2004 MIT Lincoln Laboratory Speaker Recognition System. 177-180 - Ka-Yee Leung, Man-Wai Mak
, Man-Hung Siu, Sun-Yuan Kung:
Speaker Verification Using Adapted Articulatory Feature-based Conditional Pronunciation Modeling. 181-184 - Zi-He Chen, Yuan-Fu Liao
, Yau-Tarng Juang:
Prosody Modeling and Eigen-Prosody Analysis for Robust Speaker Recognition. 185-188 - André Gustavo Adami
:
Prosodic Modeling for Speaker Recognition Based on Sub-Band Energy Temporal Trajectories. 189-192
Large Vocabulary ASR
- Jeff Siu-Kei Au-Yeung
, Chak-Fai Li, Man-Hung Siu:
Sub-phonetic Polynomial Segment Model for Large Vocabulary Continuous Speech Recognition. 193-196 - Olivier Siohan, Bhuvana Ramabhadran, Brian Kingsbury:
Contructing Ensembles of ASR Systems Using Randomized Decision Trees. 197-200 - Mike Schuster, Takaaki Hori:
Efficient Generation of high-order context-dependent Weighted Finite State Transducers for Speech Recognition. 201-204 - Hagen Soltau, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon
, Geoffrey Zweig:
The IBM 2004 Conversational Telephony System for Rich Transcription. 205-208 - Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Bin Jia, David Mrva, Philip C. Woodland, Kai Yu:
Training LVCSR Systems on Thousands of Hours of Data. 209-212 - Mark Hasegawa-Johnson, James Baker, Sarah Borys, Ken Chen
, Emily Coogan, Steven Greenberg, Amit Juneja, Katrin Kirchhoff, Karen Livescu
, Srividya Mohan, Jennifer Muller, M. Kemal Sönmez, Tianyu Wang:
Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop. 213-216
Novel Methods for Speech Analysis
- Venkatraman Atti, Andreas Spanias:
Speech Analysis by Estimating Perceptually Relevant Pole Locations. 217-220 - Steven M. Schimmel, Les E. Atlas:
Coherent Envelope Detection for Modulation Filtering of Speech. 221-224 - Kentaro Ishizuka, Hiroko Kato Solvang, Tomohiro Nakatani:
Speech Signal Analysis with Exponential Autoregressive Model. 225-228 - Robert W. Morris, Jon A. Arrowood, Mark A. Clements:
Comparison of Autoregressive Parameter Estimation Algorithms for Speech Processing and Recognition. 229-232 - Princy Dikshit, Stephen A. Zahorian, Shivaram Nagulapati:
An Algorithm for Locating Fundamental Frequency Markers in Speech Signals. 233-236 - Akira Sasou, Masataka Goto
, Satoru Hayamizu, Kazuyo Tanaka:
An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition. 237-240
Noise Robust Speech Recognition
- Chen Yang, Frank K. Soong, Tan Lee
:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR. 241-244 - Weizhong Zhu, Douglas D. O'Shaughnessy:
Log-Energy Dynamic Range Normalizaton for Robust Speech Recognition. 245-248 - Jethran Guinness, Bhiksha Raj, Bent Schmidt-Nielsen, Lorenzo Turicchia, Rahul Sarpeshkar:
A Companding Front End for Noise-Robust Automatic Speech Recognition. 249-252 - Hemant Misra
, Shajith Ikbal, Sunil Sivadas, Hervé Bourlard:
Multi-resolution Spectral Entropy Feature for Robust ASR. 253-256 - Masakiyo Fujimoto, Satoshi Nakamura:
Particle Filter Based Non-Stationary Noise Tracking for Robust Speech Recognition. 257-260 - Tor André Myrvoll, Satoshi Nakamura:
Online cepstral filtering using a sequential EM approach with Polyak averaging and feedback. 261-264
Prosody and Speech Synthesis
- Brian Langner, Alan W. Black:
Improving the Understandability of Speech Synthesis by Modeling Speech in Noise. 265-268 - Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan:
An Automatic Prosody Recognizer using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model. 269-272 - Yoko Kokenawa, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka:
F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis. 273-276 - Shinsuke Sakai:
Additive Modeling of English F0 Contour for Speech Synthesis. 277-280 - Dan-Ning Jiang, Wei Zhang, Liqin Shen, Lianhong Cai:
Prosody Analysis and Modeling for Emotional Speech Synthesis. 281-284 - Jianfeng Li, Guoping Hu, Ren-Hua Wang, Li-Rong Dai:
Sliding Window Smoothing For Maximum Entropy Based Intonational Phrase Prediction In Chinese. 285-288 - Wentao Gu
, Keikichi Hirose, Hiroya Fujisaki:
Identification and Synthesis of Cantonese Tones Based on the Command-Response Model for F0 Contour Generation. 289-292 - Joram Meron, Peter Veprek:
Compression of Exception Lexicons for Small Footprint Grapheme-To-Phoneme Conversion. 293-296 - Christina L. Bennett, Alan W. Black:
Prediction of Pronunciation Variations for Speech Synthesis: A Data-Driven Approach. 297-300 - Mitsuaki Isogai, Hideyuki Mizuno, Kazunori Mano:
Recording Script Design for Corpus-Based TTS System Based on Coverage of Various Phonetic Elements. 301-304 - Jilei Tian, Jani Nurminen, Imre Kiss:
Optimal Subset Selection from Text Databases. 305-308 - Jordi Adell, Antonio Bonafonte
, Jon Ander Gómez, María José Castro
:
Comparative study of Automatic Phone Segmentation methods for TTS. 309-312
General Topics in ASR
- Brian Delaney:
Increased Robustness Against Bit Errors for Distributed Speech Recognition in Wireless Environments. 313-316 - Stefan Steidl
, Michael Levit, Anton Batliner, Elmar Nöth
, Heinrich Niemann:
"Of All Things the Measure Is Man" : Automatic Classification of Emotions and Inter-Labeler Consistency. 317-320 - Lingyun Gu, John G. E. Harris, Rahul Shrivastav, Christine Sapienza:
Disordered Speech Evaluation Using Objective Quality Measures. 321-324 - Björn W. Schuller
, Raquel Jiménez Villar, Gerhard Rigoll, Manfred K. Lang:
Meta-Classifiers in Acoustic and Linguistic Feature Fusion-Based Affect Recognition. 325-328 - Antonio M. Peinado
, Angel M. Gomez, Victoria E. Sánchez, José L. Pérez-Córdoba
, Antonio J. Rubio:
Packet Loss Concealment Based on VQ Replicas and MMSE Estimation Applied to Distributed Speech Recognition. 329-332 - Valentin Ion, Reinhold Haeb-Umbach
:
A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. 333-336 - Li Deng, Xiang Li, Dong Yu, Alex Acero
:
A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition. 337-340 - Izhak Shafran, Mehryar Mohri:
A Comparison of Classifiers for Detecting Emotion from Speech. 341-344 - Alastair Bruce James, Ben Milner:
Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss. 345-348 - Xin Lei, Gang Ji, Tim Ng, Jeff A. Bilmes, Mari Ostendorf:
DBN-Based Multi-stream Models for Mandarin Toneme Recognition. 349-352 - Amaro A. de Lima
, Heiga Zen
, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Fernando Gil Resende:
Sparse KPCA for Feature Extraction in Speech Recognition. 353-356 - Evan Ruzanski
, John H. L. Hansen, James Meyerhoff, George Saviolakis, Michael Koenig:
Effects of Phoneme Characteristics on TEO Feature-based Automatic Stress Detection in Speech. 357-360
Speech Analysis and Synthesis
- Masatsune Tamura, Tatsuya Mizutani, Takehiko Kagoshima:
Scalable Concatenative Speech Synthesis Based on the Plural Unit Selection and Fusion Method. 361-364 - Junichi Yamagishi, Takao Kobayashi:
Adaptive Training for Hidden Semi-Markov Model. 365-368 - Mohammad Firouzmand, Laurent Girin:
Perceptually Weighted Long Term Modeling of Sinusoidal Speech Amplitude Trajectories. 369-372 - Toshiyuki Sekiya, Tetsunori Kobayashi:
Speech recognition in the blind condition based on multiple directivity patterns using a microphone array. 373-376 - Dagen Wang, Shrikanth S. Narayanan:
An Unsupervised Quantitative Measure for Word Prominence in Spontaneous Speech. 377-380 - Kostas Kokkinakis, Asoke K. Nandi:
Speech Modelling Based On Generalized Gaussian Probability Density Functions. 381-384 - Guo Chen, Vijay Parsa:
Bayesian Model Based Non-Intrusive Speech Quality Evaluation. 385-388 - Celia Shahnaz
, Wei-Ping Zhu
, M. Omair Ahmad:
Robust Pitch Estimation At Very Low SNR Exploiting Time and Frequency Domain Cues. 389-392 - Laurence Cnockaert, Francis Grenez, Jean Schoentgen:
Fundamental Frequency Estimation and Vocal Tremor Analysis by means of Morlet Wavelet Transforms. 393-396 - Anindya Sarkar, Thippur V. Sreenivas:
Automatic Speech Segmentation Using Average Level Crossing Rate Information. 397-400 - Van Tuan Pham, Gernot Kubin:
DWT-Based Phonetic Groups Classification Using Neural Networks. 401-404 - Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa
, Claudio Turchetti:
A Novel KLT Algorithm Optimized for Small Signal Sets. 405-408 - Yasser A. Mahgoub, Richard M. Dansereau:
Voicing-State Classification of Co-Channel Speech Using Nonlinear State-Space Reconstruction. 409-412 - Shrikanth S. Narayanan, Dagen Wang:
Speech Rate Estimation via Temporal Correlation and Selected Sub-Band Correlation. 413-416
Model-Based Robust Speech Recognition
- Xianyu Zhao, Zhijian Ou, Minhua Chen, Zuoying Wang:
Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition. 417-420 - Daniel Willett:
Context-Dependent Duration Modeling. 421-424 - André Coy, Jon Barker
:
Recognising Speech in the Presence of a Competing Speaker using a 'Speech Fragment Decoder'. 425-428 - Jian Wu, Qiang Huo, Donglai Zhu:
An Environment Compensated Maximum Likelihood Training Approach Based on Stochastic Vector Mapping. 429-432 - Veronique Stouten, Hugo Van hamme
, Patrick Wambacq
:
Effect of Phase-Sensitive Environment Model and Higher Order VTS on Noisy Speech Feature Enhancement. 433-436 - Pamornpol Jinachitra, Ramon Prieto:
Towards Speech Recognition Oriented Dereverberation. 437-440 - Zhipeng Zhang, Sadaoki Furui:
Noisy Speech Recognition Based on Robust End-point Detection and Model Adaptation. 441-444 - Hiroshi Fujimura, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura:
Analysis of a large in-car speech corpus and its application to the multimodel ASR. 445-448 - Goshu Nagino, Makoto Shozakai:
Building an Effective Corpus By Using Acoustic Space Visualization (COSMOS) Method. 449-452 - Shajith Ikbal, Hervé Bourlard, Mathew Magimai-Doss
:
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition. 453-456 - András Zolnay, Ralf Schlüter
, Hermann Ney:
Acoustic Feature Combination for Robust Speech Recognition. 457-460 - Stavros Tsakalidis, William Byrne:
Acoustic Training from Heterogeneous Data Sources: Experiments in Mandarin Conversational Telephone Speech Transcription. 461-464
Speech Mining and Audio-Visual Information Processing
- Kishan Thambiratnam, Sridha Sridharan:
Dynamic Match Phone-Lattice Searches For Very Fast And Accurate Unrestricted Vocabulary Keyword Spotting. 465-468 - Satoshi Tamura, Koji Iwano, Sadaoki Furui:
A Stream-Weight Optimization Method for Multi-Stream HMMS Based on Likelihood Value Normalization. 469-472 - Jesus F. Guitarte Perez, Alejandro F. Frangi
, Eduardo Lleida-Solano, Klaus Lukas:
Lip Reading for Robust Speech Recognition on Embedded Devices. 473-476 - Simon Tucker, Steve Whittaker:
Novel Techniques For Time-Compressing Speech: An Exploratory Study. 477-480 - Peng Yu, Frank Seide:
Fast Two-Stage Vocabulary-Independent Search In Spontaneous Speech. 481-484 - Takafumi Koshinaka, Ken-ichi Iso, Akitoshi Okumura:
An HMM-based Text Segmentation Method Using Variational Bayes Approach and Its Application to LVCSR for Broadcast News. 485-488 - Daniel Gatica-Perez
, Iain McCowan, Dong Zhang, Samy Bengio:
Detecting Group Interest-Level in Meetings. 489-492 - Lee Begeja, Harris Drucker, David C. Gibbon, Patrick Haffner, Zhu Liu, Bernard Renger, Behzad Shahraray:
Semantic Data Mining of Short Utterances. 493-496 - Alex Park, Timothy J. Hazen, James R. Glass:
Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling. 497-500 - Mohamed Kamal Omar, Upendra V. Chaudhari, Ganesh N. Ramaswamy:
Blind Change Detection for Audio Segmentation. 501-504 - Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:
Combining Multiple Subword Representations for Open-Vocabulary Spoken Document Retrieval. 505-508 - Hasan Ertan Çetingül, Yücel Yemez, Engin Erzin
, A. Murat Tekalp
:
Robust Lip-Motion Features For Speaker Identification. 509-512
Feature-Based Robust Speech Recognition
- Fabio Valente, Christian Wellekens:
Variational Bayesian Feature Saliency for Audio Type Classification. 513-516 - Muhammad Ghulam, Takashi Fukuda, Junsei Horikawa, Tsuneo Nitta:
Pitch-Synchronous ZCPA (PS-ZCPA)-Based Feature Extraction with Auditory Masking. 517-520 - Nicolás Morales, John H. L. Hansen, Doroteo T. Toledano
:
MFCC Compensation for Improved Recognition of Filtered and Band-Limited Speech. 521-524 - Chia-Ping Chen, Jeff A. Bilmes, Daniel P. W. Ellis:
Speech Feature Smoothing for Robust ASR. 525-528 - Vivek Tyagi, Christian Wellekens:
On desensitizing the Mel-Cepstrum to spurious spectral components for Robust Speech Recognition. 529-532 - Weifeng Li, Katunobu Itou, Kazuya Takeda, Fumitada Itakura:
Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone. 533-536 - Sue Harding, Jon Barker
, Guy J. Brown
:
Mask Estimation Based on Sound Localisation for Missing Data Speech Recognition. 537-540 - Rajesh M. Hegde, Hema A. Murthy, G. V. Ramana Rao:
Speech Processing Using Joint Features Derived from the Modified Group Delay Function. 541-544 - Benjamin J. Shannon, Kuldip K. Paliwal
:
Influence of Autocorrelation Lag Ranges on Robust Speech Recognition. 545-548 - R. Muralishankar, Douglas D. O'Shaughnessy:
Subspace-based Speaker-independent Vowel Recognition. 549-552 - Rui Zhao, Zuoying Wang:
Robust Speech Recognition Based on Spectral Adjusting and Warping. 553-556 - Jaume Padrell, Dusan Macho, Climent Nadeu:
Robust Speech Activity Detection Using LDA Applied to FF Parameters. 557-560
Language Modeling and Identification
- Murat Saraclar, Brian Roark:
Joint Discriminative Language Modeling and Utterance Classification. 561-564 - Vaibhava Goel
, Hong-Kwang Kuo, Sabine Deligne, Cheng Wu:
Language Model Estimation for Optimizing End-to-end Performance of a Natural Language Call Routing System. 565-568 - Yasunari Obuchi, Nobuo Sato:
Language Identification Using Phonetic and Prosodic HMMs with Feature Normalization. 569-572 - Ruhi Sarikaya, Agustín Gravano, Yuqing Gao:
Rapid Language Model Development Using External Resources for New Spoken Dialog Domains. 573-576 - Boon Pang Lim, Haizhou Li
, Bin Ma:
Using Local & Global Phonotactic Features in Chinese Dialect Identification. 577-580 - Ahmad Emami, Frederick Jelinek:
Random Clusterings for Language Modeling. 581-584 - Rongqing Huang, John H. L. Hansen:
Dialect/Accent Classification via Boosted Word Modeling. 585-588 - Tim Ng, Mari Ostendorf, Mei-Yuh Hwang, Man-Hung Siu, Ivan Bulyko, Xin Lei:
Web-Data Augmented Language Models for Mandarin Conversational Speech Recognition. 589-592 - Zhu Liu:
An Efficient Algorithm for Clustering Short Spoken Utterances. 593-596 - Dong Yu, Milind Mahajan, Peter Mau, Alex Acero
:
Maximum Entropy Based Generic Filter for Language Model Adaptation. 597-600 - Chi-Yueh Lin, Hsiao-Chuan Wang:
Language Identification Using Pitch Contour Information. 601-604 - Nick J.-C. Wang:
Integrating Multiple Layers of Concept Information into N-gram Modeling for Spoken Language Understanding. 605-608 - S. A. SantoshKumar, V. Ramasubramanian
:
Automatic Language Identification Using Ergodic HMM. 609-612
Text-Independent Speaker Recognition
- Jérôme Louradour, Khalid Daoudi, Régine André-Obrecht, Paul Sabatier:
Discriminative Power of Transient Frames in Speaker Recognition. 613-616 - Ji Ming, Darryl Stewart, Saeed Vaseghi:
Speaker Identification in Unknown Noisy Conditions - A Universal Compensation Approach. 617-620 - Balakrishnan Narayanaswamy, Rashmi Gangadharaiah:
Extracting Additional Information from Gaussian Mixture Model Probabilities for Improved Text-Independent Speaker Identification. 621-624 - Zhenyu Xiong, Thomas Fang Zheng, Zhanjiang Song, Wenhu Wu:
Combining Selection Tree with Observation Reordering Pruning for Efficient Speaker Identification Using GMM-UBM. 625-628 - Alex Solomonoff, William M. Campbell, Ian Boardman
:
Advances In Channel Compensation For SVM Speaker Recognition. 629-632 - Patrick Kenny, Gilles Boulianne
, Pierre Ouellet, Pierre Dumouchel
:
Factor Analysis Simplified. 637-640 - Yusuke Kida, Hiroyoshi Yamamoto, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Minimum Classification Error Interactive Training for Speaker Identification. 641-644 - Yih-Ru Wang, Chen-Yu Chiang:
A New Common Component GMM-Based Speaker Recognition Method. 645-648 - Yi-Hsiang Chao, Hsin-Min Wang
, Ruei-Chuan Chang:
Gmm-Based Bhattacharyya Kernel Fisher Discriminant Analysis For Speaker Recognition. 649-652 - James H. Nealand, Jason W. Pelecanos, Ran D. Zilca, Ganesh N. Ramaswamy:
A Study of the Relative Importance of Temporal Characteristics in Text-Dependent and Text-Constrained Speaker Verification. 653-656 - Zekeriya Tufekci, Sabri Gurbuz:
Noise Robust Speaker Verification Using Mel-Frequency Discrete Wavelet Coefficients and Parallel Model Compensation. 657-660
Acoustic Modeling and Clustering Algorithms
- Peder A. Olsen, Karthik Visweswariah, Ramesh Gopinath:
Initializing Subspace Constrained Gaussian Mixture Models. 661-664 - Özgür Çetin, Mari Ostendorf:
Multi-Rate and Variable-Rate Modeling of Speech At Phone and Syllable Time Scales. 665-668 - Xiao-Bing Li, Frank K. Soong, Tor André Myrvoll, Ren-Hua Wang:
Optimal Clustering and Non-Uniform Allocation of Gaussian Kernels in Scalar Dimension for HMM Compression. 669-672 - Hui Lin, Ye Tian, Jian-Lai Zhou, Hui Jiang:
Hierarchical Correlation Compensation For Hidden Markov Models. 673-676 - Bing Xiang, Long Nguyen, Spyros Matsoukas, Richard M. Schwartz:
Cluster-Dependent Acoustic Modeling. 677-680 - Xianghua Xu, Jie Zhu:
Fuzzy Parameter Clustering Method in Speech Recognition. 681-684 - Mark Z. Mao, Vincent Vanhoucke
, Brian Strope:
Automatic Training Set Segmentation for Multi-pass Speech Recognition. 685-688 - Yuya Akita, Tatsuya Kawahara
:
Generalized Statistical Modeling of Pronunciation Variations using Variable-length Phone Context. 689-692 - Franz Pernkopf
:
On Initialization of Gaussian Mixtures: A Hybrid Genetic EM Algorithm. 693-696 - Rusheng Hu, Xiaolong Li, Yunxin Zhao:
Acoustic Model Training Using Greedy EM. 697-700 - Konstantin Markov, Satoshi Nakamura:
Modeling Successive Frame Dependencies with Hybrid HMM/BN Acoustic Model. 701-704 - Xi Zhou, Ye Tian, Jian-Lai Zhou, Bei-qian Dai:
Improved Covariance Modeling For Maximum Likelihood Multiple Subspace Transformations. 705-708
Topics in Speaker Recognition
- Jonas Richiardi
, Plamen J. Prodanov, Andrzej Drygajlo:
A probabilistic measure of modality reliability in speaker verification. 709-712 - Mikaël Collet, Delphine Charlet, Frédéric Bimbot:
A Correlation Metric for Speaker Tracking Using Anchor Models. 713-716 - William M. Campbell, Douglas A. Reynolds, Joseph P. Campbell, Kevin Brady:
Estimating and Evaluating Confidence for Forensic Speaker Recognition. 717-720 - Norman Poh, Samy Bengio:
F-ratio Client-Dependent Normalisation for Biometric Authentication Tasks. 721-724 - Wei-Ho Tsai, Shih-Sian Cheng, Yi-Hsiang Chao, Hsin-Min Wang
:
Clustering Speech Utterances by Speaker Using Eigenvoice-Motivated Vector Space Models. 725-728 - Matthieu Hébert, Daniel Boies:
T-Norm for Text-Dependent Commercial Speaker Verification Applications: Effect of Lexical Mismatch. 729-732 - Hagai Aronowitz, David Burshtein, Amihood Amir:
A Session-GMM Generative Model Using Test Utterance Gaussian Mixture Modeling for Speaker Verification. 733-736 - Jean-François Bonastre
, Frédéric Wils, Sylvain Meignier:
ALIZE, a free toolkit for speaker recognition. 737-740 - Douglas E. Sturim, Douglas A. Reynolds:
Speaker Adaptive Cohort Selection for Tnorm in Text-Independent Speaker Verification. 741-744 - Hyoung-Gook Kim, Daniel Ertelt, Thomas Sikora:
Hybrid Speaker-Based Segmentation System Using Model-Level Clustering. 745-748 - Antonio Moreno-Daniel, Biing-Hwang Juang, Juan Arturo Nolazco-Flores
:
Robustness of Bit-stream Based Features for Speaker Verification. 749-752 - Sue Tranter:
Two-Way Cluster Voting to Improve Speaker Diarisation Performance. 753-756 - Daniel Gillick, Stephen Stafford, Barbara Peskin:
Speaker Detection Without Models. 757-760
Topics in Speech Coding and Enhancement
- Ali Erdem Ertan, Thomas P. Barnwell III:
Improving the 2.4 Kb/s Military Standard MELP (MS-MELP) Coder Using Pitch-Synchronous Analysis and Synthesis Techniques. 761-764 - Matthew Lee, Adriane Swaim Durey, Elliot Moore, Mark Clements:
Ultra Low Bit Rate Speech Coding Using an Ergodic Hidden Markov Model. 765-768 - Christopher M. Garrido, Manohar N. Murthi, Søren Vang Andersen:
Towards iLBC Speech Coding at Lower Rates Through a New Formulation of the Start State search. 769-772 - Cenk Demiroglu, Thomas P. Barnwell III:
A Missing-Data Approach to Noise-Robust LPC Extraction for Voiced Speech Using Auxiliary Sensors. 773-776 - Mark A. Jasiuk, Tenkasi Ramabadran, Udar Mittal, James P. Ashley, Michael J. McLaughlin:
A Technique of Multi-Tap Long Term Predictor (LTP) Filter Using Sub-Sample Resolution Delay. 777-780 - Jong Won Shin, Joon-Hyuk Chang, Hwan Sik Yun, Nam Soo Kim:
Voice Activity Detection based on Generalized Gamma Distribution. 781-784 - Mohamed Chibani, Philippe Gournay, Roch Lefebvre:
Increasing the Robustness of CELP-Based Coders By Constrained Optimization. 785-788 - Udar Mittal, James P. Ashley, Edgardo M. Cruz-Zeno, Mark A. Jasiuk:
Joint Optimization of Excitation Parameters in Analysis-By-Synthesis Speech Coders Having Multi-Tap Long Term Predictor. 789-792 - Sheng Yao, Cheung-Fat Chan
:
Block-based Bandwidth Extension of Narrowband Speech Signal by using CDHMM. 793-796 - Cenk Demiroglu, Sunil D. Kamath, David V. Anderson:
Segmentation-Based Speech Enhancement for Intelligibility Improvement in MELP Coders Using Auxiliary Sensors. 797-800 - Marcin Kuropatwinski
, W. Bastiaan Kleijn
:
Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement. 801-804 - Takahiro Unno, Alan McCree:
A Robust Narrowband to Wideband Extension System Featuring Enhanced Codebook Mapping. 805-808 - Laura Laaksonen, Juho Kontio, Paavo Alku
:
Artificial Bandwidth Expansion Method to Improve Intelligibility and Quality of AMR-Coded Narrowband Speech. 809-812 - Xuefeng Zhang, Ying Jia:
A Soft Decision Based Noise Cross Power Spectral Density Estimation for Two-Microphone Speech Enhancement Systems. 813-816
Large Vocabulary ASR
- Veera Venkataramani, William Byrne:
Lattice Segmentation and Support Vector Machines for Large Vocabulary Continuous Speech Recognition. 817-820 - Viet Bac Le, Laurent Besacier:
First Steps in Fast Acoustic Modeling for a New Target Language: Application to Vietnamese. 821-824 - Christian Gollan, Maximilian Bisani, Stephan Kanthak, Ralf Schlüter
, Hermann Ney:
Cross Domain Automatic Transcription on the TC-STAR EPPS Corpus. 825-828 - René Beutler, Tobias Kaufmann, Beat Pfister:
Using Rule-based Knowledge to Improve LVCSR. 829-832 - Javier Dieguez-Tirado, Carmen García-Mateo
, Laura Docío Fernández
, Antonio Cardenal López:
Adaptation Strategies for the Acoustic and Language Models in Bilingual Speech Transcription. 833-836 - Jinyu Li
, Yu Tsao
, Chin-Hui Lee:
A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition. 837-840 - Mark J. F. Gales, Bin Jia, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland, Kai Yu:
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System. 841-844 - Ananth Sankar:
Bayesian Model Combination (BAYCOM) for Improved Recognition. 845-848 - Xunying Liu, Mark J. F. Gales, Khe Chai Sim, Kai Yu:
Investigation of Acoustic Modeling Techniques for LVCSR Systems. 849-852 - Jian Xue, Yunxin Zhao:
Improved Confusion Network Algorithm and Shortest Path Search from Word Lattice. 853-856 - Sinaporn Suebvisai, Paisarn Charoenpornsawat, Alan W. Black, Monika Woszczyna, Tanja Schultz
:
Thai Automatic Speech Recognition. 857-860 - Do Yeong Kim, Ho Yin Chan, Gunnar Evermann, Mark J. F. Gales, David Mrva, Khe Chai Sim, Philip C. Woodland:
Development of the CU-HTK 2004 Broadcast News Transcription Systems. 861-864 - Terrence Martin
, Sridha Sridharan:
Cross-language Acoustic Model Refinement for the Indonesian Language. 865-868
Speech Analysis and Production
- Eoin O'Leidhin, Peter J. Murphy:
Analysis of Spectral Measures for Voiced Speech with Varying Noise and Pertubation Levels. 869-872 - Nicolas Malyska, Thomas F. Quatieri, Douglas E. Sturim:
Automatic Dysphonia Recognition using Biologically-Inspired Amplitude-Modulation Features. 873-876 - Dhany Arifianto
, Takao Kobayashi:
Voiced/Unvoiced Determination of Speech Signal in Noisy Environment using Harmonicity Measure Based on Instantaneous Frequency. 877-880 - Kazuya Takeda, Tran Huy Dat, Hiroshi Fujimura, Fumitada Itakura:
SNR and Local Noise Power Estimations Based on Gaussian Mixture Modeling on the Log-Power Domain. 881-884 - Alexander Gutkin
, Simon King
:
Detection of Symbolic Gestural Events in Articulatory Data for Use in Structural Representations of Continuous Speech. 885-888 - Nobuaki Minematsu:
Mathematical Evidence of the Acoustic Universal Structure in Speech. 889-892 - Zhaoyan Zhang, Carol Y. Espy-Wilson, Suzanne Boyce
, Mark Tiede:
Modeling of the Front Cavity and Sublingual Space in American English Rhotic Sounds. 893-896 - Tom Bäckström
, Matti Airas, Laura Lehto, Paavo Alku
:
Objective Quality Measures for Glottal Inverse Filtering of Speech Pressure Signals. 897-900 - Huiqun Deng, Rabab K. Ward, Michael P. Beddoes, Murray Hodgson:
Effects of Glottal and Lip Boundary Conditions on Vocal-Tract Area Function Estimates from Speech Signals. 901-904 - Ramdas Kumaresan, Gopi Krishna Allu, Peter Cariani:
Adaptive Filterbanks Inspired By the Auditory System for Speech Feature Extraction. 905-908 - Sadao Hiroya, Takemi Mochida:
Multi-Speaker Articulatory Reconstruction Based on an Eigen Articulatory HMM. 909-912 - Jonathan Malkin, Xiao Li, Jeff A. Bilmes:
A Graphical Model for Formant Tracking. 913-916 - Abdellah Kacha, Francis Grenez, Jean Schoentgen, Khier Benmahammed:
Dysphonic Speech Analysis Using Generalized Variogram. 917-920
Feature Extraction and Modeling
- Michael L. Seltzer, Alex Acero
:
Training Wideband Acoustic Models using Mixed-Bandwidth Training Data via Feature Bandwidth Extension. 921-924 - Bing Zhang, Spyros Matsoukas:
Minimum Phoneme Error Based Heteroscedastic Linear Discriminant Analysis for Speech Recognition. 925-928 - Woojay Jeon, Biing-Hwang Juang:
A Study of Auditory Modeling and Processing for Speech Signals. 929-932 - Ghinwa F. Choueiter, James R. Glass:
A Wavelet and Filter Bank Framework For Phonetic Classification. 933-936 - Joseph Tepperman, Shrikanth S. Narayanan:
Automatic Syllable Stress Detection Using Prosodic Features for Pronunciation Evaluation of Language Learners. 937-940 - Jonathan Darch
, Ben Milner, Xu Shao, Saeed Vaseghi, Qin Yan:
Predicting Formant Frequencies from MFCC Vectors. 941-944 - Barry Y. Chen, Qifeng Zhu, Nelson Morgan:
Tonotopic Multi-Layered Perceptron: A Neural Network for Learning Long-Term Temporal Features for Speech Recognition. 945-948 - Umit H. Yapanel, John H. L. Hansen:
Towards an Intelligent Acoustic Front-End for Automatic Speech Recognition: Built-In Speaker Normalization (BISN). 949-952 - Frank Diehl, Asunción Moreno:
Quasi-Continuous Local Codebook Features for multilingual Acoustic Phonetic Modelling. 953-956 - Mohamad Abdolahi, Hamidreza Amindavar:
GARCH Coefficients as Feature for Speech Recognition in Persian Isolated Digit. 957-960 - Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon
, Hagen Soltau, Geoffrey Zweig:
fMPE: Discriminatively Trained Features for Speech Recognition. 961-964
Adaptation and Normalization
- Fabio Valente, Christian Wellekens:
Variational Bayesian Adaptation for Speaker Clustering. 965-968 - Matthias Honal, Tanja Schultz
:
Automatic Disfluency Removal on Recognized Spontaneous Speech - Rapid Adaptation to Speaker Dependent Disfluencies. 969-972 - Chih-Hsien Huang, Jen-Tzung Chien
:
Aggregate a Posteriori Linear Regression for Speaker Adaptation. 973-976 - Jan Stadermann, Gerhard Rigoll:
Two-Stage Speaker Adaptation of Hybrid Tied-Posterior Acoustic Models. 977-980 - Brian Mak
, Simon Ka-Lung Ho:
Various Reference Speakers Determination Methods for Embedded Kernel Eigenvoice Speaker Adaptation. 981-984 - Roger Wend-Huu Hsiao, Brian Kan-Wing Mak
:
Kernel Eigenspace-based MLLR Adaptation Using Multiple Regression Classes. 985-988 - Florian Metze
, Christian Fügen, Yue Pan, Alex Waibel:
Automatically Transcribing Meetings using Distant Microphones. 989-992 - Tie Cai, Jie Zhu:
A Novel Method for Rapid Speaker Adaptation Based on Support Speaker Weighting. 993-996 - Georg Stemmer
, Fabio Brugnara, Diego Giuliani:
Adaptive Training Using Simple Target Models. 997-1000 - Daniele Colibro, Luciano Fissore, Cosmin Popovici, Claudio Vair, Pietro Laface:
Learning Pronunciation and Formulation Variants in Continuous Speech Applications. 1001-1004 - Lori Lamel, Jean-Luc Gauvain:
Alternate Phone Models for Conversational Speech. 1005-1008 - Szu-Chen Stan Jou, Tanja Schultz
, Alex Waibel:
Whispery Speech Recognition using Adapted Articulatory Features. 1009-1012
Topics in Speech Processing and Systems
- Alexandre Allauzen, Jean-Luc Gauvain:
Open Vocabulary ASR for Audiovisual Document Indexation. 1013-1016 - Bowen Zhou, Stanley F. Chen, Yuqing Gao:
Constrained Phrase-based Translation Using Weighted Finite State Transducer. 1017-1020 - Katsutoshi Ohtsuki, Nobuaki Hiroshima, Masahiro Oku, Akihiro Imamura:
Unsupervised Vocabulary Expansion for Automatic Transcription of Broadcast News. 1021-1024 - Srinivas Bangalore, Owen Rambow:
Classification of Structured Descriptions. 1025-1028 - Heidi Christensen
, BalaKrishna Kolluru, Yoshihiko Gotoh, Steve Renals
:
Maximum entropy segmentation of broadcast news. 1029-1032 - Vincent Goffin, Cyril Allauzen, Enrico Bocchieri, Dilek Hakkani-Tür
, Andrej Ljolje, Sarangarajan Parthasarathy, Mazin G. Rahim, Giuseppe Riccardi, Murat Saraçlar:
The AT&T WATSON Speech Recognizer. 1033-1036 - Ching-Ho Tsai, Nick J.-C. Wang, Patrick Huang, Jia-Lin Shen:
Open Vocabulary Chinese Name Recognition with the Help of Character Description and Syllable Spelling Recognition. 1037-1040 - Dilek Hakkani-Tür, Gökhan Tür
, Giuseppe Riccardi, Hong Kook Kim:
Error Prediction in Spoken Dialog: From Signal-to-Noise Ratio to Semantic Confidence Scores. 1041-1044 - Ian R. Lane, Tatsuya Kawahara
:
Incorporating Dialogue Context and Topic Clustering in Out-of-Domain Detection. 1045-1048 - Atsushi Sako, Yasuo Ariki:
Structuring Baseball Live Games Based on Speech Recognition Using Task Dependent Knowledge and Emotion State Recognition. 1049-1052 - Hiroaki Nanjo, Tatsuya Kawahara
:
A New ASR Evaluation Measure and Minimum Bayes-Risk Decoding for Open-domain Speech Understanding. 1053-1056 - Tatsuhiko Tomita, Yoshiyuki Okimoto, Hirofumi Yamamoto, Yoshinori Sagisaka:
Speech recognition of a named entity. 1057-1060 - Jeremy Ang, Yang Liu, Elizabeth Shriberg:
Automatic Dialog Act Segmentation and Classification in Multiparty Meetings. 1061-1064 - Makoto Hirohata, Yosuke Shinnaka, Koji Iwano, Sadaoki Furui:
Sentence extraction-based presentation summarization techniques and evaluation metrics. 1065-1068
Topics in Speech Enhancement, Separation and Dereverberation
- Takafumi Hikichi, Marc Delcroix
, Masato Miyoshi:
Blind Dereverberation based on Estimates of Signal Transmission Channels without Precise Information of Channel Order. 1069-1072 - Keisuke Kinoshita
, Tomohiro Nakatani, Masato Miyoshi:
Fast Estimation of a Precise Dereverberation Filter based on Speech Harmonicity. 1073-1076 - Sriram Srinivasan, Jonas Samuelsson, W. Bastiaan Kleijn
:
Codebook-Based Bayesian Speech Enhancement. 1077-1080 - Tim Fingscheidt
, Christophe Beaugeant, Suhadi Suhadi
:
Overcoming the Statistical Independence Assumption w.r.t. Frequency in Speech Enhancement. 1081-1084 - Mingyang Wu, DeLiang Wang:
A Two-Stage Algorithm for Enhancement of Reverberant Speech. 1085-1088 - K. Sharath Rao, Thippur V. Sreenivas:
Matrix Quantization Based Time-Varying Filter Speech Enhancement. 1089-1092 - Zicheng Liu, Amar Subramanya, Zhengyou Zhang, Jasha Droppo
, Alex Acero
:
Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones. 1093-1096 - Bin Chen, Philipos C. Loizou:
Speech Enhancement Using a MMSE Short Time Spectral Amplitude Estimator with Laplacian Speech Modeling. 1097-1100 - Guoning Hu, DeLiang Wang:
Separation of Fricatives and Affricates. 1101-1104 - Nima Mesgarani, Shihab A. Shamma:
Speech Enhancement Based on Filtering the Spectrotemporal Modulations. 1105-1108 - Volodya Grancharov, Jonas Samuelsson, W. Bastiaan Kleijn
:
Improved Kalman Filtering for Speech Enhancement. 1109-1112 - Rong Hu, Yunxin Zhao:
Adaptive Decorrelation Filtering Algorithm for Speech Source Separation in Uncorrelated Noises. 1113-1116 - Min-Seok Choi, Hong-Goo Kang:
An improved estimation of a priori speech absence probability for speech enhancement : in perspective of speech perception. 1117-1120 - Jianping Deng, Martin Bouchard
, Tet Hin Yeap:
Speech Enhancement Using a Switching Kalman Filter with a Perceptual Post-Filter. 1121-1124
Volume 2
Watermarking
- Qiao Li, Ingemar J. Cox
:
Using Perceptual Models to Improve Fidelity and Provide Invariance to Valumetric Scaling for Quantization Index Modulation Watermarking. 1-4 - Abdellatif Zaidi, Pablo Piantanida, Pierre Duhamel:
Scalar scheme for Multiple User Information Embedding. 5-8 - Ramarathnam Venkatesan, Mariusz H. Jakubowski:
Randomized Detection For Spread-Spectrum Watermarking: Defending Against Sensitivity and Other Attacks. 9-12 - Yongdong Wu:
Linear Combination Collusion Attack and its Application on an Anti-Collusion Fingerprinting. 13-16 - Mehmet Utku Celik
, Gaurav Sharma, A. Murat Tekalp
:
Pitch and Duration Modification for Speech Watermarking. 17-20 - Oktay Altun
, Gaurav Sharma, Mehmet Utku Celik
, Mark Sterling
, Edward L. Titlebaum, Mark Bocko
:
Morphological Steganalysis of Audio Signals and the Principle of Diminishing Marginal Distortions. 21-24
Denoising
- Antoni Buades, Bartomeu Coll
, Jean-Michel Morel
:
Image Denoising By Non-Local Averaging. 25-28 - Keigo Hirakawa, Thomas W. Parks:
Image Denoising for Signal-Dependent Noise. 29-32 - Il Ryeol Kim, Kenneth E. Barner:
Wavelet Domain Partition-Based Image Denoising. 33-36 - Qi Li, Tania Stathaki:
An Improved Image Denoising Algorithm based on Weighted Adaptive Local Bounds. 37-40 - Thomas C. M. Lee, Xiao-Li Meng:
A Self-Consistent Wavelet Method for Denoising Images with Missing Pixels. 41-44 - Amor Elmzoughi, Amel Benazza-Benyahia, Béatrice Pesquet:
An interscale multivariate statistical model for MAP multicomponent image denoising in the wavelet transform domain. 45-48
Video Coding
- Nikola Bozinovic, Janusz Konrad
, Wei Zhao, Carlos Vázquez
:
On the Importance of Motion Invertibility in MCTF/DWT Video Coding. 49-52 - Mingshi Wang, Mihaela van der Schaar:
Rate-Distortion Modeling for Wavelet Video Coders. 53-56 - Nejat Kamaci, Yucel Altunbasak:
Frame Bit Allocation for H.264 Using Cauchy-Distribution Based Source Modelling. 57-60 - Beibei Wang, Yao Wang
, Ivan W. Selesnick
, Anthony Vetro:
Video Coding Using 3-D Dual-Tree Discrete Wavelet Transforms. 61-64 - Aditya Mavlankar, Eckehard G. Steinbach
:
Multiple Description Video Coding Using Motion-Compensated Lifted 3D Wavelet Decomposition. 65-68 - Cyril Bergeron, Catherine Lamy-Bergot, Béatrice Pesquet-Popescu:
Adaptive M-Band Hierarchical Filterbank for Compliant Temporal Scalability in H.264 Standard. 69-72
Biometrics I
- Shan Du
, Rabab K. Ward:
Statistical Non-Uniform Sampling of Gabor Wavelet Coefficients for Face Recongnition. 73-76 - Wei Xiong, Changsheng Xu, Sim Heng Ong:
Peg-free Human Hand Shape Analysis and Recognition. 77-80 - Jason Thornton, Pablo Hennings, Jelena Kovacevic, B. V. K. Vijaya Kumar
:
Wavelet Packet Correlation Methods in Biometrics. 81-84 - Chunyan Xie, Marios Savvides, B. V. K. Vijaya Kumar
:
Quaternion Correlation Filters for Face Recognition in Wavelet Domain. 85-88 - Danny Little, Sreekar Krishna, John A. Black Jr., Sethuraman Panchanathan:
A Methodology for Evaluating Robustness of Face Recognition Algorithms with Respect to Variations in Pose Angle and Illumination Angle. 89-92 - Natalia A. Schmid, Bojan Cukic
, Manasi V. Ketkar, Harshinder Singh:
Performance analysis of Iris Based identification system at the matching score level. 93-96
Restoration and Segmentation
- Yonggang Shi, William Clement Karl
:
A Fast Level Set Method Without Solving PDEs. 97-100 - Lin-Sen Yu, Tian-Wen Zhang:
An Unsupervised Learning Algorithm for Image Segmentation Based On Finite Mixture Models. 101-104 - Claude Cariou
, Kacem Chehdi
, Arnault Nagle:
Gravitational Transform for Data Clustering - Application to Multicomponent Image Classification. 105-108 - Yunqiang Chen, Hongcheng Wang, Tong Fang, Jason Tyan:
Image Compounding Based on Independent Noise Constraint. 109-112 - Dalong Li, Russell M. Mersereau, Steven J. Simske:
Blind Image Deconvolution Using Support Vector Regression. 113-116 - Imam Samil Yetik
, Arye Nehorai:
Performance Bounds on Image Registration. 117-120
Biomedical Imaging
- Jeffrey R. Fitzsimmons, Rui Yan, Deniz Erdogmus:
MRI Image Reconstruction via Homomorphic Signal Processing. 121-124 - Wei Huang, Yibin Zheng, Janelle A. Molloy:
3D Ultrasound Image Reconstruction from Non-Uniform Resolution Freehand Slices. 125-128 - Nilanjan Ray, Scott T. Acton:
Spatiotemporal Segmentation for Validation of Rolling Leukocyte Tracking Data. 129-132 - Thomas E. Merryman, Jelena Kovacevic, Elvira Garcia Osuna, Robert F. Murphy:
Adaptive Multirate Data Acquisition of 3D Cell Images. 133-136 - Jae-Min Lee, Jing Hu, Jianbo Gao, Keith D. White
, Bruce Crosson, Christina E. Wierenga, Keith M. McGregor
, Kyung K. Peck:
Identification of brain activity by fractal scaling analysis of functional MRI data. 137-140 - William Scott Hoge
, Lei Zhao, Dana H. Brooks, Walid E. Kyriakos:
Sampling Strategies to Enable Computationally Efficient SPACE-RIP for 3D Parallel MR Imaging. 141-144
Image and Video Indexing and Retrieval
- Eli Saber, Yaowu Xu, A. Murat Tekalp
:
Object Recognition by Partial Shape Matching Guided Search. 145-148 - Raghavendra Singh, Ravi Kothari:
Use of Orthogonal Arrays to Aid Relevance Feedback in Content Based Image Retrieval Systems. 149-152 - Janne Argillander, Giridharan Iyengar, Harriet J. Nock:
Semantic Annotation of Multimedia using Maximum Entropy Models. 153-156 - Masami Mizutani, Shahram Ebadollahi, Shih-Fu Chang:
Commercial Detection in Heterogeneous Video Streams Using Fused Multi-Modal and Temporal Features. 157-160 - Stephan Reiter, Sascha Schreiber, Gerhard Rigoll:
Multimodal Meeting Analysis by Segmentation and Classification of Meeting Events based on a Higher Level Semantic Approach. 161-164 - Youssef Stitou, Flavius Turcu, Mohamed Najim, Larbi Radouane:
3-D Texture Characterization Based on Wold Decomposition and Higher Order Statistics. 165-168
Video Coding and Transmission
- Chowdary Adsumilli, Sanjit K. Mitra:
Error Concealment In Video Communications Using DPCM Bit Stream Embedding. 169-172 - Hua Yang, Kenneth Rose:
Rate-Distortion Optimized Motion Estimation for Error Resilient Video Coding. 173-176 - Georgia Feideropoulou, Béatrice Pesquet-Popescu, Jean-Claude Belfiore:
Bit Allocation Algorithm for Joint Source-Channel Coding of t+2D Video Sequences. 177-180 - Jie Liang
, Chengjie Tu, Trac D. Tran, Lu Gan:
Wiener Filtering for Generalized Error Resilient Time Domain Lapped Transform. 181-184 - Oztan Harmanci, A. Murat Tekalp
:
A Zero Error Propagation Extension to H264 for Low Delay Video Communications Over Lossy Channels. 185-188 - Sila Ekmekci, Pascal Frossard, Thomas Sikora:
Distortion Estimation for Temporal Layered Video Coding. 189-192
Image Coding
- Yan Huang, Ilya Pollak, Charles A. Bouman:
Image Compression with Multitree Tilings. 193-196 - Xin Li
:
Contour Adaptive Image Coding. 197-200 - Yuan Yuan, Donald M. Monro:
Improved Matching Pursuits Image Coding. 201-204 - Joel Sole, Philippe Salembier:
Adaptive Generalized Prediction for Lifting Schemes. 205-208 - Matthew Gaubatz, Damon M. Chandler, Sheila S. Hemami:
Spatially-Selective Quantization and Coding for Wavelet-Based Image Compression. 209-212 - Jamel Hattay
, Amel Benazza-Benyahia, Béatrice Pesquet:
Adaptive lifting for multicomponent image coding through quadtree partitioning. 213-216
Video Segmentation and Tracking
- Jacek Czyz, Branko Ristic
, Benoît Macq:
A Color-based Particle Filter for Joint Detection and Tracking of Multiple Objects. 217-220 - Emilio Maggio, Andrea Cavallaro:
Hybrid Particle Filter and Mean Shift tracker with adaptive transition model. 221-224 - Nidhal Bouaynaya, Wei Qu, Dan Schonfeld:
An Online Motion-Based Particle Filter for Head Tracking Applications. 225-228 - Yang Ran, Qinfen Zheng, Isaac Weiss, Larry S. Davis:
Reliable Segmentation of Pedestrians in Moving Scenes. 229-232 - Jie Shao, Shaohua Kevin Zhou, Rama Chellappa:
Tracking Algorithm Using Background-Foreground Motion Models and Multiple Cues. 233-236 - Michail Krinidis, Georgios N. Stamou, Heinz Teutsch, Sascha Spors, Nikolaos Nikolaidis
, Rudolf Rabenstein:
An Audio-Visual Database For Evaluating Person Tracking Algorithms. 237-240
Image Feature Extraction and Analysis
- Ke Huang, Selin Aviyente
:
Mutual Information Based Subbband Selection for Wavelet Packet Based Image Classification. 241-244 - Huibao Lin, Jennie Si
, Glen P. Abousleman:
Migrating Orthogonal Rotation-Invariant Moments from Continuous to Discrete Space. 245-248 - Hao Chen, Pramod K. Varshney:
Feature subset selection with applications to hyperspectral data. 249-252 - Mert R. Sabuncu
, Peter J. Ramadge:
Gradient Based Optimization of an EMST Image Registration Function. 253-256 - Klas Nordberg, Robert Söderberg:
Detection and Representation of Complex Local Features. 257-260 - Paul B. Albee, George C. Stockman:
Interest Points from the Radial Mass Transform. 261-264
Video Coding and Transmission
- Jun Hou, Xiangzhong Fang, Rong Hou:
Rate Control for Motion JPEG2000 Using Correlation Prediction. 265-268 - Eric Salemi, Claude Desset, Jan Cornelis, Peter Schelkens
:
Additive Distortion Modeling for Unequal Error Protection of Scalable Multimedia Content. 269-272 - Alan S. Del Tredici, Joel A. Rosiene, Truong Nguyen:
Low-Latency Methods for Wireless Video Transmission. 273-276 - Grégoire Pau, Béatrice Pesquet-Popescu:
Four-Band Linear-Phase Orthogonal Spatial Filter Bank for Subband Video Coding. 277-280 - Ritesh Sood, Mihaela van der Schaar:
Optimal Upload Policies for P2P Networks with System Imposed Constraints. 281-284 - Chi-Wah Wong, Oscar C. Au, Raymond Chi-Wing Wong
, Hong-Kwai Lam:
Piecewise Linear Model for Real-Time Rate Control. 285-288 - Sumohana S. Channappayya, Robert W. Heath Jr.
, Alan C. Bovik
:
Multiple Description Image Coding Using Natural Scene Statistics. 289-292 - Daniel Persson, Per Hedelin:
A Statistical Approach to Packet Loss Concealment for Video. 293-296 - George Partasides, Lisimachos P. Kondi:
Scalable Video Transmission over Multirate GMC-CDMA Wireless Channels. 297-300 - Seishi Takamura, Yoshiyuki Yashima:
H.264-based Lossless Video Coding Using Adaptive Transforms. 301-304 - Chih-Ming Fu, Wen-Liang Hwang, Chung-Lin Huang:
Efficient Post-Compression Error-Resilient 3D-Scalable Video Transmission for Packet Erasure Channels. 305-308 - Tuyet-Trang Lam, Lina J. Karam
, Rida A. Bazzi, Glen P. Abousleman:
Reduced-Delay Selective ARQ for Low Bit-Rate Image and Multimedia Data Transmission. 309-312 - Amir Asif
, Uyen Trang Nguyen, Guohua Xu, Bin Song:
Streaming video with bandwidth adaptation and error concealment for lowbit rate live wireless applications. 313-316
Video Coding
- Hakki A. Ilgin, Luis F. Chaparro:
Low Bit Rate Video Coding Using DCT-Based Fast Decimation/Interpolation and Embedded Zerotree Coding. 317-320 - Ping Li, Xiaokang Yang, Weisi Lin:
Buffer-constrained R-D Model-Based Rate Control for H.264/AVC. 321-324 - En-Hui Yang, Xiang Yu:
On Joint Optimization of Motion Compensation, Quantization and Baseline Entropy Coding in H.264 with Complete Decoder Compatibility. 325-328 - Emanuele Quacchio, Enrico Magli, Gabriella Olmo
, Pierpaolo Baccichet, Antonio Chimienti:
Enhancing Whole-Frame Error Concealment with an Intra Motion Vector Estimator in H.264/AVC. 329-332 - Yi Zhao, Stanley C. Ahalt, Jianyu Dong:
Optimal Interleaving for 3-D Zerotree Wavelet Video Packets Over Burst Lossy Channels. 333-336 - Saengrawee Pratoomtong, Yu Hen Hu:
On-Chip Cache Algorithm Design for Multimedia SOC. 337-340 - Maryse R. Stoufs, Joeri Barbarien, Peter Schelkens
, Jan Cornelis
, Adrian Munteanu:
Robust Motion Vector Coding and Error Concealment in MCTF-Based Video Coding. 341-344 - Takayuki Nakachi, Tetsuro Fujii:
A Study on Non-octave Scalable Coding using Motion Compensated Inter-frame Wavelet Transform. 345-348 - Gökçe Dane, Khaled El-Maleh, Yen-Chi Lee:
Encoder-Assisted Adaptive Video Frame Interpolation. 349-352 - Marta Mrak
, Nikola Sprljan, Ebroul Izquierdo:
A Resolution Adaptive Interpolation Technique for Enhanced Decoding of Scalable Coded Video. 353-356 - Byung Cheol Song, Kang Wook Chun:
Noise Power Estimation for Effective De-Noising in a Video Encoder. 357-360
Image Coding
- Nikolay N. Ponomarenko
, Karen O. Egiazarian, Vladimir V. Lukin
, Jaakko Astola:
Cascade Fractal Image Compression and its Modification. 361-364 - Demin Wang, Liang Zhang, André Vincent:
Improvement of JPEG2000 Using Curved Wavelet Transform. 365-368 - Xiang Xie, Guolin Li, Xiaowen Li, Xinkai Chen, Kun Yang, Chun Zhang, Zhihua Wang:
A New Near-Lossless Image Compression Method in Digital Image Sensors with Bayer Color Filter Arrays. 369-372 - Yong Tian, Xiangwei Kong:
An Improved Shape-Based Arbitrary Shape ROI Coding Method with SA-DWT in JPEG2000. 373-376 - Ko-Cheung Hui, Wan-Chi Siu:
New Pixel-DCT Domain Coding Technique for Object Based and Frame Based Prediction Error. 377-380 - Xiaodong Li, Ezzatollah Salari:
Embedded Image Compression Using a Classified Multistage VQ in Wavelet Domain. 381-384 - Harish Arora, Pramit Singh, Ekram Khan
, Farid Ghani:
Memory Efficient Set Partitionning in Hierarchical Tree (MESH) for Wavelet Image Compression. 385-388 - Yan Meng, Linfeng Guo:
Color Image Coding by Utilizing the Crossed Masking. 389-392 - Yun Gong:
Classified Context Quantization of VQ Indexes for Image Compression. 393-396 - Zhibin Pan, Koji Kotani, Tadahiro Ohmi:
Fast Encoding Method for Vector Quantization Based on a New Mixed Pyramid Data Structure. 397-400 - Xiaoli Tang, William A. Pearlman
:
Scalable Hyperspectral Image Coding. 401-404
Image and Video Indexing and Retrieval
- Zhihua He, Maja Bystrom:
A Sketch Image Retrieval System Using Directional Projection: DPSIR. 405-408 - Jingrui He, Changshui Zhang, Nanyuan Zhao, Hanghang Tong
:
Boosting Web Image Search by Co-Ranking. 409-412 - George Tzagkarakis, Baltasar Beferull-Lozano, Panagiotis Tsakalides
:
Rotation-Invariant Texture Retrieval with Gaussianized Steerable Pyramids. 413-416 - Balaji Iyer, Malcolm D. Macleod
:
Color image retrieval using the datasieve. 417-420 - Yang Liu, Shuqiang Jiang, Qixiang Ye, Wen Gao, Qingming Huang:
Playfield Detection Using Adaptive GMM and Its Application. 421-424 - Ying Luo, Jenq-Neng Hwang:
A Comprehensive Coarse-To-Fine Sports Video Analysis Framework to Infer 3D Parameters of Video Objects with Application to Tennis Video Sequences. 425-428 - Syed G. Quadri, Sridhar Krishnan
, Ling Guan:
Indexing of NFL Video using MPEG-7 Descriptors and MFCC features. 429-432 - Jinjun Wang, Engsiong Chng
, Changsheng Xu:
Soccer replay detection using scene transition structure analysis. 433-436 - Jing-Fung Chen, Hong-Yuan Mark Liao, Chia-Wen Lin
:
Fast Video Retrieval via the Statistics of Motion. 437-440 - Xue Mei, Mahesh Ramachandran, Shaohua Kevin Zhou:
Video Background Retrieval using Mosaic Images. 441-444 - Mikito Toguro, Kenji Suzuki
, Pitoyo Hartono, Shuji Hashimoto:
Video Stream Retrieval Based on Temporal Feature of Frame Difference. 445-448 - Shiyan Hu:
Efficient Video Retrieval by Locality Sensitive Hashing. 449-452 - Haoran Yi, Deepu Rajan, Liang-Tien Chia:
Global Motion Compensated Key Frame Extraction from Compressed Videos. 453-456
Biomedical Imaging
- Mehmet N. Tek, Bahram Shafai:
Enhancing the Depth Resolution of Contactless Electrical Conductivity Imaging. 457-460 - Miki Haseyama, Yukari Sasamura:
Effective apoptotic cell extraction from video microscopy images. 461-464 - Jing Jiang, Ming Dong, E. Mark Haacke:
ARGDYP: an Adaptive Region Growing and DYnamic Programming Algorithm for Stenosis Detection in MRI. 465-468 - Hongqing Zhu, Jian Zhou, Huazhong Shu, Limin Luo:
A Edge-Preserving Minimum Cross-Entropy Algorithm for Pet Image Reconstruction Using Multiphase Level Set Method. 469-472 - Rui Yan, Guojun He, Deniz Erdogmus, Sung-Phil Kim, José C. Príncipe, Yijun Liu:
Separating Spatial and Temporal Activation Patterns in fMRI Using Competitive Subspace Projection. 473-476 - Hao Tan, Yibin Zheng:
Point Spread Function Optimization for MRI Reconstruction. 477-480 - Can Evren Yarman, Birsen Yazici:
Radon Transform Inversion Based on Harmonic Analysis of the Euclidean Motion Group. 481-484 - Saeed Babaeizadeh, Dana H. Brooks, David Isaacson:
A Deformable-radius B-Spline Method for Shape-based Inverse Problems, as Applied to Electrical Impedance Tomography. 485-488 - Magali Sasso, C. Cohen-Bacrie:
medical ultrasound imaging using the fully adaptive beamformer. 489-492 - Yingge Wang, Qiang Cheng, Jie Cheng:
SNR Analysis for Phased-Array MRI. 493-496 - Geovanni Martínez, Jan-Gerd Frerichs, Klaus Joeris, Konstantin Kontantinov, Thomas Scheper:
Cell Density Estimation from a Still Image for In-Situ Microscopy. 497-500
Image/Video Storage, Retrieval and Authentication
- Ying Liu, Dengsheng Zhang
, Guojun Lu
, Wei-Ying Ma:
Deriving High-Level Concepts Using Fuzzy-ID3 Decision Tree for Image Retrieval. 501-504 - Huijuan Yang
, Alex C. Kot:
Data Hiding For Text Document Image Authentication by Connectivity-Preserving. 505-508 - Wei Jiang, Guihua Er, Qionghai Dai, Lian Zhong, Yao Hou:
Relevance Feedback Learning With Feature Selection In Region-Based Image Retrieval. 509-512 - David Liu, Tsuhan Chen
:
Probabilistic Relevance Feedback with Binary Semantic Feature Vectors. 513-516 - Dong Wang, Dayong Ding, Le Chen, Shen Zhang, Fuzong Lin, Bo Zhang:
Two kinds of timing cues and their usage in concept detection in news video. 517-520 - Cristian Perra, Daniele D. Giusto:
A Framework for Image Based Authentication. 521-524 - Seungjae Lee
, Dalwon Jang, Chang D. Yoo:
An SVD-Based Watermarking Method for Image Content Authentication with Improved Security. 525-528 - Hongmei Gou, Min Wu:
Robust Digital Fingerprinting for Curves. 529-532 - Siyue Chen, Henry Leung:
A Chaotic Authentication Technique for Digital Video Surveillance. 533-536 - Kyle Petrowski, Mehdi Kharrazi, Hüsrev T. Sencar, Nasir D. Memon
:
PSteg: steganographic embedding through patching. 537-540 - Jian Zhou, Xiao-Ping Zhang:
Video Shot Boundary Detection Using Independent Component Analysis. 541-544 - Claudio S. V. C. Cavalcanti, Carlos A. B. Mello
, Milena P. S. Rodrigues:
Generating True Color Paper Textures of Historical Documents. 545-548 - Marios Kyperountas, Anastasios Tefas
, Ioannis Pitas:
Methods for improving discriminant analysis for face authentication. 549-552
Image Formation- Image Representation and Quality Assessment
- Mylène C. Q. Farias
, John M. Foley, Sanjit K. Mitra:
Detectability and Annoyance of Synthetic Blockiness, Blurriness, Noisiness, and Ringing in Video Sequences. 553-556 - Mark A. Miller, Nick G. Kingsbury, Richard W. Hobbs
:
Seismic Imaging Using Complex Wavelets. 557-560 - Ti-Chiun Chang, Jan P. Allebach:
A new framework for characterization of halftone textures. 561-564 - Zhaohui Sun:
A Method to Generate Halftone Video. 565-568 - Zhanfeng Yue, Rama Chellappa:
Pose-Normalized View Synthesis From Silhouettes. 569-572 - Zhou Wang
, Eero P. Simoncelli
:
Translation Insensitive Image Similarity in Complex Wavelet Domain. 573-576 - Zhigang Su
, Yingning Peng, Xiutan Wang:
Non-Iterative Imaging Algorithm for CLSAR. 577-580 - Ee Ping Ong
, Xiaokang Yang, Weisi Lin, Zhongkang Lu
, Susu Yao:
Perceptual Quality Metric For Compressed Videos. 581-584 - Athanasios Leontaris, Amy R. Reibman
:
Comparison of blocking and blurring metrics for video compression. 585-588 - Ha T. Nguyen, Minh N. Do:
Image-Based Rendering with Depth Information Using the Propagation Algorithm. 589-592 - Sally L. Wood, Bonnie J. Smithson, Dinesh Rajan, Marc P. Christensen
:
Performance of a MVE Algorithm for Compound Eye Image Reconstruction Using Lens Diversity. 593-596 - Tejaswini Mirani, Marc P. Christensen
, Scott C. Douglas
, Dinesh Rajan, Sally L. Wood:
Optimal Co-design Of Computational Imaging System. 597-600 - Bao Guan, Hong Sun:
Turbo Iterative Estimation of Singularity Structure in SAR Image based on Wavelet-domain Hidden Markov Models. 601-604 - Soo Hyun Bae, Moon-Cheol Kim, Biing-Hwang Juang:
3CCD interpolation using selective projection. 605-608
Image Restoration and Enhancement
- Mrityunjay Kumar, Pradeep Ramuhalli
:
Dynamic Programming Based Multichannel Image Restoration. 609-612 - Zoran A. Ivanovski, Lina J. Karam
, Glen P. Abousleman:
A Motion-Augmented Super-Resolution Scheme for Very Low-Bit-Rate Video Enhancement. 613-616 - Zhonghua Ma, Hong Ren Wu
:
Classification Based Adaptive Vector Filter for Color Image Restoration. 617-620 - Gemma Pons Bernad, Laure Blanc-Féraud, Josiane Zerubia:
A Restoration Method for Confocal Microscopy Using Complex Wavelet Transform. 621-624 - Yong Lin, Russell C. Hardie, Kenneth E. Barner:
Subspace Partition Weighted Sum Filters for Image Deconvolution. 625-628 - Hongxin Wu, Liang Ji:
Blind Deconvolution Using a Monotonicity Constraint on the PSF. 629-632 - Blair Silver, Sos S. Agaian
, Karen Panetta:
Contrast Entropy Based Image Enhancement and Logarithmic Transform Coefficient Histogram Shifting. 633-636 - Alexia Giannoula, Dimitrios Hatzinakos:
Recursive deconvolution of multisensor imagery using finite mixture distributions. 637-640 - Lixin Shen, Manos Papadakis, Ioannis A. Kakadiaris
, Ioannis Konstantinidis
, Donald Kouri, David K. Hoffman:
Image Denoising Using a Tight Frame. 641-644 - Zeyong Shan, Selin Aviyente
:
Image Denoising Based on the Wavelet Co-Occurrence Matrix. 645-648 - Tai-Wai Chan, Oscar C. Au, Tak-Song Chong, Wing-San Chau:
A novel content-adaptive video denoising filter. 649-652 - Kjersti Engan
, Karl Skretting, John Håkon Husøy:
Denoising of Images Using Designed Signal Dependent Frames and Matching Pursuit. 653-656
Video Segmentation and Tracking
- Krishna V. Tangirala, Kamesh Namuduri
:
Object Tracking in Video Using Particle Filtering. 657-660 - Wei Qu, Nidhal Bouaynaya, Dan Schonfeld:
Automatic Multi-Head Detection and Tracking System using A Novel Detection-Based Particle Filter and Data Fusion. 661-664 - Jon Barker
:
Tracking Facial Markers with an Adaptive Marker Collocation Model. 665-668 - Yigithan Dedeoglu, B. Ugur Töreyin
, Ugur Güdükbay, A. Enis Çetin
:
Real-Time Fire and Flame Detection in Video. 669-672 - Benjamín Castañeda
, Juan C. Cockburn:
Reduced Support Vector Machines Applied to Real-Time Face Tracking. 673-676 - José Luis Landabaso, Montse Pardàs, Li-Qun Xu:
Hierarchical Representation of Scenes Using Activity Information. 677-680 - Shuqun Zhang:
Object Tracking in Unmanned Aerial Vehicle (UAV) Videos Using a Combined Approach. 681-684 - Juhua Zhu, Stuart C. Schwartz, Bede Liu:
A Transform Domain Approach to Real-Time Foreground Segmentation in Video Sequences. 685-688 - Xiaomu Song, Guoliang Fan:
Key-frame extraction for Object-based Video Segmentation. 689-692 - Yang Yu, David S. Doermann:
Model of Object-Based Coding for Surveillance Video. 693-696 - Wai Lok Woo, Kwok-Leung Chan:
Model-based human motion analysis in monocular video. 697-700 - Soumya Hamlaoui, Franck Davoine
:
Facial Action Tracking Using an AAM-Based Condensation Approach. 701-704 - Amit K. Agrawal, Rama Chellappa:
Moving Object Segmentation and Dynamic Scene Reconstruction Using Two Frames. 705-708 - Naresh P. Cuntoor, B. Yegnanarayana, Rama Chellappa:
Interpretation of State Sequences in HMM for Activity Representation. 709-712
Image Segmentation
- Qiang Wu
, Xiangjian He
, Tom Hintz:
Bi-Lateral Filtering Based Edge Detection on Hexagonal Architecture. 713-716 - Nicolas Brunel, Wojciech Pieczynski, Stéphane Derrode
:
Copulas in Vectorial Hidden Markov Chains for Multicomponent Image Segmentation. 717-720 - Shengyou Lin, Qiushuang Zhang, Jiaoying Shi:
Alpha Estimation in Perceptual Color Space. 721-724 - Matei Mancas
, Bernard Gosselin, Benoît Macq:
Fast and Automatic Tumoral Area Localisation using Symmetry. 725-728 - Li-Qun Xu, José Luis Landabaso, Montse Pardàs:
Shadow Removal with Blob-Based Morphological Reconstruction for Error Correction. 729-732 - Smadar Gefen, Louise Bertrand, Nahum Kiryati
, Jonathan Nissanov:
Localization of Sections Within the Brain Via 2D to 3D Image Registration. 733-736 - Nicolas Passat, Christian Ronse, Joseph Baruthio, Jean-Paul Armspach:
Automatic Parameterization of Grey-Level Hit-or-Miss Operators for Brain Vessel Segmentation. 737-740 - Mehrdad Yaghoobi, Hamid R. Rabiee
, Mohammed Ghanbari
, Mohammad Bagher Shamsollahi:
A New Image Texture Extraction Algorithm Based on Matching Pursuit Gabor Wavelets. 741-744 - Yongsheng Pan, J. Douglas Birdwell, Seddik M. Djouadi:
A New Gradient and Region Based Geometric Snake. 745-748 - Huiyu Zhou
, Tangwei Liu, Huosheng Hu
, Yusheng Pang, Faquan Lin, Ji Wu:
A hybrid framework for image segmentation. 749-752 - Xiqun Lu, Binwei Yang:
Segmentation of Color Textile Images Based on a Multiscale Context Model. 753-756 - Mona Omidyeganeh, Kambiz Nayebi, Reza Azmi, Abbas Javadtalab:
A New Segmentation Technique for Multi Font Farsi/Arabic Texts. 757-760
Feature Extraction and Analysis
- Hui Kong
, Eam Khwang Teoh, Jian-Gang Wang, Ronda Venkateswarlu:
Two Dimensional Fisher Discriminant Analysis: Forget About Small Sample Size Problem. 761-764 - R. Venkateswara Rao, Sumana Gupta:
Texture Analysis and Synthesis using Angular Wavelet Frames. 765-768 - Frans Coetzee, Visvanathan Ramesh:
Semi-Automatic Probabilistic Morphological Detection. 769-772 - Yonghuai Liu, Guoqiang Fei, Baogang Wei, Longzhuang Li:
3D Free Form Surface Matching Based on Orientation Difference Length Distribution. 773-776 - Victor H. S. Ha, José M. F. Moura:
Robust reorientation of 2D shapes using the orientation indicator index. 777-780 - Alaa El. Sagheer
, Naoyuki Tsuruta, Rin-Ichiro Taniguchi, Sakashi Maeda:
Visual speech features representation for automatic lip-reading. 781-784 - Jeffrey Ng, Anil A. Bharath:
Multiscale orientation estimation of perceptual boundaries. 785-788 - Turgay Çelik
, Cem Direkoglu
, Hüseyin Özkaramanli, Hasan Demirel, Mustafa Uyguroglu:
Region-based super-resolution aided facial feature extraction from low-resolution sequences. 789-792
Authentication and Watermarking
- Ming Jiang, Edward K. Wong, Nasir D. Memon
, Xiaolin Wu:
Steganalysis of halftone images. 793-796 - Fabrício Ourique
, Vinicius Licks, Ramiro Jordan, Fernando Pérez-González
:
Angle QIM: a novel watermark embedding scheme robust against amplitude scaling distortions. 797-800 - Vinicius Licks, Fabrício Ourique
, Ramiro Jordan, Fernando Pérez-González
:
An Exact Expression for the Bit Error Probability in Angle QIM Watermarking Under Simultaneous Amplitude Scaling and AWGN Attacks. 801-804 - Hussein Joumaa, Franck Davoine
:
An ICA based algorithm for video watermarking. 805-808 - Paulo Vinicius Koerich Borges
, Joceli Mayer:
Informed Positional Embedding for Multi-bit Watermarking. 809-812 - Qian Zhang, Nigel Boston:
A Cryptanalytic Method for Embedding Video Watermarks. 813-816 - Xingliang Huang, Bo Zhang:
Perceptual Watermarking Using a Wavelet Visible Difference Predictor. 817-820 - Mingqiao Wu, Zhongliang Zhu, Shiyao Jin:
Digital Image Steganography Algorithm Based on Iterative Blending. 821-824 - Jing-Ming Guo, Soo-Chang Pei, Hua Lee:
Watermarking in Halftone Images with Parity-Matched Error Diffusion. 825-828 - Xiaojun Qi, Ji Qi:
Image content-based geometric transformation resistant watermarking approach. 829-832 - Bijan G. Mobasseri, Yimin Zhang, Moeness G. Amin
, Behzad Mohammadi Dogahe:
Designing robust watermarks using polynomial phase exponentials [image watermarking]. 833-836 - Yongliang Liu:
Commitment based watermark detection protocols [multimedia watermarking applications]. 837-840 - Shiyan Hu:
Document Image Watermarking Algorithm Based on Neighborhood Pixel Ratio. 841-844
Image and Video Processing
- Laurent Condat
, Annick Montanvert:
A Framework for Image Magnification: Induction Revisited. 845-848 - Amit K. Roy-Chowdhury:
An algorithm for 3D reconstruction of deformable shape sequences. 849-852 - Saikiran S. Thunuguntla, Bahadir K. Gunturk
:
Feature-Based Image Registration in Log-Polar Domain. 853-856 - Toshiyuki Ono, Hiroshi Hasegawa, Isao Yamada, Kohichi Sakaniwa:
An adaptive super-resolution of videos with noise information on camera systems. 857-860 - Rami R. Hagege, Joseph M. Francos:
Parametric estimation of multi-dimensional affine transformations: an exact linear solution [image recognition applications]. 861-864 - Bahadir K. Gunturk
:
Handling exposure time in multi-frame image restoration. 865-868 - Hassan Foroosh, Murat Balci, Xiaochun Cao:
Self-calibrated reconstruction of partially viewed symmetric objects. 869-872 - S. Pavan, Sridhar Gangadharpalli, V. Sridhar:
Multivariate entropy detector based hybrid image registration. 873-876 - Xiaoyong Sun, Eric Dubois:
A matching-based view interpolation scheme. 877-880 - Xiaodong Huang, Eric Dubois:
Disparity estimation for the intermediate view interpolation of stereoscopic images. 881-884 - Gulcin Caner, A. Murat Tekalp
, Gaurav Sharma, Wendi B. Heinzelman
:
An adaptive filtering framework for image registration. 885-888 - Jiseok Liew, S. Lawrence Marple Jr.:
Three-dimensional fast algorithm solution for octant-based three-dimensional Yule-Walker equations. 889-892
Motion Detection and Estimation
- Konstantinos Rapantzikos
, Michalis E. Zervakis:
Robust optical flow estimation in MPEG sequences. 893-896 - Toru Yamada, Masao Ikekawa, Ichiro Kuroda:
Fast and accurate motion estimation algorithm by adaptive search range and shape selection. 897-900 - Libo Yang, Keman Yu, Jiang Li, Shipeng Li
:
Prediction-based directional fractional pixel motion estimation for H.264 video coding. 901-904 - Hasan F. Ates
, Yucel Altunbasak:
SAD reuse in hierarchical motion estimation for the H.264 encoder. 905-908 - Li Song, Hongkai Xiong
, Jizheng Xu, Feng Wu, Hui Su:
Adaptive predict based on fading compensation for lifting-based motion compensated temporal filtering. 909-912 - Yu-Lin Chang, Ching-Yeh Chen, Shyh-Feng Lin, Liang-Gee Chen
:
Four field variable block size motion compensated adaptive de-interlacing. 913-916 - Xiaoquan Yi, Nam Ling:
Improved partial distortion search algorithm for rapid block motion estimation via dual-halfway-stop. 917-920 - Hoi-Ming Wong, Oscar C. Au, Jinxin Huang, Shiju Zhang, Winnie N. Yan:
Sub-optimal quarter-pixel inter-prediction algorithm (SQIA). 921-924 - Murat Balci, Hassan Foroosh:
Inferring motion from the rank constraint of the phase matrix. 925-928 - Yin Sun, Feng Pan, Ashraf A. Kassim
:
Perceptually adaptive rate-distortion optimization for variable block size motion alignment in 3D wavelet coding. 929-932 - Serdar Ince, Janusz Konrad
:
Geometry-based estimation of occlusions from video frame pairs. 933-936 - Mingxing Hu, Gordon Dodds, Baozong Yuan:
Improved methods for fundamental matrix estimation based on evolutionary agents [computer vision applications]. 937-940 - Thommen Korah, Christopher Rasmussen:
Aligning sequences from multiple cameras. 941-944 - Dongjiang Xu, Takis Kasparis
:
Robust image registration under spatially non-uniform brightness changes. 945-948
Biometrics II
- Wen-Shiung Chen, Kun-Huei Chih, Sheng-Wen Shih, Chih-Ming Hsieh:
Personal identification technique based on human iris recognition with wavelet transform. 949-952 - Sinjini Mitra, Marios Savvides:
Analyzing asymmetry biometric in the frequency domain for face recognition. 953-956 - Chih-Pin Liao, Jen-Tzung Chien
:
Nonsingular discriminant feature extraction for face recognition. 957-960 - Yingzi Du, Bradford Bonney, Robert W. Ives, Delores M. Etter, Robert Schultz:
Analysis of partial iris recognition using a 1D approach. 961-964 - Isao Nakanishi, Hiroyuki Sakamoto, Yoshio Itoh, Yutaka Fukui:
Multi-matcher on-line signature verification system in DWT domain. 965-968 - Muhammad Bilal Ahmad
, Tae-Sun Choi
:
Fast and accurate 3D shape from focus using dynamic programming optimization technique. 969-972 - Jani Mäntyjärvi
, Mikko Lindholm, Elena Vildjiounaite, Satu-Marja Mäkelä
, Heikki Ailisto
:
Identifying users of portable devices from gait pattern with accelerometers. 973-976 - Mahesh Ramachandran, Shaohua Kevin Zhou, Divya Jhalani, Rama Chellappa:
A method for converting a smiling face to a neutral face with applications to face recognition. 977-980 - Bo Du, Shiguang Shan
, Laiyun Qing, Wen Gao:
Empirical comparisons of several preprocessing methods for illumination insensitive face recognition. 981-984 - Marius Tico, Markku Vehviläinen, Jukka Saarinen:
A method of fingerprint image enhancement based on second directional derivatives. 985-988
Image Filtering and Modeling
- Shuangteng Zhang, Ezzatollah Salari:
Image denoising using a neural network based non-linear filter in wavelet domain. 989-992 - Yao Nie, Hao-Song Kong, Anthony Vetro, Huifang Sun, Kenneth E. Barner:
Fast adaptive fuzzy post-filtering for coding artifacts removal in interlaced video. 993-996 - Tan Shan
, Xiangrong Zhang, Licheng Jiao:
Dual ridgelet frame constructed using biorthogonal wavelet basis. 997-1000 - Luc Klaine, Benoît Vozel
, Kacem Chehdi
:
An integro-differential method for adaptive filtering of additive or multiplicative noise. 1001-1004 - Jongmyon Kim, Linda M. Wills, D. Scott Wills:
Effective detection and elimination of impulse noise for reliable 4: 2: 0 YCbCr signals prior to compression encoding. 1005-1008 - Mamoun F. Al-Mistarihi, Emad S. Ebbini
:
Quadratic pulse inversion ultrasonic imaging (QPI): detection of low-level harmonic activity of microbubble contrast agents [biomedical applications]. 1009-1012 - Lu Ren, Mengdao Xing, Zheng Bao, Haojun Chen:
Adaptive despeckling SAR images based on scale space correlation. 1013-1016 - Hanzi Wang, David Suter
:
A re-evaluation of mixture of Gaussian background modeling [video signal processing applications]. 1017-1020 - Pierre Gacon, Pierre-Yves Coulon, Gérard Bailly:
Statistical active model for mouth components segmentation. 1021-1024 - Lei Qin, Wei Zeng, Wen Gao, Weiqiang Wang:
Local invariant descriptor for image matching. 1025-1028
Multimedia Security and Content Protection
- Shan He, Min Wu:
Improving collusion resistance of error correcting code based multimedia fingerprinting. 1029-1032 - Debargha Mukherjee
, Huisheng Wang, Amir Said, Sam Liu:
Format independent encryption of generalized scalable bit-streams enabling arbitrary secure adaptations [multimedia communication applications]. 1033-1036 - Darko Kirovski, Mehmet Kivanç Mihçak:
Bounded Gaussian fingerprints and the gradient collusion attack [multimedia fingerprinting applications]. 1037-1040 - Ashwin Swaminathan, Yinian Mao, Min Wu:
Security of feature extraction in image hashing. 1041-1044 - H. Vicky Zhao, K. J. Ray Liu:
Fair collusion attacks on scalable video fingerprinting systems. 1045-1048 - Anastasios Tefas
, Alexia Giannoula, Nikos Nikolaidis
, Ioannis Pitas:
Enhanced transform-domain correlation-based audio watermarking. 1049-1052
Content Based Information Retrieval and Pattern Discovery
- Lexing Xie
, Lyndon S. Kennedy, Shih-Fu Chang, Ajay Divakaran, Huifang Sun, Ching-Yung Lin:
Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications]. 1053-1056 - Chung-Yuan Chao, Huang-Chia Shih, Chung-Lin Huang:
Semantics-based highlight extraction of soccer program using DBN. 1057-1060 - Kai-Tat Fung, Wan-Chi Siu:
Diversity and importance measures for video downscaling. 1061-1064 - Tin Lay Nwe, Haizhou Li
:
Broadcast news segmentation by audio type analysis. 1065-1068 - Lie Lu
, Rui Cai, Alan Hanjalic:
Towards a unified framework for content-based audio analysis. 1069-1072 - Rui Cai, Lie Lu
, Lian-Hong Cai:
Unsupervised auditory scene categorization via key audio effects and information-theoretic co-clustering. 1073-1076
Multimedia Signal Processing I
- Shiyu Li, Masahiro Okuda, Shinichi Takahashi:
Kinematics based motion compression for human figure animation. 1077-1080 - Volkan Cevher, James H. McClellan:
Proposal strategies for joint state-space tracking with particle filters. 1081-1084 - Stefan Hoch, Frank Althoff, Gregor McGlaun, Gerhard Rigoll:
Bimodal fusion of emotional data in an automotive environment. 1085-1088 - Dihong Tian, Ghassan Al-Regib
:
Progressive streaming of textured 3D models over bandwidth-limited channels. 1089-1092 - Hyunjung Shim, Tsuhan Chen
:
A statistical framework for image-based relighting. 1093-1096 - Peter Hinterseer, Eckehard G. Steinbach
, Sandra Hirche, Martin Buss:
A novel, psychophysically motivated transmission approach for haptic data streams in telepresence and teleaction systems. 1097-1100 - Guan-Ming Su, Zhu Han, Min Wu, K. J. Ray Liu:
Joint uplink and downlink optimization for video conferencing over wireless LAN. 1101-1104 - Hsu-Feng Hsiao, Aik Chindapol, James A. Ritcey, Yaw-Chung Chen, Jenq-Neng Hwang:
A new multimedia packet loss classification algorithm for congestion control over wired/wireless channels. 1105-1108 - Jari Mäkinen, Bruno Bessette, Stefan Bruhn, Pasi Ojala, Redwan Salami, Anisse Taleb:
AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. 1109-1112 - Li-wei He, Zhengyou Zhang:
Real-time whiteboard capture and processing using a video camera for teleconferencing. 1113-1116 - Carlos Busso
, Sergi Hernanz, Chi-Wei Chu, Soonil Kwon
, Sung Lee, Panayiotis G. Georgiou, Isaac Cohen, Shrikanth S. Narayanan:
Smart room: participant and speaker localization and identification. 1117-1120 - Yongli Hu
, Baocai Yin, Yanfeng Sun:
Multi-lighting 3D face morphable model based on mesh resampling. 1121-1124 - Yongjin Wang, Ling Guan:
Recognizing human emotion from audiovisual information. 1125-1128 - Tsungnan Lin
, Chiapin Wang, Po-Chiang Lin:
A neural network based context-aware handoff algorithm for multimedia computing. 1129-1132
Multimedia Signal Processing II
- Anamitra Makur:
Self-embedding and restoration algorithms for document watermark. 1133-1136 - Ihab Amer, Wael M. Badawy
, Graham A. Jullien:
A high-performance hardware implementation of the H.264 simplified 8×8 transformation and quantization [video coding]. 1137-1140 - Gouri Landge, Mihaela van der Schaar, Venkatesh Akella:
Complexity metric driven energy optimization framework for implementing MPEG-21 scalable video decoders. 1141-1144 - Masanori Sano, Hideki Sumiyoshi, Masahiro Shibata, Nobuyuki Yagi:
Generating metadata from acoustic and speech data in live broadcasting. 1145-1148 - Chuanjun Li, Balakrishnan Prabhakaran, Si-Qing Zheng:
Similarity measure for multi-attribute data [haptic data recognition]. 1149-1152 - Hideki Asoh, Isao Hara, Futoshi Asano, Kiyoshi Yamamoto:
Tracking human speech events using a particle filter. 1153-1156 - Gee Rittenhouse, Haitao Zheng:
Providing VOIP service in UMTS-HSDPA with frame aggregation. 1157-1160 - HweeHwa Pang
, Yongdong Wu:
Evaluation of MPEG-4 IPMP extension. 1161-1164 - Zhiquan Lu, Xiao-Ping Zhang:
Robust image watermarking based on the wavelet contour detection. 1165-1168 - Xi Shao, Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli
:
Automatic music summarization based on music structure analysis. 1169-1172 - Yi Wang, Gang Qian, Thanassis Rikakis:
Robust pause detection using 3D motion capture data for interactive dance. 1173-1176 - Lun-Chia Kuo, Sheng-Jyh Wang
:
A flexible architecture for feature-based image editing. 1177-1180
Volume 3
Applications to Music
- Jyri Pakarinen, Matti Karjalainen, Vesa Välimäki
, Stefan Bilbao:
Energy behavior in time-varying fractional delay filters for physical modeling synthesis of musical instruments. 1-4 - Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model. 5-8 - Christopher J. C. Burges, Dan Plastina, John C. Platt, Erin Renshaw, Henrique S. Malvar:
Using audio fingerprinting for duplicate detection and thumbnail generation. 9-12 - Line Ørtoft Endelt, Anders la Cour-Harbo:
Comparison of methods for sparse representation of musical signals. 13-16 - Yipeng Li, DeLiang Wang:
Detecting pitch of singing voice in polyphonic audio. 17-20 - Stefan Petrausch, José Escolano, Rudolf Rabenstein:
A general approach to block-based physical modeling with mixed modeling strategies for digital sound synthesis. 21-24
Auditory Modeling and Hearing Aids
- Elisabet Molin, Arne Leijon, Helene Wallsten:
Spectro-temporal discrimination in cochlear implant users. 25-28 - Thomas J. Klasen, Marc Moonen, Tim Van den Bogaert, Jan Wouters
:
Preservation of interaural time delay for binaural hearing aids through multi-channel Wiener filtering based noise reduction. 29-32 - Chuping Liu, Qian-Jie Fu:
Relating the acoustic space of vowels to the perceptual space in cochlear implant simulations. 33-36 - Nathan Lesser, Daniel P. W. Ellis:
Clap detection and discrimination for rhythm therapy. 37-40 - Xianbo Xiao, Guangshu Hu, Chunhong Liu:
A spectral cues preserving compression algorithm for digital hearing aid. 41-44 - Michael Wirtzfeld, Vijay Parsa:
On subband adaptive modeling of compression hearing aids. 45-48
Loudspeaker and Microphone Array Signal Processing
- Jingdong Chen, Yiteng Huang, Jacob Benesty
:
Time delay estimation via multichannel cross-correlation [audio signal processing applications]. 49-52 - Peter Kassakian:
Magnitude least-squares fitting via semidefinite programming with applications to beamforming and multidimensional filter design. 53-56 - Kazuho Ono, Akio Ando:
A system for separating sound sources propagated in the same direction. 57-60 - Hiroshi Sawada, Shoko Araki
, Ryo Mukai, Shoji Makino:
Blind extraction of a dominant source signal from mixtures of many sources [audio source separation applications]. 61-64 - Alan Davis, Siow Yong Low, Sven Nordholm
, Nedelko Grbic:
A subband space constrained beamformer incorporating voice activity detection [speech enhancement applications]. 65-68 - Siow Yong Low, Sven Nordholm
:
A blind approach to joint noise and acoustic echo cancellation. 69-72 - Vikas C. Raykar, Ramani Duraiswami
:
Approximate expressions for the mean and the covariance of the maximum likelihood estimator for acoustic source localization. 73-76 - Wolfgang Herbordt, Satoshi Nakamura, Walter Kellermann:
Joint optimization of LCMV beamforming and acoustic echo cancellation for automatic speech recognition. 77-80 - Shoko Araki
, Shoji Makino, Hiroshi Sawada, Ryo Mukai:
Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask. 81-84 - Satoshi Ukai, Tomoya Takatani, Tsuyoki Nishikawa, Hiroshi Saruwatari:
Blind source separation combining SIMO-model-based ICA and adaptive beamforming. 85-88 - Heinz Teutsch, Walter Kellermann:
EB-ESPRIT: 2D localization of multiple wideband acoustic sources using eigen-beams. 89-92 - Yong Rui, Dinei A. F. Florêncio, Warren Lam, Jinyan Su:
Sound source localization for circular arrays of directional microphones. 93-96 - Herbert Buchner, Robert Aichner, Jochen Stenglein, Heinz Teutsch, Walter Kellermann:
Simultaneous localization of multiple sound sources using blind adaptive MIMO filtering. 97-100 - Ivan Tashev, Henrique S. Malvar:
A new beamformer design algorithm for microphone arrays. 101-104
Echo Cancellation, Active Noise Control, and Transducers
- Fabian Kuech, Andreas Mitnacht, Walter Kellermann:
Nonlinear acoustic echo cancellation using adaptive orthogonalized power filters. 105-108 - Giovanni L. Sicuranza
, Alberto Carini
:
Nonlinear multichannel active noise control using partial updates [acoustic noise control]. 109-112 - Yuriy V. Zakharov
, Tim C. Tozer:
Spectral domain B-spline identification in acoustic echo cancellation. 113-116 - Kazuhiro Kondo
, Kiyoshi Nakagawa:
Experimental evaluation of an active speech control method. 117-120 - Felix Albu
, Constantine Kotropoulos
:
Modified Gauss-Seidel affine projection algorithm for acoustic echo cancellation. 121-124 - Dayong Zhou, Victor E. DeBrunner:
ANC algorithms that do not require identifying the secondary path. 125-128 - Ann Spriet, Ian K. Proudler, Marc Moonen, Jan Wouters
:
An instrumental variable method for adaptive feedback cancellation in hearing aids. 129-132 - Andy W. H. Khong, Patrick A. Naylor
:
A family of selective-tap algorithms for stereo acoustic echo cancellation. 133-136 - Karl-Dirk Kammeyer, Markus Kallinger, Alfred Mertins:
New aspects of combining echo cancellers with beamformers. 137-140 - Per Åhgren, Andreas Jakobsson:
A study of double-talk detection performance in the presence of acoustic echo path changes. 141-144 - Ricardo A. Ribeiro
, António Joaquim Serralheiro, Moisés Simões Piedade
:
Application of Kalman and RLS adaptive algorithms to non-linear loudspeaker controller parameter estimation: a case study. 145-148 - Yoshinobu Kajikawa
, Yasuo Nomura:
Multi-channel active noise control with freely movable error microphones. 149-152 - Akihiko Sugiyama, Jérôme Berclaz, Miki Sato:
Noise-robust double-talk detection based on normalized cross correlation and a noise offset. 153-156
Broadband and Perceptual Coding
- Zeph Landau, Darko Kirovski:
Parameter analysis for GLZ audio compression. 157-160 - Fredrik Nordén, Mads Græsbøll Christensen
, Søren Holdt Jensen:
Open loop rate-distortion optimized audio coding. 161-164 - Mads Græsbøll Christensen
, Andreas Jakobsson
, Søren Vang Andersen, Søren Holdt Jensen:
Linear AM decomposition for sinusoidal audio coding. 165-168 - Rongshan Yu, Xiao Lin, Susanto Rahardja, Chi Chung Ko, Haibin Huang:
Improving coding efficiency for MPEG-4 Audio Scalable Lossless coding. 169-172 - Dong-Yan Huang, Xinrong Su, Arumugam Nallanathan
:
Characterization of a cascade LMS predictor. 173-176 - Pim Korten, Jesper Jensen, Richard Heusdens:
High resolution spherical quantization of sinusoidal parameters using a perceptual distortion measure. 177-180 - Stefan Wabnik, Gerald Schuller, Ulrich Kraemer, Jens Hirschfeld:
Frequency warping in low delay audio coding. 181-184 - Anisse Taleb, Patrik Sandgren, Ingemar Johansson, Daniel Enström, Stefan Bruhn:
Partial spectral loss concealment in transform coders. 185-188 - Rahul Vanam, Charles D. Creusere:
Evaluating low bitrate scalable audio quality using advanced version of PEAQ and energy equalization approach. 189-192 - Richard Heusdens, Jesper Jensen:
Jointly optimal time segmentation, component selection and quantization for sinusoidal coding of audio and speech. 193-196 - Ricky Der, Peter Kabal, Wai-Yip Chan:
Rate-distortion allocation for time-frequency dependent audio coding. 197-200 - Mototsugu Abe, Julius O. Smith III:
AM/FM rate estimation for time-varying sinusoidal modeling. 201-204
Applications to Music
- Olivier Gillet, Gaël Richard:
Automatic transcription of drum sequences using audiovisual features. 205-208 - Henri Penttinen, Jaakko Siiskonen, Vesa Välimäki
:
Acoustic guitar plucking point estimation in real time. 209-212 - Jin S. Seo, Minho Jin, Sunil Lee, Dalwon Jang, Seungjae Lee
, Chang D. Yoo:
Audio fingerprinting based on normalized spectral subband centroids. 213-216 - Jun Yin, Terence Sim
, Ye Wang
, Arun Shenoy:
Music transcription using an instrument model. 217-220 - Aaron S. Master, Kyogu Lee:
Explicit onset modeling of sinusoids using time reassignment. 221-224 - Chunghsin Yeh, Axel Röbel, Xavier Rodet:
Multiple fundamental frequency estimation of polyphonic music signals. 225-228 - Mathieu Lagrange, Sylvain Marchand, Jean-Bernard Rault:
Tracking partials for the sinusoidal modeling of polyphonic sounds. 229-232