8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - INTERSPEECH 2003, Geneva, Switzerland, September 1-4, 2003.
Kenneth Ward Church
: Speech and language processing: where have we been and where are we going?
: Auditory principles in speech processing - do computers need silicon ears ?
Aurora Noise Robustness on SMALL Vocabulary Databases
, Qiang Huo
: Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks.
ISCA Special Interest Group Session:
"Hot Topics" in Speech Science and Technology
: Strategies for automatic multi-tier annotation of spoken language corpora.
Speech Signal Processing 1-4
G. V. Kiran
, Thippur V. Sreenivas
: A novel method of analysing and comparing responses of hearing aid algorithms using auditory time-frequency representation.
, Issam Bazzi
, Alex Acero
: Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint.
: A new approach to voice activity detection based on self-organizing maps.
, Noureddine Ellouze
: Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima.
: Modulation spectrum for pitch and speech pause detection.
, Michael Lenz
: Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis.
: A new HMM-based approach to broad phonetic classification of speech.
David N. Levin
: Blind normalization of speech from different channels.
Phonology and Phonetics I
: Reaction time as an indicator of discrete intonational contrasts in English.
: Independent automatic segmentation by self-learning categorial pronunciation rules.
: Accentual lengthening in standard Chinese: evidence from four-syllable constituents.
: Syllable structure based phonetic units for context-dependent continuous Thai speech recognition.
: An acoustic phonetic analysis of diphthongs in ningbo Chinese.
Topics in Prosody and Emotional Speech
: Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system.
Language Modeling, Discourse and Dialog
: Syllable classification using articulatory-acoustic features.
: Recognition of out-of-vocabulary words with sub-lexical language models.
: A corpus-based decompounding algorithm for German lexical modeling in LVCSR.
, Minhwa Chung
: Modeling cross-morpheme pronunciation variations for korean large vocabulary continuous speech recognition.
Unit Selection 1, 2
Aurora Noise Robustness on LARGE Vocabulary Databases
, Hermann Ney
: Evaluation of quantile based histogram equalization with filter combination on the Aurora 3 and 4 databases.
Multilingual Speech-to-Speech Translation
: The statistical approach to machine translation and a roadmap for speech translation.
: Coupling vs. unifying: modeling techniques for speech-to-speech translation.
, Ahmed Badran
, Alan W. Black
, Robert E. Frederking
, Donna Gates
, Alon Lavie
, Lori S. Levin
, Kevin A. Lenzo
, Laura Mayfield Tomokiyo
, Jürgen Reichert
, Tanja Schultz
, Dorcas Wallace
, Monika Woszczyna
, Jing Zhang
: Speechalator: two-way speech-to-speech translation on a consumer PDA.
: Improving a connectionist based syntactical language model.
Speech Modeling and Features 1-4
, Hisashi Kawai
: Tone pattern discrimination combining parametric modeling and maximum likelihood estimation.
: A DTW-based DAG technique for speech and speaker feature analysis.
: On the role of intonation in the organization of Mandarin Chinese speech prosody.
, Hermann Ney
: A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition.
, Richard M. Stern
: Feature generation based on maximum classification probability for improved speech recognition.
: Harmonic weighting for all-pole modeling of the voiced speech.
: Locus equations determination using the speechdat(II).
Speech Enhancement 1, 2
, Eran Fishler
: Microphone array voice activity detection and noise suppression using wideband generalized likelihood ratio.
, Lin-Shan Lee
: Perceptually-constrained generalized singular value decomposition-based approach for enhancing speech corrupted by colored noise.
D. G. Raza
, C. F. Chan
: Quality enhancement of CELP coded speech by using an MFCC based Gaussian mixture model.
, Lin-Shan Lee
: Speech enhancement and improved recognition accuracy by integrating wavelet transform and spectral subtraction algorithm.
: Speech enhancement for hands-free car phones by adaptive compensation of harmonic engine noise components.
Spoken Dialog Systems 1, 2
, Joakim Gustafson
: Child and adult speaker adaptation during error resolution in a publicly available spoken dialogue system.
Lars Bo Larsen
: Assessment of spoken dialogue system usability - what are we really measuring?
Robust Speech Recognition - Noise Compensation
Forensic Speaker Recognition
: Automated speaker recognition in real world conditions: controlling the uncontrollable.
Emotion in Speech
Dialog System User and Domain Modeling
Topics in Speech Recognition and Segmentation
, Brian Mak
: Joint estimation of thresholds in a bi-threshold verification problem.
, Man-Hung Siu
: A new approach to minimize utterance verification error rate for a specific operating point.
Robust Speech Recognition - Acoustic Modeling
, Qiang Huo
: A switching linear Gaussian hidden Markov model and its application to nonstationary noise compensation for robust speech recognition.
, Peng Ding
, Bo Xu
: Joint model and feature based compensation for robust speech recognition under non-stationary noise environments.
Advanced Machine Learning Algorithms for Speech and Language Processing
Sam T. Roweis
: Factorial models and refiltering for speech separation and denoising.
Multi-Modal Spoken Language Processing
: Audiovisual speech enhancement based on the association between speech envelope and video features.
Speech Coding and Transmission
Wai C. Chu
, Toshio Miki
: Optimization of window and LSF interpolation factor for the ITU-t g.729 speech coding standard.