EUROSPEECH 1993: Berlin, Germany
Third European Conference on Speech Communication and Technology, EUROSPEECH 1993, Berlin, Germany, September 22-25, 1993. ISCA 1993
Keynotes
Janet M. Baker: Dictation, directories, and data bases; emerging PC applications forlarge vocabulary speech recognition.
William J. Barry, Paul Dalsgaard: Speech database annotation. the importance of a multi-lingual approach.
Jeremy Peckham: A new generation of spoken dialogue systems: results and lessons from the sundial project.
Roger K. Moore: Whither a theory of speech pattern processing?
Peter Noll: Speech coding for communications.
Hermann Ney: Modeling and search in continuous speech recognition.
Maxine Eskenazi: Trends in speaking styles research.
John S. Bridle: Models of speech recognition; personal perspectives on particular approaches.
Kai-Fu Lee: The conversational computer: an apple perspective.
Ute Jekosch: Speech quality assessment and evaluation.
Jan P. H. van Santen: Timing in text-to-speech systems.
Speech Coding
Kazunori Ozawa, Masahiro Serizawa, Toshiki Miyano, Toshiyuki Nomura: M-LCELP speech coding at bit-rates below 4kbps.
Eduardo López Gonzalo, Luis A. Hernández-Gomez: Fast vector quantization using neural maps for CELP at 2400bps.
Ulrich Balss, U. Kipper, Herbert Reininger, Dietrich Wolf: Improving the speech quality of CELP-coders by optimizing the long-term delay determination.
Carmen García-Mateo, José Luis Alba-Castro, Luis A. Hernández Gómez: A stochastic speech coder with multi-band long-term prediction.
B. W. M. Wery, Herman J. M. Steeneken: Intelligibility evaluation of 4-5 kbps CELP and MBE vocoders: the hermes program experiment.

Friedhelm Wuppermann, Christiane Antweiler, M. Kappelan: Objective analysis of the GSM half rate speech codec candidates.

Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro, Takehiro Moriya: Pitch synchronous innovation CELP (PSI-CELP).
Nigel Sedgwick: Emulation of a formant vocoder at 600 and 800 bps.
Thierry Dutoit, Henri Leich: An analysis of the performances of the MBE model when used in the context of a text-to-speech system.
C. F. Chan: High-quality synthesis of LPC speech using multiband excitation model.
Yair Shoham: High-quality speech coding at 2.4 kbps based on time-frequency interpolation.
Naomi Asanuma, Hiromi Nagabuchi: A new reference signal for evaluating the quality of speech coded at low bit rates.
Changxue Ma, Douglas D. O'Shaughnessy: A psychophysical study of fourier phase and amplitude coding of speech.
Articulatory Modelling
Denis Beautemps, Pierre Badin, Rafael Laboissière: Recovery of vocal tract midsagittal and area functions from speech signal for vowels and fricative consonants.
Shrikanth S. Narayanan, Abeer A. Alwan: Strange attractors and chaotic dynamics in the production of voiced and voiceless fricatives.
Noël Nguyen, Philip Hoole: Frequency variations of the lowest main spectral peak in sibilant clusters.
Hélène Loevenbruck, Pascal Perrier: Vocalic reduction : prediction of acoustic and articulatory variabilities with invariant motor commands.
Christophe Savariaux, Pascal Perrier, Jean Pierre Orliaguet: Compensating for labial perturbation in a rounded vowel: an acoustic and articulatory study.
Rudolph Sock, Anders Löfqvist: Resistance of bilabials /p, b/ to anticipatory labial and mandibular coarticulation from vowel types /i, a, u/.
Morten Olesen: Derivation of the transfer function for a speech production model including the nasal cavity.
Mats Båvegård, Jesper Högberg: Using artificial neural nets to compare different vocal tract models.
Arne Kjell Foldvik, Ulf Kristiansen, Jorn Kvaerness: A time-evolving three-dimensional vocal tract model by means of magnetic resonance imaging (MRI).
Voice Source Analysis and Modelling
Juergen Schroeter, Bert Cranen: Physiologically-motivated modeling of the voice source in articulatory analysis/synthesis.
Luís C. Oliveira: Estimation of source parameters by frequency analysis.
Jean Schoentgen: Modelling the glottal pulse with a self-excited threshold auto-regressive model.
Joachim Denzler, Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth: Going back to the source: inverse filtering of the speech signal with ANNs.
HMM-Based Recognition System
Manuel A. Leandro, José Manuel Pardo: Low cost speaker dependent isolated word speech preselection system using static phoneme pattern recognition.
Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-Decker: Speaker-independent continuous speech dictation.
Ernst Günter Schukat-Talamazzini, Heinrich Niemann, Wieland Eckert, Thomas Kuhn, S. Rieck: Automatic speech recognition without phonemes.
Takashi Seino, Seiichi Nakagawa: Spoken language identification using ergodic HMM with emphasized state transition.
Speech Signal Processing

Philippe Le Cerf, Dirk Van Compernolle: Speaker independent small vocabulary speech recognition using MLPs for phonetic labeling.
Andrzej Drygajlo: Multiresolution time-sequency speech processing based on orthogonal wavelet packet pulse forms.
Eliathamby Ambikairajah, M. Keane, Liam Kilmartin, Graham Tattersall: The application of the wavelet transform for speech processing.

Marc Roelands, Werner Verhelst: Waveform similarity based overlap-add (WSOLA) for time-scale modification of speech: structures and evaluation.
Hsiao-Chuan Wang, Hsiao-Fen Pai: A study on the weighting factors of two-dimensional cepstral distance measure.
Yves Kamp, Changxue Ma: Connection between weighted LPC and higher-order statistics for AR model estimation.
Speaker Recognition
Claude C. Chibelushi, John S. Mason, R. Deravi: Integration of acoustic and visual speech for speaker recognition.
Claude Montacié, Jean-Luc Le Floch: Discriminant AR-vector models for free-text speaker verification.
Frédéric Bimbot, Luc Mathan: Text-free speaker recognition using an arithmetic-harmonic sphericity measure.
Data Bases, Speech Assessment, Noisy Speech
Asunción Moreno, Dolors Poch, Antonio Bonafonte, Eduardo Lleida, Joaquim Llisterri, José B. Mariño, Climent Nadeu: Albayzin speech database: design of the phonetic corpus.
Carlos M. Ribeiro, Isabel Trancoso, António Joaquim Serralheiro: A software tool for speech collection, recognition and reproduction.

Christoph Draxler, Hans G. Tillmann, Barbara Eisen: Prolog tools for accessing the phondat database of spoken German.
Ute Jekosch: Cluster-similarity: a useful database for speech processing.
Giuseppe Castagneri, Giuseppe Di Fabbrizio, A. Massone, Mario Oreglia: SIRVA - a large speech database collected on the Italian telephone network.
Herman J. M. Steeneken, Jan A. Verhave, Tammo Houtgast: Objective assessment of speech communication systems; introduction of a software based procedure.
Sven W. Danielsen: Enhanced direct assessment of speech input systems within the SAM-a esprit project.
Pascale Nicolas, Pascal Romeas: Evaluation of prosody in the French version of multilingual text-to-speech synthesis: neutralising segmental information in preliminary tests.
Alan Wrench, Mary S. Jackson, Mervyn A. Jack, David S. Soutar, A. Gerry Robertson, Janet MacKenzie, John Laver: A speech therapy workstation for the assessment of segmental quality: voiceless fricatives.
Josep M. Salavedra, Enrique Masgrau, Asunción Moreno, Xavier Jove: A speech enhancement system using higher order ar estimation in real environments.
Régine Le Bouquin, Gérard Faucon, A. Akbariazirani: Proposal of a composite measure for the evaluation of noise cancelling methods in speech processing.
P. M. Crozier, Barry M. G. Cheetham, C. Holt, E. Munday: The use of linear prediction and spectral scaling for improving speech enhancement.
Helge B. D. Sørensen, Uwe Hartmann: Robust speaker-independent speech recognition using non-linear spectral subtraction based IMELDA.
Phonetics


Céu Viana, Isabel Trancoso, Carlos M. Ribeiro, Amalia Andrade, Ernesto d'Andrade: The relationship between spelled and spoken portuguese: implications for speech synthesis and recognition.
Mark S. Schmidt, S. Fitt, C. Scott, Mervyn A. Jack: Phonetic transcription standards for european names (ONOMASTICA).
Ove Andersen, Paul Dalsgaard, William J. Barry: Data-driven identification of poly- and mono-phonemes for four european languages.
Sheri Hunnicutt, Helen M. Meng, Stephanie Seneff, Victor W. Zue: Reversible letter-to-sound sound-to-letter generation based on parsing word morphology.
Duncan Young, Gerry Altmann, Anne Cutler, Dennis Norris: Metrical structure and the perception of time-compressed speech.
Valerie Pasdeloup, José Morais, Régine Kolinsky: Are stress and phonemic string processed separately? evidence from speech illusions.
Phoneme Classification and Labelling
R. J. J. H. van Son, Louis C. W. Pols: Vowel identification as influenced by vowel duration and formant track shape.
Milan Stamenkovic, Juraj Bakran, Peter Tancig, Marijan Miletic: Perceptive and spectral volumes of synthesized and natural vowels.
Ryszard Gubrynowicz, Adam Wrzoskowicz: Labeller - a system for automatic labelling of speech continuous signal.
Duration Modelling in HMMs
Nelly Suaudeau, Régine André-Obrecht: Sound duration modelling and time-variable speaking rate in a speech recognition system.
Yifan Gong, William C. Treurniet: Duration of phones as function of utterance length and its use in automatic speech recognition.
M. E. Forsyth, Mervyn A. Jack: Duration modelling and multiple codebooks in semi-continuous HMMs for speaker verification.
Mike Hochberg, Harvey F. Silverman: Constraining model duration variance in HMM-based connected-speech recognition.
Speaker Adaptation and Normalization

Yoshio Ono, Hisashi Wakita, Yunxin Zhao: Speaker normalization using constrained spectra shifts in auditory filter domain.
Yunxin Zhao: Self-learning speaker adaptation based on spectral variation source decomposition.
Tetsuo Kosaka, Edward Willems, Jun-ichi Takami, Shigeki Sagayama: A dynamic approach to speaker adaptation of hidden Markov networks for speech recognition.
Speech Analysis, Articulatory Modelling


Arno J. Klaassen: Grouping of acoustical events using cable neurons and the theory of neuronal group selection.
I. R. Gransden, Steve W. Beet: Computationally efficient methods of calculating instantaneous frequency for auditory analysis.
Krzysztof Marasek: Speech transients analysis using AR-smoothed wigner-ville distribution.
Michel Pitermann, Jean Caelen: Comparison of the variability of formants and formant targets using dynamic modeling.
Jean Schoentgen, Zoubir Azami: Pitch-synchronous formant extraction by means of a compound auto-regressive model.
Bernard Teston: A new air flowmeter design for the investigation of speech production.
Emanuela Magno Caldognetto, Kyriaki Vagges, Giancarlo Ferrigno, Claudio Zmarich: Articulatory dynamics of lips in Italian /'vpv/ and /'vbv/ sequences.
Ahmed M. Elgendy: Restricted distribution of pharyngeal segments: acoustical or mechanical constraints?
Yohan Payan, Pascal Perrier: Vowel normalization by articulatory normalization first attemps for vowel transitions.
Nobuhiro Miki, Naohisa Kamiyama, Nobuo Nagai: Synthesis and analysis of vocal source with vibration of larynx.
Imad Znagui, Sami Boudelaa: Towards an acoustic-phonetic classification of modern standard arabic vowels.

N. R. Ganguli: Spectral characteristics of fricative sound.
Danielle Duez: Second formant locus-nucleus patterns in French and Swedish.
Christine Meunier: Temporal organisation of segments and sub-segments in consonant clusters.

Philip Christov: Normalized vowel system representation for comparative phonetic studies.
Cécile Thilly: Influence of prevocalic consonant on vowel duration in French CV[p] utterances.
Peter E. Czigler: Temporal variation in consonant clusters in Swedish.
Wiktor Jassem: Discriminant analysis of continuous consonantal spectra.
Prosody: Rhythm, Style, Emotion
Edmund Rooney, Miriam Eckert, Steven M. Hiller, Rebecca Vaughan, John Laver: Training consonants in a computer-aided system for pronunciation teaching.
Gitta P. M. Laan, Dick R. van Bergem: The contribution of pitch contour, phoneme durations and spectral features to the character of spontaneous and read aloud speech.
Juan M. Garrido, Joaquim Llisterri, Carme de la Mota, Antonio Rios: Prosodic differences in reading style: isolated vs. contextualized sentences.
Improved Algorithms for HMMs
C. M. Ayer, Melvyn J. Hunt, D. M. Brookes: A discriminatively derived linear transform for improved speech recognition.
Marco Saerens: Hidden Markov models assuming a continuous-time dynamic emission of acoustic vectors.
Yoshiharu Abe, Kunio Nakajima: A bounded transition hidden Markov model for continuous speech recognition.
Fritz Class, Alfred Kaltenmeier, Peter Regel-Brietzmann: Optimization of an HMM - based continuous speech recognizer.
Marco Saerens, Hervé Bourlard: Linear and nonlinear prediction for speech recognition with hidden Markov models.
M. N. Lokbani, Denis Jouvet, Jean Monné: Segmental post-processing of the n-best solutions in a speech recognition system.
Tatsuo Matsuoka, Chin-Hui Lee: A study of on-line Bayesian adaptation for HMM-based speech recognition.
Noisy Speech and Enhancement
Maurizio Omologo, Piergiorgio Svaizer: Talker localization and speech enhancement in a noisy environment using a microphone array based acquisition system.
Takao Kobayashi, Toshio Kanno, Satoshi Imai: Generalized cepstral modeling of speech degraded by additive noise.
Fei Xie, Dirk Van Compernolle: Speech enhancement by nonlinear spectral estimation - a unifying approach.
Speaker Variability
Vincent Pean, Sheila M. Williams, Maxine Eskenazi: The design and recording of icy, a corpus for the study of intraspeaker variability and the characterisation of speaking styles.
Andrej Ljolje: Speaker clustering for improved speech recognition.
Henk van den Heuvel, Bert Cranen, A. C. M. Rietveld: Speaker-variability in spectral bands of dutch vowel segments.
J. Antonio Hernández-Méndez, Aníbal R. Figueiras-Vidal: Measuring similarities among speakers by means of neural networks.
Segmentation and Labelling
Maria Rangoussi, Stylianos Bakamidis, George Carayannis: Robust endpoint detection of speech in the presence of noise.
Bianca Angelini, Fabio Brugnara, Daniele Falavigna, Diego Giuliani, Roberto Gretter, Maurizio Omologo: Automatic segmentation and labeling of English and Italian speech databases.
Azarshid Farhat, Guy Perennou, Régine André-Obrecht: A segmental approach versus a centisecond one for automatic phonetic time-alignment.
I. Heroaez, J. Barandiaran, Enrique Monte, Borja Etxebarria: A segmentation algorithm based on acoustical features using a self organizing neural network.
Piero Cosi: SLAM: segmentation and labelling automatic module.
Barbara Eisen: Reliability of speech segmentation and labelling at different levels of transcription.
Dick R. van Bergem: On the perception of acoustic and lexical vowel reduction.
Andrew R. Nix, M. Gareth Gaskell, William D. Marslen-Wilson: Phonological variation and mismatch in lexical access.
Anne Bonneau, Linda Djezzar, Yves Laprie: Perception of French stop bursts, implications for stop identification.
Zdravko Kacic, Bogomir Horvat: Using isofrequency neural column for harmonic sound scene decomposition.
A. K. Datta: Do ear perceive vowel through formants?
Trupti Vyas, Michael J. Pont, Seyed J. Mashari: Speech recognition using auditory models and neural networks.
Changxue Ma, Armin Kohlrausch: The influence of temporal processes on spectral masking patterns of harmonic complex tones and vowels.
Hisao Kuwabara: Temporal effect on the perception of continuous speech and a possible mechanism in the human auditory system.
Edward Jones, Eliathamby Ambikairajah: Comparison of various adaptation mechanisms in an auditory model for the purpose of speech processing.
I. A. Vartanian, Tatiana V. Chernigovskaya: Sensory-motor manifestations of speech-hearing interaction.
Tatiana V. Chernigovskaya, I. A. Vartanian, T. I. Tokareva: Syllable perception: lateralization of native and foreign languages.
Michael J. Pont: Simulation of short-latency auditory evoked potentials: a pilot study.
Regine Kolinsky, Jose Morais: Intermediate representations in spoken word recognition: a cross-linguistic study of word illusions.
Jianfen Cao: Time - varing manner on formant trajectories of Chinese diphthongs.

Hiroshi Shimodaira, Mitsuru Nakai: Accent phrase segmentation using transition probabilities between pitch pattern templates.
Wolfgang Reichl, Günther Ruske: Syllable segmentation of continuous speech with artificial neural networks.
Prosody: Analysis and Modelling of <i>F</i><sub>0</sub> Contours
Louis ten Bosch: On the automatic classification of pitch movements.
U. Jensen, Roger K. Moore, Paul Dalsgaard, Børge Lindberg: Modelling of intonation contours at the sentence level using CHMMs and the 1961 o'connor and arnold scheme.
Paul Taylor: Automatic recognition of intonation from F0 contours using the rise/fall/connection model.
Edouard Geoffrois: A pitch contour analysis guided by prosodic event detection.
Grazyna Demenko, Ignacy Nowak, Janusz Imiolczyk: Analysis and synthesis of pitch movements in a read polish text.
Speech Recognition in Noise
William A. Ainsworth, Georg F. Meyer: Noise adaptation: speech recognition by auditory models and human listeners.
J. A. Nolazco Flores, Steve J. Young: Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction.
Tetsunori Kobayashi, Ryuji Mine, Katsuhiko Shirai: Speech recognition under the unstationary noise based on the noise Markov model and spectral-subtraction.
Laurent Buniet, Dominique Fohr, Yolande Anglade, Jean-Claude Junqua, Jean-Marie Pierrel: Selectively trained neural networks for connected word recognition in noisy environments.
Speaker Independency
Bianca Angelini, Fabio Brugnara, Daniele Falavigna, Diego Giuliani, Roberto Gretter, Maurizio Omologo: A baseline of a speaker independent continuous speech recognizer of Italian.
Lalit R. Bahl, Peter V. de Souza, P. S. Gopalakrishnan, David Nahamoo, Michael Picheny: Word lookahead scheme for cross-word right context models in a stack decoder.
David B. Grayden, Michael S. Scordilis: Recognition of obstruent phonemes in speaker-independent fluent speech using a hierarchical approach.
Bernd Plannerer, Günther Ruske: A continuous speech recognition system using phonotactic constraints.
Speech Synthesis
M. Ouadou, A. Rajouani, M. Zyoute, J. Rosenfeld, M. Najim: Joint arabic-hebrew speech synthesis system.
Eduardo López Gonzalo, Gábor Olaszy, Géza Németh: Improvements of the Spanish version of the multivox text-to-speech system.
Mats Ljungqvist, Hiroya Fujisaki: Generating intonation for Swedish text-to-speech conversion using a quantitative model for the F0 contour.
Peter Meyer, Hans-Wilhelm Rühl, R. Krüger, M. Kugler, L. Vogten, A. Dirksen, Karim Belhoula: PHRITTS - a text-to-speech synthesizer for the German language.
Karim Belhoula: Rule-based grapheme-to-phoneme conversion of names.

Marian J. Macchi, Mary Jo Altom, Dan Kahn, Sharad Singhal, Murray F. Spiegel: Intelligibility as a function of speech coding method for template-based speech synthesis.
Maggie Gaved: Pronunciation and text normalisation in applied text-to-speech systems.
Jill House, Catriona MacDermid, Scott McGlashan, Andrew Simpson, Nick J. Youd: Evaluating synthesised prosody in simulations of an automated telephone enquiry service.
Elissaveta Abadjieva, Iain R. Murray, John L. Arnott: Applying analysis of human emotional speech to enhance synthetic speech.
Robert W. P. Luk, Robert I. Damper: Experiments with silent-e and affix correspondences in stochastic phonographic transduction.
Georg Fries: Phoneme-dependent speech synthesis in the time and frequency domains.
Volker Kraft: Auditory detection of discontinuities in synthesis-by-concatenation.
Briony Williams: Letter-to-sound rules for the welsh language.
Dialogue Structure

Hans Dybkjær, Niels Ole Bernsen, Laila Dybkjær: Wizard-of-oz and the trade-off between naturalness and recogniser constraints.
Cerian Jones, Roberto Garigliano: Dialogue analysis and generation: a theory for modelling natural English dialogue.
Catriona MacDermid: Features of naive callers' dialogues with a simulated speech understanding and dialogue system.
Yoichi Yamashita, Riichiro Mizoguchi: Next utterance prediction based on two kinds of dialog models.
T. Andemach, G. Deville, Luc Mortier: The design of a real world wizard of oz experiment for a speech driven telephone directory information system.
Sheryl R. Young: Dialog structure and plan recognition in spontaneous spoken dialog.
Julia Hirschberg, Christine H. Nakatani: A speech-first model for repair identification in spoken language systems.
Language Modelling
R. Zhao, Patrick Kenny, P. Labute, Douglas D. O'Shaughnessy: Issues in large scale statistical language modeling.
Roberto Garigliano, Kevin Johnson, Russell J. Collingham: A data-driven case for a spontaneous speech grammar.
Reinhard Kneser, Hermann Ney: Improved clustering techniques for class-based statistical language modelling.
Jeremy H. Wright, Gareth J. F. Jones, Harvey Lloyd-Thomas: A consolidated language model for speech recognition.
Michael K. McCandless, James R. Glass: Empirical acquisition of word and phrase classes in the atis domain.
Tung-Hui Chiang, Keh-Yih Su: The effects of parameter smoothing on robust learning in syntactic ambiguity resolution.
Enrique Vidal, Roberto Pieraccini, Esther Levin: Learning associations between grammars: a new approach to natural language understanding.
Michèle Jardino, Gilles Adda: Language modelling for CSR of large corpus using automatic classification of words.
Helmut Lucke: Inference of stochastic context-free grammar rules from example data using the theory of Bayesian belief propagation.
Petra Witschel: Constructing linguistic oriented language models for large vocabulary speech recognition.
Prosody: Prosodic Parameter Manipulation
Eduardo Rodríguez Banga, Carmen García-Mateo: New frequency domain prosodic modification techniques.
H. D. Wang, D. Degryse, Fabrizio Carrara: A prosody modification approach for auditory user feedback in the spell pronunciation teaching system.
Tohru Takagi, Eiichi Miyasaka: A speech prosody conversion system with a high quality speech analysis-synthesis method.
Paul C. Bagshaw, Steven M. Hiller, Mervyn A. Jack: Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching.
New Architectures for Neural Networks
Chakib Tadj, Franck Poirier: Improved DVQ algorithm for speech recognition: a new adaptive learning rule with neurons annihilation.
Taro Sasaki, Tadashi Kitamura, Akira Iwata: Speaker-independent 212 word recognition using combNET-II.
M. Asunción Castaño, Enrique Vidal, Francisco Casacuberta: Learning direct acoustic-to-semantic mappings through simple recurrent networks.
Noise Reduction and Channel Adaption

K. F. Wong, S. H. Leung, H. C. Ng: Noisy speech recognition using singular value decomposition and two-sided linear prediction.
Franck Martin, Kiyohiro Shikano, Yasuhiro Minami: Recognition of noisy speech by composition of hidden Markov models.
Yuqing Gao, Jean Paul Haton: Noise reduction and speech recognition in noise conditions tested on LPNN-based continuous speech recognition system.
Michael Trompf, Ralf Richter, Harald Eckhardt, Heidi Hackbarth: Combination of distortion-robust feature extraction and neural noise reduction for ASR.
Chafic Mokbel, Jean Monné, Denis Jouvet: On-line adaptation of a speech recognizer to variations in telephone line conditions.
Matthias Wittmann, Otto Schmidbauer, Abdulmesih Aktas: Online channel compensation for robust speech recognition.
Patrice Alexandre, Jérôme Boudy, Philip Lockwood: Evaluation of car noise reduction/compensation techniques for digit recognition in a speaker-independent context.
A. Brancaccio, C. Pelaez: Experiments on noise reduction techniques with robust voice detector in car environment.
Word Spotting
Satoshi Nakamura, Toshio Akabane, Seiji Hamaguchi: Robust word spotting in adverse car environments.
Richard C. Rose: Definition of subword acoustic units for wordspotting.
Philippe Jeanrenaud, Kenney Ng, Man-Hung Siu, Jan Robin Rohlicek, Herbert Gish: Phonetic-based word spotter: various configurations and application to event spotting.
Akihiro Imamura, Mikio Kitai: An application of word-spotting in a voice activated service entry system.
Eduardo Lleida, José B. Mariño, Josep M. Salavedra, Antonio Bonafonte, Enrique Monte, A. Martinez: Out-of-vocabulary word modelling and rejection for keyword spotting.
Jean-Marc Boite, Hervé Bourlard, Bart D'hoore, Marc Haesen: A new approach towards keyword spotting.
J. Alvarez-Cercadillo, Luis A. Hernandez-Gomez: Grammar learning and word spotting using recurrent neural networks.
Shigeki Okawa, Tetsunori Kobayashi, Katsuhiko Shirai: Word spotting in conversational speech based on phonemic unit likelihood by mutual information criterion.
Speech Processing and Coding
F. Dohnal: Generalized frequency domain adaptive filter for acoustic echo canceller.
Joel Crestel, Michel Guitton: Estimation of speech signal classification features in a simulated hyperbaric environment.
Peter Heitkämper, Michael Walker II: Adaptive gain control and echo cancellation for hands-free telephone systems.
W. Nick Campbell: Predicting segmental durations for accommodation within a syllable-level timing framework.
Tore Fjällbrant, Fisseha Mekuria, Shahrokh Amirijoo: A filtersank based on physiologically measured characteristics in an auditory model for speech signal processing.
Fu-Rong Jean, Chih-Chung Kuo, Hsiao-Chuan Wang: Spectral sensitivity weighted transform coding for LSP parameters.
Rainer Martin: An efficient algorithm to estimate the instantaneous SNR of speech signals.


Jrgen Paulus, Christiane Antweiler, Christian G. Gerlach: High quality coding of wideband speech at 24 kbit/s.
Oded Gottesman, Yair Shoham: Realtime implementation of high-quality 32 kbps wideband LD-CELP coder.
A. Popescu, D. Vicard, F. Druilhe: A fixed-point implementation of the 16 kb/s LD-CELP speech coding algorithm.
Christian G. Gerlach: Optimality of sequential quantization in analysis-by-synthesis speech codecs.

K. W. Law, C. F. Chan: Split vector quantization of the LPC parameters using weighted lattice structure.
Stefan Bruhn: A new approach to noiseless interframe coding of LPC parameters in vector quantizer applications.
Torbjørn Svendsen: Efficient quantization of speech spectral information.
Stefan Feldes: Enhancing robustness of coded LPC-spectra to channel errors by use of residual redundancy.
S. A. Atungsiri, Ahmet M. Kondoz, Barry G. Evans: Multi-rate source and channel coding for mobile communication systems.
Takehiro Moriya, Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro: Training method of the excitation codebook for CELP.
Prosody: Phrasing
Gösta Bruce, Björn Granström, Kjell Gustafson, David House: Phrasing strategies in prosodic parsing and speech synthesis.
Jan-Roelof de Pijper, Angelien Sanderman: Prosodic cues to the perception of constituent boundaries.
Esther Grabe, Tara Hoist, Francis Nolan, Paul Warren: Acoustic cues to syntactic structure - evidence from prosodic and segmental effects.
Frédéric Beaugendre, Anne Lacheret-Dujour: Automatic generation of French intonation based on a perceptual study and morpho-syntactic information.
MLPs and TDNNs for Speech Recognition
Stephen A. Zahorian, Zaki B. Nossair, Claude A. Norton III: A partitioned neural network approach for vowel classification using smoothed time/frequency features.
Tadashi Kitamura: Speaker-independent 100 word recognition using dynamic spectral features of speech and a neural network.
Ming Zhu, Klaus Fellbaum: Speaker independent isolated word recognition using vector quantization and neural networks.
Kjell Elenius, Hans G. C. Traven: Multi-layer perceptrons and probabilistic neural networks for phoneme recognition.
C. S. Blackburn, Julie Vonwiller, R. W. King: Automatic accent classification using artificial neural networks.
Mark Huckvale: The benefits of tiered segmentation for the recognition of phonetic properties.
David Lubensky: Generalized context-dependent phone modeling using artificial neural networks.
Hermann Hild, Alex Waibel: Speaker-independent connected letter recognition with a multi-state time delay neural network.
Ulrich Bodenhausen, Alex Waibel: Tuning by doing: flexibility through automatic structure optimization.
Christoph Windheuser, Frédéric Bimbot: Phonetic features for spelled letter recognition with a time delay neural network.
Veronika Bappert, Matthias Jobst: Training of a time-delay neural network for speech recognition by solving stiff differential equations.
Speech Translation, Language Identification, Parsers
Shigeki Sagayama, Jun-ichi Takami, Akito Nagai, Harald Singer, Kouichi Yamaguchi, Kazumi Ohkura, Kenji Kita, Akira Kurematsu: ATREUS: a speech recognition front-end for a speech translation system.
Tsuyoshi Morimoto, Toshiyuki Takezawa, Fumihiro Yato, Shigeki Sagayama, Toshihisa Tashiro, Masaaki Nagata, Akira Kurematsu: ATR's speech translation system: ASURA.
Monika Woszczyna, Noah Coccaro, A. Eisele, Alon Lavie, Arthur E. McNair, Thomas Polzin, Ivica Rogina, Carolyn Penstein Rosé, Tilo Sloboda, M. Tomita, J. Tsutsumi, N. Aoki-Waibel, Alex Waibel, Wayne H. Ward: Recent advances in JANUS: a speech translation system.
Manny Rayner, Ivan Bretan, David M. Carter, Michael Collins, Vassilios Digalakis, Björn Gambäck, Jaan Kaja, Jussi Karlgren, Bertil Lyberg, Stephen G. Pulman, Patti Price, Christer Samuelsson: Spoken language translation with MID-90's technology: a case study.
Yeshwant K. Muthusamy, Kay M. Berkling, Takayuki Arai, Ronald A. Cole, Etienne Barnard: A comparison of approaches to automatic language identification using telephone speech.
Ying Cheng, Yves Normandin, Paul Fortier: Integration of neural networks and robust parsers in natural language understanding.
Pierre Dauchy, Christophe Mignot, Claude Valot: Joint speech and gesture analysis some experimental results on multimodal interface.

Jin'ichi Murakami, Hiroki Yamatomo, Shigeki Sagayama: The possibility for acquisition of statistical network grammar using ergodic HMM.
R. T. Dutton, John C. Foster, Mervyn A. Jack, F. W. M. Stentiford: Identifying usability attributes of automated telephone services.
Andrew Hunt: Utilising prosody to perform syntactic disambiguation.
Steven M. Hiller, Edmund Rooney, Jean-Paul Lefèvre, Mervyn A. Jack: Spell: an automated system for computer-aided pronunciation teaching.
Edmund Rooney, Rebecca Vaughan, Steven M. Hiller, Fabrizio Carraro, John Laver: Training vowel pronunciation using a computer-aided teaching system.
Mary Zajicek, Ken Brownsey: Methods for traversing a pre-recorded speech message network to optimise dialogue in telephone answering systems.
Roger Hanes, Jo Salter, Paul Popay, Frances Hedley: Service creation tools for creating speech interactive services.
Julia Hirschberg, Jacques M. B. Terken: Deaccentuation and persistence of grammatical function and surface position.
Stefan Euler, K. Riedel: Design and implementation of a speech server for unix based multimedia applications.
Toffee A. Albina, Erica G. Bernstein, David M. Goblirsch, Douglas E. Lake: A system for clustering spoken documents.
Dialogue Evalution
Nathalie A. Vergeynst, Keith Edwards, John C. Foster, Mervyn A. Jack: Spoken dialogues for human-computer interaction over the telephone: complexity measures.

Cristina Delogu, Andrea Di Carlo, Ciro Sementina, Silvia Stecconi: A methodology for evaluating human-machine spoken language interaction.
Philippe Morin, Jean-Claude Junqua: Error correction and ambiguity resolution in multimodal man-machine dialogue.
Data Bases

Alix de Ginestel-Mailland, Martine de Calmès, Guy Perennou: Multi-level transcription of speech corpora from orthographic forms.
Olivier Boëffard, B. Cherbonnel, F. Emerard, S. White: Automatic segmentation and quality evaluation of speech unit inventories for concatenation-based, multilingual PSOLA text-to-speech systems.
Letter to Sound and Architecture for TTS
Bert Van Coile: On the development of pronunciation rules for text-to-speech synthesis.
Walter Daelemans, Antal van den Bosch: Tabtalk: reusability in data-oriented grapheme-to-phoneme conversion.
Anders Lindström, Mats Ljungqvist, Kjell Gustafson: A modular architecture supporting multiple hypotheses for conversion of text to phonetic and linguistic entities.
Perception
Astrid van Wieringen, John K. Cullen, Louis C. W. Pols: The perceptual relevance of CV- and VC- transitions in identifying stop consonants: cross-language results.
Vincent J. van Heuven, Willy Jongenburger: Perceptual effects of place and voicing assimilation in dutch consonants.
Brit van Ooyen: Detection of vowels and consonants by human listeners: effects of minimising auditory memory load.
Gérard Bailly: Resonances as possible representation of speech in the auditory-to-articulatory transform.
Rob Goedemans, Vincent J. van Heuven: A perceptual explanation of the weightlessness of the syllable onset.
Search Algorithms
Enrico Bocchieri: A study of the beam-search algorithm for large vocabulary continuous speech recognition and methods for improved efficiency.
Lorenzo Fissore, Egidio P. Giachin, Pietro Laface, P. Massafra: Using grammars in forward and backward search.
I. Lee Hetherington, Michael S. Phillips, James R. Glass, Victor W. Zue: A* word network search for continuous speech recognition.
Speech Recognition, HMMs, NNs
M. Inés Torres, Francisco Casacuberta: Multiple codebook Spanish phone recognition using semicontinuous hidden Markov models.
Antonio Bonafonte, Xavier Ros, Jose B. Marifio: An efficient algorithm to find the best state sequence in HSMM.
Isabel Galiano, Francisco Casacuberta: Experiments on Spanish phone recognition using automatically derived phonemic baseforms.
