ICSLP 1992:
Banff,
Alberta,
Canada
The Second International Conference on Spoken Language Processing, ICSLP 1992, Banff, Alberta, Canada, October 13-16, 1992.
ISCA 1992
Word Spotting in ASR
Speech Coding
Speech Production:
Coarticulation
- Antonio Bonafonte, José B. Mariño, Montse Pardàs:
Efficient integration of coarticulation and lexical information in a finite state grammar.
- Herbert A. Leeper, A. P. Rochet, Ian R. A. MacKay:
Characteristics of nasalance in canadian speakers of English and French.
- Christine H. Shadle, Andre Moulinier, Christian U. Dobelke, Celia Scully:
Ensemble averaging applied to the analysis of fricative consonants.
- Andrew Slater, Sarah Hawkins:
Effects of stress and vowel context on velar stops in british English.
- Emanuela Magno Caldognetto, Kyriaki Vagges, Giancarlo Ferrigno, Maria Grazia Busa:
Lip rounding coarticulation in Italian.
Auditory Models
Recognition of Telephone Speech
- Hynek Hermansky, Nelson Morgan:
Towards handling the acoustic environment in spoken language processing.
- Alberto Ciaramella, Davide Clementino, Roberto Pacifici:
Real-time speaker-independent large-vocabulary CDHMM-based continuous telephonic speech recognizer.
- Matthew Lennig, Douglas Sharp, Patrick Kenny, Vishwa Gupta, Kristin Precoda:
Flexible vocabulary recognition of speech.
- Benjamin Chigier, Hong C. Leung:
The effects of signal representations, phonetic classification techniques, and the telephone network.
Text-to-Speech Synthesis 1,
1
- Leon Gulikers, Rijk Willemse:
A lexicon for a text-to-speech system.
- Rijk Willemse, Leon Gulikers:
Word class assignment in a text-to-speech system.
- Gösta Bruce, Björn Granström, Kjell Gustafson, David House:
Aspects of prosodic phrasing in Swedish.
- Kirk P. H. Sullivan, Robert I. Damper:
Synthesis-by-analogy: a bilingual investigation using German and English.
- Leonard C. Manzara, David R. Hill:
Degas: a system for rule-based diphone speech synthesis.
- Shyam Sunder Agrawal, Kenneth N. Stevens:
Towards synthesis of Hindi consonants using KLSYN88.
- Louis C. W. Pols:
Multi-lingual synthesis evaluation methods.
- Björn Granström, Petur Helgason, Hoskuldur Thrainsson:
The interaction of phonetics, phonology and morphology in an icelandic text-to-speech system.
Voice Source Characteristics
Speech Perception:
Higher-Order Processes 1,
1
- David R. Traum, James F. Allen:
A "speech acts" approach to grounding in conversation.
- Sheila Meltzer:
Antecedent activation by empty pronominals in Spanish.
- Ron Smyth:
Multiple feature matching in pronoun resolution: a new look at parallel function.
- Keh-Yih Su, Jing-Shin Chang, Yi-Chung Lin:
A discriminative approach for ambiguity resolution based on a semantic score function.
- Nobuaki Minematsu, Sumio Ohno, Keikichi Hirose, Hiroya Fujisaki:
The influence of semantic and syntactic information on spoken sentence recognition.
- Lynne C. Nygaard, Mitchell Sommers, David B. Pisoni:
Effects of speaking rate and talker variability on the representation of spoken words in memory.
- Hugo Quené, Yvette Smits:
On the absence of word segmentation at "weak" syllables.
- Mitchell Sommers, Lynne C. Nygaard, David B. Pisoni:
Stimulus variability and the perception of spoken words: effects of variations in speaking rate and overall amplitude.
- James M. McQueen, Anne Cutler:
Words within words: lexical statistics and lexical access.
Speaker-Independent Word Recognition
- Stephan Euler, Joachim Zinke:
Experiments on the use of the generalized probabilistic descent method in speech recognition.
- Ricardo de Córdoba, José Manuel Pardo, José Colás:
Improving and optimizing speaker independent, 1000 words speech recognition in Spanish.
- John F. Pitrelli, David Lubensky, Benjamin Chigier, Hong C. Leung:
Multiple-level evaluation of speech recognition systems.
- Tatsuya Kimura, Mitsuru Endo, Shoji Hiraoka, Katsuyuki Niyada:
Speaker independent word recognition using continuous matching of parameters in time-spectral form based on statistical measure.
- R. Roddeman, H. Drexler, Louis Boves:
Automatic derivation of lexical models for a very large vocabulary speech recognition system.
Human Factors
Continuous Speech Recognition 1,
2
- Patrick Kenny, Rene Hollan, Gilles Boulianne, Harinath Garudadri, Yan Ming Cheng, Matthew Lennig, Douglas D. O'Shaughnessy:
Experiments in continuous speech recognition with a 60, 000 word vocabulary.
- Gilles Boulianne, Patrick Kenny, Matthew Lennig, Douglas D. O'Shaughnessy, Paul Mermelstein:
HMM training on unconstrained speech for large vocabulary, continuous speech recognition.
- David Rainion, Shigeki Sagayama:
Appropriate error criterion selection for continuous speech HMM minimum error training.
- Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu:
Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition.
- Madeleine Bates, Robert J. Bobrow, Pascale Fung, Robert Ingria, Francis Kubala, John Makhoul, Long Nguyen, Richard M. Schwartz, David Stallard:
Design and performance of HARC, the BBN spoken language understanding system.
- Otoya Shirotsuka, G. Kawai, Michael Cohen, Jared Bernstein:
Performance of speaker-independent Japanese recognizer as a function of training set size and diversity.
- Kouichi Yamaguchi, Shigeki Sagayama, Kenji Kita, Frank K. Soong:
Continuous mixture HMM-LR using the a* algorithm for continuous speech recognition.
- Kenji Kita, Tsuyoshi Morimoto, Kazumi Ohkura, Shigeki Sagayama:
Continuously spoken sentence recognition by HMM-LR.
- Akinori Ito, Shozo Makino:
Word pre-selection using a redundant hash addressing method for continuous speech recognition.
- Andrej Ljolje, Michael D. Riley:
Optimal speech recognition using phone recognition and lexical access.
Natural Language Processing and Speech Understanding 1-3
- Nick Waegner, Steve J. Young:
A trellis-based language model for speech recognition.
- Carla B. Zoltowski, Mary P. Harper, Leah H. Jamieson, Randall A. Helzerman:
PARSEC: a constraint-based framework for spoken language understanding.
- Gareth J. F. Jones, Jeremy H. Wright, E. N. Wrigley:
The HMM interface with hybrid grammar-bigram language models for speech recognition.
- Atsuhiko Kai, Seiichi Nakagawa:
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.
- Fernando Pereira, David B. Roe:
Empirical properties of finite state approximations for phrase structure grammars.
- Stephanie Seneff, Helen M. Meng, Victor Zue:
Language modelling for recognition and understanding using layered bigrams.
- David Goddeau:
Using probabilistic shift-reduce parsing in speech recognition systems.
- Tim Howells, David Friedman, Mark A. Fanty:
Broca, an integrated parser for spoken language.
- P. V. S. Rao, Nandini Bondale:
Blank slate language processor for speech recognition.
- Eric Jackson:
Integrating two complementary approaches to spoken language understanding.
- Marcello Pelillo, Mario Refice:
Learning compatibility coefficients for word-class disambiguation relaxation processes.
- Kaichiro Hatazaki, Jun Noguchi, Akitoshi Okumura, Kazunaga Yoshida, Takao Watanabe:
INTERTALKER: an experimental automatic interpretation system using conceptual representation.
- Tsuyoshi Morimoto, Toshiyuki Takezawa, Kazumi Ohkura, Masaaki Nagata, Fumihiro Yato, Shigeki Sagayama, Akira Kurematsu:
Enhancement of ATR's spoken language translation system: SL-TRANS2.
- Tsuyoshi Morimoto:
Continuous speech recognition using a combination of syntactic constraints and dependency relationship.
- Roberto Pieraccini, Zakhar Gorelov, Esther Levin, Evelyne Tzoukermann:
Automatic learning in spoken language understanding.
Language Learning and Acquisition 1,
2
Speech Perception:
Units of Processing
Prosody:
The Phrase and Beyond 1,
2
- Agaath M. C. Sluijter, Vincent J. van Heuven, Anneke Neijt:
The influence of focus distribution and lexical stress on the temporal organisation of the syllable.
- Agaath M. C. Sluijter, Jacques M. B. Terken:
The development and perceptive evaluation of a model for paragraph intonation in dutch.
- Nobuyoshi Kaiki, Yoshinori Sagisaka:
Pause characteristics and local phrase-dependency structure in Japanese.
- Bernd Möbius, Matthias Pätzold:
F0 synthesis based on a quantitative model of German intonation.
- Kenneth N. Ross, Mari Ostendorf, Stefanie Shattuck-Hufnagel:
Factors affecting pitch accent placement.
- Marc Swerts, Ronald Geluykens, Jacques M. B. Terken:
Prosodic correlates of discourse units in spontaneous speech.
- Shin'ya Nakajima, James F. Allen:
Prosody as a cue for discourse structure.
- Barbara J. Grosz, Julia Hirschberg:
Some intonational characteristics of discourse structure.
- Hiroya Fujisaki, Keikichi Hirose, Haitao Lei:
Prosody and syntax in spoken sentences of standard Chinese.
- Kathleen Bishop:
Modeling sentential stress in the context of a large vocabulary continuous speech recognizer.
Speaker Adaptation
Pronunciation Training
- Jean-Paul Lefèvre, Mervyn A. Jack, Claudio Maggio, Mario Refice, Fabio Gabrieli, Michelina Savino, Luigi Santangelo:
An interactive system for automated pronunciation improvement.
- Edmund Rooney, Steven M. Hiller, John Laver, Mervyn A. Jack:
Prosodic features for automated pronunciation improvement in the spell system.
- Maria-Gabriella Di Benedetto, Fabrizio Carraro, Steven M. Hiller, Edmund Rooney:
Vowels pronunciation assessment in the spell system.
Self-Organizing Systems in ASR 1,
2
- Franck Poirier:
Self-organizing map with supervision for speech recognition.
- Gregory R. De Haan, Ömer Egecioglu:
Topology preservation for speech recognition.
- Gary Bradshaw, Alan Bell:
Towards the performance limits of connectionist feature detectors.
- Helge B. D. Sørensen:
Context-dependent and -independent self-structuring hidden control models for speech recognition.
- Marie-José Caraty, Claude Montacié, Claude Barras:
Integration of frequential and temporal structurations in a symbolic learning system.
- Enric Monte, José B. Mariño, Eduardo Lleida:
Smoothing hidden Markov models ay means of a self organizing feature map.
- Jyri Mäntysalo, Kari Torkkola, Teuvo Kohonen:
LVQ-based speech recognition with high-dimensional context vectors.
- Mikko Kurimo, Kari Torkkola:
Application of self-organizing maps and LVQ in training continuous density hidden Markov models for phonemes.
- Paul Dalsgaard, Ove Andersen:
Identification of mono- and poly-phonemes using acoustic-phonetic features derived by a self-organising neural network.
- Pekka Utela, Samuel Kaski, Kari Torkkola:
Using phoneme group specific LVQ-codebooks with HMMs.
Speech Synthesis 1-3
- Naoto Iwahashi, Yoshinori Sagisaka:
Speech segment network approach for an optimal synthesis unit set.
- Yoshinori Sagisaka, Nobuyoshi Kaiki, Naoto Iwahashi, Katsuhiko Mimura:
ATR μ-talk speech synthesis system.
- Bert Van Coile, Steven Leys, Luc Mortier:
On the development of a name pronunciation system.
- Inger Karlsson:
Consonants for female speech synthesis.
- Jan P. H. van Santen:
Diagnostic perceptual experiments for text-to-speech system evaluation.
- Marcello Balestri, Enzo Foti, Luciano Nebbia, Mario Oreglia, Pier Luigi Salza, Stefano Sandri:
Comparison of natural and synthetic speech intelligibility for a reverse telephone directory service.
- Richard Sproat, Julia Hirschberg, David Yarowsky:
A corpus-based synthesizer.
- Tomohisa Hirokawa, Kenzo Itoh, Hirokazu Sato:
High quality speech synthesis based on wavelet compilation of phoneme segments.
- David R. Williams, Corine A. Bickley, Kenneth N. Stevens:
Inventory of phonetic contrasts generated by high-level control of a formant synthesizer.
- Mikael Goldstein, Ove Till:
Is % overall error rate a valid measure of speech synthesiser and natural speech performance at the segmental level?
- Willy Jongenburger, Renée van Bezooijen:
Text-to-speech conversion for dutch: comprehensibility and acceptability.
- Masayo Katoh, Shin'ichiro Hashimoto:
The rhythm rules in Japanese based on the centers of energy gravity of vowels.
- Kenzo Itoh, Tomohisa Hirokawa, Hirokazu Sato:
Segmental power control for Japanese speech synthesis.
- Jean Schoentgen:
Glottal waveform synthesis with volterra shapers.
- Ken Ceder, Bertil Lyberg:
Yet another rule compiler for text-to-speech conversion?
- Kazuhiko Iwata, Yukio Mitome:
Prosody generation models constructed by considering speech tempo influence on prosody.
- Alex I. C. Monaghan:
Extracting microprosodic information from diphones - a simple way to model segmental effects on prosody for synthetic speech.
- Arjan van Hessen:
Generation of natural sounding speech stimuli by means of linear cepstral interpolation.
- W. Nick Campbell, Colin W. Wightman:
Prosodic encoding of syntactic structure for speech synthesis.
- Susan R. Hertz, Marie K. Huffman:
A nucleus-based timing model applied to multi-dialect speech synthesis by rule.
- Jill House, Nick J. Youd:
Evaluating the prosody of synthesized utterances within a dialogue system.
- Marcel Tatham, Eric Lewis:
Prosodics in a syllable-based text-to-speech synthesis system.
- Rabia Belrhali, Véronique Aubergé, Louis-Jean Boë:
From lexicon to rules: toward a descriptive method of French text-to-phonetics transcription.
- Marianne Elmlund, Ida Frehr, Niels Reinholt Petersen:
Formant transformation from male to female synthetic voices.
- Panagiotis A. Rentzepopoulos, George K. Kokkinakis:
Multilingual phoneme to grapheme conversion system based on HMM.
- Noriyo Hara, Hisayoshi Tsubaki, Hisashi Wakita:
Fundamental frequency control using linguistic information.
- Andrew P. Breen:
A comparison of statistical and rule based methods of determining segmental durations.
- J. R. Andrews, K. M. Curtis, Volker Kraft:
Generation and extraction of high quality synthesis units.
- T. I. Boogaart, Kim E. A. Silverman:
Evaluating the overall comprehensibility of speech synthesizers.
- Olivier Boëffard, Laurent Miclet, S. White:
Automatic generation of optimized unit dictionaries for text to speech synthesis.
- Hideki Kasuya, Seiki Kasuya:
Relationships between syllable, word and sentence intelligibilities of synthetic speech.
- David R. Hill, Craig-Richard Schock, Leonard C. Manzara:
Unrestricted text-to-speech revisited: rhythm and intonation.
- Anton J. Rozsypal:
Wavelet speech synthesizer in the classroom and speech laboratory.
- Thomas Portele, Birgit Steffan, Rainer Preuß, Walter F. Sendlmeier, Wolfgang Hess:
HADIFIX - a speech synthesis system for German.
- Cristina Delogu, Stella Conte, Andrea Paoloni, Ciro Sementina:
Two different methodologies for evaluating the comprehension of synthetic passages.
- Carlos Gussenhoven, Toni C. M. Rietveld:
A target-interpolation model for the intonation of dutch.
Speech Perception:
Phonetic Processes 1,
2
- Katharine Davis, Patricia K. Kuhl:
Best exemplars of English velar stops: a first report.
- Kenneth N. Stevens, Sharon Y. Manuel, Stefanie Shattuck-Hufnagel, Sharlene Liu:
Implementation of a model for lexical access based on features.
- Dieter Huber:
Perception of aperiodic speech signals.
- Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka:
Acceptability and discrimination threshold for distortion of segmental duration in Japanese words.
- Anne Bonneau, Sylvie Coste, Linda Djezzar, Yves Laprie:
Two level acoustic cues for consistent stop identification.
- Rolf Carlson, James R. Glass:
Vowel classification based on analysis-by-synthesis.
- Maria-Gabriella Di Benedetto, Jean-Sylvain Liénard:
Extrinsic normalization of vowel formant values based on cardinal vowels mapping.
- Terrance M. Nearey:
Applications of generalized linear modeling to vowel data.
- David B. Pisoni:
Some comments on invariance, variability and perceptual normalization in speech perception.
- Stephen D. Goldinger, Thomas J. Palmeri, David B. Pisoni:
Words and voices: perceptual details are preserved in lexical representations.
Speech Enhancement
Speech Recognition 1,
2
- Yolande Anglade, Dominique Fohr, Jean-Claude Junqua:
Selectively trained neural networks for the discrimination of normal and lombard speech.
- Aaron E. Rosenberg, Joel DeLong, Chin-Hui Lee, Biing-Hwang Juang, Frank K. Soong:
The use of cohort normalized scores for speaker verification.
- Tomoko Matsui, Sadaoki Furui:
Speaker recognition using concatenated phoneme models.
- Younès Bennani:
Speaker identification through a modular connectionist architecture: evaluation on the timit database.
- Claude Montacié, Jean-Luc Le Floch:
AR-vector models for free-text speaker recognition.
- Florian Schiel:
Rapid non-supervised speaker adaptation of semicontinuous hidden Markov models.
- D. Ederveen, Louis Boves:
Rule-based recognition of phoneme classes.
- Jie Yi, Kei Miki:
A new method of speaker-independent speech recognition using multiphone HMM.
- Myoung-Wan Koo, Chong Kwan Un:
A speaker adaptation based on corrective training and learning vector quantization.
- Katsuhiko Shirai, Shigeki Okawa, Tetsunori Kobayashi:
Phoneme recognition in continuous speech based on mutual information considering phonemic duration and connectivity.
- Shinji Koga, Ryosuke Isotani, Satoshi Tsukada, Kazunaga Yoshida, Kaichiro Hatazaki, Takao Watanabe:
A real-time speaker-independent continuous speech recognition system based on demi-syllable units.
- Saeed Vaseghi, Ben P. Milner:
Speech recognition in noisy environments.
- Fergus R. McInnes:
An enhanced interpolation technique for context-specific probability estimation in speech and language modelling.
- Lorenzo Fissore, Pietro Laface, Giorgio Micca, G. Sperto:
Channel adaptation for a continuous speech recognizer.
- S. Cifuentes, José Colás, Mohammed H. Savoji, José Manuel Pardo:
A new algorithm for connected digit recognition.
- Günther Ruske, Bernd Plannerer, Tanja Schultz:
Stochastic modeling of syllable-based units for continuous speech recognition.
- David M. Goblirsch, Toffee A. Albina:
HARK: an experimental speech recognition system.
- Akito Nagai, Jun-ichi Takami, Shigeki Sagayama:
The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser.
- Shinsuke Sakai, Michael S. Phillips:
J-SUMMIT: a Japanese segment-based speech recognition system.
- Shinobu Mizuta, Kunio Nakajima:
Optimal discriminative training for HMMs to recognize noisy speech.
- Shingo Kuroiwa, Kazuya Takeda, Fumihiro Yato, Seiichi Yamamoto, Kunihiko Owa, Makoto Shozakai, Ryuuji Matsumoto:
Architecture and algorithms of a real-time word recognizer for telephone input.
- Hiroyasu Kuwano, Kazuya Nomura, Atsushi Ookumo, Shoji Hiraoka, Taisuke Watanabe, Katsuyuki Niyada:
Speaker independent speech recognition method using word spotting technique and its application to VCR programming.
- S. Lennon, Eliathamby Ambikairajah:
Transputer implementation of front-end processors for speech recognition systems.
- Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano:
Phoneme HMM evaluation algorithm without phoneme labeling.
- Andreas Noll, Henning Bergmann, Hans-Hermann Hamer, Annedore Paeseler, Horst Tomaschewski:
Architecture of a configurable application interface for speech recognition systems.
- Mark A. Fanty, John Pochmara, Ronald A. Cole:
An interactive environment for speech recognition research.
- Yoshiharu Abe, Kunio Nakajima:
An approach to unlimited vocabulary continuous speech recognition based on context-dependent phoneme modeling.
- Chuck Wooters, Nelson Morgan:
Acoustic subword models in the berkeley restaurant project.
- Claus Nedergaard Jacobsen:
SIRtrain, an open standard environment for CHMM recognizer development.
- Yutaka Kobayashi, Yasuhisa Niimi:
Segmented trellis algorithms for the continuous speech recognition.
- Bo Xu, Z. W. Lin, Taiyi Huang, D. X. Xu, Y. Q. Gao:
A.
46
,
500 word Chinese speech recognition system. 0- http:
//www.isca-speech.org/archive/icslp_1992/i92_1563.html
- Dao Wen Chen:
Study of the time extension flat net for speech recognition.
Hidden Markov Models
- Frank Fallside:
A hidden Markov model structure for the acquisition of speech by machine, ASM.
- Yasuyuki Masai, Shin'ichi Tanaka, Tsuneo Nitta:
Speaker-independent keyword recognition based on SMQ/HMM.
- Régis Cardin, Diane Goupil, Roxane Lacouture, Evelyne Millien, Charles Snow, Yves Normandin:
CRIM's spontaneous speech recognition system for the ATIS task.
- Fabio Brugnara, Renato de Mori, Diego Giuliani, Maurizio Omologo:
Improved connected digit recognition using spectral variation functions.
- Andrew Tridgell, Bruce Millar, Kim-Anh Do:
Alternative preprocessing techniques for discrete hidden Markov model phoneme recognition.
Dialogue
Prosody and Phonology 1,
2
- Li-chiung Yang:
A semantic and pragmatic analysis of tone and intonation in Mandarin Chinese.
- Yoshimasa Tsukuma:
On prosodic features in speech - comparative studies between Japanese and standard Chinese.
- W. Nick Campbell:
Prosodic encoding of English speech.
- Gunnar Fant, Anita Kruckenberg, Lennart Nord:
Prediction of syllable duration, speech rate and tempo.
- Rolf Carlson, Björn Granström, Lennart Nord:
Experiments with emotive speech - acted utterances and synthesized replicas.
- J. Caspars, Vincent J. van Heuven:
Phonetic properties of dutch accent lending pitch movements under time pressure.
- Jacques M. B. Terken, Karin van den Hombergh:
Judgments of relative prominence for adjacent and non-adjacent accents.
- Frédéric Beaugendre, Christophe d'Alessandro, Anne Lacheret-Dujour, Jacques M. B. Terken:
A perceptual study of French intonation.
- Mark Liberman, J. Michael Schultz, Soonhyun Hong, Vincent Okeke:
The phonetics of IGBO tone.
- Stefanie Shattuck-Hufnagel:
Stress shift as pitch accent placement: within-word early accent placement in american English.
Speaking Styles 1,
2
ASR in Noise
- Claude Lefebvre, Dariusz A. Zwierzynski, David R. Starks, Gary Birch:
Further optimisation of a robust IMELDA speech recogniser for applications with severely degraded speech.
- Richard M. Stern, Fu-Hua Liu, Yoshiaki Ohshima, Thomas M. Sullivan, Alejandro Acero:
Multiple approaches to robust speech recognition.
- Tadashi Kitamura, Satoshi Ando, Etsuro Hayahara:
Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks.
- Douglas A. Cairns, John H. L. Hansen:
ICARUS: an mwave-based real-time speech recognition system in noise and lombard effect.
- Chafic Mokbel, L. Barbier, Y. Kerlou, Gérard Chollet:
Word recognition in the car: adapting recognizers to new environments.
Dialogues and Applications
Sampling of Speech Production 1,
2
Labelling of Speech 1,
2
- Alain Marchal, William J. Hardcastle, K. Nicolaidis, Noël Nguyen, Fiona Gibbon:
Non-linear annotation of multi-channel speech data.
- Shingo Fujiwara, Yasuhiro Komori, Masahide Sugiyama:
A phoneme labelling workbench using HMM and spectrogram reading knowledge.
- Michael S. Phillips, Victor Zue:
Automatic discovery of acoustic measurements for phonetic classification.
- Itou Katunobu, Hayamizu Satoru, Tanaka Hozumi:
Detection of unknown words and automatic estimation of their transcriptions in continuous speech recognition.
- Fabio Brugnara, Daniele Falavigna, Maurizio Omologo:
A HMM-based system for automatic segmentation and labeling of speech.
- Robert W. P. Luk, Robert I. Damper:
A modification of the viterbi algorithm for stochastic phonographic transduction.
- Paul C. Bagshaw, Briony J. Williams:
Criteria for labelling prosodic aspects of English speech.
- Yifan Gong, Jean Paul Haton:
DTW-based phonetic labeling using explicit phoneme duration constraints.
- Kim E. A. Silverman, Mary E. Beckman, John F. Pitrelli, Mari Ostendorf, Colin W. Wightman, Patti Price, Janet B. Pierrehumbert, Julia Hirschberg:
TOBI: a standard for labeling English prosody.
- Barbara Eisen, Hans-Günther Tillmann, Christoph Draxler:
Consistency of judgements in manual labelling of phonetic segments: the distinction between clear and unclear cases.
Speech Production:
Models and Theories 1,
2
- Gunnar Fant:
Vocal tract area functions of Swedish vowels and a new three-parameter model.
- Jean-Claude Junqua:
Acoustic and production pilot studies of speech vowels produced in noise.
- Yves Laprie, Marie-Odile Berger:
Active models for regularizing formant trajectories.
- René Carré, Samir Chennoukh, Mohamad Mrayati:
Vowel-consonant-vowel transitions: analysis, modeling, and synthesis.
- Maureen Stone, Subhash Lele:
Representing the tongue surface with curve fits.
- Katherine S. Harris, Eric Vatikiotis-Bateson, Peter J. Alfonso:
Muscle forces in vowel vocal tract formation.
- Makoto Hirayama, Eric Vatikiotis-Bateson, Mitsuo Kawato, Kiyoshi Honda:
Neural network modeling of speech motor control.
- Eric Vatikiotis-Bateson, Makoto Hirayama, Kiyoshi Honda, Mitsuo Kawato:
The articulatory dynamics of running speech: gestures from phonemes?
Phonetics
Speech Data Bases
Stochastic ASR
- Victor Abrash, Horacio Franco, Michael Cohen, Nelson Morgan, Yochai Konig:
Connectionist gender adaptation in a hybrid neural network / hidden Markov model speech recognition system.
- Michael Cohen, Horacio Franco, Nelson Morgan, David E. Rumelhart, Victor Abrash:
Hybrid neural network/hidden Markov model continuous-speech recognition.
- Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Ernst Günter Schukat-Talamazzini, Heinrich Niemann:
Semantic hidden Markov networks.
Analysis of Disfluencies in Speech 1,
2
Linguistic Phonetics:
Reduction
Hearing/Speech Impaired 1,
2
- Christian Benoît, Tayeb Mohamadi:
The lip benefit: auditory and visual intelligibility of French speech in noise.
- Anne-Marie Öster:
Phonological assessment of deaf children's productive knowledge as a basis for speech-training.
- Hideaki Seki, Akiko Hayashi, Satoshi Imaizumi, Takehiko Harada, Hiroshi Hosoi:
Factors affecting voicing distinction of stops for the hearing impaired.
- Arthur Boothroyd, Robin S. Waldstein, Eddy Yeung:
Investigations into the auditory F0 speechreading enhancement effect using a sinusoidal replica of the F0 contour.
- Francesco Cutugno:
Some considerations on pitch and timing control in deaf children.
- Shari R. Baum:
Rate of speech effects in aphasia: an acoustic analysis of voice onset time.
- Parth M. Bhatt:
Fundamental frequency attributes following unilateral left or right temporal lobe lesion.
- Hiroshi Hosoi, Satoshi Imaizumi, Akiko Hayashi, Takehiko Harada, Hideaki Seki:
Cue extraction and integration in speech perception for the hearing impaired.
- Anna K. Nabelek:
The relationship between spectral details in naturally produced vowels and identification errors in noise and reverberation.
- Donald G. Jamieson, Leonard Cornelisse:
Speech processing effects on intelligibility for hearing-impaired listeners.
Unit-Based ASR
Language and Dialect Characterization
Speech Production,
Parception,
and Analysis
- B. Boyanov, Gérard Chollet:
Pathological voice analysis using cepstra, bispectra and group delay functions.
- Qianje Fu, Peyu Xia, Ren-Hua Wang:
Lateralization of speech sounds by binaural distributing processing.
- H. H. Rump:
Timing of pitch movements and perceived vowel duration.
- J. P. Liu, Geneviève Baudoin, Gérard Chollet:
Studies of glottal excitation and vocal tract parameters using inverse filtering and a parameterized input model.
- Dennis Norris, Brit van Ooyen, Anne Cutler:
Speeded detection of vowels and steady-state consonants.
- Elzbieta B. Slawinski:
Temporal factors in the perception of consonants for different age and hearing impairment groups.
- Abeer Alwan:
The role of F3 and F4 in identifying place of articulation for stop consonants.
- Thomas R. Sawallis:
A new measure for perceptual weight of acoustic cues: an experiment on voicing in French intervocalic [t, d].
- Alan Wrench, Mervyn A. Jack, John Laver, Mary S. Jackson, David S. Soutar, A. Gerry Robertson, Janet MacKenzie Beck:
Objective speech quality assessment in patients with intra-oral cancers: voiceless fricatives.
- Bruce Connell:
Tongue contact, active articulators, and coarticulation.
- Makio Kashino, Astrid van Wieringen, Louis C. W. Pols:
Cross-languages differences in the identification of intervocalic stop consonants by Japanese and dutch listeners.
- Minoru Tsuzaki:
Effects of typicality and interstimulus interval on the discrimination of speech stimuli: within-subject comparison.
- Ronald A. Cole, Yeshwant K. Muthusamy:
Perceptual studies on vowels excised from continuous speech.
- Raymond S. Weitzman:
The relative perceptual salience of spectral and durational differences.
- Florien J. Koopmans-van Beinum:
Can 'level words' from one speaking style become teaks' when spliced into another speaking style?
- Beverley Gable, Helen Nemeth, Martin Haran:
Speech errors and task demand.
- John H. Esling, B. Craig Dickson, Roy C. Snell:
Analysis of phonation type using laryngographic techniques.
- Sumi Shigeno:
Effect of prototypes of vowels on speech perception in Japanese and English.
- Tomo-o Morohashi, Tetsuya Shimamura, Hiroyuki Yashima, Jouji Suzuki:
Characteristics of voice picked up from outer skin of larynx.
- Igor V. Nabelek:
Coding of voicing in whispered plosives.
- Margaret F. Cheesman, Shelly Lawrence, Allison Appleyard:
Performance on a nonsense syllable test using the articulation index.
- Donald G. Jamieson, Ketan Ramji, Issam Kheirallah, Terrance M. Nearey:
CSRE: a speech research environment.
Linguistic Phonetics
- Kazue Hata, Yoko Hasegawa:
A study of F0 reset in naturally-read utterances in Japanese.
- H. Samuel Wang, Fu-Dong Chiu:
On the nature of tone sandhi rules in taiwanese.
- Geoffrey S. Nathan:
How shallow is phonology: declarative phonologies meet fast speech.
- Junko Hosaka, Toshiyuki Takezawa, Noriyoshi Uratani:
Analyzing postposition drops in spoken Japanese.
- Jialu Zhang, Xinghui Hu:
Fundamental frequency patterns of Chinese in different speech modes.
- Knut Kvale, Arne Kjell Foldvik:
The multifarious r-sound.
- Zita McRobbie-Utasi:
The role of preaspiration duration in the voicing contrast in skolt sami.
- Eiji Yamada:
Parameter setting for abstract stress in tokyo Japanese.
- Georg Ottesen:
A method for studying prosody in texts read aloud.
- Vincent J. van Heuven:
Linguistic versus phonetic explanation of consonant lengthening after short vowels: a contrastive study of dutch and English.
- Kjell Elenius, Mats Blomberg:
Comparing phoneme and feature based speech recognition using artificial neural networks.
- Eva Strangert:
Prosodic cues to the perception of syntactic boundaries.
- Paul Taylor, Stephen Isard:
A new model of intonation for use with speech synthesis and recognition.
- Rudolf Weiss:
Computerized error detection/correction in teaching German sounds: some problems and solutions.
- Ahmed M. Elgendy:
Velum and epiglottis behavior during the production of Arabic pharyngeals and laryngeals: a fiberscopic study.
- Kim E. A. Silverman, Eleonora Blaauw, Judith Spitz, John F. Pitrelli:
A prosodic comparison of spontaneous speech and read speech.
- John J. Ohala, Maria Grazia Busa, Karen Harrison:
Phonological and psychological evidence that listeners normalize the speech signal.
- Elizabeth A. Hinkelman:
Intonation and the request/question distinction.
- Robert F. Port, Fred Cummins:
The English voicing contrast as velocity perturbation.
- Michael S. Ziolkowski, Mayumi Usami, Karen L. Landahl, Brenda K. Tunnock:
How many phonologies are there in one speaker? some experimental evidence.
- Hirokazu Sato:
Decomposition into syllable complexes and the accenting of Japanese loanwords.
- Jianfen Cao:
Temporal structure in bisyllabic word frame: an evidence for relational invariance and variability from standard Chinese.
- Shih-ping Wang:
The integration of phonetics and phonology: a case study of taiwanese "gemination" and syllable structure.
Perception and Production
- James H. Bradford:
Towards a robust speech interface for teleoperation systems.
- Piero Cosi, Paolo Frasconi, Marco Gori, N. Griggio:
Phonetic recognition experiments with recurrent neural networks.
- Mikael Goldstein, Björn Lindström, Ove Till:
Some aspects on context and response range effects when assessing naturalness of Swedish sentences generated by 4 synthesiser systems.
- Marcello Pelillo, Franca Moro, Mario Refice:
Probabilistic prediction of parts-of-speech from word spelling using decision trees.
- Dieter Barschdorff, Ulrich Gärtner:
Single word detection system with a neural classifier for recognizing speech at variable levels of background noise.
- Sharon L. Oviatt, Philip R. Cohen, Martin Fong, Michael Frank:
A rapid semi-automatic simulation technique for investigating interactive speech and handwriting.
- Sang-Hwa Chung, Dan I. Moldovan:
Speech understanding on a massively parallel computer.
- Chan-Do Lee:
Rationale for "performance phonology".
- Takuya Koizumi, Jyoji Urata, Shuji Taniguchi:
The effect of information feedback on the performance of a phoneme recognizer using kohonen map.
- Yasuharu Asano, Keikichi Hirose, Hiroya Fujisaki:
A method of dialogue management for the speech response system.
- Yumi Takizawa, Eiichi Tsuboka:
Syllable duration prediction for speech recognition.
- F. Canavesio, Giuseppe Castagneri, Giuseppe Di Fabbrizio, Francesco Senia:
Comparison between two methodologies of testing isolated word speech recognizers.
- He Jun, Henri Leich:
Extracting fuzzy features from MLP for recognition of speech.
- Keiji Fukuzawa, Yoshinaga Kato, Masahide Sugiyama:
A fuzzy partition model (FPM) neural network architecture for speaker-independent continuous speech recognition.
- A. Ennaji, Jean Rouat:
Conception of speech filters based on a neural network.
- Hong-Kwang Jeff Kuo, Chin-Hui Lee, Aaron E. Rosenberg:
Speaker set identification through speaker group modeling.
- Stephen Springer, Sara Basson, Judith Spitz:
Identification of principal ergonomic requirements for interactive spoken language systems.
- Thomas E. Jacobs, Eric R. Buhrke:
Performance of the united kingdom intelligent network automatic speech recognition system.
- Guy Deville, Pierre Mousel:
Evaluation of parsing strategies in natural language spoken man-machine dialogue.
- Yasuhisa Niimi, Yutaka Kobayashi:
An information retrieval system with a speech interface.
- J. P. Eatock, J. S. D. Mason:
Phoneme performance in speaker recognition.
- Evelyne Tzoukermann, Roberto Pieraccini, Zakhar Gorelov:
Natural language processing in the chronus system.
- Dominique Francois, Dominique Fohr:
Contribution of neural networks for phoneme identification in the APHODEX expert system.
- Douglas B. Paul:
A CSR-NL interface architecture.
- Pierre Lefebvre, Frank Poirier, G. Duncan:
Speech interface for a man-machine dialog with the unix operating system.
- P. Bardaud, François Capman, Chafic Mokbel, Chakib Tadj, Gérard Chollet:
Transformation of databases for the evaluation of speech recognizers.
- Yoichi Yamashita, Riichiro Mizoguchi:
Dialog management for speech output from concept representation.
- Seiichiro Hangai, Shigetoshi Sugiyama, Kazuhiro Miyauchi:
Speaker verification using locations and sizes of multipulses on neural networks.
- Carlos Teixeira, Isabel Trancoso:
Word rejection using multiple sink models.
- Boerge Lindberg:
Verification of language specific performance factors from recogniser testing on EUROM.1 CVC material.
- Alain Cozannet:
Modeling task driven oral dialogue.
- Wei-ying Li, Kechu Yi, Zheng Hu:
Introducing neural predictor to hidden Markov model for speech recognition.
- Feng Liu, Jianxin Jiang, Jun Cheng, Kechu Yi:
A neural network based on subnets - SNN.
Analysis/Systems
- Ute Ziegenhain:
Syntactic anaphora resolution in a speech understanding system.
- Marion Mast, Ralf Kompe, Franz Kummert, Heinrich Niemann, Elmar Nöth:
The dialog module of the speech recognition and dialog system EVAR.
- Yan Ming Cheng, Douglas D. O'Shaughnessy, Paul Mermelstein:
Statistical recovery of wideband speech from narrowband speech.
- Henk van den Heuvel, Toni C. M. Rietveld:
Speaker related variability in cepstral representations of dutch speech segments.
- Per Rosenbeck, Bo Baungaard:
Experiences from a real-world telephone application: teledialogue.
- K. Y. Lee, P. Ha, J. Rheem, S. Ann, I. Song:
Robust estimation of time-varying LP parameters on speech.
- Javier Hernando, Climent Nadeu, Eduardo Lleida:
On the AR modelling of the one-sided autocorrelation sequence for noisy speech recognition.
- Hiroshi Shimodaira, Mitsuru Nakai:
Robust pitch detection by narrow band spectrum analysis.
- S. Eady, B. Craig Dickson, Roy C. Snell, J. Woolsey, P. Ollek, A. Wynrib, J. Clayards:
A microcomputer-based system for real-time analysis and display of laryngograph signals.
- N. M. Veilleux, Mari Ostendorf, Colin W. Wightman:
Parse scoring with prosodic information.
- Ying Cheng, Paul Fortier, Yves Normandin:
Topic identification using a neural network with a keyword-spotting preprocessor.
- Shane Switzer, Tim Anderson, Matthew Kabrisky, Steven K. Rogers, Bruce W. Suter:
Frequency domain speech coding.
- Raymond Descout, Robert Bergeron, Bernard Meriald:
MEDIATEX-TASF: a closed captioning real-time service in French.
- S. A. Wilde, K. M. Curtis:
The wavelet transform for speech analysis.
- Pablo Aibar, Andrés Marzal, Enrique Vidal, Francisco Casacuberta:
Problems and algorithms in optimal linguistic decoding: a unified formulation.
- Jean Rouat, Sylvain Lemieux, Alain Migneault:
A spectro-temporal analysis of speech based on nonlinear operators.
- Miguel A. Berrojo, Javier Corrales, Jesus Macias, Santiago Aguilera:
A PC graphic tool for speech research based on a DSP board.
- Satoru Hayamizu, Katunobu Itou, Masafumi Tamoto, Kazuyo Tanaka:
A spoken language dialogue system for automatic collection of spontaneous speech.
- Shingo Nishioka, Yoichi Yamashita, Riichiro Mizoguchi:
A powerful disambiguating mechanism for speech understanding systems based on ATMs.
- Najib Naja, Jean-Marc Boucher, Samir Saoudi:
A mixed Gaussian-stochastic code book for CELP coder in LSP speech coding.
- Hiroyuki Kamata, Yoshihisa Ishida:
A method to estimate the transfer function of ARMA model of speech wave using prony method and homomorphic analysis.
- Børge Lindberg, Bjarne Andersen, Anders Baekgaard, Tom Brøndsted, Paul Dalsgaard, Jan Kristiansen:
An integrated dialogue design and continuous speech recognition system environment.
- Alain Marchal, Christine Meunier, P. Gavarry:
The PSH/DISPE helium speech cdrom.
Last update Fri May 25 08:22:58 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page