ICSLP 1994:
Yokohama,
Japan
The 3rd International Conference on Spoken Language Processing, ICSLP 1994, Yokohama, Japan, September 18-22, 1994.
ISCA 1994
Plenary Lectures
- Ilse Lehiste:
Poetic metre, prominence, and the perception of prosody: a case of intersection of art and science of spoken language.
- Shizuo Hiki:
Possibilities of compensating for defects in speech perception and production.
- Willem J. M. Levelt:
On the skill of speaking: how do we access words?
Integration of Speech and Natural Language Processing
- Toshiyuki Takezawa, Tsuyoshi Morimoto:
An efficient predictive LR parser using pause information for continuously spoken sentence recognition.
- Kyunghee Kim, Geunbae Lee, Jong-Hyeok Lee, Hong Jeong:
Integrating TDNN-based diphone recognition with table-driven morphology parsing for understanding of spoken Korean.
- Frank O. Wallerstein, Akio Amano, Nobuo Hataoka:
Implementation issues and parsing speed evaluation of HMM-LR parser.
- Kenji Kita, Yoneo Yano, Tsuyoshi Morimoto:
One-pass continuous speech recognition directed by generalized LR parsing.
- Bernd Plannerer, Tobias Einsele, Martin Beham, Günther Ruske:
A continuous speech recognition system integrating additional acoustic knowledge sources in a data-driven beam search algorithm.
- Michael K. Brown, Bruce Buntschuh:
A context-free grammar compiler for speech understanding systems.
- Katashi Nagao, Kôiti Hasida, Takashi Miyata:
Probabilistic constraint for integrated speech and language processing.
- William H. Edmondson, Jon P. Iles:
A non-linear architecture for speech and natural language processing.
Articulatory Motion
- Donna Erickson, Kevin A. Lenzo, Masashi Sawada:
Manifestations of contrastive emphasis in jaw movement in dialogue.
- Sook-Hyang Lee, Mary E. Beckman, Michel Jackson:
Jaw targets for strident fricatives.
- David J. Ostry, Eric Vatikiotis-Bateson:
Jaw motions in speech are controlled in (at least) three degrees of freedom.
- Mark K. Tiede, Eric Vatikiotis-Bateson:
Extracting articulator movement parameters from a videodisc-based cineradiographic database.
- Maureen C. Stone, Andrew J. Lundberg:
Tongue-palate interactions in consonants vs. vowels.
- Philip Hoole, Christine Mooshammer, Hans G. Tillmann:
Kinematic analysis of vowel production in German.
- Sarah Hawkins, Andrew Slater:
Spread of CV and v-to-v coarticulation in british English: implications for the intelligibility of synthetic speech.
- Mariko Kondo:
Mechanisms of vowel devoicing in Japanese.
Cognitive Models for Spoken Language Processing
Semantic Interpretation of Spoken Messages
- Roland Kuhn, Renato de Mori:
Recent results in automatic learning rules for semantic interpretation.
- Allen L. Gorin:
Semantic associations, acoustic metrics and adaptive language acquisition.
- Wayne Ward:
Extracting information in spontaneous speech.
- Megumi Kameyama, Isao Arima:
Coping with aboutness complexity in information extraction from spoken dialogues.
- Otoya Shirotsuka, Ken'Ya Murakami:
An example-based approach to semantic information extraction from Japanese spontaneous speech.
- Akito Nagai, Yasushi Ishikawa, Kunio Nakajima:
A semantic interpretation based on detecting concepts for spontaneous speech understanding.
- Akira Shimazu, Kiyoshi Kogure, Mikio Nakano:
Cooperative distributed processing for understanding dialogue utterances.
- Michio Okada, Satoshi Kurihara, Ryohei Nakatsu:
Incremental elaboration in generating and interpreting spontaneous speech.
- Wieland Eckert, Heinrich Niemann:
Semantic analysis in a robust spoken dialog system.
- Hiroshi Kanazawa, Shigenobu Seto, Hideki Hashimoto, Hideaki Shinchi, Yoichi Takebayashi:
A user-initiated dialogue model and its implementation for spontaneous human-computer interaction.
Prosody
- Andreas Kießling, Ralf Kompe, Anton Batliner, Heinrich Niemann, Elmar Nöth:
Automatic labeling of phrase accents in German.
- Kikuo Maekawa:
Intonational structure of kumamoto Japanese: a perceptual validation.
- John F. Pitrelli, Mary E. Beckman, Julia Hirschberg:
Evaluation of prosodic transcription labeling reliability in the tobi framework.
- Neil P. McAngus Todd, Guy J. Brown:
A computational model of prosody perception.
- Kuniko Kakita:
Inter-speaker interaction in speech rhythm: some durational properties of sentences and intersentence intervals.
- Bertil Lyberg, Barbro Ekholm:
The final lengthening phenomenon in Swedish - a consequence of default sentence accent?
- Dawn M. Behne, Bente Moxness:
Concurrent effects of focal stress, postvocalic voicing and distinctive vowel length on syllable-internal timing in norwegian.
- Kazuyuki Takagi, Shuichi Itahashi:
Prosodic pattern of utterance units in Japanese spoken dialogs.
- Akira Ichikawa, Shinji Sato:
Some prosodical characteristics in spontaneous spoken dialogue.
Towards Natural Sounding Synthetic Speech
- Inger Karlsson, Johan Liljencrants:
Wrestling the two-mass model to conform with real glottal wave forms.
- Helmer Strik, Lou Boves:
Automatic estimation of voice source parameters.
- Wen Ding, Hideki Kasuya, Shuichi Adachi:
Simultaneous estimation of vocal tract and voice source parameters with application to speech synthesis.
- Pierre Badin, Christine H. Shadle, Y. Pham Thi Ngoc, J. N. Carter, W. S. C. Chiu, Celia Scully, K. Stromberg:
Frication and aspiration noise sources: contribution of experimental data to articulatory synthesis.
- Nobuhiro Miki, Pierre Badin, Y. Pham Thi Ngoc, Yoshihiko Ogawa:
Vocal tract model and 3-dimensional effect of articulation.
- Hisayoshi Suzuki, Jianwu Dang, Takayoshi Nakai, Akira Ishida, Hiroshi Sakakibara:
3-d FEM analysis of sound propagation in the nasal and paranasal cavities.
- Kiyoshi Honda, Hiroyuki Hirai, Jianwu Dang:
A physiological model of speech production and the implication of tongue-larynx interaction.
- Masaaki Honda, Tokihiko Kaburagi:
A dynamical articulatory model using potential task representation.
- Kenneth N. Stevens, Corine A. Bickley, David R. Williams:
Control of a klatt synthesizer by articulatory parameters.
Statistical Methods for Speech Recognition
- Nobuaki Minematsu, Keikichi Hirose:
Speech recognition using HMM with decreased intra-group variation in the temporal structure.
- Yukihiro Osaka, Shozo Makino, Toshio Sone:
Spoken word recognition using phoneme duration information estimated from speaking rate of input speech.
- Yumi Wakita, Eiichi Tsuboka:
State duration constraint using syllable duration for speech recognition.
- Satoru Hayamizu, Kazuyo Tanaka:
Statistical modeling and recognition of rhythm in speech.
- Xinhui Hu, Keikichi Hirose:
Recognition of Chinese tones in monosyllabic and disyllabic speech using HMM.
- Jun Wu, Zuoying Wang, Jiasong Sun, Jin Guo:
Chinese speech understanding and spelling-word translation based on the statistics of corpus.
- Ren-Hua Wang, Hui Jiang:
State-codebook based quasi continuous density hidden Markov model with applications to recognition of Chinese syllables.
- Eluned S. Parris, Michael J. Carey:
Estimating linear discriminant parameters for continuous density hidden Markov models.
- Franz Wolfertstetter, Günther Ruske:
Discriminative state-weighting in hidden Markov models.
- Takao Watanabe, Koichi Shinoda, Keizaburo Takagi, Eiko Yamada:
Speech recognition using tree-structured probability density function.
- David B. Roe, Michael D. Riley:
Prediction of word confusabilities for speech recognition.
- Li Zhao, Hideyuki Suzuki, Seiichi Nakagawa:
A comparison study of output probability functions in HMMs through spoken digit recognition.
- Tomio Takara, Naoto Matayoshi, Kazuya Higa:
Connected spoken word recognition using a many-state Markov model.
- Finn Tore Johansen:
Global optimisation of HMM input transformations.
- Don Sun, Li Deng:
Nonstationary-state hidden Markov model with state-dependent time warping: application to speech recognition.
- Jean-François Mari, Jean Paul Haton:
Automatic word recognition based on second-order hidden Markov models.
- Xixian Chen, Yinong Li, Xiaoming Ma, Lie Zhang:
On the application of multiple transition branch hidden Markov models to Chinese digit recognition.
- M. J. F. Gales, Steve J. Young:
Parallel model combination on a noise corrupted resource management task.
- Jean-Baptiste Puel, Régine André-Obrecht:
Robust signal preprocessing for HMM speech recognition in adverse conditions.
- Masaharu Katoh, Masaki Kohda:
A study on viterbi best-first search for isolated word recognition using duration-controlled HMM.
- Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano:
An HMM duration control algorithm with a low computational cost.
- Peter Beyerlein:
Fast log-likelihood computation for mixture densities in a high-dimensional feature space.
- Nick Cremelie, Jean-Pierre Martens:
Time synchronous heuristic search in a stochastic segment based recognizer.
- Maria-Barbara Wesenick, Florian Schiel:
Applying speech verification to a large data base of German to obtain a statistical survey about rules of pronunciation.
- Denis Jouvet, Katarina Bartkova, A. Stouff:
Structure of allophonic models and reliable estimation of the contextual parameters.
- Christoph Windheuser, Frédéric Bimbot, Patrick Haffner:
A probabilistic framework for word recognition using phonetic features.
- Mohamed Afify, Yifan Gong, Jean Paul Haton:
Nonlinear time alignment in stochastic trajectory models for speech recognition.
- David Lubensky, Ayman Asadi, Jayant M. Naik:
Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities.
- Kazuya Takeda, Tetsunori Murakami, Shingo Kuroiwa, Seiichi Yamamoto:
A trellis-based implementation of minimum error rate training.
- Me Yi:
Concatenated training of subword HMMs using detected labels.
- Chih-Heng Lin, Pao-Chung Chang, Chien-Hsing Wu:
An initial study on speaker adaptation for Mandarin syllable recognition with minimum error discriminative training.
Phonetics & Phonology I,
II
- Yuko Kondo:
Phonetic underspecification in schwa.
- Shin'ichi Tanaka, Haruo Kubozono:
Some remarks on the compound accent rule in Japanese.
- Rodmonga K. Potapova:
Modifications of acoustic features in Russian connected speech.
- Sun-Ah Jun, Mira Oh:
A prosodic analysis of three sentence types with "WH" words in Korean.
- Kazue Hata, Heather Moran, Steve Pearson:
Distinguishing the voiceless fricatives f and TH in English: a study of relevant acoustic properties.
- Kenzo Itoh:
Correlation analysis between speech power and pitch frequency for twenty spoken languages.
- Jongho Jun:
On gestural reduction and gestural overlap in Korean and English /PK/ clusters.
- Carlos Gussenhoven, Toni C. M. Rietveld:
Intonation contours and the prominence of F0 peaks.
- Agnès Belotel-Grenié, Michel Grenié:
Phonation types analysis in standard Chinese.
- Mitsuru Nakai, Hiroshi Shimodaira:
Accent phrase segmentation by finding n-best sequences of pitch pattern templates.
- Bruce L. Derwing, Terrance M. Nearey:
Sound similarity judgments and segment prominence: a cross-linguistic study.
- Hiroya Fujisaki, Sumio Ohno, Kei-ichi Nakamura, Miguelina Guirao, Jorge A. Gurlekian:
Analysis of accent and intonation in Spanish based on a quantitative model.
- Edda Farnetani, Maria Grazia Busa:
Italian clusters in continuous speech.
- Cynthia Grover, Jacques M. B. Terken:
Rhythmic constraints in durational control.
- Kazutaka Kurisu:
Further evidence for bi-moraic foot in Japanese.
- Yuji Sagawa, Masahiro Ito, Noboru Ohnishi, Noboru Sugie:
A model for generating self-repairs.
- Christopher Cleirigh, Julie Vonwiller:
Accent identification with a view to assisting recognition (work in progress).
- K. Nagamma Reddy:
Phonetic, phonological, morpho-syntactic and semantic functons of segmental duration in spoken telugu: acoustic evidence.
- Zita McRobbie-Utasi:
Timing strategies within the paragraph.
- Sotaro Sekimoto:
The effect of the following vowel on the frequency normalization in the perception of voiceless stop consonants.
- Toshiko Muranaka, Noriyo Hara:
Features of prominent particles in Japanese discourse, frequency, functions and acoustic features.
- Shuping Ran, J. Bruce Millar, Iain MacLeod:
Vowel quality assessment based on analysis of distinctive features.
- Cristina Delogu, Stella Conte, Ciro Sementina:
Differences in the fluctuation of attention during the listening of natural and synthetic passages.
- Barbara Heuft, Thomas Portele:
Production and perception of words with identical segmental structure but different number of syllables.
- Caroline B. Huang, Mark A. Son-Bell, David M. Baggett:
Generation of pronunciations from orthographies using transformation-based error-driven learning.
- Hidenori Usuki, Jouji Suzuki, Tetsuya Shimamura:
Characteristics of mispronunciation and hesitation in Japanese tongue twister.
- Jean-Claude Junqua:
A duration study of speech vowels produced in noise.
- Bert Van Coile, L. Van Tichelen, Annemie Vorstermans, J. W. Jang, M. Staessen:
PROTRAN: a prosody transplantation tool for text-to-speech applications.
- Klaus J. Kohler:
Complementary phonology a theoretical frame for labelling an acoustic data base of dialogues.
- Sun-Ah Jun, Mary E. Beckman:
Distribution of devoiced high vowels in Korean.
- Yeo Bom Yoon:
CV as a phonological unit in Korean.
- Manjari Ohala:
Experiments on the syllable in hindi.
- John J. Ohala:
Towards a universal, phonetically-based, theory of vowel harmony.
- John Ingram, Tom Mylne:
Perceptual parsing of nasal vowels.
- Oded Ghitza, M. Mohan Sondhi:
On the perceptual distance between speech segments.
- Masato Akagi, Astrid van Wieringen, Louis C. W. Pols:
Perception of central vowel with pre- and post-anchors.
- Mario Rossi, Evelyne Peter-Defare, Regine Vial:
Phonological mechanisms of French speech errors.
- Mukhlis Abu-Bakar, Nick Chater:
Phonetic prototypes: modelling the effects of speaking rate on the internal structure of a voiceless category using recurrent neural networks.
- William J. Hardcastle:
EPG and acoustic study of some connected speech processes.
- Osamu Fujimura:
Syllable timing computation in the c/d model.
- Tatiana Slama-Cazacu:
Contribution of psycholinguistic perspective for speech technologies.
Adaption and Training for Speech Recognition
- Yutaka Tsurumi, Seiichi Nakagawa:
An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation.
- Koichi Shinoda, Takao Watanabe:
Unsupervised speaker adaptation for speech recognition using demi-syllable HMM.
- Wu Chou, C.-E. Lee, Biing-Hwang Juang:
Minimum error rate training of inter-word context dependent acoustic model units in speech recognition.
- Jia-Lin Shen, Hsin-Min Wang, Ren-Yuan Lyu, Lin-Shan Lee:
Incremental speaker adaptation using phonetically balanced training sentences for Mandarin syllable recognition based on segmental probability models.
- Lorenzo Fissore, Giorgio Micca, Franco Ravera:
Incremental training of a speech recognizer for voice dialling-by-name.
- C. J. Leggetter, Philip C. Woodland:
Speaker adaptation of continuous density HMMs using multivariate linear regression.
- Kazumi Ohkura, Hiroki Ohnishi, Masayuki Iida:
Speaker adaptation based on transfer vectors of multiple reference speakers.
- Nikko Strom:
Experiments with a new algorithm for fast speaker adaptation.
- Tung-Hui Chiang, Yi-Chung Lin, Keh-Yih Su:
A study of applying adaptive learning to a multi-module system.
- Jun'ichi Nakahashi, Eiichi Tsuboka:
Speaker adaptation based on fuzzy vector quantization.
- Myung-Kwang Kong, Seong-Kwon Lee, Soon-Hyob Kim:
A study on the simulated annealing of self organized map algorithm for Korean phoneme recognition.
- Celinda de la Torre, Alejandro Acero:
Discriminative training of garbage model for non-vocabulary utterance rejection.
Science and Technology for Multimodal Interfaces
- Eric Vatikiotis-Bateson, Inge-Marie Eigsti, Sumio Yano:
Listener eye movement behavior during audiovisual speech perception.
- Dominic W. Massaro, Michael M. Cohen:
Auditory/visual speech in multimodal human interfaces.
- Tadahisa Kondo, Kazuhiko Kakehi:
Effects of phonological and semantic information of kanji and kana characters on speech perception.
- Patricia K. Kuhl, Minoru Tsuzaki, Yoh'ichi Tohkura, Andrew N. Meltzoff:
Human processing of auditory-visual information in speech perception: potential for multimodal human-machine interfaces.
- Alex Pentland, Trevor Darrell:
Visual perception of human bodies and faces for multi-modal interfaces.
- Paul Duchnowski, Uwe Meier, Alex Waibel:
See me, hear me: integrating automatic speech recognition and lip-reading.
- Sharon L. Oviatt, Erik Olsen:
Integration themes in multimodal human-computer interaction.
- D. A. Berkley, James L. Flanagan, K. L. Shipley, Lawrence R. Rabiner:
A multimodal teleconferencing system using hands-free voice control.
- Paul Bertelson, Jean Vroomen, Geert Wiegeraad, Béatrice de Gelder:
Exploring the relation between mcgurk interference and ventriloquism.
- Jean-Claude Junqua, Philippe Morin:
Naturalness of the interaction in multimodal applications.
- Haru Ando, Yoshinori Kitahara, Nobuo Hataoka:
Evaluation of multimodal interface using spoken language and pointing gesture on interior design system.
- Kyung-ho Loken-Kim, Fumihiro Yato, Laurel Fais, Tsuyoshi Morimoto, Akira Kurematsu:
Linguistic and paralinguistic differences between multimodal and telephone-only dialogues.
Measurements and Models of Speech Production
- R. C. Rose, Juergen Schroeter, Man Mohan Sondhi:
An investigation of the potential role of speech production models in automatic speech recognition.
- Tokihiko Kaburagi, Masaaki Honda:
A trajectory formation model of articulatory movements based on the motor tasks of phoneme-specific vocal tract shapes.
- Martine George, Paul Jospa, Alain Soquet:
Articulatory trajectories generated by the control of the vocal tract by a neural network.
- Makoto Hirayama, Eric Vatikiotis-Bateson, Vincent Gracco, Mitsuo Kawato:
Neural network prediction of lip shape from muscle EMG in Japanese speech.
- Masahiro Hiraike, Shigehisa Shimizu, Takao Mizutani, Kiyoshi Hashimoto:
Estimation of the lateral shape of a tongue from speech.
- Paul Jospa, Alain Soquet:
The acoustic-articulatory mapping and the variational method.
- Xavier Pelorson, T. Lallouache, S. Tourret, C. Bouffartigue, Pierre Badin:
Aerodynamical, geometrical and mechanical aspects of bilabial plosives production.
- Jianwu Dang, Kiyoshi Honda:
Investigation of the acoustic characteristics of the velum for vowels.
- Kunitoshi Motoki, Pierre Badin, Nobuhiro Miki:
Measurement of acoustic impedance density distribution in the near field of the labial horn.
- Jean Schoentgen, Sorin Ciocea:
Explicit relations between resonance frequencies and vocal tract cross sections in loss-less kelly-lochbaum and distinctive region vocal tract models.
- Vesa Välimäki, Matti Karjalainen:
Improving the kelly-lochbaum vocal tract model using conical tube sections and fractional delay filtering techniques.
- Masafumi Matsumura, Takuya Nukawa, Koji Shimizu, Yasuji Hashimoto, Tatsuya Morita:
Measurement of 3d shapes of vocal tract, dental crown and nasal cavity using MRI: vowels and fricatives.
- Chang-Sheng Yang, Hideki Kasuya:
Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects.
- Shrikanth Narayanan, Abeer Alwan, Katherine Haker:
An MRI study of fricative consonants.
- Eric Vatikiotis-Bateson, Mark K. Tiede, Yasuhiro Wada, Vincent Gracco, Mitsuo Kawato:
Phoneme extraction using via point estimation of real speech.
- Hiroki Matsuzaki, Nobuhiro Miki, Nobuo Nagai, Tohru Hirohku, Yoshihiko Ogawa:
3d FEM analysis of vocal tract model of elliptic tube with inhomogeneous-wall impedance.
- Yuki Kakita, Hitoshi Okamoto:
Chaotic characteristics of voice fluctuation and its model explanation: normal and pathological voices.
- Tadashige Ikeda, Yuji Matsuzaki:
Flow theory for analysis of phonation with a membrane model of vocal cord.
- B. Craig Dickson, John H. Esling, Roy C. Snell:
Real-time processing of electroglottographic waveforms for the evaluation of phonation types.
- Donna Erickson, Kiyoshi Honda, Hiroyuki Hirai, Mary E. Beckman, Seiji Niimi:
Global pitch range and the production of low tones in English intonation.
- Masafumi Matsumura, Kazuo Kimura, Katsumi Yoshino, Takashi Tachimura, Takeshi Wada:
Measument of palatolingual contact pressure during consonant productions using strain gauge transducer mounted platal plate.
- Kohichi Ogata, Yorinobu Sonoda:
A study of sensor arrangements for detecting movements and inclinations of tongue point during speech.
- Shinobu Masaki, Kiyoshi Honda:
Estimation of temporal processing unit of speech motor programming for Japanese words based on the measurement of reaction time.
Applications of Spoken Language Processing
- Jay G. Wilpon, David B. Roe:
Applications of speech recognition technology in telecommunications.
- Tsuneo Nitta:
Speech recognition applications in Japan.
- Tomohisa Hirokawa:
Trends in the applications of and market for speech synthesis technology.
- Baruch Mazor, Jerome Braun, Bonnie Zeigler, Solomon Lerner, Ming-Whei Feng, Han Zhou:
OASIS - a speech recognition system for telephone service orders.
- Ronald A. Cole, David G. Novick, Mark A. Fanty, Pieter J. E. Vermeulen, Stephen Sutton, Daniel C. Burnett, Johan Schalkwyk:
A prototype voice-response questionnaire for the u.s. census.
- Toshiaki Tsuboi, Shigeru Homma, Shoichi Matsunaga:
A speech-to-text transcription system for medical diagnoses.
- Marc Dymetman, Julie Brousseau, George F. Foster, Pierre Isabelle, Yves Normandin, Pierre Plamondon:
Towards an automatic dictation system for translators : the transtalk project.
- Kamil A. Grajski, Kurt Rodarmer:
Real-time, speaker-independent, continuous Spanish speech recognition for personal computer desktop command & control.
- Jun Noguchi, Shinsuke Sakai, Kaichiro Hatazaki, Ken-ichi Iso, Takao Watanabe:
An automatic voice dialing system developed on PC speech i/o platform.
- Martin Oerder, Harald Aust:
A realtime prototype of an automatic inquiry system.
- David Goddeau, Eric Brill, James R. Glass, Christine Pao, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Victor W. Zue:
GALAXY: a human-language interface to on-line travel information.
Speech Synthesis I,
II
- Merle Horne, Marcus Filipsson:
Generating prosodic structure for Swedish text-to-speech.
- Alan W. Black, Paul Taylor:
Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input.
- Jan P. H. van Santen, Julia Hirschberg:
Segmental effects on timing and height of pitch contours.
- Toshiaki Fukada, Yasuhiro Komori, Takashi Aso, Yasunori Ohora:
A study on pitch pattern generation using HMM-based statistical information.
- Olivier Boëffard, Fábio Violaro:
Using a hybrid model in a text-to-sppech system to enlarge prosodic modifications.
- Akio Ando, Eiichi Miyasaka:
A new method for estimating Japanese speech rate.
- Emmy M. Konst, Lou Boves:
Automatic grapheme-to-phoneme conversion of dutch names.
- Briony Williams:
Diphone synthesis for the welsh language.
- Shinichi Doi, Kazuhiko Iwata, Kazunori Muraki, Yukio Mitome:
Pause control in Japanese text-to-speech conversion system with lexical discourse grammar.
- Naohiro Sakurai, Takerni Mochida, Tetsunori Kobayashi, Katsuhiko Shirai:
Generation of prosody in speech synthesis using large speech data-base.
- Niels-Jorn Dyhr, Marianne Elmlund, Carsten Henriksen:
Preserving naturalness in synthetic voices while minimizing variation in formant frequencies and bandwidths.
- Kazuhiro Takahashi, Kazuhiko Iwata, Yukio Mitome, Keiko Nagano:
Japanese text-to-speech conversion software for personal computers.
- Annemie Vorstermans, Jean-Pierre Martens:
Automatic labeling of speech synthesis corpora.
- Yasushi Ishikawa, Kunio Nakajima:
On synthesis units for Japanese text-to-speech synthesis.
- Judith L. Klavans, Evelyne Tzoukermann:
Inducing concatenative units from machine readable dictionaries and corpora for speech synthesis.
- Thomas Portele, Florian Höfer, Wolfgang J. Hess:
Structure and representation of an inventory for German speech synthesis.
- Anne Lacheret-Dujour, Vincent Pean:
Towards a prosodic cues-based modelling of phonological variability for text-to-speech synthesis.
- Isabel Trancoso, Céu Viana, Fernando M. Silva, Goncalo C. Marques, Luís C. Oliveira:
Rule-based vs neural network-based approaches to letter-to-phone conversion for portuguese common and proper names.
- Benjamin Ao, Chilin Shih, Richard Sproat:
A corpus-based Mandarin text-to-speech synthesizer.
- Kazuo Hakoda, Tomohisa Hirokawa, Kenzo Itoh:
Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis.
- Mats Ljungqvist, Anders Lindström, Kjell Gustafson:
A new system for text-to-speech conversion, and its application to Swedish.
- Yoshinori Shiga, Yoshiyuki Hara, Tsuneo Nitta:
A novel segment-concatenation algorithm for a cepstrum-based synthesizer.
- Florien J. Koopmans-van Beinum, Louis C. W. Pols:
Naturalness and intelligibility of rule-synthesized speech, supplied with specific spectro-temporal features derived from natural continuous speech.
New Approach for Brain Function Research in Speech Perception and Production/
- Karalyn Patterson, Karen Croot, John R. Hodges:
Speech production: Insights from a study of progressive aphasia.
- Makoto Iwata, Yasuhisa Sakurai, Toshimitsu Momose:
Functional mapping of cerebral mechanism of reading in the Japanese language.
- Dana F. Boatman, Ronald P. Lesser, Barry Gordon:
Cortical representation of speech perception and production, as revealed by direct cortical electrical interference.
- Michael D. Rugg, Catherine J. C. Cox, Michael C. Doyle:
Investigating word recognition and language comprehension with event-related brain potentials.
- Sue Franklin, Julie Morris, Judy Turner:
Dissociations in word deafness.
- Akira Uno, Jun Tanemura, Koichi Higo:
Recovery mechanism of naming disorders in aphasic patients: effects of different training modalities.
Language Modeling for Speech Recognition
- Michael K. Brown, Stephen C. Glinski:
Stochastic context-free language modeling with evolutional grammars.
- Nigel Ward:
A lightweight parser for speech understanding.
- Takeshi Kawabata:
Dynamic probabilistic grammar for spoken language disambiguation.
- Kouichi Yamaguchi, Harald Singer, Shoichi Matsunaga, Shigeki Sagayama:
Speaker-consistent parsing for speaker-independent continuous speech recognition.
- Masaaki Nagata:
A stochastic morphological analyzer for spontaneously spoken languages.
- Jean-Yves Antoine, Jean Caelen, Bertrand Caillaud:
Automatic adaptive understanding of spoken language by cooperation of syntactic parsing and semantic priming.
- Adwait Ratnaparkhi, Salim Roukos, Todd Ward:
A maximum entropy model for parsing.
- Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka:
Sentence spotting using continuous structuring method.
- Hiroyuki Sakamoto, Shoichi Matsunaga:
Continuous speech recognition using a dialog-conditioned stochastic language model.
- Tatsuya Kawahara, Toshihiko Munetsugu, Norihide Kitaoka, Shuji Doshita:
Keyword and phrase spotting with heuristic language model.
- Jin'ichi Murakami, Shoichi Matsunaga:
A spontaneous speech recognition algorithm using word trigram models and filled-pause procedure.
- Masayuki Yamada, Yasuhiro Komori, Yasunori Ohora:
Active/non-active word control using garbage model, unknown word re-evaluation in speech conversation.
- L. Chase, R. Rosenfeld, Wayne Ward:
Error-responsive modifications to speech recognizers: negative n-grams.
- Bernhard Suhm, Alex Waibel:
Towards better language models for spontaneous speech.
- Michael K. McCandless, James R. Glass:
Empirical acquisition of language models for speech recognition.
- Shigeru Fujio, Yoshinori Sagisaka, Norio Higuchi:
Prediction of prosodic phrase boundaries using stochastic context-free grammar.
- Egidio P. Giachin, Paolo Baggia, Giorgio Micca:
Language models for spontaneous speech recognition: a bootstrap method for learning phrase digrams.
- Monika Woszczyna, Alex Waibel:
Inferring linguistic structure in spoken language.
- Germán Bordel, I. Torrest, Enrique Vidal:
Back-off smoothing in a syntactic approach to language modelling.
- H.-H. Shih, Steve J. Young:
Computer assisted grammar construction.
- Giuliano Antoniol, Fabio Brugnara, Mauro Cettolo, Marcello Federico:
Language model estimations and representations for real-time continuous speech recognition.
- Bruno Jacob, Régine André-Obrecht:
Sub-dictionary statistical modeling for isolated word recognition.
- Michèle Jardino:
A class bigram model for very large corpus.
Models and Systems for Spoken Dialogue
- Akio Amano, Toshiyuki Odaka:
A spoken dialogue system based on hierarchical feedback mechanism.
- Niels Ole Bernsen, Laila Dybkjær, Hans Dybkjær:
A dedicated task-oriented dialogue theory in support of spoken language dialogue systems design.
- Farzad Ehsani, Kaichiro Hatazaki, Jun Noguchi, Takao Watanabe:
Interactive speech dialogue system using simultaneous understanding.
- Masahiro Araki, Taro Watanabe, Felix C. M. Quimbo, Shuji Doshita:
A cooperative man-machine dialogue model for problem solving.
- Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano:
A multi-modal dialogue system for telephone directory assistance.
- Mark Terry, Randall Sparks, Patrick Obenchain:
Automated query identification in English dialogue.
- Keiichi Sakai, Yuji Ikeda, Minoru Fujita:
Robust discourse processing considering misrecognition in spoken dialogue system.
- Keiko Watanuki, Kenji Sakamoto, Fumio Togawa:
Analysis of multimodal interaction data in human communication.
- Kazuhiro Arai:
Changes in user's responses with use of a speech dialog system.
- Katunobu Itou, Tomoyosi Akiba, Osamu Hasegawa, Satoru Hayamizu, Kazuyo Tanaka:
Collecting and analyzing nonverbal elements for maintenance of dialog using a wizard of oz simulation.
- Giovanni Flammia, James R. Glass, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff, Victor W. Zue:
Porting the bilingual voyager system to Italian.
- Gen-ichiro Kikui, Tsuyoshi Morimoto:
Similarity-based identification of repairs in Japanese spoken language.
- Lars Bo Larsen, Anders Baekgaard:
Rapid prototyping of a dialogue system using a generic dialogue development platform.
- Shozo Naito, Akira Shimazu:
Heuristics for generating acoustic stress in dialogues and examination of their validity.
- Jacques Siroux, Mouloud Kharoune, Marc Guyomard:
Application and dialogue in the sundial system.
- Shin-ichiro Kamei, Shinichi Doi, Takako Komatsu, Susumu Akamine, Hitoshi Iida, Kazunori Muraki:
A dialog analysis using information of the previous sentence.
- Kiyoshi Kogure, Akira Shimazu, Mikio Nakano:
Recognizing plans in more natural dialogue utterances.
- Bernd Hildebrandt, Gernot A. Fink, Franz Kummert, Gerhard Sagerer:
Understanding of time constituents in spoken language dialogues.
- Tadahiko Kumamoto, Akira Ito, Tsuyoshi Ebina:
An analysis of Japanese sentences in spoken dialogue and its application to communicative intention recognition.
- Beth Ann Hockey:
Extra propositional focus and belief revision.
- Daniel Schang, Laurent Romary:
Frames, a unified model for the representation of reference and space in a man-machine dialogue.
- Masahito Kawamori, Akira Shimazu, Kiyoshi Kogure:
Roles of interjectory utterances in spoken discourse.
- Yukiko Ishikawa:
Communicative mode dependent contribution from the recipient in information providing dialogue.
- Alain Cozannet, Jacques Siroux:
Strategies for oral dialogue control.
- Astrid Brietzmann, Fritz Class, Ute Ehrlich, Paul Heisterkamp, Alfred Kaltenmeier, Klaus Mecklenburg, Peter Regel-Brietzmann:
Robust speech understanding.
- Yoichi Yamashita, Keiichi Tajima, Yasuo Nomura, Riichiro Mizoguchi:
Dialog context dependencies of utterances generated from concept reperesentation.
- Shu Nakazato, Katsuhiko Shirai:
Effects on utterances caused by knowledge on the hearer.
- A. Ferrieux, M. David Sadek:
An efficient data-driven model for cooperative spoken dialogue.
- James R. Glass, Joseph Polifroni, Stephanie Seneff:
Multilingual language generation across multiple domains.
Speech Recognition in Adverse Environments
- Chafic Mokbel, R. Paches-Leal, Denis Jouvet, Jean Monné:
Compensation of telephone line effects for robust speech recognition.
- Jun-ichi Takahashi, Shigeki Sagayama:
Telephone line characteristic adaptation using vector field smoothing technique.
- Jane Chang, Victor W. Zue:
A study of speech recognition system robustness to microphone variations: experiments in phonetic classification.
- Tadashi Suzuki, Kunio Nakajima, Yoshiharu Abe:
Isolated word recognition using models for acoustic phonetic variability by lombard effect.
- John H. L. Hansen, Brian D. Womack, Levent M. Arslan:
A source generator based production model for environmental robustness in speech recognition.
- Hiroshi Matsumoto, Hiroyuki Imose:
A frequency-weighted continuous density HMM for noisy speech recognition.
- Lee-Min Lee, Hsiao-Chuan Wang:
A study on adaptations of cepstral and delta cepstral coefficients for noisy speech recognition.
- Kuldip K. Paliwal, Bishnu S. Atal:
A comparative study of feature representations for robust speech recognition in adverse environments.
- Hugo Van Hamme:
ARDOSS: autoregressive domain spectral subtraction for robust speech recognition in additive noise.
- Keizaburo Takagi, Hiroaki Hattori, Takao Watanabe:
Speech recognition with rapid environment adaptation by spectrum equalization.
- Richard M. Stern, Fu-Hua Liu, Pedro J. Moreno, Alejandro Acero:
Signal processing for robust speech recognition.
- Olivier Siohan, Yifan Gong, Jean Paul Haton:
A comparison of three noisy speech recognition approaches.
Speech Analysis
Prosody of Discourse and Dialogue
- Shigeru Kiritani, Kikuo Maekawa, Hajime Hirose:
Intonation pattern with focus and related muscle activities in tokyo dialect.
- Jianfen Cao:
The effects of contrastive accent and lexical stress upon temporal distribution in a sentence.
- Henrietta J. Cedergren, Hélène Perreault:
Speech rate and syllable timing in spontaneous speech.
- Hyunbok Lee, Narn-taek Jin, Cheol-jae Seong, Il-jin Jung, Seung-mie Lee:
An experimental phonetic study of speech rhythm in standard Korean.
- Noriko Umeda, Toby Wedmore:
A rhythm theory for spontaneous speech: the role of vowel amplitude in the rhythmic hierarchy.
- Gösta Bruce, Björn Granström, Kjell Gustafson, David House, Paul Touati:
Modelling Swedish prosody in a dialogue framework.
- Hiroya Fujisaki, Sumio Ohno, Masafumi Osame, Mayumi Sakata, Keikichi Hirose:
Prosodic characteristics of a spoken dialogue for information query.
- Shoichi Takeda, Yoshiyuki Itoh, Norifumi Sakuma, Kei Yokosato:
Analysis of prosodic and linguistic features of spontaneous Japanese conversational speech.
- Nick Campbell:
Combining the use of duration and F0 in an automatic analysis of dialogue prosody.
- Gabriele Bakenecker, Hans Ulrich Block, Anton Batliner, Ralf Kompe, Elmar Nöth, Peter Regel-Brietzmann:
Improving parsing by incorporating 'prosodic clause boundaries into a grammar.
- Andrew Hunt:
A prosodic recognition module based on linear discriminant analysis.
- Keikichi Hirose, Atsuhiro Sakurai, Hiroyuki Konno:
Use of prosodic features in the recognition of continuous speech.
Spoken Language Cognition and Its Disorders
- Taeko Nakayama Wydell, Brian Butterworth:
The inconsistency of consistency effects in reading: the case of Japanese kanji phonology.
- Valter Ciocca, Livia Wong, Lydia K. H. So:
An acoustic analysis of unreleased stop consonants in word-final position.
- Jean Vroomen, Béatrice de Gelder:
Speech segmentation in dutch: no role for the syllable.
- James M. McQueen:
Do ambiguous fricatives rhyme? lexical involvement in phonetic decision-making depends on task demands.
- Pierre A. Hallé, Juan Segui:
Moraic segmentation in Japanese revisited.
- Jennifer J. Venditti, Hiroko Yamashita:
Prosodic information and processing of temporarily ambiguous constructions in Japanese.
- Nobuaki Minematsu, Keikichi Hirose:
Role of prosodic features in the human process of speech perception.
- Masahiro Hashimoto, Hideaki Seki:
Limitations of lip-reading advantage by desynchronizing visual and auditory information in speech.
- Sue Franklin, Judy Turner, Julie Morris:
Word meaning deafness: effects of word type.
- Mikio Masukata, Seiichi Nakagawa:
Concept and grammar acquisition based on combining with visual and auditory information.
- Gavin J. Dempster, Sheila M. Williams, Sandra P. Whiteside:
The punch and judy man: a study of phonological / phonetic variation.
- Hartmut Traunmller, Renée van Bezooijen:
The auditory perception of children's age and sex.
- James S. Magnuson, Reiko Akahane-Yamada, Howard C. Nusbaum:
Are representations used for talker identification available for talker normalization?
- Yoko Hasegawa, Kazue Hata:
Non-physiological differences between male and female speech: evidence from the delayed F0 fall phenomenon in Japanese.
- Tatsuya Kitamura, Masato Akagi:
Speaker individualities in speech spectral envelopes.
- Duncan Markham:
Prosodic imitation: productional results.
- Fiona Gibbon, William J. Hardcastle:
Articulatory description of affricate production in speech disordered children using electropalatography (EPG).
- Akira Ujihira, Haruo Kubozono:
A phonetic and phonological analysis of stuttering in Japanese.
- Donald G. Jamieson, Susan Rvachew:
Perception, production and training of new consonant contrasts in children with articulation disorders.
- Sachiko Nakakoshi, Atsushi Mizobuchi, Hiroto Katori:
Cognitive processes of speech sounds in a brain-damaged patient.
- N. Suzuki, H. Dent, Masahiko Wakumoto, Fiona Gibbon, Ken-ich Michi, William J. Hardcastle:
A cross-linguistic study of lateral /s/ using electropalatography (EPG).
- Junko Matsubara, Toshihiro Kashiwagi, Morio Kohno, Hirotaka Tanabe, Asako Kashiwagi:
Prosody of recurrent utterances in aphasic patients.
- Virginia LoCastro:
Intonation and language teaching.
- Tsuyoshi Nara, P. Bhaskararao:
A computer-aided phonetic instruction system for south-asian languages.
- Morio Kohno, Junko Matsubara, Katsuko Higuchi, Toshihiro Kashiwagi:
Rhythm processing by a patient with pure anarthria: some suggestions on the role of rhythm in spoken language processing.
- Nobuko Yamada:
Japanese accentuation of foreign learners and its interlanguage.
- Masato Kaneko:
Mechanisms producing recurring utterances in a patient with slowly progressive aphasia.
- Kiyokata Katoh, Takako Ayusawa, Yukihiro Nishinuma, Richard Harrison, Kikuko Yamashita:
Hypermedia for spoken language education.
- P. Bhaskararao, Venkata N. Peri, Vishwas Udpikar:
A text-to-speech system for application by visually handicapped and illiterate.
Spoken Language Systems and Assessments
- Diego Giuliani, Maurizio Omologo, Piergiorgio Svaizer:
Talker localization and speech recognition using a microphone array and a cross-powerspectrum phase analysis.
- Qiguang Lin, Ea-Ee Jan, ChiWei Che, Bert de Vries:
System of microphone arrays and neural networks for robust speech recognition in multimedia environments.
- Manny Rayner, David M. Carter, Patti Price, Bertil Lyberg:
Estimating performance of pipelined spoken language translation systems.
- Cheol-Woo Jo, Kyung-Tae Kim, Yong-Ju Lee:
Generation of multi-syllable nonsense words for the assessment of Korean text-to-speech system.
- Aruna Bayya, Michael Durian, Lori Meiskey, Rebecca Root, Randall Sparks, Mark Terry:
Voice map: a dialogue-based spoken language information access system.
- Shigenobu Seto, Kazuhiro Kimura:
Development of a document preparation system with speech command using EDR electronic dictionaries.
- Bianca Angelini, Giuliano Antoniol, Fabio Brugnara, Mauro Cettolo, Marcello Federico, Roberto Fiutem, Gianni Lazzari:
Radiological reporting by speech recognition: the a.re.s. system.
- Samir Bennacef, Hélène Bonneau-Maynard, Jean-Luc Gauvain, Lori Lamel, Wolfgang Minker:
A spoken language system for information retrieval.
- Børge Lindberg:
Recogniser response modelling from testing on series of minimal word pairs.
- Toshimitsu Minowa, Yasuhiko Arai, Hisanori Kanasashi, Tatsuya Kimura, Takuji Kawamoto:
A study on the problems for apllication of voice interface based on ford recognition.
- Hiroyuki Kamio, Mika Koorita, Hiroshi Matsu'ura, Masafumi Tamura, Tsuneo Nitta:
A UI design support tool for multimodal spoken dialogue system.
- Takuya Nishirnoto, Nobutoshi Shida, Tetsunori Kobayashi, Katsuhiko Shirai:
Multimodal drawing tool using speech, mouse and key-board.
- Yasuhiko Arai, Toshimitsu Minowa, Hiroko Yoshida, Hirofumi Nishimura, Hiroyvki Kamata, Takashi Honda:
Generation of non-entry words from entries of the natural speech database.
- Pedro Gómez Vilda, Daniel Martinez, Victor Nieto Lluis, Victoria Rodellar:
MECALLSAT: a multimedia environment for computer-aided language learning incorporating speech assessment techniques.
- Arthur E. McNair, Alex Waibel:
Improving recognizer acceptance through robust, natural speech repair.
- David Fay:
User acceptance of automatic speech recognition in telephone services.
- Stephen Love, R. T. Dutton, John C. Foster, Mervyn A. Jack, F. W. M. Stentiford:
Identifying salient usability attributes for automated telephone services.
- Arnd Mariniak:
Word complexity measures in the context of speech intelligibility tests.
- Frank H. Wu, Monica A. Maries:
Recognition accuracy methods and measures.
- Ute Jekosch, Louis C. W. Pols:
A feature-profile for application-specific speech synthesis assessment and evaluation.
- Thomas Hegehofer:
A description model for speech assessment tests with subjects.
- Victoria Rodellar, Antonio Diaz, Jose Gallardo, Virginia Peinado, Victor Nieto Lluis, Pedro Gómez Vilda:
VLSI implementation of a robust hybrid parameter-extractor and neural network for speech decoding.
- Toshiro Watanabe, Shinji Hayashi:
An objective measure for qualitatively assessing low-bit-rate coded speech.
- Kazuhiko Ozeki:
Performance comparison of recognition systems based on the akaike information criterion.
- Nobutoshi Hanai, Richard M. Stern:
Robust speech recognition in the automobile.
- Javier Macías Guarasa, Manuel A. Leandro, José Colás, Alvaro Villegas, Santiago Aguilera, José Manuel Pardo:
On the development of a dictation machine for Spanish: DIVO.
- Yoshiaki Ohshima, Richard M. Stern:
Environmental robustness in automatic speech recognition using physiologic ally-motivated signal processing.
Large Vocabulary/Speaker Independent Speech
- V. Valtchev, J. J. Odell, Philip C. Woodland, Steve J. Young:
Recognition ********* a dynamic network decoder design for large vocabulary speech recognition.
- Hermann Ney, Xavier L. Aubert:
A word graph algorithm for large vocabulary, continuous speech recognition.
- Michael S. Phillips, David Goddeau:
Fast match for segment-based large vocabulary continuous speech recognition.
- Chuck Wooters, Andreas Stolcke:
Multiple-pronunciation lexical modeling in a speaker independent speech understanding system.
- Yves Normandin, Roxane Lacouture, Régis Cardin:
MMIE training for large vocabulary continuous speech recognition.
- Yen-Ju Yang, Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
An intelligent and efficient word-class-based Chinese language model for Mandarin speech recognition with very large vocabulary.
- Tetsuo Kosaka, Shoichi Matsunaga, Shigeki Sagayama:
Tree-structured speaker clustering for speaker-independent continuous speech recognition.
- Tatsuya Kimura, Hiroyasu Kuwano, Akira Ishida, Taisuke Watanabe, Shoji Hiraoka:
Compact-size speaker independent speech recognizer for large vocabulary using "compats" method.
- Yasuyuki Masai, Jun'ichi Iwasaki, Shin'ichi Tanaka, Tsuneo Nitta, Masahiro Yao, Tomohiro Onogi, Akira Nakayama:
A keyword-spotting unit for speaker-independent spontaneous speech recognition.
- Myoung-Wan Koo, Sang-Kyu Park, Kyung-Tae Kong, Sam-joo Doh:
KT-stock: a speaker-independent large-vocabulary speech recognition system over the telephone.
- Bianca Angelini, Fabio Brugnara, Daniele Falavigna, Diego Giuliani, Roberto Gretter, Maurizio Omologo:
Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus.
Perception and Structure of Spoken Language
- Roy D. Patterson, Timothy R. Anderson, Michael Allerhand:
The auditory image model as a preprocessor for spoken language.
- Hideki Kawahara:
Effects of natural auditory feedback on fundamental frequency control.
- Tomohiro Nakatani, Takeshi Kawabata, Hiroshi G. Okuno:
Unified architecture for auditory scene analysis and spoken language processing.
- Anne Cutler, Duncan Young:
Rhythmic structure of word blends in English.
- Kazuhiko Kakehi, Kazumi Kato:
Perception for VCV speech uttered simultaneously or sequentially by two talkers.
- Shigeaki Amano:
Perception of time-compressed/expanded Japanese words depends on the number of perceived phonemes.
- Monique Radeau, Juan Segui, José Morais:
The effect of overlap position in phonological priming between spoken words.
- Masuzo Yanagida:
A cognitive model of inferring unknown words and uncertain sound sequence.
- Takashi Otake, Kiyoko Yoneyama:
A moraic nasal and a syllable structure in Japanese.
- Paula M. T. Smeele, Anne C. Sittig, Vincent J. van Heuven:
Temporal organization of bimodal speech information.
- Sumi Shigeno:
The use of auditory and phonetic memories in the discrimination of stop consonants under audio-visual presentation.
Voice Quality
- Inger Karlsson:
Controlling voice quality of synthetic speech.
- Louis C. W. Pols:
Voice quality of synthetic speech: representation and evaluation.
- Etsuko Ofuka, Hélène Valbret, Mitch G. Waterman, Nick Campbell, Peter Roach:
The role of F0 and duration in signalling affect in Japanese: anger, kindness and Politeness.
- Gunnar Fant, Anita Kruckenberg, Johan Liljencrants, Mats Båvegård:
Voice source parameters in continuous speech, transformation of LF-parameters.
- Masanobu Abe, Hideyuki Mizuno:
Speaking style conversion by changing prosodic parameters and formant frequencies.
- Hideki Kasuya, Xuan Tan, Chang-Sheng Yang:
Voice source and vocal tract characteristics associated with speaker individuality.
- Sadaoki Furui, Tomoko Matsui:
Phoneme-level voice individuality used in speaker recognition.
- Satoshi Imaizumi, Hartono Abdoerrachman, Seiji Niimi:
Controllability of voice quality: evidence from physiological and acoustic observations.
- Guus de Krom:
Spectral correlates of breathiness and roughness for different types of vowel fragments.
- John H. Esling, Lynn Marie Heap, Roy C. Snell, B. Craig Dickson:
Analysis of pitch dependence of pharyngeal, faucal, and larynx-height voice quality settings.
Neural Network and Connectionist Approaches
- KyungMin Na, JaeYeol Rheem, SouGuil Ann:
Minimum-error-rate training of predictive neural network models.
- Allen L. Gorin, H. Hanek, R. C. Rose, Laura G. Miller:
Spoken language acquisition for automated call routing.
- Eliathamby Ambikairajah, Owen Friel, William Millar:
A speech recognition system using both auditory and afferent pathway signal processing.
- Steve Renals, Mike Hochberg:
Using gamma filters to model temporal dependencies in speech.
- Jan P. Verhasselt, Jean-Pierre Martens:
Phone recognition using a transition-controlled, segment-based dp/mlp hybrid.
- Mike Hochberg, Steve Renals, Anthony J. Robinson, Dan J. Kershaw:
Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system.
- Dong Yu, Taiyi Huang, Dao Wen Chen:
A multi-state NN/HMM hybrid method for high performance speech recognition.
- Fikret S. Gürgen, J. M. Song, R. W. King:
A continuous HMM based preprocessor for modular speech recognition neural networks.
- Ying Cheng, Paul Fortier, Yves Normandin:
System integrating connectionist and ibolic approaches for spoken language understanding.
- Xavier Menéndez-Pidal, Javier Ferreiros, Ricardo de Córdoba, José Manuel Pardo:
Recent work in hybrid neural networks and HMM systems in CSR tasks.
- Jean-François Mari, Dominique Fohr, Yolande Anglade, Jean-Claude Junqua:
Hidden Markov models and selectively trained neural networks for connected confusable word recognition.
- Yochai Konig, Nelson Morgan:
Modeling dynamics in connectionist speech recognition - the time index model.
- Dao Wen Chen, Xiao Dong Li, San Zhu, Dongxin Xu, Taiyi Huang:
Mandarin syllables recognition by subsyllables dynamic neural network.
- Shigeki Okawa, Christoph Windheuser, Frédéric Bimbot, Katsuhiko Shirai:
Evaluation of phonetic feature recognition with a time-delay neural network.
- Enric Monte, Javier Hernando Pericas:
A self organizing feature map based on the fisher discriminant.
- Richard R. Favero, Fikret S. Gürgen:
Using wavelet dyadic grids and neural networks for speech recognition.
- Hiroaki Hattori:
A normalization method of prediction error for neural networks.
- Philippe Le Cerf, Dirk Van Compernolle:
Recurrent neural network word models for small vocabulary speech recognition.
- Yoshinaga Koto, Shigeru Katagiri:
A novel fuzzy partition model architecture for classifying dynamic patterns.
- Martin Cooke, Phil D. Green, Malcolm Crawford:
Handling missing data in speech recognition.
- Patrick Haffner:
A new probabilistic framework for connectionist time alignment.
- Ken-ichi Iso:
A speech recognition model using internal degrees of freedom.
- Dongxin Xu, Dao Wen Chen, Qian Ma, Bo Xu, Taiyi Huang:
Adaptation of neural network model: comparison of multilayer perceptron and LVQ.
- Takuya Koizumi, Shuji Taniguchi, Ken-ichi Hattori, Mikio Mori:
Simplified sub-neural-networks for accurate phoneme recognition.
- Victoria Rodellar, Victor Nieto Lluis, Pedro Gómez Vilda, Daniel Martinez, Mercedes Pérez:
A neural network for phonetically decoding the speech trace.
- Kiyoaki Aikawa, Tsuyoshi Saito:
Noise robust speech recognition using a dynamic-cepstrum.
Speech Analysis and Enhancement
- Toshiyuki Aritsuka, Yoshito Nejime:
Telephone-band speech enhancement based on the fundamental frequency component compensation.
- Nobuyuki Kunieda, Tetsuya Shimamura, Jouji Suzuki, Hiroyuki Yashima:
Reduction of noise level by SPAD (speech processing system by use of auto-difference function).
- Yuki Yoshida, Masanobu Abe:
An algorithm to reconstruct wideband speech from narrowband speech based on codebook mapping.
- C. W. Seymour, M. Niranjan:
An hmm-based cepstral-domain speech enhancement system.
- Naoto Iwahashi, Yoshinori Sagisaka:
Voice adaptation using multi-functional transformation with weighting by radial basis function networks.
- Hong Tang, Xiaoyuan Zhu, Iain MacLeod, J. Bruce Millar, Michael Wagner:
A dynamic-window weighted-RMS averaging filter applied to speaker identification.
- Hiroshi Yasukawa:
Quality enhancement of band limited speech by filtering and multirate techniques.
- Thanh Tung Le, John S. Mason, Tadashi Kitamura:
Characteristics of multi-layer perceptron models in enhancing degraded speech.
- Adam B. Fineberg, Kevin C. Yu:
A time-frequency analysis technique for speech recognition signal processing.
- Paavo Alku, Erkki Vilkman:
Estimation of the glottal pulseform based on discrete all-pole modeling.
- H. Nishi, M. Kitai:
Analysis and detection of double talk in telephone dialogs.
- Ove Andersen, Paul Dalsgaard:
A self-learning approach to transcription of danish proper names.
- Eisuke Horita, Yoshikazu Miyanaga, Koji Tochinai:
A time-varying analysis based on analytic speech signals.
- Takashi Endo, Shun'ichi Yajima:
New spectrum interpolation method for improving quality of synthesized speech.
- Mark Johnson:
Automatic context-sensitive measurement of the acoustic correlates of distinctive features at landmarks.
- Alain Soquet, Marco Saerens:
A comparison of different acoustic and articulatory representations for the determination of place of articulation of plosives.
- Naotoshi Osaka:
An analysis of voice quality using sinusoidal model.
- Alan Wrench, M. M. Watson, David S. Soutar, A. Gerry Robertson, John Laver:
Fast formant estimation of children's speech.
- Josep M. Salavedra, Enrique Masgrau, Asunción Moreno, Joan Estarellas, Javier Hernando:
Some fast higher order AR estimation techniques applied to parametric wiener filtering.
- Mikio Yamaguchi, Shigeharu Toyoda, Katsuhiro Yada:
Applications of a rule-based speech synthesizer module.
- Jon P. Iles, William H. Edmondson:
Quasi-articulatory formant synthesis.
- Knut Kvale:
On the connection between manual segmentation conventions and "errors" made by automatic segmentation.
- Mutsuko Tomokiyo:
Natural utterance segmentation and discourse label assignment.
- Satoshi Yumoto, Jouji Suzuki, Tetsuya Shimamura:
Possibility of speech synthesis by common voice source.
- Changfu Wang, Wenshen Yue, Keikichi Hirose, Hiroya Fujisaki:
A scheme for Chinese speech synthesis by rule based on pitch-synchronous multi-pulse excitation LP method.
- Anders Lindström, Mats Ljungqvist:
Text processing within a speech synthesis system.
- P. Carvalho, P. Lopes, Isabel Trancoso, Luís C. Oliveira:
E-mail to voice-mail conversion using a portuguese text-to-speech system.
- Shigeyoshi Kitazawa, Satoshi Kobayashi, Takao Matsunaga, Hideya Ichikawa:
Tempo estimation by wave envelope for recognition of paralinguistic features in spontaneous speech.
Acquisition of Spoken Language
- Teruaki Tsushima, Osamu Takizawa, Midori Sasaki, Satoshi Shiraki, Kanae Nishi, Morio Kohno, Paula Menyuk, Catherine T. Best:
Discrimination of English /r-l/ and /w-y/ by Japanese infants at 6-12 months: language-specific developmental changes in speech perception abilities.
- Hiroaki Kojima, Kazuyo Tanaka, Satoru Hayamizu:
Generating phoneme models for forming phonological concepts.
- Yoko Shimura, Satoshi Imaizumi:
Infant's expression and perception of emotion through vocalizations.
- Tomohiko Ito:
Transition from two-word to multiple-word stage in the course of language acquisition.
- P. V. S. Rao, Nandini Bondale:
BSLP based language grammars for child speech.
- John Nienart, J. Devin McAuley:
Using prediction to learn pre-linguistic speech characteristics: a connectionist model.
Education of Spoken Language
Speech/Language Database
- Tsuyoshi Morimoto, Noriyoshi Uratani, Toshiyuki Takezawa, Osamu Furuse, Yasuhiro Sobashima, Hitoshi Iida, Atsushi Nakamura, Yoshinori Sagisaka, Norio Higuchi, Yasuhiro Yamazaki:
A speech and language database for speech translation research.
- Lori Lamel, Florian Schiel, Adrian Fourcin, Joseph Mariani, Hans G. Tillmann:
The translanguage English database (TED).
- Ikuo Kudo, Takao Nakama, Nozomi Arai, Nahoko Fujimura:
The data collection of voice across Japan (VAJ) project.
- M. Damhuis, T. I. Boogaart, C. in't Veld, M. Versteijlen, W. Schelvis, L. Bos, Lou Boves:
Creation and analysis of the dutch polyphone corpus.
- Per Rosenbeck, Bo Baungaard, Claus Jacobsen, Dan-Joe Barry:
The design and efficient recording of a 3000 speaker scandinavian telephone speech database: rafael.0.
- Daniel Tapias, Alejandro Acero, J. Esteve, Juan Carlos Torrecilla:
The VESTEL telephone speech database.
- Ronald A. Cole, Mark A. Fanty, Mike Noel, Terri Lander:
Telephone speech corpus development at CSLU1.
- P. E. Kenne, Hamish G. Pearcy, Mary O'Kane:
Derivation of a large speech and natural language database through alignment of court recordings an their transcripts.
- Qiguang Lin, ChiWei Che, Joe French:
Description of the caip speech corpus.
- Rob Kassel:
Automating the design of compact linguistic corpora.
- Kazuyo Tanaka, Kanae Kinebuchi, Naoko Houra, Kazuyuki Takagi, Shuichi Itahashi, Katsunobu Itou, Satoru Hayamizu:
Annotating illocutionary force types and phonological features into a spontaneous dialogue corpus: an experimental study.
Speaker,
Language and Phoneme Recognition
- Aaron E. Rosenberg, Chin-Hui Lee, Frank K. Soong:
Cepstral channel normalization techniques for HMM-based speaker verification.
- Vijay Raman, Jayant M. Naik:
Noise reduction for speech recognition and speaker verification in mobile telephony.
- Eluned S. Parris, Michael J. Carey:
Discriminative phonemes for speaker identification.
- Javier Hernando, Climent Nadeu, Carlos Villagrasa, Enric Monte:
Speaker identification in noisy conditions using linear prediction of the one-sided autocorrelation sequence.
- Jialong He, Li Liu, Günther Palm:
A text-independent speaker identification system based on neural networks.
- Fangxin Chen, Bruce Millar, Michael Wagner:
Hybrid threshold approach in text-independent speaker verification.
- Y. Ariki, K. Doi:
Speaker recognition based on subspace methods.
- Seong-Jin Yun, Yung-Hwan Oh:
Performance improvement of speaker recognition system for small training data.
- B. Yegnanarayana, S. P. Wagh, S. Rajendran:
A speaker verification system using prosodic features.
- William Goldenthal, James R. Glass:
Statistical trajectory models for phonetic recognition.
- Mats Blomberg:
A common phone model representation for speech recognition and synthesis.
- Shubha Kadambe, James Hieronymus:
Spontaneous speech language identification with a knowledge of linguistics.
- Timothy J. Hazen, Victor W. Zue:
Recent improvements in an approach to segment-based automatic language identification.
- Padma Ramesh, David B. Roe:
Language identification with embedded word models.
- Kay M. Berkling, Etienne Barnard:
Language identification of six languages based on a common set of broad phonemes.
- Allan A. Reyes, Takashi Seino, Seiichi Nakagawa:
Three language identification methods based on HMMs.
- Shuichi Itahashi, Jian Xiong Zhou, Kimihito Tanaka:
Spoken language discrimination using speech fundamental frequency.
- Paul Dalsgaard, Ove Andersen:
Application of inter-language phoneme similarities for language identification.
- Hugo Van Hamme, Guido Gallopyn, Ludwig Weynants, Bart D'hoore, Hervé Bourlard:
Comparison of acoustic features and robustness tests of a real-time recogniser using a hardware telephone line simulator.
- Shigeki Okawa, Tetsunori Kobayashi, Katsuhiko Shirai:
Phoneme recognition in various styles of utterance based on mutual information criterion.
- Masakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada:
Speaker independent speech recognition method using phoneme similarity vector.
- Kai Hbener, Julie Carson-Berndsen:
Phoneme recognition using acoustic events.
- Parham Mokhtari, Frantz Clermont:
Contributions of selected spectral regions to vowel classification accuracy.
- Climent Nadeu, Biing-Hwang Juang:
Filtering of spectral parameters for speech recognition.
- Barry Arons:
Pitch-based emphasis detection for segmenting speech recordings.
- Z. Li, Patrick Kenny:
Overlapping phone segments.
- Maurice K. Wong:
Clustering triphones by phonological mapping.
Speech Perception and Speech Related Disorders
- Nelson Morgan, Hervé Bourlard, Steven Greenberg, Hynek Hermansky:
Stochastic perceptual auditory-event-based models for speech recognition.
- Itaru F. Tatsumi, Hiroya Fujisaki:
Auditory perception of filled and empty time intervals, and mechanism of time discrimination.
- Margaret F. Cheesman, Jennifer C. Armitage, Kimberley Marshall:
Speech perception and growth of masking in younger and older adults.
- Toshio Irino, Roy D. Patterson:
A theory of asymmetric intensity enhancement around acoustic transients.
- Hector Javkin, Elizabeth Keate, Norma Antonanzas-Barroso, Ranjun Zou, Karen Youdelman:
Text-to-speech in the speech training of the deaf: adapting models to individual speakers.
- Thomas Holton:
Robust pitch and voicing detection using a model of auditory signal processing.
- Satoshi Imaizumi, Akiko Hayashi, Toshisada Deguchi:
Listener adaptive characteristics in dialogue speech effects of temporal adjustment on emotional aspects of speech.
- Minoru Tsuzaki, Hiroaki Kato, Masako Tanaka:
Effects of acoustic discontinuity and phonemic deviation on the apparent duration of speech segments.
- Chie H. Craig, Richard M. Warren, Tricia B. K. Chirillo:
The influence of context on spoken language perception and processing among elderly and hearing impaired listeners.
- Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka:
Acceptability of temporal modification in consonant and vowel onsets.
- Weizhong Zhu, Yoshinobu Kikuchi, Yasuo Endo, Hideki Kasuya, Minoru Hirano, Masanao Ohashi:
An integrated acoustic evaluation system of pathologic voice.
- Yumiko Fukuda, Wako Ikehara, Erniko Kamikubo, Shizuo Hiki:
An electronic dictionary of Japanese sign language: design of system and organization of database.
- Yasuo Endo, Hideki Kasuya:
Synthesis of pathological voice based on a stochastic voice source model.
- Hiroshi Hosoi, Yoshiaki Tsuta, Takashi Nishida, Kiyotaka Murata, Fumihiko Ohta, Tsuyoshi Mekata, Yumiko Kato:
Hearing aid evaluation using variable - speech - rate audiometry.
- Fred D. Minifie, Daniel Z. Huang, Jordan Green:
Relationship between acoustic measures of vocal perturbation and perceptual judgments of breathiness, harshness, and hoarseness.
- Takashi Ikeda, Kouji Tasaki, Akira Watanabe:
A hearing aid by single resonant analysis for telephonic speech.
- Tsuneo Yamada, Reiko Akahane-Yamada, Winifred Strange:
Perceptual learning of Japanese mora syllables by native speakers of american English an analysis of acquisition processes of speech perception in second language learning.
- Yuichi Ueda, Takayuki Agawa, Akira Watanabe:
A DSP-based amplitude compressor for digital hearing AIDS.
- Amalia Sarabasa:
Perception and production saturation of spoken English as a first phase in reducing a foreign accent.
- Edmund Rooney, Fabrizio Carraro, Will Dempsey, Katie Robertson, Rebecca Vaughan, Mervyn A. Jack, Jonathan Murray:
Harp: an autonomous speech rehabilitation system for hearing-impaired people.
- Reiko Akahane-Yamada, Winifred Strange, James S. Magnuson, John S. Pruitt, William D. Clarke:
The intelligibility of Japanese speakers' production of american English /r/, /i/, and /w/, as evaluated by native speakers of american English.
- Itaru Nagayama, Norio Akamatsu, Toshiki Yoshino:
Phonetic visualization for speech training system by using neural network.
- Elzbieta B. Slawinski:
Perceptual and productive distinction between the English [r] and [l] in prevocalic position by English and Japanese speakers.
- Yasushi Naito, Hidehiko Okazawa, Iwao Honjo, Yosaku Shiomi, Haruo Takahashi, Waka Hoji, Michio Kawano, Hiroshi Ishizu, Sadahiko Nishizawa, Yoshiharu Yonekura, Junji Konishi:
Cortical activation with speech in cochlear implant users: a study with positron emission tomography.
- Kiyoaki Aikawa, Reiko Akahane-Yamada:
Comparative study of spectral representations in measuring the English /r/-/l/ acoustic-perceptual dissimilarity.
- Shigeyoshi Kitazawa, Kazuyuki Muramoto, Juichi Ito:
Acoustic simulation of auditory model based speech processor for cochlear implant system.
- Makio Kashino, Chie H. Craig:
The influence of knowledge and experience during the processing of spoken words: non-native listeners.
- David House:
Perception and production of mood in speech by cochlear implant users.
- Yoshito Nejime, Toshiyuki Aritsuka, Toshiki Imamura, Tohru Ifukube, Jun'ichi Matsushima:
A portable digital speech rate converter and its evaluation by hearing-impaired listeners.
Speech Coding
- Keiichi Funaki, Kazunaga Yoshida, Kazunori Ozawa:
4kb/s speech coding with small computational amount and memory requirement: ULCELP.
- Miguel Angel Ferrer-Ballester, Aníbal R. Figueiras-Vidal:
Improving CELP voice quality by projection similarity measure.
- Hitoshi Ohmuro, Kazunori Mano, Takehiro Moriya:
Variable bit-rate speech coding based on PSI-CELP.
- Sung-Joo Kim, Seung Jong Park, Yung-Hwan Oh:
Complexity reduction methods for vector sum excited linear prediction coding.
- Preeti Rao, Yoshiaki Asakawa, Hidetoshi Sekine:
8 kb/s low-delay speech coding with 4 ms frame size.
- Jey-Hsin Yao, Yoshinori Tanaka:
Low-bit-rate speech coding with mixed-excitation and interpolated LPC coefficients.
- Cheung-Fat Chan:
Multi-band excitation coding of speech at 960 bps using split residual VQ and v/UV decision regeneration.
- Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai:
Speech coding based on adaptive MEL-cepstral analysis for noisy channels.
- Fu-Rong Jean, Hsiao-Chuan Wang:
A two-stage coding of speech LSP parameters based on KLT transform and 2d-prediction.
The Impact of Signal Processing Technologies
- Harry Levitt:
on communication disabilities ********* technologies for signal processing hearing AIDS.
- Futoshi Asano, Yôiti Suzuki, Toshio Sone:
Signal processing techniques applicable to hearing aids.
- Peter Blarney, Gary Dooley, Elvira Parisi:
Combination and comparison of electric stimulation and residual hearing.
- Sotaro Funasaka, Masae Shiroma, Kumiko Yukawa:
Analysis of consonants perception of Japanese 22-channel cochlear implant patients.
- Volker Hohmann, Birger Kollmeier:
Digital hearing aid techniques employing a loudness model for recruitment compensation.
- Akira Nakamura, Nobumasa Seiyama, Atsushi Imai, Tohru Takagi, Eiichi Miyasaka:
A new approach to compensate degeneration of speech intelligibility for elderly listeners.
- Tsuyoshi Mekata, Yoshiyuki Yoshizumi, Yumiko Koto, Etji Noguchi, Yoshinori Yamada:
Development of a portable multi-function digital hearing aid.
- Donald G. Jamieson:
The use of spoken language in the evaluation of assistive listening devices.
Continuous Speech Recognition
- Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-Decker:
Continuous speech dictation in French.
- Ronald A. Cole, Beatrice T. Oshika, Mike Noel, Terri Lander, Mark A. Fanty:
Labeler agreement in phonetic labeling of continuous speech.
- Biing-Hwang Juang, Jay G. Wilpon:
Recent technology developments in connected digit speech recognition.
- Daniel Jurafsky, Chuck Wooters, Gary Tajchman, Jonathan Segal, Andreas Stolcke, Eric Foster, Nelson Morgan:
The berkeley restaurant project.
- Volker Steinbiss, Bach-Hiep Tran, Hermann Ney:
Improvements in beam search.
- Kevin Johnson, Roberto Garigliano, Russell J. Collingham:
Data-based control of the search space generated by multiple knowledge bases for speech recognition.
- Atsuhiko Kai, Seiichi Nakagawa:
Evaluation of unknown word processing in a spoken word recognition system.
- Tetsuo Araki, Satoru Ikehara, Hideto Yokokawa:
Using accent information to correctly select Japanese phrases made of strings of syllables.
- Sheryl R. Young:
Estimating recognition confidence: methods for conjoining acoustics, semantics, pragmatics and discourse.
- John McDonough, Herbert Gish:
Issues in topic identification on the switchboard corpus.
- Li Deng, Hossein Sameti:
Automatic speech recognition using dynamically defined speech units.
- M. Jones, Philip C. Woodland:
Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser.
- Natividad Prieto, Emilio Sanchis, Luis Palmero:
Continuous speech understanding based on automatic learning of acoustic and semantic models.
- Kazuhiro Kondo, Yu-Hung Kao, Barbara Wheatley:
On inter-phrase context dependencies in continuously read Japanese speech.
- Gernot A. Fink, Franz Kummert, Gerhard Sagerer:
A close high-level interaction scheme for recognition and interpretation of speech.
- Sylvie Coste-Marquis:
Interaction between most reliable acoustic cues and lexical analysis.
- Y. Ariki, T. Kawamura:
Simultaneous spotting of phonemes and words in continuous speech.
- Man-Hung Siu, Herbert Gish, Jan Robin Rohlicek:
Predicting word spotting performance.
- Sumio Ohno, Hiroya Fujisaki, Keikichi Hirose:
A method for word spotting in continuous speech using both segmental and contextual likelihood scores.
- Renato de Mori, Diego Giuliani, Roberto Gretter:
Phone-based prefiltering for continuous speech recognition.
- Harald Singer, Jun-ichi Takami:
Speech recognition without grammar or vocabulary constraints.
- Javier Macías Guarasa, Manuel A. Leandro, Xavier Menéndez-Pidal, José Colás, Ascension Gallardo, José Manuel Pardo, Santiago Aguilera:
Comparison of three approaches to phonetic string generation for large vocabulary speech recognition.
- Pietro Laface, Lorenzo Fissore, Franco Ravera:
Automatic generation of words toward flexible vocabulary isolated word recognition.
- H. C. Choi, R. W. King:
Fast speaker adaptation through spectral transformation for continuous speech recognition.
- Sekharjit Datta:
Dynamic machine adaptation in a multi-speaker isolated word recognition system.
- Sheryl R. Young:
Discourse structure for spontaneous spoken interactions: multi-speaker vs. human-computer dialogs.
- Hansjörg Mixdorff, Hiroya Fujisaki:
Analysis of voice fundamental frequency contours of German utterances using a quantitative model.
Last update Fri May 25 08:22:59 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page