default search action
INTERSPEECH 2006: Pittsburgh, PA, USA
- Ninth International Conference on Spoken Language Processing, INTERSPEECH-ICSLP 2006, Pittsburgh, PA, USA, September 17-21, 2006. ISCA 2006
Language Modeling for Spoken Dialog Systems
- Matthew Purver, Florin Ratiu, Lawrence Cavedon:
Robust interpretation in dialogue by combining confidence scores with contextual features. - Hui Ye, Steve J. Young:
A clustering approach to semantic decoding. - Teruhisa Misu, Tatsuya Kawahara:
A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts. - Axel Horndasch, Elmar Nöth, Anton Batliner, Volker Warnke:
Phoneme-to-grapheme mapping for spoken inquiries to the semantic web. - Karl Weilhammer, Matthew N. Stuttle, Steve J. Young:
Bootstrapping language models for dialogue systems. - Junlan Feng:
Question answering with discriminative learning algorithms.
Feature Enhancement for Robust ASR
- Patrick Kenny, Vishwa Gupta, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel:
Feature normalization using smoothed mixture transformations. - Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin:
Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition. - Babak Nasersharif, Ahmad Akbari:
A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies. - Friedrich Faubel, Matthias Wölfel:
Coupling particle filters with automatic speech recognition for speech feature enhancement. - Chang-Wen Hsu, Lin-Shan Lee:
Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition. - Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto:
An improved mel-wiener filter for mel-LPC based speech recognition.
Dialog and Discourse
- Lluís F. Hurtado, David Griol, Encarna Segarra, Emilio Emilio, Sanchis Sanchis:
A stochastic approach for dialog management based on neural networks. - Mihai Rotaru, Diane J. Litman:
Discourse structure and speech recognition problems. - Satanjeev Banerjee, Alexander I. Rudnicky:
A texttiling based approach to topic boundary detection in meetings. - Stefan Schulz, Hilko Donker:
An user-centered development of an intuitive dialog control for speech-controlled music selection in cars. - Antoine Raux, Dan Bohus, Brian Langner, Alan W. Black, Maxine Eskénazi:
Doing research on a deployed spoken dialogue system: one year of let's go! experience. - Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg:
Detecting question-bearing turns in spoken tutorial dialogues.
The Speech Separation Challenge
- Soundararajan Srinivasan, Yang Shao, Zhaozhang Jin, DeLiang Wang:
A computational auditory scene analysis system for robust speech recognition. - Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu:
CASA based speech separation for robust speech recognition. - Mark R. Every, Philip J. B. Jackson:
Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm. - Jon Barker, André Coy, Ning Ma, Martin Cooke:
Recent advances in speech fragment decoding techniques. - Tuomas Virtanen:
Speech recognition using factorial hidden Markov models for separation in the feature space. - Ji Ming, Timothy J. Hazen, James R. Glass:
Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation. - Trausti T. Kristjansson, John R. Hershey, Peder A. Olsen, Steven J. Rennie, Ramesh A. Gopinath:
Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system. - Om Deshmukh, Carol Y. Espy-Wilson:
Modified phase opponency based solution to the speech separation challenge.
Multilingual and Multi-Accent Processing
- Jonas Lööf, Maximilian Bisani, Christian Gollan, Georg Heigold, Björn Hoffmeister, Christian Plahl, Ralf Schlüter, Hermann Ney:
The 2006 RWTH parliamentary speeches transcription system. - Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean Paul Haton:
Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints. - Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao:
Automatic speech recognition of Cantonese-English code-mixing utterances. - M. Zimmerman, Dilek Hakkani-Tür, James G. Fung, Nikki Mirghafori, Luke R. Gottlieb, Elizabeth Shriberg, Yang Liu:
The ICSI+ multilingual sentence segmentation system. - Yan Ming Cheng, Changxue Ma, Lynette Melnar:
Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms. - Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han:
A multi-space distribution (MSD) approach to speech recognition of tonal languages. - Viet Bac Le, Laurent Besacier:
Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR. - Yi Liu, Pascale Fung:
Multi-accent Chinese speech recognition. - Seyed Ghorshi, Saeed Vaseghi, Qin Yan:
Comparative analysis of formants of British, american and australian accents. - Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
Automatic initial/final generation for dialectal Chinese speech recognition. - Ruhi Sarikaya, Ossama Emam, Imed Zitouni, Yuqing Gao:
Maximum entropy modeling for diacritization of Arabic text. - Slavomír Lihan, Jozef Juhár, Anton Cizmar:
Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models.
Corpora, Annotation, and Assessment Metrics I, II
- Rhys James Jones, Ambrose Choy, Briony Williams:
Integrating Festival and Windows. - Cosmin Munteanu, Gerald Penn, Ronald Baecker, Elaine G. Toms, David James:
Measuring the acceptable word error rate of machine-generated webcast transcripts. - Goshu Nagino, Makoto Shozakai:
Analyzing reusability of speech corpus based on statistical multidimensional scaling method. - Susan Fitt, Korin Richmond:
Redundancy and productivity in the speech technology lexicon - can we do better? - Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki:
Word intelligibility estimation of noise-reduced speech. - Christoph Draxler:
Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers. - Timothy Murphy, Dorel Picovici, Abdulhussain E. Mahdi:
A new single-ended measure for assessment of speech quality. - Ailbhe Ní Chasaide, John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Christer Gobl:
Speech technology for minority languages: the case of Irish (gaelic). - Francisco José Fraga, Carlos Alberto Ynoguti, André Godoi Chiovato:
Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments. - Volodya Grancharov, David Yuheng Zhao, Jonas Lindblom, W. Bastiaan Kleijn:
Non-intrusive speech quality assessment with low computational complexity. - Min-Siong Liang, Ren-Yuan Lyu, Yuang-Chin Chiang:
Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus. - Andrej Zgank, Tomaz Rotovnik, Matej Grasic, Marko Kos, Damjan Vlaj, Zdravko Kacic:
Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition. - Siew Leng Toh, Fan Yang, Peter A. Heeman:
An annotation scheme for agreement analysis. - Hitoshi Aoki, Atsuko Kurashima, Akira Takahashi:
Conversational quality estimation model for wideband IP-telephony services. - Kelley Kilanski, Jonathan Malkin, Xiao Li, Richard Wright, Jeff A. Bilmes:
The vocal joystick data collection effort and vowel corpus. - Dmitry Sityaev, Katherine M. Knill, Tina Burrows:
Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems. - Peter A. Heeman, Andy McMillin, J. Scott Yaruss:
An annotation scheme for complex disfluencies. - Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik:
Automatic phonetic transcription of large speech corpora: a comparative study. - Yongmei Shi, Lina Zhou:
Examining knowledge sources for human error correction.
Speech Coding
- Joon-Hyuk Chang, Woohyung Lim, Nam Soo Kim:
Signal modification incorporating perceptual weighting filter. - Jani Nurminen:
Enhanced dynamic codebook reordering for advanced quantizer structures. - Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang:
An efficient segment-based speech compression technique for hand-held TTS systems. - V. Ramasubramanian, D. Harish:
An unified unit-selection framework for ultra low bit-rate speech coding. - Jes Thyssen, Juin-Hwey Chen:
Efficient VQ techniques and general noise shaping in noise feedback coding. - Yasheng Qian, Wei-Shou Hsu, Peter Kabal:
Classified comfort noise generation for efficient voice transmission. - Balázs Kövesi, Dominique Massaloux, David Virette, Julien Bensa:
Integration of a CELP coder in the ARDOR universal sound codec. - Saikat Chatterjee, T. V. Sreenivas:
Two stage transform vector quantization of LSFs for wideband speech coding. - Saikat Chatterjee, T. V. Sreenivas:
Comparison of prediction based LSF quantization methods using split VQ. - Konrad Hofbauer, Gernot Kubin:
High-rate data embedding in unvoiced speech. - Kyle D. Anderson, Philippe Gournay:
Pitch resynchronization while recovering from a late frame in a predictive speech decoder.
Speech Enhancement I, II
- Suhadi Suhadi, Sorel Stan, Tim Fingscheidt:
A novel environment-dependent speech enhancement method with optimized memory footprint. - Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:
Weighted codebook mapping for noisy speech enhancement using harmonic-noise model. - Jesper Jensen, Richard C. Hendriks, Jan S. Erkelens, Richard Heusdens:
MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors. - Amarnag Subramanya, Michael L. Seltzer, Alex Acero:
Automatic removal of typed keystrokes from speech signals. - Erhard Rank, Gernot Kubin:
Lattice LP filtering for noise reduction in speech signals. - Om Deshmukh, Carol Y. Espy-Wilson:
Speech enhancement using modified phase opponency model. - Wen Jin, Michael S. Scordilis:
Single channel speech enhancement by frequency domain constrained optimization and temporal masking. - Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, Nam Soo Kim:
Speech enhancement based on residual noise shaping. - Hannu Pulakka, Laura Laaksonen, Paavo Alku:
Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages. - Benjamin J. Shannon, Kuldip K. Paliwal:
Role of phase estimation in speech enhancement. - Benjamin J. Shannon, Kuldip K. Paliwal, Climent Nadeu:
Speech enhancement based on spectral estimation from higher-lag autocorrelation. - Nitish Krishnamurthy, John H. L. Hansen:
Noise update modeling for speech enhancement: when do we do enough? - A. Shahina, B. Yegnanarayana:
Mapping neural networks for bandwidth extension of narrowband speech. - Amit Das, John H. L. Hansen:
Decision directed constrained iterative speech enhancement. - Takahiro Murakami, Yoshihisa Ishida:
Adaptive filtering for attenuating musical noise caused by spectral subtraction. - Yi Hu, Philipos C. Loizou:
Evaluation of objective measures for speech enhancement. - Myung-Suk Song, Chang-Heon Lee, Hong-Goo Kang:
Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition.
ASR Other I, II
- Gilles Boulianne, Jean-Francois Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath:
Computer-assisted closed-captioning of live TV broadcasts in French. - Mohamed Afify, Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Laurent Besacier, Yuqing Gao:
On the use of morphological analysis for dialectal Arabic speech recognition. - Isabel Trancoso, Ricardo Nunes, Luís Neves, Céu Viana, Helena Moniz, Diamantino Caseiro, Ana Isabel Mata:
Recognition of classroom lectures in european portuguese. - Thomas Pellegrini, Lori Lamel:
Investigating automatic decomposition for ASR in less represented languages. - Abdillahi Nimaan, Pascal Nocera, Jean-François Bonastre:
Automatic transcription of Somali language. - Özgür Çetin, Elizabeth Shriberg:
Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition. - Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation. - Wooil Kim, John H. L. Hansen:
Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval. - Hahn Koo, Yan Ming Cheng:
Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform. - Yasunari Obuchi, Nobuo Hataoka:
Development and evaluation of speech database in automotive environments for practical speech recognition systems. - Dong Yu, Yun-Cheng Ju, Alex Acero:
An effective and efficient utterance verification technology using word n-gram filler models. - J. M. Górriz, Javier Ramírez, Carlos García Puntonet, José C. Segura:
An efficient bispectrum phase entropy-based algorithm for VAD. - Petr Cerva, Jan Nouza, Jan Silovský:
Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination. - Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda:
CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition. - Cheng-Tao Chu, Yun-Hsuan Sung, Yuan Zhao, Daniel Jurafsky:
Detection of word fragments in Mandarin telephone conversation. - Qiang Huo, Wei Li:
A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis. - Angel M. Gomez, Juan J. Ramos-Muñoz, Antonio M. Peinado, Victoria E. Sánchez:
Multi-flow block interleaving applied to distributed speech recognition over IP networks. - Edward C. Lin, Kai Yu, Rob A. Rutenbar, Tsuhan Chen:
Moving speech recognition from software to silicon: the in silico vox project. - Chengyuan Ma, Yu Tsao, Chin-Hui Lee:
A study on detection based automatic speech recognition. - Rahul Chitturi, Mark Hasegawa-Johnson:
Novel time domain multi-class SVMs for landmark detection.
Modeling Prosodic Features
- Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan:
Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling. - Andrew Rosenberg, Julia Hirschberg:
On the correlation between energy and pitch accent in read English speech. - Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu:
Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses. - Tomás Dubeda:
Prosodic boundaries in Czech: an experiment based on delexicalized speech. - Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao:
Totally data-driven intonation prediction model using a novel F0 contour parametric representation. - Laura Dilley, Mara Breen, Marti Bolivar, John Kraemer, Edward Gibson:
A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices).
Spoken Information Retrieval
- Issac Alphonso, Shuangyu Chang:
Saliency parsing for automated directory assistance. - Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee:
Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity. - Xiang Li, Ea-Ee Jan, Cheng Wu, David M. Lubensky:
Improved topic classification over maximum entropy model using k-norm based new objectives. - Yi-Cheng Pan, Jia-Yu Chen, Yen-shin Lee, Yi-Sheng Fu, Lin-Shan Lee:
Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning. - Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki:
Discriminative named entity recognition of speech data using speech recognition confidence. - Ville T. Turunen, Mikko Kurimo:
Using latent semantic indexing for morph-based spoken document retrieval.
Front-End Methods for ASR
- Ralf Schlüter, András Zolnay, Hermann Ney:
Feature combination using linear discriminant analysis and its pitfalls. - Fabio Valente, Hynek Hermansky:
Discriminant linear processing of time-frequency plane. - Esmeralda Uraga, Thomas Hain:
Automatic speech recognition experiments with articulatory data. - Frederik Stouten, Jean-Pierre Martens:
Speech recognition with phonological features: some issues to attend. - Matthias Wölfel, Christian Fügen,