Serdar Yildirim, Shrikanth Narayanan: Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio-Visual Information.
2-12
Shih-Hsiang Lin, Berlin Chen, Yao-Ming Yeh: Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition.
84-94
Radoslaw Mazur, Alfred Mertins: An Approach for Solving the Permutation Problem of Convolutive Blind Source Separation Based on Statistical Signal Models.
117-126
Yasser Hifny, Steve Renals: Speech Recognition Using Augmented Conditional Random Fields.
354-365
J. Hansen, V. Varadarajan: Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition.
366-378
Jianhua Tao, Le Xin, Panrong Yin: Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method.
469-477
Peng Liu, Frank K. Soong: Graph-Based Partial Hypothesis Fusion for Pen-Aided Speech Input.
478-485
Pui-Yu Hui, Helen M. Meng: Cross-Modality Semantic Integration With Hypothesis Rescoring for Robust Interpretation of Multimodal User Interactions.
486-500
A. Homayoun Kamkar-Parsi, Martin Bouchard: Improved Noise Power Spectrum Density Estimation for Binaural Hearing Aids Operating in a Diffuse Noise Field Environment.
521-533
Hong Kook Kim, Richard C. Rose: Cepstrum-Domain Model Combination Based on Decomposition of Speech and Noise Using MMSE-LSA for ASR in Noisy Environments.
704-713
Mohamed Attia, Mohsen Rashwan, Mohamed Al-Badrashiny: Fassieh-, a Semi-Automatic Visual Interactive Tool for Morphological, PoS-Tags, Phonetic, and Semantic Annotation of Arabic Text Corpora.
916-925
Shmulik Markovich, Sharon Gannot, Israel Cohen: Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals.
1071-1086
Wooil Kim, John H. L. Hansen: Time-Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions.
1292-1304
Hung-Yu Su, Chung-Hsien Wu: Improving Structural Statistical Machine Translation for Sign Language With Small Corpus Using Thematic Role Templates as Translation Memory.
1305-1315
Engin Erzin: Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings.
1316-1324
Giso Grimm, Volker Hohmann, Birger Kollmeier: Increase and Subjective Evaluation of Feedback Stability in Hearing Aids by a Binaural Coherence-Based Noise Reduction Scheme.
1408-1419
Sampo Vesa: Binaural Sound Source Distance Learning in Rooms.
1498-1507
Ioannis Andrianakis, Paul R. White: A Speech Enhancement Algorithm Based on a Chi MRF Model of the Speech STFT Amplitudes.
1508-1517
Emre Özkan, I. Yücel Özbek, Mübeccel Demirekler: Dynamic Speech Spectrum Representation and Tracking Variable Number of Vocal Tract Resonance Frequencies With Time-Varying Dirichlet Process Mixture Models.
1518-1532
Chung-Hsien Wu, Chia-Hsin Hsieh: Story Segmentation and Topic Classification of Broadcast News via a Topic-Based Segmental Model and a Genetic Algorithm.
1612-1623