Ning Ma, Martin Bouchard, Rafik A. Goubran: Speech enhancement using a masking threshold constrained Kalman filter and its heuristic implementations.
19-32
James D. Gordy, Rafik A. Goubran: On the perceptual performance limitations of echo cancellers in wideband telephony.
33-42
Li Deng, Dong Yu, Alex Acero: A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
256-265
A. Davis, Sven Nordholm, Roberto Togneri: Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold.
412-424
Li Deng, Alex Acero, Issam Bazzi: Tracking vocal tract resonances using a quantized nonlinear function embeddedin a temporal constraint.
425-434
K. Mustafa, Ian C. Bruce: Robust formant tracking for continuous speech with speaker variability.
435-444
S. Gazor, R. R. Far: Adaptive maximum windowed likelihood multicomponent AM-FM signal decomposition.
479-491
Qiang Fu, Peter Murphy: Robust glottal source estimation based on joint source-filter model optimization.
492-501
E. Fisher, Joseph Tabrikian, Shlomo Dubnov: Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model.
502-510
Anand D. Subramaniam, William R. Gardner, Bhaskar D. Rao: Low-complexity source coding using Gaussian mixture models, lattice vector quantization, and recursive coding with application to speech spectrum quantization.
524-532
S. Ramamohan, S. Dandapat: Sinusoidal model-based analysis and classification of stressed speech.
737-746
Joon-Hyuk Chang, Nam Soo Kim: A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding.
747-751
Jeih-Weih Hung, Lin-Shan Lee: Optimization of temporal filters for constructing robust features in speech recognition.
808-832
Ji Ming: Noise compensation for speech recognition with arbitrary additive noise.
833-844
Florian Hilger, Hermann Ney: Quantile based histogram equalization for noise robust large vocabulary speech recognition.
845-854
Shinji Watanabe, Atsushi Sako, Atsushi Nakamura: Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition.
855-872
R. Sant'Ana, Rosangela Coelho, Abraham Alcaim: Text-independent speaker recognition based on the Hurst parameter and the multidimensional fractional Brownian motion model.
931-940
J. Mullen, David M. Howard, Damian T. Murphy: Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality.
964-971
Harald Viste, Gianpaolo Evangelista: A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures.
1051-1061
Eva Navas, Inma Hernáez, Iker Luengo: An objective and subjective study of the role of semantics and prosodic features in building corpora for emotional TTS.
1117-1127
M. Schroder: Expressing degree of activation in synthetic speech.
1128-1136
Wentao Gu, Keikichi Hirose, Hiroya Fujisaki: Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances.
1155-1170
N. Campbell: Conversational speech synthesis and the need for some laughter.
1171-1178
Taishih Chi, Shihab A. Shamma: Spectrum restoration from multiscale auditory phase singularities by generalized projections.
1179-1192
A. Watanabe, T. Sakata: Reliable methods for estimating relative vocal tract lengths from formant trajectories of common words.
1193-1204
W. C. Chu: Embedded quantization of line spectral frequencies using a multistage tree-structured vector quantizer.
1205-1217
Rile Hu, Chengqing Zong, Bo Xu: An approach to automatic acquisition of translation templates based on phrase structure extraction and alignment.
1656-1663
Chak-Fai Li, Man-Hung Siu, Jeff Siu-Kei Au-Yeung: Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition.
1704-1718
Jen-Tzung Chien: Association pattern language modeling.
1719-1728
N. Duta, Richard M. Schwartz, John Makhoul: Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations.
1745-1753
S. Mathur, Brad H. Story, J. J. Rodriguez: Vocal-tract modeling: fractional elongation of segment lengths in a waveguide model with half-sample delays.
1754-1762
Jithendra Vepa, Simon King: Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis.
1763-1771
Rainer Huber, Birger Kollmeier: PEMO-Q - A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception.
1902-1911
Abhijit Karmakar, Arun Kumar, R. K. Patney: A Multiresolution Model of Auditory Excitation Pattern and Its Application to Objective Evaluation of Perceived Speech Quality.
1912-1923
S. George, S. Zielinski, F. Rumsey: Feature Extraction for the Prediction of Multichannel Spatial Audio Fidelity.
1994-2005
Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki: Performance Estimation of Speech Recognition System Under Noise Conditions Using Objective Quality Measures and Artificial Voice.
2006-2013
P. Li, Y. Guan, B. Xu, W. Liu: Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech.
2014-2023