Wooil Kim, Richard M. Stern: Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise.
1-11
Monja A. Knoll, Lisa Scharrer, Alan Costall: "Look at the shark": Evaluation of student- and actress-produced standardised sentences of infant- and foreigner-directed speech.
12-22
Anna Hjalmarsson: The additive effect of turn-taking cues in human and synthetic voice.
23-35
Hiroki Mori, Tomoyuki Satake, Makoto Nakamura, Hideki Kasuya: Constructing a spoken dialogue corpus for studying paralinguistic information in expressive conversation and analyzing its statistical/acoustic characteristics.
36-50
Anthony P. Stark, Kuldip K. Paliwal: Use of speech presence uncertainty with MMSE spectral energy estimation for robust automatic speech recognition.
51-61
Ruili Wang, Jingli Lu: Investigation of golden speakers for second language learners from imitation preference perspective by voice modification.
175-184
Julia Feld, Mitchell Sommers: There goes the neighborhood: Lipreading and the structure of the mental lexicon.
220-228
Peng Dai, Ing Yann Soon: A temporal warped 2D psychoacoustic modeling for robust speech recognition system.
229-241
Geoffrey Stewart Morrison: A comparison of procedures for the calculation of forensic likelihood ratios from acoustic-phonetic data: Multivariate kernel density (MVKD) versus Gaussian mixture model-universal background model (GMM-UBM).
242-256
Yun Lei, John H. L. Hansen: Mismatch modeling and compensation for robust speaker verification.
257-268
Volume 53, Number 3, March 2011
Eero Väyrynen, Juhani Toivanen, Tapio Seppänen: Classification of emotion in spoken Finnish using vowel-length segments: Increasing reliability with a fusion technique.
269-282
Wooil Kim, John H. L. Hansen: Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise.
451-464
Bernd T. Meyer, Birger Kollmeier: Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition.
753-767
Francesc Alías, Lluís Formiga, Xavier Llorà: Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept.
786-800
Kalle J. Palomäki, Guy J. Brown: A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise.
924-940
Trevor H. Chen, Dominic W. Massaro: Evaluation of synthetic and natural Mandarin visual speech: Initial consonants, single vowels, and syllables.
955-972
Takashi Nose, Takao Kobayashi: Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency.
973-985
Jianxin Peng, Chengxun Bei, Haitao Sun: Relationship between Chinese speech intelligibility and speech transmission index in rooms based on auralization.
986-990
Volume 53, Number 8, October 2011
Philip N. Garner: Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.
991-1001
Katherine Forbes-Riley, Diane J. Litman: Benefits and challenges of real-time uncertainty detection and adaptation in a spoken dialogue computer tutor.
1115-1136
Jaime C. Acosta, Nigel G. Ward: Achieving rapport with turn-by-turn, user-responsive emotional coloring.
1137-1148
Ammar Mahdhaoui, Mohamed Chetouani: Supervised and semi-supervised infant-directed speech classification for parent-infant interaction analysis.
1149-1161
Marcel Kockmann, Lukás Burget, Jan Cernocký: Application of speaker- and language identification state-of-the-art techniques for emotion recognition.
1172-1185