


default search action
Speech Communication, Volume 54
Volume 54, Number 1, January 2012
- Abhishek Jaywant  , Marc D. Pell , Marc D. Pell : :
 Categorical processing of negative emotions from speech prosody. 1-10
- Elisabetta Fersini  , Enza Messina , Enza Messina , Francesco Archetti: , Francesco Archetti:
 Emotional states in judicial courtrooms: An experimental investigation. 11-22
- Mouloud Djamah, Douglas D. O'Shaughnessy: 
 Fine granularity scalable speech coding using embedded tree-structured vector quantization. 23-39
- Abhijeet Sangwan, John H. L. Hansen: 
 Automatic analysis of Mandarin accented English using phonological features. 40-54
- Deepu Vijayasenan, Fabio Valente, Hervé Bourlard: 
 Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features. 55-67
- Máire Ní Chiosáin, Pauline Welby  , Robert Espesser: , Robert Espesser:
 Is the syllabification of Irish a typological exception? An experimental study. 68-91
- Silke Paulmann, Debra Titone, Marc D. Pell  : :
 How emotional prosody guides your way: Evidence from eye movements. 92-107
- Peter Jancovic, Xin Zou, Münevver Köküer: 
 Speech enhancement based on Sparse Code Shrinkage employing multiple speech models. 108-118
- Cong-Thanh Do, Dominique Pastor  , André Goalic: , André Goalic:
 A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. 119-133
- Keigo Nakamura, Tomoki Toda  , Hiroshi Saruwatari, Kiyohiro Shikano: , Hiroshi Saruwatari, Kiyohiro Shikano:
 Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. 134-146
- Ying-Yee Kong, Ala Mullangi: 
 On the development of a frequency-lowering system that enhances place-of-articulation perception. 147-160
Volume 54, Number 2, February 2012
- Nigel G. Ward, Alejandro Vega, Timo Baumann  : :
 Prosodic and temporal features for language modeling for dialog. 161-174
- J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark: 
 Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis. 175-188
- Sophie Bouton  , Pascale Colé , Pascale Colé , Willy Serniclaes: , Willy Serniclaes:
 The influence of lexical knowledge on phoneme discrimination in deaf children with cochlear implants. 189-198
- Jón Guðnason  , Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor , Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor : :
 Data-driven voice source waveform analysis and synthesis. 199-211
- George Saon  , Hagen Soltau: , Hagen Soltau:
 Boosting systems for large vocabulary continuous speech recognition. 212-218
- Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura: 
 Acoustically discriminative language model training with pseudo-hypothesis. 219-228
- Masakiyo Fujimoto, Shinji Watanabe  , Tomohiro Nakatani: , Tomohiro Nakatani:
 Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection. 229-244
- Vataya Chunwijitra, Takashi Nose  , Takao Kobayashi: , Takao Kobayashi:
 A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis. 245-255
- Hamid Reza Tohidypour, Seyyed Ali Seyyedsalehi  , Hossein Behbood, Hossein Roshandel: , Hossein Behbood, Hossein Roshandel:
 A new representation for speech frame recognition based on redundant wavelet filter banks. 256-271
- Fei Chen  , Philipos C. Loizou: , Philipos C. Loizou:
 Impact of SNR and gain-function over- and under-estimation on speech intelligibility. 272-281
- Kuldip K. Paliwal  , Belinda Schwerin , Belinda Schwerin , Kamil K. Wójcicki: , Kamil K. Wójcicki:
 Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator. 282-305
- Andrew Hines  , Naomi Harte , Naomi Harte : :
 Speech intelligibility prediction using a Neurogram Similarity Index Measure. 306-320
Volume 54, Number 3, March 2012
- Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel: 
 Improving proper name recognition by means of automatically learned pronunciation variants. 321-340
- Pandurangarao N. Kulkarni, Prem C. Pandey, Dakshayani S. Jangamashetti: 
 Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss. 341-350
- Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang: 
 Index-based incremental language model for scalable directory assistance. 351-367
- Daniel Recasens: 
 A cross-language acoustic study of initial and final allophones of /l/. 368-383
- Takashi Nose  , Takao Kobayashi: , Takao Kobayashi:
 Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols. 384-392
- Amaro A. de Lima, Thiago de M. Prego  , Sergio L. Netto , Sergio L. Netto , Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal: , Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal:
 On the quality-assessment of reverberated speech. 393-401
- Peng Dai  , Ing Yann Soon: , Ing Yann Soon:
 A temporal frequency warped (TFW) 2D psychoacoustic filter for robust speech recognition system. 402-413
- Ioulia Grichkovtsova, Michel Morel, Anne Lacheret: 
 The role of voice quality and prosodic contour in affective speech perception. 414-429
- Frank Rudzicz  : :
 Using articulatory likelihoods in the recognition of dysarthric speech. 430-444
- Je Hun Jeon, Yang Liu: 
 Automatic prosodic event detection using a novel labeling and selection method in co-training. 445-458
- Jordi Adell, David Escudero Mancebo  , Antonio Bonafonte , Antonio Bonafonte : :
 Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence. 459-476
- Jae-Hun Choi, Joon-Hyuk Chang: 
 On using acoustic environment classification for statistical model-based speech enhancement. 477-490
- Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran: 
 Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech. 491-502
- Angel M. Gomez  , Belinda Schwerin , Belinda Schwerin , Kuldip K. Paliwal , Kuldip K. Paliwal : :
 Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio. 503-515
Volume 54, Number 4, May 2012
- Anis Ben Aicha, Sofia Ben Jebara: 
 Perceptual speech quality measures separating speech distortion and additive noise degradations. 517-528
- Meihong Wu, Huahui Li, Zhiling Hong, Xinchi Xian, Jingyu Li, Xihong Wu, Liang Li: 
 Effects of aging on the ability to benefit from prior knowledge of message content in masked speech recognition. 529-542
- Md. Sahidullah  , Goutam Saha , Goutam Saha : :
 Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. 543-565
- David Escudero Mancebo  , Lourdes Aguilar , Lourdes Aguilar , María Vanrell , María Vanrell , Pilar Prieto , Pilar Prieto : :
 Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system. 566-582
Volume 54, Number 5, June 2012
- William Ricardo Rodríguez  , Oscar Saz, Eduardo Lleida , Oscar Saz, Eduardo Lleida : :
 A prelingual tool for the education of altered voices. 583-600
- Evaldas Vaiciukynas  , Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza: , Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza:
 Exploring similarity-based classification of larynx disorders from human voice. 601-610
- David M. Howard  , Evelyn Abberton, Adrian Fourcin: , Evelyn Abberton, Adrian Fourcin:
 Disordered voice measurement and auditory analysis. 611-621
- Tiago H. Falk  , Wai-Yip Chan, Fraser Shein: , Wai-Yip Chan, Fraser Shein:
 Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility. 622-631
- Marieke de Bruijn, Louis ten Bosch, Dirk J. Kuik, Birgit I. Witte, Johannes A. Langendijk  , C. René Leemans, Irma Verdonck-de Leeuw , C. René Leemans, Irma Verdonck-de Leeuw : :
 Acoustic-phonetic and artificial neural network feature analysis to assess speech quality of stop consonants produced by patients treated for oral or oropharyngeal cancer. 632-640
- Sevasti-Zoi Karakozoglou, Nathalie Henrich  , Christophe d'Alessandro , Christophe d'Alessandro , Yannis Stylianou: , Yannis Stylianou:
 Automatic glottal segmentation using local-based active contours and application to glottovibrography. 641-654
- Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez, P. Murphy: 
 Assessment of disordered voice via the first rahmonic. 655-663
- Alain Ghio  , Gilles Pouchoulin , Gilles Pouchoulin , Bernard Teston, Serge Pinto , Bernard Teston, Serge Pinto , Corinne Fredouille, Céline De Looze, Danièle Robert, François Viallet, Antoine Giovanni: , Corinne Fredouille, Céline De Looze, Danièle Robert, François Viallet, Antoine Giovanni:
 How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers? 664-679
Volume 54, Number 6, July 2012
- Pilar Prieto  , María Vanrell , María Vanrell , Lluïsa Astruc, Elinor Payne, Brechtje Post: , Lluïsa Astruc, Elinor Payne, Brechtje Post:
 Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish. 681-702
- Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King  , Keiichi Tokuda: , Keiichi Tokuda:
 Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping. 703-714
- Tobias Kaufmann, Beat Pfister: 
 Syntactic language modeling with formal grammars. 715-731
- Petr Zelinka, Milan Sigmund  , Jiri Schimmel: , Jiri Schimmel:
 Impact of vocal effort variability on automatic speech recognition. 732-742
- Rigas Kotsakis, George Kalliris  , Charalampos Dimoulas , Charalampos Dimoulas : :
 Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification. 743-762
- Mohammad Hossein Moattar  , Mohammad Mehdi Homayounpour , Mohammad Mehdi Homayounpour : :
 Variational conditional random fields for online speaker detection and tracking. 763-780
- Mirjam Wester: 
 Talker discrimination across languages. 781-790
- Takanobu Oba, Takaaki Hori, Atsushi Nakamura: 
 Efficient training of discriminative language models by sample selection. 791-800
- Herman Kamper  , Félicien Jeje Muamba Mukanya, Thomas Niesler: , Félicien Jeje Muamba Mukanya, Thomas Niesler:
 Multi-accent acoustic modelling of South African English. 801-813
- Eduardo Pavez  , Jorge F. Silva , Jorge F. Silva : :
 Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition. 814-835
- Ronan Flynn  , Edward Jones , Edward Jones : :
 Feature selection for reduced-bandwidth distributed speech recognition. 836-843
- David M. Howard  , Evelyn Abberton, Adrian Fourcin: , Evelyn Abberton, Adrian Fourcin:
 Erratum to "Disordered voice measurement and auditory analysis" [Speech Comm. 54(2012) 611-621]. 844
Volume 54, Number 7, September 2012
- Lan Wang, Hui Chen, Sheng Li  , Helen M. Meng: , Helen M. Meng:
 Phoneme-level articulatory animation in pronunciation training. 845-856
- Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King  , Keiichi Tokuda: , Keiichi Tokuda:
 Impacts of machine translation and speech synthesis on speech-to-speech translation. 857-866
- Shajith Ikbal, Hemant Misra, Hynek Hermansky  , Mathew Magimai-Doss , Mathew Magimai-Doss : :
 Phase AutoCorrelation (PAC) features for noise robust speech recognition. 867-880
- Ronan Flynn  , Edward Jones , Edward Jones : :
 Reducing bandwidth for robust distributed speech recognition in conditions of packet loss. 881-892
- Thorsten Smit, Friedrich Türckheim, Robert Mores: 
 Fast and robust formant detection from LP data. 893-902
- Ali Hassan  , Robert I. Damper: , Robert I. Damper:
 Classification of emotional speech using 3DEC hierarchical classifier. 903-916
- Hugo Quené  , Gün Refik Semin , Gün Refik Semin , Francesco Foroni , Francesco Foroni : :
 Audible smiles and frowns affect speech comprehension. 917-922
Volume 54, Number 8, October 2012
- Yana Yunusova  , Melanie Baljko , Melanie Baljko , Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis: , Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis:
 Acquisition of the 3D surface of the palate by in-vivo digitization with Wave. 923-931
- Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu: 
 A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model. 932-945
- Peggy P. K. Mok  : :
 Effects of consonant cluster syllabification on vowel-to-vowel coarticulation in English. 946-956
- Zhongbo Li, Shenghui Zhao, Stefan Bruhn, Jing Wang, Jingming Kuang: 
 Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP. 957-974
Volume 54, Number 9, November 2012
- Okko Räsänen  : :
 Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions. 975-997
- Toshio Irino, Yoshie Aoki, Hideki Kawahara  , Roy D. Patterson: , Roy D. Patterson:
 Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination. 998-1013
- Atsunori Ogawa, Atsushi Nakamura: 
 Joint estimation of confidence and error causes in speech recognition. 1014-1028
- Irene Ayllón Clemente, Martin Heckmann  , Britta Wrede , Britta Wrede : :
 Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation. 1029-1048
- Khiet P. Truong, David A. van Leeuwen, Franciska M. G. de Jong: 
 Speech-based recognition of self-reported and observed emotion in a dimensional space. 1049-1063
Volume 54, Number 10, December 2012
- Mohammad Hossein Moattar  , Mohammad Mehdi Homayounpour , Mohammad Mehdi Homayounpour : :
 A review on speaker diarization systems and approaches. 1065-1103
- Veena Karjigi  , Preeti Rao: , Preeti Rao:
 Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling. 1104-1120
- Edward Ozimek, Dariusz Kutzner, Pawel Libiszewski: 
 Speech intelligibility tested by the Pediatric Matrix Sentence test in 3-6 year old children. 1121-1131
- Doris Baum: 
 Recognising speakers from the topics they talk about. 1132-1142

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


 Google
Google Google Scholar
Google Scholar Semantic Scholar
Semantic Scholar Internet Archive Scholar
Internet Archive Scholar CiteSeerX
CiteSeerX ORCID
ORCID














