default search action
Hervé Bourlard
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [j65]Hervé Bourlard, Selen Hande Kabil:
Autoencoders reloaded. Biol. Cybern. 116(4): 389-406 (2022) - [c241]Selen Hande Kabil, Hervé Bourlard:
From Undercomplete to Sparse Overcomplete Autoencoders to Improve LF-MMI based Speech Recognition. INTERSPEECH 2022: 1061-1065 - [c240]Cécile Fougeron, Nicolas Audibert, Ina Kodrasi, Parvaneh Janbakhshi, Michaela Pernon, Nathalie Lévêque, Stephanie Borel, Marina Laganaro, Hervé Bourlard, Frédéric Assal:
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech. INTERSPEECH 2022: 2188-2192 - 2021
- [j64]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Subspace-Based Learning for Automatic Dysarthric Speech Detection. IEEE Signal Process. Lett. 28: 96-100 (2021) - [c239]Deepak Baby, Hervé Bourlard:
Speech Dereverberation Using Variational Autoencoders. ICASSP 2021: 5784-5788 - [c238]Apoorv Vyas, Srikanth R. Madikeri, Hervé Bourlard:
Lattice-Free Mmi Adaptation of Self-Supervised Pretrained Acoustic Models. ICASSP 2021: 6219-6223 - [c237]Ina Kodrasi, Michaela Pernon, Marina Laganaro, Hervé Bourlard:
Automatic And Perceptual Discrimination Between Dysarthria, Apraxia of Speech, and Neurotypical Speech. ICASSP 2021: 7308-7312 - [c236]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Automatic Dysarthric Speech Detection Exploiting Pairwise Distance-Based Convolutional Neural Networks. ICASSP 2021: 7328-7332 - [c235]Apoorv Vyas, Srikanth R. Madikeri, Hervé Bourlard:
Comparing CTC and LFMMI for Out-of-Domain Adaptation of wav2vec 2.0 Acoustic Model. Interspeech 2021: 2861-2865 - [c234]Srikanth R. Madikeri, Petr Motlícek, Hervé Bourlard:
Multitask Adaptation with Lattice-Free MMI for Multi-Genre Speech Recognition of Low Resource Languages. Interspeech 2021: 4329-4333 - [i12]Apoorv Vyas, Srikanth R. Madikeri, Hervé Bourlard:
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model. CoRR abs/2104.02558 (2021) - 2020
- [j63]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR. Speech Commun. 119: 24-35 (2020) - [j62]Ina Kodrasi, Hervé Bourlard:
Spectro-Temporal Sparsity Characterization for Dysarthric Speech Detection. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1210-1222 (2020) - [j61]Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
Neural Network Based End-to-End Query by Example Spoken Term Detection. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1416-1427 (2020) - [j60]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Automatic Pathological Speech Intelligibility Assessment Exploiting Subspace-Based Analyses. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1717-1728 (2020) - [c233]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Synthetic Speech References for Automatic Pathological Speech Intelligibility Assessment. ICASSP 2020: 6099-6103 - [c232]Banriskhem K. Khonglah, Srikanth R. Madikeri, Subhadeep Dey, Hervé Bourlard, Petr Motlícek, Jayadev Billa:
Incremental Semi-Supervised Learning for Multi-Genre Speech Recognition. ICASSP 2020: 7419-7423 - [c231]Srikanth R. Madikeri, Banriskhem K. Khonglah, Sibo Tong, Petr Motlícek, Hervé Bourlard, Daniel Povey:
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems. INTERSPEECH 2020: 4746-4750 - [c230]Ina Kodrasi, Michaela Pernon, Marina Laganaro, Hervé Bourlard:
Automatic Discrimination of Apraxia of Speech and Dysarthria Using a Minimalistic Set of Handcrafted Features. INTERSPEECH 2020: 4991-4995 - [i11]Srikanth R. Madikeri, Sibo Tong, Juan Zuluaga-Gomez, Apoorv Vyas, Petr Motlícek, Hervé Bourlard:
Pkwrap: a PyTorch Package for LF-MMI Training of Acoustic Models. CoRR abs/2010.03466 (2020) - [i10]Ina Kodrasi, Michaela Pernon, Marina Laganaro, Hervé Bourlard:
Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech. CoRR abs/2011.07542 (2020) - [i9]Apoorv Vyas, Srikanth R. Madikeri, Hervé Bourlard:
Lattice-Free MMI Adaptation Of Self-Supervised Pretrained Acoustic Models. CoRR abs/2012.14252 (2020)
2010 – 2019
- 2019
- [j59]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and sparse subspace modeling of speech for DNN based acoustic modeling. Speech Commun. 109: 34-45 (2019) - [c229]Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
Multilingual Bottleneck Features for Query by Example Spoken Term Detection. ASRU 2019: 621-628 - [c228]Sibo Tong, Philip N. Garner, Hervé Bourlard:
An Investigation of Multilingual ASR Using End-to-end LF-MMI. ICASSP 2019: 6061-6065 - [c227]Ina Kodrasi, Hervé Bourlard:
Super-gaussianity of Speech Spectral Coefficients as a Potential Biomarker for Dysarthric Speech Detection. ICASSP 2019: 6400-6404 - [c226]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Pathological Speech Intelligibility Assessment Based on the Short-time Objective Intelligibility Measure. ICASSP 2019: 6405-6409 - [c225]Apoorv Vyas, Pranay Dighe, Sibo Tong, Hervé Bourlard:
Analyzing Uncertainties in Speech Recognition Using Dropout. ICASSP 2019: 6730-6734 - [c224]François Marelli, Bastian Schnell, Hervé Bourlard, Thierry Dutoit, Philip N. Garner:
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model. ICASSP 2019: 7040-7044 - [c223]Sibo Tong, Apoorv Vyas, Philip N. Garner, Hervé Bourlard:
Unbiased Semi-Supervised LF-MMI Training Using Dropout. INTERSPEECH 2019: 1576-1580 - [c222]Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard:
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility. INTERSPEECH 2019: 3038-3042 - [i8]Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
Multilingual Bottleneck Features for Query by Example Spoken Term Detection. CoRR abs/1907.00443 (2019) - [i7]Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
Neural Network based End-to-End Query by Example Spoken Term Detection. CoRR abs/1911.08332 (2019) - 2018
- [j58]Dhananjay Ram, Afsaneh Asaei, Hervé Bourlard:
Phonetic subspace features for improved query by example spoken term detection. Speech Commun. 103: 27-36 (2018) - [j57]Sibo Tong, Philip N. Garner, Hervé Bourlard:
Cross-lingual adaptation of a CTC-based multilingual acoustic model. Speech Commun. 104: 39-46 (2018) - [j56]Dhananjay Ram, Afsaneh Asaei, Hervé Bourlard:
Sparse Subspace Modeling for Query by Example Spoken Term Detection. IEEE ACM Trans. Audio Speech Lang. Process. 26(6): 1126-1139 (2018) - [c221]Ina Kodrasi, Hervé Bourlard:
Statistical Modeling of Speech Spectral Coefficients in Patients with Parkinson's Disease. ITG Symposium on Speech Communication 2018: 1-5 - [c220]Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
CNN Based Query by Example Spoken Term Detection. INTERSPEECH 2018: 92-96 - [c219]Ina Kodrasi, Hervé Bourlard:
Single-channel Late Reverberation Power Spectral Density Estimation Using Denoising Autoencoders. INTERSPEECH 2018: 1319-1323 - [c218]Hervé Bourlard:
Evolution of Neural Network Architectures for Speech Recognition. INTERSPEECH 2018: 1767 - [c217]Afsaneh Asaei, Dhananjay Ram, Hervé Bourlard:
Phonological Posterior Hashing for Query by Example Spoken Term Detection. INTERSPEECH 2018: 2067-2071 - [c216]Sibo Tong, Philip N. Garner, Hervé Bourlard:
Fast Language Adaptation Using Phonological Information. INTERSPEECH 2018: 2459-2463 - [c215]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Far-Field ASR Using Low-Rank and Sparse Soft Targets from Parallel Data. SLT 2018: 581-587 - 2017
- [j55]Afsaneh Asaei, Milos Cernak, Hervé Bourlard:
Perceptual Information Loss due to Impaired Speech Production. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2433-2443 (2017) - [c214]Renars Liepins, Ulrich Germann, Guntis Barzdins, Alexandra Birch, Steve Renals, Susanne Weber, Peggy van der Kreeft, Hervé Bourlard, João Prieto, Ondrej Klejch, Peter Bell, Alexandros Lazaridis, Afonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed M. Ali, Sebastião Miranda, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell:
The SUMMA Platform Prototype. EACL (Software Demonstrations) 2017: 116-119 - [c213]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and sparse soft targets to learn better DNN acoustic models. ICASSP 2017: 5265-5269 - [c212]Sibo Tong, Philip N. Garner, Hervé Bourlard:
An Investigation of Deep Neural Networks for Multilingual Speech Recognition Training and Adaptation. INTERSPEECH 2017: 714-718 - [c211]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Exploiting Eigenposteriors for Semi-Supervised Training of DNN Acoustic Models with Sequence Discrimination. INTERSPEECH 2017: 3552-3556 - [i6]Sibo Tong, Philip N. Garner, Hervé Bourlard:
Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model. CoRR abs/1711.10025 (2017) - 2016
- [j54]Raphael Ullmann, Hervé Bourlard:
Predicting the intrusiveness of noise through sparse coding with auditory kernels. Speech Commun. 76: 186-200 (2016) - [j53]Afsaneh Asaei, Hervé Bourlard, Mohammad Javad Taghizadeh, Volkan Cevher:
Computational methods for underdetermined convolutive speech localization and separation via model-based sparse component analysis. Speech Commun. 76: 201-217 (2016) - [j52]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition. Speech Commun. 76: 230-244 (2016) - [j51]Milos Cernak, Afsaneh Asaei, Hervé Bourlard:
On structured sparsity of phonological posteriors for linguistic parsing. Speech Commun. 84: 36-45 (2016) - [j50]Marc Ferras, Srikanth R. Madikeri, Petr Motlícek, Subhadeep Dey, Hervé Bourlard:
A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition. IEEE Signal Process. Lett. 23(4): 527-531 (2016) - [j49]Marc Ferras, Srikanth R. Madikeri, Hervé Bourlard:
Speaker Diarization and Linking of Meeting Data. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 1935-1945 (2016) - [j48]Afsaneh Asaei, Mohammad Javad Taghizadeh, Saeid Haghighatshoar, Bhiksha Raj, Hervé Bourlard, Volkan Cevher:
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization. IEEE Trans. Signal Process. 64(3): 567-579 (2016) - [c210]Marc Ferras, Srikanth R. Madikeri, Petr Motlícek, Hervé Bourlard:
System fusion and speaker linking for longitudinal diarization of TV shows. ICASSP 2016: 5495-5499 - [c209]Pranay Dighe, Gil Luyet, Afsaneh Asaei, Hervé Bourlard:
Exploiting low-dimensional structures to enhance DNN based acoustic modeling in speech recognition. ICASSP 2016: 5690-5694 - [c208]Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner, Hervé Bourlard:
Sound Pattern Matching for Automatic Prosodic Event Detection. INTERSPEECH 2016: 170-174 - [c207]Dhananjay Ram, Afsaneh Asaei, Hervé Bourlard:
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection. INTERSPEECH 2016: 918-922 - [c206]Marc Ferras, Srikanth R. Madikeri, Subhadeep Dey, Petr Motlícek, Hervé Bourlard:
Inter-Task System Fusion for Speaker Recognition. INTERSPEECH 2016: 1810-1814 - [c205]Afsaneh Asaei, Gil Luyet, Milos Cernak, Hervé Bourlard:
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures. INTERSPEECH 2016: 1873-1877 - [c204]Gil Luyet, Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-Rank Representation of Nearest Neighbor Posterior Probabilities to Enhance DNN Based Acoustic Modeling. INTERSPEECH 2016: 3449-3453 - [i5]Milos Cernak, Afsaneh Asaei, Hervé Bourlard:
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing. CoRR abs/1601.05647 (2016) - [i4]Pranay Dighe, Gil Luyet, Afsaneh Asaei, Hervé Bourlard:
Exploiting Low-dimensional Structures to Enhance DNN Based Acoustic Modeling in Speech Recognition. CoRR abs/1601.05936 (2016) - [i3]Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models. CoRR abs/1610.05688 (2016) - 2015
- [j47]Mohammad Javad Taghizadeh, Afsaneh Asaei, Saeid Haghighatshoar, Philip N. Garner, Hervé Bourlard:
Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery. IEEE J. Sel. Top. Signal Process. 9(5): 802-814 (2015) - [j46]Mohammad Javad Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard, Afsaneh Asaei:
Ad hoc microphone array calibration: Euclidean distance matrix completion algorithm and theoretical guarantees. Signal Process. 107: 123-140 (2015) - [j45]Ashtosh Sapru, Hervé Bourlard:
Automatic Recognition of Emergent Social Roles in Small Group Interactions. IEEE Trans. Multim. 17(5): 746-760 (2015) - [c203]Mohammad Javad Taghizadeh, Saeid Haghighatshoar, Afsaneh Asaei, Philip N. Garner, Hervé Bourlard:
Robust microphone placement for source localization from noisy distance measurements. ICASSP 2015: 2579-2583 - [c202]José F. Velasco, Mohammad Javad Taghizadeh, Afsaneh Asaei, Hervé Bourlard, Carlos Julian Martín-Arguedas, Javier Macías Guarasa, Daniel Pizarro:
Novel GCC-PHAT model in diffuse sound field for microphone array pairwise distance based calibration. ICASSP 2015: 2669-2673 - [c201]Afsaneh Asaei, Nasser Mohammadiha, Mohammad Javad Taghizadeh, Simon Doclo, Hervé Bourlard:
On application of non-negative matrix factorization for ad hoc microphone array calibration from incomplete noisy distances. ICASSP 2015: 2694-2698 - [c200]Srikanth R. Madikeri, Hervé Bourlard:
KL-HMM based speaker diarization system for meetings. ICASSP 2015: 4435-4439 - [c199]Srikanth R. Madikeri, Petr Motlícek, Hervé Bourlard:
Combining SGMM speaker vectors and KL-HMM approach for speaker diarization. ICASSP 2015: 4834-4838 - [c198]Raphael Ullmann, Mathew Magimai-Doss, Hervé Bourlard:
Objective speech intelligibility assessment through comparison of phoneme class conditional probability sequences. ICASSP 2015: 4924-4928 - [c197]Afsaneh Asaei, Milos Cernak, Hervé Bourlard:
On compressibility of neural network phonological features for low bit rate speech coding. INTERSPEECH 2015: 418-422 - [c196]Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss, Hervé Bourlard:
Objective intelligibility assessment of text-to-speech systems through utterance verification. INTERSPEECH 2015: 3501-3505 - [c195]Dhananjay Ram, Afsaneh Asaei, Pranay Dighe, Hervé Bourlard:
Sparse modeling of posterior exemplars for keyword detection. INTERSPEECH 2015: 3690-3694 - 2014
- [j44]Mohammad Javad Taghizadeh, Philip N. Garner, Hervé Bourlard:
Enhanced diffuse field model for ad hoc microphone array calibration. Signal Process. 101: 242-255 (2014) - [j43]David Imseng, Petr Motlícek, Hervé Bourlard, Philip N. Garner:
Using out-of-language data to improve an under-resourced speech recognizer. Speech Commun. 56: 142-151 (2014) - [j42]Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, Volkan Cevher:
Structured Sparsity Models for Reverberant Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 22(3): 620-633 (2014) - [j41]Sree Harsha Yella, Hervé Bourlard:
Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 1688-1700 (2014) - [j40]Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard, Qingmin Liao:
Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 2244-2255 (2014) - [c194]Mohammad Javad Taghizadeh, Afsaneh Asaei, Philip N. Garner, Hervé Bourlard:
Ad-hoc microphone array calibration from partial distance measurements. HSCMA 2014: 1-5 - [c193]Sree Harsha Yella, Hervé Bourlard:
Information bottleneck based speaker diarization of meetings using non-speech as side information. ICASSP 2014: 96-100 - [c192]Ashtosh Sapru, Sree Harsha Yella, Hervé Bourlard:
Improving speaker diarization using social role information. ICASSP 2014: 101-105 - [c191]Srikanth R. Madikeri, Hervé Bourlard:
Filterbank slope based features for speaker diarization. ICASSP 2014: 111-115 - [c190]Afsaneh Asaei, Hervé Bourlard, Mohammad Javad Taghizadeh, Volkan Cevher:
Model-based sparse component analysis for reverberant speech localization. ICASSP 2014: 1439-1443 - [c189]David Imseng, Blaise Potard, Petr Motlícek, Alexandre Nanchen, Hervé Bourlard:
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages. ICASSP 2014: 2322-2326 - [c188]Ngoc Thang Vu, David Imseng, Daniel Povey, Petr Motlícek, Tanja Schultz, Hervé Bourlard:
Multilingual deep neural network based acoustic modeling for rapid language adaptation. ICASSP 2014: 7639-7643 - [c187]Steve Renals, Jean Carletta, Keith Edwards, Hervé Bourlard, Philip N. Garner, Andrei Popescu-Belis, Dietrich Klakow, Andrey Girenko, Volha Petukhova, Philippe Wacker, Andrew Joscelyne, Costis Kompis, Simon Aliwell, William Stevens, Youssef Sabbah:
ROCKIT: Roadmap for Conversational Interaction Technologies. RFMIR@ICMI 2014: 39-42 - [c186]Pranay Dighe, Marc Ferras, Hervé Bourlard:
Detecting and labeling speakers on overlapping speech using vector taylor series. INTERSPEECH 2014: 592-596 - [c185]Sree Harsha Yella, Petr Motlícek, Hervé Bourlard:
Phoneme background model for information bottleneck based speaker diarization. INTERSPEECH 2014: 597-601 - [c184]Marc Ferras, Stefano Masneri, Oliver Schreer, Hervé Bourlard:
Diarizing large corpora using multi-modal speaker linking. INTERSPEECH 2014: 602-606 - [c183]Sara Bahaadini, Afsaneh Asaei, David Imseng, Hervé Bourlard:
Posterior-based sparse representation for automatic speech recognition. INTERSPEECH 2014: 2454-2458 - [c182]Marc Ferras, Hervé Bourlard:
Multi-source posteriors for speech activity detection on public talks. INTERSPEECH 2014: 2529-2532 - [c181]Ashtosh Sapru, Hervé Bourlard:
Detecting speaker roles and topic changes in multiparty conversations using latent topic models. INTERSPEECH 2014: 2882-2886 - [c180]Pranay Dighe, Marc Ferras, Hervé Bourlard:
Modeling Overlapping Speech using Vector Taylor Series. Odyssey 2014: 194-199 - [i2]Mohammad Javad Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard, Afsaneh Asaei:
Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees. CoRR abs/1409.0203 (2014) - 2013
- [j39]Petr Motlícek, Stefan Duffner, Danil Korchagin, Hervé Bourlard, Carl Scheffler, Jean-Marc Odobez, Giovanni Del Galdo, Markus Kallinger, Oliver Thiergart:
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing. Adv. Multim. 2013: 175745:1-175745:21 (2013) - [j38]Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez:
Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations. IEEE Trans. Speech Audio Process. 21(1): 83-96 (2013) - [j37]Weifeng Li, Longbiao Wang, Yicong Zhou, Hervé Bourlard, Qingmin Liao:
Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition. IEEE Trans. Speech Audio Process. 21(8): 1689-1698 (2013) - [j36]David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss:
Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition. IEEE Trans. Speech Audio Process. 21(8): 1713-1726 (2013) - [c179]Ashtosh Sapru, Hervé Bourlard:
Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings. ACII 2013: 324-329 - [c178]David Imseng, Petr Motlícek, Philip N. Garner, Hervé Bourlard:
Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. ASRU 2013: 332-337 - [c177]Marc Ferras, Hervé Bourlard:
MLP-based factor analysis for tandem speech recognition. ICASSP 2013: 6719-6723 - [c176]Sree Harsha Yella, Hervé Bourlard:
Improved overlap speech diarization of meeting recordings using long-term conversational features. ICASSP 2013: 7746-7750 - [c175]David Imseng, Hervé Bourlard:
Speaker adaptive Kullback-Leibler divergence based hidden Markov models. ICASSP 2013: 7913-7917 - [c174]Mohammad Javad Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard:
Euclidean distance matrix completion for ad-hoc microphone array calibration. DSP 2013: 1-7 - [c173]Hervé Bourlard, Marc Ferras, Nikolaos Pappas, Andrei Popescu-Belis, Steve Renals, Fergus McInnes, Peter Bell, Sandy Ingram, Maël Guillemot:
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project. SLAM@INTERSPEECH 2013: 3-8 - [c172]Ashtosh Sapru, Hervé Bourlard:
Automatic social role recognition in professional meetings using conditional random fields. INTERSPEECH 2013: 1530-1534 - 2012
- [j35]Andrei Popescu-Belis, Denis Lalanne, Hervé Bourlard:
Finding Information in Multimedia Meeting Records. IEEE Multim. 19(2): 48-57 (2012) - [j34]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features. Speech Commun. 54(1): 55-67 (2012) - [j33]Hervé Bourlard, Vikram Krishnamurthy, Yan Lindsay Sun, H. Vicky Zhao, K. J. Ray Liu:
A Technical Revolution: Social Learning and Networking [From the Guest Editors]. IEEE Signal Process. Mag. 29(2): 20-21 (2012) - [c171]Afsaneh Asaei, Michael E. Davies, Hervé Bourlard, Volkan Cevher:
Computational methods for structured sparse component analysis of convolutive speech mixtures. ICASSP 2012: 2425-2428 - [c170]David Imseng, Hervé Bourlard, Philip N. Garner:
Using KL-divergence and multilingual information to improve ASR for under-resourced languages. ICASSP 2012: 4869-4872 - [c169]Mohammad Javad Taghizadeh, Philip N. Garner, Hervé Bourlard:
Microphone array beampattern characterization for hands-free speech applications. SAM 2012: 465-468 - [c168]Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard:
Template-based ASR using posterior features and synthetic references: comparing different TTS systems. SAPA@INTERSPEECH 2012: 52-57 - [c167]Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, Volkan Cevher:
Structured sparse coding for microphone array location calibration. SAPA@INTERSPEECH 2012: 74-79 - [c166]Weifeng Li, Hervé Bourlard:
Sub-band based Log-energy and Its Dynamic Range Stretching for Robust In-car Speech Recognition. INTERSPEECH 2012: 314-317 - [c165]David Imseng, John Dines, Petr Motlícek, Philip N. Garner, Hervé Bourlard:
Comparing different acoustic modeling techniques for multilingual boosting. INTERSPEECH 2012: 1191-1194 - [c164]Milos Cernak, David Imseng, Hervé Bourlard:
Robust triphone mapping for acoustic modeling. INTERSPEECH 2012: 1910-1913 - [c163]Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard:
Synthetic References for Template-based ASR using posterior features. INTERSPEECH 2012: 2146-2149 - [c162]David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé, Alexandre Nanchen:
MediaParl: Bilingual mixed language accented speech database. SLT 2012: 263-268 - [c161]David Imseng, Hervé Bourlard, Philip N. Garner:
Boosting under-resourced speech recognizers by exploiting out-of-language data - case study on Afrikaans. SLTU 2012: 60-67 - [i1]Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, Volkan Cevher:
Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings. CoRR abs/1210.6766 (2012) - 2011
- [j32]Mazin Gilbert, Alex Acero, Jordan Cohen, Hervé Bourlard, Shih-Fu Chang, Minoru Etoh:
Media Search in Mobile Devices [From the Guest Editors]. IEEE Signal Process. Mag. 28(4): 12-13 (2011) - [j31]Joel Pinto, Garimella S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky, Hervé Bourlard:
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator. IEEE Trans. Speech Audio Process. 19(2): 225-241 (2011) - [j30]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization. IEEE Trans. Speech Audio Process. 19(2): 431-438 (2011) - [j29]Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, Mathew Magimai-Doss:
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2538-2551 (2011) - [c160]Hamid Reza Abutalebi, Hedieh Heli, Danil Korchagin, Hervé Bourlard:
A BSS-based approach for localization of simultaneous speakers in reverberant conditions. EUSIPCO 2011: 254-258 - [c159]Afsaneh Asaei, Hervé Bourlard, Volkan Cevher:
Model-based compressive sensing for multi-party distant speech recognition. ICASSP 2011: 4600-4603 - [c158]Serena Soldo, Mathew Magimai-Doss, Joel Pinto, Hervé Bourlard:
Posterior features for template-based ASR. ICASSP 2011: 4864-4867 - [c157]David Imseng, Hervé Bourlard, Mathew Magimai-Doss, John Dines:
Language dependent universal phoneme posterior estimation for mixed language speech recognition. ICASSP 2011: 5012-5015 - [c156]Danil Korchagin, Petr Motlícek, Stefan Duffner, Hervé Bourlard:
Just-in-time multimodal association and fusion from home entertainment. ICME 2011: 1-5 - [c155]Afsaneh Asaei, Mohammad Javad Taghizadeh, Hervé Bourlard, Volkan Cevher:
Multi-Party Speech Recovery Exploiting Structured Sparsity Models. INTERSPEECH 2011: 185-188 - [c154]Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla, Hervé Bourlard:
Grapheme-Based Automatic Speech Recognition Using KL-HMM. INTERSPEECH 2011: 445-448 - [c153]David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss:
Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations. INTERSPEECH 2011: 537-540 - [c152]Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez:
LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization. INTERSPEECH 2011: 1045-1048 - [c151]Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard:
Hierarchical Tandem Features for ASR in Mandarin. INTERSPEECH 2011: 1241-1244 - [e3]Hervé Bourlard, Thomas S. Huang, Enrique Vidal, Daniel Gatica-Perez, Louis-Philippe Morency, Nicu Sebe:
Proceedings of the 13th International Conference on Multimodal Interfaces, ICMI 2011, Alicante, Spain, November 14-18, 2011. ACM 2011, ISBN 978-1-4503-0641-6 [contents] - 2010
- [j28]K. J. Ray Liu, Hervé Bourlard, Vikram Krishnamurthy, Alex Pentland, Stephen B. Wicker:
Introduction to the Special Issue on Signal and Information Processing for Social Networks. IEEE J. Sel. Top. Signal Process. 4(4): 673-676 (2010) - [j27]Hamed Ketabdar, Hervé Bourlard:
Enhanced Phone Posteriors for Improving Speech Recognition Systems. IEEE Trans. Speech Audio Process. 18(6): 1094-1106 (2010) - [c150]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard, Daniel Gatica-Perez:
Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. ICASSP 2010: 4474-4477 - [c149]Afsaneh Asaei, Benjamin Picart, Hervé Bourlard:
Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure. ICASSP 2010: 4886-4889 - [c148]Giulia Garau, Hervé Bourlard:
Using audio and visual cues for speaker diarisation initialisation. ICASSP 2010: 4942-4945 - [c147]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Multistream speaker diarization beyond two acoustic feature streams. ICASSP 2010: 4950-4953 - [c146]David Imseng, Hervé Bourlard, Mathew Magimai-Doss:
Towards mixed language speech recognition systems. INTERSPEECH 2010: 278-281 - [c145]Afsaneh Asaei, Hervé Bourlard, Philip N. Garner:
Sparse component analysis for speech recognition in multi-speaker environment. INTERSPEECH 2010: 1704-1707 - [c144]Alfred Dielmann, Giulia Garau, Hervé Bourlard:
Floor holder detection and end of speaker turn prediction in meetings. INTERSPEECH 2010: 2306-2309 - [c143]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Advances in fast multistream diarization based on the information bottleneck framework. INTERSPEECH 2010: 2650-2653 - [c142]Giulia Garau, Alfred Dielmann, Hervé Bourlard:
Audio-visual synchronisation for speaker diarisation. INTERSPEECH 2010: 2654-2657 - [c141]David Imseng, Mathew Magimai-Doss, Hervé Bourlard:
Hierarchical multilayer perceptron based language identification. INTERSPEECH 2010: 2722-2725 - [c140]Alessandro Vinciarelli, Roderick Murray-Smith, Hervé Bourlard:
Mobile social signal processing: vision and research issues. Mobile HCI 2010: 513-516
2000 – 2009
- 2009
- [j26]Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard:
Social signal processing: Survey of an emerging domain. Image Vis. Comput. 27(12): 1743-1759 (2009) - [j25]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
An Information Theoretic Approach to Speaker Diarization of Meeting Data. IEEE Trans. Speech Audio Process. 17(7): 1382-1393 (2009) - [c139]Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard:
MLP based hierarchical system for task adaptation in ASR. ASRU 2009: 365-370 - [c138]Guillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss:
Posterior features applied to speech recognition tasks with user-defined vocabulary. ICASSP 2009: 3809-3812 - [c137]Weifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition. ICASSP 2009: 3921-3924 - [c136]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Mutual information based channel selection for speaker diarization of meetings data. ICASSP 2009: 4065-4068 - [c135]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez, Hervé Bourlard:
Speaker change detection with privacy-preserving audio cues. ICMI 2009: 343-346 - [c134]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
KL realignment for speaker diarization with multiple feature streams. INTERSPEECH 2009: 1059-1062 - [c133]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard, Daniel Gatica-Perez:
Investigating privacy-sensitive features for speech detection in multiparty conversations. INTERSPEECH 2009: 2243-2246 - [c132]Giulia Garau, Sileye O. Ba, Hervé Bourlard, Jean-Marc Odobez:
Investigating the use of visual focus of attention for audio-visual speaker diarisation. ACM Multimedia 2009: 681-684 - [p1]Claude Stricker, Jean-Frédéric Wagen, Guillermo Aradilla, Hervé Bourlard, Hynek Hermansky, Joel Pinto, Paul-Henri Rey, Jérôme Théraulaz:
Intelligent Multi-modal Interfaces for Mobile Applications in Hostile Environment(IM-HOST). Human Machine Interaction 2009: 71-102 - 2008
- [c131]Weifeng Li, Mathew Magimai-Doss, John Dines, Hervé Bourlard:
MLP-based log spectral energy mapping for robust overlapping speech recognition. EUSIPCO 2008: 1-5 - [c130]Hamed Ketabdar, Hervé Bourlard:
Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation. ICASSP 2008: 4065-4068 - [c129]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Combination of agglomerative and sequential clustering for speaker diarization. ICASSP 2008: 4361-4364 - [c128]Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard, Alex Pentland:
Social signals, their function, and automatic analysis: a survey. ICMI 2008: 61-68 - [c127]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Integration of TDOA features in information bottleneck framework for fast speaker diarization. INTERSPEECH 2008: 40-43 - [c126]Guillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss:
Using KL-based acoustic models in a large vocabulary recognition task. INTERSPEECH 2008: 928-931 - [c125]Weifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
Neural network based regression for robust overlapping speech recognition using microphone arrays. INTERSPEECH 2008: 2012-2015 - [c124]Weifeng Li, Ken'ichi Kumatani, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
A Neural Network Based Regression Approach for Recognizing Simultaneous Speech. MLMI 2008: 110-118 - [c123]Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard, Alex Pentland:
Social signal processing: state-of-the-art and future perspectives of an emerging domain. ACM Multimedia 2008: 1061-1070 - [e2]Andrei Popescu-Belis, Steve Renals, Hervé Bourlard:
Machine Learning for Multimodal Interaction , 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers. Lecture Notes in Computer Science 4892, Springer 2008, ISBN 978-3-540-78154-7 [contents] - 2007
- [c122]Steve Renals, Thomas Hain, Hervé Bourlard:
Recognition and understanding of meetings the AMI and AMIDA projects. ASRU 2007: 238-247 - [c121]Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Agglomerative information bottleneck for speaker diarization of meetings data. ASRU 2007: 250-255 - [c120]Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard:
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features. ICASSP (4) 2007: 657-660 - [c119]Weifeng Li, Hervé Bourlard:
Non-linear spectral contrast stretching for in-car speech recognition. INTERSPEECH 2007: 1122-1125 - [c118]Hamed Ketabdar, Hervé Bourlard:
In-context phone posteriors as complementary features for tandem ASR. INTERSPEECH 2007: 2069-2072 - [c117]Guillermo Aradilla, Hervé Bourlard:
Posterior-Based Features and Distances in Template Matching for Speech Recognition. MLMI 2007: 204-214 - 2006
- [j24]Vivek Tyagi, Hervé Bourlard, Christian Wellekens:
On variable-scale piecewise stationary spectral analysis of speech signals for ASR. Speech Commun. 48(9): 1182-1191 (2006) - [j23]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
User-customized password speaker verification using multiple reference and background models. Speech Commun. 48(9): 1200-1213 (2006) - [c116]Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard:
Using More Informative Posterior Probabilities for Speech Recognition. ICASSP (1) 2006: 29-32 - [c115]Guillaume Lathoud, Mathew Magimai-Doss, Hervé Bourlard:
Threshold Selection for Unsupervised Detection, With an Application to Microphone Arrays. ICASSP (3) 2006: 285-288 - [c114]Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard:
Using Pitch as Prior Knowledge in Template-Based Speech Recognition. ICASSP (1) 2006: 445-448 - [c113]Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard:
Using posterior-based features in template matching for speech recognition. INTERSPEECH 2006 - [c112]Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard:
Posterior based keyword spotting with a priori thresholds. INTERSPEECH 2006 - [c111]Hemant Misra, Jithendra Vepa, Hervé Bourlard:
Multi-stream ASR: an oracle perspective. INTERSPEECH 2006 - [c110]Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew H. C. Thean, Pavel Zemcík:
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. MLMI 2006: 24-35 - [c109]Hervé Bourlard:
Understanding and Modeling Communication Scenes. SLT 2006: 14 - 2005
- [j22]Nelson Morgan, Qifeng Zhu, Andreas Stolcke, M. Kemal Sönmez, Sunil Sivadas, Takahiro Shinozaki, Mari Ostendorf, Pratibha Jain, Hynek Hermansky, Dan Ellis, George R. Doddington, Barry Y. Chen, Özgür Çetin, Hervé Bourlard, Marios Athineos:
Pushing the envelope - aside [speech recognition]. IEEE Signal Process. Mag. 22(5): 81-88 (2005) - [j21]Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen, Hervé Bourlard:
Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system. IEEE Trans. Speech Audio Process. 13(1): 14-22 (2005) - [c108]Hemant Misra, Shajith Ikbal, Sunil Sivadas, Hervé Bourlard:
Multi-resolution Spectral Entropy Feature for Robust ASR. ICASSP (1) 2005: 253-256 - [c107]Shajith Ikbal, Hervé Bourlard, Mathew Magimai-Doss:
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition. ICASSP (1) 2005: 453-456 - [c106]Vivek Tyagi, Christian Wellekens, Hervé Bourlard:
On variable-scale piecewise stationary spectral analysis of speech signals for ASR. INTERSPEECH 2005: 209-212 - [c105]Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard:
Developing and enhancing posterior based speech recognition systems. INTERSPEECH 2005: 1461-1464 - [c104]Hemant Misra, Hervé Bourlard:
Spectral entropy feature in full-combination multi-stream for robust ASR. INTERSPEECH 2005: 2633-2636 - [c103]Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard:
Improving speech recognition using a data-driven approach. INTERSPEECH 2005: 3333-3336 - [c102]Vivek Tyagi, Christian Wellekens, Hervé Bourlard:
A Variable-Scale Piecewise Stationary Spectral Analysis Technique Applied to ASR. MLMI 2005: 274-284 - [c101]Hamed Ketabdar, Hervé Bourlard, Samy Bengio:
Hierarchical Multi-stream Posterior Based Speech Recognition System. MLMI 2005: 294-306 - 2004
- [j20]Hervé Bourlard, Ioannis Pitas, Kenneth Kin-Man Lam, Yue Wang:
Editorial. EURASIP J. Adv. Signal Process. 2004(4): 427-429 (2004) - [j19]Datong Chen, Jean-Marc Odobez, Hervé Bourlard:
Text detection, recognition in images and video frames. Pattern Recognit. 37(3): 595-608 (2004) - [j18]Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
Robust speaker change detection. IEEE Signal Process. Lett. 11(8): 649-651 (2004) - [j17]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition with auxiliary information. IEEE Trans. Speech Audio Process. 12(3): 189-203 (2004) - [c100]Samy Bengio, Hervé Bourlard:
Multi Channel Sequence Processing. Deterministic and Statistical Methods in Machine Learning 2004: 22-36 - [c99]Mathew Magimai-Doss, Samy Bengio, Hervé Bourlard:
Joint decoding for phoneme-grapheme continuous speech recognition. ICASSP (1) 2004: 177-180 - [c98]Hemant Misra, Shajith Ikbal, Hervé Bourlard, Hynek Hermansky:
Spectral entropy based feature for robust ASR. ICASSP (1) 2004: 193-196 - [c97]Shajith Ikbal, Hemant Misra, Hervé Bourlard, Hynek Hermansky:
Phase autocorrelation (PAC) features in entropy based multi-stream for robust speech recognition. ICASSP (1) 2004: 205-208 - [c96]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Confidence measures in multiple pronunciations modeling for speaker verification. ICASSP (1) 2004: 389-392 - [c95]Mathew Magimai-Doss, Shajith Ikbal, Todd A. Stephenson, Hervé Bourlard:
Modeling auxiliary features in tandem systems. INTERSPEECH 2004: 1501-1504 - [c94]Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
An online audio indexing system. INTERSPEECH 2004: 1601-1604 - [c93]Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra, Hervé Bourlard:
Spectro-temporal activity pattern (STAP) features for noise robust ASR. INTERSPEECH 2004: 2109-2112 - [c92]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Posteriori probabilities and likelihoods combination for speech and speaker recognition. INTERSPEECH 2004: 2345-2348 - [c91]Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky, Hervé Bourlard:
Entropy based combination of tandem representations for noise robust ASR. INTERSPEECH 2004: 2553-2556 - [c90]Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard:
Towards Computer Understanding of Human Interactions. MLMI 2004: 56-75 - [c89]Mathew Magimai-Doss, Hervé Bourlard:
On the Adequacy of Baseform Pronunciations and Pronunciation Variants. MLMI 2004: 209-222 - [e1]Samy Bengio, Hervé Bourlard:
Machine Learning for Multimodal Interaction, First International Workshop,MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers. Lecture Notes in Computer Science 3361, Springer 2004, ISBN 3-540-24509-X [contents] - 2003
- [j16]Katrin Weber, Shajith Ikbal, Samy Bengio, Hervé Bourlard:
Robust speech recognition and feature extraction using HMM2. Comput. Speech Lang. 17(2-3): 195-211 (2003) - [j15]Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Commun. 40(3): 351-363 (2003) - [j14]Iain McCowan, Hervé Bourlard:
Microphone array post-filter based on noise field coherence. IEEE Trans. Speech Audio Process. 11(6): 709-716 (2003) - [c88]Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard:
Towards Computer Understanding of Human Interactions. EUSAI 2003: 235-251 - [c87]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. ICASSP (1) 2003: 20-23 - [c86]Shajith Ikbal, Hemant Misra, Hervé Bourlard:
Phase autocorrelation (PAC) derived robust speech features. ICASSP (2) 2003: 133-136 - [c85]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Hybrid HMM/ANN and GMM combination for user-customized password speaker verification. ICASSP (2) 2003: 225-228 - [c84]Hemant Misra, Hervé Bourlard, Vivek Tyagi:
New entropy based combination rules in HMM/ANN multi-stream ASR. ICASSP (2) 2003: 741-744 - [c83]Iain McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner, Hervé Bourlard:
Modeling human interaction in meetings. ICASSP (4) 2003: 748-751 - [c82]Mark Barnard, Samy Bengio, Hervé Bourlard, Daniel Gatica-Perez, Iain McCowan:
On automatic annotation of meeting databases. ICIP (3) 2003: 629-632 - [c81]Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux, Sébastien Marcel:
Speech & face based biometric authentication at IDIAP. ICME 2003: 1-4 - [c80]Vivek Tyagi, Iain McCowan, Hervé Bourlard, Hemant Misra:
On factorizing spectral dynamics for robust speech recognition. INTERSPEECH 2003: 981-984 - [c79]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
On the combination of speech and speaker recognition. INTERSPEECH 2003: 1361-1364 - [c78]Mathew Magimai-Doss, Todd A. Stephenson, Hervé Bourlard:
Using pitch frequency information in speech recognition. INTERSPEECH 2003: 2525-2528 - 2002
- [j13]Sebastian Möller, Hervé Bourlard:
Analytic assessment of telephone transmission impact on ASR performance using a simulation model. Speech Commun. 38(3-4): 441-459 (2002) - [c77]Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
Robust HMM-based speech/music segmentation. ICASSP 2002: 297-300 - [c76]Iain McCowan, Hervé Bourlard:
Microphone array post-filter for diffuse noise field. ICASSP 2002: 905-908 - [c75]Katrin Weber, Samy Bengio, Hervé Bourlard:
Increasing speech recognition robustness with HMM2. ICASSP 2002: 929-932 - [c74]Datong Chen, Jean-Marc Odobez, Hervé Bourlard:
Text Segmentation and Recognition in Complex Background Based on Markov Random Field. ICPR (4) 2002: 227-230 - [c73]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition. ICPR (4) 2002: 293- - [c72]Hervé Bourlard:
Some Recent Advances in Speech Recognition with Potential Applications in Other Statistical Pattern Recognition Areas. ICPR (3) 2002: 727-727 - [c71]Jitendra Ajmera, Hervé Bourlard, I. Lapidot, Iain McCowan:
Unknown-multiple speaker clustering using HMM. INTERSPEECH 2002: 573-576 - [c70]Andrew C. Morris, Simon Payne, Hervé Bourlard:
Low cost duration modelling for noise robust speech recognition. INTERSPEECH 2002: 1025-1028 - [c69]Pere Pujol Marsal, Susagna Pol, Astrid Hagen, Hervé Bourlard, Climent Nadeu:
Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system. INTERSPEECH 2002: 1057-1060 - [c68]Mohamed Faouzi BenZeghiba, Hervé Bourlard:
User-customized password speaker verification based on HMM/ANN and GMM models. INTERSPEECH 2002: 1325-1328 - [c67]Katrin Weber, Febe de Wet, Bert Cranen, Lou Boves, Samy Bengio, Hervé Bourlard:
Evaluation of formant-like features for ASR. INTERSPEECH 2002: 2101-2104 - [c66]Iain McCowan, Andrew C. Morris, Hervé Bourlard:
Improving speech recognition performance of small microphone arrays using missing data techniques. INTERSPEECH 2002: 2181-2184 - [c65]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. INTERSPEECH 2002: 2665-2668 - [c64]Todd A. Stephenson, Jaume Escofet, Mathew Magimai-Doss, Hervé Bourlard:
Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables. NNSP 2002: 637-646 - [c63]Shajith Ikbal, Katrin Weber, Hervé Bourlard:
Speaker normalization using HMM2. NNSP 2002: 647-656 - 2001
- [j12]Andrew C. Morris, Astrid Hagen, Hervé Glotin, Hervé Bourlard:
Multi-stream adaptive evidence combination for noise robust ASR. Speech Commun. 34(1-2): 25-40 (2001) - [c62]Datong Chen, Hervé Bourlard, Jean-Philippe Thiran:
Text Identification in Complex Background Using SVM. CVPR (2) 2001: 621-627 - [c61]Astrid Hagen, Hervé Bourlard, Andrew C. Morris:
Adaptive ML-weighting in multi-band recombination of Gaussian mixture ASR. ICASSP 2001: 257-260 - [c60]Datong Chen, Kim Shearer, Hervé Bourlard:
Text Enhancement with Asymmetric Filter for Video OCR. ICIAP 2001: 192-197 - [c59]Andrew C. Morris, Astrid Hagen, Hervé Bourlard:
MAP combination of multi-stream HMM or HMM/ANN experts. INTERSPEECH 2001: 225-228 - [c58]Astrid Hagen, Hervé Bourlard:
Error correcting posterior combination for robust multi-band speech recognition. INTERSPEECH 2001: 587-590 - [c57]Katrin Weber, Samy Bengio, Hervé Bourlard:
HMM2- extraction of formant structures and their use for robust ASR. INTERSPEECH 2001: 607-610 - [c56]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Modeling auxiliary information in Bayesian network based ASR. INTERSPEECH 2001: 2765-2768 - 2000
- [c55]Hervé Bourlard:
Will the spoken words be back to libraries? (invited talk - Abstract not available). DELOS 2000 - [c54]Marius-Calin Silaghi, Hervé Bourlard:
A new keyword spotting approach based on iterative dynamic programming. ICASSP 2000: 1831-1834 - [c53]Katrin Weber, Samy Bengio, Hervé Bourlard:
HMM2- a novel approach to HMM emission probability estimation. INTERSPEECH 2000: 147-150 - [c52]Astrid Hagen, Hervé Bourlard:
Using multiple time scales in the framework of multi-stream speech recognition. INTERSPEECH 2000: 349-352 - [c51]Andrew C. Morris, Ljubomir Josifovski, Hervé Bourlard, Martin Cooke, Phil D. Green:
A neural network for classification with incomplete data: application to robust ASR. INTERSPEECH 2000: 409-412 - [c50]Sebastian Möller, Hervé Bourlard:
Real-time telephone transmission simulation for speech recognizer and dialogue system evaluation and improvement. INTERSPEECH 2000: 750-753 - [c49]Todd A. Stephenson, Hervé Bourlard, Samy Bengio, Andrew C. Morris:
Automatic speech recognition using dynamic bayesian networks with both acoustic and articulatory variables. INTERSPEECH 2000: 951-954 - [c48]Giulia Bernardis, Hervé Bourlard, Martin Rajman, Jean-Cédric Chappelier:
Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers. LREC 2000 - [c47]Hervé Bourlard, Samy Bengio, Katrin Weber:
New Approaches Towards Robust, Adaptive Speech Recognition (invited paper). NIPS 2000: 751-757
1990 – 1999
- 1999
- [c46]Andrew C. Morris, Astrid Hagen, Hervé Bourlard:
The full combination sub-bands approach to noise robust HMM/ANN based ASR. EUROSPEECH 1999: 599-602 - 1998
- [c45]Giulia Bernardis, Hervé Bourlard:
Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systems. ICSLP 1998 - [c44]Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard:
Interfacing of CASA and partial recognition based on a multistream technique. ICSLP 1998 - 1997
- [c43]Hervé Bourlard:
State-of-the-Art and Recent Progress in Hybrid HMM/ANN Speech Recognition. ICANN 1997: 875-884 - [c42]Hervé Bourlard, Stéphane Dupont:
Subband-based speech recognition. ICASSP 1997: 1251-1254 - [c41]Vincent Fontaine, Hervé Bourlard:
Speaker-dependent speech recognition based on phone-like units models-application to voice dialling. ICASSP 1997: 1527-1530 - [c40]Stéphane Dupont, Hervé Bourlard, Olivier Deroo, Vincent Fontaine, Jean-Marc Boite:
Hybrid HMM/ANN systems for training independent tasks: experiments on Phonebook and related improvements. ICASSP 1997: 1767-1770 - [c39]Stéphane Dupont, Hervé Bourlard:
Using multiple time scales in a multi-stream speech recognition system. EUROSPEECH 1997: 3-6 - [c38]Jean Hennebert, Christophe Ris, Hervé Bourlard, Steve Renals, Nelson Morgan:
Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems. EUROSPEECH 1997: 1951-1954 - [c37]Hervé Bourlard, Nelson Morgan:
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions. Summer School on Neural Networks 1997: 389-417 - 1996
- [j11]Hervé Bourlard, Hynek Hermansky, Nelson Morgan:
Towards increasing speech recognition error rates. Speech Commun. 18(3): 205-231 (1996) - [j10]Hervé Bourlard, Yochai Konig, Nelson Morgan:
A training algorithm for statistical sequence recognition with applications to transition-based speech recognition. IEEE Signal Process. Lett. 3(7): 203-205 (1996) - [c36]Hervé Bourlard, Stéphane Dupont, Hynek Hermansky, Nelson Morgan:
Towards subband-based speech recognition. EUSIPCO 1996: 1-4 - [c35]Hervé Bourlard, Yochai Konig, Nelson Morgan, Christophe Ris:
A new training algorithm for hybrid HMM/ANN speech recognition systems. EUSIPCO 1996: 1-4 - [c34]Yochai Konig, Hervé Bourlard, Nelson Morgan:
REMAP-experiments with speech recognition. ICASSP 1996: 3350-3353 - [c33]Hervé Bourlard, Stéphane Dupont:
A new ASR approach based on independent processing and recombination of partial frequency bands. ICSLP 1996: 426-429 - [c32]Jeff A. Bilmes, Nelson Morgan, Su-Lin Wu, Hervé Bourlard:
Stochastic perceptual speech models with durational dependence. ICSLP 1996: 1301-1304 - 1995
- [j9]Nelson Morgan, Hervé Bourlard:
Neural networks for statistical recognition of continuous speech. Proc. IEEE 83(5): 742-772 (1995) - [j8]Johan de Veth, Hervé Bourlard:
Comparison of hidden Markov model techniques for automatic speaker verification in real-world conditions. Speech Commun. 17(1-2): 81-90 (1995) - [j7]Nelson Morgan, Hervé Bourlard:
Continuous speech recognition. IEEE Signal Process. Mag. 12(3): 24-42 (1995) - [c31]Nelson Morgan, Hervé Bourlard, Steven Greenberg, Hynek Hermansky, Su-Lin Wu:
Stochastic perceptual models of speech. ICASSP 1995: 397-400 - [c30]Nelson Morgan, Su-Lin Wu, Hervé Bourlard:
Digit recognition with stochastic perceptual speech models. EUROSPEECH 1995: 771-774 - [c29]Hervé Bourlard:
Towards increasing speech recognition error rates. EUROSPEECH 1995: 883-894 - [c28]Hervé Bourlard, Yochai Konig, Nelson Morgan:
REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition. EUROSPEECH 1995: 1663-1666 - [c27]Yochai Konig, Hervé Bourlard, Nelson Morgan:
REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition. NIPS 1995: 388-394 - 1994
- [j6]Steve Renals, Nelson Morgan, Hervé Bourlard, Michael Cohen, Horacio Franco:
Connectionist probability estimators in HMM speech recognition. IEEE Trans. Speech Audio Process. 2(1): 161-174 (1994) - [c26]Hervé Bourlard, Bart D'hoore, Jean-Marc Boite:
Optimizing recognition and rejection performance in wordspotting systems. ICASSP (1) 1994: 373-376 - [c25]Jean-Marc Boite, Hervé Bourlard, Bart D'hoore, Sari Accaino, Johan Vantieghem:
Task independent and dependent training: performance comparison of HMM and hybrid HMM/MLP approaches. ICASSP (1) 1994: 617-620 - [c24]Hugo Van hamme, Guido Gallopyn, Ludwig Weynants, Bart D'hoore, Hervé Bourlard:
Comparison of acoustic features and robustness tests of a real-time recogniser using a hardware telephone line simulator. ICSLP 1994: 1907-1910 - [c23]Nelson Morgan, Hervé Bourlard, Steven Greenberg, Hynek Hermansky:
Stochastic perceptual auditory-event-based models for speech recognition. ICSLP 1994: 1943-1946 - 1993
- [j5]Nelson Morgan, Hervé Bourlard, Steve Renals, Michael Cohen, Horacio Franco:
Hybrid Neural Network/Hidden Markov Model Systems for Continuous Speech Recognition. Int. J. Pattern Recognit. Artif. Intell. 7(4): 899-916 (1993) - [j4]Hervé Bourlard, Nelson Morgan:
Continuous speech recognition by connectionist statistical methods. IEEE Trans. Neural Networks 4(6): 893-909 (1993) - [c22]Johan de Veth, Guido Gallopyn, Hervé Bourlard:
Limited parameter hidden Markov models for connected digit speaker verification over telephone channels. ICASSP (2) 1993: 247-250 - [c21]Marco Saerens, Hervé Bourlard:
Linear and nonlinear prediction for speech recognition with hidden Markov models. EUROSPEECH 1993: 807-810 - [c20]Jean-Marc Boite, Hervé Bourlard, Bart D'hoore, Marc Haesen:
A new approach towards keyword spotting. EUROSPEECH 1993: 1273-1276 - [c19]Philipp Schmid, Ronald A. Cole, Mark A. Fanty, Hervé Bourlard, M. Haessen:
Real-time, neural network-based, French alphabet recognition with telephone speech. EUROSPEECH 1993: 1723-1726 - [c18]Hervé Bourlard, Jean-Marc Boite, Bart D'hoore, Marco Saerens:
Performance comparison of hidden Markov models and neural networks for task dependent and independent isolated word recognition. EUROSPEECH 1993: 1925-1928 - [c17]Tony Robinson, Luís B. Almeida, Jean-Marc Boite, Hervé Bourlard, Frank Fallside, Mike Hochberg, Dan J. Kershaw, Phil Kohn, Yochai Konig, Nelson Morgan, João Paulo Neto, Steve Renals, Marco Saerens, Chuck Wooters:
A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project. EUROSPEECH 1993: 1941-1944 - [c16]Johan de Veth, Guido Gallopyn, Hervé Bourlard:
Speaker verification over telephone channels based on concatenated phonemic hidden Markov models. EUROSPEECH 1993: 2279-2282 - 1992
- [j3]Nelson Morgan, Hervé Bourlard:
Factoring Networks by a Statistical Method. Neural Comput. 4(6): 835-838 (1992) - [j2]Hervé Bourlard, Nelson Morgan, Steve Renals:
Neural nets and hidden Markov models: Review and generalizations. Speech Commun. 11(2-3): 237-246 (1992) - [c15]Hervé Bourlard, Nelson Morgan, Chuck Wooters, Steve Renals:
CDNN: a context dependent neural network for continuous speech recognition. ICASSP 1992: 349-352 - 1991
- [c14]Nelson Morgan, Hynek Hermansky, Hervé Bourlard, Phil Kohn, Chuck Wooters:
Continuous speech recognition using PLP analysis with multilayer perceptrons. ICASSP 1991: 49-52 - [c13]Nelson Morgan, Hervé Bourlard, Chuck Wooters, Phil Kohn, Michael Cohen:
Phonetic context in hybrid HMM/MLP continuous speech recognition. EUROSPEECH 1991: 109-112 - [c12]Hervé Bourlard:
Neural nets and hidden Markov models: review and generalizations. EUROSPEECH 1991: 363-369 - [c11]Steve Renals, Nelson Morgan, Hervé Bourlard, Horacio Franco, Michael Cohen:
Connectionist Optimisation of Tied Mixture Hidden Markov Models. NIPS 1991: 167-174 - 1990
- [j1]Hervé Bourlard, Christian Wellekens:
Links Between Markov Models and Multilayer Perceptrons. IEEE Trans. Pattern Anal. Mach. Intell. 12(12): 1167-1178 (1990) - [c10]Nelson Morgan, Hervé Bourlard:
Continuous speech recognition using multilayer perceptrons with hidden Markov models. ICASSP 1990: 413-416 - [c9]Nelson Morgan, Chuck Wooters, Hervé Bourlard, Michael Cohen:
Continuous speech recognition on the resource management database using connectionist probability estimation. ICSLP 1990: 1337-1340 - [c8]Hervé Bourlard, Nelson Morgan, Chuck Wooters:
Connectionist Approaches to the Use of Markov Models for Speech Recognition. NIPS 1990: 213-219
1980 – 1989
- 1989
- [c7]Hervé Bourlard, Christian J. Wellekens:
Speech dynamics and recurrent neural networks. ICASSP 1989: 33-36 - [c6]Hervé Bourlard, Nelson Morgan, Christian Wellekens:
Statistical Inference in Multilayer Perceptrons and Hidden Markov Models with Applications in Continuous Speech Recognition. NATO Neurocomputing 1989: 217-226 - [c5]Hervé Bourlard, Nelson Morgan:
A Continuous Speech Recognition System Embedding MLP into HMM. NIPS 1989: 186-193 - [c4]Nelson Morgan, Hervé Bourlard:
Generalization and Parameter Estimation in Feedforward Netws: Some Experiments. NIPS 1989: 630-637 - 1988
- [c3]Hervé Bourlard, Christian Wellekens:
Links Between Markov Models and Multilayer Perceptrons. NIPS 1988: 502-510 - 1985
- [c2]Hervé Bourlard, Yves G. Kamp, Christian Wellekens:
Speaker dependent connected speech recognition via phonetic Markov models. ICASSP 1985: 1213-1216 - 1984
- [c1]Hervé Bourlard, Christian Wellekens, Hermann Ney:
Connected digit recognition using vector quantization. ICASSP 1984: 413-416
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-20 00:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint