


Остановите войну!
for scientists:


default search action
Andreas Stolcke
Person information

- affiliation: Microsoft Research, Mountain View, CA, USA
- affiliation: Microsoft Research
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i52]Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh:
Adaptive Endpointing with Deep Contextual Multi-armed Bandits. CoRR abs/2303.13407 (2023) - [i51]Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran:
Cross-utterance ASR Rescoring with Graph-based Label Propagation. CoRR abs/2303.15132 (2023) - [i50]Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant P. Strimel, Andreas Stolcke, Ivan Bulyko:
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers. CoRR abs/2303.17131 (2023) - [i49]Denis Filimonov, Prabhat Pandey, Ariya Rastrow, Ankur Gandhe, Andreas Stolcke:
Streaming Speech-to-Confusion Network Speech Recognition. CoRR abs/2306.03778 (2023) - [i48]Aakriti Agrawal, Milind Rao, Anit Kumar Sahu, Gopinath Chennupati, Andreas Stolcke:
Learning When to Trust Which Teacher for Weakly Supervised ASR. CoRR abs/2306.12012 (2023) - 2022
- [c191]Scott Novotney, Sreeparna Mukherjee, Zeeshan Ahmed, Andreas Stolcke:
CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals. ACL (Findings) 2022: 3368-3379 - [c190]Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
RescoreBERT: Discriminative Speech Recognition Rescoring With Bert. ICASSP 2022: 6117-6121 - [c189]Metehan Cekic, Ruirui Li, Zeya Chen, Yuguang Yang, Andreas Stolcke, Upamanyu Madhow:
Self-Supervised Speaker Recognition Training using Human-Machine Dialogues. ICASSP 2022: 6132-6136 - [c188]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. ICASSP 2022: 6302-6306 - [c187]K. C. Kishan, Zhenning Tan, Long Chen, Minho Jin, Eunjung Han, Andreas Stolcke, Chul Lee:
OpenFEAT: Improving Speaker Identification by Open-Set Few-Shot Embedding Adaptation with Transformer. ICASSP 2022: 7062-7066 - [c186]Hua Shen, Yuguang Yang, Guoli Sun, Ryan Langman, Eunjung Han, Jasha Droppo, Andreas Stolcke:
Improving Fairness in Speaker Verification via Group-Adapted Fusion Network. ICASSP 2022: 7077-7081 - [c185]Xin Zhang
, Minho Jin, Roger Cheng, Ruirui Li, Eunjung Han, Andreas Stolcke:
Contrastive-mixup Learning for Improved Speaker Verification. ICASSP 2022: 7652-7656 - [c184]Aparna Khare
, Eunjung Han, Yuguang Yang, Andreas Stolcke:
ASR-Aware End-to-End Neural Diarization. ICASSP 2022: 8092-8096 - [c183]Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke:
Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. INTERSPEECH 2022: 1268-1272 - [c182]Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. INTERSPEECH 2022: 1298-1302 - [c181]Minho Jin, Chelsea Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke:
Adversarial Reweighting for Speaker Verification Fairness. INTERSPEECH 2022: 4800-4804 - [c180]Long Chen, Yixiong Meng, Venkatesh Ravichandran
, Andreas Stolcke:
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. INTERSPEECH 2022: 4805-4809 - [c179]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [i47]Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT. CoRR abs/2202.01094 (2022) - [i46]Aparna Khare, Eunjung Han, Yuguang Yang, Andreas Stolcke:
ASR-Aware End-to-end Neural Diarization. CoRR abs/2202.01286 (2022) - [i45]Metehan Cekic, Ruirui Li, Zeya Chen, Yuguang Yang, Andreas Stolcke, Upamanyu Madhow:
Self-supervised Speaker Recognition Training Using Human-Machine Dialogues. CoRR abs/2202.03484 (2022) - [i44]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. CoRR abs/2202.08532 (2022) - [i43]Xin Zhang, Minho Jin, Roger Cheng, Ruirui Li, Eunjung Han, Andreas Stolcke:
Contrastive-mixup learning for improved speaker verification. CoRR abs/2202.10672 (2022) - [i42]Hua Shen, Yuguang Yang, Guoli Sun, Ryan Langman, Eunjung Han, Jasha Droppo, Andreas Stolcke:
Improving fairness in speaker verification via Group-adapted Fusion Network. CoRR abs/2202.11323 (2022) - [i41]Scott Novotney, Sreeparna Mukherjee, Zeeshan Ahmed, Andreas Stolcke:
CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals. CoRR abs/2203.08774 (2022) - [i40]Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke:
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. CoRR abs/2207.04081 (2022) - [i39]Minho Jin, Chelsea J.-T. Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke:
Adversarial Reweighting for Speaker Verification Fairness. CoRR abs/2207.07776 (2022) - [i38]Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. CoRR abs/2207.07850 (2022) - [i37]Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke:
Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. CoRR abs/2207.11345 (2022) - [i36]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i35]Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran:
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech. CoRR abs/2211.09731 (2022) - 2021
- [c178]Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe:
Attention-based Contextual Language Model Adaptation for Speech Recognition. ACL/IJCNLP (Findings) 2021: 1994-2003 - [c177]Zhenning Tan, Yuguang Yang, Eunjung Han, Andreas Stolcke:
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets. ASRU 2021: 1124-1131 - [c176]Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang:
Contrastive Unsupervised Learning for Speech Emotion Recognition. ICASSP 2021: 6329-6333 - [c175]Hu Hu, Xuesong Yang
, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-To-End ASR by Domain Adversarial Training with Relabeling. ICASSP 2021: 6408-6412 - [c174]Eunjung Han, Chul Lee, Andreas Stolcke:
BW-EDA-EEND: streaming END-TO-END Neural Speaker Diarization for a Variable Number of Speakers. ICASSP 2021: 7193-7197 - [c173]Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching. ICASSP 2021: 7218-7222 - [c172]Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
Personalization Strategies for End-to-End Speech Recognition Systems. ICASSP 2021: 7348-7352 - [c171]Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke:
DO as I Mean, Not as I Say: Sequence Loss Training for Spoken Language Understanding. ICASSP 2021: 7473-7477 - [c170]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
wav2vec-C: A Self-Supervised Model for Speech Representation Learning. Interspeech 2021: 711-715 - [c169]Yi-Chieh Liu, Eunjung Han, Chul Lee, Andreas Stolcke:
End-to-End Neural Diarization: From Transformer to Conformer. Interspeech 2021: 3081-3085 - [c168]Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo:
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End. Interspeech 2021: 3455-3459 - [c167]Long Chen, Venkatesh Ravichandran
, Andreas Stolcke:
Graph-Based Label Propagation for Semi-Supervised Speaker Identification. Interspeech 2021: 4588-4592 - [c166]Ruirui Li, Chelsea J.-T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke:
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition. Interspeech 2021: 4593-4597 - [c165]Desh Raj, Leibny Paola García-Perera
, Zili Huang, Shinji Watanabe
, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs. SLT 2021: 881-888 - [i34]Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang:
Contrastive Unsupervised Learning for Speech Emotion Recognition. CoRR abs/2102.06357 (2021) - [i33]Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke:
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding. CoRR abs/2102.06750 (2021) - [i32]Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
Personalization Strategies for End-to-End Speech Recognition Systems. CoRR abs/2102.07739 (2021) - [i31]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
Wav2vec-C: A Self-supervised Model for Speech Representation Learning. CoRR abs/2103.08393 (2021) - [i30]Wen Wang, Andreas Stolcke, Jing Zheng:
Reranking Machine Translation Hypotheses with Structured and Web-based Language Models. CoRR abs/2104.12277 (2021) - [i29]Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo:
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End. CoRR abs/2105.07071 (2021) - [i28]Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe:
Attention-based Contextual Language Model Adaptation for Speech Recognition. CoRR abs/2106.01451 (2021) - [i27]Yi-Chieh Liu, Eunjung Han, Chul Lee, Andreas Stolcke:
End-to-end Neural Diarization: From Transformer to Conformer. CoRR abs/2106.07167 (2021) - [i26]Long Chen, Venkatesh Ravichandran, Andreas Stolcke:
Graph-based Label Propagation for Semi-Supervised Speaker Identification. CoRR abs/2106.08207 (2021) - [i25]Ruirui Li, Chelsea J.-T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke:
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition. CoRR abs/2106.10169 (2021) - [i24]Zhenning Tan, Yuguang Yang, Eunjung Han, Andreas Stolcke:
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets. CoRR abs/2109.02576 (2021) - 2020
- [c164]Dave Makhervaks, William Hinthorn, Dimitrios Dimitriadis, Andreas Stolcke:
Combining Acoustics, Content and Interaction Features to Find Hot Spots in Meetings. ICASSP 2020: 8054-8058 - [c163]Ruirui Li, Jyun-Yu Jiang, Xian Wu, Chu-Cheng Hsieh, Andreas Stolcke:
Speaker Identification for Household Scenarios with Self-Attention and Adversarial Training. INTERSPEECH 2020: 2272-2276 - [c162]Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient Minimum Word Error Rate Training of RNN-Transducer for End-to-End Speech Recognition. INTERSPEECH 2020: 2807-2811 - [c161]Andreas Stolcke:
Improving Diarization Robustness using Diversification, Randomization and the DOVER Algorithm. Odyssey 2020: 95-101 - [i23]Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition. CoRR abs/2007.13802 (2020) - [i22]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs. CoRR abs/2011.01997 (2020) - [i21]Eunjung Han, Chul Lee, Andreas Stolcke:
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers. CoRR abs/2011.02678 (2020) - [i20]Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gökçe Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling. CoRR abs/2012.07353 (2020)
2010 – 2019
- 2019
- [c160]Andreas Stolcke, Takuya Yoshioka:
Dover: A Method for Combining Diarization Outputs. ASRU 2019: 757-763 - [c159]Bryan Li, Dimitrios Dimitriadis, Andreas Stolcke:
Acoustic and Lexical Sentiment Analysis for Customer Service Calls. ICASSP 2019: 5876-5880 - [c158]Takuya Yoshioka, Dimitrios Dimitriadis, Andreas Stolcke, William Hinthorn, Zhuo Chen, Michael Zeng, Xuedong Huang:
Meeting Transcription Using Asynchronous Distant Microphones. INTERSPEECH 2019: 2968-2972 - [i19]Takuya Yoshioka, Zhuo Chen, Dimitrios Dimitriadis, William Hinthorn, Xuedong Huang, Andreas Stolcke, Michael Zeng:
Meeting Transcription Using Virtual Microphone Arrays. CoRR abs/1905.02545 (2019) - [i18]Andreas Stolcke, Takuya Yoshioka:
DOVER: A Method for Combining Diarization Outputs. CoRR abs/1909.08090 (2019) - [i17]Dave Makhervaks, William Hinthorn, Dimitrios Dimitriadis, Andreas Stolcke:
Combining Acoustics, Content and Interaction Features to Find Hot Spots in Meetings. CoRR abs/1910.10869 (2019) - [i16]Andreas Stolcke:
Improving Diarization Robustness using Diversification, Randomization and the DOVER Algorithm. CoRR abs/1910.11691 (2019) - 2018
- [j27]Jorge Proença
, Carla Lopes
, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Mispronunciation Detection in Children's Reading of Sentences. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1203-1215 (2018) - [c157]Wayne Xiong, Lingfeng Wu, Jun Zhang, Andreas Stolcke:
Session-level Language Modeling for Conversational Speech. EMNLP 2018: 2764-2768 - [c156]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo
, Xuedong Huang, Andreas Stolcke:
The Microsoft 2017 Conversational Speech Recognition System. ICASSP 2018: 5934-5938 - 2017
- [j26]Jorge Proença
, Carla Lopes
, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Automatic evaluation of reading aloud performance in children. Speech Commun. 94: 1-14 (2017) - [j25]Wayne Xiong
, Jasha Droppo
, Xuedong Huang
, Frank Seide, Michael L. Seltzer
, Andreas Stolcke
, Dong Yu
, Geoffrey Zweig:
Toward Human Parity in Conversational Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2410-2423 (2017) - [c155]Geoffrey Zweig, Chengzhu Yu, Jasha Droppo
, Andreas Stolcke:
Advances in all-neural speech recognition. ICASSP 2017: 4805-4809 - [c154]Wayne Xiong, Jasha Droppo
, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The microsoft 2016 conversational speech recognition system. ICASSP 2017: 5255-5259 - [c153]Andreas Stolcke, Jasha Droppo
:
Comparing Human and Machine Errors in Conversational Speech Transcription. INTERSPEECH 2017: 137-141 - [c152]Jorge Proença
, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Detection of Mispronunciations and Disfluencies in Children Reading Aloud. INTERSPEECH 2017: 1437-1441 - [c151]Jorge Proença
, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Automatic Evaluation of Children Reading Aloud on Sentences and Pseudowords. INTERSPEECH 2017: 2749-2753 - [i15]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo, Xuedong Huang, Andreas Stolcke:
The Microsoft 2017 Conversational Speech Recognition System. CoRR abs/1708.06073 (2017) - [i14]Andreas Stolcke, Jasha Droppo:
Comparing Human and Machine Errors in Conversational Speech Transcription. CoRR abs/1708.08615 (2017) - 2016
- [j24]T. J. Tsai, Andreas Stolcke:
Robust and Efficient Multiple Alignment of Unsynchronized Meeting Recordings. IEEE ACM Trans. Audio Speech Lang. Process. 24(5): 833-845 (2016) - [c150]Suman V. Ravuri, Andreas Stolcke:
A comparative study of recurrent neural network models for lexical domain classification. ICASSP 2016: 6075-6079 - [c149]Dong Yu, Wayne Xiong, Jasha Droppo
, Andreas Stolcke, Guoli Ye, Jinyu Li
, Geoffrey Zweig:
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention. INTERSPEECH 2016: 17-21 - [c148]Jorge Proença
, Dirce Celorico, Carla Lopes, Miguel Sales Dias, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Design and Analysis of a Database to Evaluate Children's Reading Aloud Performance. PROPOR 2016: 385-395 - [i13]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The Microsoft 2016 Conversational Speech Recognition System. CoRR abs/1609.03528 (2016) - [i12]Geoffrey Zweig, Chengzhu Yu, Jasha Droppo, Andreas Stolcke:
Advances in All-Neural Speech Recognition. CoRR abs/1609.05935 (2016) - [i11]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
Achieving Human Parity in Conversational Speech Recognition. CoRR abs/1610.05256 (2016) - 2015
- [j23]T. J. Tsai, Andreas Stolcke, Malcolm Slaney
:
A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction. IEEE Trans. Multim. 17(9): 1550-1561 (2015) - [c147]Abdel-rahman Mohamed, Frank Seide, Dong Yu, Jasha Droppo
, Andreas Stolcke, Geoffrey Zweig, Gerald Penn
:
Deep bi-directional recurrent networks over spectral windows. ASRU 2015: 78-83 - [c146]Suman V. Ravuri, Andreas Stolcke:
A comparative study of neural network models for lexical intent classification. ASRU 2015: 368-374 - [c145]T. J. Tsai, Andreas Stolcke, Malcolm Slaney
:
Multimodal addressee detection in multiparty dialogue systems. ICASSP 2015: 2314-2318 - [c144]Michael Levit, Andreas Stolcke, Shuangyu Chang, Sarangarajan Parthasarathy:
Token-level interpolation for class-based language models. ICASSP 2015: 5426-5430 - [c143]Suman V. Ravuri, Andreas Stolcke:
Recurrent neural network and LSTM models for lexical utterance classification. INTERSPEECH 2015: 135-139 - [c142]Michael Levit, Andreas Stolcke, R. Subba, Sarangarajan Parthasarathy, Shuangyu Chang, S. Xie, T. Anastasakos, Benoît Dumoulin:
Personalization of word-phrase-entity language models. INTERSPEECH 2015: 448-452 - [c141]T. J. Tsai, Andreas Stolcke:
Aligning meeting recordings via adaptive fingerprinting. INTERSPEECH 2015: 786-790 - [c140]Sree Harsha Yella, Andreas Stolcke:
A comparison of neural network feature transforms for speaker diarization. INTERSPEECH 2015: 3026-3030 - 2014
- [c139]Malcolm Slaney
, Rahul Rajan, Andreas Stolcke, Partha Parthasarathy:
Gaze-enhanced speech recognition. ICASSP 2014: 3236-3240 - [c138]Andreas Stolcke, Neville Ryant, Vikramjit Mitra, Jiahong Yuan, Wen Wang, Mark Liberman:
Highly accurate phonetic segmentation using boundary correction models and system fusion. ICASSP 2014: 5552-5556 - [c137]Malcolm Slaney
, Andreas Stolcke, Dilek Hakkani-Tür:
The Relation of Eye Gaze and Face Pose: Potential Impact on Speech Recognition. ICMI 2014: 144-147 - [c136]Suman V. Ravuri, Andreas Stolcke:
Neural network models for lexical addressee detection. INTERSPEECH 2014: 298-302 - [c135]Michael Levit, Sarangarajan Parthasarathy, Shuangyu Chang, Andreas Stolcke, Benoît Dumoulin:
Word-phrase-entity language models: getting more mileage out of n-grams. INTERSPEECH 2014: 666-670 - [c134]Sree Harsha Yella, Andreas Stolcke, Malcolm Slaney
:
Artificial neural network features for speaker diarization. SLT 2014: 402-406 - 2013
- [c133]Vikramjit Mitra, Wen Wang, Andreas Stolcke, Hosung Nam, Colleen Richey, Jiahong Yuan, Mark Liberman:
Articulatory trajectories for large-vocabulary speech recognition. ICASSP 2013: 7145-7149 - [c132]Mark Liberman, Jiahong Yuan, Andreas Stolcke, Wen Wang, Vikramjit Mitra:
Using multiple versions of speech input in phone recognition. ICASSP 2013: 7591-7595 - [c131]Jiahong Yuan, Neville Ryant, Mark Liberman, Andreas Stolcke, Vikramjit Mitra, Wen Wang:
Automatic phonetic segmentation using boundary models. INTERSPEECH 2013: 2306-2310 - [c130]Elizabeth Shriberg, Andreas Stolcke, Suman V. Ravuri:
Addressee detection for dialog systems using temporal and spectral dimensions of speaking style. INTERSPEECH 2013: 2559-2563 - [c129]Heeyoung Lee, Andreas Stolcke, Elizabeth Shriberg:
Using Out-of-Domain Data for Lexical Addressee Detection in Human-Human-Computer Dialog. HLT-NAACL 2013: 221-229 - [c128]Wen Wang, Andreas Stolcke, Jiahong Yuan, Mark Liberman:
A Cross-language Study on Automatic Speech Disfluency Detection. HLT-NAACL 2013: 703-708 - 2012
- [c127]Andreas Stolcke, Arindam Mandal, Elizabeth Shriberg:
Speaker recognition with region-constrained MLLR transforms. ICASSP 2012: 4397-4400 - [c126]Elizabeth Shriberg, Andreas Stolcke, Dilek Hakkani-Tür, Larry P. Heck:
Learning When to Listen: Detecting System-Addressed Speech in Human-Human-Computer Dialog. INTERSPEECH 2012: 334-337 - [c125]