default search action

combined dblp search
author search
venue search
publication search

ask others

George Saon

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Books and Theses

see FAQ

What is the meaning of the colors in the publication lists?

1997
[b1]
- view
  - electronic edition @ archives-ouvertes.fr
  - details & citations
- export record
  dblp key:
  - phd/hal/Saon97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/hal/Saon97
George Saon:
Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne. (One and two-dimensional Markov models for off-line handwriting recognition). Henri Poincaré University, Nancy, France, 1997

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2021
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CuiZKLFKSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CuiZKLFKSK21
Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung:
Asynchronous Decentralized Distributed Training of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3565-3576 (2021)
2020
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/CuiZFSPK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/CuiZFSPK20
Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David S. Kung:
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies. IEEE Signal Process. Mag. 37(3): 39-49 (2020)
2017
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/ibmrd/SaonP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ibmrd/SaonP17
George Saon, Michael Picheny:
Recent advances in conversational speech recognition using convolutional and recurrent neural networks. IBM J. Res. Dev. 61(4-5): 1:1-1:10 (2017)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ibmrd/AudhkhasiRSSRCP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ibmrd/AudhkhasiRSSRCP17
Kartik Audhkhasi, Andrew Rosenberg, George Saon, Abhinav Sethy, Bhuvana Ramabhadran, Stanley F. Chen, Michael Picheny:
Recent progress in deep end-to-end models for spoken language processing. IBM J. Res. Dev. 61(4-5): 2:1-2:10 (2017)
2015
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/SainathKSSMDR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/SainathKSSMDR15
Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015)
2012
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SaonS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SaonS12
George Saon, Hagen Soltau:
Boosting systems for large vocabulary continuous speech recognition. Speech Commun. 54(2): 212-218 (2012)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/SaonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/SaonC12
George Saon, Jen-Tzung Chien:
Large-Vocabulary Continuous Speech Recognition Systems: A Look at Some Recent Advances. IEEE Signal Process. Mag. 29(6): 18-33 (2012)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SaonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SaonC12
George Saon, Jen-Tzung Chien:
Bayesian Sensing Hidden Markov Models. IEEE Trans. Speech Audio Process. 20(1): 43-54 (2012)
2011
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/ibmrd/PichenyNGKRRS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ibmrd/PichenyNGKRRS11
Michael Picheny, David Nahamoo, Vaibhava Goel, Brian Kingsbury, Bhuvana Ramabhadran, Steven J. Rennie, George Saon:
Trends and advances in speech recognition. IBM J. Res. Dev. 55(5): 2 (2011)
2009
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SoltauSKKMPE09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SoltauSKKMPE09
Hagen Soltau, George Saon, Brian Kingsbury, Hong-Kwang Jeff Kuo, Lidia Mangu, Daniel Povey, Ahmad Emami:
Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program. IEEE Trans. Speech Audio Process. 17(5): 884-894 (2009)
2006
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenKMPSSZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenKMPSSZ06
Stanley F. Chen, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Hagen Soltau, Geoffrey Zweig:
Advances in speech transcription at IBM under the DARPA EARS program. IEEE Trans. Speech Audio Process. 14(5): 1596-1608 (2006)
2004
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/YvonZS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/YvonZS04
François Yvon, Geoffrey Zweig, George Saon:
Arc minimization in finite-state decoding graphs with cross-word acoustic context. Comput. Speech Lang. 18(4): 397-415 (2004)
2002
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PadmanabhanSHKM02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PadmanabhanSHKM02
Mukund Padmanabhan, George Saon, Jing Huang, Brian Kingsbury, Lidia Mangu:
Automatic speech recognition performance on a voicemail transcription task. IEEE Trans. Speech Audio Process. 10(7): 433-442 (2002)
2001
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SaonP01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SaonP01
George Saon, Mukund Padmanabhan:
Data-driven approach to designing compound words for continuous speech recognition. IEEE Trans. Speech Audio Process. 9(4): 327-332 (2001)
1999
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ijdar/Saon99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijdar/Saon99
George Saon:
Cursive word recognition using a random field based hidden Markov model. Int. J. Document Anal. Recognit. 1(4): 199-208 (1999)
1997
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijprai/SaonB97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijprai/SaonB97
George Saon, Abdel Belaïd:
High Performance Unconstrained Word Recognition System Combining HMMs and Markov Random Fields. Int. J. Pattern Recognit. Artif. Intell. 11(5): 771-788 (1997)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/UdagawaSKMS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/UdagawaSKMS24
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon:
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems. ICASSP 2024: 10176-10180
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AroraS0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AroraS0K24
Siddhant Arora, George Saon, Shinji Watanabe, Brian Kingsbury:
Semi-Autoregressive Streaming ASR with Label Context. ICASSP 2024: 11681-11685
2023
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/MittalSJSK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/MittalSJSK23
Ashish R. Mittal, Sunita Sarawagi, Preethi Jyothi, George Saon, Gakuto Kurata:
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries. EMNLP 2023: 14820-14835
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonGC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonGC23
George Saon, Ankit Gupta, Xiaodong Cui:
Diagonal State Space Augmented Transformers for Speech Recognition. ICASSP 2023: 1-5
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasKSK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasKSK23
Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Brian Kingsbury:
Multi-Speaker Data Augmentation for Improved end-to-end Automatic Speech Recognition. ICASSP 2023: 1-5
[c108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiSK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiSK23
Xiaodong Cui, George Saon, Brian Kingsbury:
Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition. INTERSPEECH 2023: 1299-1303
2022
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BohnstinglGWSEP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BohnstinglGWSEP22
Thomas Bohnstingl, Ayush Garg, Stanislaw Wozniak, George Saon, Evangelos Eleftheriou, Angeliki Pantazi:
Speech Recognition Using Biologically-Inspired Neural Networks. ICASSP 2022: 6992-6996
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuoTTKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuoTTKS22
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Brian Kingsbury, George Saon:
Improving End-to-end Models for Set Prediction in Spoken Language Understanding. ICASSP 2022: 7162-7166
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasKKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasKKS22
Samuel Thomas, Hong-Kwang Jeff Kuo, Brian Kingsbury, George Saon:
Towards Reducing the Need for Speech Training Data to Build Spoken Language Understanding Systems. ICASSP 2022: 7932-7936
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasKSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasKSK22
Samuel Thomas, Brian Kingsbury, George Saon, Hong-Kwang Jeff Kuo:
Integrating Text Inputs for Training and Adapting RNN Transducer ASR Models. ICASSP 2022: 8127-8131
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KonsAMDK0S22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KonsAMDK0S22
Zvi Kons, Hagai Aronowitz, Edmilson da Silva Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas, George Saon:
Extending RNN-T-based speech recognition systems with emotion and language classification. INTERSPEECH 2022: 546-549
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiSH0K22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiSH0K22
Jiatong Shi, George Saon, David Haws, Shinji Watanabe, Brian Kingsbury:
VQ-T: RNN Transducers using Vector-Quantized Prediction Network States. INTERSPEECH 2022: 1656-1660
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FasoliCSVSCKG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FasoliCSVSCKG22
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan:
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization. INTERSPEECH 2022: 2038-2042
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiSNSFKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiSNSFKK22
Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. INTERSPEECH 2022: 2638-2642
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fukuda0SKSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fukuda0SKSK22
Takashi Fukuda, Samuel Thomas, Masayuki Suzuki, Gakuto Kurata, George Saon, Brian Kingsbury:
Global RNN Transducer Models For Multi-dialect Speech Recognition. INTERSPEECH 2022: 3138-3142
[c98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UdagawaSKIS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UdagawaSKIS22
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon:
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. INTERSPEECH 2022: 3919-3923
2021
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTBK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTBK21
George Saon, Zoltán Tüske, Daniel Bolaños, Brian Kingsbury:
Advancing RNN Transducer Technology for Speech Recognition. ICASSP 2021: 5654-5658
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0001KSTKKKH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0001KSTKKKH21
Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory:
RNN Transducer Models for Spoken Language Understanding. ICASSP 2021: 7493-7497
[c95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ganhotra0KJSTK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ganhotra0KJSTK21
Jatin Ganhotra, Samuel Thomas, Hong-Kwang Jeff Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury:
Integrating Dialog History into End-to-End Spoken Language Understanding Systems. Interspeech 2021: 1254-1258
[c94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiKSHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiKSHT21
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. Interspeech 2021: 1802-1806
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurataSKHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurataSKHT21
Gakuto Kurata, George Saon, Brian Kingsbury, David Haws, Zoltán Tüske:
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio. Interspeech 2021: 2027-2031
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSK21
Zoltán Tüske, George Saon, Brian Kingsbury:
On the Limit of English Conversational Speech Recognition. Interspeech 2021: 2062-2066
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FasoliCSSWVSCK021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FasoliCSSWVSCK021
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-Bit Quantization of LSTM-Based Speech Recognition Models. Interspeech 2021: 2586-2590
2020
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCKLFKSMBDK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCKLFKSMBDK20
Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David S. Kung, Michael Picheny:
Improving Efficiency in Large-Scale Decentralized Distributed Training. ICASSP 2020: 3022-3026
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTA20
George Saon, Zoltán Tüske, Kartik Audhkhasi:
Alignment-Length Synchronous Decoding for RNN Transducer. ICASSP 2020: 7804-7808
[c88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSAK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSAK20
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury:
Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard. INTERSPEECH 2020: 551-555
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurataS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurataS20
Gakuto Kurata, George Saon:
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition. INTERSPEECH 2020: 2117-2121
2019
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonTAKPT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonTAKPT19
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury, Michael Picheny, Samuel Thomas:
Simplified LSTMS for Speech Recognition. ASRU 2019: 547-553
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCFKSKP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCFKSKP19
Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung, Michael Picheny:
Distributed Deep Learning Strategies for Automatic Speech Recognition. ICASSP 2019: 5706-5710
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTAK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTAK19
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury:
Sequence Noise Injected Training for End-to-end Speech Recognition. ICASSP 2019: 6261-6265
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasSHKTSKPDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasSHKTSKPDK19
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. ICASSP 2019: 6455-6459
[c82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PichenyTKACS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PichenyTKACS19
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. INTERSPEECH 2019: 326-330
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiSTKP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiSTKP19
Kartik Audhkhasi, George Saon, Zoltán Tüske, Brian Kingsbury, Michael Picheny:
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition. INTERSPEECH 2019: 2618-2622
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCFSKBK0P19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCFSKBK0P19
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David S. Kung, Michael Picheny:
A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition. INTERSPEECH 2019: 2628-2632
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeAS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeAS19
Zoltán Tüske, Kartik Audhkhasi, George Saon:
Advancing Sequence-to-Sequence Based Speech Recognition. INTERSPEECH 2019: 3780-3784
2018
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AudhkhasiKRSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AudhkhasiKRSP18
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition. ICASSP 2018: 4759-4763
2017
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KurataRSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KurataRSS17
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy:
Language modeling with highway LSTM. ASRU 2017: 244-251
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CuiKRSSASNR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CuiKRSSASNR17
Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Tom Sercu, Kartik Audhkhasi, Abhinav Sethy, Markus Nußbaum-Thom, Andrew Rosenberg:
Knowledge distillation across ensembles of multilingual models for low-resource languages. ICASSP 2017: 4825-4829
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SercuSCCRKS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SercuSCCRKS17
Tom Sercu, George Saon, Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Brian Kingsbury, Abhinav Sethy:
Network architectures for multilingual speech representation learning. ICASSP 2017: 5295-5299
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiGS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiGS17
Xiaodong Cui, Vaibhava Goel, George Saon:
Embedding-Based Speaker Adaptive Training of Deep Neural Networks. INTERSPEECH 2017: 122-126
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonKSATDCRPLRH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonKSATDCRPLRH17
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. INTERSPEECH 2017: 132-136
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurataSRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurataSRS17
Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, George Saon:
Empirical Exploration of Novel Architectures and Objectives for Language Models. INTERSPEECH 2017: 279-283
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiRSPN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiRSPN17
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. INTERSPEECH 2017: 959-963
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/CongKGSZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/CongKGSZ17
Guojing Cong, Brian Kingsbury, Soumyadip Gosh, George Saon, Fan Zhou:
Accelerating deep neural network learning for speech recognition on a cluster of GPUs. MLHPC@SC 2017: 3:1-3:8
2016
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HawsDSTP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HawsDSTP16
David Haws, Dimitrios Dimitriadis, George Saon, Samuel Thomas, Michael Picheny:
On the importance of event detection for ASR. ICASSP 2016: 5705-5709
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonSRK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonSRK16
George Saon, Tom Sercu, Steven J. Rennie, Hong-Kwang Jeff Kuo:
The IBM 2016 English Conversational Telephone Speech Recognition System. INTERSPEECH 2016: 7-11
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiTTRS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiTTRS16
Masayuki Suzuki, Ryuki Tachibana, Samuel Thomas, Bhuvana Ramabhadran, George Saon:
Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings. INTERSPEECH 2016: 1588-1592
2015
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasSSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasSSN15
Samuel Thomas, George Saon, Maarten Van Segbroeck, Shrikanth S. Narayanan:
Improvements to the IBM speech activity detection system for the DARPA RATS program. ICASSP 2015: 4500-4504
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KeskarS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KeskarS15
Nitish Shirish Keskar, George Saon:
A nonmonotone learning rate strategy for SGD training of deep neural networks. ICASSP 2015: 4974-4978
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ManguSPK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ManguSPK15
Lidia Mangu, George Saon, Michael Picheny, Brian Kingsbury:
Order-free spoken term detection. ICASSP 2015: 5331-5335
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonKRP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonKRP15
George Saon, Hong-Kwang Jeff Kuo, Steven J. Rennie, Michael Picheny:
The IBM 2015 English conversational telephone speech recognition system. INTERSPEECH 2015: 3140-3144
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasSKM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasSKM15
Samuel Thomas, George Saon, Hong-Kwang Jeff Kuo, Lidia Mangu:
The IBM BOLT speech transcription system. INTERSPEECH 2015: 3150-3153
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiSRK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiSRK15
Jia Cui, George Saon, Bhuvana Ramabhadran, Brian Kingsbury:
A multi-region deep neural network model in speech recognition. INTERSPEECH 2015: 3244-3248
2014
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasGSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasGSS14
Samuel Thomas, Sriram Ganapathy, George Saon, Hagen Soltau:
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions. ICASSP 2014: 2519-2523
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonS14
George Saon, Hagen Soltau:
A comparison of two optimization techniques for sequence discriminative training of deep neural networks. ICASSP 2014: 5567-5571
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSS14
Hagen Soltau, George Saon, Tara N. Sainath:
Joint training of convolutional and non-convolutional neural networks. ICASSP 2014: 5572-5576
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKMSR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKMSR14
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, Bhuvana Ramabhadran:
Improvements to filterbank and delta learning within a deep neural network framework. ICASSP 2014: 6839-6843
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonSEP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonSEP14
George Saon, Hagen Soltau, Ahmad Emami, Michael Picheny:
Unfolded recurrent neural networks for speech recognition. INTERSPEECH 2014: 343-347
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathCRPGKSAC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathCRPGKSAC14
Tara N. Sainath, I-Hsin Chung, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Brian Kingsbury, George Saon, Vernon Austel, Upendra V. Chaudhari:
Parallel deep neural network training for LVCSR tasks using blue gene/Q. INTERSPEECH 2014: 1048-1052
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Saon14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Saon14
George Saon:
A distributed architecture for fast SGD sequence discriminative training of DNN acoustic models. SLT 2014: 183-188
2013
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonSNP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonSNP13
George Saon, Hagen Soltau, David Nahamoo, Michael Picheny:
Speaker adaptation of neural network acoustic models using i-vectors. ASRU 2013: 55-59
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManguSKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManguSKS13
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon:
The IBM keyword search system for the DARPA RATS program. ASRU 2013: 204-209
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ManguSKKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ManguSKKS13
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, Brian Kingsbury, George Saon:
Exploiting diversity for spoken term detection. ICASSP 2013: 8282-8286
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauKMSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauKMSB13
Hagen Soltau, Hong-Kwang Kuo, Lidia Mangu, George Saon, Tomás Beran:
Neural network acoustic models for the DARPA RATS program. INTERSPEECH 2013: 3092-3096
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonTSGK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonTSGK13
George Saon, Samuel Thomas, Hagen Soltau, Sriram Ganapathy, Brian Kingsbury:
The IBM speech activity detection system for the DARPA RATS program. INTERSPEECH 2013: 3497-3501
2012
[c47]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/SaonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SaonC12
George Saon, Jen-Tzung Chien:
Recent developments in large vocabulary continuous speech recognition. APSIPA 2012: 1-6
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonK12
George Saon, Brian Kingsbury:
Discriminative feature-space transforms using deep neural networks. INTERSPEECH 2012: 14-17
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiASG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiASG12
Xiaodong Cui, Mohamed Afify, George Saon, Vaibhava Goel:
Sparse Bayesian Factor Analysis for Stereo-based Stochastic Mapping. INTERSPEECH 2012: 795-798
2011
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonC11
George Saon, Jen-Tzung Chien:
Some properties of Bayesian sensing hidden Markov models. ASRU 2011: 65-70
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KuoAMS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KuoAMS11
Hong-Kwang Jeff Kuo, Ebru Arisoy, Lidia Mangu, George Saon:
Minimum Bayes risk discriminative language models for Arabic speech recognition. ASRU 2011: 208-213
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManguKCKSSB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManguKCKSSB11
Lidia Mangu, Hong-Kwang Kuo, Stephen M. Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy:
The IBM 2011 GALE Arabic speech transcription system. ASRU 2011: 272-277
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KingsburySSCKMRMJ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KingsburySSCKMRMJ11
Brian Kingsbury, Hagen Soltau, George Saon, Stephen M. Chu, Hong-Kwang Kuo, Lidia Mangu, Suman V. Ravuri, Nelson Morgan, Adam Janin:
The IBM 2009 GALE Arabic speech transcription system. ICASSP 2011: 4672-4675
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonC11
George Saon, Jen-Tzung Chien:
Bayesian sensing hidden Markov models for speech recognition. ICASSP 2011: 5056-5059
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonC11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonC11a
George Saon, Jen-Tzung Chien:
Discriminative training for Bayesian sensing hidden Markov models. ICASSP 2011: 5316-5319
[c38]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlslp/SaonC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlslp/SaonC11
George Saon, Jen-Tzung Chien:
Bayesian sensing hidden Markov models for speech recognition. MLSLP 2011
2010
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonSCCKKMP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonSCCKKMP10
George Saon, Hagen Soltau, Upendra V. Chaudhari, Stephen M. Chu, Brian Kingsbury, Hong-Kwang Kuo, Lidia Mangu, Daniel Povey:
The IBM 2008 GALE Arabic speech transcription system. ICASSP 2010: 4378-4381
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonS10
George Saon, Hagen Soltau:
Boosting systems for LVCSR. INTERSPEECH 2010: 1341-1344
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SoltauSK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SoltauSK10
Hagen Soltau, George Saon, Brian Kingsbury:
The IBM Attila speech recognition toolkit. SLT 2010: 97-102
2009
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauS09
Hagen Soltau, George Saon:
Dynamic network decoding revisited. ASRU 2009: 276-281
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonPS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonPS09
George Saon, Daniel Povey, Hagen Soltau:
Large margin semi-tied covariance transforms for discriminative training. ICASSP 2009: 3753-3756
2008
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PoveyKKRSV08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PoveyKKRSV08
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah:
Boosted MMI for model and feature-space discriminative training. ICASSP 2008: 4057-4060
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonP08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonP08
George Saon, Daniel Povey:
Penalty function maximization for large margin HMM training. INTERSPEECH 2008: 920-923
2007
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonP07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonP07
George Saon, Michael Picheny:
Lattice-based Viterbi decoding techniques for speech translation. ASRU 2007: 386-389
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSKKMPZ07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSKKMPZ07
Hagen Soltau, George Saon, Brian Kingsbury, Hong-Kwang Jeff Kuo, Lidia Mangu, Daniel Povey, Geoffrey Zweig:
The IBM 2006 Gale Arabic ASR System. ICASSP (4) 2007: 349-352
2006
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Saon06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Saon06
George Saon:
A Non-Linear Speaker Adaptation Technique using Kernel Ridge Regression. ICASSP (1) 2006: 225-228
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZweigSSRPMK06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZweigSSRPMK06
Geoffrey Zweig, Olivier Siohan, George Saon, Bhuvana Ramabhadran, Daniel Povey, Lidia Mangu, Brian Kingsbury:
Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy. ICASSP (1) 2006: 589-592
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoveyS06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoveyS06
Daniel Povey, George Saon:
Feature and model space speaker adaptation with full covariance Gaussians. INTERSPEECH 2006
[c25]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/ZweigSSRPMK06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZweigSSRPMK06
Geoffrey Zweig, Olivier Siohan, George Saon, Bhuvana Ramabhadran, Daniel Povey, Lidia Mangu, Brian Kingsbury:
Automated Quality Monitoring for Call Centers using Speech and NLP Technologies. HLT-NAACL 2006
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SaonRZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SaonRZ06
George Saon, Bhuvana Ramabhadran, Geoffrey Zweig:
On the Effect Ofword Error Rate on Automated Quality Monitoring. SLT 2006: 106-109
2005
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauKMPSZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauKMPSZ05
Hagen Soltau, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Geoffrey Zweig:
The IBM 2004 Conversational Telephony System for Rich Transcription. ICASSP (1) 2005: 205-208
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PoveyKMSSZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PoveyKMSSZ05
Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon, Hagen Soltau, Geoffrey Zweig:
fMPE: Discriminatively Trained Features for Speech Recognition. ICASSP (1) 2005: 961-964
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonPZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonPZ05
George Saon, Daniel Povey, Geoffrey Zweig:
Anatomy of an extremely fast LVCSR decoder. INTERSPEECH 2005: 549-552
2004
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SarikayaGS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SarikayaGS04
Ruhi Sarikaya, Yuqing Gao, George Saon:
Fractional Fourier transform features for speech recognition. ICASSP (1) 2004: 52
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonDP04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonDP04
George Saon, Satya Dharanipragada, Daniel Povey:
Feature space Gaussianization. ICASSP (1) 2004: 329-332
2003
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KingsburyMSZAGVP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KingsburyMSZAGVP03
Brian Kingsbury, Lidia Mangu, George Saon, Geoffrey Zweig, Scott Axelrod, Vaibhava Goel, Karthik Visweswariah, Michael Picheny:
Toward domain-independent conversational speech recognition. INTERSPEECH 2003: 1881-1884
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonZKMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonZKMC03
George Saon, Geoffrey Zweig, Brian Kingsbury, Lidia Mangu, Upendra V. Chaudhari:
An architecture for rapid decoding of large vocabulary conversational speech. INTERSPEECH 2003: 1977-1980
2002
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FineSG02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FineSG02
Shai Fine, George Saon, Ramesh A. Gopinath:
Digit recognition in noisy environments via a sequential GMM/SVM system. ICASSP 2002: 49-52
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KingsburySMPS02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KingsburySMPS02
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, Ruhi Sarikaya:
Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system. ICASSP 2002: 53-56
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZweigSY02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZweigSY02
Geoffrey Zweig, George Saon, François Yvon:
Arc minimization in finite state decoding graphs with cross-word acoustic context. INTERSPEECH 2002: 389-392
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonH02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonH02
George Saon, Juan M. Huerta:
Improvements to the IBM Aurora 2 multi-condition system. INTERSPEECH 2002: 469-472
2001
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonZP01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonZP01
George Saon, Geoffrey Zweig, Mukund Padmanabhan:
Linear feature space projections for speaker adaptation. ICASSP 2001: 325-328
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AaronCCDEFLLMMMNOPPRSSTVY01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AaronCCDEFLLMMMNOPPRSSTVY01
Andrew Aaron, Scott Saobing Chen, Paul S. Cohen, Satya Dharanipragada, Ellen Eide, Martin Franz, Jean-Michel LeRoux, X. Luo, Benoît Maison, Lidia Mangu, T. Mathes, Miroslav Novak, Peder A. Olsen, Michael Picheny, Harry Printz, Bhuvana Ramabhadran, Andrej Sakrajda, George Saon, Borivoj Tydlitát, Karthik Visweswariah, D. Yuk:
Speech recognition for DARPA Communicator. ICASSP 2001: 489-492
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonHJ01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonHJ01
George Saon, Juan M. Huerta, Ea-Ee Jan:
Robust digit recognition in noisy environments: the IBM Aurora 2 system. INTERSPEECH 2001: 629-632
2000
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonPGC00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonPGC00
George Saon, Mukund Padmanabhan, Ramesh A. Gopinath, Scott Saobing Chen:
Maximum likelihood discriminant feature spaces. ICASSP 2000: 1129-1132
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonP00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonP00
George Saon, Mukund Padmanabhan:
Minimum Bayes error feature selection. INTERSPEECH 2000: 75-78
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangKMPSZ00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangKMPSZ00
Jing Huang, Brian Kingsbury, Lidia Mangu, Mukund Padmanabhan, George Saon, Geoffrey Zweig:
Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard). INTERSPEECH 2000: 338-341
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JanOSR00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JanOSR00
Ea-Ee Jan, Jaime Botella Ordinas, George Saon, Salim Roukos:
Real-time multilingual HMM training robust to channel variations. INTERSPEECH 2000: 925-928
[c5]
- view
- export record
  dblp key:
  - conf/nips/SaonP00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SaonP00
George Saon, Mukund Padmanabhan:
Minimum Bayes Error Feature Selection for Continuous Speech Recognition. NIPS 2000: 800-806
1999
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/PadmanabhanSBHZ99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PadmanabhanSBHZ99
Mukund Padmanabhan, George Saon, Sankar Basu, Jing Huang, Geoffrey Zweig:
Recent improvements in voicemail transcription. EUROSPEECH 1999: 503-506
1997
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonB97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonB97
George Saon, Abdel Belaïd:
Binary pattern recognition using Markov random fields and HMMs. ICASSP 1997: 3725-3728
1995
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icdar/SaonBG95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdar/SaonBG95
George Saon, Abdel Belaïd, Yifan Gong:
Stochastic trajectory modeling for recognition of unconstrained handwritten words. ICDAR 1995: 508-511
1994
[c1]
- view
  - electronic edition @ u-tokyo.ac.jp (open access)
  - details & citations
- export record
  dblp key:
  - conf/mva/SaonBG94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mva/SaonBG94
George Saon, Abdel Belaïd, Yifan Gong:
Off-line Handwriting Recognition by Statistical Correlation. MVA 1994: 371-374

Parts in Books or Collections

see FAQ

What is the meaning of the colors in the publication lists?

2014
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/tanlp/SoltauSMKKCB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/tanlp/SoltauSMKKCB14
Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen M. Chu, Fadi Biadsy:
Automatic Speech Recognition. NLP of Semitic Languages 2014: 409-459

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-00235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-00235
Ankit Gupta, George Saon, Brian Kingsbury:
Exploring the limits of decoder-only models trained on public speech recognition corpora. CoRR abs/2402.00235 (2024)
2023
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14120
George Saon, Ankit Gupta, Xiaodong Cui:
Diagonal State Space Augmented Transformers for Speech Recognition. CoRR abs/2302.14120 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-04031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-04031
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon:
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems. CoRR abs/2309.04031 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10926
Siddhant Arora, George Saon, Shinji Watanabe, Brian Kingsbury:
Semi-Autoregressive Streaming ASR With Label Context. CoRR abs/2309.10926 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12727
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12727
Xiaodong Cui, Ashish R. Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury:
Soft Random Sampling: A Theoretical and Empirical Analysis. CoRR abs/2311.12727 (2023)
2022
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12105
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Brian Kingsbury, George Saon:
Improving End-to-End Models for Set Prediction in Spoken Language Understanding. CoRR abs/2201.12105 (2022)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-13155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-13155
Samuel Thomas, Brian Kingsbury, George Saon, Hong-Kwang Jeff Kuo:
Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models. CoRR abs/2202.13155 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00006
Samuel Thomas, Hong-Kwang Jeff Kuo, Brian Kingsbury, George Saon:
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems. CoRR abs/2203.00006 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15176
Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. CoRR abs/2203.15176 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00212
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon:
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. CoRR abs/2204.00212 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07882
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan:
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization. CoRR abs/2206.07882 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-13965
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-13965
Zvi Kons, Hagai Aronowitz, Edmilson da Silva Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas, George Saon:
Extending RNN-T-based speech recognition systems with emotion and language classification. CoRR abs/2207.13965 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01818
Jiatong Shi, George Saon, David Haws, Shinji Watanabe, Brian Kingsbury:
VQ-T: RNN Transducers using Vector-Quantized Prediction Network States. CoRR abs/2208.01818 (2022)
2021
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09935
George Saon, Zoltán Tüske, Daniel Bolaños, Brian Kingsbury:
Advancing RNN Transducer Technology for Speech Recognition. CoRR abs/2103.09935 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03842
Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory:
RNN Transducer Models For Spoken Language Understanding. CoRR abs/2104.03842 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-00982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-00982
Zoltán Tüske, George Saon, Brian Kingsbury:
On the limit of English conversational speech recognition. CoRR abs/2105.00982 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08405
Jatin Ganhotra, Samuel Thomas, Hong-Kwang Jeff Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury:
Integrating Dialog History into End-to-End Spoken Language Understanding Systems. CoRR abs/2108.08405 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-10803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-10803
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. CoRR abs/2108.10803 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-12074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-12074
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-bit Quantization of LSTM-based Speech Recognition Models. CoRR abs/2108.12074 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02743
Thomas Bohnstingl, Ayush Garg, Stanislaw Wozniak, George Saon, Evangelos Eleftheriou, Angeliki Pantazi:
Towards efficient end-to-end speech recognition with biologically-inspired neural networks. CoRR abs/2110.02743 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-11199
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-11199
Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung:
Asynchronous Decentralized Distributed Training of Acoustic Models. CoRR abs/2110.11199 (2021)
2020
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-07263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-07263
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury:
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300. CoRR abs/2001.07263 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-01119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-01119
Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David S. Kung, Michael Picheny:
Improving Efficiency in Large-Scale Decentralized Distributed Training. CoRR abs/2002.01119 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10502
Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David S. Kung:
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition. CoRR abs/2002.10502 (2020)
2019
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04956
Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung, Michael Picheny:
Distributed Deep Learning Strategies For Automatic Speech Recognition. CoRR abs/1904.04956 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-13258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-13258
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. CoRR abs/1904.13258 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-05701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-05701
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David S. Kung, Michael Picheny:
A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition. CoRR abs/1907.05701 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-03455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-03455
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. CoRR abs/1908.03455 (2019)
2017
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SaonKSATDCRPLRH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SaonKSATDCRPLRH17
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. CoRR abs/1703.02136 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AudhkhasiRSPN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AudhkhasiRSPN17
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. CoRR abs/1703.07754 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-06436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-06436
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy:
Language Modeling with Highway LSTM. CoRR abs/1709.06436 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-06937
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-06937
Xiaodong Cui, Vaibhava Goel, George Saon:
Embedding-Based Speaker Adaptive Training of Deep Neural Networks. CoRR abs/1710.06937 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-03133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-03133
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building competitive direct acoustics-to-word models for English conversational speech recognition. CoRR abs/1712.03133 (2017)
2016
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SaonSRK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SaonSRK16
George Saon, Tom Sercu, Steven J. Rennie, Hong-Kwang Jeff Kuo:
The IBM 2016 English Conversational Telephone Speech Recognition System. CoRR abs/1604.08242 (2016)
2015
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SaonKRP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SaonKRP15
George Saon, Hong-Kwang Jeff Kuo, Steven J. Rennie, Michael Picheny:
The IBM 2015 English Conversational Telephone Speech Recognition System. CoRR abs/1505.05899 (2015)
2013
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.