


Остановите войну!
for scientists:
Daniel Povey
Person information

- affiliation (former): Johns Hopkins University, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2021
- [j11]Hang Lv
, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang
, Lei Xie, Sanjeev Khudanpur
:
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation. IEEE Signal Process. Lett. 28: 703-707 (2021) - [c145]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. ICASSP 2021: 5864-5868 - [c144]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition. ICASSP 2021: 6019-6023 - [c143]Ke Li, Daniel Povey, Sanjeev Khudanpur:
A Parallelizable Lattice Rescoring Strategy with Neural Language Models. ICASSP 2021: 6518-6522 - [c142]Guoguo Chen, Shuzhou Chai, Guan-Bo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio. Interspeech 2021: 3670-3674 - [c141]Junbo Zhang, Zhiwen Zhang, Yongqing Wang, Zhiyong Yan, Qiong Song, Yukai Huang, Ke Li, Daniel Povey, Yujun Wang:
speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment. Interspeech 2021: 3710-3714 - [c140]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs. SLT 2021: 881-888 - [i16]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. CoRR abs/2102.04488 (2021) - [i15]Ke Li, Daniel Povey, Sanjeev Khudanpur:
A Parallelizable Lattice Rescoring Strategy with Neural Language Models. CoRR abs/2103.05081 (2021) - [i14]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder For Automatic Speech Recognition. CoRR abs/2103.09063 (2021) - [i13]Junbo Zhang, Zhiwen Zhang, Yongqing Wang, Zhiyong Yan, Qiong Song, Yukai Huang, Ke Li, Daniel Povey, Yujun Wang:
speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment. CoRR abs/2104.01378 (2021) - [i12]Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio. CoRR abs/2106.06909 (2021) - [i11]Piotr Zelasko, Daniel Povey, Jan "Yenda" Trmal, Sanjeev Khudanpur:
Lhotse: a speech data representation library for the modern deep learning ecosystem. CoRR abs/2110.12561 (2021) - 2020
- [c139]Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR. ICASSP 2020: 6334-6338 - [c138]Zili Huang, Shinji Watanabe
, Yusuke Fujita, Paola García, Yiwen Shao, Daniel Povey, Sanjeev Khudanpur:
Speaker Diarization with Region Proposal Network. ICASSP 2020: 6514-6518 - [c137]Hugo Braun, Justin Luitjens, Ryan Leary, Tim Kaldewey, Daniel Povey:
Gpu-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition. ICASSP 2020: 7874-7878 - [c136]Ke Li, Zhe Liu, Tianxing He, Hongzhao Huang, Fuchun Peng, Daniel Povey, Sanjeev Khudanpur:
An Empirical Study of Transformer-Based Neural Language Model Adaptation. ICASSP 2020: 7934-7938 - [c135]Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR. INTERSPEECH 2020: 561-565 - [c134]Pegah Ghahramani, Hossein Hadian, Daniel Povey, Hynek Hermansky, Sanjeev Khudanpur:
An Alternative to MFCCs for ASR. INTERSPEECH 2020: 1664-1667 - [c133]Ke Li, Daniel Povey, Sanjeev Khudanpur:
Neural Language Modeling with Implicit Cache Pointers. INTERSPEECH 2020: 3625-3629 - [c132]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. INTERSPEECH 2020: 4258-4262 - [c131]Srikanth R. Madikeri, Banriskhem K. Khonglah, Sibo Tong, Petr Motlícek, Hervé Bourlard, Daniel Povey:
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems. INTERSPEECH 2020: 4746-4750 - [c130]Ruizhe Huang, Ke Li, Ashish Arora, Daniel Povey, Sanjeev Khudanpur:
Efficient MDI Adaptation for n-Gram Language Models. INTERSPEECH 2020: 4916-4920 - [i10]Zili Huang, Shinji Watanabe, Yusuke Fujita, Paola García, Yiwen Shao, Daniel Povey, Sanjeev Khudanpur:
Speaker Diarization with Region Proposal Network. CoRR abs/2002.06220 (2020) - [i9]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. CoRR abs/2005.08347 (2020) - [i8]Yiwen Shao, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR. CoRR abs/2005.09824 (2020) - [i7]Ruizhe Huang, Ke Li, Ashish Arora, Daniel Povey, Sanjeev Khudanpur:
Efficient MDI Adaptation for n-gram Language Models. CoRR abs/2008.02385 (2020) - [i6]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs. CoRR abs/2011.01997 (2020) - [i5]Desh Raj, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur:
Frustratingly Easy Noise-aware Training of Acoustic Models. CoRR abs/2011.02090 (2020)
2010 – 2019
- 2019
- [c129]Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur:
Incremental Lattice Determinization for WFST Decoders. ASRU 2019: 1-7 - [c128]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in X-Vectors. ASRU 2019: 726-733 - [c127]David Snyder, Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition for Multi-speaker Conversations Using X-vectors. ICASSP 2019: 5796-5800 - [c126]Chun-Chieh Chang, Ashish Arora, Leibny Paola García-Perera, David Etter, Daniel Povey, Sanjeev Khudanpur:
Optical Character Recognition with Chinese and Korean Character Decomposition. WML@ICDAR 2019: 134-139 - [c125]Ashish Arora, Paola García, Shinji Watanabe
, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian
, Jan Trmal:
Using ASR Methods for OCR. ICDAR 2019: 663-668 - [c124]Fei Wu, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network. INTERSPEECH 2019: 1-5 - [c123]Jiamin Xie, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
Multi-PLDA Diarization on Children's Speech. INTERSPEECH 2019: 376-380 - [c122]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak
:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c121]Daniel Garcia-Romero, David Snyder, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition. INTERSPEECH 2019: 1493-1496 - [c120]Daniel Garcia-Romero, David Snyder, Shinji Watanabe
, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition Benchmark Using the CHiME-5 Corpus. INTERSPEECH 2019: 1506-1510 - [c119]David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak
, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472 - [c118]Yiming Wang, David Snyder, Hainan Xu, Vimal Manohar, Phani Sankar Nidadavolu, Daniel Povey, Sanjeev Khudanpur:
The JHU ASR System for VOiCES from a Distance Challenge 2019. INTERSPEECH 2019: 2488-2492 - [c117]Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak
:
Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN. INTERSPEECH 2019: 3925-3929 - [c116]Mahsa Yarmohammadi, Xutai Ma, Sorami Hisamoto, Muhammad Rahman, Yiming Wang, Hainan Xu, Daniel Povey, Philipp Koehn, Kevin Duh:
Robust Document Representations for Cross-Lingual Information Retrieval in Low-Resource Settings. MTSummit (1) 2019: 12-20 - [i4]Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur:
Probing the Information Encoded in x-vectors. CoRR abs/1909.06351 (2019) - 2018
- [j10]Vijayaditya Peddinti, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs. IEEE Signal Process. Lett. 25(3): 373-377 (2018) - [j9]Hossein Hadian
, Hossein Sameti
, Daniel Povey, Sanjeev Khudanpur:
Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 1949-1961 (2018) - [c115]Zili Huang, L. Paola García-Perera, Jesús Villalba, Daniel Povey, Najim Dehak:
JHU Diarization System Description. IberSPEECH 2018: 236-239 - [c114]Vimal Manohar, Hossein Hadian
, Daniel Povey, Sanjeev Khudanpur:
Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI. ICASSP 2018: 4844-4848 - [c113]David Snyder, Daniel Garcia-Romero, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
X-Vectors: Robust DNN Embeddings for Speaker Recognition. ICASSP 2018: 5329-5333 - [c112]Daniel Povey, Hossein Hadian
, Pegah Ghahremani, Ke Li, Sanjeev Khudanpur:
A Time-Restricted Self-Attention Layer for ASR. ICASSP 2018: 5874-5878 - [c111]Hainan Xu, Tongfei Chen, Dongji Gao, Yiming Wang, Ke Li, Nagendra Goel, Yishay Carmiel, Daniel Povey, Sanjeev Khudanpur:
A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition. ICASSP 2018: 5929-5933 - [c110]Hainan Xu, Ke Li, Yiming Wang, Jian Wang, Shiyin Kang, Xie Chen, Daniel Povey, Sanjeev Khudanpur:
Neural Network Language Modeling with Letter-Based Features and Importance Sampling. ICASSP 2018: 6109-6113 - [c109]Hossein Hadian
, Hossein Sameti, Daniel Povey, Sanjeev Khudanpur:
End-to-end Speech Recognition Using Lattice-free MMI. INTERSPEECH 2018: 12-16 - [c108]Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak
:
End-to-end Deep Neural Network Age Estimation. INTERSPEECH 2018: 277-281 - [c107]Pegah Ghahremani, Hossein Hadian
, Hang Lv, Daniel Povey, Sanjeev Khudanpur:
Acoustic Modeling from Frequency Domain Representations of Speech. INTERSPEECH 2018: 1596-1600 - [c106]Gaofeng Cheng, Daniel Povey, Lu Huang, Ji Xu, Sanjeev Khudanpur, Yonghong Yan:
Output-Gate Projected Gated Recurrent Unit for Speech Recognition. INTERSPEECH 2018: 1793-1797 - [c105]Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
A GPU-based WFST Decoder with Exact Lattice Generation. INTERSPEECH 2018: 2212-2216 - [c104]Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak
, Daniel Povey, Shinji Watanabe
, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812 - [c103]Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak
:
Emotion Identification from Raw Speech Signals Using DNNs. INTERSPEECH 2018: 3097-3101 - [c102]Ke Li, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition. INTERSPEECH 2018: 3373-3377 - [c101]Yingke Zhu, Tom Ko, David Snyder, Brian Mak, Daniel Povey:
Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification. INTERSPEECH 2018: 3573-3577 - [c100]Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Yarmohammadi, Sanjeev Khudanpur:
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. INTERSPEECH 2018: 3743-3747 - [c99]David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
Spoken Language Recognition using X-vectors. Odyssey 2018: 105-111 - [c98]Hossein Hadian
, Daniel Povey, Hossein Sameti, Jan Trmal, Sanjeev Khudanpur:
Improving LF-MMI Using Unconstrained Supervisions for ASR. SLT 2018: 43-47 - [c97]Vimal Manohar, Pegah Ghahremani, Daniel Povey, Sanjeev Khudanpur:
A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models. SLT 2018: 250-257 - [i3]Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
A GPU-based WFST Decoder with Exact Lattice Generation. CoRR abs/1804.03243 (2018) - 2017
- [c96]Pegah Ghahremani, Vimal Manohar, Hossein Hadian
, Daniel Povey, Sanjeev Khudanpur:
Investigation of transfer learning for ASR using LF-MMI trained neural networks. ASRU 2017: 279-286 - [c95]Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning. ASRU 2017: 346-352 - [c94]Daniel Garcia-Romero, David Snyder, Gregory Sell, Daniel Povey, Alan McCree:
Speaker diarization using deep neural network embeddings. ICASSP 2017: 4930-4934 - [c93]Tom Ko
, Vijayaditya Peddinti, Daniel Povey, Michael L. Seltzer, Sanjeev Khudanpur:
A study on data augmentation of reverberant speech for robust speech recognition. ICASSP 2017: 5220-5224 - [c92]Hossein Hadian, Daniel Povey, Hossein Sameti, Sanjeev Khudanpur:
Phone Duration Modeling for LVCSR Using Neural Networks. INTERSPEECH 2017: 518-522 - [c91]David Snyder, Daniel Garcia-Romero, Daniel Povey, Sanjeev Khudanpur:
Deep Neural Network Embeddings for Text-Independent Speaker Verification. INTERSPEECH 2017: 999-1003 - [c90]Gaofeng Cheng, Vijayaditya Peddinti, Daniel Povey, Vimal Manohar, Sanjeev Khudanpur, Yonghong Yan:
An Exploration of Dropout with LSTMs. INTERSPEECH 2017: 1586-1590 - [c89]Yiming Wang, Vijayaditya Peddinti, Hainan Xu, Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
Backstitch: Counteracting Finite-Sample Bias via Negative Steps. INTERSPEECH 2017: 1631-1635 - [c88]Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework. INTERSPEECH 2017: 2541-2545 - [c87]Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601 - [i2]Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework. CoRR abs/1706.03747 (2017) - 2016
- [c86]Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Acoustic data-driven pronunciation lexicon generation for logographic languages. ICASSP 2016: 5350-5354 - [c85]Vijayaditya Peddinti, Vimal Manohar, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Far-Field ASR Without Parallel Data. INTERSPEECH 2016: 1996-2000 - [c84]Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, Sanjeev Khudanpur:
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI. INTERSPEECH 2016: 2751-2755 - [c83]Pegah Ghahremani, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic Modelling from the Signal Domain Using CNNs. INTERSPEECH 2016: 3434-3438 - [c82]David Snyder, Pegah Ghahremani, Daniel Povey, Daniel Garcia-Romero, Yishay Carmiel, Sanjeev Khudanpur:
Deep neural network-based speaker embeddings for end-to-end speaker verification. SLT 2016: 165-170 - 2015
- [c81]David Snyder, Daniel Garcia-Romero, Daniel Povey:
Time delay deep neural network-based universal background models for speaker recognition. ASRU 2015: 92-97 - [c80]Vijayaditya Peddinti, Guoguo Chen, Vimal Manohar, Tom Ko
, Daniel Povey, Sanjeev Khudanpur:
JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS. ASRU 2015: 539-546 - [c79]Gaurav Kumar, Graeme W. Blackwood, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation. EMNLP 2015: 1902-1907 - [c78]Vassil Panayotov, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Librispeech: An ASR corpus based on public domain audio books. ICASSP 2015: 5206-5210 - [c77]Guoguo Chen, Hainan Xu, Minhua Wu, Daniel Povey, Sanjeev Khudanpur:
Pronunciation and silence probability modeling for ASR. INTERSPEECH 2015: 533-537 - [c76]Hainan Xu, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Modeling phonetic context with non-random forests for speech recognition. INTERSPEECH 2015: 2117-2121 - [c75]Vijayaditya Peddinti, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur:
Reverberation robust acoustic modeling using i-vectors with time delay neural networks. INTERSPEECH 2015: 2440-2444 - [c74]Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Semi-supervised maximum mutual information training of deep neural network acoustic models. INTERSPEECH 2015: 2630-2634 - [c73]Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur:
A time delay neural network architecture for efficient modeling of long temporal contexts. INTERSPEECH 2015: 3214-3218 - [c72]Tom Ko, Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur:
Audio augmentation for speech recognition. INTERSPEECH 2015: 3586-3589 - [c71]Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
A diversity-penalizing ensemble training method for deep learning. INTERSPEECH 2015: 3590-3594 - [c70]Daniel Povey, Xiaohui Zhang, Sanjeev Khudanpur:
Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging. ICLR (Workshop) 2015 - [i1]David Snyder, Guoguo Chen, Daniel Povey:
MUSAN: A Music, Speech, and Noise Corpus. CoRR abs/1510.08484 (2015) - 2014
- [c69]Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Improving deep neural network acoustic models using generalized maxout networks. ICASSP 2014: 215-219 - [c68]Pegah Ghahremani, Bagher BabaAli, Daniel Povey, Korbinian Riedhammer, Jan Trmal, Sanjeev Khudanpur:
A pitch extraction algorithm tuned for automatic speech recognition. ICASSP 2014: 2494-2498 - [c67]Gaurav Kumar, Matt Post, Daniel Povey, Sanjeev Khudanpur:
Some insights from translating conversational telephone speech. ICASSP 2014: 3231-3235 - [c66]Ngoc Thang Vu, David Imseng, Daniel Povey, Petr Motlícek
, Tanja Schultz
, Hervé Bourlard:
Multilingual deep neural network based acoustic modeling for rapid language adaptation. ICASSP 2014: 7639-7643 - [c65]David Nolden, Hagen Soltau, Daniel Povey, Pegah Ghahremani, Lidia Mangu, Hermann Ney:
Removing redundancy from lattices. INTERSPEECH 2014: 656-660 - [c64]Justin T. Chiu, Yun Wang, Jan Trmal, Daniel Povey, Guoguo Chen, Alexander I. Rudnicky:
Combination of FST and CN search in spoken term detection. INTERSPEECH 2014: 2784-2788 - [c63]Gaurav Kumar, Yuan Cao, Ryan Cotterell, Chris Callison-Burch, Daniel Povey, Sanjeev Khudanpur:
Translations of the Callhome Egyptian Arabic corpus for conversational speech translation. IWSLT 2014 - [c62]Daniel Garcia-Romero, Xiaohui Zhang, Alan McCree, Daniel Povey:
Improving speaker recognition performance in the domain adaptation challenge using deep neural networks. SLT 2014: 378-383 - [c61]Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. SLT 2014: 530-535 - 2013
- [c60]Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Using proxies for OOV keywords in the keyword search task. ASRU 2013: 416-421 - [c59]Mirko Hannemann, Daniel Povey, Geoffrey Zweig:
Combining forward and backward search in decoding. ICASSP 2013: 6739-6743 - [c58]Petr Motlícek
, Daniel Povey, Martin Karafiát
:
Feature and score level combination of subspace Gaussinas in LVCSR task. ICASSP 2013: 7604-7608 - [c57]Guoguo Chen, Sanjeev Khudanpur, Daniel Povey, Jan Trmal, David Yarowsky, Oguz Yilmaz:
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages. ICASSP 2013: 8560-8564 - [c56]Shakti P. Rath, Daniel Povey, Karel Veselý, Jan Cernocký:
Improved feature processing for deep neural networks. INTERSPEECH 2013: 109-113 - [c55]Karel Veselý, Arnab Ghoshal, Lukás Burget, Daniel Povey:
Sequence-discriminative training of deep neural networks. INTERSPEECH 2013: 2345-2349 - 2012
- [j8]Daniel Povey, Kaisheng Yao:
A basis representation of constrained MLLR transforms for robust adaptation. Comput. Speech Lang. 26(1): 35-51 (2012) - [c54]Oriol Vinyals, Suman V. Ravuri, Daniel Povey:
Revisiting Recurrent Neural Networks for robust ASR. ICASSP 2012: 4085-4088 - [c53]Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukás Burget
, Arnab Ghoshal, Milos Janda, Martin Karafiát
, Stefan Kombrink, Petr Motlícek
, Yanmin Qian, Korbinian Riedhammer, Karel Veselý, Ngoc Thang Vu:
Generating exact lattices in the WFST framework. ICASSP 2012: 4213-4216 - [c52]Ngoc Thang Vu, Tanja Schultz
, Daniel Povey:
Modeling gender dependency in the Subspace GMM framework. ICASSP 2012: 4345-4348 - [c51]Korbinian Riedhammer, Tobias Bocklet, Arnab Ghoshal, Daniel Povey:
Revisiting semi-continuous hidden Markov models. ICASSP 2012: 4721-4724 - [c50]Chao Weng, Biing-Hwang Juang,