default search action
Zhong Meng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Books and Theses
- 2018
- [b1]Zhong Meng:
Discriminative and adaptive training for robust speech recognition and understanding. Georgia Institute of Technology, Atlanta, GA, USA, 2018
Journal Articles
- 2020
- [j1]Zhong Meng, Muhammad Umair Bin Altaf, Biing-Hwang Fred Juang:
Active voice authentication. Digit. Signal Process. 101: 102672 (2020)
Conference and Workshop Papers
- 2024
- [c56]Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904 - [c55]Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820 - [c54]Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting Conformers With Structured State-Space Sequence Models For Online Speech Recognition. ICASSP 2024: 12221-12225 - [c53]Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar:
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. NAACL (Industry Track) 2024: 315-323 - [c52]Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217 - 2023
- [c51]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5 - [c50]Cal Peyser, Zhong Meng, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho, Ke Hu:
Improving Joint Speech-Text Representations Without Alignment. INTERSPEECH 2023: 1354-1358 - [c49]Shaan Bijwadia, Shuo-Yiin Chang, Weiran Wang, Zhong Meng, Hao Zhang:
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models. INTERSPEECH 2023: 1409-1413 - 2022
- [c48]Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. ICASSP 2022: 6017-6021 - [c47]Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR. ICASSP 2022: 8082-8086 - [c46]Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. ICASSP 2022: 8132-8136 - [c45]Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. INTERSPEECH 2022: 521-525 - [c44]Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. INTERSPEECH 2022: 2608-2612 - [c43]Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. INTERSPEECH 2022: 3774-3778 - [c42]Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387 - [c41]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204 - 2021
- [c40]Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. ASRU 2021: 296-303 - [c39]Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng:
Continuous Speech Separation with Ad Hoc Microphone Arrays. EUSIPCO 2021: 1100-1104 - [c38]Eric Sun, Liang Lu, Zhong Meng, Yifan Gong:
Sequence-Level Self-Teaching Regularization. ICASSP 2021: 2945-2949 - [c37]Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka:
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. ICASSP 2021: 6503-6507 - [c36]Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings. ICASSP 2021: 6763-6767 - [c35]Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition. ICASSP 2021: 7338-7342 - [c34]Yan Deng, Rui Zhao, Zhong Meng, Xie Chen, Bing Liu, Jinyu Li, Yifan Gong, Lei He:
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. Interspeech 2021: 751-755 - [c33]Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. Interspeech 2021: 2596-2600 - [c32]Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone. Interspeech 2021: 3430-3434 - [c31]Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. Interspeech 2021: 3435-3439 - [c30]Eric Sun, Jinyu Li, Zhong Meng, Yu Wu, Jian Xue, Shujie Liu, Yifan Gong:
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions. Interspeech 2021: 3470-3474 - [c29]Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
End-to-End Speaker-Attributed ASR with Transformer. Interspeech 2021: 4413-4417 - [c28]Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. SLT 2021: 243-250 - [c27]Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Investigation of End-to-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings. SLT 2021: 809-816 - [c26]Xiaofei Wang, Naoyuki Kanda, Yashesh Gaur, Zhuo Chen, Zhong Meng, Takuya Yoshioka:
Exploring End-to-End Multi-Channel ASR with Bias Information for Meeting Transcription. SLT 2021: 833-840 - 2020
- [c25]Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288 - [c24]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. ICASSP 2020: 7389-7393 - [c23]Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. ICASSP 2020: 7699-7703 - [c22]Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka:
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers. INTERSPEECH 2020: 36-40 - [c21]Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Serialized Output Training for End-to-End Overlapped Speech Recognition. INTERSPEECH 2020: 2797-2801 - [c20]Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. INTERSPEECH 2020: 3590-3594 - 2019
- [c19]Peidong Wang, Zhuo Chen, Xiong Xiao, Zhong Meng, Takuya Yoshioka, Tianyan Zhou, Liang Lu, Jinyu Li:
Speech Separation Using Speaker Inventory. ASRU 2019: 230-236 - [c18]Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. ASRU 2019: 268-275 - [c17]Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. ASRU 2019: 949-955 - [c16]Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. ICASSP 2019: 5721-5725 - [c15]Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. ICASSP 2019: 6216-6220 - [c14]Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-student Learning. ICASSP 2019: 6445-6449 - [c13]Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-invariant Training. ICASSP 2019: 6740-6744 - [c12]Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. INTERSPEECH 2019: 241-245 - [c11]Yashesh Gaur, Jinyu Li, Zhong Meng, Yifan Gong:
Acoustic-to-Phrase Models for Speech Recognition. INTERSPEECH 2019: 2240-2244 - 2018
- [c10]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. ICASSP 2018: 5949-5953 - [c9]Zhong Meng, Jinyu Li, Zhuo Chen, Yang Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training Via Adversarial Learning. ICASSP 2018: 5969-5973 - [c8]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Cycle-Consistent Speech Enhancement. INTERSPEECH 2018: 1165-1169 - [c7]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Adversarial Feature-Mapping for Speech Enhancement. INTERSPEECH 2018: 3259-3263 - 2017
- [c6]Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised adaptation with domain separation networks for robust speech recognition. ASRU 2017: 214-221 - [c5]Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan:
Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition. ICASSP 2017: 271-275 - [c4]Zhong Meng, Biing-Hwang Juang:
Minimum Semantic Error Cost Training of Deep Long Short-Term Memory Networks for Topic Spotting on Conversational Speech. INTERSPEECH 2017: 2496-2500 - [c3]Zhong Meng, Biing-Hwang Juang:
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting. INTERSPEECH 2017: 3547-3551 - 2016
- [c2]Zhong Meng, Biing-Hwang Juang:
Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting. INTERSPEECH 2016: 770-774 - [c1]Zhong Meng, Biing-Hwang Juang:
Statistical Modeling of Speaker's Voice with Temporal Co-Location for Active Voice Authentication. INTERSPEECH 2016: 1725-1729
Informal and Other Publications
- 2024
- [i53]Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024) - [i52]Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar:
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. CoRR abs/2404.10180 (2024) - [i51]Lun Wang, Om Thakkar, Zhong Meng, Nicole Rafidi, Rohit Prabhavalkar, Arun Narayanan:
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping. CoRR abs/2406.02004 (2024) - [i50]Zhong Meng, Zelin Wu, Rohit Prabhavalkar, Cal Peyser, Weiran Wang, Nanxin Chen, Tara N. Sainath, Bhuvana Ramabhadran:
Text Injection for Neural Contextual Biasing. CoRR abs/2406.02921 (2024) - [i49]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Neeraj Gaur, Zhong Meng:
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions. CoRR abs/2406.14701 (2024) - 2023
- [i48]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023) - [i47]Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023) - [i46]Cal Peyser, Zhong Meng, Ke Hu, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho:
Improving Joint Speech-Text Representations Without Alignment. CoRR abs/2308.06125 (2023) - [i45]Shaan Bijwadia, Shuo-Yiin Chang, Weiran Wang, Zhong Meng, Hao Zhang, Tara N. Sainath:
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models. CoRR abs/2308.07395 (2023) - [i44]Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting conformers with structured state space models for online speech recognition. CoRR abs/2309.08551 (2023) - [i43]Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023) - [i42]Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023) - [i41]Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Yongqiang Wang, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Zhong Meng, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the thin gap between speech and text foundation models. CoRR abs/2310.00230 (2023) - 2022
- [i40]Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. CoRR abs/2202.00842 (2022) - [i39]Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. CoRR abs/2203.16685 (2022) - [i38]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022) - 2021
- [i37]Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings. CoRR abs/2101.01853 (2021) - [i36]Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2102.01380 (2021) - [i35]Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng:
Continuous Speech Separation with Ad Hoc Microphone Arrays. CoRR abs/2103.02378 (2021) - [i34]Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone. CoRR abs/2103.16776 (2021) - [i33]Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
End-to-End Speaker-Attributed ASR with Transformer. CoRR abs/2104.02128 (2021) - [i32]Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. CoRR abs/2106.02302 (2021) - [i31]Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. CoRR abs/2107.02852 (2021) - [i30]Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. CoRR abs/2110.01500 (2021) - [i29]Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR. CoRR abs/2110.03151 (2021) - [i28]Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. CoRR abs/2110.05354 (2021) - [i27]Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021) - [i26]Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. CoRR abs/2110.14838 (2021) - 2020
- [i25]Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. CoRR abs/2001.01795 (2020) - [i24]Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. CoRR abs/2001.01798 (2020) - [i23]Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020) - [i22]Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. CoRR abs/2003.07482 (2020) - [i21]Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Serialized Output Training for End-to-End Overlapped Speech Recognition. CoRR abs/2003.12687 (2020) - [i20]Zhong Meng, M. Umair Bin Altaf, Biing-Hwang Juang:
Active Voice Authentication. CoRR abs/2004.12071 (2020) - [i19]Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. CoRR abs/2004.13480 (2020) - [i18]Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka:
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers. CoRR abs/2006.10930 (2020) - [i17]Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. CoRR abs/2007.15188 (2020) - [i16]Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings. CoRR abs/2008.04546 (2020) - [i15]Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. CoRR abs/2010.12673 (2020) - [i14]Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2011.01991 (2020) - [i13]Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka:
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. CoRR abs/2011.02921 (2020) - [i12]Xiaofei Wang, Naoyuki Kanda, Yashesh Gaur, Zhuo Chen, Zhong Meng, Takuya Yoshioka:
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription. CoRR abs/2011.03110 (2020) - 2019
- [i11]Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-Student Learning. CoRR abs/1904.12399 (2019) - [i10]Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-Invariant Training. CoRR abs/1904.12400 (2019) - [i9]Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. CoRR abs/1904.12406 (2019) - [i8]Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. CoRR abs/1904.12407 (2019) - [i7]Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. CoRR abs/1911.03762 (2019) - 2018
- [i6]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. CoRR abs/1804.00644 (2018) - [i5]Zhong Meng, Jinyu Li, Zhuo Chen, Yong Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training via Adversarial Learning. CoRR abs/1804.00732 (2018) - [i4]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Feature-Mapping for Speech Enhancement. CoRR abs/1809.02251 (2018) - [i3]Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Cycle-Consistent Speech Enhancement. CoRR abs/1809.02253 (2018) - 2017
- [i2]Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition. CoRR abs/1711.08010 (2017) - [i1]Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan:
Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition. CoRR abs/1711.08016 (2017)
Coauthor Index
aka: Biing-Hwang Fred Juang
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint