default search action

combined dblp search
author search
venue search
publication search

ask others

Zhong Meng

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Books and Theses

see FAQ

What is the meaning of the colors in the publication lists?

2018
[b1]
- view
  - electronic edition via handle.net
  - details & citations
  authority control:
- export record
  dblp key:
  - phd/basesearch/Meng18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Meng18a
Zhong Meng:
Discriminative and adaptive training for robust speech recognition and understanding. Georgia Institute of Technology, Atlanta, GA, USA, 2018

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/MengAJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/MengAJ20
Zhong Meng, Muhammad Umair Bin Altaf, Biing-Hwang Fred Juang:
Active voice authentication. Digit. Signal Process. 101: 102672 (2020)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimHMSSM0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimHMSSM0S24
Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarMWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarMWS24
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShanGMWCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShanGMWCS24
Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting Conformers With Structured State-Space Sequence Models For Online Speech Recognition. ICASSP 2024: 12221-12225
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WuSLRMVWCPMCP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WuSLRMVWCPMCP24
Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar:
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. NAACL (Industry Track) 2024: 315-323
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
2023
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengWPSCVZLRR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengWPSCVZLRR23
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PeyserMPRSPCH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PeyserMPRSPCH23
Cal Peyser, Zhong Meng, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho, Ke Hu:
Improving Joint Speech-Text Representations Without Alignment. INTERSPEECH 2023: 1354-1358
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BijwadiaCWMZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BijwadiaCWMZ23
Shaan Bijwadia, Shuo-Yiin Chang, Weiran Wang, Zhong Meng, Hao Zhang:
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models. INTERSPEECH 2023: 1409-1413
2022
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCWYWML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCWYWML22
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. ICASSP 2022: 6017-6021
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KandaXGWMCY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KandaXGWMCY22
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR. ICASSP 2022: 8082-8086
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenMPL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenMPL22
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. ICASSP 2022: 8132-8136
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kanda0WXMWG00Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kanda0WXMWG00Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. INTERSPEECH 2022: 521-525
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengGK0CW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengGK0CW022
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. INTERSPEECH 2022: 2608-2612
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaWWXMWGC0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaWWXMWGC0Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. INTERSPEECH 2022: 3774-3778
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zhang0K00EYXMQW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zhang0K00EYXMQW22
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MengCPZWAESRHVHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MengCPZWAESRHVHM22
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204
2021
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KandaXWZGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KandaXWZGWMCY21
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. ASRU 2021: 296-303
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/WangYCWZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/WangYCWZM21
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng:
Continuous Speech Separation with Ad Hoc Microphone Arrays. EUSIPCO 2021: 1100-1104
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SunLM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SunLM021
Eric Sun, Liang Lu, Zhong Meng, Yifan Gong:
Sequence-Level Self-Teaching Regularization. ICASSP 2021: 2945-2949
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KandaMLGWCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KandaMLGWCY21
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka:
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. ICASSP 2021: 6503-6507
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangKGWMY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangKGWMY21
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings. ICASSP 2021: 6763-6767
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengKGPSLC0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengKGPSLC0021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition. ICASSP 2021: 7338-7342
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengZMCLL0H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengZMCLL0H21
Yan Deng, Rui Zhao, Zhong Meng, Xie Chen, Bing Liu, Jinyu Li, Yifan Gong, Lei He:
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. Interspeech 2021: 751-755
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Meng0K0CYSL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Meng0K0CYSL021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. Interspeech 2021: 2596-2600
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaYWGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaYWGWMCY21
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone. Interspeech 2021: 3430-3434
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001MKL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001MKL021
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. Interspeech 2021: 3435-3439
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sun0MWX0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sun0MWX0021
Eric Sun, Jinyu Li, Zhong Meng, Yu Wu, Jian Xue, Shujie Liu, Yifan Gong:
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions. Interspeech 2021: 3470-3474
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaYGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaYGWMCY21
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
End-to-End Speaker-Attributed ASR with Transformer. Interspeech 2021: 4413-4417
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MengPSGKLCZLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MengPSGKLCZLG21
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. SLT 2021: 243-250
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KandaCGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KandaCGWMCY21
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Investigation of End-to-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings. SLT 2021: 809-816
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangKGCMY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangKGCMY21
Xiaofei Wang, Naoyuki Kanda, Yashesh Gaur, Zhuo Chen, Zhong Meng, Takuya Yoshioka:
Exploring End-to-End Multi-Channel ASR with Bias Information for Meeting Transcription. SLT 2021: 833-840
2020
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYLZMLWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYLZMLWXL20
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengHLL0GL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengHLL0GL20
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. ICASSP 2020: 7389-7393
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZSWDMG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZSWDMG20
Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. ICASSP 2020: 7699-7703
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaGWMCZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaGWMCZY20
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka:
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers. INTERSPEECH 2020: 36-40
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaGWMY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaGWMY20
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Serialized Output Training for End-to-End Overlapped Speech Recognition. INTERSPEECH 2020: 2797-2801
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZMLWPMWHZG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZMLWPMWHZG20
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. INTERSPEECH 2020: 3590-3594
2019
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangCXMYZLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangCXMYZLL19
Peidong Wang, Zhuo Chen, Xiong Xiao, Zhong Meng, Takuya Yoshioka, Tianyan Zhou, Liang Lu, Jinyu Li:
Speech Separation Using Speaker Inventory. ASRU 2019: 230-236
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengLGG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengLGG19
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. ASRU 2019: 268-275
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengGLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengGLG19
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. ASRU 2019: 949-955
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLG19
Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. ICASSP 2019: 5721-5725
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengZLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengZLG19
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. ICASSP 2019: 6216-6220
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLZG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLZG19
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-student Learning. ICASSP 2019: 6445-6449
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLG19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLG19a
Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-invariant Training. ICASSP 2019: 6740-6744
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengGLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengGLG19
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. INTERSPEECH 2019: 241-245
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaurLMG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaurLMG19
Yashesh Gaur, Jinyu Li, Zhong Meng, Yifan Gong:
Acoustic-to-Phrase Models for Speech Recognition. INTERSPEECH 2019: 2240-2244
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLGJ18
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. ICASSP 2018: 5949-5953
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLCZMGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLCZMGJ18
Zhong Meng, Jinyu Li, Zhuo Chen, Yang Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training Via Adversarial Learning. ICASSP 2018: 5969-5973
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Cycle-Consistent Speech Enhancement. INTERSPEECH 2018: 1165-1169
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18a
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Adversarial Feature-Mapping for Speech Enhancement. INTERSPEECH 2018: 3259-3263
2017
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengCMLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengCMLG17
Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised adaptation with domain separation networks for robust speech recognition. ASRU 2017: 214-221
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengWHE17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengWHE17
Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan:
Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition. ICASSP 2017: 271-275
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengJ17
Zhong Meng, Biing-Hwang Juang:
Minimum Semantic Error Cost Training of Deep Long Short-Term Memory Networks for Topic Spotting on Conversational Speech. INTERSPEECH 2017: 2496-2500
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengJ17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengJ17a
Zhong Meng, Biing-Hwang Juang:
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting. INTERSPEECH 2017: 3547-3551
2016
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengJ16
Zhong Meng, Biing-Hwang Juang:
Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting. INTERSPEECH 2016: 770-774
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengJ16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengJ16a
Zhong Meng, Biing-Hwang Juang:
Statistical Modeling of Speaker's Voice with Temporal Co-Location for Active Voice Authentication. INTERSPEECH 2016: 1725-1729

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17184
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10180
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10180
Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar:
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR. CoRR abs/2404.10180 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02004
Lun Wang, Om Thakkar, Zhong Meng, Nicole Rafidi, Rohit Prabhavalkar, Arun Narayanan:
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping. CoRR abs/2406.02004 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02921
Zhong Meng, Zelin Wu, Rohit Prabhavalkar, Cal Peyser, Weiran Wang, Nanxin Chen, Tara N. Sainath, Bhuvana Ramabhadran:
Text Injection for Neural Contextual Biasing. CoRR abs/2406.02921 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14701
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Neeraj Gaur, Zhong Meng:
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions. CoRR abs/2406.14701 (2024)
2023
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08583
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01037
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06125
Cal Peyser, Zhong Meng, Ke Hu, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho:
Improving Joint Speech-Text Representations Without Alignment. CoRR abs/2308.06125 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07395
Shaan Bijwadia, Shuo-Yiin Chang, Weiran Wang, Zhong Meng, Hao Zhang, Tara N. Sainath:
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models. CoRR abs/2308.07395 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08551
Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara N. Sainath:
Augmenting conformers with structured state space models for online speech recognition. CoRR abs/2309.08551 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00178
Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00230
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00230
Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Yongqiang Wang, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Zhong Meng, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the thin gap between speech and text foundation models. CoRR abs/2310.00230 (2023)
2022
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00842
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. CoRR abs/2202.00842 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16685
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. CoRR abs/2203.16685 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17049
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022)
2021
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-01853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-01853
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings. CoRR abs/2101.01853 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01380
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2102.01380 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-02378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-02378
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng:
Continuous Speech Separation with Ad Hoc Microphone Arrays. CoRR abs/2103.02378 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16776
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16776
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone. CoRR abs/2103.16776 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02128
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
End-to-End Speaker-Attributed ASR with Transformer. CoRR abs/2104.02128 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02302
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. CoRR abs/2106.02302 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02852
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. CoRR abs/2107.02852 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01500
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. CoRR abs/2110.01500 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03151
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR. CoRR abs/2110.03151 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05354
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. CoRR abs/2110.05354 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14142
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14838
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. CoRR abs/2110.14838 (2021)
2020
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01795
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. CoRR abs/2001.01795 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01798
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. CoRR abs/2001.01798 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11482
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07482
Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. CoRR abs/2003.07482 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12687
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka:
Serialized Output Training for End-to-End Overlapped Speech Recognition. CoRR abs/2003.12687 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-12071
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-12071
Zhong Meng, M. Umair Bin Altaf, Biing-Hwang Juang:
Active Voice Authentication. CoRR abs/2004.12071 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-13480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-13480
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. CoRR abs/2004.13480 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-10930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-10930
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka:
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers. CoRR abs/2006.10930 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15188
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. CoRR abs/2007.15188 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04546
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings. CoRR abs/2008.04546 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12673
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. CoRR abs/2010.12673 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01991
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2011.01991 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02921
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka:
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. CoRR abs/2011.02921 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03110
Xiaofei Wang, Naoyuki Kanda, Yashesh Gaur, Zhuo Chen, Zhong Meng, Takuya Yoshioka:
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription. CoRR abs/2011.03110 (2020)
2019
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12399
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-Student Learning. CoRR abs/1904.12399 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12400
Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-Invariant Training. CoRR abs/1904.12400 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12406
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. CoRR abs/1904.12406 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12407
Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. CoRR abs/1904.12407 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03762
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. CoRR abs/1911.03762 (2019)
2018
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00644
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. CoRR abs/1804.00644 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00732
Zhong Meng, Jinyu Li, Zhuo Chen, Yong Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training via Adversarial Learning. CoRR abs/1804.00732 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02251
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Feature-Mapping for Speech Enhancement. CoRR abs/1809.02251 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02253
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Cycle-Consistent Speech Enhancement. CoRR abs/1809.02253 (2018)
2017
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-08010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-08010
Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition. CoRR abs/1711.08010 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-08016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-08016
Zhong Meng, Shinji Watanabe, John R. Hershey, Hakan Erdogan:
Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition. CoRR abs/1711.08016 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.