default search action
Yichong Leng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j1]Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Sheng Zhao, Tao Qin, Frank K. Soong, Tie-Yan Liu:
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality. IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4234-4245 (2024)
Conference and Workshop Papers
- 2024
- [c19]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. ACL (1) 2024: 1979-1998 - [c18]Yichong Leng, Zhifang Guo, Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiangyang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. ICLR 2024 - [c17]Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian:
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers. ICLR 2024 - [c16]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024 - [c15]Jingxuan Wei, Linzhuang Sun, Yichong Leng, Xu Tan, Bihui Yu, Ruifeng Guo:
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation. IJCAI 2024: 6531-6540 - 2023
- [c14]Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu:
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition. AAAI 2023: 13034-13042 - [c13]Zixin Zeng, Rui Wang, Yichong Leng, Junliang Guo, Shufang Xie, Xu Tan, Tao Qin, Tie-Yan Liu:
Extract and Attend: Improving Entity Translation in Neural Machine Translation. ACL (Findings) 2023: 1697-1710 - [c12]Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan:
Prompttts: Controllable Text-To-Speech With Text Descriptions. ICASSP 2023: 1-5 - 2022
- [c11]Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin:
Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction. EMNLP 2022: 10367-10380 - [c10]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. ICASSP 2022: 6087-6091 - [c9]Jin Xu, Xu Tan, Kaitao Song, Renqian Luo, Yichong Leng, Tao Qin, Tie-Yan Liu, Jian Li:
Analyzing and Mitigating Interference in Neural Architecture Search. ICML 2022: 24646-24662 - [c8]Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo P. Mandic, Lei He, Xiangyang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu:
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis. NeurIPS 2022 - [c7]Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li:
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling. NeurIPS 2022 - 2021
- [c6]Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu:
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition. EMNLP (Findings) 2021: 4328-4337 - [c5]Yichong Leng, Xu Tan, Sheng Zhao, Frank K. Soong, Xiang-Yang Li, Tao Qin:
MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network. ICASSP 2021: 391-395 - [c4]Jiawei Chen, Xu Tan, Yichong Leng, Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu:
Speech-T: Transducer for Text to Speech and Beyond. NeurIPS 2021: 6621-6633 - [c3]Yichong Leng, Xu Tan, Linchen Zhu, Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiangyang Li, Edward Lin, Tie-Yan Liu:
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition. NeurIPS 2021: 21708-21719 - 2019
- [c2]Yichong Leng, Xu Tan, Tao Qin, Xiang-Yang Li, Tie-Yan Liu:
Unsupervised Pivot Translation for Distant Languages. ACL (1) 2019: 175-183 - [c1]Yingce Xia, Xu Tan, Fei Tian, Fei Gao, Di He, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, Jinhua Zhu, Tao Qin, Tie-Yan Liu:
Microsoft Research Asia's Systems for WMT19. WMT (2) 2019: 424-433
Informal and Other Publications
- 2024
- [i23]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. CoRR abs/2402.07729 (2024) - [i22]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024) - [i21]Jingxuan Wei, Linzhuang Sun, Yichong Leng, Xu Tan, Bihui Yu, Ruifeng Guo:
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation. CoRR abs/2404.14827 (2024) - [i20]Yunfei Chu, Jin Xu, Qian Yang, Haojie Wei, Xipin Wei, Zhifang Guo, Yichong Leng, Yuanjun Lv, Jinzheng He, Junyang Lin, Chang Zhou, Jingren Zhou:
Qwen2-Audio Technical Report. CoRR abs/2407.10759 (2024) - 2023
- [i19]Kai Shen, Zeqian Ju, Xu Tan, Yanqing Liu, Yichong Leng, Lei He, Tao Qin, Sheng Zhao, Jiang Bian:
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers. CoRR abs/2304.09116 (2023) - [i18]Hande Dong, Jiayi Lin, Yichong Leng, Jiawei Chen, Yutao Xie:
Retriever and Ranker Framework with Probabilistic Hard Negative Sampling for Code Search. CoRR abs/2305.04508 (2023) - [i17]Zixin Zeng, Rui Wang, Yichong Leng, Junliang Guo, Xu Tan, Tao Qin, Tie-Yan Liu:
Extract and Attend: Improving Entity Translation in Neural Machine Translation. CoRR abs/2306.02242 (2023) - [i16]Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. CoRR abs/2309.02285 (2023) - 2022
- [i15]Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Frank K. Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu:
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality. CoRR abs/2205.04421 (2022) - [i14]Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li:
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling. CoRR abs/2205.12986 (2022) - [i13]Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo P. Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu:
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis. CoRR abs/2205.14807 (2022) - [i12]Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan:
PromptTTS: Controllable Text-to-Speech with Text Descriptions. CoRR abs/2211.12171 (2022) - [i11]Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin:
Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction. CoRR abs/2211.13252 (2022) - [i10]Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu:
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition. CoRR abs/2212.01039 (2022) - [i9]Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo P. Mandic:
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech. CoRR abs/2212.14518 (2022) - 2021
- [i8]Yichong Leng, Xu Tan, Sheng Zhao, Frank K. Soong, Xiangyang Li, Tao Qin:
MBNet: MOS Prediction for Synthesized Speech with Mean-Bias Network. CoRR abs/2103.00110 (2021) - [i7]Yichong Leng, Xu Tan, Linchen Zhu, Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiang-Yang Li, Edward Lin, Tie-Yan Liu:
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition. CoRR abs/2105.03842 (2021) - [i6]Jin Xu, Xu Tan, Kaitao Song, Renqian Luo, Yichong Leng, Tao Qin, Tie-Yan Liu, Jian Li:
Analyzing and Mitigating Interference in Neural Architecture Search. CoRR abs/2108.12821 (2021) - [i5]Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Linquan Liu, Tao Qin, Xiang-Yang Li, Edward Lin, Tie-Yan Liu:
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition. CoRR abs/2109.14420 (2021) - [i4]Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee:
A study on the efficacy of model pre-training in developing neural text-to-speech system. CoRR abs/2110.03857 (2021) - 2019
- [i3]Yichong Leng, Xu Tan, Tao Qin, Xiang-Yang Li, Tie-Yan Liu:
Unsupervised Pivot Translation for Distant Languages. CoRR abs/1906.02461 (2019) - [i2]Yingce Xia, Xu Tan, Fei Tian, Fei Gao, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, Jinhua Zhu, Tao Qin, Tie-Yan Liu:
Microsoft Research Asia's Systems for WMT19. CoRR abs/1911.06191 (2019) - [i1]Xu Tan, Yichong Leng, Jiale Chen, Yi Ren, Tao Qin, Tie-Yan Liu:
A Study of Multilingual Neural Machine Translation. CoRR abs/1912.11625 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-30 21:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint