default search action
Ye Bai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c33]Qianqian Dong, Zhiying Huang, Qi Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. ICLR 2024 - [c32]Ye Bai, Chenxing Li, Hao Li, Yuanyuan Zhao, Xiaorui Wang:
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation. ICME 2024: 1-6 - [i22]Ye Bai, Chenxing Li, Hao Li, Yuanyuan Zhao, Xiaorui Wang:
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation. CoRR abs/2404.11275 (2024) - [i21]Junzuo Zhou, Jiangyan Yi, Tao Wang, Jianhua Tao, Ye Bai, Chu Yuan Zhang, Yong Ren, Zhengqi Wen:
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking. CoRR abs/2406.04840 (2024) - [i20]Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024) - [i19]Minglun Han, Ye Bai, Chen Shen, Youjia Huang, Mingkun Huang, Zehua Lin, Linhao Dong, Lu Lu, Yuxuan Wang:
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training. CoRR abs/2409.08680 (2024) - [i18]Ye Bai, Haonan Chen, Jitong Chen, Zhuo Chen, Yi Deng, Xiaohong Dong, Lamtharn Hantrakul, Weituo Hao, Qingqing Huang, Zhongyi Huang, Dongya Jia, Feihu La, Duc Le, Bochen Li, Chumin Li, Hui Li, Xingxing Li, Shouda Liu, Wei-Tsung Lu, Yiqing Lu, Andrew Shaw, Janne Spijkervet, Yakun Sun, Bo Wang, Ju-Chiang Wang, Yuping Wang, Yuxuan Wang, Ling Xu, Yifeng Yang, Chao Yao, Shuo Zhang, Yang Zhang, Yilin Zhang, Hang Zhao, Ziyi Zhao, Dejian Zhong, Shicen Zhou, Pei Zou:
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation. CoRR abs/2409.09214 (2024) - 2023
- [j8]Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan:
Transfer knowledge for punctuation prediction via adversarial training. Speech Commun. 149: 1-10 (2023) - [c31]Chenxing Li, Ye Bai, Yang Wang, Feng Deng, Yuanyuan Zhao, Zhuo Zhang, Xiaorui Wang:
Image-driven Audio-visual Universal Source Separation. INTERSPEECH 2023: 3729-3733 - [c30]Zeyu Jin, Zixuan Wang, Qixin Wang, Jia Jia, Ye Bai, Yi Zhao, Hao Li, Xiaorui Wang:
HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection. ACM Multimedia 2023: 9393-9395 - [i17]Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. CoRR abs/2306.02982 (2023) - 2022
- [c29]Ying Zhang, Peng Yang, Jinba Xiao, Ye Bai, Hao Che, Xiaorui Wang:
K-Converter: An Unsupervised Singing Voice Conversion System. ICASSP 2022: 6662-6666 - [c28]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li:
ADD 2022: the first Audio Deep Synthesis Detection Challenge. ICASSP 2022: 9216-9220 - [c27]Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang:
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition. INTERSPEECH 2022: 1676-1680 - [i16]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu:
ADD 2022: the First Audio Deep Synthesis Detection Challenge. CoRR abs/2202.08433 (2022) - [i15]Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang:
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition. CoRR abs/2209.08326 (2022) - 2021
- [j7]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Shuai Zhang:
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1340-1351 (2021) - [j6]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1897-1911 (2021) - [c26]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition. APSIPA ASC 2021: 454-459 - [c25]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen:
Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition. ICASSP 2021: 6249-6253 - [c24]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Xuefei Liu, Zhengqi Wen:
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition. Interspeech 2021: 266-270 - [c23]Haoxin Ma, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Chenglong Wang:
Continual Learning for Fake Audio Detection. Interspeech 2021: 886-890 - [c22]Jiangyan Yi, Ye Bai, Jianhua Tao, Haoxin Ma, Zhengkun Tian, Chenglong Wang, Tao Wang, Ruibo Fu:
Half-Truth: A Partially Fake Audio Detection Dataset. Interspeech 2021: 1654-1658 - [c21]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. Interspeech 2021: 4034-4038 - [c20]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian:
Hierarchically Attending Time-Frequency and Channel Features for Improving Speaker Verification. ISCSLP 2021: 1-5 - [c19]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Ye Bai:
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition. ISCSLP 2021: 1-5 - [i14]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Fast End-to-End Speech Recognition via a Non-Autoregressive Model and Cross-Modal Knowledge Transferring from BERT. CoRR abs/2102.07594 (2021) - [i13]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen, Xuefei Liu:
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition. CoRR abs/2104.01522 (2021) - [i12]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. CoRR abs/2104.02882 (2021) - [i11]Jiangyan Yi, Ye Bai, Jianhua Tao, Zhengkun Tian, Chenglong Wang, Tao Wang, Ruibo Fu:
Half-Truth: A Partially Fake Audio Detection Dataset. CoRR abs/2104.03617 (2021) - [i10]Haoxin Ma, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Chenglong Wang:
Continual Learning for Fake Audio Detection. CoRR abs/2104.07286 (2021) - 2020
- [j5]Bocheng Zhao, Jianhua Tao, Minghao Yang, Zhengkun Tian, Cunhang Fan, Ye Bai:
Deep imitator: Handwriting calligraphy imitation via deep attention networks. Pattern Recognit. 104: 107080 (2020) - [j4]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Cunhang Fan:
A Public Chinese Dataset for Language Model Adaptation. J. Signal Process. Syst. 92(8): 839-851 (2020) - [c18]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
Synchronous Transformers for end-to-end Speech Recognition. ICASSP 2020: 7884-7888 - [c17]Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Ye Bai, Cunhang Fan:
Focal Loss for Punctuation Prediction. INTERSPEECH 2020: 721-725 - [c16]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition. INTERSPEECH 2020: 3381-3385 - [c15]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen:
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition. INTERSPEECH 2020: 5026-5030 - [i9]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Ye Bai:
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition. CoRR abs/2002.08126 (2020) - [i8]Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan:
Adversarial Transfer Learning for Punctuation Restoration. CoRR abs/2004.00248 (2020) - [i7]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition. CoRR abs/2005.04862 (2020) - [i6]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen:
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition. CoRR abs/2005.07903 (2020) - [i5]Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen:
Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition. CoRR abs/2010.14798 (2020)
2010 – 2019
- 2019
- [j3]Ye Bai:
Research on the effect of psychological stress intervention in music students based on Diffie-Hellman key exchange algorithm. Clust. Comput. 22(6): 13723-13729 (2019) - [j2]Srikanth Gururajan, Ye Bai:
Autonomous "Figure-8" Flights of a Quadcopter: Experimental Datasets. Data 4(1): 39 (2019) - [j1]Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai:
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 621-630 (2019) - [c14]Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Ye Bai:
Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network. APSIPA 2019: 662-666 - [c13]Haoxin Ma, Ye Bai, Jiangyan Yi, Jianhua Tao:
Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting. APSIPA 2019: 868-872 - [c12]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Bin Liu:
Voice Activity Detection Based on Time-Delay Neural Networks. APSIPA 2019: 1173-1178 - [c11]Jiangyan Yi, Jianhua Tao, Ye Bai:
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition. ICASSP 2019: 6071-6075 - [c10]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Chenghao Zhao, Cunhang Fan:
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. INTERSPEECH 2019: 2190-2194 - [c9]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen:
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition. INTERSPEECH 2019: 3795-3799 - [c8]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengqi Wen:
Self-Attention Transducers for End-to-End Speech Recognition. INTERSPEECH 2019: 4395-4399 - [i4]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen:
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition. CoRR abs/1907.06017 (2019) - [i3]Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengqi Wen:
Self-Attention Transducers for End-to-End Speech Recognition. CoRR abs/1909.13037 (2019) - [i2]Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang:
Integrating Whole Context to Sequence-to-sequence Speech Recognition. CoRR abs/1912.01777 (2019) - [i1]Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen:
Synchronous Transformers for End-to-End Speech Recognition. CoRR abs/1912.02958 (2019) - 2018
- [c7]Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai:
Adversarial Multilingual Training for Low-Resource Speech Recognition. ICASSP 2018: 4899-4903 - [c6]Cunhang Fan, Bin Liu, Jianhua Tao, Zhengqi Wen, Jiangyan Yi, Ye Bai:
Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation. ISCSLP 2018: 26-30 - [c5]Ye Bai, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Cunhang Fan:
CLMAD: A Chinese Language Model Adaptation Dataset. ISCSLP 2018: 275-279 - 2016
- [c4]Ye Bai, Jiangyan Yi, Hao Ni, Zhengqi Wen, Bin Liu, Ya Li, Jianhua Tao:
End-to-end keywords spotting based on connectionist temporal classification for Mandarin. ISCSLP 2016: 1-5 - 2013
- [c3]Ye Bai, Xueli Sheng, Chunyan Sun, Jin Han:
Study of a speech coding algorithm based on a contact conduction transmitter in a complicated water area. WUWNet 2013: 19:1-19:2 - 2011
- [c2]Yunliang Yu, Tingting Zhang, Ye Bai, Jianqiang Wang:
Method of the Road Lines Recognition in the Maps of Digital Material Based on Improvemented BP Neural Network. CSISE (1) 2011: 113-117 - [c1]Yunliang Yu, Ye Bai, Tingting Zhang, Jianqiang Wang:
The Heavy Mineral Analysis Based on Immune Self-organizing Neural Network. CSISE (1) 2011: 119-123
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:42 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint