


Остановите войну!
for scientists:
Lei Xie 0001
Person information

- affiliation: Northwestern Polytechnical University, School of Computer Science, Xi'an, China
- affiliation (2006 - 2007): The Chinese University of Hong Kong, Department of Systems Engineering and Engineering Management, Hong Kong
- affiliation (2004 - 2006): City University of Hong Kong, School of Creative Media, Hong Kong
- affiliation (PhD 2004): Northwestern Polytechnical University, Xi'an, China
- affiliation (2001 - 2002): Vrije Universiteit Brussel, Department of Electronics and Information Processing, Belgium
Other persons with the same name
- Lei Xie — disambiguation page
- Lei Xie 0002 — Xi'an Jiaotong University, China
- Lei Xie 0003
— Zhejiang University, College of Information Science and Electronic Engineering, Hangzhou, China
- Lei Xie 0004
— Nanjing University, State Key Laboratory for Novel Software Technology, China
- Lei Xie 0005
— Delft University of Technology, Laboratory of Computer Engineering, The Netherlands
- Lei Xie 0006
— City University of New York, Department of Computer Science, Hunter College, NY, USA (and 1 more)
- Lei Xie 0007
— Zhejiang University, State Key Laboratory of Industrial Control Technology, Hangzhou, China (and 2 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [j45]Hongqiang Du
, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022) - [j44]Chenggang Mi
, Lei Xie, Yanning Zhang:
Improving data augmentation for low resource speech-to-text translation with diverse paraphrasing. Neural Networks 148: 194-205 (2022) - [j43]Jingyong Hou
, Lei Xie, Shilei Zhang:
Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution. Neural Networks 150: 28-42 (2022) - [j42]Xiaochun An, Frank K. Soong, Lei Xie
:
Disentangling Style and Speaker Attributes for TTS Style Transfer. IEEE ACM Trans. Audio Speech Lang. Process. 30: 646-658 (2022) - [j41]Yi Lei, Shan Yang, Xinsheng Wang
, Lei Xie
:
MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 853-864 (2022) - [j40]Tao Li
, Xinsheng Wang
, Qicong Xie, Zhichao Wang, Lei Xie
:
Cross-Speaker Emotion Disentangling and Transfer for End-to-End Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1448-1460 (2022) - [c131]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171 - [c130]Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng:
WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition. ICASSP 2022: 6182-6186 - [c129]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Conversational Speech Recognition by Learning Conversation-Level Characteristics. ICASSP 2022: 6752-6756 - [c128]Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation. ICASSP 2022: 6792-6796 - [c127]Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis. ICASSP 2022: 7237-7241 - [c126]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c125]Yukai Ju, Wei Rao, Xiaopeng Yan, Yihui Fu, Shubo Lv, Luyao Cheng, Yannan Wang, Lei Xie, Shidong Shang:
TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge. ICASSP 2022: 9291-9295 - [i48]Yi Lei, Shan Yang, Xinsheng Wang, Lei Xie:
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis. CoRR abs/2201.06460 (2022) - [i47]Xiaochun An, Frank K. Soong, Lei Xie:
Disentangling Style and Speaker Attributes for TTS Style Transfer. CoRR abs/2201.09472 (2022) - [i46]Hongqiang Du, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. CoRR abs/2201.10693 (2022) - [i45]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i44]Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma:
Conversational Speech Recognition By Learning Conversation-level Characteristics. CoRR abs/2202.07855 (2022) - [i43]Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, Lei Xie:
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition. CoRR abs/2204.03398 (2022) - 2021
- [j39]Liumeng Xue
, Shifeng Pan
, Lei He, Lei Xie, Frank K. Soong:
Cycle consistent network for end-to-end style transfer TTS training. Neural Networks 140: 223-236 (2021) - [j38]Xiaochun An
, Frank K. Soong
, Shan Yang, Lei Xie:
Effective and direct control of neural TTS prosody by removing interactions between different attributes. Neural Networks 143: 250-260 (2021) - [j37]Hongqiang Du
, Xiaohai Tian, Lei Xie, Haizhou Li
:
Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021) - [j36]Hang Lv
, Daniel Povey, Mahsa Yarmohammadi, Ke Li, Yiming Wang
, Lei Xie, Sanjeev Khudanpur
:
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation. IEEE Signal Process. Lett. 28: 703-707 (2021) - [c124]Qijie Shao, Jingyong Hou, Yanxin Hu, Qing Wang, Lei Xie, Xin Lei:
Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting. APSIPA ASC 2021: 672-678 - [c123]Li Zhang, Qing Wang, Lei Xie:
Duality Temporal-Channel-Frequency Attention Enhanced Speaker Representation Learning. ASRU 2021: 206-213 - [c122]Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang:
Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR. ASRU 2021: 328-334 - [c121]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. ICASSP 2021: 5864-5868 - [c120]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition. ICASSP 2021: 6019-6023 - [c119]Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu:
The Multi-Speaker Multi-Style Voice Cloning Challenge 2021. ICASSP 2021: 8613-8617 - [c118]Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, Wendong Gan, Haitao Chen, Hai Li:
Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion. Interspeech 2021: 831-835 - [c117]Jian Cong, Shan Yang, Lei Xie, Dan Su:
Glow-WaveGAN: Learning Speech Representations from GAN-Based Variational Auto-Encoder for High Fidelity Flow-Based Speech Synthesis. Interspeech 2021: 2182-2186 - [c116]Pengcheng Guo, Xuankai Chang, Shinji Watanabe
, Lei Xie:
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain. Interspeech 2021: 3720-3724 - [c115]Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su:
Controllable Context-Aware Conversational Speech Synthesis. Interspeech 2021: 4658-4662 - [c114]Xiaochun An, Frank K. Soong, Lei Xie:
Improving Performance of Seen and Unseen Speech Style Transfer in End-to-End Neural TTS. Interspeech 2021: 4688-4692 - [c113]Tao Li, Shan Yang, Liumeng Xue, Lei Xie:
Controllable Emotion Transfer For End-to-End Speech Synthesis. ISCSLP 2021: 1-5 - [c112]Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, Hai Li, Lei Xie, Xiulin Li:
Accent and Speaker Disentanglement in Many-to-many Voice Conversion. ISCSLP 2021: 1-5 - [c111]Qing Wang, Wei Rao, Pengcheng Guo, Lei Xie:
Adversarial Training for Multi-domain Speaker Recognition. ISCSLP 2021: 1-5 - [c110]Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie:
Context-aware RNNLM Rescoring for Conversational Speech Recognition. ISCSLP 2021: 1-5 - [c109]Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie:
Conversational End-to-End TTS for Voice Agents. SLT 2021: 403-409 - [c108]Yi Lei, Shan Yang, Lei Xie:
Fine-Grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis. SLT 2021: 423-430 - [c107]Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, Lei Xie:
Multi-Band Melgan: Faster Waveform Generation For High-Quality Text-To-Speech. SLT 2021: 492-498 - [c106]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing Voice Conversion Network with Cycle Consistency Loss of Speaker Identity. SLT 2021: 507-513 - [c105]Heyang Xue, Shan Yang, Yi Lei, Lei Xie, Xiulin Li:
Learn2Sing: Target Speaker Singing Voice Synthesis by Learning from a Singing Teacher. SLT 2021: 522-529 - [i42]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Streaming Transformers. CoRR abs/2102.04488 (2021) - [i41]Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
An Asynchronous WFST-Based Decoder For Automatic Speech Recognition. CoRR abs/2103.09063 (2021) - [i40]Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Leijing Hou, Shilei Zhang:
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR. CoRR abs/2104.04702 (2021) - [i39]Pengcheng Guo, Xuankai Chang, Shinji Watanabe, Lei Xie:
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain. CoRR abs/2106.08595 (2021) - [i38]Zhichao Wang, Xinyong Zhou, Fengyu Yang, Tao Li, Hongqiang Du, Lei Xie, Wendong Gan, Haitao Chen, Hai Li:
Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion. CoRR abs/2106.08741 (2021) - [i37]Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. CoRR abs/2106.09320 (2021) - [i36]Xiaochun An, Frank K. Soong, Lei Xie:
Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS. CoRR abs/2106.10003 (2021) - [i35]Hongqiang Du, Lei Xie:
Improving robustness of one-shot voice conversion with deep discriminative speaker encoder. CoRR abs/2106.10406 (2021) - [i34]Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su:
Controllable Context-aware Conversational Speech Synthesis. CoRR abs/2106.10828 (2021) - [i33]Jian Cong, Shan Yang, Lei Xie, Dan Su:
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis. CoRR abs/2106.10831 (2021) - [i32]Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Lei Xie:
Controllable cross-speaker emotion transfer for end-to-end speech synthesis. CoRR abs/2109.06733 (2021) - [i31]Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng:
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition. CoRR abs/2110.03370 (2021) - [i30]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021) - [i29]Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis. CoRR abs/2110.08813 (2021) - [i28]Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation. CoRR abs/2111.12277 (2021) - [i27]Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios. CoRR abs/2112.12743 (2021) - 2020
- [j35]Shan Yang, Heng Lu, Shiyin Kang, Liumeng Xue
, Jinba Xiao, Dan Su, Lei Xie, Dong Yu:
On the localness modeling for the self-attention based end-to-end speech synthesis. Neural Networks 125: 121-130 (2020) - [j34]Shan Yang
, Yuxuan Wang, Lei Xie
:
Adversarial Feature Learning and Unsupervised Clustering Based Speech Synthesis for Found Data With Acoustic and Textual Noise. IEEE Signal Process. Lett. 27: 1730-1734 (2020) - [j33]Chenggang Mi
, Lei Xie, Yanning Zhang:
Loanword Identification in Low-Resource Languages with Minimal Supervision. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(3): 43:1-43:22 (2020) - [j32]Yougen Yuan
, Lei Xie
, Cheung-Chi Leung, Hongjie Chen, Bin Ma:
Fast Query-by-Example Speech Search Using Attention-Based Deep Binary Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1988-2000 (2020) - [j31]Chenggang Mi
, Lei Xie
, Yanning Zhang
:
Improving Adversarial Neural Machine Translation for Morphologically Rich Language. IEEE Trans. Emerg. Top. Comput. Intell. 4(4): 417-426 (2020) - [c104]Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870 - [c103]Jingyong Hou, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie:
Mining Effective Negative Training Samples for Keyword Spotting. ICASSP 2020: 7444-7448 - [c102]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Effective Wavenet Adaptation for Voice Conversion with Limited Data. ICASSP 2020: 7779-7783 - [c101]Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training. INTERSPEECH 2020: 811-815 - [c100]Fengyu Yang, Shan Yang, Qinghua Wu, Yujun Wang, Lei Xie:
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis. INTERSPEECH 2020: 3436-3440 - [c99]Qing Wang, Pengcheng Guo, Lei Xie:
Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition. INTERSPEECH 2020: 4228-4232 - [c98]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. INTERSPEECH 2020: 4258-4262 - [c97]Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, Bo Xu, Lei Xie:
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals. NeurIPS 2020 - [i26]Shan Yang, Yuxuan Wang, Lei Xie:
Adversarial Feature Learning and Unsupervised Clustering based Speech Synthesis for Found Data with Acoustic and Textual Noise. CoRR abs/2004.13595 (2020) - [i25]Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, Lei Xie:
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech. CoRR abs/2005.05106 (2020) - [i24]Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur:
Wake Word Detection with Alignment-Free Lattice-Free MMI. CoRR abs/2005.08347 (2020) - [i23]Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He, Lei Xie:
Conversational End-to-End TTS for Voice Agent. CoRR abs/2005.10438 (2020) - [i22]Qing Wang, Pengcheng Guo, Lei Xie:
Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition. CoRR abs/2005.10637 (2020) - [i21]Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, Bo Xu, Lei Xie:
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals. CoRR abs/2006.14150 (2020) - [i20]Fengyu Yang, Shan Yang, Qinghua Wu, Yujun Wang, Lei Xie:
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis. CoRR abs/2008.00613 (2020) - [i19]Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training. CoRR abs/2008.04265 (2020) - [i18]Heyang Xue, Shan Yang, Yi Lei, Lei Xie, Xiulin Li:
Learn2Sing: Target Speaker Singing Voice Synthesis by learning from a Singing Teacher. CoRR abs/2011.08467 (2020) - [i17]Yi Lei, Shan Yang, Lei Xie:
Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis. CoRR abs/2011.08477 (2020) - [i16]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing voice conversion network with cycle consistency loss of speaker identity. CoRR abs/2011.08548 (2020) - [i15]Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, Hai Li, Lei Xie, Xiulin Li:
Accent and Speaker Disentanglement in Many-to-many Voice Conversion. CoRR abs/2011.08609 (2020) - [i14]Qing Wang, Wei Rao, Pengcheng Guo, Lei Xie:
Adversarial Training for Multi-domain Speaker Recognition. CoRR abs/2011.08623 (2020) - [i13]Tao Li, Shan Yang, Liumeng Xue, Lei Xie:
Controllable Emotion Transfer For End-to-End Speech Synthesis. CoRR abs/2011.08679 (2020) - [i12]Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie:
Context-aware RNNLM Rescoring for Conversational Speech Recognition. CoRR abs/2011.09301 (2020) - [i11]Haohan Guo, Heng Lu, Na Hu, Chunlei Zhang, Shan Yang, Lei Xie, Dan Su, Dong Yu:
Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training. CoRR abs/2012.01837 (2020)
2010 – 2019
- 2019
- [j30]Xiaolian Zhu
, Yuchao Zhang, Shan Yang, Liumeng Xue, Lei Xie:
Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis. IEEE Access 7: 65955-65964 (2019) - [j29]Yougen Yuan
, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma:
Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context. IEEE Access 7: 67656-67665 (2019) - [j28]Jingyong Hou
, Yangyang Shi, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie
:
Region Proposal Network Based Small-Footprint Keyword Spotting. IEEE Signal Process. Lett. 26(10): 1471-1475 (2019) - [j27]Sining Sun
, Pengcheng Guo, Lei Xie
, Mei-Yuh Hwang:
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1826-1838 (2019) - [c96]Sining Sun, Shuran Zhou, Mei-Yuh Hwang, Lei Xie, Qin Li, Xin Lei:
Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition. APSIPA 2019: 633-637 - [c95]Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur:
Incremental Lattice Determinization for WFST Decoders. ASRU 2019: 1-7 - [c94]Yiming Wang, Sanjeev Khudanpur, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe
:
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit. ASRU 2019: 136-143 - [c93]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
WaveNet Factorization with Singular Value Decomposition for Voice Conversion. ASRU 2019: 152-159 - [c92]Xiaochun An, Yuxuan Wang, Shan Yang, Zejun Ma, Lei Xie:
Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis. ASRU 2019: 184-191 - [c91]Xiaolian Zhu, Shan Yang, Geng Yang, Lei Xie:
Controlling Emotion Strength with Relative Attribute for End-to-End Speech Synthesis. ASRU 2019: 192-199 - [c90]Fengyu Yang, Shan Yang, Pengcheng Zhu, Pengju Yan, Lei Xie:
Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias. ASRU 2019: 208-213 - [c89]Xiong Wang, Sining Sun, Lei Xie:
Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting. ASRU 2019: 607-612 - [c88]Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie:
Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings. ASRU 2019: 613-620 - [c87]Ke Wang, Frank K. Soong, Lei Xie:
A Pitch-aware Approach to Single-channel Speech Separation. ICASSP 2019: 296-300 - [c86]Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie:
Component Fusion: Learning Replaceable Language Model Component for End-to-end Speech Recognition System. ICASSP 2019: 5631-5635 - [c85]Changhao Shan, Chao Weng, Guangsen Wang, Dan Su, Min Luo, Dong Yu, Lei Xie:
Investigating End-to-end Speech Recognition for Mandarin-english Code-switching. ICASSP 2019: 6056-6060 - [c84]Xiong Wang, Sining Sun, Changhao Shan, Jingyong Hou, Lei Xie, Shen Li, Xin Lei:
Adversarial Examples for Improving End-to-end Attention-based Small-footprint Keyword Spotting. ICASSP 2019: 6366-6370 - [c83]Xiang Hao, Changhao Shan, Yong Xu, Sining Sun, Lei Xie:
An Attention-based Neural Network Approach for Single Channel Speech Enhancement. ICASSP 2019: 6895-6899 - [c82]Shan Yang, Heng Lu, Shiying Kang, Lei Xie, Dong Yu:
Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis. ICASSP 2019: 6910-6914 - [c81]Jingyong Hou, Pengcheng Guo, Sining Sun, Frank K. Soong, Wenping Hu, Lei Xie:
Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech. ICASSP 2019: 8122-8126 - [c80]Yougen Yuan, Wei Tang, Minhao Fan, Yue Cao, Peng Zhang, Lei Xie:
Deep Audio-visual System for Closed-set Word-level Speech Recognition. ICMI 2019: 540-545 - [c79]Pengcheng Guo, Sining Sun, Lei Xie:
Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition. INTERSPEECH 2019: 749-753 - [c78]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
A New GAN-Based End-to-End TTS Training Algorithm. INTERSPEECH 2019: 1288-1292 - [c77]Qing Wang, Pengcheng Guo, Sining Sun, Lei Xie, John H. L. Hansen:
Adversarial Regularization for End-to-End Robust Speaker Verification. INTERSPEECH 2019: 4010-4014 - [c76]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS. INTERSPEECH 2019: 4460-4464 - [i10]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS. CoRR abs/1904.04764 (2019) - [i9]Haohan Guo, Frank K. Soong, Lei He, Lei Xie:
A New GAN-based End-to-End TTS Training Algorithm. CoRR abs/1904.04775 (2019) - [i8]