


default search action
Jixun Yao
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c19]Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the Beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. AAAI 2025: 24966-24974 - [c18]Jixun Yao, Yuguang Yang, Yu Pan, Ziqian Ning, Jianhao Ye, Hongbin Zhou, Lei Xie:
StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching. AAAI 2025: 25669-25677 - [c17]Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, EngSiong Chng, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. ICLR 2025 - [i24]Qing Wang, Jixun Yao, Zhaokai Sun, Pengcheng Guo, Lei Xie, John H. L. Hansen:
DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification. CoRR abs/2501.05127 (2025) - [i23]Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, Chng Eng Siong, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. CoRR abs/2502.02942 (2025) - [i22]Jixun Yao, Yuguang Yang, Yu Pan, Yuan Feng, Ziqian Ning, Jianhao Ye, Hongbin Zhou, Lei Xie:
Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech. CoRR abs/2502.02950 (2025) - 2024
- [j2]Jixun Yao
, Qing Wang
, Pengcheng Guo
, Ziqian Ning, Lei Xie
:
Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2944-2956 (2024) - [c16]Yu Pan, Yanni Hu, Yuguang Yang, Wen Fei, Jixun Yao, Heng Lu, Lei Ma, Jianjun Zhao:
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition. ICASSP 2024: 10021-10025 - [c15]Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie:
Promptvc: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts. ICASSP 2024: 10571-10575 - [c14]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. ICASSP 2024: 11106-11110 - [c13]Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings. ISCSLP 2024: 506-510 - [c12]Dake Guo, Jixun Yao, Xinfa Zhu, Kangxiang Xia, Zhao Guo, Ziyu Zhang, Yao Wang, Jie Liu, Lei Xie:
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge. ISCSLP 2024: 616-620 - [i21]Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. CoRR abs/2408.15474 (2024) - [i20]Sijing Chen, Yuan Feng, Laipeng He, Tianwei He, Wendi He, Yanni Hu, Bin Lin, Yiting Lin, Yu Pan, Pengfei Tan, Chengwei Tian, Chen Wang, Zhicheng Wang, Ruoye Xie, Jixun Yao, Quanlei Yan, Yuguang Yang, Jianhao Ye, Jingjing Yin, Yanzhen Yu, Huimin Zhang, Xiang Zhang, Guangcheng Zhao, Hongbin Zhou, Pengpeng Zou:
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models. CoRR abs/2409.12139 (2024) - [i19]Yuguang Yang, Yu Pan, Jixun Yao, Xiang Zhang, Jianhao Ye, Hongbin Zhou, Lei Xie, Lei Ma, Jianjun Zhao:
Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling. CoRR abs/2410.01350 (2024) - [i18]Nikita Kuzmin, Hieu-Thi Luong, Jixun Yao, Lei Xie, Kong Aik Lee, Eng Siong Chng:
NTU-NPU System for Voice Privacy 2024 Challenge. CoRR abs/2410.02371 (2024) - [i17]Dake Guo, Jixun Yao, Xinfa Zhu, Kangxiang Xia, Zhao Guo, Ziyu Zhang, Yao Wang, Jie Liu, Lei Xie:
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge. CoRR abs/2410.23815 (2024) - [i16]Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings. CoRR abs/2411.00064 (2024) - [i15]Yu Pan, Yuguang Yang, Jixun Yao, Jianhao Ye, Hongbin Zhou, Lei Ma, Jianjun Zhao:
CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching. CoRR abs/2411.02026 (2024) - [i14]Yuke Li, Xinfa Zhu, Hanzhao Li, Jixun Yao, Wenjie Tian, XiPeng Yang, YunLin Chen, Zhifei Li, Lei Xie:
CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion. CoRR abs/2411.18918 (2024) - [i13]Jixun Yao, Yuguang Yang, Yu Pan, Ziqian Ning, Jiaohao Ye, Hongbin Zhou, Lei Xie:
StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching. CoRR abs/2412.04724 (2024) - 2023
- [j1]Qing Wang
, Jixun Yao
, Li Zhang
, Pengcheng Guo
, Lei Xie
:
Timbre-Reserved Adversarial Attack in Speaker Identification. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3848-3858 (2023) - [c11]Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su:
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis. AAAI 2023: 13025-13033 - [c10]Yuanjun Lv, Jixun Yao, Peikun Chen, Hongbin Zhou, Heng Lu, Lei Xie:
Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation. ASRU 2023: 1-8 - [c9]Ziqian Wang, Qing Wang, Jixun Yao, Lei Xie:
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge. DADA@IJCAI 2023: 64-69 - [c8]Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, Lei Xie, Mengxiao Bi:
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features. ICASSP 2023: 1-5 - [c7]Jixun Yao
, Yi Lei, Qing Wang, Pengcheng Guo, Ziqian Ning, Lei Xie, Hai Li, Junhui Liu, Danming Xie:
Preserving Background Sound in Noise-Robust Voice Conversion Via Multi-Task Learning. ICASSP 2023: 1-5 - [c6]Jixun Yao
, Qing Wang, Yi Lei, Pengcheng Guo, Lei Xie, Namin Wang, Jie Liu:
Distinguishable Speaker Anonymization Based on Formant and Fundamental Frequency Scaling. ICASSP 2023: 1-5 - [c5]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. INTERSPEECH 2023: 2063-2067 - [c4]Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie:
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification. INTERSPEECH 2023: 3994-3998 - [c3]Guofeng Yi
, Yuguang Yang
, Yu Pan
, Yuhang Cao
, Jixun Yao
, Xiang Lv
, Cunhang Fan
, Zhao Lv, Jianhua Tao, Shan Liang
, Heng Lu
:
Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction. MuSe@ACM Multimedia 2023: 19-26 - [i12]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. CoRR abs/2305.12425 (2023) - [i11]Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie:
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification. CoRR abs/2305.19020 (2023) - [i10]Yu Pan, Yanni Hu, Yuguang Yang, Jixun Yao, Wen Fei, Lei Ma, Heng Lu:
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition. CoRR abs/2306.07848 (2023) - [i9]Qing Wang, Jixun Yao, Li Zhang, Pengcheng Guo, Lei Xie:
Timbre-reserved Adversarial Attack in Speaker Identification. CoRR abs/2309.00929 (2023) - [i8]Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie:
PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts. CoRR abs/2309.09262 (2023) - [i7]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. CoRR abs/2309.15496 (2023) - [i6]Yuanjun Lv, Jixun Yao, Peikun Chen, Hongbin Zhou, Heng Lu, Lei Xie:
SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation. CoRR abs/2310.05051 (2023) - 2022
- [c2]Renmingyue Du, Jixun Yao
:
High Quality and Similarity One-Shot Voice Conversion Using End-to-End Model. CSAI 2022: 284-288 - [i5]Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie:
NWPU-ASLP System for the VoicePrivacy 2022 Challenge. CoRR abs/2209.11969 (2022) - [i4]Jixun Yao, Yi Lei, Qing Wang, Pengcheng Guo, Ziqian Ning, Lei Xie, Hai Li, Junhui Liu, Danming Xie:
Preserving background sound in noise-robust voice conversion via multi-task learning. CoRR abs/2211.03036 (2022) - [i3]Jixun Yao, Qing Wang, Yi Lei, Pengcheng Guo, Lei Xie, Namin Wang, Jie Liu:
Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling. CoRR abs/2211.03038 (2022) - [i2]Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, Lei Xie, Mengxiao Bi:
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features. CoRR abs/2211.04710 (2022) - [i1]Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su:
UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis. CoRR abs/2212.01546 (2022) - 2020
- [c1]Jixun Yao, Xiaoan Li, Dengshan Huang:
A Reward Shaping Method based on Meta-LSTM for Continuous Control of Robot. CSAI 2020: 122-127
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-15 22:36 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint