default search action
Zexin Cai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Zexin Cai, Ming Li:
Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks. Comput. Speech Lang. 85: 101597 (2024) - [c15]Zexin Cai, Ming Li:
Invertible Voice Conversion with Parallel Data. ICASSP 2024: 10041-10045 - [i15]Danwei Cai, Zexin Cai, Ming Li:
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning. CoRR abs/2401.01473 (2024) - [i14]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization. CoRR abs/2409.03655 (2024) - [i13]Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
HLTCOE JHU Submission to the Voice Privacy Challenge 2024. CoRR abs/2409.08913 (2024) - 2023
- [j2]Yaogen Yang, Haozhe Zhang, Zexin Cai, Yao Shi, Ming Li, Dong Zhang, Xiaojun Ding, Jianhua Deng, Jie Wang:
Electrolaryngeal speech enhancement based on a two stage framework with bottleneck feature refinement and voice conversion. Biomed. Signal Process. Control. 80(Part): 104279 (2023) - [j1]Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual multi-speaker speech synthesis with limited bilingual training data. Comput. Speech Lang. 77: 101427 (2023) - [c14]Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems. ICASSP 2023: 1-5 - [c13]Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. ICASSP 2023: 1-5 - [i12]Zexin Cai, Weiqing Wang, Yikang Wang, Ming Li:
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023. CoRR abs/2308.10281 (2023) - 2022
- [c12]Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-Shot Voice Conversion System for Both Human Beings and Machines. ICASSP 2022: 6567-65571 - [i11]Zexin Cai, Ming Li:
Invertible Voice Conversion. CoRR abs/2201.10687 (2022) - [i10]Danwei Cai, Zexin Cai, Ming Li:
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems. CoRR abs/2206.09103 (2022) - [i9]Zexin Cai, Weiqing Wang, Ming Li:
Waveform Boundary Detection for Partially Spoofed Audio. CoRR abs/2211.00226 (2022) - 2021
- [i8]Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li:
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines. CoRR abs/2111.03811 (2021) - 2020
- [c11]Zexin Cai, Ming Li:
The Duke Entry for 2020 Blizzard Challenge. Blizzard Challenge / Voice Conversion Challenge 2020 - [c10]Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. INTERSPEECH 2020: 3974-3978 - [i7]Zexin Cai, Chuxiong Zhang, Ming Li:
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint. CoRR abs/2005.04587 (2020) - [i6]Zexin Cai, Yaogen Yang, Ming Li:
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario. CoRR abs/2005.10441 (2020) - [i5]Yan Jia, Zexin Cai, Murong Ma, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li:
Training Wake Word Detection with Synthesized Speech Data on Confusion Words. CoRR abs/2011.01460 (2020)
2010 – 2019
- 2019
- [c9]Zexin Cai, Chuxiong Zhang, Yaogen Yang, Ming Li:
The DKU Speech Synthesis System for 2019 Blizzard Challenge. Blizzard Challenge 2019 - [c8]Zexin Cai, Zhicheng Xu, Ming Li:
F0 Contour Estimation Using Phonetic Feature in Electrolaryngeal Speech Enhancement. ICASSP 2019: 6490-6494 - [c7]Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features. INTERSPEECH 2019: 2110-2114 - [i4]Zexin Cai, Yaogen Yang, Chuxiong Zhang, Xiaoyi Qin, Ming Li:
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features. CoRR abs/1907.01749 (2019) - 2018
- [c6]Danwei Cai, Zexin Cai, Ming Li:
Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition. APSIPA 2018: 1478-1482 - [c5]Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. ICASSP 2018: 5189-5193 - [c4]Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights in-to-End Learning Scheme for Language Identification. ICASSP 2018: 5209-5213 - [c3]Haiwei Wu, Ming Li, Zexin Cai, Haibin Zhong:
Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. ISCSLP 2018: 1-5 - [c2]Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, Haibin Zhong:
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion. ISCSLP 2018: 235-239 - [c1]Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. ISCSLP 2018: 319-323 - [i3]Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li:
Insights into End-to-End Learning Scheme for Language Identification. CoRR abs/1804.00381 (2018) - [i2]Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li:
A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification. CoRR abs/1804.00385 (2018) - [i1]Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. CoRR abs/1809.02906 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 21:14 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint