


Остановите войну!
for scientists:


default search action
Kong-Aik Lee
Kong Aik Lee
Person information

- affiliation: Institute for Infocomm Research, Singapore
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [j30]Hongning Zhu
, Kong Aik Lee
, Haizhou Li
:
Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022) - [j29]Tianchi Liu
, Rohan Kumar Das
, Kong Aik Lee
, Haizhou Li
:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022) - [c128]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation. ConvAI@ACL 2022: 1-11 - [c127]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation. ICAART (2) 2022: 193-202 - [c126]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146 - [c125]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents. ICASSP 2022: 7052-7056 - [c124]Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Hui Chen:
Learning Domain-Invariant Transformation for Speaker Verification. ICASSP 2022: 7177-7181 - [c123]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521 - [c122]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c121]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? INTERSPEECH 2022: 600-604 - [c120]Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng
, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang:
Deep Spectro-temporal Artifacts for Detecting Synthesized Speech. DDAM@MM 2022: 69-75 - [c119]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Odyssey 2022: 330-337 - [c118]Lin Li
, Kaixi Hu
, Turghun Tayir, Jianquan Liu, Kong Aik Lee:
Noise-Robust Semi-supervised Multi-modal Machine Translation. PRICAI (2) 2022: 155-168 - [i41]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022) - [i40]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i39]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents. CoRR abs/2202.05971 (2022) - [i38]Qiongqiong Wang, Kong Aik Lee, Tianchi Liu:
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? CoRR abs/2204.03965 (2022) - [i37]Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans:
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. CoRR abs/2204.09976 (2022) - [i36]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i35]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. CoRR abs/2210.02437 (2022) - [i34]Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang:
Deep Spectro-temporal Artifacts for Detecting Synthesized Speech. CoRR abs/2210.05254 (2022) - [i33]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022) - [i32]Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker recognition with two-step multi-modal deep cleansing. CoRR abs/2210.15903 (2022) - [i31]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - 2021
- [j28]Meng Liu, Longbiao Wang, Jianwu Dang, Kong Aik Lee
, Seiichi Nakagawa:
Replay attack detection using variable-frequency resolution phase and magnitude features. Comput. Speech Lang. 66: 101161 (2021) - [j27]Kong Aik Lee
, Ville Vestman
, Tomi Kinnunen
:
ASVtorch toolkit: Speaker verification with deep neural networks. SoftwareX 14: 100697 (2021) - [j26]Kong Aik Lee
, Qiongqiong Wang, Takafumi Koshinaka
:
Xi-Vector Embedding for Speaker Recognition. IEEE Signal Process. Lett. 28: 1385-1389 (2021) - [j25]Andreas Nautsch
, Xin Wang
, Nicholas W. D. Evans, Tomi H. Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado
, Md. Sahidullah, Junichi Yamagishi
, Kong Aik Lee
:
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Trans. Biom. Behav. Identity Sci. 3(2): 252-265 (2021) - [c117]Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction. ASRU 2021: 106-113 - [c116]Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Chang Zeng
, Jianwu Dang:
DeepLip: A Benchmark for Deep Learning-Based Audio-Visual Lip Biometrics. ASRU 2021: 122-129 - [c115]Qiongqiong Wang, Kong Aik Lee
, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. EUSIPCO 2021: 476-480 - [c114]Lin Li, Kaixi Hu, Yunpei Zheng, Jianquan Liu, Kong Aik Lee
:
COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling. ICASSP 2021: 4310-4314 - [c113]Hanyi Zhang, Longbiao Wang, Kong Aik Lee
, Meng Liu, Jianwu Dang, Hui Chen:
Meta-Learning for Cross-Channel Speaker Verification. ICASSP 2021: 5839-5843 - [c112]Meng Liu, Longbiao Wang, Kong Aik Lee
, Xuanda Chen, Jianwu Dang:
Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution. ICASSP 2021: 6374-6378 - [c111]Hongning Zhu, Kong Aik Lee
, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. Interspeech 2021: 106-110 - [c110]Yibo Wu, Longbiao Wang, Kong Aik Lee
, Meng Liu, Jianwu Dang:
Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network. Interspeech 2021: 1089-1093 - [c109]Li Zhang, Qing Wang, Kong Aik Lee
, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. Interspeech 2021: 1094-1098 - [c108]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee
:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. Interspeech 2021: 4299-4303 - [i30]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech. CoRR abs/2102.05889 (2021) - [i29]Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, Chang Zeng, Jianwu Dang:
Exploring Deep Learning for Joint Audio-Visual Lip Biometrics. CoRR abs/2104.08510 (2021) - [i28]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. CoRR abs/2106.06362 (2021) - [i27]Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. CoRR abs/2106.09320 (2021) - [i26]Hongning Zhu, Kong Aik Lee, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. CoRR abs/2107.06493 (2021) - [i25]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
Generating Personalized Dialogue via Multi-Task Meta-Learning. CoRR abs/2108.03377 (2021) - [i24]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
Xi-Vector Embedding for Speaker Recognition. CoRR abs/2108.05679 (2021) - [i23]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto:
Task-aware Warping Factors in Mask-based Speech Enhancement. CoRR abs/2108.12128 (2021) - [i22]Jean-François Bonastre, Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md. Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia A. Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi:
Benchmarking and challenges in security and privacy for voice biometrics. CoRR abs/2109.00281 (2021) - [i21]Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md. Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi:
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan. CoRR abs/2109.00535 (2021) - [i20]Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md. Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas W. D. Evans, Héctor Delgado:
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection. CoRR abs/2109.00537 (2021) - [i19]Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction. CoRR abs/2110.00940 (2021) - [i18]Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan:
DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation. CoRR abs/2111.11363 (2021) - 2020
- [j24]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee
:
Voice biometrics security: Extrapolating false alarm rate via hierarchical Bayesian modeling of speaker verification scores. Comput. Speech Lang. 60 (2020) - [j23]Kong Aik Lee
, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition. Comput. Speech Lang. 61: 101033 (2020) - [j22]Kong Aik Lee
, Seyed Omid Sadjadi, Haizhou Li
, Douglas A. Reynolds:
Two decades into Speaker Recognition Evaluation - are we there yet? Comput. Speech Lang. 61: 101058 (2020) - [j21]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee
, Lauri Juvela
, Paavo Alku
, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Sébastien Le Maguer
, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j20]Ivan Kukanov
, Trung Ngo Trong, Ville Hautamäki
, Sabato Marco Siniscalchi
, Valerio Mario Salerno, Kong Aik Lee
:
Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 682-695 (2020) - [j19]Tomi Kinnunen
, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee
, Ville Vestman
, Andreas Nautsch
, Massimiliano Todisco, Xin Wang
, Md. Sahidullah
, Junichi Yamagishi
, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2195-2210 (2020) - [c107]Qiongqiong Wang, Koji Okabe, Kong Aik Lee
, Takafumi Koshinaka:
A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition. ICASSP 2020: 6619-6623 - [c106]Dao Zhou, Longbiao Wang, Kong Aik Lee
, Meng Liu, Jianwu Dang:
Deep Discriminative Embedding with Ranked Weight for Speaker Verification. ICONIP (5) 2020: 79-86 - [c105]Hossein Zeinali, Kong Aik Lee
, Jahangir Alam, Lukás Burget
:
SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification. INTERSPEECH 2020: 731-735 - [c104]Hanyi Zhang, Longbiao Wang, Yunchun Zhang, Meng Liu, Kong Aik Lee
, Jianguo Wei:
Adversarial Separation Network for Speaker Recognition. INTERSPEECH 2020: 951-955 - [c103]Kosuke Akimoto, Seng Pei Liew, Sakiko Mishima, Ryo Mizushima, Kong Aik Lee
:
POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise. INTERSPEECH 2020: 1081-1085 - [c102]Kong Aik Lee
, Koji Okabe, Hitoshi Yamamoto, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Keisuke Ishikawa, Koichi Shinoda:
NEC-TT Speaker Verification System for SRE'19 CTS Challenge. INTERSPEECH 2020: 2227-2231 - [c101]Dao Zhou, Longbiao Wang, Kong Aik Lee
, Yibo Wu, Meng Liu, Jianwu Dang, Jianguo Wei:
Dynamic Margin Softmax Loss for Speaker Verification. INTERSPEECH 2020: 3800-3804 - [c100]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee
:
Extrapolating False Alarm Rates in Automatic Speaker Verification. INTERSPEECH 2020: 4218-4222 - [c99]Ville Vestman, Kong Aik Lee, Tomi Kinnunen:
Neural i-vectors. Odyssey 2020: 67-74 - [c98]Liping Chen, Kong-Aik Lee, Lei He, Frank K. Soong:
On Early-stop Clustering for Speaker Diarization. Odyssey 2020: 110-116 - [c97]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka:
Using Multi-Resolution Feature Maps with Convolutional Neural Networks for Anti-Spoofing in ASV. Odyssey 2020: 138-142 - [c96]Leibny Paola García-Perera, Jesús Villalba
, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak
:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422 - [e2]Kong-Aik Lee, Takafumi Koshinaka, Koichi Shinoda:
Odyssey 2020: The Speaker and Language Recognition Workshop, 1-5 November 2020, Tokyo, Japan. ISCA 2020 [contents] - [i17]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen:
Neural i-vectors. CoRR abs/2004.01559 (2020) - [i16]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. CoRR abs/2007.05979 (2020) - [i15]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Extrapolating false alarm rates in automatic speaker verification. CoRR abs/2008.03590 (2020) - [i14]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka:
A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition. CoRR abs/2008.08815 (2020) - [i13]Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka:
Using Multi-Resolution Feature Maps with Convolutional Neural Networks for Anti-Spoofing in ASV. CoRR abs/2008.08865 (2020)
2010 – 2019
- 2019
- [c95]Kong Aik Lee
, Qiongqiong Wang, Takafumi Koshinaka:
The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA. ICASSP 2019: 5821-5825 - [c94]Ville Vestman, Kong Aik Lee
, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of i-Vectors Enabled by GPU Acceleration. INTERSPEECH 2019: 351-355 - [c93]Hitoshi Yamamoto, Kong Aik Lee
, Koji Okabe, Takafumi Koshinaka:
Speaker Augmentation and Bandwidth Extension for Deep Speaker Embedding. INTERSPEECH 2019: 406-410 - [c92]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch
, Junichi Yamagishi, Nicholas W. D. Evans, Tomi H. Kinnunen, Kong Aik Lee
:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. INTERSPEECH 2019: 1008-1012 - [c91]Kong Aik Lee
, Ville Hautamäki
, Tomi H. Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das
, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Massimiliano Todisco:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019: 1497-1501 - [c90]Kong Aik Lee
, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
The NEC-TT 2018 Speaker Verification System. INTERSPEECH 2019: 4355-4359 - [p1]Md. Sahidullah
, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee
:
Introduction to Voice Presentation Attack Detection and Recent Advances. Handbook of Biometric Anti-Spoofing, 2nd Ed. 2019: 321-361 - [i12]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Kong-Aik Lee:
Introduction to Voice Presentation Attack Detection and Recent Advances. CoRR abs/1901.01085 (2019) - [i11]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. CoRR abs/1904.05441 (2019) - [i10]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i9]Ville Vestman, Kong Aik Lee, Tomi H. Kinnunen, Takafumi Koshinaka:
Unleashing the Unused Potential of I-Vectors Enabled by GPU Acceleration. CoRR abs/1906.08556 (2019) - [i8]Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Kong Aik Lee:
Voice Biometrics Security: Extrapolating False Alarm Rate via Hierarchical Bayesian Modeling of Speaker Verification Scores. CoRR abs/1911.01182 (2019) - [i7]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i6]Paola García, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux
, Kong Aik Lee, Najim Dehak:
Speaker detection in the wild: Lessons learned from JSALT 2019. CoRR abs/1912.00938 (2019) - [i5]Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukás Burget:
Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan. CoRR abs/1912.06311 (2019) - 2018
- [j18]Jianbo Ma
, Vidhyasaharan Sethu
, Eliathamby Ambikairajah
, Kong-Aik Lee
:
Generalized Variability Model for Speaker Verification. IEEE Signal Process. Lett. 25(12): 1775-1779 (2018) - [j17]Longting Xu
, Kong-Aik Lee
, Haizhou Li
, Zhen Yang:
Generalizing I-Vector Estimation for Rapid Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 749-759 (2018) - [c89]Yanping Li, Kong-Aik Lee, Yougen Yuan, Haizhou Li
, Zhen Yang:
Many-to-Many Voice Conversion based on Bottleneck Features with Variational Autoencoder for Non-parallel Training Data. APSIPA 2018: 829-833 - [c88]Ivan Kukanov, Ville Hautamäki
, Kong-Aik Lee
:
Maximal Figure-of-Merit Embedding for Multi-Label Audio Classification. ICASSP 2018: 136-140 - [c87]Karthika Vijayan
, Haizhou Li
, Hanwu Sun, Kong-Aik Lee
:
On the Importance of Analytic Phase of Speech Signals in Spoken Language Recognition. ICASSP 2018: 5194-5198 - [c86]Jianbo Ma, Vidhyasaharan Sethu
, Eliathamby Ambikairajah
, Kong-Aik Lee
:
Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification. ICASSP 2018: 5264-5268 - [c85]Massimiliano Todisco, Héctor Delgado, Kong-Aik Lee
, Md. Sahidullah
, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion. INTERSPEECH 2018: 77-81 - [c84]Longting Xu, Kong-Aik Lee
, Haizhou Li
, Zhen Yang:
Co-whitening of I-vectors for Short and Long Duration Speaker Verification. INTERSPEECH 2018: 1066-1070 - [c83]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Kong-Aik Lee, Junichi Yamagishi:
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. Odyssey 2018: 296-303 - [c82]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. Odyssey 2018: 312-319 - [c81]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Hitoshi Yamamoto, Takafumi Koshinaka:
Attention Mechanism in Speaker Recognition: What Does it Learn in Deep Speaker Embedding? SLT 2018: 1052-1059 - [i4]Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. CoRR abs/1804.09618 (2018) - [i3]Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Hitoshi Yamamoto, Takafumi Koshinaka:
Attention Mechanism in Speaker Recognition: What Does It Learn in Deep Speaker Embedding? CoRR abs/1809.09311 (2018) - [i2]Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka:
The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA. CoRR abs/1812.10260 (2018) - 2017
- [j16]Aleksandr Sizov, Kong-Aik Lee
, Tomi Kinnunen:
Direct Optimization of the Detection Cost for I-Vector-Based Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 588-597 (2017) - [c80]Hanwu Sun, Kong-Aik Lee, Trung Hieu Nguyen, Bin Ma, Haizhou Li
:
I2R-NUS submission to oriental language recognition AP16-OL7 challenge. APSIPA 2017: 1574-1578 - [c79]Liping Chen, Kong-Aik Lee
, Bin Ma, Long Ma, Haizhou Li
, Li-Rong Dai:
Adaptation of PLDA for multi-source text-independent speaker verification. ICASSP 2017: 5380-5384 - [c78]