default search action

combined dblp search
author search
venue search
publication search

ask others

Haizhou Li 0001

李海洲

> Home > Persons

Person information

unicode name: 李海洲
affiliation: Chinese University of Hong Kong (Shenzhen), China
affiliation: National University of Singapore, Department of Electrical and Computer Engineering, Singapore
affiliation (2006 - 2016): Nanyang Technological University, Singapore
affiliation (2003 - 2016): Institute for Infocomm Research, A*STAR, Singapore
affiliation (2011): University of New South Wales, Sydney, Australia
affiliation (2009): University of Eastern Finland, Kuopio, Finland
affiliation (PhD 1990): South China University of Technology, Guangzhou, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j210]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/CaiLLWWZSL26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/CaiLLWWZSL26
Siqi Cai, Zheyuan Lin, Xiaoli Liu, Wenjie Wei, Shuai Wang, Malu Zhang, Tanja Schultz, Haizhou Li:
Spiking neural networks for EEG signal analysis: From theory to practice. Neural Networks 194: 108127 (2026)
[j209]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiuJYHL26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiuJYHL26
Rui Liu, Zhenqi Jia, Jie Yang, Yifan Hu, Haizhou Li:
Emphasis rendering for conversational text-to-speech with multi-modal multi-scale context modeling. Speech Commun. 178: 103353 (2026)
[j208]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/JiangCWQL26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/JiangCWQL26
Ziyang Jiang, Xueyan Chen, Shuai Wang, Xinyuan Qian, Haizhou Li:
TPEech: Target Speaker Extraction and Noise Suppression With Historical Dialogue Text Cues. IEEE Signal Process. Lett. 33: 351-355 (2026)
[j207]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/SongZCWL26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/SongZCWL26
Zeyang Song, Shimin Zhang, Yuhong Chou, Jibin Wu, Haizhou Li:
IML-Spikeformer: Input-Aware Multilevel Spiking Transformer for Speech Processing. IEEE Trans. Neural Networks Learn. Syst. 37(3): 1377-1389 (2026)
[c794]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KeXYWJL26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KeXYWJL26
Rui Ke, Jiahui Xu, Shenghao Yang, Kuang Wang, Feng Jiang, Haizhou Li:
CATCH: A Controllable Theme Detection Framework with Contextualized Clustering and Hierarchical Generation. AAAI 2026: 31419-31428
[i286]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-05564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-05564
Zhixian Zhao, Shuiyuan Wang, Guojian Li, Hongfei Xue, Chengyou Wang, Shuai Wang, Longshuai Xiao, Zihan Zhang, Hui Bu, Xin Xu, Xinsheng Wang, Hexin Liu, Eng Siong Chng, Hung-yi Lee, Haizhou Li, Lei Xie:
The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era. CoRR abs/2601.05564 (2026)
[i285]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-22873
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-22873
Li Zhou, Hao Jiang, Junjie Li, Tianrui Wang, Haizhou Li:
EmoShift: Lightweight Activation Steering for Enhanced Emotion-Aware Speech Synthesis. CoRR abs/2601.22873 (2026)
[i284]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-10656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-10656
Jingru Lin, Chen Zhang, Tianrui Wang, Haizhou Li:
AudioRAG: A Challenging Benchmark for Audio Reasoning and Information Retrieval. CoRR abs/2602.10656 (2026)
[i283]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-19166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-19166
Qibing Bai, Shuhao Shi, Shuai Wang, Yukai Ju, Yannan Wang, Haizhou Li:
CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data. CoRR abs/2602.19166 (2026)
[i282]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-20548
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-20548
Shuai Wang, Malu Zhang, Yulin Jiang, Dehao Zhang, Ammar Belatreche, Yu Liang, Yimeng Shan, Zijian Zhou, Yang Yang, Haizhou Li:
Robust Spiking Neural Networks Against Adversarial Attacks. CoRR abs/2602.20548 (2026)
[i281]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-23266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-23266
Siyuan Liu, Jiahui Xu, Feng Jiang, Kuang Wang, Zefeng Zhao, Chu-Ren Huang, Jinghang Gu, Changqing Yin, Haizhou Li:
Discourse-Aware Dual-Track Streaming Response for Low-Latency Spoken Dialogue Systems. CoRR abs/2602.23266 (2026)
2025
[j206]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/LiuJBL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/LiuJBL25
Rui Liu, Zhenqi Jia, Feilong Bao, Haizhou Li:
Retrieval-Augmented Dialogue Knowledge Aggregation for expressive conversational speech synthesis. Inf. Fusion 118: 102948 (2025)
[j205]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/LiuYGL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/LiuYGL25
Rui Liu, Hongyu Yuan, Guanglai Gao, Haizhou Li:
Listening and seeing again: Generative error correction for audio-visual speech recognition. Inf. Fusion 120: 103077 (2025)
[j204]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/LiuZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/LiuZL25
Rui Liu, Jinhua Zhang, Haizhou Li:
Hierarchical multi-source cues fusion for mono-to-binaural based Audio Deepfake Detection. Inf. Fusion 120: 103097 (2025)
[j203]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/QianGZZLGL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/QianGZZLGL25
Xinyuan Qian, Jiaran Gao, Yaodan Zhang, Qiquan Zhang, Hexin Liu, Leibny Paola García-Perera, Haizhou Li:
SAV-SE: Scene-Aware Audio-Visual Speech Enhancement With Selective State Space Model. IEEE J. Sel. Top. Signal Process. 19(4): 623-634 (2025)
[j202]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WuCWWMWML25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WuCWWMWML25
Wenxuan Wu, Xueyuan Chen, Shuai Wang, Jiadong Wang, Lingwei Meng, Xixin Wu, Helen Meng, Haizhou Li:
$C^{2}$AV-TSE: Context and Confidence-Aware Audio Visual Target Speaker Extraction. IEEE J. Sel. Top. Signal Process. 19(4): 646-657 (2025)
[j201]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pami/GraumanWBCCFGHJKLLMNRRR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/GraumanWBCCFGHJKLLMNRRR25
Kristen Grauman, Andrew Westbury, Eugene Byrne, Vincent Cartillier, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Devansh Kukreja, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3,600 Hours of Egocentric Video. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 9468-9509 (2025)
[j200]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/QianYWZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/QianYWZL25
Xinyuan Qian, Xianghu Yue, Jiadong Wang, Huiping Zhuang, Haizhou Li:
Analytic Class Incremental Learning for Sound Source Localization With Privacy Protection. IEEE Signal Process. Lett. 32: 726-730 (2025)
[j199]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/MaWLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/MaWLL25
Yi Ma, Shuai Wang, Tianchi Liu, Haizhou Li:
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification. IEEE Signal Process. Lett. 32: 731-735 (2025)
[j198]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/SunZLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/SunZLL25
Qiyuan Sun, Haolin Zuo, Rui Liu, Haizhou Li:
Connecting Cross-Modal Representations for Compact and Robust Multimodal Sentiment Analysis With Sentiment Word Substitution Error. IEEE Trans. Affect. Comput. 16(3): 1265-1276 (2025)
[j197]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/InoueZWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/InoueZWL25
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Control of Emotion Rendering in Speech Synthesis. IEEE Trans. Affect. Comput. 16(4): 3316-3328 (2025)
[j196]
- view
  authority control:
- export record
  dblp key:
  - journals/tc/LiuWWYPL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tc/LiuWWYPL25
Qianhui Liu, Jiadong Wang, Yang Wang, Xin Yang, Gang Pan, Haizhou Li:
Human-Inspired Computing for Robust and Efficient Audio-Visual Speech Recognition. IEEE Trans. Computers 74(9): 2950-2961 (2025)
[j195]
- view
  authority control:
- export record
  dblp key:
  - journals/tce/CaiZZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tce/CaiZZL25
Siqi Cai, Ran Zhang, Hongxu Zhu, Haizhou Li:
Modeling the Temporal Dynamics of EEG Signals in Selective Listening. IEEE Trans. Consumer Electron. 71(1): 1115-1124 (2025)
[j194]
- view
  authority control:
- export record
  dblp key:
  - journals/tdsc/TianQLZV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tdsc/TianQLZV25
Hui Tian, Yiqin Qiu, Haizhou Li, Xinpeng Zhang, Athanasios V. Vasilakos:
Universal Low Bit-Rate Speech Steganalysis Integrating Domain-Specific and Domain-Shared Knowledge. IEEE Trans. Dependable Secur. Comput. 22(5): 5382-5396 (2025)
[j193]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/LiuTDLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/LiuTDLL25
Tianchi Liu, Duc-Tuan Truong, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-Spoofing. IEEE Trans. Inf. Forensics Secur. 20: 12005-12018 (2025)
[j192]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/ZhangZWLYLY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/ZhangZWLYLY25
Jiqing Zhang, Malu Zhang, Yuanchen Wang, Qianhui Liu, Baocai Yin, Haizhou Li, Xin Yang:
Spiking Neural Networks With Adaptive Membrane Time Constant for Event-Based Tracking. IEEE Trans. Image Process. 34: 1009-1021 (2025)
[j191]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/TaoQDGWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/TaoQDGWL25
Ruijie Tao, Xinyuan Qian, Rohan Kumar Das, Xiaoxue Gao, Jiadong Wang, Haizhou Li:
Enhancing Real-World Active Speaker Detection With Multi-Modal Extraction Pre-Training. IEEE Trans. Multim. 27: 2362-2373 (2025)
[j190]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/JiLGL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/JiLGL25
Ruihang Ji, Dongyu Li, Shuzhi Sam Ge, Haizhou Li:
Tunnel Prescribed Control of Nonlinear Systems With Unknown Control Directions. IEEE Trans. Neural Networks Learn. Syst. 36(1): 1383-1395 (2025)
[j189]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ZhangLWBCYL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ZhangLWBCYL25
Malu Zhang, Xiaoling Luo, Jibin Wu, Ammar Belatreche, Siqi Cai, Yang Yang, Haizhou Li:
Toward Building Human-Like Sequential Memory Using Brain-Inspired Spiking Neural Models. IEEE Trans. Neural Networks Learn. Syst. 36(6): 10143-10155 (2025)
[j188]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/XiaoJWZHL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/XiaoJWZHL25
Yan Xiao, Yaochu Jin, Bin Wang, Yan Zhang, Kuangrong Hao, Haizhou Li:
Zero-Shot Relation Classification Through Inference on Category Attributes. IEEE Trans. Neural Networks Learn. Syst. 36(7): 13135-13148 (2025)
[c793]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0008HH025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0008HH025
Rui Liu, Shuwei He, Yifan Hu, Haizhou Li:
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech. AAAI 2025: 24632-24640
[c792]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangWCYTGXZZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangWCYTGXZZL25
Chenyu Yang, Shuai Wang, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Yaoxun Xu, Yizhi Zhou, Haina Zhu, Haizhou Li:
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor. AAAI 2025: 25597-25605
[c791]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0020C0TGT025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0020C0TGT025
Chen Zhang, Dading Chong, Feng Jiang, Chengguang Tang, Anningzhe Gao, Guohua Tang, Haizhou Li:
Aligning Language Models Using Follow-up Likelihood as Reward Signal. AAAI 2025: 25832-25841
[c790]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Hu0R0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Hu0R0025
Yifan Hu, Rui Liu, Yi Ren, Xiang Yin, Haizhou Li:
Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis. ACL (Findings) 2025: 1988-2003
[c789]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhuHLLTAA0HWYCM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhuHLLTAA0HWYCM25
Jianqing Zhu, Huang Huang, Zhihang Lin, Juhao Liang, Zhengyang Tang, Khalid Almubarak, Mosen Alharthi, Bang An, Juncai He, Xiangbo Wu, Fei Yu, Junying Chen, Zhuoheng Ma, Yuhao Du, He Zhang, Saied Alshahrani, Emad A. Alghamdi, Lian Zhang, Ruoyu Sun, Haizhou Li, Benyou Wang, Jinchao Xu:
Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion. ACL (1) 2025: 2025-2042
[c788]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangLBZW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangLBZW025
Yuhao Zhang, Zhiheng Liu, Fan Bu, Ruiyu Zhang, Benyou Wang, Haizhou Li:
Soundwave: Less is More for Speech-Text Alignment in LLMs. ACL (1) 2025: 18718-18738
[c787]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/JiangCJXWZYZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/JiangCJXWZYZ025
Yidi Jiang, Qian Chen, Shengpeng Ji, Yu Xi, Wen Wang, Chong Zhang, Xianghu Yue, Shiliang Zhang, Haizhou Li:
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook. ACL (1) 2025: 19112-19124
[c786]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/WangL000025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangL000025
Kuang Wang, Xianfei Li, Shenghao Yang, Li Zhou, Feng Jiang, Haizhou Li:
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles. ACL (1) 2025: 21082-21107
[c785]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/0004TWJSZL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/0004TWJSZL025
Tianchi Liu, Ruijie Tao, Qiongqiong Wang, Yidi Jiang, Hardik B. Sailor, Ke Zhang, Jingru Lin, Haizhou Li:
Interpolating Speaker Identities in Embedding Space for Data Expansion. APSIPA 2025: 589-594
[c784]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YildirimTWA025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YildirimTWA025
Mehmet Sinan Yildirim, Ruijie Tao, Wupeng Wang, Junyi Ao, Haizhou Li:
Leveraging Language Information for Target Language Extraction. APSIPA 2025: 837-842
[c783]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Xian0S025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Xian0S025
Huhong Xian, Rui Liu, Berrak Sisman, Haizhou Li:
NE-PADD: Leveraging Named Entity Knowledge for Robust Partial Audio Deepfake Detection via Attention Aggregation. APSIPA 2025: 2199-2204
[c782]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TaoSJ0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TaoSJ0025
Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li:
Voice Conversion Augmentation for Speaker Recognition on Defective Datasets. APSIPA 2025: 2529-2534
[c781]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenLL0DT025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenLL0DT025
Liping Chen, Kong-Aik Lee, Zhen-Hua Ling, Xin Wang, Rohan Kumar Das, Tomoki Toda, Haizhou Li:
Speaker Privacy and Security in the Big Data Era: Protection and Defense Against Deepfake. APSIPA 2025: 2570-2575
[c780]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/JiaLSL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/JiaLSL25
Zhenqi Jia, Rui Liu, Berrak Sisman, Haizhou Li:
Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis. EMNLP 2025: 8852-8858
[c779]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChenCLJWHRGLXR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenCLJWHRGLXR25
Simin Chen, Yiming Chen, Zexin Li, Yifan Jiang, Zhongwei Wan, Yixin He, Dezhi Ran, Tianle Gu, Haizhou Li, Tao Xie, Baishakhi Ray:
Benchmarking Large Language Models Under Data Contamination: A Survey from Static to Dynamic Evaluation. EMNLP 2025: 10080-10098
[c778]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/DaiZWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/DaiZWL25
Xunlian Dai, Li Zhou, Benyou Wang, Haizhou Li:
From Word to World: Evaluate and Mitigate Culture Bias in LLMs via Word Association Test. EMNLP 2025: 24510-24526
[c777]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhouYXCLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhouYXCLL25
Li Zhou, Lutong Yu, Dongchu Xie, Shaohuan Cheng, Wenyan Li, Haizhou Li:
Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation. EMNLP 2025: 24616-24638
[c776]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0008XL0SZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0008XL0SZ25
Rui Liu, Xiaofen Xing, Zheng Lian, Haizhou Li, Björn W. Schuller, Haolin Zuo:
MEIJU - The 1st Multimodal Emotion and Intent Joint Understanding Challenge. ICASSP 2025: 1-2
[c775]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BorsdorfPH0S25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BorsdorfPH0S25
Marvin Borsdorf, Zexu Pan, Pascal Himmelmann, Haizhou Li, Tanja Schultz:
Speech Separation for Low-Resource Languages. ICASSP 2025: 1-5
[c774]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InoueWW0B025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InoueWW0B025
Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. ICASSP 2025: 1-5
[c773]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuW0B025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuW0B025
Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
E1 TTS: Simple and Fast Non-Autoregressive TTS. ICASSP 2025: 1-5
[c772]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PahujaICSS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PahujaICSS025
Saurav Pahuja, Gabriel Ivucic, Siqi Cai, Dashanka De Silva, Tanja Schultz, Haizhou Li:
ATGnet: Adaptive Temporal Graph Network for EEG-enabled Sound Source Tracking in Cocktail Party Scenarios. ICASSP 2025: 1-5
[c771]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangL0WWW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangL0WWW025
Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li:
Multi-Level Speaker Representation for Target Speaker Extraction. ICASSP 2025: 1-5
[c770]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QiuZZWCG0SY025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QiuZZWCG0SY025
Xuerui Qiu, Malu Zhang, Jieyuan Zhang, Wenjie Wei, Honglin Cao, Junsheng Guo, Rui-Jie Zhu, Yimeng Shan, Yang Yang, Haizhou Li:
Quantized Spike-driven Transformer. ICLR 2025
[c769]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LiZWLML25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LiZWLML25
Junjie Li, Ke Zhang, Shuai Wang, Kong Aik Lee, Man-Wai Mak, Haizhou Li:
MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues. ICME 2025: 1-6
[c768]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Cao0WLBZZY025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Cao0WLBZZY025
Honglin Cao, Zijian Zhou, Wenjie Wei, Yu Liang, Ammar Belatreche, Dehao Zhang, Malu Zhang, Yang Yang, Haizhou Li:
Binary Event-Driven Spiking Transformer. IJCAI 2025: 4110-4118
[c767]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008GXSB025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008GXSB025
Rui Liu, Pu Gao, Jiatian Xi, Berrak Sisman, Carlos Busso, Haizhou Li:
Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset. INTERSPEECH 2025
[c766]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaiIWJWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaiIWJWL25
Qibing Bai, Sho Inoue, Shuai Wang, Zhongjie Jiang, Yannan Wang, Haizhou Li:
Accent Normalization Using Self-Supervised Discrete Tokens with Non-Parallel Data. INTERSPEECH 2025
[c765]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Inoue0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Inoue0025
Sho Inoue, Shuai Wang, Haizhou Li:
PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs. INTERSPEECH 2025
[c764]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWHZW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWHZW025
Shaole Li, Shuai Wang, Jiangyu Han, Ke Zhang, Wupeng Wang, Haizhou Li:
REAL-T: Real Conversational Mixtures for Target Speaker Extraction. INTERSPEECH 2025
[c763]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWLJWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWLJWL25
Sirui Li, Shuai Wang, Zhijun Liu, Zhongjie Jiang, Yannan Wang, Haizhou Li:
SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms. INTERSPEECH 2025
[c762]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lin0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lin0025
Zheyuan Lin, Siqi Cai, Haizhou Li:
Decoding Listener's Identity: Person Identification from EEG Signals Using a Lightweight Spiking Transformer. INTERSPEECH 2025
[c761]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PahujaI0S0S25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PahujaI0S0S25
Saurav Pahuja, Gabriel Ivucic, Siqi Cai, Dashanka De Silva, Haizhou Li, Tanja Schultz:
GTAnet: Geometry-Guided Temporal Attention for EEG-Based Sound Source Tracking in Cocktail Party Scenarios. INTERSPEECH 2025
[c760]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Silva0PS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Silva0PS025
Dashanka De Silva, Siqi Cai, Saurav Pahuja, Tanja Schultz, Haizhou Li:
NeuroSpex+: Dual-Task Training of Neuro-Guided Speaker Extraction with Speech Envelope and Waveform. INTERSPEECH 2025
[c759]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuWWM025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuWWM025
Wenxuan Wu, Shuai Wang, Xixin Wu, Helen Meng, Haizhou Li:
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction. INTERSPEECH 2025
[c758]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangCWZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangCWZ025
Chenyu Yang, Hangting Chen, Shuai Wang, Haina Zhu, Haizhou Li:
TVC-MusicGen: Time-Varying Structure Control for Background Music Generation via Self-Supervised Training. INTERSPEECH 2025
[c757]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZLWL0G025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZLWL0G025
Xueyi Zhang, Peiyin Zhu, Yuan Liao, Xiyu Wang, Mingrui Lao, Siqi Cai, Yanming Guo, Haizhou Li:
TrustCLIP: Learning from Noisy Labels via Semantic Label Verification and Trust-aligned Gradient Projection. ACM Multimedia 2025: 4388-4397
[c756]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangSZYXCL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangSZYXCL025
Xueyi Zhang, Jialu Sun, Chengwei Zhang, Xianghu Yue, Tianfang Xiao, Siqi Cai, Mingrui Lao, Haizhou Li:
EventLip: Enhancing Event-Based Lip Reading via Frequency-Aware Spatiotemporal Hypergraph Modeling. ACM Multimedia 2025: 8263-8272
[c755]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/000400Y025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/000400Y025
Yifan Hu, Rui Liu, Yi Ren, Xiang Yin, Haizhou Li:
UniTalker: Conversational Speech-Visual Synthesis. ACM Multimedia 2025: 10248-10257
[c754]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/LiDHKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LiDHKL25
Chuang Li, Yang Deng, Hengchang Hu, Min-Yen Kan, Haizhou Li:
ChatCRS: Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems. NAACL (Findings) 2025: 295-312
[c753]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/LiuKLJL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LiuKLJL25
Ziche Liu, Rui Ke, Yajiao Liu, Feng Jiang, Haizhou Li:
Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models. NAACL (Long Papers) 2025: 6595-6611
[c752]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/ZhouKLGCCLH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZhouKLGCCLH25
Li Zhou, Taelin Karidi, Wanlong Liu, Nicolas Garneau, Yong Cao, Wenyu Chen, Haizhou Li, Daniel Hershcovich:
Does Mapo Tofu Contain Coffee? Probing LLMs for Food-related Cultural Knowledge. NAACL (Long Papers) 2025: 9840-9867
[c751]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/LaoLGZ0D025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/LaoLGZ0D025
Mingrui Lao, Zheng Li, Yanming Guo, Xueyi Zhang, Siqi Cai, Zhaoyun Ding, Haizhou Li:
Boosting Discriminability for Robust Multimodal Entity Linking with Visual Modality Missing. SIGIR 2025: 989-999
[c750]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/ZhangCSLZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/ZhangCSLZL25
Qiquan Zhang, Moran Chen, Zeyang Song, Hexin Liu, Xiangyu Zhang, Haizhou Li:
Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study. WASPAA 2025: 1-5
[c749]
- view
  authority control:
- export record
  dblp key:
  - conf/www/ChengZJW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/ChengZJW025
Zihao Cheng, Li Zhou, Feng Jiang, Benyou Wang, Haizhou Li:
Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement. WWW 2025: 2677-2688
[e25]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/2024innobiz
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/2024innobiz
Haizhou Li, Tanja Schultz, Yalei Bi, Jian Zhu, Hongsheng He, Jun Ma, Siqi Cai, Wanyue Jiang, Shuzhi Sam Ge:
Social Robotics - 16th International Conference, ICSR + InnoBiz 2024, Shenzhen, China, September 25-28, 2024, Proceedings. Lecture Notes in Computer Science 15170, Springer 2025, ISBN 978-981-96-1150-8 [contents]
[i280]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04038
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04038
Rui Liu, Hongyu Yuan, Haizhou Li:
Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition. CoRR abs/2501.04038 (2025)
[i279]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05729
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05729
Yi Ma, Shuai Wang, Tianchi Liu, Haizhou Li:
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification. CoRR abs/2501.05729 (2025)
[i278]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05904
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05904
Honglin Cao, Zijian Zhou, Wenjie Wei, Ammar Belatreche, Yu Liang, Dehao Zhang, Malu Zhang, Yang Yang, Haizhou Li:
Binary Event-Driven Spiking Transformer. CoRR abs/2501.05904 (2025)
[i277]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06467
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06467
Rui Liu, Zhenqi Jia, Feilong Bao, Haizhou Li:
Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis. CoRR abs/2501.06467 (2025)
[i276]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-09352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-09352
Xianghu Yue, Yiming Chen, Xueyi Zhang, Xiaoxue Gao, Mengling Feng, Mingrui Lao, Huiping Zhuang, Haizhou Li:
PAL: Prompting Analytic Learning with Missing Modality for Multi-Modal Class-Incremental Learning. CoRR abs/2501.09352 (2025)
[i275]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-13492
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-13492
Xuerui Qiu, Jieyuan Zhang, Wenjie Wei, Honglin Cao, Junsheng Guo, Rui-Jie Zhu, Yimeng Shan, Yang Yang, Malu Zhang, Haizhou Li:
Quantized Spike-driven Transformer. CoRR abs/2501.13492 (2025)
[i274]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-03260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-03260
Qiquan Zhang, Buddhi Wickramasinghe, Eliathamby Ambikairajah, Vidhyasaharan Sethu, Haizhou Li:
Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends. CoRR abs/2502.03260 (2025)
[i273]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11193
Li Zhou, Ruijie Zhang, Xunlian Dai, Daniel Hershcovich, Haizhou Li:
Large Language Models Penetration in Scholarly Writing and Peer Review. CoRR abs/2502.11193 (2025)
[i272]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12900
Yuhao Zhang, Zhiheng Liu, Fan Bu, Ruiyu Zhang, Benyou Wang, Haizhou Li:
Soundwave: Less is More for Speech-Text Alignment in LLMs. CoRR abs/2502.12900 (2025)
[i271]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-17521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-17521
Simin Chen, Yiming Chen, Zexin Li, Yifan Jiang, Zhongwei Wan, Yixin He, Dezhi Ran, Tianle Gu, Haizhou Li, Tao Xie, Baishakhi Ray:
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation. CoRR abs/2502.17521 (2025)
[i270]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18968
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18968
Kuang Wang, Xianfei Li, Shenghao Yang, Li Zhou, Feng Jiang, Haizhou Li:
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles. CoRR abs/2502.18968 (2025)
[i269]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-20067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-20067
Yidi Jiang, Qian Chen, Shengpeng Ji, Yu Xi, Wen Wang, Chong Zhang, Xianghu Yue, Shiliang Zhang, Haizhou Li:
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook. CoRR abs/2502.20067 (2025)
[i268]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-05085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-05085
Feng Jiang, Zhiyu Lin, Fan Bu, Yuhao Du, Benyou Wang, Haizhou Li:
S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information. CoRR abs/2503.05085 (2025)
[i267]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-12589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-12589
Wupeng Wang, Zexu Pan, Jingru Lin, Shuai Wang, Haizhou Li:
Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation. CoRR abs/2503.12589 (2025)
[i266]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-15338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-15338
Junyi Ao, Dekun Chen, Xiaohai Tian, Wenjie Feng, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context. CoRR abs/2503.15338 (2025)
[i265]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-00750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-00750
Wenxuan Wu, Xueyuan Chen, Shuai Wang, Jiadong Wang, Lingwei Meng, Xixin Wu, Helen Meng, Haizhou Li:
C²/AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction. CoRR abs/2504.00750 (2025)
[i264]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-02302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-02302
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation. CoRR abs/2504.02302 (2025)
[i263]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-05657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-05657
Tianchi Liu, Duc-Tuan Truong, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing. CoRR abs/2504.05657 (2025)
[i262]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-05794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-05794
Renjie Li, Wenjie Wei, Qi Xin, Xiaoli Liu, Sixuan Mao, Erik Ma, Zijian Chen, Malu Zhang, Haizhou Li, Zhaoyu Zhang:
What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips. CoRR abs/2505.05794 (2025)
[i261]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-12597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-12597
Yifan Hu, Rui Liu, Yi Ren, Xiang Yin, Haizhou Li:
Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis. CoRR abs/2505.12597 (2025)
[i260]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-14356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-14356
Sho Inoue, Shuai Wang, Haizhou Li:
PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs. CoRR abs/2505.14356 (2025)
[i259]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-18562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-18562
Xunlian Dai, Li Zhou, Benyou Wang, Haizhou Li:
From Word to World: Evaluate and Mitigate Culture Bias via Word Association Test. CoRR abs/2505.18562 (2025)
[i258]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20341
Rui Liu, Pu Gao, Jiatian Xi, Berrak Sisman, Carlos Busso, Haizhou Li:
Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset. CoRR abs/2505.20341 (2025)
[i257]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01565
Li Zhou, Lutong Yu, Dongchu Xie, Shaohuan Cheng, Wenyan Li, Haizhou Li:
Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation. CoRR abs/2506.01565 (2025)
[i256]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-07634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-07634
Chenyu Yang, Shuai Wang, Hangting Chen, Wei Tan, Jianwei Yu, Haizhou Li:
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement. CoRR abs/2506.07634 (2025)
[i255]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-09792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-09792
Wenxuan Wu, Shuai Wang, Xixin Wu, Helen Meng, Haizhou Li:
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction. CoRR abs/2506.09792 (2025)
[i254]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13709
Sirui Li, Shuai Wang, Zhijun Liu, Zhongjie Jiang, Yannan Wang, Haizhou Li:
SpeechRefiner: Towards Perceptual Quality Refinement for Front-End Algorithms. CoRR abs/2506.13709 (2025)
[i253]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-21682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-21682
Li Zhou, Hao Jiang, Junjie Li, Zefeng Zhao, Feng Jiang, Wenyu Chen, Haizhou Li:
Do We Really Need GNNs with Explicit Structural Modeling? MLPs Suffice for Language Model Representations. CoRR abs/2506.21682 (2025)
[i252]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-04598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-04598
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Multi-Step Prediction and Control of Hierarchical Emotion Distribution in Text-to-Speech Synthesis. CoRR abs/2507.04598 (2025)
[i251]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-07384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-07384
Yu Chen, Xinyuan Qian, Hongxu Zhu, Jiadong Wang, Kainan Chen, Haizhou Li:
VP-SelDoA: Visual-prompted Selective DoA Estimation of Target Sound via Semantic-Spatial Matching. CoRR abs/2507.07384 (2025)
[i250]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-07396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-07396
Zeyang Song, Shimin Zhang, Yuhong Chou, Jibin Wu, Haizhou Li:
IML-Spikeformer: Input-aware Multi-Level Spiking Transformer for Speech Processing. CoRR abs/2507.07396 (2025)
[i249]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-15294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-15294
Junjie Li, Wenxuan Wu, Shuai Wang, Zexu Pan, Kong Aik Lee, Helen Meng, Haizhou Li:
MeMo: Attentional Momentum for Real-time Audio-visual Speaker Extraction under Impaired Visual Conditions. CoRR abs/2507.15294 (2025)
[i248]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-17735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-17735
Qibing Bai, Sho Inoue, Shuai Wang, Zhongjie Jiang, Yannan Wang, Haizhou Li:
Accent Normalization Using Self-Supervised Discrete Tokens with Non-Parallel Data. CoRR abs/2507.17735 (2025)
[i247]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-13889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-13889
Chuang Li, Yang Deng, Hengchang Hu, See-Kiong Ng, Min-Yen Kan, Haizhou Li:
CARE: Contextual Adaptation of Recommenders for LLM-based Conversational Recommendation. CoRR abs/2508.13889 (2025)
[i246]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-14706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-14706
Junying Chen, Zhenyang Cai, Zhiheng Liu, Yunjin Yang, Rongsheng Wang, Qingying Xiao, Xiangyi Feng, Zhan Su, Jing Guo, Xiang Wan, Guangjun Yu, Haizhou Li, Benyou Wang:
ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine. CoRR abs/2508.14706 (2025)
[i245]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-19210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-19210
Tianchi Liu, Ruijie Tao, Qiongqiong Wang, Yidi Jiang, Hardik B. Sailor, Ke Zhang, Jingru Lin, Haizhou Li:
Interpolating Speaker Identities in Embedding Space for Data Expansion. CoRR abs/2508.19210 (2025)
[i244]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-03829
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-03829
Huhong Xian, Rui Liu, Berrak Sisman, Haizhou Li:
NE-PADD: Leveraging Named Entity Knowledge for Robust Partial Audio Deepfake Detection via Attention Aggregation. CoRR abs/2509.03829 (2025)
[i243]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-06074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-06074
Zhenqi Jia, Rui Liu, Berrak Sisman, Haizhou Li:
Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis. CoRR abs/2509.06074 (2025)
[i242]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-09174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-09174
Yuhao Zhang, Yuhao Du, Zhanchen Dai, Xiangnan Ma, Kaiqi Kou, Benyou Wang, Haizhou Li:
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs. CoRR abs/2509.09174 (2025)
[i241]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-14804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-14804
Mingchen Shao, Bingshen Mu, Chengyou Wang, Haizhou Li, Ying Yan, Zhonghua Fu, Lei Xie:
Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages. CoRR abs/2509.14804 (2025)
[i240]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-17186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-17186
Dehao Zhang, Malu Zhang, Shuai Wang, Jingya Wang, Wenjie Wei, Zeyu Ma, Guoyin Wang, Yang Yang, Haizhou Li:
Dendritic Resonate-and-Fire Neuron for Effective and Efficient Long Sequence Modeling. CoRR abs/2509.17186 (2025)
[i239]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24266
Wenjie Wei, Malu Zhang, Jieyuan Zhang, Ammar Belatreche, Shuai Wang, Yimeng Shan, Hanwen Liu, Honglin Cao, Guoqing Wang, Yang Yang, Haizhou Li:
S²NN: Sub-bit Spiking Neural Networks. CoRR abs/2509.24266 (2025)
[i238]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24700
Suli Wang, Yang-yang Li, Siqi Cai, Haizhou Li:
A Robust Multi-Scale Framework with Test-Time Adaptation for sEEG-Based Speech Decoding. CoRR abs/2509.24700 (2025)
[i237]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-00032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-00032
Ziyi Zeng, Zhenyang Cai, Yixi Cai, Xidong Wang, Junying Chen, Rongsheng Wang, Yipeng Liu, Siqi Cai, Benyou Wang, Zhiguo Zhang, Haizhou Li:
WaveMind: Towards a Conversational EEG Foundation Model Aligned to Textual and Visual Modalities. CoRR abs/2510.00032 (2025)
[i236]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-13910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-13910
Jingru Lin, Chen Zhang, Stephen Y. Liu, Haizhou Li:
RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems. CoRR abs/2510.13910 (2025)
[i235]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-17879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-17879
Zheyuan Lin, Siqi Cai, Haizhou Li:
Decoding Listeners Identity: Person Identification from EEG Signals Using a Lightweight Spiking Transformer. CoRR abs/2510.17879 (2025)
[i234]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-18206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-18206
Hanyu Meng, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Qiquan Zhang, Haizhou Li:
Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing. CoRR abs/2510.18206 (2025)
[i233]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-21403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-21403
Jieyuan Zhang, Xiaolong Zhou, Shuai Wang, Wenjie Wei, Hanwen Liu, Qian Sun, Malu Zhang, Yang Yang, Haizhou Li:
Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks. CoRR abs/2510.21403 (2025)
[i232]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-22758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-22758
Li Zhou, Lutong Yu, You Lyu, Yihang Lin, Zefeng Zhao, Junyi Ao, Yuhao Zhang, Benyou Wang, Haizhou Li:
EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models. CoRR abs/2510.22758 (2025)
[i231]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-01652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-01652
Mehmet Sinan Yildirim, Ruijie Tao, Wupeng Wang, Junyi Ao, Haizhou Li:
Leveraging Language Information for Target Language Extraction. CoRR abs/2511.01652 (2025)
[i230]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-06288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-06288
Wenxuan Wu, Shuai Wang, Xixin Wu, Helen Meng, Haizhou Li:
ELEGANCE: Efficient LLM Guidance for Audio-Visual Target Speech Extraction. CoRR abs/2511.06288 (2025)
[i229]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-02459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-02459
Qianhui Liu, Jing Yang, Miao Yu, Trevor E. Carlson, Gang Pan, Haizhou Li, Zhumin Chen:
Efficient Eye-based Emotion Recognition via Neural Architecture Search of Time-to-First-Spike-Coded Spiking Neural Networks. CoRR abs/2512.02459 (2025)
[i228]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-21715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-21715
Rui Ke, Jiahui Xu, Shenghao Yang, Kuang Wang, Feng Jiang, Haizhou Li:
CATCH: A Controllable Theme Detection Framework with Contextualized Clustering and Hierarchical Generation. CoRR abs/2512.21715 (2025)
2024
[j187]
- view
  authority control:
- export record
  dblp key:
  - journals/isci/LiuGL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/isci/LiuGL24
Qianhui Liu, Meng Ge, Haizhou Li:
Intelligent event-based lip reading word classification with spiking neural networks using spatio-temporal attention features and triplet loss. Inf. Sci. 675: 120660 (2024)
[j186]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/YanLZFMLP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/YanLZFMLP24
Jiaqi Yan, Qianhui Liu, Malu Zhang, Lang Feng, De Ma, Haizhou Li, Gang Pan:
Efficient spiking neural network design via neural architecture search. Neural Networks 173: 106172 (2024)
[j185]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ChenYWLT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ChenYWLT24
Xinyi Chen, Qu Yang, Jibin Wu, Haizhou Li, Kay Chen Tan:
A Hybrid Neural Coding Approach for Pattern Recognition With Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3064-3078 (2024)
[j184]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/WangCHWLZXDRSQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WangCHWLZXDRSQL24
Shuai Wang, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li:
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024)
[j183]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/LinGWLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LinGWLF24
Jingru Lin, Meng Ge, Wupeng Wang, Haizhou Li, Mengling Feng:
Selective HuBERT: Self-Supervised Pre-Training for Target Speaker in Clean and Mixture Speech. IEEE Signal Process. Lett. 31: 1014-1018 (2024)
[j182]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/MaYAGL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/MaYAGL24
Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li:
Text-Guided HuBERT: Self-Supervised Speech Pre-Training via Generative Adversarial Networks. IEEE Signal Process. Lett. 31: 2055-2059 (2024)
[j181]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/GaoLCLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GaoLCLL24
Xiaoxue Gao, Zexin Li, Yiming Chen, Cong Liu, Haizhou Li:
Transferable Adversarial Attacks Against ASR. IEEE Signal Process. Lett. 31: 2200-2204 (2024)
[j180]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/LiuZLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/LiuZLSL24
Rui Liu, Haolin Zuo, Zheng Lian, Björn W. Schuller, Haizhou Li:
Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities. IEEE Trans. Affect. Comput. 15(4): 1856-1873 (2024)
[j179]
- view
  authority control:
- export record
  dblp key:
  - journals/tamd/YangZWTL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/YangZWTL24
Qu Yang, Malu Zhang, Jibin Wu, Kay Chen Tan, Haizhou Li:
LC-TTFS: Toward Lossless Network Conversion for Spiking Neural Networks With TTFS Coding. IEEE Trans. Cogn. Dev. Syst. 16(5): 1626-1639 (2024)
[j178]
- view
  authority control:
- export record
  dblp key:
  - journals/tamd/CaiZZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/CaiZZWL24
Siqi Cai, Ran Zhang, Malu Zhang, Jibin Wu, Haizhou Li:
EEG-Based Auditory Attention Detection With Spiking Graph Convolutional Network. IEEE Trans. Cogn. Dev. Syst. 16(5): 1698-1706 (2024)
[j177]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/YoshinoCCKLHMFLZFZKLJPGHDGHSZLSDB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YoshinoCCKLHMFLZFZKLJPGHDGHSZLSDB24
Koichiro Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael E. Banchs, Alexander Rudnicky:
Overview of the Tenth Dialog System Technology Challenge: DSTC10. IEEE ACM Trans. Audio Speech Lang. Process. 32: 765-778 (2024)
[j176]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuLL24
Lei Liu, Li Liu, Haizhou Li:
Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1559-1572 (2024)
[j175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouZZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouZZWL24
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis With Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1699-1711 (2024)
[j174]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuSGL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuSGL24
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2188-2201 (2024)
[j173]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuLWL24
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2324-2337 (2024)
[j172]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SunTTLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SunTTLQ24
Congcong Sun, Hui Tian, Peng Tian, Haizhou Li, Zhenxing Qian:
Multi-Agent Deep Learning for the Detection of Multiple Speech Steganography Methods. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2957-2972 (2024)
[j171]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangZRZYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangZRZYL24
Mingyang Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou Li:
RefXVC: Cross-Lingual Voice Conversion With Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4146-4156 (2024)
[j170]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangPLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangPLWL24
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation With Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024)
[j169]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PanBCSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanBCSL24
Zexu Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li:
NeuroHeed: Neuro-Steered Speaker Extraction Using EEG Signals. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4456-4470 (2024)
[j168]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuZXLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuZXLW24
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4569-4579 (2024)
[j167]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangCLQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangCLQL24
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024)
[j166]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tbe/CaiSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/CaiSL24
Siqi Cai, Tanja Schultz, Haizhou Li:
Brain Topology Modeling With EEG-Graphs for Auditory Spatial Attention Detection. IEEE Trans. Biomed. Eng. 71(1): 171-182 (2024)
[j165]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/LiuWQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/LiuWQL24
Miao Liu, Jing Wang, Xinyuan Qian, Haizhou Li:
Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6937-6948 (2024)
[j164]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/WengZLLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/WengZLLL24
Zhenyu Weng, Huiping Zhuang, Fulin Luo, Haizhou Li, Zhiping Lin:
Few-Shot Contrastive Transfer Learning With Pretrained Model for Masked Face Verification. IEEE Trans. Multim. 26: 3871-3883 (2024)
[j163]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/QianXZTL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/QianXZTL24
Xinyuan Qian, Wei Xue, Qiquan Zhang, Ruijie Tao, Haizhou Li:
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech. IEEE Trans. Multim. 26: 4480-4489 (2024)
[j162]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tnn/CaiLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/CaiLL24
Siqi Cai, Peiwen Li, Haizhou Li:
A Bio-Inspired Spiking Attentional Neural Network for Attentional Selection in the Listening Brain. IEEE Trans. Neural Networks Learn. Syst. 35(12): 17387-17397 (2024)
[j161]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/JiGZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/JiGZL24
Ruihang Ji, Shuzhi Sam Ge, Kai Zhao, Haizhou Li:
Event-Triggered Tracking Control for Nonlinear Systems With Prescribed Performance. IEEE Trans. Syst. Man Cybern. Syst. 54(6): 3547-3557 (2024)
[c748]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangYMW0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangYMW0T24
Shimin Zhang, Qu Yang, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan:
TC-LIF: A Two-Compartment Spiking Neuron Model for Long-Term Sequential Modelling. AAAI 2024: 16838-16847
[c747]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuH0Y024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuH0Y024
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. AAAI 2024: 18698-18706
[c746]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangPZT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangPZT024
Jiadong Wang, Zexu Pan, Malu Zhang, Robby T. Tan, Haizhou Li:
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition. AAAI 2024: 19144-19152
[c745]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangDCZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangDCZL24
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Malu Zhang, Haizhou Li:
A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators. AAAI 2024: 19515-19524
[c744]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChenZLDT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZLDT024
Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li:
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models. ACL (Findings) 2024: 1359-1375
[c743]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Inoue00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Inoue00024
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Fine-Grained Quantitative Emotion Editing for Speech Generation. APSIPA 2024: 1-6
[c742]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/JiangLCLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/JiangLCLZ024
Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Haizhou Li:
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark. LREC/COLING 2024: 495-506
[c741]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/LuoZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LuoZZL24
Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li:
CrossTune: Black-Box Few-Shot Classification with Label Enhancement. LREC/COLING 2024: 4185-4197
[c740]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/FanJL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/FanJL024
Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study. LREC/COLING 2024: 16998-17010
[c739]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/IvucicPPC0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/IvucicPPC0S24
Gabriel Ivucic, Saurav Pahuja, Felix Putze, Siqi Cai, Haizhou Li, Tanja Schultz:
The Impact of Cross-Validation Schemes for EEG-Based Auditory Attention Detection with Deep Neural Networks. EMBC 2024: 1-4
[c738]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/0020TCSTJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/0020TCSTJ024
Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li:
TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models. EMNLP (Findings) 2024: 8926-8946
[c737]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChenYG0DT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenYG0DT024
Yiming Chen, Xianghu Yue, Xiaoxue Gao, Chen Zhang, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li:
Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models. EMNLP (Findings) 2024: 10917-10930
[c736]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/Pan0ZL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Pan0ZL0024
Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li:
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models. EMNLP 2024: 14686-14695
[c735]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLLGS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLLGS024
Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li:
SVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks. ICASSP 2024: 221-225
[c734]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SongWZS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SongWZS024
Zeyang Song, Jibin Wu, Malu Zhang, Mike Zheng Shou, Haizhou Li:
Spiking-Leaf: A Learnable Auditory Front-End for Spiking Neural Networks. ICASSP 2024: 226-230
[c733]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangGZASN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangGZASN024
Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li:
An Empirical Study on the Impact of Positional Encoding in Transformer-Based Monaural Speech Enhancement. ICASSP 2024: 1001-1005
[c732]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaiZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaiZ024
Siqi Cai, Ran Zhang, Haizhou Li:
Robust Decoding of the Auditory Attention from EEG Recordings Through Graph Convolutional Networks. ICASSP 2024: 2320-2324
[c731]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenQPC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenQPC024
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li:
LOCSELECT: Target Speaker Localization with an Auditory Selective Hearing Mechanism. ICASSP 2024: 8696-8700
[c730]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Inoue0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Inoue0W024
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605
[c729]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTPGW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTPGW024
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech. ICASSP 2024: 10666-10670
[c728]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangBLYCHQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangBLYCHQ024
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905
[c727]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangCTDQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangCTDQ024
Yidi Jiang, Zhengyang Chen, Ruijie Tao, Liqun Deng, Yanmin Qian, Haizhou Li:
Prompt-Driven Target Speech Diarization. ICASSP 2024: 11086-11090
[c726]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaLHGL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaLHGL24
Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li:
Gradient Weighting for Speaker Verification in Extremely Low Signal-to-Noise Ratio. ICASSP 2024: 11311-11315
[c725]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/JiangYLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/JiangYLL24
Feng Jiang, Lingyi Yang, Yu Lu, Haizhou Li:
Tailored Domain-Specific Summaries: A Two-Stage Method Combining Extractive and Abstractive Summarization Models. ICONIP (9) 2024: 347-362
[c724]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/LiuYZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LiuYZ0024
Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li:
LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization. IJCAI 2024: 3097-3105
[c723]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/WangMBWS00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WangMBWS00024
Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang:
Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition. IJCAI 2024: 3160-3168
[c722]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WuCWLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WuCWLM24
Wenxuan Wu, Xueyuan Chen, Xixin Wu, Haizhou Li, Helen Meng:
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy. IJCNN 2024: 1-8
[c721]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004ZDMT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004ZDMT024
Tianchi Liu, Lin Zhang, Rohan Kumar Das, Yi Ma, Ruijie Tao, Haizhou Li:
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? INTERSPEECH 2024
[c720]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008X0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008X0024
Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency. INTERSPEECH 2024
[c719]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorsdorfP0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorsdorfP0S24
Marvin Borsdorf, Zexu Pan, Haizhou Li, Tanja Schultz:
wTIMIT2mix: A Cocktail Party Mixtures Database to Study Target Speaker Extraction for Normal and Whispered Speech. INTERSPEECH 2024
[c718]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EwertB0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EwertB0S24
Iva Ewert, Marvin Borsdorf, Haizhou Li, Tanja Schultz:
Does the Lombard Effect Matter in Speech Separation? Introducing the Lombard-GRID-2mix Dataset. INTERSPEECH 2024
[c717]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinGAD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinGAD024
Jingru Lin, Meng Ge, Junyi Ao, Liqun Deng, Haizhou Li:
SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech. INTERSPEECH 2024
[c716]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHC024
Zijie Lin, Tianyu He, Siqi Cai, Haizhou Li:
ASA: An Auditory Spatial Attention Dataset with Multiple Speaking Locations. INTERSPEECH 2024
[c715]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PahujaIHCS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PahujaIHCS024
Saurav Pahuja, Gabriel Ivucic, Pascal Himmelmann, Siqi Cai, Tanja Schultz, Haizhou Li:
Leveraging Graphic and Convolutional Neural Networks for Auditory Attention Detection with EEG. INTERSPEECH 2024
[c714]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongLYP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongLYP024
Zeyang Song, Qianhui Liu, Qu Yang, Yizhou Peng, Haizhou Li:
ED-sKWS: Early-Decision Spiking Neural Networks for Rapid, and Energy-Efficient Keyword Spotting. INTERSPEECH 2024
[c713]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZLLWG0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZLLWG0Q024
Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. INTERSPEECH 2024
[c712]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZ0A024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZ0A024
Qiquan Zhang, Hongxu Zhu, Xinyuan Qian, Eliathamby Ambikairajah, Haizhou Li:
An Exploration of Length Generalization in Transformer-Based Speech Enhancement. INTERSPEECH 2024
[c711]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Bai0LZRW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Bai0LZRW024
Qibing Bai, Shuai Wang, Zhijun Liu, Mingyang Zhang, Wei Rao, Yannan Wang, Haizhou Li:
Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion. ISCSLP 2024: 284-288
[c710]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Hu0G024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Hu0G024
Yifan Hu, Rui Liu, Guanglai Gao, Haizhou Li:
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis. ISCSLP 2024: 299-303
[c709]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhaoWLP0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhaoWLP0Z24
Peng Zhao, Ruicong Wang, Zijie Lin, Zexu Pan, Haizhou Li, Xueyi Zhang:
Ensemble Deep Learning Models for EEG-Based Auditory Attention Decoding. ISCSLP 2024: 339-343
[c708]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YueZCZLZQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YueZCZLZQ024
Xianghu Yue, Xueyi Zhang, Yiming Chen, Chengwei Zhang, Mingrui Lao, Huiping Zhuang, Xinyuan Qian, Haizhou Li:
MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks. ACM Multimedia 2024: 2428-2437
[c707]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuLL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuLL0024
Weizhi Liu, Yue Li, Dongdong Lin, Hui Tian, Haizhou Li:
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis. ACM Multimedia 2024: 3294-3302
[c706]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0008H00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0008H00024
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. ACM Multimedia 2024: 4187-4196
[c705]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuWQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuWQ024
Miao Liu, Jing Wang, Xinyuan Qian, Haizhou Li:
ListenFormer: Responsive Listening Head Generation with Non-autoregressive Transformers. ACM Multimedia 2024: 7094-7103
[c704]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoSJTCA024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoSJTCA024
Ruijie Tao, Zhan Shi, Yidi Jiang, Duc-Tuan Truong, Eng Siong Chng, Massimo Alioto, Haizhou Li:
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization. ACM Multimedia 2024: 11342-11347
[c703]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/LiZKL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LiZKL24
Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li:
UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking. NAACL-HLT (Findings) 2024: 2972-2983
[c702]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangCS0CXCJLWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangCS0CXCJLWW024
Xidong Wang, Guiming Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Junying Chen, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li:
CMB: A Comprehensive Medical Benchmark in Chinese. NAACL-HLT 2024: 6184-6205
[c701]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/HuangYZSCSCAAHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/HuangYZSCSCAAHL24
Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Mosen Alharthi, Bang An, Juncai He, Ziche Liu, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu:
AceGPT, Localizing Large Language Models in Arabic. NAACL-HLT 2024: 8139-8163
[c700]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AoWTCZ0W0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AoWTCZ0W0024
Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. NeurIPS 2024
[c699]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiangCZHZAAHZ0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiangCZHZAAHZ0W24
Juhao Liang, Zhenyang Cai, Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li, Benyou Wang, Jinchao Xu:
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs. NeurIPS 2024
[c698]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangLZ0GCY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangLZ0GCY024
Xueyi Zhang, Mingrui Lao, Peng Zhao, Jun Tang, Yanming Guo, Siqi Cai, Xianghu Yue, Haizhou Li:
Language Without Borders: A Dataset and Benchmark for Code-Switching Lip Reading. NeurIPS 2024
[c697]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/0003SB0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/0003SB0024
Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. Odyssey 2024: 180-186
[c696]
- view
  authority control:
- export record
  dblp key:
  - conf/ram/YangCLHC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ram/YangCLHC024
Hongli Yang, Xinyi Chen, Junjie Li, Hao Huang, Siqi Cai, Haizhou Li:
Listen to the Speaker in Your Gaze. CIS-RAM 2024: 380-385
[c695]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiZWLML24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiZWLML24
Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the Effectiveness of Enrollment Speech Augmentation For Target Speaker Extraction. SLT 2024: 325-332
[c694]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SilvaCPSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SilvaCPSL24
Dashanka De Silva, Siqi Cai, Saurav Pahuja, Tanja Schultz, Haizhou Li:
Neurospex: Neuro-Guided Speaker Extraction With Cross-Modal Fusion. SLT 2024: 341-348
[c693]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangWLZQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangWLZQL24
Jiahe Wang, Shuai Wang, Junjie Li, Ke Zhang, Yanmin Qian, Haizhou Li:
Enhancing Speaker Extraction Through Rectifying Target Confusion. SLT 2024: 349-356
[c692]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhangXGWLHWLCZFCTZWHCLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhangXGWLHWLCZFCTZWHCLW24
Xueyao Zhang, Liumeng Xue, Yicheng Gu, Yuancheng Wang, Jiaqi Li, Haorui He, Chaoren Wang, Songting Liu, Xi Chen, Junan Zhang, Zihao Fang, Haopeng Chen, Tze Ying Tang, Lexiao Zou, Mingxuan Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu:
Amphion: an Open-Source Audio, Music, and Speech Generation Toolkit. SLT 2024: 879-884
[c691]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/JiangZJLCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/JiangZJLCL24
Lichuan Jiang, Jiani Zhong, Muqing Jian, Xuanzhuo Liu, Siqi Cai, Haizhou Li:
The Impact of Synchronized Visual and Auditory Attention on Human Perception. ICSR + InnoBiz 2024: 41-50
[c690]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/QianLZCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/QianLZCL24
Xinyuan Qian, Chen Lu, Yating Zhang, Kainan Chen, Haizhou Li:
Semi-supervised Speaker Localization with Gaussian-Like Pseudo-labeling. ICSR + InnoBiz 2024: 146-155
[c689]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/WangZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/WangZL24
Shuai Wang, Pengcheng Zhu, Haizhou Li:
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions. ICSR + InnoBiz 2024: 303-311
[c688]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LiuHGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LiuHGZ024
Ganjun Liu, Xiaohui Hou, Meng Ge, Tao Zhang, Haizhou Li:
A Non-Intrusive Approach to Assessing Dysarthria Severity: Advancing Clinical Diagnosis. WWW (Companion Volume) 2024: 1134-1137
[i227]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02626
Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li:
Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio. CoRR abs/2401.02626 (2024)
[i226]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-09150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-09150
Feng Jiang, Kuang Wang, Haizhou Li:
Bridging Research and Readers: A Multi-Modal Automated Academic Papers Interpretation System. CoRR abs/2401.09150 (2024)
[i225]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-09686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-09686
Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li:
An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement. CoRR abs/2401.09686 (2024)
[i224]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12264
Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li:
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing. CoRR abs/2401.12264 (2024)
[i223]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14652
Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li:
LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization. CoRR abs/2401.14652 (2024)
[i222]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17604
Lei Liu, Li Liu, Haizhou Li:
Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition. CoRR abs/2401.17604 (2024)
[i221]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00270
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00270
Wenjie Wei, Malu Zhang, Jilin Zhang, Ammar Belatreche, Jibin Wu, Zijing Xu, Xuerui Qiu, Hong Chen, Yang Yang, Haizhou Li:
Event-Driven Learning for Spiking Neural Networks. CoRR abs/2403.00270 (2024)
[i220]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-02002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-02002
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Fine-Grained Quantitative Emotion Editing for Speech Generation. CoRR abs/2403.02002 (2024)
[i219]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-03640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-03640
Xidong Wang, Nuo Chen, Junyin Chen, Yan Hu, Yidong Wang, Xiangbo Wu, Anningzhe Gao, Xiang Wan, Haizhou Li, Benyou Wang:
Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People. CoRR abs/2403.03640 (2024)
[i218]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-05772
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-05772
Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li:
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks. CoRR abs/2403.05772 (2024)
[i217]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12468
Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li:
CrossTune: Black-Box Few-Shot Classification with Label Enhancement. CoRR abs/2403.12468 (2024)
[i216]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16078
Wenxuan Wu, Xueyuan Chen, Xixin Wu, Haizhou Li, Helen Meng:
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy. CoRR abs/2403.16078 (2024)
[i215]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17161
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder. CoRR abs/2404.17161 (2024)
[i214]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-18501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-18501
Ruijie Tao, Xinyuan Qian, Yidi Jiang, Junjie Li, Jiadong Wang, Haizhou Li:
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention. CoRR abs/2404.18501 (2024)
[i213]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-01868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-01868
Chuang Li, Yang Deng, Hengchang Hu, Min-Yen Kan, Haizhou Li:
Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems. CoRR abs/2405.01868 (2024)
[i212]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09171
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. CoRR abs/2405.09171 (2024)
[i211]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-12609
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-12609
Xiangyu Zhang, Qiquan Zhang, Hexin Liu, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps:
Mamba in Speech: Towards an Alternative to Self-Attention. CoRR abs/2405.12609 (2024)
[i210]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14646
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14646
Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li:
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models. CoRR abs/2405.14646 (2024)
[i209]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19799
Jiahui Xu, Feng Jiang, Anningzhe Gao, Haizhou Li:
Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation. CoRR abs/2405.19799 (2024)
[i208]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20215
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20215
Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li:
TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models. CoRR abs/2405.20215 (2024)
[i207]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02483
Tianchi Liu, Lin Zhang, Rohan Kumar Das, Yi Ma, Ruijie Tao, Haizhou Li:
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? CoRR abs/2406.02483 (2024)
[i206]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05551
Zhijun Liu, Shuai Wang, Sho Inoue, Qibing Bai, Haizhou Li:
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis. CoRR abs/2406.05551 (2024)
[i205]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07198
Yidi Jiang, Ruijie Tao, Zhengyang Chen, Yanmin Qian, Haizhou Li:
Target Speech Diarization with Multimodal Prompts. CoRR abs/2406.07198 (2024)
[i204]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10844
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhiwu Li, Haizhou Li:
Multi-Scale Accent Modeling with Disentangling for Multi-Speaker Multi-Accent TTS Synthesis. CoRR abs/2406.10844 (2024)
[i203]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12726
Zeyang Song, Qianhui Liu, Qu Yang, Yizhou Peng, Haizhou Li:
ED-sKWS: Early-Decision Spiking Neural Networks for Rapid,and Energy-Efficient Keyword Spotting. CoRR abs/2406.12726 (2024)
[i202]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13340
Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024)
[i201]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14115
Ziche Liu, Rui Ke, Feng Jiang, Haizhou Li:
Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models. CoRR abs/2406.14115 (2024)
[i200]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01009
Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li:
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models. CoRR abs/2407.01009 (2024)
[i199]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02751
Rui Liu, Haolin Zuo, Zheng Lian, Xiaofen Xing, Björn W. Schuller, Haizhou Li:
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset. CoRR abs/2407.02751 (2024)
[i198]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09521
Yang Wang, Haiyang Mei, Qirui Bao, Ziqi Wei, Mike Zheng Shou, Haizhou Li, Bo Dong, Xin Yang:
Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition. CoRR abs/2407.09521 (2024)
[i197]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10471
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10471
Weizhi Liu, Yue Li, Dongdong Lin, Hui Tian, Haizhou Li:
GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis. CoRR abs/2407.10471 (2024)
[i196]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15188
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024)
[i195]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-21491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-21491
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. CoRR abs/2407.21491 (2024)
[i194]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16564
Qianhui Liu, Jiadong Wang, Yang Wang, Xin Yang, Gang Pan, Haizhou Li:
Human-Inspired Audio-Visual Speech Recognition: Spike Activity, Cueing Interaction and Causal Processing. CoRR abs/2408.16564 (2024)
[i193]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02489
Dashanka De Silva, Siqi Cai, Saurav Pahuja, Tanja Schultz, Haizhou Li:
NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention. CoRR abs/2409.02489 (2024)
[i192]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-07224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-07224
Xinyuan Qian, Xianghu Yue, Jiadong Wang, Huiping Zhuang, Haizhou Li:
Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection. CoRR abs/2409.07224 (2024)
[i191]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09351
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09351
Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
E1 TTS: Simple and Fast Non-Autoregressive TTS. CoRR abs/2409.09351 (2024)
[i190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09352
Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. CoRR abs/2409.09352 (2024)
[i189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09589
Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction. CoRR abs/2409.09589 (2024)
[i188]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13948
Chen Zhang, Dading Chong, Feng Jiang, Chengguang Tang, Anningzhe Gao, Guohua Tang, Haizhou Li:
Aligning Language Models Using Follow-up Likelihood as Reward Signal. CoRR abs/2409.13948 (2024)
[i187]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15782
Shuai Wang, Pengcheng Zhu, Haizhou Li:
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions. CoRR abs/2409.15782 (2024)
[i186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15799
Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024)
[i185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18680
Yiming Chen, Xianghu Yue, Xiaoxue Gao, Chen Zhang, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li:
Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models. CoRR abs/2409.18680 (2024)
[i184]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03719
Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor+: Text-based Speech Editing by Modeling Local Hierarchical Acoustic Smoothness and Global Prosody Consistency. CoRR abs/2410.03719 (2024)
[i183]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-09524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-09524
Rui Liu, Zhenqi Jia, Jie Yang, Yifan Hu, Haizhou Li:
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling. CoRR abs/2410.09524 (2024)
[i182]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13268
Fan Bu, Yuhao Zhang, Xidong Wang, Benyou Wang, Qun Liu, Haizhou Li:
Roadmap towards Superhuman Speech Understanding using Large Language Models. CoRR abs/2410.13268 (2024)
[i181]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14101
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14101
Shuwei He, Rui Liu, Haizhou Li:
Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech. CoRR abs/2410.14101 (2024)
[i180]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14259
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14259
Zihao Cheng, Li Zhou, Feng Jiang, Benyou Wang, Haizhou Li:
Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement. CoRR abs/2410.14259 (2024)
[i179]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16059
Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li:
Multi-Level Speaker Representation for Target Speaker Extraction. CoRR abs/2410.16059 (2024)
[i178]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17196
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17196
Yiming Chen, Xianghu Yue, Chen Zhang, Xiaoxue Gao, Robby T. Tan, Haizhou Li:
VoiceBench: Benchmarking LLM-Based Voice Assistants. CoRR abs/2410.17196 (2024)
[i177]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-03085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-03085
Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. CoRR abs/2411.03085 (2024)
[i176]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-07751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-07751
Xinyuan Qian, Jiaran Gao, Yaodan Zhang, Qiquan Zhang, Hexin Liu, Leibny Paola García, Haizhou Li:
SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model. CoRR abs/2411.07751 (2024)
[i175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-09220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-09220
Xiaoxue Gao, Zexin Li, Yiming Chen, Cong Liu, Haizhou Li:
Transferable Adversarial Attacks against ASR. CoRR abs/2411.09220 (2024)
[i174]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-03253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-03253
Juhao Liang, Zhenyang Cai, Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li, Benyou Wang, Jinchao Xu:
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs. CoRR abs/2412.03253 (2024)
[i173]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-08247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-08247
Junjie Li, Ke Zhang, Shuai Wang, Kong Aik Lee, Haizhou Li:
MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues. CoRR abs/2412.08247 (2024)
[i172]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-11409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-11409
Rui Liu, Shuwei He, Yifan Hu, Haizhou Li:
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech. CoRR abs/2412.11409 (2024)
[i171]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-12310
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-12310
Jianqing Zhu, Huang Huang, Zhihang Lin, Juhao Liang, Zhengyang Tang, Khalid Almubarak, Abdulmohsen Alharthik, Bang An, Juncai He, Xiangbo Wu, Fei Yu, Junying Chen, Zhuoheng Ma, Yuhao Du, He Zhang, Emad A. Alghamdi, Lian Zhang, Ruoyu Sun, Haizhou Li, Benyou Wang, Jinchao Xu:
Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion. CoRR abs/2412.12310 (2024)
[i170]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-12498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-12498
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Control of Emotion Rendering in Speech Synthesis. CoRR abs/2412.12498 (2024)
[i169]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-13786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-13786
Chenyu Yang, Shuai Wang, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Yaoxun Xu, Yizhi Zhou, Haina Zhu, Haizhou Li:
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor. CoRR abs/2412.13786 (2024)
2023
[j160]
- view
  authority control:
- export record
  dblp key:
  - journals/cacm/LuoWGDCLJY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cacm/LuoWGDCLJY23
Tao Luo, Weng-Fai Wong, Rick Siow Mong Goh, Anh Tuan Do, Zhixian Chen, Haizhou Li, Wenyu Jiang, Weiyun Yau:
Achieving Green AI with Energy-Efficient Deep Learning Using Neuromorphic Computing. Commun. ACM 66(7): 52-57 (2023)
[j159]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/WickramasingheASELD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WickramasingheASELD23
Buddhi Wickramasinghe, Eliathamby Ambikairajah, Vidhyasaharan Sethu, Julien Epps, Haizhou Li, Ting Dang:
DNN controlled adaptive front-end for replay attack detection systems. Speech Commun. 154: 102973 (2023)
[j158]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WangPGYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WangPGYL23
Tingting Wang, Zexu Pan, Meng Ge, Zhen Yang, Haizhou Li:
Time-Domain Speech Separation Networks With Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114 (2023)
[j157]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/ZhouWZTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhouWZTL23
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023)
[j156]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhangZWL23
Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li:
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023)
[j155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/ZhouSRSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ZhouSRSL23
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023)
[j154]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/ZhouSRSL23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ZhouSRSL23a
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis With Mixed Emotions. IEEE Trans. Affect. Comput. 14(4): 3120-3134 (2023)
[j153]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TianQMLQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TianQMLQ23
Hui Tian, Yiqin Qiu, Wojciech Mazurczyk, Haizhou Li, Zhenxing Qian:
STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams. IEEE ACM Trans. Audio Speech Lang. Process. 31: 277-289 (2023)
[j152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangQNNAL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangQNNAL23
Qiquan Zhang, Xinyuan Qian, Zhaoheng Ni, Aaron Nicolson, Eliathamby Ambikairajah, Haizhou Li:
A Time-Frequency Attention Module for Neural Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 462-475 (2023)
[j151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/QianWWGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianWWGL23
Xinyuan Qian, Zhengdong Wang, Jiadong Wang, Guohui Guan, Haizhou Li:
Audio-Visual Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 31: 550-562 (2023)
[j150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangDZFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangDZFL23
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
PoE: A Panel of Experts for Generalized Automatic Dialogue Assessment. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1234-1250 (2023)
[j149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/TaoLDHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TaoLDHL23
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023)
[j148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouWTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouWTL23
Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li:
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023)
[j147]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/GaoGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GaoGL23
Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
PoLyScriber: Integrated Fine-Tuning of Extractor and Lyrics Transcriber for Polyphonic Music. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1968-1981 (2023)
[j146]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/WengZLRML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/WengZLRML23
Zhenyu Weng, Huiping Zhuang, Haizhou Li, Balakrishnan Ramalingam, Rajesh Elara Mohan, Zhiping Lin:
Online Multi-Face Tracking With Multi-Modality Cascaded Matching. IEEE Trans. Circuits Syst. Video Technol. 33(6): 2738-2752 (2023)
[j145]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/QiuTLCV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/QiuTLCV23
Yiqin Qiu, Hui Tian, Haizhou Li, Chin-Chen Chang, Athanasios V. Vasilakos:
Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis. IEEE Trans. Inf. Forensics Secur. 18: 2737-2750 (2023)
[j144]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/WuCZLLT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/WuCZLLT23
Jibin Wu, Yansong Chua, Malu Zhang, Guoqi Li, Haizhou Li, Kay Chen Tan:
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 34(1): 446-460 (2023)
[c687]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChenCL0LT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenCL0LT023
Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li:
Dynamic Transformers Provide a False Sense of Efficiency. ACL (1) 2023: 7164-7180
[c686]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangZWL23
Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Zero-shot multi-speaker accent TTS with limited accent data. APSIPA ASC 2023: 1931-1936
[c685]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DuJTZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DuJTZ023
Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li:
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation. CVPR 2023: 3749-3758
[c684]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangQZT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangQZT023
Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li:
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert. CVPR 2023: 14653-14662
[c683]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/dada/YiTFYWWZZZRXZGW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dada/YiTFYWWZZZRXZGW23
Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li:
ADD 2023: the Second Audio Deepfake Detection Challenge. DADA@IJCAI 2023: 125-130
[c682]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/CaiLY023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/CaiLY023
Siqi Cai, Jia Li, Hongmeng Yang, Haizhou Li:
RGCnet: An Efficient Recursive Gated Convolutional Network for EEG-based Auditory Attention Detection. EMBC 2023: 1-4
[c681]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangDTST023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangDTST023
Chen Zhang, Luis F. D'Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li:
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark. EMNLP (Findings) 2023: 5579-5601
[c680]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/0004FTL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/0004FTL023
Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li:
How Well Do Text Embedding Models Understand Syntax? EMNLP (Findings) 2023: 9717-9728
[c679]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangCJYCCLWZXW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangCJYCCLWZXW23
Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Guiming Chen, Jianquan Li, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li:
HuatuoGPT, Towards Taming Language Model to Be a Doctor. EMNLP (Findings) 2023: 10859-10885
[c678]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BorsdorfPICLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BorsdorfPICLS23
Marvin Borsdorf, Saurav Pahuja, Gabriel Ivucic, Siqi Cai, Haizhou Li, Tanja Schultz:
Multi-Head Attention and GRU for Improved Match-Mismatch Classification of Speech Stimulus and EEG Response. ICASSP 2023: 1-2
[c677]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaoYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaoYL23
Xiaoxue Gao, Xianghu Yue, Haizhou Li:
Self-Transriber: Few-Shot Lyrics Transcription With Self-Training. ICASSP 2023: 1-5
[c676]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWBL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWBL23
Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNet: Target Speaker Extraction with Intermittent Visual Cue Through Embedding Inpainting. ICASSP 2023: 1-5
[c675]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TaoLSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TaoLSL23
Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker Recognition with Two-Step Multi-Modal Deep Cleansing. ICASSP 2023: 1-5
[c674]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YueAGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YueAGL23
Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li:
Token2vec: A Joint Self-Supervised Pre-Training Framework Using Unpaired Speech and Text. ICASSP 2023: 1-5
[c673]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangZSQNL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangZSQNL23
Qiquan Zhang, Hongxu Zhu, Qi Song, Xinyuan Qian, Zhaoheng Ni, Haizhou Li:
Ripple Sparse Self-Attention for Monaural Speech Enhancement. ICASSP 2023: 1-5
[c672]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZuoLZGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZuoLZGL23
Haolin Zuo, Rui Liu, Jinming Zhao, Guanglai Gao, Haizhou Li:
Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities. ICASSP 2023: 1-5
[c671]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SiZLWWDCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SiZLWWDCL23
Yuke Si, Yan Zhang, Yuhang Li, Xiaobao Wang, Longbiao Wang, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition. IJCNN 2023: 1-8
[c670]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008ZHG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008ZHG023
Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li:
Explicit Intensity Control for Accented Text-to-speech. INTERSPEECH 2023: 22-26
[c669]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangC023
Ruicong Wang, Siqi Cai, Haizhou Li:
EEG-based Auditory Attention Detection with Spatiotemporal Graph and Graph Convolutional Network. INTERSPEECH 2023: 1144-1148
[c668]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengAKW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengAKW023
Chutong Meng, Junyi Ao, Tom Ko, Mingxuan Wang, Haizhou Li:
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning. INTERSPEECH 2023: 2978-2982
[c667]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinYA023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinYA023
Jingru Lin, Xianghu Yue, Junyi Ao, Haizhou Li:
Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder. INTERSPEECH 2023: 2988-2992
[c666]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangTP023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangTP023
Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li:
Target Active Speaker Detection with Audio-visual Cues. INTERSPEECH 2023: 3152-3156
[c665]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangBP0WW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangBP0WW23
Ke Zhang, Marvin Borsdorf, Zexu Pan, Haizhou Li, Yangjie Wei, Yi Wang:
Speaker Extraction with Detection of Presence and Absence of Target Speakers. INTERSPEECH 2023: 3714-3718
[c664]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuG0023
Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. INTERSPEECH 2023: 3719-3723
[c663]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008ZG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008ZG023
Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li:
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. INTERSPEECH 2023: 3999-4003
[c662]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuS0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuS0023
Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li:
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. INTERSPEECH 2023: 5536-5540
[c661]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/kars/LiHZK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kars/LiHZK023
Chuang Li, Hengchang Hu, Yan Zhang, Min-Yen Kan, Haizhou Li:
A Conversation is Worth A Thousand Recommendations: A Survey of Holistic Conversational Recommendation Systems. KaRS@RecSys 2023: 7-20
[c660]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangZWTLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangZWTLL23
Xueyi Zhang, Chengwei Zhang, Tao Wang, Jun Tang, Songyang Lao, Haizhou Li:
Slow-Fast Time Parameter Aggregation Network for Class-Incremental Lip Reading. ACM Multimedia 2023: 747-756
[c659]
- view
  authority control:
- export record
  dblp key:
  - conf/ner/PahujaCSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ner/PahujaCSL23
Saurav Pahuja, Siqi Cai, Tanja Schultz, Haizhou Li:
XAnet: Cross-Attention Between EEG of Left and Right Brain for Auditory Attention Decoding. NER 2023: 1-4
[c658]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0004LW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0004LW023
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Disentangling Voice and Content with Self-Supervision for Speaker Recognition. NeurIPS 2023
[c657]
- view
  authority control:
- export record
  dblp key:
  - conf/nlpcc/FanJLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nlpcc/FanJLL23
Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. NLPCC (3) 2023: 69-80
[c656]
- view
  authority control:
- export record
  dblp key:
  - conf/rep4nlp/Wang023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rep4nlp/Wang023
Bin Wang, Haizhou Li:
Relational Sentence Embedding for Flexible Semantic Matching. RepL4NLP@ACL 2023: 238-252
[c655]
- view
  authority control:
- export record
  dblp key:
  - conf/smc/PahujaIPC0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smc/PahujaIPC0S23
Saurav Pahuja, Gabriel Ivucic, Felix Putze, Siqi Cai, Haizhou Li, Tanja Schultz:
Enhancing Subject-Independent EEG-Based Auditory Attention Decoding with WGAN and Pearson Correlation Coefficient. SMC 2023: 3715-3720
[e24]
- view
- export record
  dblp key:
  - conf/dada/2023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dada/2023
Jianhua Tao, Haizhou Li, Jiangyan Yi, Cunhang Fan:
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), Macao, China, August 19, 2023. CEUR Workshop Proceedings 3597, CEUR-WS.org 2023 [contents]
[i168]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17480
Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li:
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert. CoRR abs/2303.17480 (2023)
[i167]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-10453
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-10453
Zhihong Chen, Feng Jiang, Junying Chen, Tiannan Wang, Fei Yu, Guiming Chen, Hongbo Zhang, Juhao Liang, Chen Zhang, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li:
Phoenix: Democratizing ChatGPT across Languages. CoRR abs/2304.10453 (2023)
[i166]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04816
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis with Limited Data. CoRR abs/2305.04816 (2023)
[i165]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-08541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-08541
Qiquan Zhang, Hongxu Zhu, Qi Song, Xinyuan Qian, Zhaoheng Ni, Haizhou Li:
Ripple sparse self-attention for monaural speech enhancement. CoRR abs/2305.08541 (2023)
[i164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12228
Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li:
Dynamic Transformers Provide a False Sense of Efficiency. CoRR abs/2305.12228 (2023)
[i163]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12831
Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li:
Target Active Speaker Detection with Audio-visual Cues. CoRR abs/2305.12831 (2023)
[i162]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13755
Feng Jiang, Longwang He, Peifeng Li, Qiaoming Zhu, Haizhou Li:
Topic-driven Distant Supervision Framework for Macro-level Discourse Parsing. CoRR abs/2305.13755 (2023)
[i161]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13774
Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li:
ADD 2023: the Second Audio Deepfake Detection Challenge. CoRR abs/2305.13774 (2023)
[i160]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13785
Danqing Luo, Chen Zhang, Jiahui Xu, Bin Wang, Yiming Chen, Yan Zhang, Haizhou Li:
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation. CoRR abs/2305.13785 (2023)
[i159]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14790
Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Haizhou Li:
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark. CoRR abs/2305.14790 (2023)
[i158]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15075
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15075
Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li:
HuatuoGPT, towards Taming Language Model to Be a Doctor. CoRR abs/2305.15075 (2023)
[i157]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16353
Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li:
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. CoRR abs/2305.16353 (2023)
[i156]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16594
Xinyi Chen, Qu Yang, Jibin Wu, Haizhou Li, Kay Chen Tan:
A Hybrid Neural Coding Approach for Pattern Recognition with Spiking Neural Networks. CoRR abs/2305.16594 (2023)
[i155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03612
Zhenyu Weng, Huiping Zhuang, Haizhou Li, Zhiping Lin:
Constant Sequence Extension for Fast Search Using Weighted Hamming Distance. CoRR abs/2306.03612 (2023)
[i154]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17005
Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li:
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. CoRR abs/2306.17005 (2023)
[i153]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-07231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-07231
Shimin Zhang, Qu Yang, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan:
Long Short-term Memory with Two-Compartment Spiking Neuron. CoRR abs/2307.07231 (2023)
[i152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11380
Lingyi Yang, Feng Jiang, Haizhou Li:
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text. CoRR abs/2307.11380 (2023)
[i151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-13923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-13923
Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. CoRR abs/2307.13923 (2023)
[i150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08833
Xidong Wang, Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li:
CMB: A Comprehensive Medical Benchmark in Chinese. CoRR abs/2308.08833 (2023)
[i149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-13250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-13250
Shimin Zhang, Qu Yang, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan:
TC-LIF: A Two-Compartment Spiking Neuron Model for Long-term Sequential Modelling. CoRR abs/2308.13250 (2023)
[i148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14774
Hongxu Zhu, Siqi Cai, Yidi Jiang, Qiquan Zhang, Haizhou Li:
EEG-Derived Voice Signature for Attended Speaker Detection. CoRR abs/2308.14774 (2023)
[i147]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06723
Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. CoRR abs/2309.06723 (2023)
[i146]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07682
Chuang Li, Hengchang Hu, Yan Zhang, Min-Yen Kan, Haizhou Li:
A Conversation is Worth A Thousand Recommendations: A Survey of Holistic Conversational Recommender Systems. CoRR abs/2309.07682 (2023)
[i145]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08408
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023)
[i144]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09469
Zeyang Song, Jibin Wu, Malu Zhang, Mike Zheng Shou, Haizhou Li:
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks. CoRR abs/2309.09469 (2023)
[i143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10674
Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023)
[i142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11724
Rui Liu, Bin Liu, Haizhou Li:
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech. CoRR abs/2309.11724 (2023)
[i141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11725
Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency. CoRR abs/2309.11725 (2023)
[i140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11730
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023)
[i139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12053
Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu:
AceGPT, Localizing Large Language Models in Arabic. CoRR abs/2309.12053 (2023)
[i138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01128
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Disentangling Voice and Content with Self-Supervision for Speaker Recognition. CoRR abs/2310.01128 (2023)
[i137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08958
Chen Zhang, Luis Fernando D'Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li:
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark. CoRR abs/2310.08958 (2023)
[i136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10492
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10492
Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li:
UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking. CoRR abs/2310.10492 (2023)
[i135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10497
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li:
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism. CoRR abs/2310.10497 (2023)
[i134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11722
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11722
Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
Quantify Health-Related Atomic Knowledge in Chinese Medical Large Language Models: A Computational Analysis. CoRR abs/2310.11722 (2023)
[i133]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14978
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14978
Qu Yang, Malu Zhang, Jibin Wu, Kay Chen Tan, Haizhou Li:
LC-TTFS: Towards Lossless Network Conversion for Spiking Neural Networks with TTFS Coding. CoRR abs/2310.14978 (2023)
[i132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-07996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-07996
Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li:
How Well Do Text Embedding Models Understand Syntax? CoRR abs/2311.07996 (2023)
[i131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-09774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-09774
Junying Chen, Xidong Wang, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang:
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs. CoRR abs/2311.09774 (2023)
[i130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03620
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification. CoRR abs/2312.03620 (2023)
[i129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09911
Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu:
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit. CoRR abs/2312.09911 (2023)
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11947
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. CoRR abs/2312.11947 (2023)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15407
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Malu Zhang, Haizhou Li:
A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators. CoRR abs/2312.15407 (2023)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16002
Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng:
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge. CoRR abs/2312.16002 (2023)
2022
[j143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/YueLGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/YueLGL22
Xianghu Yue, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li:
Self-Supervised Learning With Segmental Masking for Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1367-1379 (2022)
[j142]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/DuXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/DuXL22
Hongqiang Du, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022)
[j141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pami/WuXHZZLT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WuXHZZLT22
Jibin Wu, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang, Haizhou Li, Kay Chen Tan:
Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022)
[j140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/ZhouSLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhouSLL22
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022)
[j139]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ZhuLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhuLL22
Hongning Zhu, Kong Aik Lee, Haizhou Li:
Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022)
[j138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/LiuDLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LiuDLL22
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022)
[j137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/PanQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/PanQL22
Zexu Pan, Xinyuan Qian, Haizhou Li:
Speaker Extraction With Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022)
[j136]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/Li22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/Li22
Haizhou Li:
A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE Signal Process. Mag. 39(2): 159-160 (2022)
[j135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PanTXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanTXL22
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Selective Listening by Synchronizing Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664 (2022)
[j134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuSGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuSGL22
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Decoding Knowledge Transfer for Neural Text-to-Speech Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1789-1802 (2022)
[j133]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GaoGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GaoGL22
Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
Automatic Lyrics Transcription of Polyphonic Music With Lyrics-Chord Multi-Task Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2280-2294 (2022)
[j132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/GuptaLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuptaLG22
Chitralekha Gupta, Haizhou Li, Masataka Goto:
Deep Learning Approaches in Topics of Singing Information Processing. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2422-2451 (2022)
[j131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PanGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanGL22
Zexu Pan, Meng Ge, Haizhou Li:
USEV: Universal Speaker Extraction With Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022)
[j130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tbe/SuCXLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/SuCXLS22
Enze Su, Siqi Cai, Longhan Xie, Haizhou Li, Tanja Schultz:
STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG. IEEE Trans. Biomed. Eng. 69(7): 2233-2242 (2022)
[j129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/thms/CaiSXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/thms/CaiSXL22
Siqi Cai, Enze Su, Longhan Xie, Haizhou Li:
EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2): 256-266 (2022)
[j128]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ZhangWWBAZMQCCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ZhangWWBAZMQCCL22
Malu Zhang, Jiadong Wang, Jibin Wu, Ammar Belatreche, Burin Amornpaisannon, Zhixuan Zhang, Venkata Pavan Kumar Miriyala, Hong Qu, Yansong Chua, Trevor E. Carlson, Haizhou Li:
Rectified Linear Postsynaptic Potential Function for Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 33(5): 1947-1958 (2022)
[c654]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangDF022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangDF022
Chen Zhang, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li:
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation. AAAI 2022: 11657-11666
[c653]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhaoZ0LJW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhaoZ0LJW022
Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li:
M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL (1) 2022: 5699-5710
[c652]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangK022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangK022
Bin Wang, C.-C. Jay Kuo, Haizhou Li:
Just Rank: Rethinking Evaluation with Word and Sentence Similarities. ACL (1) 2022: 6060-6077
[c651]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GraumanWBCFGH0L22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GraumanWBCFGH0L22
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990
[c650]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangDZF022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangDZF022
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation. EMNLP 2022: 3336-3355
[c649]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangZZC022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangZZC022
Bin Wang, Chen Zhang, Yan Zhang, Yiming Chen, Haizhou Li:
Analyzing and Evaluating Faithfulness in Dialogue Summarization. EMNLP 2022: 4897-4908
[c648]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChenZWL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenZWL022
Yiming Chen, Yan Zhang, Bin Wang, Zuozhu Liu, Haizhou Li:
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework. EMNLP 2022: 8150-8161
[c647]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaoGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaoGL22
Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. ICASSP 2022: 791-795
[c646]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BorsdorfSLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BorsdorfSLS22
Marvin Borsdorf, Kevin Scheck, Haizhou Li, Tanja Schultz:
Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages. ICASSP 2022: 846-850
[c645]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLJWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLJWL22
Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang, Haizhou Li:
Memobert: Pre-Training Model with Prompt-Based Learning for Multimodal Emotion Recognition. ICASSP 2022: 4703-4707
[c644]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TaoLDHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TaoLDHL22
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146
[c643]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GeXWCDL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GeXWCDL22
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291
[c642]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuDLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuDLL22
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521
[c641]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangSNNL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangSNNL22
Qiquan Zhang, Qi Song, Zhaoheng Ni, Aaron Nicolson, Haizhou Li:
Time-Frequency Attention for Monaural Speech Enhancement. ICASSP 2022: 7852-7856
[c640]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuSLZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuSLZL22
Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. ICASSP 2022: 8032-8036
[c639]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWZLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWZLL22
Jiadong Wang, Jibin Wu, Malu Zhang, Qi Liu, Haizhou Li:
A Hybrid Learning Framework for Deep Spiking Neural Networks with One-Spike Temporal Coding. ICASSP 2022: 8942-8946
[c638]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YiFTNMWWTBFLWZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YiFTNMWWTBFLWZY22
Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li:
ADD 2022: the first Audio Deep Synthesis Detection Challenge. ICASSP 2022: 9216-9220
[c637]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorsdorfS0S22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorsdorfS0S22
Marvin Borsdorf, Kevin Scheck, Haizhou Li, Tanja Schultz:
Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language. INTERSPEECH 2022: 256-260
[c636]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangBAZXWZKL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangBAZXWZKL22
Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li:
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. INTERSPEECH 2022: 1686-1690
[c635]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PanG022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PanG022
Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. INTERSPEECH 2022: 1786-1790
[c634]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuSZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuSZ022
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. INTERSPEECH 2022: 2603-2607
[c633]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AoZZ00K00QW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AoZZ00K00QW22
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662
[c632]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangL022a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangL022a
Qu Yang, Qi Liu, Haizhou Li:
Deep residual spiking neural network for keyword spotting in low-resource settings. INTERSPEECH 2022: 3023-3027
[c631]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongLY022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongLY022
Zeyang Song, Qi Liu, Qu Yang, Haizhou Li:
Knowledge distillation for In-memory keyword spotting model. INTERSPEECH 2022: 4128-4132
[c630]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008SSG022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008SSG022
Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. INTERSPEECH 2022: 5493-5497
[c629]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoYFFLZ0M0A22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoYFFLZ0M0A22
Jianhua Tao, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Liang, Pengyuan Zhang, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi:
DDAM '22: 1st International Workshop on Deepfake Detection for Audio Multimedia. ACM Multimedia 2022: 7405-7406
[c628]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangWZCW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangWZCW022
Qu Yang, Jibin Wu, Malu Zhang, Yansong Chua, Xinchao Wang, Haizhou Li:
Training Spiking Neural Networks with Local Tandem Learning. NeurIPS 2022
[c627]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/LiSLCXL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/LiSLCXL22
Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie, Haizhou Li:
Esaa: An Eeg-Speech Auditory Attention Detection Database. O-COCOSDA 2022 2022: 1-6
[e23]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/2022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/2022
Rong Tong, Yanfeng Lu, Minghui Dong, Wengao Gong, Haizhou Li:
International Conference on Asian Language Processing, IALP 2022, Singapore, October 27-28, 2022. IEEE 2022, ISBN 978-1-6654-7674-4 [contents]
[e22]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/2021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/2021
Svetlana Stoyanchev, Stefan Ultes, Haizhou Li:
Conversational AI for Natural Human-Centric Interaction - 12th International Workshop on Spoken Dialogue System Technology, IWSDS 2021, Singapore. Lecture Notes in Electrical Engineering 943, Springer 2022, ISBN 978-981-19-5537-2 [contents]
[e21]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/2022ddam
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/2022ddam
Jianhua Tao, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Lian, Pengyuan Zhang:
DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, Lisboa, Portugal, 14 October 2022. ACM 2022, ISBN 978-1-4503-9496-3 [contents]
[i125]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03967
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03967
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. CoRR abs/2201.03967 (2022)
[i124]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10693
Hongqiang Du, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. CoRR abs/2201.10693 (2022)
[i123]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01624
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022)
[i122]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08433
Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu:
ADD 2022: the First Audio Deep Synthesis Detection Challenge. CoRR abs/2202.08433 (2022)
[i121]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09995
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02679
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02679
Bin Wang, C.-C. Jay Kuo, Haizhou Li:
Just Rank: Rethinking Evaluation with Word and Sentence Similarities. CoRR abs/2203.02679 (2022)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15610
Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li:
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. CoRR abs/2203.15610 (2022)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16840
Zexu Pan, Xinyuan Qian, Haizhou Li:
Speaker Extraction with Co-Speech Gestures Cue. CoRR abs/2203.16840 (2022)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16843
Zexu Pan, Meng Ge, Haizhou Li:
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. CoRR abs/2203.16843 (2022)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17113
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03307
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03307
Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
Genre-conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. CoRR abs/2204.03307 (2022)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10237
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10237
Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li:
M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. CoRR abs/2205.10237 (2022)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07229
Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. CoRR abs/2206.07229 (2022)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07336
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07336
Xiaoxue Gao, Chitralekha Gupta, Haizhou Li:
PoLyScribers: Joint Training of Vocal Extractor and Lyrics Transcriber for Polyphonic Music. CoRR abs/2207.07336 (2022)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05890
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis with Mixed Emotions. CoRR abs/2208.05890 (2022)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-01768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-01768
Jiadong Wang, Xinyuan Qian, Haizhou Li:
Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception. CoRR abs/2209.01768 (2022)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10804
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Controllable Accented Text-to-Speech Synthesis. CoRR abs/2209.10804 (2022)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11433
Qutang Cai, Guoqiang Hong, Zhijian Ye, Ximin Li, Haizhou Li:
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.11433 (2022)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11910
Bin Wang, Chen Zhang, Chengwei Wei, Haizhou Li:
A Focused Study on Sequence Length for Dialogue Summarization. CoRR abs/2209.11910 (2022)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-04062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-04062
Chutong Meng, Junyi Ao, Tom Ko, Mingxuan Wang, Haizhou Li:
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning. CoRR abs/2210.04062 (2022)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-04532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-04532
Qu Yang, Jibin Wu, Malu Zhang, Yansong Chua, Xinchao Wang, Haizhou Li:
Training Spiking Neural Networks with Local Tandem Learning. CoRR abs/2210.04532 (2022)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11777
Bin Wang, Chen Zhang, Yan Zhang, Yiming Chen, Haizhou Li:
Analyzing and Evaluating Faithfulness in Dialogue Summarization. CoRR abs/2210.11777 (2022)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13756
Kun Zhou, Berrak Sisman, Carlos Busso, Haizhou Li:
Mixed Emotion Modelling for Emotional Voice Conversion. CoRR abs/2210.13756 (2022)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13832
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation. CoRR abs/2210.13832 (2022)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15359
Haolin Zuo, Rui Liu, Jinming Zhao, Guanglai Gao, Haizhou Li:
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities. CoRR abs/2210.15359 (2022)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15360
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15360
Yifan Hu, Rui Liu, Guanglai Gao, Haizhou Li:
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis. CoRR abs/2210.15360 (2022)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15364
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15364
Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li:
Explicit Intensity Control for Accented Text-to-speech. CoRR abs/2210.15364 (2022)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15385
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15903
Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker recognition with two-step multi-modal deep cleansing. CoRR abs/2210.15903 (2022)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16755
Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li:
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text. CoRR abs/2210.16755 (2022)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16798
Yiming Chen, Yan Zhang, Bin Wang, Zuozhu Liu, Haizhou Li:
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework. CoRR abs/2210.16798 (2022)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00109
Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting. CoRR abs/2211.00109 (2022)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01091
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10152
Xiaoxue Gao, Xianghu Yue, Haizhou Li:
Self-Transriber: Few-shot Lyrics Transcription with Self-training. CoRR abs/2211.10152 (2022)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11004
Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li:
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation. CoRR abs/2211.11004 (2022)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08802
Bin Wang, Haizhou Li:
Relational Sentence Embedding for Flexible Semantic Matching. CoRR abs/2212.08802 (2022)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08992
Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment. CoRR abs/2212.08992 (2022)
2021
[j127]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/WuLZPLT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/WuLZPLT21
Jibin Wu, Qi Liu, Malu Zhang, Zihan Pan, Haizhou Li, Kay Chen Tan:
HuRAI: A brain-inspired computational model for human-robot auditory interface. Neurocomputing 465: 103-113 (2021)
[j126]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/LiuSLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/LiuSLL21
Rui Liu, Berrak Sisman, Yixing Lin, Haizhou Li:
FastTalker: A neural text-to-speech architecture with shallow and group autoregression. Neural Networks 141: 306-314 (2021)
[j125]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DuTXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DuTXL21
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021)
[j124]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/GunendradasanAE21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GunendradasanAE21
Tharshini Gunendradasan, Eliathamby Ambikairajah, Julien Epps, Vidhyasaharan Sethu, Haizhou Li:
An adaptive transmission line cochlear model based front-end for replay attack detection. Speech Commun. 132: 114-122 (2021)
[j123]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SharmaGVTL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SharmaGVTL21
Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li:
NHSS: A speech and singing parallel database. Speech Commun. 133: 9-22 (2021)
[j122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/QianLWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/QianLWL21
Xinyuan Qian, Qi Liu, Jiadong Wang, Haizhou Li:
Three-Dimensional Speaker Localization: Audio-Refined Visual Scaling Factor Estimation. IEEE Signal Process. Lett. 28: 1405-1409 (2021)
[j121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/SismanYKL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SismanYKL21
Berrak Sisman, Junichi Yamagishi, Simon King, Haizhou Li:
An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 29: 132-157 (2021)
[j120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuSBYGL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuSBYGL21
Rui Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao, Haizhou Li:
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 274-285 (2021)
[j119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangZZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangZZL21
Mingyang Zhang, Yi Zhou, Li Zhao, Haizhou Li:
Transfer Learning From Speech Synthesis to Voice Conversion With Non-Parallel Training Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1290-1302 (2021)
[j118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuSGL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuSGL21
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Expressive TTS Training With Frame and Style Reconstruction Loss. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1806-1818 (2021)
[j117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangLDL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangLDL21
Chen Zhang, Grandee Lee, Luis Fernando D'Haro, Haizhou Li:
D-Score: Holistic Dialogue Evaluation Without Reference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2502-2516 (2021)
[j116]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/PanZWWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PanZWWL21
Zihan Pan, Malu Zhang, Jibin Wu, Jiadong Wang, Haizhou Li:
Multi-Tone Phase Coding of Interaural Time Difference for Sound Source Localization With Spiking Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2656-2670 (2021)
[j115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/XuRWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuRWL21
Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li:
Target Speaker Verification With Selective Auditory Attention for Single and Multi-Talker Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2696-2709 (2021)
[j114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouTL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouTL21
Yi Zhou, Xiaohai Tian, Haizhou Li:
Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3427-3439 (2021)
[c626]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/0004HLB020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0004HLB020
Yan Zhang, Ruidan He, Zuozhu Liu, Lidong Bing, Haizhou Li:
Bootstrapped Unsupervised Sentence Representation Learning. ACL/IJCNLP (1) 2021: 5168-5180
[c625]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhangCDZFL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangCDZFL020
Chen Zhang, Yiming Chen, Luis Fernando D'Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee, Haizhou Li:
DynaEval: Unifying Turn and Dialogue Level Evaluation. ACL/IJCNLP (1) 2021: 5676-5689
[c624]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/LiGL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiGL21
Jinhu Li, Chitralekha Gupta, Haizhou Li:
Training Explainable Singing Quality Assessment Network with Augmented Data. APSIPA ASC 2021: 904-911
[c623]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/GuptaLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GuptaLL21
Chitralekha Gupta, Jinhu Li, Haizhou Li:
Towards Reference-Independent Rhythm Assessment of Solo Singing. APSIPA ASC 2021: 912-919
[c622]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MaLHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MaLHL21
Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction. ASRU 2021: 106-113
[c621]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SharmaMZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SharmaMZL21
Bidisha Sharma, Maulik C. Madhavi, Xuehao Zhou, Haizhou Li:
Exploring Teacher-Student Learning Approach for Multi-Lingual Speech-to-Intent Classification. ASRU 2021: 419-426
[c620]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DuSZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DuSZL21
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. ASRU 2021: 594-601
[c619]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NikonorovSZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NikonorovSZL21
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li:
DEEPA: A Deep Neural Analyzer for Speech and Singing Vocoding. ASRU 2021: 618-625
[c618]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/BorsdorfLS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/BorsdorfLS21
Marvin Borsdorf, Haizhou Li, Tanja Schultz:
Target Language Extraction at Multilingual Cocktail Parties. ASRU 2021: 717-724
[c617]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0003ZZ0LS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0003ZZ0LS021
Mingyang Zhang, Xuehao Zhou, Kun Zhou, Rui Liu, Perry Lam, Berrak Sisman, Haizhou Li:
SUTD-NUS System for Blizzard Challenge 2021. Blizzard Challenge 2021
[c616]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/SuCLXL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/SuCLXL21
Enze Su, Siqi Cai, Peiwen Li, Longhan Xie, Haizhou Li:
Auditory Attention Detection with EEG Channel Attention. EMBC 2021: 5804-5807
[c615]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/CaiSSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/CaiSSL21
Siqi Cai, Pengcheng Sun, Tanja Schultz, Haizhou Li:
Low-Latency Auditory Spatial Attention Detection Based on Spectro-Spatial Features from EEG. EMBC 2021: 5812-5815
[c614]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChenZZLC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenZZLC021
Yiming Chen, Yan Zhang, Chen Zhang, Grandee Lee, Ran Cheng, Haizhou Li:
Revisiting Self-training for Few-shot Learning of Language Model. EMNLP (1) 2021: 9125-9135
[c613]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HouXC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HouXC021
Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670
[c612]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouS0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouS0021
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset. ICASSP 2021: 920-924
[c611]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianMPW021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianMPW021
Xinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li:
Multi-Target DoA Estimation with an Audio-Visual Fusion Mechanism. ICASSP 2021: 4280-4284
[c610]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0008S021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0008S021
Rui Liu, Berrak Sisman, Haizhou Li:
Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis. ICASSP 2021: 6059-6063
[c609]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GeXWCD021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GeXWCD021
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. ICASSP 2021: 6109-6113
[c608]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoWXDC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoWXDC021
Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. ICASSP 2021: 6304-6308
[c607]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasY021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasY021
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Data Augmentation with Signal Companding for Detection of Logical Access Attacks. ICASSP 2021: 6349-6353
[c606]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanTX021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanTX021
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Muse: Multi-Modal Target Speaker Extraction with Visual Cues. ICASSP 2021: 6678-6682
[c605]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SharmaM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SharmaM021
Bidisha Sharma, Maulik C. Madhavi, Haizhou Li:
Leveraging Acoustic and Linguistic Embeddings from Pretrained Speech and Language Models for Intent Classification. ICASSP 2021: 7498-7502
[c604]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieTLSXWLSLHBX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieTLSXWLSLHBX21
Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu:
The Multi-Speaker Multi-Style Voice Cloning Challenge 2021. ICASSP 2021: 8613-8617
[c603]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhuangWLT0L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhuangWLT0L21
Huiping Zhuang, Zhenyu Weng, Fulin Luo, Kar-Ann Toj, Haizhou Li, Zhiping Lin:
Accumulated Decoupled Learning with Gradient Staleness Mitigation for Convolutional Neural Networks. ICML 2021: 12935-12944
[c602]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/WangQPZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/WangQPZ021
Jiadong Wang, Xinyuan Qian, Zihan Pan, Malu Zhang, Haizhou Li:
GCC-PHAT with Speech-oriented Attention for Robotic Sound Source Localization. ICRA 2021: 5876-5883
[c601]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/YangWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/YangWL21
Qu Yang, Jibin Wu, Haizhou Li:
Rethinking Benchmarks for Neuromorphic Learning Algorithms. IJCNN 2021: 1-8
[c600]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuL021
Hongning Zhu, Kong Aik Lee, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. Interspeech 2021: 106-110
[c599]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSNL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSNL021
Qiquan Zhang, Qi Song, Aaron Nicolson, Tian Lan, Haizhou Li:
Temporal Convolutional Network with Frequency Dimension Adaptive Attention for Speech Enhancement. Interspeech 2021: 166-170
[c598]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Yue021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Yue021
Xianghu Yue, Haizhou Li:
Phonetically Motivated Self-Supervised Speech Representation Learning. Interspeech 2021: 746-750
[c597]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouSL21
Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training. Interspeech 2021: 811-815
[c596]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasM021
Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Diagnosis of COVID-19 Using Auditory Acoustic Cues. Interspeech 2021: 921-925
[c595]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWLX021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWLX021
Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. Interspeech 2021: 1094-1098
[c594]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouTW021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouTW021
Yi Zhou, Xiaohai Tian, Zhizheng Wu, Haizhou Li:
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation. Interspeech 2021: 1374-1378
[c593]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorsdorfX0S21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorsdorfX0S21
Marvin Borsdorf, Chenglin Xu, Haizhou Li, Tanja Schultz:
Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers. Interspeech 2021: 1469-1473
[c592]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangXG021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangXG021
Wupeng Wang, Chenglin Xu, Meng Ge, Haizhou Li:
Neural Speaker Extraction with Speaker-Speech Cross-Attention Network. Interspeech 2021: 3535-3539
[c591]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorsdorfX0S21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorsdorfX0S21a
Marvin Borsdorf, Chenglin Xu, Haizhou Li, Tanja Schultz:
GlobalPhone Mix-To-Separate Out of 2: A Multilingual 2000 Speakers Mixtures Database for Speech Separation. Interspeech 2021: 3905-3909
[c590]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008S021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008S021
Rui Liu, Berrak Sisman, Haizhou Li:
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability. Interspeech 2021: 4648-4652
[c589]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangSM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangSM021
Yidi Jiang, Bidisha Sharma, Maulik C. Madhavi, Haizhou Li:
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification. Interspeech 2021: 4713-4717
[c588]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/OuyangDY021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/OuyangDY021
Meidan Ouyang, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Capsule Network based End-to-end System for Detection of Replay Attacks. ISCSLP 2021: 1-5
[c587]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/ZhangDCF021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/ZhangDCF021
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li:
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation. IWSDS 2021: 291-306
[c586]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoPDQS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoPDQS021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. ACM Multimedia 2021: 3927-3935
[c585]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/QianSA021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/QianSA021
Xinyuan Qian, Bidisha Sharma, Amine El Abridi, Haizhou Li:
SLoClas: A Database for Joint Sound Localization and Classification. O-COCOSDA 2021: 128-133
[c584]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sigdial/LiLYGSCVDWL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/LiLYGSCVDWL21
Haizhou Li, Gina-Anne Levow, Zhou Yu, Chitralekha Gupta, Berrak Sisman, Siqi Cai, David Vandyke, Nina Dethlefs, Yan Wu, Junyi Jessy Li:
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue. SIGDIAL 2021
[c583]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhouS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhouS021
Kun Zhou, Berrak Sisman, Haizhou Li:
Vaw-Gan For Disentanglement And Recomposition Of Emotional Elements In Speech. SLT 2021: 415-422
[c582]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DuTX021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DuTX021
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing Voice Conversion Network with Cycle Consistency Loss of Speaker Identity. SLT 2021: 507-513
[e20]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/2021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/2021
Deyi Xiong, Ridong Jiang, Yanfeng Lu, Minghui Dong, Haizhou Li:
International Conference on Asian Language Processing, IALP 2021, Singapore, December 11-13, 2021. IEEE 2021, ISBN 978-1-6654-8311-7 [contents]
[e19]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/2019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/2019
Erik Marchi, Sabato Marco Siniscalchi, Sandro Cumani, Valerio Mario Salerno, Haizhou Li:
Increasing Naturalness and Flexibility in Spoken Dialogue Interaction - 10th International Workshop on Spoken Dialogue Systems, IWSDS 2019, Syracuse, Sicily, Italy, 24-26 April 2019. Lecture Notes in Electrical Engineering 714, Springer 2021, ISBN 978-981-15-9322-2 [contents]
[e18]
- view
- export record
  dblp key:
  - conf/sigdial/2021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/2021
Haizhou Li, Gina-Anne Levow, Zhou Yu, Chitralekha Gupta, Berrak Sisman, Siqi Cai, David Vandyke, Nina Dethlefs, Yan Wu, Junyi Jessy Li:
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGdial 2021, Singapore and Online, July 29-31, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-81-7 [contents]
[e17]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/2021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/2021
Haizhou Li, Shuzhi Sam Ge, Yan Wu, Agnieszka Wykowska, Hongsheng He, Xiaorui Liu, Dongyu Li, Jairo Pérez-Osorio:
Social Robotics - 13th International Conference, ICSR 2021, Singapore, November 10-13, 2021, Proceedings. Lecture Notes in Computer Science 13086, Springer 2021, ISBN 978-3-030-90524-8 [contents]
[i88]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07370
Bidisha Sharma, Maulik C. Madhavi, Haizhou Li:
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification. CoRR abs/2102.07370 (2021)
[i87]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-03621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-03621
Siqi Cai, Pengcheng Sun, Tanja Schultz, Haizhou Li:
Low-latency auditory spatial attention detection based on spectro-spatial features from EEG. CoRR abs/2103.03621 (2021)
[i86]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16269
Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li:
Target Speaker Verification with Selective Auditory Attention for Single and Multi-talker Speech. CoRR abs/2103.16269 (2021)
[i85]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16809
Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training. CoRR abs/2103.16809 (2021)
[i84]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01408
Rui Liu, Berrak Sisman, Haizhou Li:
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability. CoRR abs/2104.01408 (2021)
[i83]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06107
Xinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li:
Multi-target DoA Estimation with an Audio-visual Fusion Mechanism. CoRR abs/2105.06107 (2021)
[i82]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-14762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-14762
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional Voice Conversion: Theory, Databases and ESD. CoRR abs/2105.14762 (2021)
[i81]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-01112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-01112
Chen Zhang, Yiming Chen, Luis Fernando D'Haro, Yan Zhang, Thomas Friedrichs, Grandee Lee, Haizhou Li:
DynaEval: Unifying Turn and Dialogue Level Evaluation. CoRR abs/2106.01112 (2021)
[i80]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09320
Li Zhang, Qing Wang, Kong Aik Lee, Lei Xie, Haizhou Li:
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification. CoRR abs/2106.09320 (2021)
[i79]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-03748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-03748
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. CoRR abs/2107.03748 (2021)
[i78]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06493
Hongning Zhu, Kong Aik Lee, Haizhou Li:
Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding. CoRR abs/2107.06493 (2021)
[i77]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06592
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. CoRR abs/2107.06592 (2021)
[i76]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02539
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02539
Xinyuan Qian, Bidisha Sharma, Amine El Abridi, Haizhou Li:
SLoClas: A Database for Joint Sound Localization and Classification. CoRR abs/2108.02539 (2021)
[i75]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02598
Yidi Jiang, Bidisha Sharma, Maulik C. Madhavi, Haizhou Li:
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification. CoRR abs/2108.02598 (2021)
[i74]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13486
Bidisha Sharma, Maulik C. Madhavi, Xuehao Zhou, Haizhou Li:
Exploring Teacher-Student Learning Approach for Multi-lingual Speech-to-Intent Classification. CoRR abs/2109.13486 (2021)
[i73]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-14831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-14831
Zexu Pan, Meng Ge, Haizhou Li:
USEV: Universal Speaker Extraction with Visual Cue. CoRR abs/2109.14831 (2021)
[i72]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-00940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-00940
Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li:
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction. CoRR abs/2110.00940 (2021)
[i71]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01256
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01256
Yiming Chen, Yan Zhang, Chen Zhang, Grandee Lee, Ran Cheng, Haizhou Li:
Revisiting Self-Training for Few-Shot Learning of Language Model. CoRR abs/2110.01256 (2021)
[i70]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01895
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li:
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation. CoRR abs/2110.01895 (2021)
[i69]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03156
Rui Liu, Berrak Sisman, Haizhou Li:
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis. CoRR abs/2110.03156 (2021)
[i68]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03342
Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. CoRR abs/2110.03342 (2021)
[i67]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06434
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li:
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding. CoRR abs/2110.06434 (2021)
[i66]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07058
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CoRR abs/2110.07058 (2021)
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10326
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity. CoRR abs/2110.10326 (2021)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-00865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-00865
Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang, Haizhou Li:
MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition. CoRR abs/2111.00865 (2021)
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-07518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-07518
Qiquan Zhang, Qi Song, Zhaoheng Ni, Aaron Nicolson, Haizhou Li:
Time-Frequency Attention for Monaural Speech Enhancement. CoRR abs/2111.07518 (2021)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-07194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-07194
Chen Zhang, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li:
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation. CoRR abs/2112.07194 (2021)
2020
[j113]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LeeSLR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LeeSLR20
Kong Aik Lee, Seyed Omid Sadjadi, Haizhou Li, Douglas A. Reynolds:
Two decades into Speaker Recognition Evaluation - are we there yet? Comput. Speech Lang. 61: 101058 (2020)
[j112]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/ZhangWBPXCLQL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/ZhangWBPXCLQL20
Malu Zhang, Jibin Wu, Ammar Belatreche, Zihan Pan, Xiurui Xie, Yansong Chua, Guoqi Li, Hong Qu, Haizhou Li:
Supervised learning in spiking neural networks with synaptic delay-weight plasticity. Neurocomputing 409: 103-118 (2020)
[j111]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ZhangLCWBPQL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ZhangLCWBPQL20
Malu Zhang, Xiaoling Luo, Yi Chen, Jibin Wu, Ammar Belatreche, Zihan Pan, Hong Qu, Haizhou Li:
An Efficient Threshold-Driven Aggregate-Label Learning Algorithm for Multimodal Information Processing. IEEE J. Sel. Top. Signal Process. 14(3): 592-602 (2020)
[j110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/ZhangSZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhangSZL20
Mingyang Zhang, Berrak Sisman, Li Zhao, Haizhou Li:
DeepConversion: Voice conversion with limited parallel training data. Speech Commun. 122: 31-43 (2020)
[j109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/ZhouTL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhouTL20
Yi Zhou, Xiaohai Tian, Haizhou Li:
Multi-Task WaveRNN With an Integrated Architecture for Cross-Lingual Voice Conversion. IEEE Signal Process. Lett. 27: 1310-1314 (2020)
[j108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/LiuSBGL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LiuSBGL20
Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li:
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS. IEEE Signal Process. Lett. 27: 1470-1474 (2020)
[j107]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuptaLW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuptaLW20
Chitralekha Gupta, Haizhou Li, Ye Wang:
Automatic Leaderboard: Evaluation of Singing Quality Without a Standard Reference. IEEE ACM Trans. Audio Speech Lang. Process. 28: 13-26 (2020)
[j106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/XuRCL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuRCL20
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1370-1384 (2020)
[j105]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/YangDL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/YangDL20
Jichen Yang, Rohan Kumar Das, Haizhou Li:
Significance of Subband Features for Synthetic Speech Detection. IEEE Trans. Inf. Forensics Secur. 15: 2160-2170 (2020)
[c581]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LeeL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LeeL20
Grandee Lee, Haizhou Li:
Modeling Code-Switch Languages Using Bilingual Parallel Corpus. ACL 2020: 860-870
[c580]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/HuangG020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HuangG020
Lin Huang, Chitralekha Gupta, Haizhou Li:
Spectral Features and Pitch Histogram for Automatic Singing Quality Evaluation with CRNN. APSIPA 2020: 492-499
[c579]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/DuZS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DuZS020
Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. APSIPA 2020: 507-513
[c578]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/LuZS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LuZS020
Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. APSIPA 2020: 514-519
[c577]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/DasTYRY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasTYRY020
Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation. APSIPA 2020: 605-609
[c576]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/Das020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Das020
Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. APSIPA 2020: 747-752
[c575]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0020TZ0LLS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0020TZ0LLS020
Yi Zhou, Xiaohai Tian, Xuehao Zhou, Mingyang Zhang, Grandee Lee, Riu Liu, Berrak Sisman, Haizhou Li:
NUS-HLT System for Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c574]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Tian0YZD00ZS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Tian0YZD00ZS0020
Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li:
The NUS & NWPU system for Voice Conversion Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c573]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LinMD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LinMD020
Wanqiu Lin, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
Transformer-based Arabic Dialect Identification. IALP 2020: 192-196
[c572]
- view
  authority control:
- export record
  dblp key:
  - conf/icarcv/WengZLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icarcv/WengZLL20
Zhenyu Weng, Yuesheng Zhu, Zhiping Lin, Haizhou Li:
Real-Time Multiple Object Tracking with Discriminative Features. ICARCV 2020: 309-314
[c571]
- view
  authority control:
- export record
  dblp key:
  - conf/icarcv/PengZHLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icarcv/PengZHLL20
Xinggan Peng, Huiping Zhuang, Guang-Bin Huang, Haizhou Li, Zhiping Lin:
Robust Real-time Face Tracking for People Wearing Face Masks. ICARCV 2020: 779-783
[c570]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuptaY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuptaY020
Chitralekha Gupta, Emre Yilmaz, Haizhou Li:
Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help? ICASSP 2020: 496-500
[c569]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HaoXHXC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HaoXHXC020
Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870
[c568]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0008SLBG020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0008SLBG020
Rui Liu, Berrak Sisman, Jingdong Li, Feilong Bao, Guanglai Gao, Haizhou Li:
Teacher-Student Training For Robust Tacotron-Based TTS. ICASSP 2020: 6274-6278
[c567]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasY020
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Assessing the Scope of Generalized Countermeasures for Anti-Spoofing. ICASSP 2020: 6589-6593
[c566]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXKZCNM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXKZCNM020
Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063
[c565]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Das020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Das020
Rohan Kumar Das, Haizhou Li:
On the Importance of Vocal Tract Constriction for Speaker Characterization: The Whispered Speech Study. ICASSP 2020: 7119-7123
[c564]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouTLD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouTLD020
Xuehao Zhou, Xiaohai Tian, Grandee Lee, Rohan Kumar Das, Haizhou Li:
End-to-End Code-Switching TTS with Cross-Lingual Language Model. ICASSP 2020: 7614-7618
[c563]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DuT0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DuT0020
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Effective Wavenet Adaptation for Voice Conversion with Limited Data. ICASSP 2020: 7779-7783
[c562]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PanLY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PanLY020
Zexu Pan, Zhaojie Luo, Jichen Yang, Haizhou Li:
Multi-Modal Attention for Speech Emotion Recognition. INTERSPEECH 2020: 364-368
[c561]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouYLL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouYLL020
Xinyuan Zhou, Emre Yilmaz, Yanhua Long, Yijie Li, Haizhou Li:
Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition. INTERSPEECH 2020: 1042-1046
[c560]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuDY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuDY020
Zhenzong Wu, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Light Convolutional Neural Network with Feature Genuinization for Detection of Synthetic Speech Attacks. INTERSPEECH 2020: 1101-1105
[c559]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GeXWCD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GeXWCD020
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. INTERSPEECH 2020: 1406-1410
[c558]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaoD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaoD020
Ruijie Tao, Rohan Kumar Das, Haizhou Li:
Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network. INTERSPEECH 2020: 2242-2246
[c557]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazGWCM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazGWCM020
Emre Yilmaz, Özgür Bora Gevrek, Jibin Wu, Yuxiang Chen, Xuanbo Meng, Haizhou Li:
Deep Convolutional Spiking Neural Networks for Keyword Spotting. INTERSPEECH 2020: 2557-2561
[c556]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiSSX020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiSSX020
Siqi Cai, Enze Su, Yonghao Song, Longhan Xie, Haizhou Li:
Low Latency Auditory Attention Detection with Common Spatial Pattern Analysis of EEG Signals. INTERSPEECH 2020: 2772-2776
[c555]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouS0020
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. INTERSPEECH 2020: 3416-3420
[c554]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinLBRDN020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinLBRDN020
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3456-3460
[c553]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouXPZC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouXPZC020
Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068
[c552]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouXZC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouXZC020
Nana Hou, Chenglin Xu, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension. INTERSPEECH 2020: 4069-4073
[c551]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasTK020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasTK020
Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. INTERSPEECH 2020: 4213-4217
[c550]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004DMS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004DMS020
Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. INTERSPEECH 2020: 4293-4297
[c549]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouLYLL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouLYLL020
Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR. INTERSPEECH 2020: 5016-5020
[c548]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/GuptaH020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/GuptaH020
Chitralekha Gupta, Lin Huang, Haizhou Li:
Automatic Rank-Ordering of Singing Vocals with Twin-Neural Network. ISMIR 2020: 416-423
[c547]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/ZhangDBFL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/ZhangDBFL20
Chen Zhang, Luis Fernando D'Haro, Rafael E. Banchs, Thomas Friedrichs, Haizhou Li:
Deep AM-FM: Toolkit for Automatic Dialogue Evaluation. IWSDS 2020: 53-69
[c546]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/TianD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/TianD020
Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion. Odyssey 2020: 159-164
[c545]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ZhouS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ZhouS020
Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. Odyssey 2020: 230-237
[c544]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Sisman020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Sisman020
Berrak Sisman, Haizhou Li:
Generative Adversarial Networks for Singing Voice Conversion with and without Parallel Data. Odyssey 2020: 238-244
[c543]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/0008SBG020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/0008SBG020
Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li:
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss. Odyssey 2020: 245-251
[c542]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GaoTZD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GaoTZD020
Xiaoxue Gao, Xiaohai Tian, Yi Zhou, Rohan Kumar Das, Haizhou Li:
Personalized Singing Voice Generation Using WaveRNN. Odyssey 2020: 252-258
[i61]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00198
Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. CoRR abs/2002.00198 (2020)
[i60]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00387
Xiaoyi Qin, Ming Li, Hui Bu, Rohan Kumar Das, Wei Rao, Shrikanth Narayanan, Haizhou Li:
The FFSVC 2020 Evaluation Plan. CoRR abs/2002.00387 (2020)
[i59]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00417
Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li:
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss. CoRR abs/2002.00417 (2020)
[i58]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-11837
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-11837
Malu Zhang, Jiadong Wang, Zhixuan Zhang, Ammar Belatreche, Jibin Wu, Yansong Chua, Hong Qu, Haizhou Li:
Spike-Timing-Dependent Back Propagation in Deep Spiking Neural Networks. CoRR abs/2003.11837 (2020)
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-08326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-08326
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. CoRR abs/2004.08326 (2020)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-08849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-08849
Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. CoRR abs/2004.08849 (2020)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-14762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-14762
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-domain speaker extraction network. CoRR abs/2004.14762 (2020)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04686
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. CoRR abs/2005.04686 (2020)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07025
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. CoRR abs/2005.07025 (2020)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08046
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2005.08046 (2020)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-09982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-09982
Srivatsa P, Kyle Timothy Ng Chu, Yaswanth Tavva, Jibin Wu, Malu Zhang, Haizhou Li, Trevor E. Carlson:
You Only Spike Once: Improving Energy-Efficient Neuromorphic Inference to ANN-Level Accuracy. CoRR abs/2006.09982 (2020)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-10407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-10407
Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR. CoRR abs/2006.10407 (2020)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-10414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-10414
Xinyuan Zhou, Emre Yilmaz, Yanhua Long, Yijie Li, Haizhou Li:
Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition. CoRR abs/2006.10414 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01204
Jibin Wu, Chenglin Xu, Daquan Zhou, Haizhou Li, Kay Chen Tan:
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks. CoRR abs/2007.01204 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03274
Zihan Pan, Malu Zhang, Jibin Wu, Haizhou Li:
Multi-Tones' Phase Coding (MTPC) of Interaural Time Difference by Spiking Neural Network. CoRR abs/2007.03274 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01490
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01490
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li:
Expressive TTS Training with Frame and Style Reconstruction Loss. CoRR abs/2008.01490 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03648
Berrak Sisman, Junichi Yamagishi, Simon King, Haizhou Li:
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning. CoRR abs/2008.03648 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03992
Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. CoRR abs/2008.03992 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04562
Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. CoRR abs/2008.04562 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05284
Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li:
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS. CoRR abs/2008.05284 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-08901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-08901
Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. CoRR abs/2008.08901 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-04107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-04107
Zexu Pan, Zhaojie Luo, Jichen Yang, Haizhou Li:
Multi-modal Attention for Speech Emotion Recognition. CoRR abs/2009.04107 (2020)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-14399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-14399
Mingyang Zhang, Yi Zhou, Li Zhao, Haizhou Li:
Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data. CoRR abs/2009.14399 (2020)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03905
Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation. CoRR abs/2010.03905 (2020)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03907
Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. CoRR abs/2010.03907 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07775
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li:
Muse: Multi-modal target speaker extraction with visual cues. CoRR abs/2010.07775 (2020)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12423
Rui Liu, Berrak Sisman, Haizhou Li:
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis. CoRR abs/2010.12423 (2020)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14794
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset. CoRR abs/2010.14794 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02314
Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech. CoRR abs/2011.02314 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08548
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08548
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing voice conversion network with cycle consistency loss of speaker identity. CoRR abs/2011.08548 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09624
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals. CoRR abs/2011.09624 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-00337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-00337
Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li:
NHSS: A Speech and Singing Parallel Database. CoRR abs/2012.00337 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j104]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/DHaroBHL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/DHaroBHL19
Luis Fernando D'Haro, Rafael E. Banchs, Chiori Hori, Haizhou Li:
Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics. Comput. Speech Lang. 55: 200-215 (2019)
[j103]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/VijayanLT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/VijayanLT19
Karthika Vijayan, Haizhou Li, Tomoki Toda:
Speech-to-Singing Voice Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes. IEEE Signal Process. Mag. 36(1): 95-102 (2019)
[j102]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SismanZL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SismanZL19
Berrak Sisman, Mingyang Zhang, Haizhou Li:
Group Sparse Representation With WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 1085-1097 (2019)
[j101]
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/00050T19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/00050T19
Qiang Yu, Haizhou Li, Kay Chen Tan:
Spike Timing or Rate? Neurons Learn to Make Decisions for Both Through Threshold-Driven Plasticity. IEEE Trans. Cybern. 49(6): 2178-2189 (2019)
[j100]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/0003T0H19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/0003T0H19
Chong Zhang, Kay Chen Tan, Haizhou Li, Geok Soon Hong:
A Cost-Sensitive Deep Belief Network for Imbalanced Classification. IEEE Trans. Neural Networks Learn. Syst. 30(1): 109-122 (2019)
[c541]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangWCLPL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangWCLPL019
Malu Zhang, Jibin Wu, Yansong Chua, Xiaoling Luo, Zihan Pan, Dan Liu, Haizhou Li:
MPD-AL: An Efficient Membrane Potential Driven Aggregate-Label Learning Algorithm for Spiking Neurons. AAAI 2019: 1327-1334
[c540]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/SismanVD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SismanVD019
Berrak Sisman, Karthika Vijayan, Minghui Dong, Haizhou Li:
SINGAN: Singing Voice Conversion with Generative Adversarial Networks. APSIPA 2019: 112-118
[c539]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GaoTDZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GaoTDZ019
Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. APSIPA 2019: 159-164
[c538]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HouXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HouXC019
Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. APSIPA 2019: 667-672
[c537]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiuD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiuD019
Yitong Liu, Rohan Kumar Das, Haizhou Li:
Multi-band Spectral Entropy Information for Detection of Replay Attacks. APSIPA 2019: 838-843
[c536]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/VijayanM019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/VijayanM019
Karthika Vijayan, K. Sri Rama Murty, Haizhou Li:
Allpass Modeling of Phase Spectrum of Speech Signals for Formant Tracking. APSIPA 2019: 1190-1196
[c535]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhouTD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhouTD019
Yi Zhou, Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. APSIPA 2019: 1282-1287
[c534]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DasY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasY019
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech. APSIPA 2019: 1630-1635
[c533]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SismanZDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SismanZDL19
Berrak Sisman, Mingyang Zhang, Minghui Dong, Haizhou Li:
On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion. ASRU 2019: 144-151
[c532]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DuTXL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DuTXL19
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
WaveNet Factorization with Singular Value Decomposition for Voice Conversion. ASRU 2019: 152-159
[c531]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhouTYDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhouTYDL19
Yi Zhou, Xiaohai Tian, Emre Yilmaz, Rohan Kumar Das, Haizhou Li:
A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion. ASRU 2019: 160-167
[c530]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XuRCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XuRCL19
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-Domain Speaker Extraction Network. ASRU 2019: 327-334
[c529]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YueLYDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YueLYDL19
Xianghu Yue, Grandee Lee, Emre Yilmaz, Fang Deng, Haizhou Li:
End-to-End Code-Switching ASR for Low-Resourced Language Pairs. ASRU 2019: 972-979
[c528]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DasYL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DasYL19
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic and Deep Features Perspective on ASVspoof 2019. ASRU 2019: 1018-1025
[c527]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SharmaG0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SharmaG0W19
Bidisha Sharma, Chitralekha Gupta, Haizhou Li, Ye Wang:
Automatic Lyrics-to-audio Alignment on Polyphonic Music Using Singing-adapted Acoustic Models. ICASSP 2019: 396-400
[c526]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WickramasingheA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WickramasingheA19
Buddhi Wickramasinghe, Eliathamby Ambikairajah, Julien Epps, Vidhyasaharan Sethu, Haizhou Li:
Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection. ICASSP 2019: 6011-6015
[c525]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Lee019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Lee019
Grandee Lee, Haizhou Li:
Word and Class Common Space Embedding for Code-switch Language Modelling. ICASSP 2019: 6086-6090
[c524]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouTXD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouTXD019
Yi Zhou, Xiaohai Tian, Haihua Xu, Rohan Kumar Das, Haizhou Li:
Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling. ICASSP 2019: 6790-6794
[c523]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuRC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuRC019
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. ICASSP 2019: 6990-6994
[c522]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/PanWZ0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/PanWZ0C19
Zihan Pan, Jibin Wu, Malu Zhang, Haizhou Li, Yansong Chua:
Neural Population Coding for Effective Temporal Classification. IJCNN 2019: 1-8
[c521]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WuCZYL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WuCZYL019
Jibin Wu, Yansong Chua, Malu Zhang, Qu Yang, Guoqi Li, Haizhou Li:
Deep Spiking Neural Network with Spike Count based Learning Rule. IJCNN 2019: 1-6
[c520]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WuZ0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WuZ0C19
Jibin Wu, Malu Zhang, Haizhou Li, Yansong Chua:
Competitive STDP-based Feature Representation Learning for Sound Event Classification. IJCNN 2019: 1-8
[c519]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianC019
Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. INTERSPEECH 2019: 201-205
[c518]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazDZHB0L19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazDZHB0L19
Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. INTERSPEECH 2019: 411-415
[c517]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sharma019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sharma019
Bidisha Sharma, Haizhou Li:
A Combination of Model-Based and Feature-Based Strategy for Speech-to-Singing Alignment. INTERSPEECH 2019: 624-628
[c516]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasY019
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic Features for Spoofed Speech Detection. INTERSPEECH 2019: 1058-1062
[c515]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TjandraS0S0019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TjandraS0S0019
Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122
[c514]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaoXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaoXC019
Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Multi-Talker Speaker Verification. INTERSPEECH 2019: 1273-1277
[c513]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00030F0Y19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00030F0Y19
Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet. INTERSPEECH 2019: 1298-1302
[c512]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHKYOV0DSLD0R19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHKYOV0DSLD0R19
Kong Aik Lee, Ville Hautamäki, Tomi H. Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Massimiliano Todisco:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019: 1497-1501
[c511]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaD019
Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
Multi-Level Adaptive Speech Activity Detector for Speech in Naturalistic Environments. INTERSPEECH 2019: 2015-2019
[c510]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaD019a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaD019a
Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
On the Importance of Audio-Source Separation for Singer Identification in Polyphonic Music. INTERSPEECH 2019: 2020-2024
[c509]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaY019
Chitralekha Gupta, Emre Yilmaz, Haizhou Li:
Acoustic Modeling for Automatic Lyrics-to-Audio Alignment. INTERSPEECH 2019: 2040-2044
[c508]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZengKPXC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZengKPXC019
Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169
[c507]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/GuptaVSG019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaVSG019
Chitralekha Gupta, Karthika Vijayan, Bidisha Sharma, Xiaoxue Gao, Haizhou Li:
NUS Speak-to-Sing: A Web Platform for Personalized Speech-to-Singing Conversion. INTERSPEECH 2019: 2376-2377
[c506]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Das019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Das019
Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Long-Term Acoustic Cues for Orca Activity Detection. INTERSPEECH 2019: 2418-2422
[c505]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GunendradasanAE19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GunendradasanAE19
Tharshini Gunendradasan, Eliathamby Ambikairajah, Julien Epps, Haizhou Li:
An Adaptive-Q Cochlear Model for Replay Spoofing Detection. INTERSPEECH 2019: 2918-2922
[c504]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/WuPZDC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuPZDC019
Jibin Wu, Zihan Pan, Malu Zhang, Rohan Kumar Das, Yansong Chua, Haizhou Li:
Robust Sound Recognition: A Neuromorphic Approach. INTERSPEECH 2019: 3667-3668
[c503]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeY019
Grandee Lee, Xianghu Yue, Haizhou Li:
Linguistically Motivated Parallel Data Augmentation for Code-Switch Language Modeling. INTERSPEECH 2019: 3730-3734
[c502]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangYD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangYD019
Qinyi Wang, Emre Yilmaz, Adem Derinel, Haizhou Li:
Code-Switching Detection Using ASR-Generated Language Posteriors. INTERSPEECH 2019: 3740-3744
[c501]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazCYL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazCYL019
Emre Yilmaz, Samuel Cohen, Xianghu Yue, David A. van Leeuwen, Haizhou Li:
Multi-Graph Decoding for Code-Switching ASR. INTERSPEECH 2019: 3750-3754
[c500]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuMD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuMD019
Tianchi Liu, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
A Unified Framework for Speaker and Utterance Verification. INTERSPEECH 2019: 4320-4324
[c499]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/MadhaviZ0Y19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/MadhaviZ0Y19
Maulik C. Madhavi, Tong Zhan, Haizhou Li, Min Yuan:
First Leap Towards Development of Dialogue System for Autonomous Bus. IWSDS 2019: 393-400
[c498]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/000119a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/000119a
Haizhou Li:
Country Report - Singapore. O-COCOSDA 2019: 1-6
[c497]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/SheelvantSMDP019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/SheelvantSMDP019
Rohan Sheelvant, Bidisha Sharma, Maulik C. Madhavi, Rohan Kumar Das, S. R. M. Prasanna, Haizhou Li:
RSL2019: A Realistic Speech Localization Corpus. O-COCOSDA 2019: 1-6
[e16]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/2018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/2018
Luis Fernando D'Haro, Rafael E. Banchs, Haizhou Li:
9th International Workshop on Spoken Dialogue System Technology, IWSDS 2018, Singapore, April 18-20, 2018. Lecture Notes in Electrical Engineering 579, Springer 2019, ISBN 978-981-13-9442-3 [contents]
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02546
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02546
Wei Rao, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification. CoRR abs/1902.02546 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03705
Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data. CoRR abs/1902.03705 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-05705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-05705
Jibin Wu, Yansong Chua, Malu Zhang, Qu Yang, Guoqi Li, Haizhou Li:
Deep Spiking Neural Network with Spike Count based Learning Rule. CoRR abs/1902.05705 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-09952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-09952
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. CoRR abs/1903.09952 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-12389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-12389
Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet. CoRR abs/1903.12389 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-07386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-07386
Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11449
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11449
Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019. CoRR abs/1905.11449 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07523
Emre Yilmaz, Samuel Cohen, Xianghu Yue, David A. van Leeuwen, Haizhou Li:
Multi-Graph Decoding for Code-Switching ASR. CoRR abs/1906.07523 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07955
Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. CoRR abs/1906.07955 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-08003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-08003
Qinyi Wang, Emre Yilmaz, Adem Derinel, Haizhou Li:
Code-Switching Detection Using ASR-Generated Language Posteriors. CoRR abs/1906.08003 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-10369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-10369
Chitralekha Gupta, Emre Yilmaz, Haizhou Li:
Acoustic Modeling for Automatic Lyrics-to-Audio Alignment. CoRR abs/1906.10369 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-01167
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-01167
Jibin Wu, Yansong Chua, Malu Zhang, Guoqi Li, Haizhou Li, Kay Chen Tan:
A Hybrid Learning Rule for Efficient and Rapid Inference with Spiking Neural Networks. CoRR abs/1907.01167 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-01302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-01302
Zihan Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li, Eliathamby Ambikairajah:
An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks. CoRR abs/1909.01302 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-08018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-08018
Zihan Pan, Jibin Wu, Yansong Chua, Malu Zhang, Haizhou Li:
Neural Population Coding for Effective Temporal Classification. CoRR abs/1909.08018 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-10200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-10200
Chitralekha Gupta, Emre Yilmaz, Haizhou Li:
Automatic Lyrics Transcription in Polyphonic Music: Does Background Music Help? CoRR abs/1909.10200 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-12681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-12681
Xianghu Yue, Grandee Lee, Emre Yilmaz, Fang Deng, Haizhou Li:
End-to-End Code-Switching ASR for Low-Resourced Language Pairs. CoRR abs/1909.12681 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02839
Rui Liu, Berrak Sisman, Jingdong Li, Feilong Bao, Guanglai Gao, Haizhou Li:
Teacher-Student Training for Robust Tacotron-based TTS. CoRR abs/1911.02839 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-08373
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-08373
Jibin Wu, Emre Yilmaz, Malu Zhang, Haizhou Li, Kay Chen Tan:
Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition. CoRR abs/1911.08373 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-00863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-00863
Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019)
2018
[j99]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/IrtzaSAL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/IrtzaSAL18
Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li:
Using language cluster models in hierarchical language identification. Speech Commun. 100: 30-40 (2018)
[j98]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/PhamXXCCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/PhamXXCCL18
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech Commun. 104: 12-23 (2018)
[j97]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XuLLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuLLY18
Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang:
Generalizing I-Vector Estimation for Rapid Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 749-759 (2018)
[j96]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Li18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Li18
Haizhou Li:
Farewell Editorial. IEEE ACM Trans. Audio Speech Lang. Process. 26(12): 2489 (2018)
[c496]
- view
  authority control:
- export record
  dblp key:
  - conf/aclnews/LiWACL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiWACL18
Zhongwei Li, Xuancong Wang, AiTi Aw, Eng Siong Chng, Haizhou Li:
Named-Entity Tagging and Domain adaptation for Better Customized Translation. NEWS@ACL 2018: 41-46
[c495]
- view
  authority control:
- export record
  dblp key:
  - conf/aclnews/ChenDZBL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ChenDZBL18
Nancy F. Chen, Xiangyu Duan, Min Zhang, Rafael E. Banchs, Haizhou Li:
NEWS 2018 Whitepaper. NEWS@ACL 2018: 47-54
[c494]
- view
  authority control:
- export record
  dblp key:
  - conf/aclnews/ChenBZDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ChenBZDL18
Nancy F. Chen, Rafael E. Banchs, Min Zhang, Xiangyu Duan, Haizhou Li:
Report of NEWS 2018 Named Entity Transliteration Shared Task. NEWS@ACL 2018: 55-73
[c493]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangSR0Z18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangSR0Z18
Mingyang Zhang, Berrak Sisman, Sai Sirisha Rallabandi, Haizhou Li, Li Zhao:
Error Reduction Network for DBLSTM-based Voice Conversion. APSIPA 2018: 823-828
[c492]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiLY0Y18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiLY0Y18
Yanping Li, Kong-Aik Lee, Yougen Yuan, Haizhou Li, Zhen Yang:
Many-to-Many Voice Conversion based on Bottleneck Features with Variational Autoencoder for Non-parallel Training Data. APSIPA 2018: 829-833
[c491]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GuptaLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GuptaLW18
Chitralekha Gupta, Haizhou Li, Ye Wang:
Automatic Evaluation of Singing Quality without a Reference. APSIPA 2018: 990-997
[c490]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YangD018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YangD018
Jichen Yang, Rohan Kumar Das, Haizhou Li:
Extended Constant-Q Cepstral Coefficients for Detection of Spoofing Attacks. APSIPA 2018: 1024-1029
[c489]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Das018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Das018
Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Excitation Source Features for Detection of Replay Attacks. APSIPA 2018: 1030-1037
[c488]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/SuthokumarSSWA018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SuthokumarSSWA018
Gajan Suthokumar, Kaavya Sriskandaraja, Vidhyasaharan Sethu, Chamith Wijenayake, Eliathamby Ambikairajah, Haizhou Li:
Use of Claimed Speaker Models for Replay Detection. APSIPA 2018: 1038-1046
[c487]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/FernandoSA018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/FernandoSA018
Sarith Fernando, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li:
Second Order Factorized Model Adaptation for Short Duration Language Identification. APSIPA 2018: 1440-1447
[c486]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DasM018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasM018
Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Compensating Utterance Information in Fixed Phrase Speaker Verification. APSIPA 2018: 1708-1712
[c485]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/VijayanG018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/VijayanG018
Karthika Vijayan, Xiaoxue Gao, Haizhou Li:
Analysis of Speech and Singing Signals for Temporal Alignment. APSIPA 2018: 1893-1898
[c484]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/XiaoY0SH0D018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/XiaoY0SH0D018
Jinba Xiao, Shan Yang, Mingyang Zhang, Berrak Sisman, Dongyan Huang, Lei Xie, Minghui Dong, Haizhou Li:
The I2R-NWPU-NUS Text-to-Speech System for Blizzard Challenge 2018. Blizzard Challenge 2018
[c483]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuRXC018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuRXC018
Chenglin Xu, Wei Rao, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM. ICASSP 2018: 6-10
[c482]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRSXCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRSXCL18
Qing Wang, Wei Rao, Sining Sun, Lei Xie, Eng Siong Chng, Haizhou Li:
Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition. ICASSP 2018: 4889-4893
[c481]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Vijayan0SL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Vijayan0SL18
Karthika Vijayan, Haizhou Li, Hanwu Sun, Kong-Aik Lee:
On the Importance of Analytic Phase of Speech Signals in Spoken Language Recognition. ICASSP 2018: 5194-5198
[c480]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IrtzaSA018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IrtzaSA018
Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li:
End-to-End Hierarchical Language Identification System. ICASSP 2018: 5199-5203
[c479]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/Pan0WC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/Pan0WC18
Zihan Pan, Haizhou Li, Jibin Wu, Yansong Chua:
An Event-Based Cochlear Filter Temporal Encoding Scheme for Speech Signals. IJCNN 2018: 1-8
[c478]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WuC018a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WuC018a
Jibin Wu, Yansong Chua, Haizhou Li:
A Biologically Plausible Speech Recognition Framework Based on Spiking Neural Networks. IJCNN 2018: 1-8
[c477]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SismanL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SismanL18
Berrak Sisman, Haizhou Li:
Wavelet Analysis of Speaker Dependent and Independent Prosody for Voice Conversion. INTERSPEECH 2018: 52-56
[c476]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanLXCML18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanLXCML18
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search. INTERSPEECH 2018: 97-101
[c475]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/XuPKLCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuPKLCL18
Haihua Xu, Van Tung Pham, Zin Tun Kyaw, Zhi Hao Lim, Eng Siong Chng, Haizhou Li:
Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2018: 554-555
[c474]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuLLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuLLY18
Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang:
Co-whitening of I-vectors for Short and Long Duration Speaker Verification. INTERSPEECH 2018: 1066-1070
[c473]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaLW18
Chitralekha Gupta, Haizhou Li, Ye Wang:
Automatic Pronunciation Evaluation of Singing. INTERSPEECH 2018: 1507-1511
[c472]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SismanZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SismanZL18
Berrak Sisman, Mingyang Zhang, Haizhou Li:
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder. INTERSPEECH 2018: 1978-1982
[c471]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuRCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuRCL18
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning. INTERSPEECH 2018: 3479-3483
[c470]
- view
  - electronic edition @ ircam.fr (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/GuptaT0W18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/GuptaT0W18
Chitralekha Gupta, Rong Tong, Haizhou Li, Ye Wang:
Semi-supervised Lyrics and Solo-singing Alignment. ISMIR 2018: 600-607
[c469]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/TianWXC018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/TianWXC018
Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:
Average Modeling Approach to Voice Conversion with Non-Parallel Data. Odyssey 2018: 227-232
[c468]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/SismanL018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SismanL018
Berrak Sisman, Grandee Lee, Haizhou Li:
Phonetically Aware Exemplar-Based Prosody Transformation. Odyssey 2018: 267-274
[c467]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SismanZS0018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SismanZS0018
Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion. SLT 2018: 282-289
[c466]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/XuDYY018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/XuDYY018
Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative X-Vectors for Text-Independent Speaker Verification. SLT 2018: 1014-1020
[e15]
- view
- export record
  dblp key:
  - conf/aclnews/2018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2018
Nancy F. Chen, Rafael E. Banchs, Xiangyu Duan, Min Zhang, Haizhou Li:
Proceedings of the Seventh Named Entities Workshop, NEWS@ACL 2018, Melbourne, Australia, July 20, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-37-7 [contents]
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-10801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-10801
Chong Zhang, Kay Chen Tan, Haizhou Li, Geok Soon Hong:
A Cost-Sensitive Deep Belief Network for Imbalanced Classification. CoRR abs/1804.10801 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-00367
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-00367
Chong Zhang, Geok Soon Hong, Jun-Hong Zhou, Kay Chen Tan, Haizhou Li, Huan Xu, Jihoon Hong, Hian-Leng Chan:
A Multi-State Diagnosis and Prognosis Framework with Feature Learning for Tool Condition Monitoring. CoRR abs/1805.00367 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-03621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-03621
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search. CoRR abs/1806.03621 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-01013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-01013
Laxmi R. Iyer, Yansong Chua, Haizhou Li:
Is Neuromorphic MNIST neuromorphic? Analyzing the discriminative power of neuromorphic datasets in the time domain. CoRR abs/1807.01013 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-06798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-06798
Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative x-vectors for text-independent speaker verification. CoRR abs/1809.06798 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00241
Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition. CoRR abs/1811.00241 (2018)
2017
[j95]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/SriskandarajaSA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/SriskandarajaSA17
Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li:
Front-End for Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition. IEEE J. Sel. Top. Signal Process. 11(4): 632-643 (2017)
[j94]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenLXML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenLXML17
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection. IEEE J. Sel. Top. Signal Process. 11(8): 1329-1339 (2017)
[j93]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenXLLML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenXLLML17
Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li:
Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 108-119 (2017)
[j92]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TianLWCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TianLWCL17
Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017)
[c465]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DHaroNCNBKL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DHaroNCNBKL17
Luis Fernando D'Haro, Andreea I. Niculescu, Caixia Cai, Suraj Nair, Rafael E. Banchs, Alois C. Knoll, Haizhou Li:
An integrated framework for multimodal human-robot interaction. APSIPA 2017: 76-82
[c464]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GuptaLW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GuptaLW17
Chitralekha Gupta, Haizhou Li, Ye Wang:
Perceptual evaluation of singing quality. APSIPA 2017: 577-586
[c463]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenLDPNXHCXSCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenLDPNXHCXSCM17
Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327
[c462]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/SismanLT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SismanLT17
Berrak Sisman, Haizhou Li, Kay Chen Tan:
Transformation of prosody in voice conversion. APSIPA 2017: 1537-1546
[c461]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/VijayanDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/VijayanDL17
Karthika Vijayan, Minghui Dong, Haizhou Li:
A dual alignment scheme for improved speech-to-singing voice conversion. APSIPA 2017: 1547-1555
[c460]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/SunLNML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SunLNML17
Hanwu Sun, Kong-Aik Lee, Trung Hieu Nguyen, Bin Ma, Haizhou Li:
I2R-NUS submission to oriental language recognition AP16-OL7 challenge. APSIPA 2017: 1574-1578
[c459]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZengXCCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZengXCCL17
Zhiping Zeng, Haihua Xu, Tze Yuang Chong, Eng Siong Chng, Haizhou Li:
Improving N-gram language modeling for code-switching speech recognition. APSIPA 2017: 1596-1601
[c458]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CicmanLT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CicmanLT17
Berrak Sisman, Haizhou Li, Kay Chen Tan:
Sparse representation of phonetic features for voice conversion with and without parallel data. ASRU 2017: 677-684
[c457]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YangXCLZHL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YangXCLZHL17
Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li:
Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework. ASRU 2017: 685-691
[c456]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenLXML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenLXML17
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Multilingual bottle-neck feature learning from untranscribed speech. ASRU 2017: 727-733
[c455]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YuanLXCML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YuanLXCML17
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Extracting bottleneck features and word-like pairs from untranscribed speech for feature representation. ASRU 2017: 734-739
[c454]
- view
  authority control:
- export record
  dblp key:
  - conf/etfa/0003HXTZCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/etfa/0003HXTZCL17
Chong Zhang, Geok Soon Hong, Huan Xu, Kay Chen Tan, Jun-Hong Zhou, Hian-Leng Chan, Haizhou Li:
A data-driven prognostics framework for tool remaining useful life estimation in tool condition monitoring. ETFA 2017: 1-8
[c453]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/SismanLLT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/SismanLLT17
Berrak Sisman, Grandee Lee, Haizhou Li, Kay Chen Tan:
On the analysis and evaluation of prosody conversion techniques. IALP 2017: 44-47
[c452]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/HouTCML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/HouTCML17
Nana Hou, Xiaohai Tian, Eng Siong Chng, Bin Ma, Haizhou Li:
Improving air traffic control speech intelligibility by reducing speaking rate effectively. IALP 2017: 197-200
[c451]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LeeHCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LeeHCL17
Grandee Lee, Thi-Nga Ho, Eng Siong Chng, Haizhou Li:
A review of the mandarin-english code-switching corpus: SEAME. IALP 2017: 210-213
[c450]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LiCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LiCL17
Zhongwei Li, Eng Siong Chng, Haizhou Li:
Named entity transliteration with sequence-to-sequence neural network. IALP 2017: 374-378
[c449]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZJCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZJCL17
Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017: 3246-3250
[c448]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMMLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMMLD17
Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li, Li-Rong Dai:
Adaptation of PLDA for multi-source text-independent speaker verification. ICASSP 2017: 5380-5384
[c447]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuanLXCML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuanLXCML17
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection. ICASSP 2017: 5645-5649
[c446]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Li17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li17
Haizhou Li:
ISCA Medal for Scientific Achievement. INTERSPEECH 2017: 1
[c445]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangDXMDYL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangDXMDYL17
Dong-Yan Huang, Wan Ding, Mingyu Xu, Huaiping Ming, Minghui Dong, Xinguo Yu, Haizhou Li:
Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques. INTERSPEECH 2017: 162-165
[c444]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHKLa17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHKLa17
Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332
[c443]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeL17
Kong-Aik Lee, Haizhou Li:
Gain Compensation for Fast i-Vector Extraction Over Short Duration. INTERSPEECH 2017: 1527-1531
[c442]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuXSRCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuXSRCL17
Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng, Haizhou Li:
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. INTERSPEECH 2017: 1894-1898
[c441]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrtzaSAL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrtzaSAL17
Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li:
Investigating Scalability in Hierarchical Language Identification System. INTERSPEECH 2017: 2581-2585
[c440]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHXL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHXL17
Jie Wu, Dong-Yan Huang, Lei Xie, Haizhou Li:
Denoising Recurrent Neural Network for Deep Bidirectional LSTM Based Voice Conversion. INTERSPEECH 2017: 3379-3383
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YangXCLZHL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YangXCLZHL17
Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li:
Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework. CoRR abs/1707.01670 (2017)
2016
[j91]
- view
  authority control:
- export record
  dblp key:
  - journals/cim/HuTTL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cim/HuTTL16
Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li:
How the Brain Formulates Memory: A Spatio-Temporal Model Research Frontier. IEEE Comput. Intell. Mag. 11(2): 56-68 (2016)
[j90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/XiaoZNZJCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/XiaoZNZJCL16
Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation. EURASIP J. Adv. Signal Process. 2016: 4 (2016)
[j89]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuL16
Zhizheng Wu, Haizhou Li:
On the study of replay and voice conversion attacks to text-dependent speaker verification. Multim. Tools Appl. 75(9): 5311-5327 (2016)
[j88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/ChenWTML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ChenWTML16
Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li:
Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL. Speech Commun. 84: 46-56 (2016)
[j87]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ShepstoneLLTJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ShepstoneLLTJ16
Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Total Variability Modeling Using Source-Specific Priors. IEEE ACM Trans. Audio Speech Lang. Process. 24(3): 504-517 (2016)
[j86]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NguyenXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NguyenXCL16
Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(6): 1006-1019 (2016)
[j85]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuYTTL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuYTTL16
Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li:
A Spiking Neural Network System for Robust Sequence Recognition. IEEE Trans. Neural Networks Learn. Syst. 27(3): 621-635 (2016)
[j84]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/UedaWKXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/UedaWKXCL16
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. J. Signal Process. Syst. 82(2): 151-161 (2016)
[j83]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/ChenLMGLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/ChenLMGLD16
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Exploration of Local Variability in Text-Independent Speaker Verification. J. Signal Process. Syst. 82(2): 217-228 (2016)
[c439]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KimBL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KimBL16
Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling for Dialogue Topic Tracking. ACL (1) 2016
[c438]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aclnews/JiangBL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/JiangBL16
Ridong Jiang, Rafael E. Banchs, Haizhou Li:
Evaluating and Combining Name Entity Recognition Systems. NEWS@ACM 2016: 21-27
[c437]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aclnews/DuanZLBK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/DuanZLBK16
Xiangyu Duan, Min Zhang, Haizhou Li, Rafael E. Banchs, A. Kumaran:
Whitepaper of NEWS 2016 Shared Task on Machine Transliteration. NEWS@ACM 2016: 49-57
[c436]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aclnews/DuanBZLK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/DuanBZLK16
Xiangyu Duan, Rafael E. Banchs, Min Zhang, Haizhou Li, A. Kumaran:
Report of NEWS 2016 Machine Transliteration Shared Task. NEWS@ACM 2016: 58-72
[c435]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenL16
Nancy F. Chen, Haizhou Li:
Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning. APSIPA 2016: 1-7
[c434]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TianXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TianXCL16
Xiaohai Tian, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing speech detection using temporal convolutional neural network. APSIPA 2016: 1-6
[c433]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XiaoWCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XiaoWCL16
Xiong Xiao, Shinji Watanabe, Eng Siong Chng, Haizhou Li:
Beamforming networks using spatial covariance features for far-field speech recognition. APSIPA 2016: 1-6
[c432]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuRXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuRXHCL16
Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
I-vector based deep neural network acoustic model adaptation using multilingual language resource. APSIPA 2016: 1-5
[c431]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection from a feature representation perspective. ICASSP 2016: 2119-2123
[c430]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MingHXZDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MingHXZDL16
Huaiping Ming, Dong-Yan Huang, Lei Xie, Shaofei Zhang, Minghui Dong, Haizhou Li:
Exemplar-based sparse representation of timbre and prosody for voice conversion. ICASSP 2016: 5175-5179
[c429]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLCMLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLCMLD16
Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai:
Content-aware local variability vector for speaker verification with short utterance. ICASSP 2016: 5485-5489
[c428]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IrtzaSBAL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IrtzaSBAL16
Saad Irtza, Vidhyasaharan Sethu, Haris Bavattichalil, Eliathamby Ambikairajah, Haizhou Li:
A hierarchical framework for language identification. ICASSP 2016: 5820-5824
[c427]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NiLWLRLCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NiLWLRLCML16
Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Nancy F. Chen, Bin Ma, Haizhou Li:
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search. ICASSP 2016: 6015-6019
[c426]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuHXPLWDLXMCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuHXPLWDLXMCL16
Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li:
Approximate search of audio queries by using DTW with phone time boundary and data augmentation. ICASSP 2016: 6030-6034
[c425]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXXCCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXXCCL16
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Keyword search using query expansion for graph-based rescoring of hypothesized detections. ICASSP 2016: 6035-6039
[c424]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenPXXDNCSLCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenPXXDNCSLCML16
Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044
[c423]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZNJCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZNJCL16
Xiong Xiao, Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources. ICASSP 2016: 6330-6334
[c422]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangDL16
Dong-Yan Huang, Minghui Dong, Haizhou Li:
Combining multiple kernel models for automatic intelligibility detection of pathological speech. ICASSP 2016: 6485-6489
[c421]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/DingXHLDYL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/DingXHLDYL16
Wan Ding, Mingyu Xu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Xinguo Yu, Haizhou Li:
Audio and face video emotion recognition in the wild using deep neural networks and small datasets. ICMI 2016: 506-513
[c420]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanLXML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanLXML16
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information. INTERSPEECH 2016: 788-792
[c419]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLXML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLXML16
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection. INTERSPEECH 2016: 923-927
[c418]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PhamXXCCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PhamXXCCL16
Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples. INTERSPEECH 2016: 933-937
[c417]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChanDHL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChanDHL16
Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li:
SERAPHIM: A Wavetable Synthesis System with 3D Lip Animation for Real-Time Speech and Singing Applications on Mobile Platforms. INTERSPEECH 2016: 1225-1229
[c416]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuSNXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuSNXHCL16
Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. INTERSPEECH 2016: 1315-1319
[c415]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXXCL16
Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li:
A DNN-HMM Approach to Story Segmentation. INTERSPEECH 2016: 1527-1531
[c414]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenTWLML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenTWLML16
Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li:
SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese. INTERSPEECH 2016: 1545-1549
[c413]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions. INTERSPEECH 2016: 1715-1719
[c412]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/ChanDHL16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChanDHL16a
Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li:
SERAPHIM Live! - Singing Synthesis for the Performer, the Composer, and the 3D Game Developer. INTERSPEECH 2016: 1966-1967
[c411]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingHXWDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingHXWDL16
Huaiping Ming, Dong-Yan Huang, Lei Xie, Jie Wu, Minghui Dong, Haizhou Li:
Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion. INTERSPEECH 2016: 2453-2457
[c410]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongCML16
Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li:
Context Aware Mispronunciation Detection for Mandarin Pronunciation Training. INTERSPEECH 2016: 3112-3116
[c409]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLDHRXLSNWSCK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLDHRXLSNWSCK16
Kong-Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Chenglin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain Meignier:
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS. INTERSPEECH 2016: 3211-3215
[c408]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrtzaSFAL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrtzaSFAL16
Saad Irtza, Vidhyasaharan Sethu, Sarith Fernando, Eliathamby Ambikairajah, Haizhou Li:
Out of Set Language Modelling in Hierarchical Language Identification. INTERSPEECH 2016: 3270-3274
[c407]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiWLRLML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiWLRLML16
Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li:
Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search. INTERSPEECH 2016: 3698-3702
[c406]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeungWXHPLXXNMC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeungWXHPLXXNMC16
Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li:
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. INTERSPEECH 2016: 3703-3707
[c405]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/RaoXXXLCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/RaoXXXLCL16
Wei Rao, Xiong Xiao, Chenglin Xu, Haihua Xu, Kong-Aik Lee, Eng Siong Chng, Haizhou Li:
Neural networks based channel compensation for i-vector speaker verification. ISCSLP 2016: 1-5
[c404]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhangXWDICL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhangXWDICL16
Zhaofeng Zhang, Xiong Xiao, Longbiao Wang, Jianwu Dang, Masahiro Iwahashi, Eng Siong Chng, Haizhou Li:
Multi-channel feature adaptation for robust speech recognition. ISCSLP 2016: 1-5
[c403]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/WangNLYXXXNCML16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/WangNLYXXXNCML16
Lei Wang, Chongjia Ni, Cheung-Chi Leung, Changhuai You, Lei Xie, Haihua Xu, Xiong Xiao, Tin Lay Nwe, Eng Siong Chng, Bin Ma, Haizhou Li:
The NNI Vietnamese Speech Recognition System for MediaEval 2016. MediaEval 2016
[c402]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Li16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Li16
Haizhou Li:
Voice conversion and spoofing countermeasures for speaker verification. Odyssey 2016
[c401]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/XuLLY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/XuLLY16
Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang:
Rapid Computation of I-vector. Odyssey 2016: 47-52
[c400]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/SunNWLML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SunNWLML15
Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Kong-Aik Lee, Bin Ma, Haizhou Li:
I2R Submission to the 2015 NIST Language Recognition I-vector Challenge. Odyssey 2016: 311-318
[c399]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/HuangXLWMTZDLHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/HuangXLWMTZDLHD16
Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Nguyen Quy Hy, Minghui Dong, Haizhou Li:
An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity. SSW 2016: 44-51
[e14]
- view
- export record
  dblp key:
  - conf/aclnews/2016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2016
Xiangyu Duan, Rafael E. Banchs, Min Zhang, Haizhou Li, A. Kumaran:
Proceedings of the Sixth Named Entity Workshop, NEWS@ACL 2016, Berlin, Germany, August 12, 2016. Association for Computational Linguistics 2016, ISBN 978-1-945626-16-6 [contents]
[e13]
- view
- export record
  dblp key:
  - conf/ialp/2016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/2016
Minghui Dong, Yuen-Hsien Tseng, Yanfeng Lu, Liang-Chih Yu, Lung-Hao Lee, Chung-Hsien Wu, Haizhou Li:
2016 International Conference on Asian Language Processing, IALP 2016, Tainan, Taiwan, November 21-23, 2016. IEEE 2016, ISBN 978-1-5090-0922-0 [contents]
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LeeHLRSNWSKPTXX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeeHLRSNWSKPTXX16
Kong-Aik Lee, Ville Hautamäki, Anthony Larcher, Wei Rao, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Xiong Xiao, Chenglin Xu, Haihua Xu, Bin Ma, Haizhou Li, Sylvain Meignier:
Fantastic 4 system for NIST 2015 Language Recognition Evaluation. CoRR abs/1602.01929 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection under noisy conditions: a preliminary investigation and an initial database. CoRR abs/1602.02950 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangXWCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangXWCL16
Zhaofeng Zhang, Xiong Xiao, Longbiao Wang, Eng Siong Chng, Haizhou Li:
Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting. CoRR abs/1604.03276 (2016)
2015
[j82]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/YouLL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/YouLL15
Chang Huai You, Haizhou Li, Kong-Aik Lee:
Relevance factor of maximum a posteriori adaptation for GMM-NAP-SVM in speaker and language recognition. Comput. Speech Lang. 30(1): 116-134 (2015)
[j81]
- view
  authority control:
- export record
  dblp key:
  - journals/ijhr/LiXWYTL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijhr/LiXWYTL15
Liyuan Li, Qianli Xu, Gang S. Wang, Xinguo Yu, Yeow Kee Tan, Haizhou Li:
Visual Perception Based Engagement Awareness for Multiparty Human-Robot Interaction. Int. J. Humanoid Robotics 12(4): 1550019:1-1550019:28 (2015)
[j80]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/DoXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DoXCL15
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages. Int. J. Asian Lang. Process. 23(1): 21-33 (2015)
[j79]
- view
  authority control:
- export record
  dblp key:
  - journals/lre/LyuTCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/lre/LyuTCL15
Dau-Cheng Lyu, Tien Ping Tan, Engsiong Chng, Haizhou Li:
Mandarin-English code-switching speech corpus in South-East Asia: SEAME. Lang. Resour. Evaluation 49(3): 581-600 (2015)
[j78]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuCL15
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Exemplar-based voice conversion using joint nonnegative matrix factorization. Multim. Tools Appl. 74(22): 9943-9958 (2015)
[j77]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/WuEKYAL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WuEKYAL15
Zhizheng Wu, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li:
Spoofing and countermeasures for speaker verification: A survey. Speech Commun. 66: 130-153 (2015)
[j76]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChenLDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChenLDL15
Liping Chen, Kong-Aik Lee, Li-Rong Dai, Haizhou Li:
Quasi-Factorial Prior for i-vector Extraction. IEEE Signal Process. Lett. 22(12): 2484-2488 (2015)
[j75]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangLLML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangLLML15
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Acoustic Segment Modeling with Spectral Clustering Methods. IEEE ACM Trans. Audio Speech Lang. Process. 23(2): 264-277 (2015)
[j74]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiFHMT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiFHMT15
Haizhou Li, Marcello Federico, Xiaodong He, Helen M. Meng, Isabel Trancoso:
Introduction to the Special Section on Continuous Space and Related Methods in Natural Language Processing. IEEE ACM Trans. Audio Speech Lang. Process. 23(3): 427-430 (2015)
[j73]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BanchsDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BanchsDL15
Rafael E. Banchs, Luis F. D'Haro, Haizhou Li:
Adequacy-Fluency Metrics: Evaluating MT in the Continuous Space Model Framework. IEEE ACM Trans. Audio Speech Lang. Process. 23(3): 472-482 (2015)
[j72]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChongBCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChongBCL15
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long History Context Language Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 23(7): 1221-1232 (2015)
[j71]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DennisDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DennisDL15
Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Generalized Hough Transform for Speech Pattern Classification. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1963-1972 (2015)
[c398]
- view
  authority control:
- export record
  dblp key:
  - books/sp/15/DHaroKYJNBL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/15/DHaroKYJNBL15
Luis Fernando D'Haro, Seokhwan Kim, Kheng Hui Yeo, Ridong Jiang, Andreea I. Niculescu, Rafael E. Banchs, Haizhou Li:
CLARA: A Multifunctional Virtual Agent for Conference Support and Touristic Information. IWSDS 2015: 233-239
[c397]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YuanTSTL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YuanTSTL15
Miaolong Yuan, Bo Tian, Vui Ann Shim, Huajin Tang, Haizhou Li:
An Entorhinal-Hippocampal Model for Simultaneous Cognitive Map Building. AAAI 2015: 586-592
[c396]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/MingHDLXZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/MingHDLXZ15
Huaiping Ming, Dong-Yan Huang, Minghui Dong, Haizhou Li, Lei Xie, Shaofei Zhang:
Fundamental frequency modeling using wavelets for emotional voice conversion. ACII 2015: 804-809
[c395]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aclnews/ZhangLBK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ZhangLBK15
Min Zhang, Haizhou Li, Rafael E. Banchs, A. Kumaran:
Whitepaper of NEWS 2015 Shared Task on Machine Transliteration. NEWS@ACL 2015: 1-9
[c394]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aclnews/BanchsZDLK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/BanchsZDLK15
Rafael E. Banchs, Min Zhang, Xiangyu Duan, Haizhou Li, A. Kumaran:
Report of NEWS 2015 Machine Transliteration Shared Task. NEWS@ACL 2015: 10-23
[c393]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DoXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DoXCL15
Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Distance metric learning for kernel density-based acoustic model under limited training data conditions. APSIPA 2015: 54-58
[c392]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YuXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YuXXCL15
Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, Haizhou Li:
A density peak clustering approach to unsupervised acoustic subword units discovery. APSIPA 2015: 178-183
[c391]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangHXCLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangHXCLD15
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong:
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation. APSIPA 2015: 222-228
[c390]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PhamXDCXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PhamXDCXCL15
Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou Li:
On the study of very low-resource language keyword search. APSIPA 2015: 358-364
[c389]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DongYLEHMTLL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DongYLEHMTLL15
Minghui Dong, Chenyu Yang, Yanfeng Lu, Jochen Walter Ehnes, Dong-Yan Huang, Huaiping Ming, Rong Tong, Siu Wa Lee, Haizhou Li:
Mapping frames with DNN-HMM recognizer for non-parallel voice conversion. APSIPA 2015: 488-494
[c388]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DoXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DoXXCL15
Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, Haizhou Li:
Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation. APSIPA 2015: 594-98
[c387]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhaoXZNZRWJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhaoXZNZRWJCL15
Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction. ASRU 2015: 460-467
[c386]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XuXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XuXCL15
Haihua Xu, Xiong Xiao, Engsiong Chng, Haizhou Li:
On statistical machine translation method for lexicon refinement in speech recognition. ChinaSIP 2015: 25-29
[c385]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/TianDXXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/TianDXXCL15
Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, Engsiong Chng, Haizhou Li:
Detecting synthetic speech using long term magnitude and phase information. ChinaSIP 2015: 611-615
[c384]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KimBL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KimBL15
Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constraints from Wikipedia. EMNLP 2015: 2225-2229
[c383]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/WuWZAL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/WuWZAL15
Kui Wu, Xuancong Wang, Nina Zhou, AiTi Aw, Haizhou Li:
Joint Chinese word segmentation and punctuation prediction using deep recurrent neural network for social media data. IALP 2015: 41-44
[c382]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/ChuaCPCDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/ChuaCPCDL15
Gillian Chua, Qian Ci Chang, Ye Won Park, Paul Yaozhu Chan, Minghui Dong, Haizhou Li:
The expression of singing emotion - contradicting the constraints of song. IALP 2015: 98-102
[c381]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/YuLHDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/YuLHDL15
Yang Yu, Weisi Lin, Dong-Yan Huang, Minghui Dong, Haizhou Li:
Performance scoring of singing voice. IALP 2015: 119-122
[c380]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/JiangKBL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/JiangKBL15
Ridong Jiang, Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Towards improving the performance of Vector Space Model for Chinese Frequently Asked Question Answering. IALP 2015: 136-139
[c379]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DennisDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DennisDL15
Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Combining robust spike coding with spiking neural networks for sound event classification. ICASSP 2015: 176-180
[c378]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoZZJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoZZJCL15
Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
A learning-based approach to direction of arrival estimation in noisy and reverberant environments. ICASSP 2015: 2814-2818
[c377]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShepstoneLLTJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShepstoneLLTJ15
Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Source-specific informative prior for i-vector extraction. ICASSP 2015: 4185-4189
[c376]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuYXXLCYLWLMCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuYXXLCYLWLMCL15
Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Engsiong Chng, Haizhou Li:
Language independent query-by-example spoken term detection using N-best phone sequences and partial matching. ICASSP 2015: 5191-5195
[c375]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMGLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMGLD15
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Channel adaptation of plda for text-independent speaker verification. ICASSP 2015: 5251-5255
[c374]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongCLML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongCLML15
Rong Tong, Nancy F. Chen, Boon Pang Lim, Bin Ma, Haizhou Li:
Tokenizing fundamental frequency variation for Mandarin tone error detection. ICASSP 2015: 5361-5365
[c373]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenNCSPXXLLLL015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenNCSPXXLLLL015
Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370
[c372]
- view
  authority control:
- export record
  dblp key:
  - conf/icdsp/ChanDLTCYCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdsp/ChanDLTCYCL15
Paul Yaozhu Chan, Minghui Dong, Yi Qian Lim, Ashleigh Toh, Elliot Chong, Mantita Yeo, Megan Chua, Haizhou Li:
Formant excursion in singing synthesis. DSP 2015: 168-172
[c371]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLMGLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLMGLD15
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Phone-centric local variability vector for text-constrained speaker verification. INTERSPEECH 2015: 229-233
[c370]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenTWLML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenTWLML15
Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li:
iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent. INTERSPEECH 2015: 324-328
[c369]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongCML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongCML15
Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li:
Goodness of tone (GOT) for non-native Mandarin tone recognition. INTERSPEECH 2015: 801-805
[c368]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrtzaSLAL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrtzaSLAL15
Saad Irtza, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah, Haizhou Li:
Phonemes frequency based PLLR dimensionality reduction for language recognition. INTERSPEECH 2015: 997-1001
[c367]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuLLY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuLLY15
Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang:
Sparse coding of total variability matrix. INTERSPEECH 2015: 1022-1026
[c366]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChongBCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChongBCL15
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
TDTO language modeling with feedforward neural networks. INTERSPEECH 2015: 1458-1462
[c365]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangHXCLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangHXCLD15
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Engsiong Chng, Haizhou Li, Minghui Dong:
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation. INTERSPEECH 2015: 1498-1502
[c364]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DennisDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DennisDL15
Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Spiking neural networks and the generalised hough transform for speech pattern detection. INTERSPEECH 2015: 1997-2001
[c363]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoTDXCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoTDXCL15
Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Engsiong Chng, Haizhou Li:
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge. INTERSPEECH 2015: 2052-2056
[c362]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/LeeWNSNTML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeWNSNTML15
Kong-Aik Lee, Guangsen Wang, Kam Pheng Ng, Hanwu Sun, Trung Hieu Nguyen, Ngoc Thuy Huong Thai, Bin Ma, Haizhou Li:
The reddots platform for mobile crowd-sourcing of speech data. INTERSPEECH 2015: 2603-2604
[c361]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangDL15
Dong-Yan Huang, Minghui Dong, Haizhou Li:
A real-time variable-q non-stationary Gabor transform for pitch shifting. INTERSPEECH 2015: 2744-2748
[c360]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLWKBLAKVMLSA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLWKBLAKVMLSA15
Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez:
The reddots data collection for speaker recognition. INTERSPEECH 2015: 2996-3000
[c359]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLXML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLXML15
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study. INTERSPEECH 2015: 3189-3193
[c358]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingHXLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingHXLD15
Huaiping Ming, Dong-Yan Huang, Lei Xie, Haizhou Li, Minghui Dong:
An alternating optimization approach for phase retrieval. INTERSPEECH 2015: 3426-3430
[c357]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoZZJCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoZZJCL15
Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Learning to estimate reverberation time in noisy and reverberant rooms. INTERSPEECH 2015: 3431-3435
[c356]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgoCNML15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgoCNML15
Hoang Gia Ngo, Nancy F. Chen, Binh Minh Nguyen, Bin Ma, Haizhou Li:
Phonology-augmented statistical transliteration for low-resource languages. INTERSPEECH 2015: 3670-3674
[c355]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/HouPL0XLXFNXCZS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/HouPL0XLXFNXCZS15
Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Engsiong Chng, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2015. MediaEval 2015
[c354]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoL15
Sheng Gao, Haizhou Li:
Octave-dependent Probabilistic Latent Semantic Analysis to Chorus Detection of Popular Song. ACM Multimedia 2015: 979-982
[c353]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/GaoL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/GaoL15
Sheng Gao, Haizhou Li:
Popular song summarization using chorus section detection from audio signal. MMSP 2015: 1-6
[c352]
- view
  authority control:
- export record
  dblp key:
  - conf/sigdial/KimBL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/KimBL15
Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Towards Improving Dialogue Topic Tracking Performances with Wikification of Concept Mentions. SIGDIAL Conference 2015: 124-128
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/lnsn/ZhuGPLDS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/lnsn/ZhuGPLDS15
Linhong Zhu, Sheng Gao, Sinno Jialin Pan, Haizhou Li, Dingxiong Deng, Cyrus Shahabi:
The Pareto Principle Is Everywhere: Finding Informative Sentences for Opinion Summarization Through Leader Detection. Recommendation and Search in Social Networks 2015: 165-187
[e12]
- view
- export record
  dblp key:
  - conf/aclnews/2015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2015
Xiangyu Duan, Rafael E. Banchs, Min Zhang, Haizhou Li, A. Kumaran:
Proceedings of the Fifth Named Entity Workshop, NEWS@ACL 2015, Beijing, China, July 31, 2015. Association for Computational Linguistics 2015, ISBN 978-1-941643-65-5 [contents]
2014
[j70]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/DoXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/DoXCL14
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages. IEICE Trans. Inf. Syst. 97-D(2): 285-295 (2014)
[j69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/LarcherLML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LarcherLML14
Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Text-dependent speaker verification: Classifiers, databases and RSR2015. Speech Commun. 60: 56-77 (2014)
[j68]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuVCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuVCL14
Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014)
[j67]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuanTL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuanTL14
Miaolong Yuan, Huajin Tang, Haizhou Li:
Real-Time Keypoint Recognition Using Restricted Boltzmann Machine. IEEE Trans. Neural Networks Learn. Syst. 25(11): 2119-2126 (2014)
[c351]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KimBL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KimBL14
Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia. ACL (2) 2014: 19-23
[c350]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HuangLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HuangLD14
Dong-Yan Huang, Haizhou Li, Minghui Dong:
Ensemble Nyström method for predicting conflict level from speech. APSIPA 2014: 1-5
[c349]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HuangXXXSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HuangXXXSL14
Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Chng Eng Siong, Haizhou Li:
Multi-view features in a DNN-CRF model for improved sentence unit detection on English broadcast news. APSIPA 2014: 1-9
[c348]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiuHLDLO14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiuHLDLO14
Shuojun Liu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Haizhou Li, Ee Ping Ong:
Emotional facial expression transfer based on temporal restricted Boltzmann machines. APSIPA 2014: 1-7
[c347]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuGCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuGCL14
Zhizheng Wu, Sheng Gao, Engsiong Chng, Haizhou Li:
A study on replay attack and anti-spoofing for text-dependent speaker verification. APSIPA 2014: 1-5
[c346]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuPCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuPCL14
Haihua Xu, Van Tung Pham, Engsiong Chng, Haizhou Li:
Towards better keyword search performance on Malay broadcast news data. APSIPA 2014: 1-5
[c345]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/MingHXL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/MingHXL14
Huaiping Ming, Dong-Yan Huang, Lei Xie, Haizhou Li:
Learning optimal features for music transcription. ChinaSIP 2014: 105-109
[c344]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimBL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimBL14
Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Wikipedia-based Kernels for dialogue topic tracking. ICASSP 2014: 131-135
[c343]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LarcherLML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LarcherLML14
Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Modelling the alternative hypothesis for text-dependent speaker verification. ICASSP 2014: 734-738
[c342]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LarcherLML14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LarcherLML14a
Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Imposture classification for text-dependent speaker verification. ICASSP 2014: 739-743
[c341]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL14
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Feature compensation using linear combination of speaker and environment dependent correction vectors. ICASSP 2014: 1720-1724
[c340]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NguyenXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NguyenXCL14
Duc Hoang Ha Nguyen, Xiong Xiao, Engsiong Chng, Haizhou Li:
Generalization of temporal filter and linear transformation for robust speech recognition. ICASSP 2014: 1730-1734
[c339]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DennisDLC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DennisDLC14
Jonathan William Dennis, Tran Huy Dat, Haizhou Li, Engsiong Chng:
A discriminatively trained Hough Transform for frame-level phoneme recognition. ICASSP 2014: 2514-2518
[c338]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangDL14
Dong-Yan Huang, Minghui Dong, Haizhou Li:
Intelligibility detection of pathological speech using asymmetric sparse kernel partial least squares classifier. ICASSP 2014: 3744-3748
[c337]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMGLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMGLD14
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Minimum divergence estimation of speaker prior in multi-session PLDA scoring. ICASSP 2014: 4007-4011
[c336]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenSLNXPML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenSLNXPML14
Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham, Bin Ma, Haizhou Li:
Strategies for Vietnamese keyword search. ICASSP 2014: 4121-4125
[c335]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChongBCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChongBCL14
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Improving language modeling by using distance and co-occurrence information of word-pairs and its application to LVCSR. ICASSP 2014: 4883-4887
[c334]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongLCML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongLCML14
Rong Tong, Boon Pang Lim, Nancy F. Chen, Bin Ma, Haizhou Li:
Subspace Gaussian mixture model for computer-assisted language learning. ICASSP 2014: 5347-5351
[c333]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PhamXCSLCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PhamXCSLCL14
Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Engsiong Chng, Haizhou Li:
Discriminative score normalization for keyword search decision. ICASSP 2014: 7078-7082
[c332]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoXSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoXSL14
Van Hai Do, Xiong Xiao, Chng Eng Siong, Haizhou Li:
Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR. INTERSPEECH 2014: 6-10
[c331]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLLML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLLML14
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
A graph-based Gaussian component clustering approach to unsupervised acoustic modeling. INTERSPEECH 2014: 875-879
[c330]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarcherLMNML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarcherLMNML14
Anthony Larcher, Kong-Aik Lee, Pablo Luis Sordo Martinez, Trung Hieu Nguyen, Bin Ma, Haizhou Li:
Extended RSR2015 for text-dependent speaker verification over VHF channel. INTERSPEECH 2014: 1322-1326
[c329]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgoCSML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgoCSML14
Hoang Gia Ngo, Nancy F. Chen, Sunil Sivadas, Bin Ma, Haizhou Li:
A minimal-resource transliteration framework for vietnamese. INTERSPEECH 2014: 1410-1414
[c328]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangLXML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangLXML14
Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection. INTERSPEECH 2014: 1722-1726
[c327]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuSSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuSSL14
Haihua Xu, Hang Su, Chng Eng Siong, Haizhou Li:
Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems. INTERSPEECH 2014: 2078-2082
[c326]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/DongLLCPEH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongLLCPEH14
Minghui Dong, Siu Wa Lee, Haizhou Li, Paul Y. Chan, Xuejian Peng, Jochen Walter Ehnes, Dong-Yan Huang:
I²r speech2singing perfects everyone's singing. INTERSPEECH 2014: 2148-2149
[c325]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeWDTL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeWDTL14
Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li:
A comparative study of spectral transformation techniques for singing voice synthesis. INTERSPEECH 2014: 2499-2503
[c324]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL14
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Joint nonnegative matrix factorization for exemplar-based voice conversion. INTERSPEECH 2014: 2509-2513
[c323]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuXHXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuXHXCL14
Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Engsiong Chng, Haizhou Li:
A deep neural network approach for sentence boundary detection in broadcast news. INTERSPEECH 2014: 2887-2891
[c322]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongML14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongML14
Rong Tong, Bin Ma, Haizhou Li:
Virtual example for phonotactic language recognition. INTERSPEECH 2014: 3017-3021
[c321]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/ShimTYTL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/ShimTYTL14
Vui Ann Shim, Bo Tian, Miaolong Yuan, Huajin Tang, Haizhou Li:
Direction-driven navigation using cognitive map for mobile robots. IROS 2014: 2639-2646
[c320]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenLMGLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenLMGLD14
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Local variability vector for text-independent speaker verification. ISCSLP 2014: 54-58
[c319]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/UedaWKXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/UedaWKXCL14
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization. ISCSLP 2014: 379-383
[c318]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Poon-FengHDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Poon-FengHDL14
Kelvin Poon-Feng, Dong-Yan Huang, Minghui Dong, Haizhou Li:
Acoustic emotion recognition based on fusion of multiple feature-dependent deep Boltzmann machines. ISCSLP 2014: 584-588
[c317]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/YangXXXLCYL0LMSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/YangXXXLCYL0LMSL14
Peng Yang, Haihua Xu, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei Wang, Su Jun Leow, Bin Ma, Chng Eng Siong, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2014. MediaEval 2014
[c316]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LeeM0CGD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LeeM0CGD14
Kong Aik Lee, Bin Ma, Haizhou Li, Liping Chen, Wu Guo, Li-Rong Dai:
Local Variability Modeling for Text-Independent Speaker Verification. Odyssey 2014: 54-59
[c315]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/YouLM014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/YouLM014
Changhuai You, Kong Aik Lee, Bin Ma, Haizhou Li:
Text-Dependent Speaker Verification System in VHF Communication Channel. Odyssey 2014: 216-223
[c314]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/MirnigTCCDLT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/MirnigTCCDLT14
Nicole Mirnig, Yeow Kee Tan, Tai Wen Chang, Yuanwei Chua, Tran Anh Dung, Haizhou Li, Manfred Tscheligi:
Screen feedback in human-robot interaction: How to enhance robot expressiveness. RO-MAN 2014: 224-230
[c313]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/PhamCSXCNCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/PhamCSXCNCL14
Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng, Haizhou Li:
System and keyword dependent fusion for spoken term detection. SLT 2014: 430-435
[c312]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/NiculescuBL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/NiculescuBL14
Andreea I. Niculescu, Rafael E. Banchs, Haizhou Li:
Why Industrial Robots Should Become More Social - On the Design of a Natural Language Interface for an Interactive Robot Welder. ICSR 2014: 276-278
[c311]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sspr/HautamakiPKLLF14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sspr/HautamakiPKLLF14
Ville Hautamäki, Antti Pöllänen, Tomi Kinnunen, Kong-Aik Lee, Haizhou Li, Pasi Fränti:
A Comparison of Categorical Attribute Data Clustering Methods. S+SSPR 2014: 53-62
[e11]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/2014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2014
Haizhou Li, Helen M. Meng, Bin Ma, Engsiong Chng, Lei Xie:
15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014, Singapore, September 14-18, 2014. ISCA 2014 [contents]
[e10]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/2014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/2014
Minghui Dong, Jianhua Tao, Haizhou Li, Thomas Fang Zheng, Yanfeng Lu:
The 9th International Symposium on Chinese Spoken Language Processing, Singapore, September 12-14, 2014. IEEE 2014, ISBN 978-1-4799-4220-6 [contents]
2013
[j66]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/SaktiPFSVKHSNPWXRALL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SaktiPFSVKHSNPWXRALL13
Sakriani Sakti, Michael Paul, Andrew M. Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
A-STAR: Toward translating Asian spoken languages. Comput. Speech Lang. 27(2): 509-527 (2013)
[j65]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/YuTLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/YuTLS13
Jiali Yu, Huajin Tang, Haizhou Li, Luping Shi:
Dynamical properties of continuous attractor neural network with background tuning. Neurocomputing 99: 439-447 (2013)
[j64]
- view
  authority control:
- export record
  dblp key:
  - journals/ijsr/NiculescuDNLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsr/NiculescuDNLL13
Andreea I. Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, See Swee Lan:
Making Social Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy. Int. J. Soc. Robotics 5(2): 171-191 (2013)
[j63]
- view
  authority control:
- export record
  dblp key:
  - journals/nca/YuTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nca/YuTL13
Jiali Yu, Huajin Tang, Haizhou Li:
Continuous attractors of discrete-time recurrent neural networks. Neural Comput. Appl. 23(1): 89-96 (2013)
[j62]
- view
  authority control:
- export record
  dblp key:
  - journals/neco/HuTTLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/neco/HuTTLS13
Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, Luping Shi:
A Spike-Timing-Based Integrated Model for Pattern Recognition. Neural Comput. 25(2): 450-472 (2013)
[j61]
- view
  authority control:
- export record
  dblp key:
  - journals/pieee/OShaughnessyDL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pieee/OShaughnessyDL13
Douglas D. O'Shaughnessy, Li Deng, Haizhou Li:
Speech Information Processing: Theory and Applications [Scanning the Issue]. Proc. IEEE 101(5): 1034-1037 (2013)
[j60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pieee/LiML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pieee/LiML13
Haizhou Li, Bin Ma, Kong-Aik Lee:
Spoken Language Recognition: From Fundamentals to Practice. Proc. IEEE 101(5): 1136-1159 (2013)
[j59]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WangLLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WangLLML13
Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Shifted-Delta MLP Features for Spoken Language Recognition. IEEE Signal Process. Lett. 20(1): 15-18 (2013)
[j58]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HautamakiKSLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HautamakiKSLML13
Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee, Bin Ma, Haizhou Li:
Sparse Classifier Fusion for Speaker Verification. IEEE Trans. Speech Audio Process. 21(8): 1622-1631 (2013)
[j57]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NgLLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NgLLML13
Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Spoken Language Recognition With Prosodic Features. IEEE Trans. Speech Audio Process. 21(9): 1841-1853 (2013)
[j56]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WrightKDHHL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WrightKDHHL13
Stephen J. Wright, Dimitri Kanevsky, Li Deng, Xiaodong He, Georg Heigold, Haizhou Li:
Optimization Algorithms and Applications for Speech and Language Processing. IEEE Trans. Speech Audio Process. 21(11): 2231-2243 (2013)
[j55]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuTL13
Jiali Yu, Huajin Tang, Haizhou Li:
Dynamics Analysis of a Population Decoding Model. IEEE Trans. Neural Networks Learn. Syst. 24(3): 498-503 (2013)
[j54]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YuTTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YuTTL13
Qiang Yu, Huajin Tang, Kay Chen Tan, Haizhou Li:
Rapid Feedforward Computation by Temporal Encoding and Learning With Spiking Neurons. IEEE Trans. Neural Networks Learn. Syst. 24(10): 1539-1552 (2013)
[c310]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LuXLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LuXLML13
Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions. ACL (2) 2013: 190-195
[c309]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChongBCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChongBCL13
Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li:
Modeling of term-distance and term-occurrence information for improving n-gram language model performance. ACL (2) 2013: 233-237
[c308]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NguyenMXCLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NguyenMXCLL13
Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A particle filter compensation approach to robust LVCSR. APSIPA 2013: 1-7
[c307]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuL13
Zhizheng Wu, Haizhou Li:
Voice conversion and spoofing attack on speaker verification systems. APSIPA 2013: 1-9
[c306]
- view
  authority control:
- export record
  dblp key:
  - conf/asunam/ZhuGPLDS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asunam/ZhuGPLDS13
Linhong Zhu, Sheng Gao, Sinno Jialin Pan, Haizhou Li, Dingxiong Deng, Cyrus Shahabi:
Graph-based informative-sentence selection for opinion summarization. ASONAM 2013: 408-412
[c305]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/WuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/WuCL13
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Conditional restricted Boltzmann machine for voice conversion. ChinaSIP 2013: 104-108
[c304]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/LyuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/LyuCL13
Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Language diarization for conversational code-switch speech with pronunciation dictionary adaptation. ChinaSIP 2013: 147-150
[c303]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Constrained adaptation of histogram equalization for robust speech recognition. ChinaSIP 2013: 360-364
[c302]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/discomt/WilliamsBL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/discomt/WilliamsBL13
Jennifer Williams, Rafael Enrique Banchs, Haizhou Li:
Meaning Unit Segmentation in English and Chinese: a New Approach to Discourse Phenomena. DiscoMT@ACL 2013: 1-9
[c301]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DennisYTDL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DennisYTDL13
Jonathan William Dennis, Qiang Yu, Huajin Tang, Tran Huy Dat, Haizhou Li:
Temporal coding of local spectrogram features for robust sound recognition. ICASSP 2013: 803-807
[c300]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuXCL13
Zhizheng Wu, Xiong Xiao, Engsiong Chng, Haizhou Li:
Synthetic speech detection using temporal modulation feature. ICASSP 2013: 7234-7238
[c299]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LyuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LyuCL13
Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Language diarization for code-switch conversational speech. ICASSP 2013: 7314-7318
[c298]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LarcherLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LarcherLML13
Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances. ICASSP 2013: 7673-7677
[c297]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YouLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YouLML13
Chang Huai You, Haizhou Li, Bin Ma, Kong-Aik Lee:
A study on GMM-SVM with adaptive relevance factor and its comparison with i-vector and JFA for speaker recognition. ICASSP 2013: 7683-7687
[c296]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Temporal filter design by minimum KL divergence criterion for robust speech recognition. ICASSP 2013: 7908-7912
[c295]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenML13
Nancy F. Chen, Bin Ma, Haizhou Li:
Minimal-resource phonetic language models to summarize untranscribed speech. ICASSP 2013: 8357-8361
[c294]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AdelVKSLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AdelVKSLS13
Heike Adel, Ngoc Thang Vu, Franziska Kraus, Tim Schlippe, Haizhou Li, Tanja Schultz:
Recurrent neural network language modeling for code switching conversational speech. ICASSP 2013: 8411-8415
[c293]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuLXML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuLXML13
Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Broadcast news story segmentation using latent topics on data manifold. ICASSP 2013: 8465-8469
[c292]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLLML13
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection. ICASSP 2013: 8545-8549
[c291]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HuangDL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HuangDL13
Dong-Yan Huang, Minghui Dong, Haizhou Li:
A dynamic Gaussian process for voice conversion. ICME Workshops 2013: 1-4
[c290]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SethuEAL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SethuEAL13
Vidhyasaharan Sethu, Julien Epps, Eliathamby Ambikairajah, Haizhou Li:
GMM based speaker variability compensated system for interspeech 2013 compare emotion challenge. INTERSPEECH 2013: 205-209
[c289]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoXCL13
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent phone mapping for LVCSR of under-resourced languages. INTERSPEECH 2013: 500-504
[c288]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoCL13
Xiong Xiao, Engsiong Chng, Haizhou Li:
Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition. INTERSPEECH 2013: 876-880
[c287]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuLLCKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuLLCKL13
Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. INTERSPEECH 2013: 950-954
[c286]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/aizhouHBMMA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/aizhouHBMMA13
Rahim Saeidi, Kong-Aik Lee, Tomi Kinnunen, Tawfik Hasan, Benoit G. B. Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Jia Min Karen Kua, Changhuai You, Hanwu Sun, Anthony Larcher, Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, Billy Braithwaite, Rosa González Hautamäki, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, Navid Shokouhi, Driss Matrouf, Laurent El Shafey, Pejman Mowlaee, Julien Epps, Tharmarajah Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Sébastien Marcel, John S. D. Mason, Eliathamby Ambikairajah:
I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. INTERSPEECH 2013: 1986-1990
[c285]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLLML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLLML13
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams. INTERSPEECH 2013: 2297-2301
[c284]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenSHML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenSHML13
Nancy F. Chen, Vivaek Shivakumar, Mahesh Harikumar, Bin Ma, Haizhou Li:
Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages. INTERSPEECH 2013: 2370-2374
[c283]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarcherBFLLLMP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarcherBFLLLMP13
Anthony Larcher, Jean-François Bonastre, Benoit G. B. Fauve, Kong-Aik Lee, Christophe Lévy, Haizhou Li, John S. D. Mason, Jean-Yves Parfait:
ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition. INTERSPEECH 2013: 2768-2772
[c282]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061
[c281]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLYML13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLYML13
Kong-Aik Lee, Anthony Larcher, Chang Huai You, Bin Ma, Haizhou Li:
Multi-session PLDA scoring of i-vector for partially open-set speaker detection. INTERSPEECH 2013: 3651-3655
[c280]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/TianSYSTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/TianSYSTL13
Bo Tian, Vui Ann Shim, Miaolong Yuan, Chithra Srinivasan, Huajin Tang, Haizhou Li:
RGB-D based cognitive map building and navigation. IROS 2013: 1562-1567
[c279]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/ChongXXTPLSL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/ChongXXTPLSL13
Tze Yuang Chong, Xiong Xiao, Haihua Xu, Tien Ping Tan, Chau Khoa Pham, Dau-Cheng Lyu, Chng Eng Siong, Haizhou Li:
The development and analysis of a Malay broadcasr news corpus. O-COCOSDA/CASLRE 2013: 1-5
[c278]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/MirnigTHLT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/MirnigTHLT13
Nicole Mirnig, Yeow Kee Tan, Boon Siew Han, Haizhou Li, Manfred Tscheligi:
Screen feedback: How to overcome the expressive limitations of a social robot. RO-MAN 2013: 348-349
[c277]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/LiTGL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/LiTGL13
Yanan Li, Keng Peng Tee, Shuzhi Sam Ge, Haizhou Li:
Building Companionship through Human-Robot Collaboration. ICSR 2013: 1-7
[c276]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ssw/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206
2012
[j53]
- view
  authority control:
- export record
  dblp key:
  - journals/cim/YanTCLT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cim/YanTCLT12
Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, Huajin Tang:
Gesture Recognition Based on Localist Attractor Networks with Application to Robot Control [Application Notes]. IEEE Comput. Intell. Mag. 7(1): 64-74 (2012)
[j52]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/Li12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/Li12
Haizhou Li:
Foreword. IEICE Trans. Inf. Syst. 95-D(5): 1181 (2012)
[j51]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/WangXLMCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/WangXLMCL12
Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li:
Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features. IEICE Trans. Inf. Syst. 95-D(5): 1206-1215 (2012)
[j50]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/LengDKL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/LengDKL12
Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective Gammatone Envelope Feature for Robust Sound Event Recognition. IEICE Trans. Inf. Syst. 95-D(5): 1229-1237 (2012)
[j49]
- view
  authority control:
- export record
  dblp key:
  - journals/ijhr/TeeYCHL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijhr/TeeYCHL12
Keng Peng Tee, Rui Yan, Yuanwei Chua, Zhiyong Huang, Haizhou Li:
Modular IK: a Robust Inverse Kinematic Algorithm for Gesture Imitation in an Upper-Body Humanoid Robot. Int. J. Humanoid Robotics 9(2) (2012)
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/ipm/KuoL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ipm/KuoL12
Jin-Shea Kuo, Haizhou Li:
Learning regional transliteration variants. Inf. Process. Manag. 48(1): 154-169 (2012)
[j47]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/DehzangiMCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/DehzangiMCL12
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recognit. Lett. 33(13): 1703-1709 (2012)
[j46]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WuKCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WuKCL12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion. IEEE Signal Process. Lett. 19(12): 914-917 (2012)
[j45]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NweSML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NweSML12
Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data. IEEE Trans. Speech Audio Process. 20(2): 461-473 (2012)
[j44]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenKZTZWTL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenKZTZWTL12
Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, Haizhou Li:
Bitext Dependency Parsing With Auto-Generated Bilingual Treebank. IEEE Trans. Speech Audio Process. 20(5): 1461-1472 (2012)
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KinnunenSSLSHL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KinnunenSSLSHL12
Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong-Aik Lee, Johan Sandberg, Maria Hansson-Sandsten, Haizhou Li:
Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Trans. Speech Audio Process. 20(7): 1990-2001 (2012)
[j42]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/LiYYTL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/LiYYTL12
Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, Haizhou Li:
Robust Multiperson Detection and Tracking for Mobile Service and Social Robots. IEEE Trans. Syst. Man Cybern. Part B 42(5): 1398-1412 (2012)
[c275]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/BanchsL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/BanchsL12
Rafael E. Banchs, Haizhou Li:
IRIS: a Chat-oriented Dialogue System based on the Vector Space Model. ACL (System Demonstrations) 2012: 37-42
[c274]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChenZL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZL12
Wenliang Chen, Min Zhang, Haizhou Li:
Utilizing Dependency Language Models for Graph-based Dependency Parsing Models. ACL (1) 2012: 213-222
[c273]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiongZL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiongZL12
Deyi Xiong, Min Zhang, Haizhou Li:
Modeling the Translation of Predicate-Argument Structure for SMT. ACL (1) 2012: 902-911
[c272]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/ZhangLKL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ZhangLKL12
Min Zhang, Haizhou Li, A. Kumaran, Ming Liu:
Whitepaper of NEWS 2012 Shared Task on Machine Transliteration. NEWS@ACL 2012: 1-9
[c271]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/ZhangLKL12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ZhangLKL12a
Min Zhang, Haizhou Li, A. Kumaran, Ming Liu:
Report of NEWS 2012 Machine Transliteration Shared Task. NEWS@ACL 2012: 10-20
[c270]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/AmbikairajahKSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/AmbikairajahKSL12
Eliathamby Ambikairajah, Jia Min Karen Kua, Vidhyasaharan Sethu, Haizhou Li:
PNCC-ivector-SRC based speaker verification. APSIPA 2012: 1-7
[c269]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/WuKCLA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuKCLA12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li, Eliathamby Ambikairajah:
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. APSIPA 2012: 1-5
[c268]
- view
  authority control:
- export record
  dblp key:
  - conf/hri/LiYLWSTL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hri/LiYLWSTL12
Liyuan Li, Xinguo Yu, Jun Li, Gang S. Wang, Ji Yu Shi, Yeow Kee Tan, Haizhou Li:
Vision-based attention estimation and selection for social robot to perform natural interaction in the open world. HRI 2012: 183-184
[c267]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/DoXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/DoXCL12
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
A Phone Mapping Technique for Acoustic Modeling of Under-Resourced Languages. IALP 2012: 233-236
[c266]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeADL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeADL12
Siu Wa Lee, Shen Ting Ang, Minghui Dong, Haizhou Li:
Generalized F0 modelling with absolute and relative pitch features for singing voice synthesis. ICASSP 2012: 429-432
[c265]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL12
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Lasso environment model combination for robust speech recognition. ICASSP 2012: 4305-4308
[c264]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL12
Xiong Xiao, Engsiong Chng, Haizhou Li:
Joint spectral and temporal normalization of features for robust recognition of noisy and reverberated speech. ICASSP 2012: 4325-4328
[c263]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KinnunenWLSCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KinnunenWLSCL12
Tomi Kinnunen, Zhizheng Wu, Kong-Aik Lee, Filip Sedlak, Engsiong Chng, Haizhou Li:
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech. ICASSP 2012: 4401-4404
[c262]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LarcherBLMLB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LarcherBLMLB12
Anthony Larcher, Pierre-Michel Bousquet, Kong-Aik Lee, Driss Matrouf, Haizhou Li, Jean-François Bonastre:
I-vectors in the context of phonetically-constrained short utterances for speaker verification. ICASSP 2012: 4773-4776
[c261]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VuLWTSBCSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VuLWTSBCSL12
Ngoc Thang Vu, Dau-Cheng Lyu, Jochen Weiner, Dominic Telaar, Tim Schlippe, Fabian Blaicher, Engsiong Chng, Tanja Schultz, Haizhou Li:
A first speech recognition system for Mandarin-English code-switch conversational speech. ICASSP 2012: 4889-4892
[c260]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MisuMKNL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MisuMKNL12
Teruhisa Misu, Etsuo Mizukami, Hideki Kashioka, Satoshi Nakamura, Haizhou Li:
A bootstrapping approach for SLU portability to a new language by inducting unannotated user queries. ICASSP 2012: 4961-4964
[c259]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengLXML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengLXML12
Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Acoustic TextTiling for story segmentation of spoken documents. ICASSP 2012: 5121-5124
[c258]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLLML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLLML12
Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
An acoustic segment modeling approach to query-by-example spoken term detection. ICASSP 2012: 5157-5160
[c257]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarcherLML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarcherLML12
Anthony Larcher, Kong-Aik Lee, Bin Ma, Haizhou Li:
RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases. INTERSPEECH 2012: 1580-1583
[c256]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangLTMLL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangLTMLL12
Ye Jiang, Kong-Aik Lee, Zhenmin Tang, Bin Ma, Anthony Larcher, Haizhou Li:
PLDA Modeling in I-Vector and Supervector Space for Speaker Verification. INTERSPEECH 2012: 1680-1683
[c255]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL12
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition. INTERSPEECH 2012: 1700-1703
[c254]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouLML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouLML12
Changhuai You, Haizhou Li, Bin Ma, Kong-Aik Lee:
Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition. INTERSPEECH 2012: 2065-2068
[c253]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/TeeGYL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/TeeGYL12
Keng Peng Tee, Shuzhi Sam Ge, Rui Yan, Haizhou Li:
Adaptive control for robot manipulators under ellipsoidal task space constraints. IROS 2012: 1167-1172
[c252]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DoXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DoXCL12
Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context dependant phone mapping for cross-lingual acoustic modeling. ISCSLP 2012: 16-20
[c251]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeungML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeungML12
Cheung-Chi Leung, Bin Ma, Haizhou Li:
Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers. ISCSLP 2012: 108-111
[c250]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/NguyenXCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/NguyenXCL12
Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li:
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition. ISCSLP 2012: 131-135
[c249]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeeDL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeeDL12
Siu Wa Lee, Minghui Dong, Haizhou Li:
A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis. ISCSLP 2012: 150-154
[c248]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/MisuMMKL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/MisuMMKL12
Teruhisa Misu, Shigeki Matsuda, Etsuo Mizukami, Hideki Kashioka, Haizhou Li:
Efficient Language Model Construction for Spoken Dialog Systems by Inducting Language Resources of Different Languages. IWSDS 2012: 101-110
[c247]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/JiangTLDL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/JiangTLDL12
Ridong Jiang, Yeow Kee Tan, Dilip Kumar Limbu, Tran Anh Dung, Haizhou Li:
Component Pluggable Dialogue Framework and Its Application to Social Robots. IWSDS 2012: 225-237
[c246]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/HautamakiLLKML12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HautamakiLLKML12
Ville Hautamäki, Kong-Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li:
Variational Bayes logistic regression as regularized fusion for NIST SRE 2010. Odyssey 2012: 268-274
[c245]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/YouLALM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/YouLALM12
Chang Huai You, Haizhou Li, Eliathamby Ambikairajah, Kong-Aik Lee, Bin Ma:
Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition. Odyssey 2012: 338-345
[c244]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/WeinerVTMSLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/WeinerVTMSLCL12
Jochen Weiner, Ngoc Thang Vu, Dominic Telaar, Florian Metze, Tanja Schultz, Dau-Cheng Lyu, Engsiong Chng, Haizhou Li:
Integration of language identification into a recognition system for spoken conversations containing code-Switches. SLTU 2012: 76-79
[e9]
- view
- export record
  dblp key:
  - conf/aclnews/2012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2012
Min Zhang, Haizhou Li, A. Kumaran:
Proceedings of the 4th Named Entity Workshop, NEWS@ACL 2012, Jeju, Korea, July 12, 2012. Association for Computational Linguistics 2012, ISBN 978-1-937284-40-4 [contents]
[e8]
- view
- export record
  dblp key:
  - conf/odyssey/2012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/2012
Haizhou Li, Bin Ma, Kong-Aik Lee:
Odyssey 2012: The Speaker and Language Recognition Workshop, Singapore, June 25-28, 2012. ISCA 2012 [contents]
[e7]
- view
- export record
  dblp key:
  - conf/siggraph/2012asiaposters
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggraph/2012asiaposters
Qunsheng Peng, Haizhou Li:
SIGGRAPH Asia 2012 Poster Proceedings, Singapore, Singapore, November 28 - December 01, 2012. ACM 2012, ISBN 978-1-4503-1911-9 [contents]
2011
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/cim/TangL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cim/TangL11
Huajin Tang, Haizhou Li:
Information Theoretic Learning: Reny's Entropy and Kernel Perspectives (Principe, J.; 2010) [Book Review]. IEEE Comput. Intell. Mag. 6(3): 60-62 (2011)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/DehzangiMCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/DehzangiMCL11
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error Corrective Fusion of Classifier Scores for Spoken Language Recognition. IEICE Trans. Inf. Syst. 94-D(12): 2503-2512 (2011)
[j39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ijsr/LiCT11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsr/LiCT11
Haizhou Li, John-John Cabibihan, Yeow Kee Tan:
Towards an Effective Design of Social Robots. Int. J. Soc. Robotics 3(4): 333-335 (2011)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/DennisDL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/DennisDL11
Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions. IEEE Signal Process. Lett. 18(2): 130-133 (2011)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhuML11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhuML11
Donglai Zhu, Bin Ma, Haizhou Li:
Speaker Verification With Feature-Space MAPLR Parameters. IEEE Trans. Speech Audio Process. 19(3): 505-515 (2011)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeeYLKS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeeYLKS11
Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 19(4): 861-870 (2011)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TranL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TranL11
H. D. Tran, Haizhou Li:
Sound Event Recognition With Probabilistic Distance SVMs. IEEE Trans. Speech Audio Process. 19(6): 1556-1568 (2011)
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiongZL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiongZL11
Deyi Xiong, Min Zhang, Haizhou Li:
A Maximum-Entropy Segmentation Model for Statistical Machine Translation. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2494-2505 (2011)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/MaddageL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/MaddageL11
Namunu Chinthaka Maddage, Haizhou Li:
Beat space segmentation and octave scale cepstral feature for sung language recognition in pop music. ACM Trans. Multim. Comput. Commun. Appl. 7(4): 37:1-37:19 (2011)
[c243]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/BanchsL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/BanchsL11
Rafael E. Banchs, Haizhou Li:
AM-FM: A Semantic Framework for Translation Quality Assessment. ACL (2) 2011: 153-158
[c242]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiongZL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiongZL11
Deyi Xiong, Min Zhang, Haizhou Li:
Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers. ACL 2011: 1288-1297
[c241]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/ZhangLKL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ZhangLKL11
Min Zhang, Haizhou Li, A. Kumaran, Ming Liu:
Report of NEWS 2011 Machine Transliteration Shared Task. NEWS@IJCNLP 2011: 1-13
[c240]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/ZhangKL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/ZhangKL11
Min Zhang, A. Kumaran, Haizhou Li:
Whitepaper of NEWS 2011 Shared Task on Machine Transliteration. NEWS@IJCNLP 2011: 14-22
[c239]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/GaoL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/GaoL11
Sheng Gao, Haizhou Li:
A cross-domain adaptation method for sentiment classification using probabilistic latent analysis. CIKM 2011: 1047-1052
[c238]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ChenKZTZWTL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenKZTZWTL11
Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, Haizhou Li:
SMT Helps Bitext Dependency Parsing. EMNLP 2011: 73-83
[c237]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/LiZCLCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiZCLCL11
Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen, Haizhou Li:
Joint Models for Chinese POS Tagging and Dependency Parsing. EMNLP 2011: 1180-1191
[c236]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DatL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DatL11
Tran Huy Dat, Haizhou Li:
Probabilistic distance SVM with Hellinger-Exponential Kernel for sound event classification. ICASSP 2011: 2272-2275
[c235]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DatL11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DatL11a
Tran Huy Dat, Haizhou Li:
Jump Function Kolmogorov for overlapping audio event classification. ICASSP 2011: 3696-3699
[c234]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgLLML11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgLLML11
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Score fusion and calibration in multiple language detectors with large performance variation. ICASSP 2011: 4404-4407
[c233]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SedlakKHLL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SedlakKHLL11
Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong-Aik Lee, Haizhou Li:
Classifier subset selection and fusion for speaker verification. ICASSP 2011: 4544-4547
[c232]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLMLGD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLMLGD11
Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
Factored covariance modeling for text-independent speaker verification. ICASSP 2011: 4856-4859
[c231]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL11
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition. ICASSP 2011: 5480-5483
[c230]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/TangXZLZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/TangXZLZ11
Guoyu Tang, Yunqing Xia, Min Zhang, Haizhou Li, Fang Zheng:
CLGVSM: Adapting Generalized Vector Space Model to Cross-lingual Document Clustering. IJCNLP 2011: 580-588
[c229]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/ZhangDLXL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/ZhangDLXL11
Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, Haizhou Li:
Joint Alignment and Artificial Data Generation: An Empirical Study of Pivot-based Machine Transliteration. IJCNLP 2011: 1207-1215
[c228]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LengDKL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LengDKL11
Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition. INTERSPEECH 2011: 297-300
[c227]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoLSL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoLSL11
Xiong Xiao, Jinyu Li, Chng Eng Siong, Haizhou Li:
Feature Normalization Using Structured Full Transforms for Robust Speech Recognition. INTERSPEECH 2011: 693-696
[c226]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangMLW11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangMLW11
Chien-Lin Huang, Bin Ma, Haizhou Li, Chung-Hsien Wu:
Speech Indexing Using Semantic Context Inference. INTERSPEECH 2011: 717-720
[c225]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLS11
Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
Target-Aware Lattice Rescoring for Dialect Recognition. INTERSPEECH 2011: 733-736
[c224]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuLXML11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuLXML11
Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li:
Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation. INTERSPEECH 2011: 1109-1112
[c223]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamXBCLS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamXBCLS11
Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, Haizhou Li, Chng Eng Siong:
Speech Modulation Features for Robust Nonnative Speech Accent Detection. INTERSPEECH 2011: 2417-2420
[c222]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DennisDL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DennisDL11
Jonathan William Dennis, Tran Huy Dat, Haizhou Li:
Image Representation of the Subband Power Distribution for Robust Sound Classification. INTERSPEECH 2011: 2437-2440
[c221]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiLKML11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiLKML11
Ville Hautamäki, Kong-Aik Lee, Tomi Kinnunen, Bin Ma, Haizhou Li:
Regularized Logistic Regression Fusion for Speaker Verification. INTERSPEECH 2011: 2745-2748
[c220]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouLL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouLL11
Chang Huai You, Haizhou Li, Kong-Aik Lee:
Study on the Relevance Factor of Maximum a Posteriori with GMM for Language Recognition. INTERSPEECH 2011: 2893-2896
[c219]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYHLL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYHLL11
Kong-Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, Haizhou Li:
Spoken Language Recognition in the Latent Topic Simplex. INTERSPEECH 2011: 2933-2936
[c218]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/LeeLTML11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLTML11
Kong-Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, Haizhou Li:
Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home. INTERSPEECH 2011: 3317-3318
[c217]
- view
  authority control:
- export record
  dblp key:
  - conf/mmm/GaoL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmm/GaoL11
Sheng Gao, Haizhou Li:
Effective Large Scale Text Retrieval via Learning Risk-Minimization and Dependency-Embedded Model. MMM (2) 2011: 99-110
[e6]
- view
- export record
  dblp key:
  - conf/aclnews/2011
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2011
Min Zhang, Haizhou Li, A. Kumaran:
Proceedings of the 3rd Named Entities Workshop, NEWS@IJCNLP 2011, Chiang Mai, Thailand, November 12, 2011. Asian Federation of Natural Language Processing 2011 [contents]
2010
[j32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/coling/XiongZAL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/coling/XiongZAL10
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
Linguistically Annotated Reordering: Evaluation and Analysis. Comput. Linguistics 36(3): 535-568 (2010)
[j31]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/CenDCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/CenDCL10
Ling Cen, Minghui Dong, Paul Y. Chan, Haizhou Li:
Feature Integration and Dimension Reduction in Unit Selection TTS. Int. J. Asian Lang. Process. 20(1): 35-42 (2010)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/neco/TangLY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/neco/TangLY10
Huajin Tang, Haizhou Li, Rui Yan:
Memory Dynamics in Attractor Networks with Saliency Weights. Neural Comput. 22(7): 1899-1926 (2010)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/WangCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/WangCL10
Lei Wang, Engsiong Chng, Haizhou Li:
A tree-construction search approach for multivariate time series motifs discovery. Pattern Recognit. Lett. 31(9): 869-875 (2010)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/KinnunenL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KinnunenL10
Tomi Kinnunen, Haizhou Li:
An overview of text-independent speaker recognition: From features to supervectors. Speech Commun. 52(1): 12-40 (2010)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/LiM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/LiM10
Haizhou Li, Bin Ma:
TechWare: Speaker and Spoken Language Recognition Resources [Best of the Web]. IEEE Signal Process. Mag. 27(6): 139-142 (2010)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiaoLCLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiaoLCLL10
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 18(6): 1158-1169 (2010)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YouLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YouLL10
Chang Huai You, Kong-Aik Lee, Haizhou Li:
GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition. IEEE Trans. Speech Audio Process. 18(6): 1300-1312 (2010)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/TangLY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/TangLY10
Huajin Tang, Haizhou Li, Zhang Yi:
A discrete-time neural network for optimization problems with hybrid constraints. IEEE Trans. Neural Networks 21(7): 1184-1189 (2010)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/tois/ChiaSLN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tois/ChiaSLN10
Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
Statistical lattice-based spoken document retrieval. ACM Trans. Inf. Syst. 28(1): 2:1-2:30 (2010)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/MaddageSL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/MaddageSL10
Namunu Chinthaka Maddage, Khe Chai Sim, Haizhou Li:
Word level automatic alignment of music and lyrics using vocal synthesis. ACM Trans. Multim. Comput. Commun. Appl. 6(3): 19:1-19:16 (2010)
[c216]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/DuanZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DuanZL10
Xiangyu Duan, Min Zhang, Haizhou Li:
Pseudo-Word for Phrase-Based Machine Translation. ACL 2010: 148-156
[c215]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiongZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiongZL10
Deyi Xiong, Min Zhang, Haizhou Li:
Error Detection for Statistical Machine Translation Using Linguistic Features. ACL 2010: 604-611
[c214]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangZL10
Min Zhang, Hui Zhang, Haizhou Li:
Convolution Kernel over Packed Parse Forest. ACL 2010: 875-885
[c213]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/LiKZP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiKZP10
Haizhou Li, A. Kumaran, Min Zhang, Vladimir Pervouchine:
Report of NEWS 2010 Transliteration Generation Shared Task. NEWS@ACL 2010: 1-11
[c212]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/LiKZP10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiKZP10a
Haizhou Li, A. Kumaran, Min Zhang, Vladimir Pervouchine:
Whitepaper of NEWS 2010 Shared Task on Transliteration Generation. NEWS@ACL 2010: 12-20
[c211]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/KumaranKL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/KumaranKL10
A. Kumaran, Mitesh M. Khapra, Haizhou Li:
Report of NEWS 2010 Transliteration Mining Shared Task. NEWS@ACL 2010: 21-28
[c210]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/KumaranKL10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/KumaranKL10a
A. Kumaran, Mitesh M. Khapra, Haizhou Li:
Whitepaper of NEWS 2010 Shared Task on Transliteration Mining. NEWS@ACL 2010: 29-38
[c209]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/DongCC0010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/DongCC0010
Minghui Dong, Paul Y. Chan, Ling Cen, Bin Ma, Haizhou Li:
I2R Text-to-Speech System for Blizzard Challenge 2010. Blizzard Challenge 2010
[c208]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/LeeAZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LeeAZL10
Lianhau Lee, AiTi Aw, Min Zhang, Haizhou Li:
EM-based Hybrid Model for Bilingual Terminology Extraction from Comparable Corpora. COLING (Posters) 2010: 639-646
[c207]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/PervouchineZLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/PervouchineZLL10
Vladimir Pervouchine, Min Zhang, Ming Liu, Haizhou Li:
Improving Name Origin Recognition with Context Features and Unlabelled Data. COLING (Posters) 2010: 972-978
[c206]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ZhangDPL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhangDPL10
Min Zhang, Xiangyu Duan, Vladimir Pervouchine, Haizhou Li:
Machine Transliteration: Leveraging on Third Languages. COLING (Posters) 2010: 1444-1452
[c205]
- view
  authority control:
- export record
  dblp key:
  - conf/ecce/NiculescuDNLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecce/NiculescuDNLL10
Andreea I. Niculescu, Betsy van Dijk, Anton Nijholt, See Swee Lan, Haizhou Li:
How humans behave and evaluate a social robot in real-environment settings. ECCE 2010: 351-352
[c204]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangZLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZLC10
Hui Zhang, Min Zhang, Haizhou Li, Engsiong Chng:
Non-Isomorphic Forest Pair Translation. EMNLP 2010: 440-450
[c203]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/LeeLYKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/LeeLYKS10
Kong-Aik Lee, Haizhou Li, Chang Huai You, Tomi Kinnunen, Khe Chai Sim:
Discrete expected likelihood kernel for SVM-based speaker verification. EUSIPCO 2010: 591-595
[c202]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/YouLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/YouLL10
Chang Huai You, Haizhou Li, Kong-Aik Lee:
A GMM-supervector approach to language recognition with adaptive relevance factor. EUSIPCO 2010: 1993-1997
[c201]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DatLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DatLL10
Tran Huy Dat, Yi Ren Leng, Haizhou Li:
Feature integration for heart sound biometrics. ICASSP 2010: 1714-1717
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Error corrective classifier fusion for spoken Language Recognition. ICASSP 2010: 1994-1997
[c199]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoSLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoSLL10
Yu Tsao, Hanwu Sun, Haizhou Li, Chin-Hui Lee:
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition. ICASSP 2010: 4422-4425
[c198]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SunMKL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SunMKL10
Hanwu Sun, Bin Ma, Swe Zin Kalayar Khine, Haizhou Li:
Speaker diarization system for RT07 and RT09 meeting room audio. ICASSP 2010: 4982-4985
[c197]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuML10
Donglai Zhu, Bin Ma, Haizhou Li:
Soft margin estimation of Gaussian mixture model parameters for spoken language recognition. ICASSP 2010: 4990-4993
[c196]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarLTMBC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarLTMBC10
C. Santhosh Kumar, Haizhou Li, Rong Tong, Pavel Matejka, Lukás Burget, Jan Cernocký:
Tuning phone decoders for language identification. ICASSP 2010: 5010-5013
[c195]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgLLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgLLML10
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Prosodic attribute model for spoken language identification. ICASSP 2010: 5022-5025
[c194]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiHML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiHML10
Shuanhu Bai, Chien-Lin Huang, Bin Ma, Haizhou Li:
Semi-supervised learning of language model using unsupervised topic model. ICASSP 2010: 5386-5389
[c193]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/NweDCWML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/NweDCWML10
Tin Lay Nwe, Minghui Dong, Paul Y. Chan, Xi Wang, Bin Ma, Haizhou Li:
Voice conversion: From spoken vowels to singing vowels. ICME 2010: 1421-1426
[c192]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Framewise Phone Classification Using Weighted Fuzzy Classification Rules. ICPR 2010: 4186-4189
[c191]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/TeeYL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/TeeYL10
Keng Peng Tee, Rui Yan, Haizhou Li:
Adaptive admittance control of a robot manipulator under task space constraint. ICRA 2010: 5181-5186
[c190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunMHNL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunMHNL10
Hanwu Sun, Bin Ma, Chien-Lin Huang, Trung Hieu Nguyen, Haizhou Li:
The IIR NIST SRE 2008 and 2010 summed channel speaker recognition systems. INTERSPEECH 2010: 366-369
[c189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangSML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangSML10
Chien-Lin Huang, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker characterization using long-term and temporal information. INTERSPEECH 2010: 370-373
[c188]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLC10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLC10
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Selecting phonotactic features for language recognition. INTERSPEECH 2010: 737-740
[c187]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLMLGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLMLGD10
Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
The estimation and kernel metric of spectral correlation for text-independent speaker verification. INTERSPEECH 2010: 1065-1068
[c186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangXMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangXMCL10
Xiaoxuan Wang, Lei Xie, Bin Ma, Engsiong Chng, Haizhou Li:
Phoneme lattice based texttiling towards multilingual story segmentation. INTERSPEECH 2010: 1305-1308
[c185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuMLLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuMLLL10
Donglai Zhu, Bin Ma, Kong-Aik Lee, Cheung-Chi Leung, Haizhou Li:
MAP estimation of subspace transform for speaker recognition. INTERSPEECH 2010: 1465-1468
[c184]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HautamakiKNLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HautamakiKNLML10
Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, Haizhou Li:
Approaching human listener accuracy with modern speaker verification. INTERSPEECH 2010: 1473-1476
[c183]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweSML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweSML10
Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li:
Speaker diarization in meeting audio for single distant microphone. INTERSPEECH 2010: 1505-1508
[c182]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKCL10
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. INTERSPEECH 2010: 1732-1735
[c181]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgLHLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgLHLML10
Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, Haizhou Li:
Towards long-range prosodic attribute modeling for language recognition. INTERSPEECH 2010: 1792-1795
[c180]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuTCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuTCL10
Dau-Cheng Lyu, Tien Ping Tan, Engsiong Chng, Haizhou Li:
SEAME: a Mandarin-English code-switching speech corpus in south-east asia. INTERSPEECH 2010: 1986-1989
[c179]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehzangiMCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehzangiMCL10
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
A discriminative performance metric for GMM-UBM speaker identification. INTERSPEECH 2010: 2114-2117
[c178]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LengDKL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LengDKL10
Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective gammatone filterbank feature for robust sound event recognition. INTERSPEECH 2010: 2246-2249
[c177]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeungZLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeungZLML10
Cheung-Chi Leung, Donglai Zhu, Kong-Aik Lee, Bin Ma, Haizhou Li:
Incorporating MAP estimation and covariance transform for SVM based speaker recognition. INTERSPEECH 2010: 2318-2321
[c176]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouLL10
Chang Huai You, Haizhou Li, Kong-Aik Lee:
A hybrid modeling strategy for GMM-SVM speaker recognition with adaptive relevance factor. INTERSPEECH 2010: 2746-2749
[c175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DongCCLTK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongCCLTK10
Minghui Dong, Paul Y. Chan, Ling Cen, Haizhou Li, Jason Teo, Ping Jen Kua:
Phonetic segmentation of singing voice using MIDI and parallel speech. INTERSPEECH 2010: 2890-2893
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DongCCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DongCCL10
Minghui Dong, Paul Y. Chan, Ling Cen, Haizhou Li:
Aligning singing voice with MIDI melody using synthesized audio signal. ISCSLP 2010: 95-98
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangL10
Chien-Lin Huang, Haizhou Li:
UBM data selection for effective speaker modeling. ISCSLP 2010: 162-165
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangGDLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangGDLML10
Eryu Wang, Wu Guo, Li-Rong Dai, Kong-Aik Lee, Bin Ma, Haizhou Li:
Factor analysis based spatial correlation modeling for speaker verification. ISCSLP 2010: 166-170
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/BaiLHML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/BaiLHML10
Shuanhu Bai, Cheung-Chi Leung, Chien-Lin Huang, Bin Ma, Haizhou Li:
Building topic mixture language models using the document soft classification notion of topic models. ISCSLP 2010: 229-232
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChanDCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChanDCL10
Paul Yaozhu Chan, Minghui Dong, Ling Cen, Haizhou Li:
The psychoacoustic approach towards enhancing speech intelligibility in noise. ISCSLP 2010: 238-241
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SunML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SunML10
Hanwu Sun, Bin Ma, Haizhou Li:
Frame selection of interview channel for NIST speaker recognition evaluation. ISCSLP 2010: 305-308
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/CenCDL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/CenCDL10
Ling Cen, Paul Y. Chan, Minghui Dong, Haizhou Li:
Generating emotional speech from neutral speech. ISCSLP 2010: 383-386
[c167]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/DuanBLXAZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/DuanBLXAZL10
Xiangyu Duan, Rafael E. Banchs, Jun Lang, Deyi Xiong, AiTi Aw, Min Zhang, Haizhou Li:
I²r's machine translation system for IWSLT 2010. IWSLT 2010: 67-72
[c166]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/XiongZL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/XiongZL10
Deyi Xiong, Min Zhang, Haizhou Li:
Learning Translation Boundaries for Phrase-Based Decoding. HLT-NAACL 2010: 136-144
[c165]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/NgLLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/NgLLML10
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li:
Detection target dependent score calibration for language recognition. Odyssey 2010: 18
[c164]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/LeungML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LeungML10
Cheung-Chi Leung, Bin Ma, Haizhou Li:
Parallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition. Odyssey 2010: 41
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/HanWTL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/HanWTL10
Boon Siew Han, Alvin Hong Yee Wong, Yeow Kee Tan, Haizhou Li:
Using design methodology to enhance interaction for a robotic receptionist. RO-MAN 2010: 797-802
[c162]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/Li10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/Li10
Haizhou Li:
BISTRA: Malay-English bidirectional speech translation. SLTU 2010: 1
[c161]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/SamBCMLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/SamBCMLL10
Sethserey Sam, Laurent Besacier, Eric Castelli, Bin Ma, Cheung-Chi Leung, Haizhou Li:
Autonomous acoustic model adaptation for multilingual meeting transcription involving high- and low-resourced languages. SLTU 2010: 116-121
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/YanTL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/YanTL10
Rui Yan, Keng Peng Tee, Haizhou Li:
Nonlinear Control of a Robot Manipulator with Time-Varying Uncertainties. ICSR 2010: 202-211
[c159]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ssw/DongCCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/DongCCL10
Minghui Dong, Ling Cen, Paul Y. Chan, Haizhou Li:
Considering readability in text-to-speech recording script design. SSW 2010: 312-316
[e5]
- view
- export record
  dblp key:
  - conf/aclnews/2010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2010
A. Kumaran, Haizhou Li:
Proceedings of the 2010 Named Entities Workshop, NEWS@ACL 2010, Uppsala, Sweden, July 16, 2010. Association for Computational Linguistics 2010, ISBN 978-1-932432-78-7 [contents]
[e4]
- view
  authority control:
- export record
  dblp key:
  - conf/socrob/2010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/socrob/2010
Shuzhi Sam Ge, Haizhou Li, John-John Cabibihan, Yeow Kee Tan:
Social Robotics - Second International Conference on Social Robotics, ICSR 2010, Singapore, November 23-24, 2010. Proceedings. Lecture Notes in Computer Science 6414, Springer 2010, ISBN 978-3-642-17247-2 [contents]

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j21]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/DongCCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DongCCL09
Minghui Dong, Ling Cen, Paul Y. Chan, Haizhou Li:
Readability Consideration in Speech Synthesis Recording Script Selection. Int. J. Asian Lang. Process. 19(2): 45-54 (2009)
[j20]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/HuangLM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/HuangLM09
Chien-Lin Huang, Haizhou Li, Bin Ma:
Speaker Characterization using Average Filtering and Two Space Fusions. Int. J. Asian Lang. Process. 19(3): 85-94 (2009)
[j19]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/NgLLML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/NgLLML09
Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Asian Language Recognition. Int. J. Asian Lang. Process. 19(4): 139-152 (2009)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/YouLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/YouLL09
Chang Huai You, Kong-Aik Lee, Haizhou Li:
An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition. IEEE Signal Process. Lett. 16(1): 49-52 (2009)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/talip/WuL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/WuL09
Chung-Hsien Wu, Haizhou Li:
Introduction to the Special Issue on Recent Advances in Asian Language Spoken Document Retrieval. ACM Trans. Asian Lang. Inf. Process. 8(1): 1:1-1:3 (2009)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TongMLS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TongMLS09
Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong:
A Target-Oriented Phonotactic Front-End for Spoken Language Recognition. IEEE Trans. Speech Audio Process. 17(7): 1335-1347 (2009)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/DatL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/DatL09
Tran Huy Dat, Haizhou Li:
Jump function Kolmogorov for audio classification in noise-mismatch conditions. IEEE Trans. Signal Process. 57(8): 2908-2918 (2009)
[c158]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LeeAVMZL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LeeAVMZL09
Lianhau Lee, AiTi Aw, Thuy Vu, Sharifah Aljunied Mahani, Min Zhang, Haizhou Li:
MARS: Multilingual Access and Retrieval System with Enhanced Query Translation and Document Retrieval. ACL/IJCNLP (Software Demonstrations) 2009: 21-24
[c157]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/PervouchineLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/PervouchineLL09
Vladimir Pervouchine, Haizhou Li, Bo Lin:
Transliteration Alignment. ACL/IJCNLP 2009: 136-144
[c156]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangZLAT09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangZLAT09
Hui Zhang, Min Zhang, Haizhou Li, AiTi Aw, Chew Lim Tan:
Forest-based Tree Sequence to String Translation Model. ACL/IJCNLP 2009: 172-180
[c155]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiongZAL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiongZAL09
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
A Syntax-Driven Bracketing Model for Phrase-Based Translation. ACL/IJCNLP 2009: 315-323
[c154]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/SetiawanKLR09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/SetiawanKLR09
Hendra Setiawan, Min-Yen Kan, Haizhou Li, Philip Resnik:
Topological Ordering of Function Words in Hierarchical Phrase-based Translation. ACL/IJCNLP 2009: 324-332
[c153]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChenZLA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZLA09
Boxing Chen, Min Zhang, Haizhou Li, AiTi Aw:
A Comparative Study of Hypothesis Alignment and its Improvement for Machine Translation System Combination. ACL/IJCNLP 2009: 941-948
[c152]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/LiKPZ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiKPZ09
Haizhou Li, A. Kumaran, Vladimir Pervouchine, Min Zhang:
Report of NEWS 2009 Machine Transliteration Shared Task. NEWS@IJCNLP 2009: 1-18
[c151]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/aclnews/LiKZP09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/LiKZP09
Haizhou Li, A. Kumaran, Min Zhang, Vladimir Pervouchine:
Whitepaper of NEWS 2009 Machine Transliteration Shared Task. NEWS@IJCNLP 2009: 19-26
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XiaoLCLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XiaoLCLL09
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A study on hidden Markov model's generalization capability for speech recognition. ASRU 2009: 255-260
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaktiKPHSNPWXRALL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaktiKPHSNPWXRALL09
Sakriani Sakti, Noriyuki Kimura, Michael Paul, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
The Asian network-based speech-to-speech translation system. ASRU 2009: 507-512
[c148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/DongCCHZ0009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/DongCCHZ0009
Minghui Dong, Ling Cen, Paul Y. Chan, Dongyan Huang, Donglai Zhu, Bin Ma, Haizhou Li:
I2R Text-to-Speech System for Blizzard Challenge 2009. Blizzard Challenge 2009
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/YanLDT09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/YanLDT09
Rui Yan, Haizhou Li, Zhao Yang Dong, Huajin Tang:
Nonlinear control approaches for SI engine model with uncertainties. CDC 2009: 5440-5445
[c146]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangL09
Min Zhang, Haizhou Li:
Tree Kernel-based SVM with Structured Syntactic Knowledge for BTG-based Phrase Reordering. EMNLP 2009: 698-707
[c145]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangZLT09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZLT09
Hui Zhang, Min Zhang, Haizhou Li, Chew Lim Tan:
Fast Translation Rule Matching for Syntax-based Statistical Machine Translation. EMNLP 2009: 1037-1045
[c144]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhangZTL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZTL09
Hui Zhang, Min Zhang, Chew Lim Tan, Haizhou Li:
K-Best Combination of Syntactic Parsers. EMNLP 2009: 1552-1560
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/fira/LimbuTWJWLHYD009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fira/LimbuTWJWLHYD009
Dilip Kumar Limbu, Yeow Kee Tan, Chern Yuen Wong, Ridong Jiang, Hengxin Wu, Liyuan Li, Kah Eng Hoe, Xinguo Yu, Li Dong, Haizhou Li:
Experiences with a Barista Robot, FusionBot. FIRA 2009: 140-151
[c142]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/TanKJLHYDWL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/TanKJLHYDWL09
Yeow Kee Tan, Dilip Kumar Limbu, Ridong Jiang, Liyuan Li, Kah Eng Hoe, Xinguo Yu, Li Dong, Chern Yuen Wong, Haizhou Li:
An Interactive Robot Butler. HCI (2) 2009: 385-394
[c141]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/NgLLML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/NgLLML09
Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li:
Analysis and Selection of Prosodic Features for Language Identification. IALP 2009: 123-128
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/DongCCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/DongCCL09
Minghui Dong, Ling Cen, Paul Y. Chan, Haizhou Li:
Refining Unit Boundaries for Mandarin Text-to-Speech Database. IALP 2009: 245-248
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/BaiZL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/BaiZL09
Shuanhu Bai, Min Zhang, Haizhou Li:
Semi-supervised Learning of Domain-Specific Language Models from General Domain Data. IALP 2009: 273-279
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LeungTML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LeungTML09
Cheung-Chi Leung, Rong Tong, Bin Ma, Haizhou Li:
A Lattice-Based Phonotactic Language Recognition System with CMLLR Adaptation and Its Implementation Issues. IALP 2009: 285-288
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DatL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DatL09
Tran Huy Dat, Haizhou Li:
Sound event classification based on Feature Integration, Recursive Feature Elimination and Structured Classification. ICASSP 2009: 177-180
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuML09
Donglai Zhu, Bin Ma, Haizhou Li:
Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition. ICASSP 2009: 4045-4048
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NweSLR09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NweSLR09
Tin Lay Nwe, Hanwu Sun, Haizhou Li, Susanto Rahardja:
Speaker diarization in meeting audio. ICASSP 2009: 4073-4076
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NguyenLS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NguyenLS09
Trung Hieu Nguyen, Haizhou Li, Chng Eng Siong:
Cluster criterion functions in spectral subspace and their application in speaker clustering. ICASSP 2009: 4085-4088
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YouLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YouLL09
Chang Huai You, Kong-Aik Lee, Haizhou Li:
A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition. ICASSP 2009: 4221-4224
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LongMLGSD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LongMLGSD09
Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NosratighodsTEAML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NosratighodsTEAML09
Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Bin Ma, Haizhou Li:
Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE. ICASSP 2009: 4233-4236
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SunML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SunML09
Hanwu Sun, Bin Ma, Haizhou Li:
Cross-validation of multiple language recognition systems using pseudo keys. ICASSP 2009: 4353-4356
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/iccpol/KuoLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccpol/KuoLL09
Jin-Shea Kuo, Haizhou Li, Chih-Lung Lin:
Harvesting Regional Transliteration Variants with Guided Search. ICCPOL 2009: 133-144
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangSL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangSL09
Lei Wang, Chng Eng Siong, Haizhou Li:
Efficient sparse self-similarity matrix construction for repeating sequence detection. ICME 2009: 458-461
[c126]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MaZL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MaZL09
Bin Ma, Donglai Zhu, Haizhou Li:
Acoustic segment modeling for speaker recognition. ICME 2009: 1668-1671
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLCL09
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng, Kong-Aik Lee:
Target-aware language models for spoken language recognition. INTERSPEECH 2009: 200-203
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunNML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunNML09
Hanwu Sun, Tin Lay Nwe, Bin Ma, Haizhou Li:
Speaker diarization for meeting room audio. INTERSPEECH 2009: 900-903
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CenDCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CenDCL09
Ling Cen, Minghui Dong, Paul Y. Chan, Haizhou Li:
Unit selection based speech synthesis for poor channel condition. INTERSPEECH 2009: 2075-2078
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuML09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuML09
Donglai Zhu, Bin Ma, Haizhou Li:
Large margin estimation of Gaussian mixture model parameters with extended baum-welch for spoken language recognition. INTERSPEECH 2009: 2179-2182
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehzangiMCL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehzangiMCL09
Omid Dehzangi, Bin Ma, Engsiong Chng, Haizhou Li:
Discriminative feature transformation using output coding for speech recognition. INTERSPEECH 2009: 2979-2982
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL09
Khe Chai Sim, Haizhou Li:
Stream-based context-sensitive phone mapping for cross-lingual speech recognition. INTERSPEECH 2009: 3019-3022
[c119]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/DuanXZZL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/DuanXZZL09
Xiangyu Duan, Deyi Xiong, Hui Zhang, Min Zhang, Haizhou Li:
I²r's machine translation system for IWSLT 2009. IWSLT 2009: 50-54
[c118]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/DuanXZZL09a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/DuanXZZL09a
Xiangyu Duan, Deyi Xiong, Hui Zhang, Min Zhang, Haizhou Li:
I2R's machine translation system for IWSLT 2009. IWSLT (Evaluation Campaign) 2009
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/micad/WongLLLW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micad/WongLLLW09
D. W. K. Wong, J. Liu, J. H. Lim, Haizhou Li, Tien Yin Wong:
Automated detection of kinks from blood vessels for optic cup segmentation in retinal images. Computer-Aided Diagnosis 2009: 72601J
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/micad/LiuWLLTW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micad/LiuWLLTW09
J. Liu, Damon Wing Kee Wong, J. H. Lim, Haizhou Li, Ngan Meng Tan, Tien Yin Wong:
ARGALI: an automatic cup-to-disc ratio measurement system for glaucoma detection and AnaLysIs framework. Computer-Aided Diagnosis 2009: 72603K
[c115]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mtsummit/XiongZAL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mtsummit/XiongZAL09
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
Efficient Beam Thresholding for Statistical Machine Translation. MTSummit 2009
[c114]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mtsummit/XiongZAL09a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mtsummit/XiongZAL09a
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
A Source Dependency Model for Statistical Machine Translation. MTSummit 2009
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/HanHTNYCYL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/HanHTNYCYL09
Boon Siew Han, Wee Kiat Ho, Adrian Hwang Jian Tay, Tzer Liang Ng, Ai Ping Yow, I-Ming Chen, Song Huat Yeo, Haizhou Li:
A life-size robotic lion dance system with integrated motion control. RO-MAN 2009: 687-692
[e3]
- view
- export record
  dblp key:
  - conf/aclnews/2009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aclnews/2009
Haizhou Li, A. Kumaran:
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, NEWS@IJCNLP 2009, Singapore, August 7, 2009. Association for Computational Linguistics 2009, ISBN 978-1-932432-57-2 [contents]
[e2]
- view
- export record
  dblp key:
  - conf/ialp/2009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/2009
Min Zhang, Haizhou Li, Kim-Teng Lua, Minghui Dong:
2009 International Conference on Asian Language Processing, IALP 2009, Singapore, December 7-9, 2009. IEEE Computer Society 2009, ISBN 978-0-7695-3904-1 [contents]
2008
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcpol/KwongL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcpol/KwongL08
Oi Yee Kwong, Haizhou Li:
Guest Editors' Introduction. Int. J. Comput. Process. Orient. Lang. 21(2): 97-99 (2008)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcpol/LiKSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcpol/LiKSL08
Haizhou Li, Jin-Shea Kuo, Jian Su, Chih-Lung Lin:
Mining Live Transliterations Using Incremental Learning Algorithms. Int. J. Comput. Process. Orient. Lang. 21(2): 183-203 (2008)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/jasis/KuoLY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jasis/KuoLY08
Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang:
Active learning for constructing transliteration lexicons from the Web. J. Assoc. Inf. Sci. Technol. 59(1): 126-135 (2008)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SimL08
Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification. IEEE Trans. Speech Audio Process. 16(5): 1029-1037 (2008)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhuLML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhuLML08
Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training. IEEE Trans. Speech Audio Process. 16(8): 1642-1653 (2008)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiaoSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiaoSL08
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 16(8): 1662-1674 (2008)
[c112]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/XiongZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiongZAL08
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
A Linguistically Annotated Reordering Model for BTG-based Statistical Machine Translation. ACL (2) 2008: 149-152
[c111]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChenZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZAL08
Boxing Chen, Min Zhang, AiTi Aw, Haizhou Li:
Exploiting N-best Hypotheses for SMT Self-Enhancement. ACL (2) 2008: 157-160
[c110]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangJALTL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangJALTL08
Min Zhang, Hongfei Jiang, AiTi Aw, Haizhou Li, Chew Lim Tan, Sheng Li:
A Tree Sequence Alignment-based Tree-to-Tree Translation Model. ACL 2008: 559-567
[c109]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/biostec/PervouchineLZCL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/biostec/PervouchineLZCL08
Vladimir Pervouchine, Graham Leedham, Haishan Zhong, David Cho, Haizhou Li:
Comparative Study of Several Novel Acoustic Features for Speaker Recognition. BIOSIGNALS (1) 2008: 220-223
[c108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/DongZ0008
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/DongZ0008
Minghui Dong, Donglai Zhu, Bin Ma, Haizhou Li:
I2R's Submission to Blizzard Challenge 2008. Blizzard Challenge 2008
[c107]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ChenZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ChenZAL08
Boxing Chen, Min Zhang, AiTi Aw, Haizhou Li:
Regenerating Hypotheses for Statistical Machine Translation. COLING 2008: 105-112
[c106]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/XiongZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/XiongZAL08
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
Linguistically Annotated BTG for Statistical Machine Translation. COLING 2008: 1009-1016
[c105]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ZhangJLAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhangJLAL08
Min Zhang, Hongfei Jiang, Haizhou Li, AiTi Aw, Sheng Li:
Grammar Comparison Study for Translational Equivalence Modeling and Statistical Machine Translation. COLING 2008: 1097-1104
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KhineNL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KhineNL08
Swe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li:
Singing voice detection in pop songs using co-training algorithm. ICASSP 2008: 1629-1632
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NweL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NweL08
Tin Lay Nwe, Haizhou Li:
On fusion of timbre-motivated features for singing voice detection and singer identification. ICASSP 2008: 2225-2228
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DatL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DatL08
Tran Huy Dat, Haizhou Li:
Jump function komogorov and its application for audio stream segmentation and classification. ICASSP 2008: 3353-3356
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeYL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeYL08
Kong-Aik Lee, Changhuai You, Haizhou Li:
Spoken Language recognition using support vector machines with generative front-end. ICASSP 2008: 4153-4156
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuLML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuLML08
Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Discriminative learning for optimizing detection performance in spoken language recognition. ICASSP 2008: 4161-4164
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongMLC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongMLC08
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone tokenizers for spoken language recognition. ICASSP 2008: 4221-4224
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimL08
Khe Chai Sim, Haizhou Li:
Robust phone set mapping using decision tree clustering for cross-lingual phone recognition. ICASSP 2008: 4309-4312
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/YouRL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/YouRL08
Chang Huai You, Susanto Rahardja, Haizhou Li:
Speech enhancement for telephony name speech recognition. ICME 2008: 973-976
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HuangWLHM08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HuangWLHM08
Chien-Lin Huang, Chung-Hsien Wu, Haizhou Li, Chia-Hsin Hsieh, Bin Ma:
Unsupervised pronunciation grammar growing using knowledge-based and data-driven approaches. ICME 2008: 1097-1100
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/DehzangiMCL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/DehzangiMCL08
Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Fuzzy rule selection using Iterative Rule Learning for speech data classification. ICPR 2008: 1-4
[c94]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/KuoLL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/KuoLL08
Jin-Shea Kuo, Haizhou Li, Chih-Lung Lin:
Mining Transliterations from Web Query Results: An Incremental Approach. IJCNLP 2008: 16-23
[c93]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/ZhangSLATW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/ZhangSLATW08
Min Zhang, Chengjie Sun, Haizhou Li, AiTi Aw, Chew Lim Tan, Xiaolong Wang:
Name Origin Recognition Using Maximum Entropy Model and Diverse Features. IJCNLP 2008: 56-63
[c92]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/KuoL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/KuoL08
Jin-Shea Kuo, Haizhou Li:
Multi-View Co-Training of Transliteration Model. IJCNLP 2008: 373-380
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenCL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenCL08
Trung Hieu Nguyen, Engsiong Chng, Haizhou Li:
T-test distance and clustering criterion for speaker diarization. INTERSPEECH 2008: 36-39
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongMLC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongMLC08
Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng:
Target-oriented phone selection from universal phone set for spoken language recognition. INTERSPEECH 2008: 715-718
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhineNL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhineNL08
Swe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li:
Speech/laughter classification in meeting audio. INTERSPEECH 2008: 793-796
[c88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuML08
Donglai Zhu, Bin Ma, Haizhou Li:
Using MAP estimation of feature transformation for speaker recognition. INTERSPEECH 2008: 849-852
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYLKZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYLKZ08
Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, Donglai Zhu:
Characterizing speech utterances for speaker verification with sequence kernel SVM. INTERSPEECH 2008: 1397-1400
[c86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DatL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DatL08
Tran Huy Dat, Haizhou Li:
Speaker identification in noise mismatch conditions based on jump function Kolmogorov analysis in wavelet domain. INTERSPEECH 2008: 1469-1472
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangMWML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangMWML08
Chien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, Haizhou Li:
Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions. INTERSPEECH 2008: 1897-1900
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweDKL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweDKL08
Tin Lay Nwe, Minghui Dong, Swe Zin Kalayar Khine, Haizhou Li:
Multi-speaker meeting audio segmentation. INTERSPEECH 2008: 2522-2525
[c83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaddageL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaddageL08
Namunu Chinthaka Maddage, Haizhou Li:
Rhythm based music segmentation and octave scale cepstral features for sung language recognition. INTERSPEECH 2008: 2526-2529
[c82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL08
Khe Chai Sim, Haizhou Li:
Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition. INTERSPEECH 2008: 2715-2718
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XiaoSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XiaoSL08
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Effect of Feature Smoothing for Robust Speech Recognition. ISCSLP 2008: 73-76
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DehzangiMSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DehzangiMSL08
Omid Dehzangi, Bin Ma, Chng Eng Siong, Haizhou Li:
Discriminative Output Coding Features for Speech Recognition. ISCSLP 2008: 89-92
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/DongL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/DongL08
Minghui Dong, Haizhou Li:
Predicting Spectral and Prosodic Parameters for Unit Selection in Speech Synthesis. ISCSLP 2008: 133-136
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SunML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SunML08
Hanwu Sun, Bin Ma, Haizhou Li:
Using Pseudo-Key for Language Recognition System Design. ISCSLP 2008: 173-176
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YouLML08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YouLML08
Chang Huai You, Kong-Aik Lee, Bin Ma, Haizhou Li:
Self-Organized Clustering for Feature Mapping in Language Recognition. ISCSLP 2008: 177-180
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SunML08a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SunML08a
Hanwu Sun, Bin Ma, Haizhou Li:
An Efficient Feature Selection Method for Speaker Recognition. ISCSLP 2008: 181-184
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/BaiL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/BaiL08
Shuanhu Bai, Haizhou Li:
PLSA Based Topic Mixture Language Modeling Approach. ISCSLP 2008: 185-188
[c74]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/ChenXZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/ChenXZAL08
Boxing Chen, Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li:
I²r multi-pass machine translation system for IWSLT 2008. IWSLT 2008: 46-51
[c73]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/KhalilovCHFHMBCZAL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/KhalilovCHFHMBCZAL08
Maxim Khalilov, Marta R. Costa-jussà, Carlos A. Henríquez Q., José A. R. Fonollosa, Adolfo Hernández, José B. Mariño, Rafael E. Banchs, Boxing Chen, Min Zhang, AiTi Aw, Haizhou Li:
The TALP&I2r SMT systems for IWSLT 2008. IWSLT 2008: 116-123
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/mmm/MaddageKL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmm/MaddageKL08
Namunu Chinthaka Maddage, Mohan S. Kankanhalli, Haizhou Li:
Effectiveness of Signal Segmentation for Music Content Representation. MMM 2008: 477-486
[c71]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/KinnunenLL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/KinnunenLL08
Tomi Kinnunen, Kong-Aik Lee, Haizhou Li:
Dimension reduction of the modulation spectrogram for speaker verification. Odyssey 2008: 30
[c70]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/paclic/LiMLSSTZY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/paclic/LiMLSSTZY08
Haizhou Li, Bin Ma, Kong-Aik Lee, Khe Chai Sim, Hanwu Sun, Rong Tong, Donglai Zhu, Changhuai You:
NIST 2007 Language Recognition Evaluation: From the Perspective of IIR. PACLIC 2008: 46-57
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/ChiaSLN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/ChiaSLN08
Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
A lattice-based approach to query-by-example spoken document retrieval. SIGIR 2008: 363-370
2007
[j8]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/DongLN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DongLN07
Minghui Dong, Haizhou Li, Tin Lay Nwe:
Evaluating Prosody of Mandarin Speech for Language Learning. J. Chin. Lang. Comput. 17(4): 219-226 (2007)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XiaoSL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XiaoSL07
Xiong Xiao, Chng Eng Siong, Haizhou Li:
Temporal Structure Normalization of Speech Feature for Robust Speech Recognition. IEEE Signal Process. Lett. 14(7): 500-503 (2007)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/talip/KuoLY07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/KuoLY07
Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang:
A phonetic similarity model for automatic extraction of transliteration pairs. ACM Trans. Asian Lang. Inf. Process. 6(2): 6 (2007)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiML07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiML07
Haizhou Li, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification. IEEE Trans. Speech Audio Process. 15(1): 271-284 (2007)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/NweL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/NweL07
Tin Lay Nwe, Haizhou Li:
Exploring Vibrato-Motivated Acoustic Features for Singer Identification. IEEE Trans. Speech Audio Process. 15(2): 519-530 (2007)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MaLT07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MaLT07
Bin Ma, Haizhou Li, Rong Tong:
Spoken Language Recognition Using Ensemble Classifiers. IEEE Trans. Speech Audio Process. 15(7): 2053-2062 (2007)
[c68]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LiSKD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiSKD07
Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, Minghui Dong:
Semantic Transliteration of Personal Names. ACL 2007
[c67]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/SetiawanKL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/SetiawanKL07
Hendra Setiawan, Min-Yen Kan, Haizhou Li:
Ordering Phrases with Function Words. ACL 2007
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/clear/KohSNNMCLR07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clear/KohSNNMCLR07
Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Chng Eng Siong, Haizhou Li, Susanto Rahardja:
Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007 Evaluation. CLEAR 2007: 484-496
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/cmmr/KhineNL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cmmr/KhineNL07
Swe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li:
Exploring Perceptual Based Timbre Feature for Singer Identification. CMMR 2007: 159-171
[c64]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ChiaLN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChiaLN07
Tee Kiah Chia, Haizhou Li, Hwee Tou Ng:
A Statistical Language Modeling Approach to Lattice-Based Spoken Document Retrieval. EMNLP-CoNLL 2007: 810-818
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuMLH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuMLH07
Donglai Zhu, Bin Ma, Haizhou Li, Qiang Huo:
A Generalized Feature Transformation Approach for Channel Robust Speaker Verification. ICASSP (4) 2007: 61-64
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongLMCC07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongLMCC07
Rong Tong, Haizhou Li, Bin Ma, Engsiong Chng, Siu-Yeung Cho:
Spoken Language Recognition with Relevance Feedback. ICASSP (4) 2007: 861-864
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaTL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaTL07
Bin Ma, Rong Tong, Haizhou Li:
Discriminative Vector for Spoken Language Recognition. ICASSP (4) 2007: 1001-1004
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCL07
Xiong Xiao, Engsiong Chng, Haizhou Li:
Normalizing the Speech Modulation Spectrum for Robust Speech Recognition. ICASSP (4) 2007: 1021-1024
[c59]
- view
  - electronic edition via handle.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icmc/KhineNL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmc/KhineNL07
Swe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li:
On Timbre Based perceptual Feature for Singer identification. ICMC 2007
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangLC07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangLC07
Lei Wang, Haizhou Li, Engsiong Chng:
A Vector-Based Approach to Broadcast Audio Database Indexing and Retrieval. ICME 2007: 512-515
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL07
Khe Chai Sim, Haizhou Li:
Fusion of contrastive acoustic models for parallel phonotactic spoken language identification. INTERSPEECH 2007: 170-173
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYLK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYLK07
Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen:
A GMM-based probabilistic sequence kernel for speaker verification. INTERSPEECH 2007: 294-297
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoCL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoCL07
Xiong Xiao, Engsiong Chng, Haizhou Li:
Evaluating the temporal structure normalisation technique on the Aurora-4 task. INTERSPEECH 2007: 1070-1073
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohSNNMCLR07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohSNNMCLR07
Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja:
Using direction of arrival estimate and acoustic feature information in speaker diarization. INTERSPEECH 2007: 2149-2152
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/NweL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/NweL07
Tin Lay Nwe, Haizhou Li:
Singing voice detection using perceptually-motivated features. ACM Multimedia 2007: 309-312
2006
[j2]
- view
  - electronic edition @ aclclp.org.tw (open access)
  - details & citations
- export record
  dblp key:
  - journals/ijclclp/MaL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijclclp/MaL06
Bin Ma, Haizhou Li:
A Comparative Study of Four Language Identification Systems. Int. J. Comput. Linguistics Chin. Lang. Process. 11(2) (2006)
[j1]
- view
  - electronic edition @ colips.org
  - details & citations
- export record
  dblp key:
  - journals/jclc/DongLL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jclc/DongLL06
Minghui Dong, Kim-Teng Lua, Haizhou Li:
A Unit Selection-based Speech Synthesis Approach for Mandarin Chinese. J. Chin. Lang. Comput. 16(3): 135-144 (2006)
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KuoLY06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KuoLY06
Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang:
Learning Transliteration Lexicons from the Web. ACL 2006
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TongMZLC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TongMZLC06
Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, Engsiong Chng:
Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification. ICASSP (1) 2006: 205-208
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiN06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiN06
Haizhou Li, Tin Lay Nwe:
Vibrato-Motivated Acoustic Features for Singger Identification. ICASSP (5) 2006: 533-536
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiL06
Shuanhu Bai, Haizhou Li:
Bayesian Learning of N-Gram Statistical Language Modeling. ICASSP (1) 2006: 1045-1048
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MaddageKL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MaddageKL06
Namunu Chinthaka Maddage, Mohan S. Kankanhalli, Haizhou Li:
A Hierarchical Approach for Music Chord Modeling Based on the Analysis of Tonal Characteristics. ICME 2006: 945-948
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DongLN06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongLN06
Minghui Dong, Haizhou Li, Tin Lay Nwe:
Evaluating prosody of Mandarin speech for language learning. INTERSPEECH 2006
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMT06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMT06
Haizhou Li, Bin Ma, Rong Tong:
Vector-based spoken language recognition using output coding. INTERSPEECH 2006
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaZTL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaZTL06
Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li:
Speaker cluster based GMM tokenization for speaker recognition. INTERSPEECH 2006
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweLD06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweLD06
Tin Lay Nwe, Haizhou Li, Minghui Dong:
Analysis and detection of speech under sleep deprivation. INTERSPEECH 2006
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XiaoLC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XiaoLC06
Xiong Xiao, Haizhou Li, Engsiong Chng:
Vector Autoregressive Model for Missing Feature Reconstruction. ISCSLP (Selected Papers) 2006: 315-324
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GiulianiNL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GiulianiNL06
Manuel Giuliani, Tin Lay Nwe, Haizhou Li:
Meeting Segmentation Using Two-Layer Cascaded Subband Filters. ISCSLP (Selected Papers) 2006: 672-682
[c41]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/KinnunenK00C06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/KinnunenK00C06
Tomi Kinnunen, Chin-Wei Eugene Koh, Lei Wang, Haizhou Li, Eng Siong Chng:
Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification. ISCSLP 2006
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeeSTMDYZKWKEL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeeSTMDYZKWKEL06
Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Chng Eng Siong, Haizhou Li:
The IIR Submission to CSLP 2006 Speaker Recognition Evaluation. ISCSLP (Selected Papers) 2006: 494-505
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TongMLYZKSDCL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TongMLYZKSDCL06
Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Chng Eng Siong, Haizhou Li:
Fusion of Acoustic and Tokenization Features for Speaker Recognition. ISCSLP (Selected Papers) 2006: 566-577
[c38]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/ZhuT0006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhuT0006
Donglai Zhu, Rong Tong, Bin Ma, Haizhou Li:
Minimum Classification Error Based Optimal Linear Combination for Spoken Language Identification. ISCSLP 2006
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/IskandarWKL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/IskandarWKL06
Denny Iskandar, Ye Wang, Min-Yen Kan, Haizhou Li:
Syllabic level automatic synchronization of music signals and text lyrics. ACM Multimedia 2006: 659-662
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/odyssey/LiYLMTZL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LiYLMTZL06
Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, Haizhou Li:
Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion. Odyssey 2006: 1-5
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/MaddageLK06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/MaddageLK06
Namunu Chinthaka Maddage, Haizhou Li, Mohan S. Kankanhalli:
Music structure based vector space retrieval. SIGIR 2006: 67-74
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/2006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/2006
Qiang Huo, Bin Ma, Chng Eng Siong, Haizhou Li:
Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Selected Papers. Lecture Notes in Computer Science 4274, Springer 2006, ISBN 3-540-49665-3 [contents]
2005
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiM05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiM05
Haizhou Li, Bin Ma:
A Phonotactic Language Model for Spoken Language Identification. ACL 2005: 515-522
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LimLM05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LimLM05
Boon Pang Lim, Haizhou Li, Bin Ma:
Using Local & Global Phonotactic Features in Chinese Dialect Identification. ICASSP (1) 2005: 577-580
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NweL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NweL05
Tin Lay Nwe, Haizhou Li:
Broadcast news segmentation by audio type analysis. ICASSP (2) 2005: 1065-1068
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnlp/SetiawanLZO05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/SetiawanLZO05
Hendra Setiawan, Haizhou Li, Min Zhang, Beng Chin Ooi:
Phrase-Based Statistical Machine Translation: A Level of Detail Approach. IJCNLP 2005: 576-587
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnlp/ZhangLSS05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/ZhangLSS05
Min Zhang, Haizhou Li, Jian Su, Hendra Setiawan:
A Phrase-Based Context-Dependent Joint Probability Model for Named Entity Translation. IJCNLP 2005: 600-611
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NweL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NweL05
Tin Lay Nwe, Haizhou Li:
Identifying singers of popular songs. INTERSPEECH 2005: 129-132
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaLL05
Bin Ma, Haizhou Li, Chin-Hui Lee:
An acoustic segment modeling approach to automatic language identification. INTERSPEECH 2005: 2829-2832
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoMLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoMLL05
Sheng Gao, Bin Ma, Haizhou Li, Chin-Hui Lee:
A text categorization approach to automatic language identification. INTERSPEECH 2005: 2837-2840
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DongLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongLL05
Minghui Dong, Kim-Teng Lua, Haizhou Li:
A probabilistic approach to prosodic word prediction for Mandarin Chinese TTS. INTERSPEECH 2005: 3245-3248
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarML05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarML05
C. Santhosh Kumar, V. P. Mohandas, Haizhou Li:
Multilingual speech recognition: a unified approach. INTERSPEECH 2005: 3357-3360
[c24]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/maveba/ManickamL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/maveba/ManickamL05
Kathiresan Manickam, Haizhou Li:
Complexity analysis of normal and deaf infant cry acoustic waves. MAVEBA 2005: 105-108
[c23]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mtsummit/SetiawanLZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mtsummit/SetiawanLZ05
Hendra Setiawan, Haizhou Li, Min Zhang:
Learning Phrase Translation using Level of Detail Approach. MTSummit 2005: 243-250
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/MaL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/MaL05
Bin Ma, Haizhou Li:
A phonotactic-semantic paradigm for automatic spoken document classification. SIGIR 2005: 369-376
2004
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiZS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiZS04
Haizhou Li, Min Zhang, Jian Su:
A Joint Source-Channel Model for Machine Transliteration. ACL 2004: 159-166
[c20]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ZhangLS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhangLS04
Min Zhang, Haizhou Li, Jian Su:
Direct Orthographical Mapping for Machine Transliteration. COLING 2004
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuFL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuFL04
Jun Xu, Guohong Fu, Haizhou Li:
Grapheme-to-phoneme conversion for Chinese text-to-speech. INTERSPEECH 2004: 1885-1888
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LimLC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LimLC04
Boon Pang Lim, Haizhou Li, Yu Chen:
Language identification through large vocabulary continuous speech recognition. ISCSLP 2004: 49-52
2003
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuCDGL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuCDGL03
Jun Xu, Thomas Choy, Minghui Dong, Cuntai Guan, Haizhou Li:
On unit analysis for Cantonese corpus-based TTS. INTERSPEECH 2003: 269-272
2002
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaGLL02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaGLL02
Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee:
Multilingual speech recognition with language identification. INTERSPEECH 2002: 505-508
[c15]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/000102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/000102
Haizhou Li:
Concatenative Chinese speech synthesis and quality evaluation. ISCSLP 2002
[c14]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0001G002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0001G002
Bin Ma, Cuntai Guan, Haizhou Li:
Likelihood probability mismatch analysis and normalization in multilingual speech applications. ISCSLP 2002
[c13]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0005G002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0005G002
Min Zhang, Cuntai Guan, Haizhou Li:
Equivalent node-based speech grammar optimization. ISCSLP 2002
2000
[c12]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0005C000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0005C000
Min Zhang, Engsiong Chng, Haizhou Li:
Semi-class-based N-gram Language Modeling for Chinese Dictation. ISCSLP 2000

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1998
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiLLY98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiLLY98
Shuanhu Bai, Haizhou Li, Zhiwei Lin, Baosheng Yuan:
Building class-based language models with contextual statistics. ICASSP 1998: 173-176
[c10]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0001LB98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0001LB98
Haizhou Li, Zhiwei Lin, Shuanhu Bai:
Chinese Sentence Tokenization Using Viterbi Decoder. ISCSLP 1998
[c9]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/Guan0YL98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Guan0YL98
Cuntai Guan, Haizhou Li, Baosheng Yuan, Zhiwei Lin:
Data-driven Acoustic Modeling Approach for Chinese LVCSR. ISCSLP 1998
[c8]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/YuanGL098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YuanGL098
Baosheng Yuan, Cuntai Guan, Gareth Loudon, Haizhou Li:
Optimization of Parameter Tying for Chinese Acoustic Modeling. ISCSLP 1998
[c7]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - conf/paclic/LiY98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/paclic/LiY98
Haizhou Li, Baosheng Yuan:
Chinese Word Segmentation. PACLIC 1998: 212-217
1996
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SuLHN96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SuLHN96
Jian Su, Haizhou Li, Jean-Paul Haton, Kai-Tat Ng:
Speaker time-drifting adaptation using trajectory mixture hidden Markov models. ICASSP 1996: 709-712
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiGH96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiGH96
Haizhou Li, Yifan Gong, Jean-Paul Haton:
Probabilistic mapping networks for speaker recognition. ICASSP 1996: 3374-3377
1995
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgLH95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgLH95
Kai Tat Ng, Haizhou Li, Jean Paul Haton:
Some nonparametric distance measures in speaker verification. EUROSPEECH 1995: 317-320
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHG95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHG95
Haizhou Li, Jean Paul Haton, Yifan Gong:
On MMI learning of Gaussian mixture for speaker models. EUROSPEECH 1995: 363-366
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHSG95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHSG95
Haizhou Li, Jean Paul Haton, Jian Su, Yifan Gong:
Speaker recognition with temporal transition models. EUROSPEECH 1995: 617-620
1993
[c1]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/seke/LeungL93
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/seke/LeungL93
Ping Hung Karl R. Leung, Haizhou Li:
Structured Specifications, Semantics, and System Semantics. SEKE 1993: 324-326

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.