default search action
Shengkui Zhao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c30]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330 - [c29]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. ICASSP 2024: 10356-10360 - [c28]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? ICASSP 2024: 10366-10370 - [i15]Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. CoRR abs/2406.02009 (2024) - [i14]Jia Qi Yip, Shengkui Zhao, Dianwen Ng, Eng Siong Chng, Bin Ma:
Towards Audio Codec-based Speech Separation. CoRR abs/2406.12434 (2024) - [i13]Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024) - 2023
- [c27]Shengkui Zhao, Bin Ma:
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement. ICASSP 2023: 1-5 - [c26]Shengkui Zhao, Bin Ma:
MossFormer: Pushing the Performance Limit of Monaural Speech Separation Using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions. ICASSP 2023: 1-5 - [c25]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma:
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition. INTERSPEECH 2023: 1319-1323 - [c24]Jia Qi Yip, Duc-Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. INTERSPEECH 2023: 1938-1942 - [i12]Shengkui Zhao, Bin Ma:
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions. CoRR abs/2302.11824 (2023) - [i11]Shengkui Zhao, Bin Ma:
D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement. CoRR abs/2302.11832 (2023) - [i10]Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023) - [i9]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? CoRR abs/2309.09413 (2023) - [i8]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023) - [i7]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. CoRR abs/2312.11825 (2023) - 2022
- [c23]Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. ICASSP 2022: 656-660 - [c22]Shengkui Zhao, Bin Ma, Karn N. Watcharasupat, Woon-Seng Gan:
FRCRN: Boosting Feature Representation Using Frequency Recurrence for Monaural Speech Enhancement. ICASSP 2022: 9281-9285 - [i6]Shengkui Zhao, Bin Ma, Karn N. Watcharasupat, Woon-Seng Gan:
FRCRN: Boosting Feature Representation using Frequency Recurrence for Monaural Speech Enhancement. CoRR abs/2206.07293 (2022) - 2021
- [c21]Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma:
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. ICASSP 2021: 5969-5973 - [c20]Shengkui Zhao, Trung Hieu Nguyen, Bin Ma:
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses. ICASSP 2021: 6648-6652 - [i5]Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma:
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. CoRR abs/2102.01991 (2021) - [i4]Shengkui Zhao, Trung Hieu Nguyen, Bin Ma:
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses. CoRR abs/2102.01993 (2021) - [i3]Karn N. Watcharasupat, Thi Ngoc Tho Nguyen, Woon-Seng Gan, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. CoRR abs/2110.00745 (2021) - 2020
- [c19]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion. INTERSPEECH 2020: 2927-2931 - [i2]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion. CoRR abs/2010.08136 (2020)
2010 – 2019
- 2019
- [c18]Shengkui Zhao, Trung Hieu Nguyen, Hao Wang, Bin Ma:
Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks. INTERSPEECH 2019: 689-693 - [c17]Shengkui Zhao, Chongjia Ni, Rong Tong, Bin Ma:
Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition. INTERSPEECH 2019: 1238-1242 - 2017
- [c16]Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017: 3246-3250 - [c15]Thi Ngoc Tho Nguyen, Cagdas Tuna, Shengkui Zhao, Douglas L. Jones:
A novel sparse model for multi-source localization using distributed microphone array. ICASSP 2017: 3256-3260 - 2016
- [j10]Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation. EURASIP J. Adv. Signal Process. 2016: 4 (2016) - [c14]Shengkui Zhao, Cagdas Tuna, Thi Ngoc Tho Nguyen, Douglas L. Jones:
Large region acoustic source mapping: A generalized sparse constrained deconvolution approach. ICASSP 2016: 3186-3190 - [c13]Xiong Xiao, Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones, Eng Siong Chng, Haizhou Li:
An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources. ICASSP 2016: 6330-6334 - 2015
- [j9]Viet Anh Nguyen, Jiangbo Lu, Shengkui Zhao, Dung T. Vu, Hongsheng Yang, Douglas L. Jones, Minh N. Do:
ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach. IEEE J. Sel. Top. Signal Process. 9(3): 546-561 (2015) - [c12]Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction. ASRU 2015: 460-467 - [c11]Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones:
Large region acoustic source mapping using movable arrays. ICASSP 2015: 2589-2593 - [c10]Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
A learning-based approach to direction of arrival estimation in noisy and reverberant environments. ICASSP 2015: 2814-2818 - [c9]Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li:
Learning to estimate reverberation time in noisy and reverberant rooms. INTERSPEECH 2015: 3431-3435 - 2014
- [j8]Shengkui Zhao, Douglas L. Jones, Suiyang Khoo, Zhihong Man:
New Variable Step-Sizes Minimizing Mean-Square Deviation for the LMS-Type Algorithms. Circuits Syst. Signal Process. 33(7): 2251-2265 (2014) - [j7]Shengkui Zhao, Tigran Saluev, Douglas L. Jones:
Underdetermined direction of arrival estimation using acoustic vector sensor. Signal Process. 100: 160-168 (2014) - [j6]Viet Anh Nguyen, Jiangbo Lu, Shengkui Zhao, Douglas L. Jones, Minh N. Do:
Teleimmersive Audio-Visual Communication Using Commodity Hardware [Applications Corner]. IEEE Signal Process. Mag. 31(6): 118-136 (2014) - [c8]Nguyen Thi Ngoc Tho, Shengkui Zhao, Douglas L. Jones:
Robust DOA estimation of multiple speech sources. ICASSP 2014: 2287-2291 - [c7]Shengkui Zhao, Douglas L. Jones:
A new auxiliary-vector algorithm with conjugate orthogonality for speech enhancement. INTERSPEECH 2014: 2660-2664 - [i1]Viet Anh Nguyen, Jiangbo Lu, Shengkui Zhao, Tien Dung Vu, Hongsheng Yang, Douglas L. Jones, Minh N. Do:
ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach. CoRR abs/1408.0605 (2014) - 2013
- [c6]Viet Anh Nguyen, Shengkui Zhao, Tien Dung Vu, Douglas L. Jones, Minh N. Do:
Spatialized audio multiparty teleconferencing with commodity miniature microphone array. ACM Multimedia 2013: 553-556 - 2012
- [j5]Suiyang Khoo, Shengkui Zhao, Zhihong Man:
Adaptive fast finite-time multiple-surface sliding control for a class of uncertain non-linear systems. Int. J. Model. Identif. Control. 16(4): 392-400 (2012) - [c5]Yun Liang, Zheng Cui, Shengkui Zhao, Kyle Rupnow, Yihao Zhang, Douglas L. Jones, Deming Chen:
Real-time implementation and performance optimization of 3D sound localization on GPUs. DATE 2012: 832-835 - [c4]Shengkui Zhao, Douglas L. Jones:
A Fast-Converging Adaptive Frequency-Domain MVDR Beamformer for Speech Enhancement. INTERSPEECH 2012: 1930-1933 - 2010
- [c3]Shengkui Zhao, Jianfei Cai, Zhihong Man:
Nonlinear image restoration using recurrent radial basis function network. ISCAS 2010: 1161-1164
2000 – 2009
- 2009
- [b1]Shengkui Zhao:
Performance analysis and enhancements of adaptive algorithms and their applications. Nanyang Technological University, Singapore, 2009 - [j4]Shengkui Zhao, Zhihong Man, Suiyang Khoo, Hong Ren Wu:
Variable step-size LMS algorithm with a quotient form. Signal Process. 89(1): 67-76 (2009) - [j3]Shengkui Zhao, Zhihong Man, Suiyang Khoo:
A generalized data windowing scheme for adaptive conjugate gradient algorithms. Signal Process. 89(5): 894-900 (2009) - [j2]Shengkui Zhao, Zhihong Man, Suiyang Khoo, Hong Ren Wu:
Stability and Convergence Analysis of Transform-Domain LMS Adaptive Filters With Second-Order Autoregressive Process. IEEE Trans. Signal Process. 57(1): 119-130 (2009) - 2008
- [j1]Suiyang Khoo, Zhihong Man, Shengkui Zhao:
Comments on "Adaptive multiple-surface sliding control for non-autonomous systems with mismatched uncertainties". Autom. 44(11): 2995-2998 (2008) - 2006
- [c2]Suiyang Khoo, Zhihong Man, Shengkui Zhao:
Sliding Mode Control of Fuzzy Dynamic Systems. ICARCV 2006: 1-6 - [c1]Shengkui Zhao, Zhihong Man, Suiyang Khoo:
Modified LMS and NLMS Algorithms with a New Variable Step Size. ICARCV 2006: 1-6
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 21:18 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint