


default search action
Yu Tsao 0001
Person information
- affiliation: Academia Sinica, Research Center for Information Technology Innovation, Taipei, Taiwan
Other persons with the same name
- Yu Tsao — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
[j83]Muhammad Salman Khan
, Valerio Mario Salerno
, Moreno La Quatra
, Kuo-Hsuan Hung, Szu-Wei Fu, Yu Tsao
, Sabato Marco Siniscalchi
:
Foundation Models for Speech Enhancement Leveraging Consistency Constraints and Contrast Stretching. IEEE Access 13: 175718-175732 (2025)
[j82]Ming-Chi Yen, Chia-Hua Wu, Shu-Wei Tsai, Jyh-Shing Roger Jang, Yu Tsao
, Amir Hussain
, Hsin-Min Wang
:
Mandarin Electrolaryngeal Speech Voice Conversion with Speech Encoder Loss Learning and Seq2seq Modeling. IEEE Internet Things Mag. 8(4): 22-28 (2025)
[j81]Amir Hussain
, Yu Tsao
, John H. L. Hansen
, Naomi Harte, Shinji Watanabe
, Isabel Trancoso, Shixiong Zhang:
Guest Editorial: IEEE JSTSP Special Issue on Deep Multimodal Speech Enhancement and Separation (DEMSES). IEEE J. Sel. Top. Signal Process. 19(4): 596-599 (2025)
[j80]Ammarah Hashmi
, Sahibzada Adil Shahzad
, Chia-Wen Lin
, Yu Tsao
, Hsin-Min Wang
:
AVTENet: A Human-Cognition-Inspired Audio-Visual Transformer-Based Ensemble Network for Video Deepfake Detection. IEEE Trans. Cogn. Dev. Syst. 17(6): 1360-1376 (2025)
[j79]Tun-Yu Chang
, Jeng-Bang Wang
, Yu-Hsuan Tsai
, Yu Tsao
, Chia-Hsiang Yang
:
A 40-nm 3.9mW, 200words/Min Neural Signal Processor in Speech Decoding for Brain-Machine Interface. IEEE Trans. Biomed. Circuits Syst. 19(6): 1065-1077 (2025)
[j78]Sahibzada Adil Shahzad
, Ammarah Hashmi
, Yan-Tsung Peng
, Yu Tsao
, Hsin-Min Wang
:
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Deepfake Detection of Frontal Face Videos. IEEE Trans. Hum. Mach. Syst. 55(6): 973-982 (2025)
[j77]Ying-Ren Chien
, Po-Heng Chou
, You-Jie Peng
, Chun-Yuan Huang
, Hen-Wai Tsao
, Yu Tsao
:
NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications. IEEE Trans. Instrum. Meas. 74: 1-15 (2025)
[j76]Kuan-Chen Wang, Kai-Chun Liu
, Ping-Cheng Yeh, Sheng-Yu Peng
, Yu Tsao
:
TrustEMG-Net: Using Representation-Masking Transformer With U-Net for Surface Electromyography Enhancement. IEEE J. Biomed. Health Informatics 29(4): 2506-2520 (2025)
[j75]Po-Heng Chou
, Bo-Ren Zheng, Wan-Jen Huang
, Walid Saad
, Yu Tsao
, Ronald Y. Chang
:
Deep Reinforcement Learning-Based Precoding for Multi-RIS-Aided Multiuser Downlink Systems With Practical Phase Shift. IEEE Wirel. Commun. Lett. 14(1): 23-27 (2025)
[c278]Siyin Wang, Wenyi Yu, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Lu Lu, Yu Tsao, Junichi Yamagishi, Yuxuan Wang, Chao Zhang:
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions. ACL (1) 2025: 23588-23609
[c277]Wei-Lun Chen
, Chia-Yeh Hsieh, Yu-Hsiang Kao
, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao
:
Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images. AICAS 2025: 1-5
[c276]Yu-Chien Lin, Chia-Hua Wu, Yu Tsao, Hsin-Min Wang:
Improving Speech Translation Through Data Augmentation with Data in Similar Languages. EUSIPCO 2025: 491-495
[c275]You-Jin Li, Rong Chao, Borching Su, Yu Tsao
:
Speech Enhancement with MAP-based Training for Robust ASR. ICASSP 2025: 1-5
[c274]Jie Lin, I Chiu, Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang
, Ping-Cheng Yeh, Yu Tsao
:
MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution. ICASSP 2025: 1-5
[c273]Yu-Tung Liu, Kuan-Chen Wang, Rong Chao, Sabato Marco Siniscalchi, Ping-Cheng Yeh, Yu Tsao
:
MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network. ICASSP 2025: 1-5
[c272]De-Yan Lu, Jian-Jiun Ding, Yu Tsao
:
Neural Variational Mode Decomposition and Its Application for ECG Denoising. ICASSP 2025: 1-5
[c271]Wenze Ren, Haibin Wu, Yi-Cheng Lin, Xuanjun Chen, Rong Chao, Kuo-Hsuan Hung, You-Jin Li, Wen-Yuan Ting, Hsin-Min Wang
, Yu Tsao
:
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement. ICASSP 2025: 1-5
[c270]Ryandhimas E. Zezario
, Sabato Marco Siniscalchi, Hsin-Min Wang
, Yu Tsao
:
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models. ICASSP 2025: 1-5
[c269]Shafique Ahmed, Ryandhimas E. Zezario
, Nasir Saleem, Amir Hussain
, Hsin-Min Wang
, Yu Tsao
:
A Study on Speech Assessment with Visual Cues. INTERSPEECH 2025
[c268]Rong Chao, Rauf Nasretdinov, Yu-Chiang Frank Wang, Ante Jukic, Szu-Wei Fu, Yu Tsao
:
Universal Speech Enhancement with Regression and Generative Mamba. INTERSPEECH 2025
[c267]Hsing-Hang Chou, Yun-Shao Lin, Ching-Chin Sung, Yu Tsao
, Chi-Chun Lee:
ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism. INTERSPEECH 2025
[c266]Sung-Feng Huang, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Pin-Jui Ku, Ante Jukic, Huck Yang, Yu Tsao
, Yu-Chiang Frank Wang, Hung-yi Lee, Szu-Wei Fu:
VoiceNoNG: Robust High-Quality Speech Editing Model without Hallucinations. INTERSPEECH 2025
[c265]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR. INTERSPEECH 2025
[c264]Chia-Hua Wu, Wanying Ge, Xin Wang, Junichi Yamagishi, Yu Tsao
, Hsin-Min Wang
:
A Comparative Study on Proactive and Passive Detection of Deepfake Speech. INTERSPEECH 2025
[c263]Ryandhimas E. Zezario
, Sabato Marco Siniscalchi, Fei Chen, Hsin-Min Wang
, Yu Tsao
:
Feature Importance across Domains for Improving Non-Intrusive Speech Intelligibility Prediction in Hearing Aids. INTERSPEECH 2025
[i200]Sung-Feng Huang, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Chao-Han Huck Yang, Yu Tsao, Yu-Chiang Frank Wang, Hung-yi Lee, Szu-Wei Fu:
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits. CoRR abs/2501.03805 (2025)
[i199]Jiawei Du
, Xuanjun Chen, Haibin Wu, Lin Zhang, I-Ming Lin, I-Hsiang Chiu, Wenze Ren, Yuan Tseng, Yu Tsao, Jyh-Shing Roger Jang, Hung-yi Lee:
CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset. CoRR abs/2501.08238 (2025)
[i198]Moreno La Quatra, Valerio Mario Salerno, Yu Tsao, Sabato Marco Siniscalchi:
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction. CoRR abs/2501.12979 (2025)
[i197]Meng-Ping Lin, Jen-Cheng Hou, Chia-Wei Chen, Shao-Yi Chien, Jun-Cheng Chen, Xugang Lu, Yu Tsao:
Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement. CoRR abs/2501.13375 (2025)
[i196]Wei-Lun Chen, Chia-Yeh Hsieh, Yu-Hsiang Kao, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images. CoRR abs/2501.18453 (2025)
[i195]Shafique Ahmed, Ryandhimas E. Zezario, Hui-Guan Yuan, Amir Hussain, Hsin-Min Wang
, Wei-Ho Chung, Yu Tsao:
NeuroAMP: A Novel End-to-end General Purpose Deep Neural Amplifier for Personalized Hearing Aids. CoRR abs/2502.10822 (2025)
[i194]Kuo-Hsuan Hung, Xugang Lu, Szu-Wei Fu, Huan-Hsin Tseng, Hsin-Yi Lin, Chii-Wann Lin, Yu Tsao:
Linguistic Knowledge Transfer Learning for Speech Enhancement. CoRR abs/2503.07078 (2025)
[i193]Siyin Wang, Wenyi Yu, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Lu Lu, Yu Tsao, Junichi Yamagishi, Yuxuan Wang, Chao Zhang:
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions. CoRR abs/2503.20290 (2025)
[i192]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR. CoRR abs/2505.13079 (2025)
[i191]Rong Chao, Rauf Nasretdinov, Yu-Chiang Frank Wang, Ante Jukic, Szu-Wei Fu, Yu Tsao:
Universal Speech Enhancement with Regression and Generative Mamba. CoRR abs/2505.21198 (2025)
[i190]Whenty Ariyanti, Kuan-Yu Chen, Sabato Marco Siniscalchi, Hsin-Min Wang, Yu Tsao:
Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations. CoRR abs/2505.21356 (2025)
[i189]Shafique Ahmed, Ryandhimas E. Zezario, Nasir Saleem
, Amir Hussain
, Hsin-Min Wang
, Yu Tsao:
A Study on Speech Assessment with Visual Cues. CoRR abs/2506.09549 (2025)
[i188]Tzu-Quan Lin, Heng-Cheng Kuo, Tzu-Chieh Wei, Hsi-Chun Cheng, Chun-Wei Chen, Hsien-Fu Hsiao, Yu Tsao, Hung-yi Lee:
An Exploration of Mamba for Speech Self-Supervised Models. CoRR abs/2506.12606 (2025)
[i187]Chia-Hua Wu, Wanying Ge, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang
:
A Comparative Study on Proactive and Passive Detection of Deepfake Speech. CoRR abs/2506.14398 (2025)
[i186]Po-Heng Chou, Ching-Wen Chen, Wan-Jen Huang, Walid Saad, Yu Tsao, Ronald Y. Chang:
DNN-Based Precoding in RIS-Aided mmWave MIMO Systems With Practical Phase Shift. CoRR abs/2507.02824 (2025)
[i185]Hui-Guan Yuan, Ryandhimas E. Zezario, Shafique Ahmed, Hsin-Min Wang
, Kai-Lung Hua, Yu Tsao:
Neuro-MSBG: An End-to-End Neural Model for Hearing Loss Simulation. CoRR abs/2507.15396 (2025)
[i184]Ryandhimas E. Zezario, Sabato Marco Siniscalchi, Fei Chen, Hsin-Min Wang
, Yu Tsao:
Feature Importance across Domains for Improving Non-Intrusive Speech Intelligibility Prediction in Hearing Aids. CoRR abs/2507.23223 (2025)
[i183]Meng-Ping Lin, Enoch Hsin-Ho Huang, Shao-Yi Chien, Yu Tsao:
End-to-End Audio-Visual Learning for Cochlear Implant Sound Coding in Noisy Environments. CoRR abs/2508.13576 (2025)
[i182]Rong Chao, Wenze Ren, You-Jin Li, Kuo-Hsuan Hung, Sung-Feng Huang, Szu-Wei Fu, Wen-Huang Cheng, Yu Tsao:
Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement. CoRR abs/2508.13624 (2025)
[i181]Ryandhimas E. Zezario, Dyah A. M. G. Wisnu, Hsin-Min Wang
, Yu Tsao:
Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM. CoRR abs/2509.03013 (2025)
[i180]Ryandhimas E. Zezario, Dyah A. M. G. Wisnu, Hsin-Min Wang
, Yu Tsao:
A Study on Zero-Shot Non-Intrusive Speech Intelligibility for Hearing Aids Using Large Language Models. CoRR abs/2509.03021 (2025)
[i179]Dyah A. M. G. Wisnu, Ryandhimas E. Zezario, Stefano Rini, Hsin-Min Wang
, Yu Tsao:
Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings. CoRR abs/2509.03292 (2025)
[i178]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR. CoRR abs/2509.05609 (2025)
[i177]Chun-Yuan Huang, Po-Heng Chou, Wan-Jen Huang, Ying-Ren Chien, Yu Tsao:
Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO System. CoRR abs/2509.25660 (2025)
[i176]Po-Heng Chou, Bo-Ren Zheng, Wan-Jen Huang, Walid Saad, Yu Tsao, Ronald Y. Chang:
Deep Reinforcement Learning-Based Precoding for Multi-RIS-Aided Multiuser Downlink Systems with Practical Phase Shift. CoRR abs/2509.25661 (2025)
[i175]Kai-Wei Chang, En-Pei Hu, Chun-Yi Kuan, Wenze Ren, Wei-Chih Chen, Guan-Ting Lin, Yu Tsao, Shao-Hua Sun, Hung-yi Lee, James Glass:
Game-Time: Evaluating Temporal Dynamics in Spoken Language Models. CoRR abs/2509.26388 (2025)
[i174]Ying-Ren Chien, Po-Heng Chou, You-Jie Peng, Chun-Yuan Huang, Hen-Wai Tsao, Yu Tsao:
NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications. CoRR abs/2510.01850 (2025)
[i173]Dyah A. M. G. Wisnu, Ryandhimas E. Zezario, Stefano Rini, Fo-Rui Li, Yan-Tsung Peng, Hsin-Min Wang
, Yu Tsao:
STSM-FiLM: A FiLM-Conditioned Neural Architecture for Time-Scale Modification of Speech. CoRR abs/2510.02672 (2025)
[i172]Wei-Lung Mao, Chun-Chi Wang, Po-Heng Chou, Kai-Chun Liu, Yu Tsao:
MECKD: Deep Learning-Based Fall Detection in Multilayer Mobile Edge Computing With Knowledge Distillation. CoRR abs/2510.03601 (2025)
[i171]I Chiu, Yu-Tung Liu, Kuan-Chen Wang, Hung-Yu Wei, Yu Tsao:
Robust Photoplethysmography Signal Denoising via Mamba Networks. CoRR abs/2510.11058 (2025)
[i170]Ching-Chin Sung, Shuntaro Suzuki, Francis Pingfan Chien, Komei Sugiura, Yu Tsao:
Condition-Invariant fMRI Decoding of Speech Intelligibility with Deep State Space Model. CoRR abs/2511.01868 (2025)
[i169]Hung-Yang Sung, Chien-Chun Wang, Kuan-Tang Huang, Tien-Hong Lo, Yu Tsao, Yung-Chang Hsu, Berlin Chen:
CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition. CoRR abs/2511.06860 (2025)
[i168]Po-Heng Chou
, Da-Chih Lin
, Hung-Yu Wei, Walid Saad, Yu Tsao:
Learning-based Radio Link Failure Prediction Based on Measurement Dataset in Railway Environments. CoRR abs/2511.08851 (2025)
[i167]Shuntaro Suzuki, Chia-Chun Dan Hsu, Yu Tsao, Komei Sugiura:
MEGState: Phoneme Decoding from Magnetoencephalography Signals. CoRR abs/2512.17978 (2025)- 2024
[j74]Sheng-Yu Peng
, I-Chun Liu, Yi-Heng Wu, Ting-Ju Lin, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Pin-Han Lin
, Kuo-Hsuan Hung, Yu Tsao
:
An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications. IEEE J. Solid State Circuits 59(2): 636-648 (2024)
[j73]Enoch Hsin-Ho Huang
, Rong Chao
, Yu Tsao
, Chao-Min Wu
:
ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants. IEEE Trans. Cogn. Dev. Syst. 16(1): 346-357 (2024)
[j72]Syu-Siang Wang
, Jia-Yang Chen, Bo-Ren Bai
, Shih-Hau Fang
, Yu Tsao
:
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3826-3837 (2024)
[c262]I-Chun Liu, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Chung-Wei Huang, Pin-Han Lin, Hsuan-Wei Pu, Sheng-Yu Peng, Yu Tsao
:
The Multilayer Neural Network Implementation Using SRAM-Based Reconfigurable Cognitive Computation Matrices. AICAS 2024: 467-471
[c261]Guojian Lin, Yu Tsao
, Fei Chen:
A Non-Intrusive Speech Quality Assessment Model using Whisper and Multi-Head Attention. APSIPA 2024: 1-6
[c260]Wenze Ren, Yi-Cheng Lin, Huang-Cheng Chou, Haibin Wu, Yi-Chiao Wu, Chi-Chun Lee, Hung-Yi Lee, Hsin-Min Wang
, Yu Tsao
:
EMO-Codec: An In-Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models with Subjective and Objective Evaluations. APSIPA 2024: 1-6
[c259]Kuo-Hsuan Hung, Kuan-Chen Wang, Kai-Chun Liu, Wei-Lun Chen, Xugang Lu, Yu Tsao
, Chii-Wann Lin:
MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal. IEEE Big Data 2024: 6469-6475
[c258]Cho-Yuan Lee, Kuan-Chen Wang, Kai-Chun Liu, Yu-Te Wang, Xugang Lu, Ping-Cheng Yeh, Yu Tsao
:
A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals. EMBC 2024: 1-5
[c257]Chin-Jou Li, Chien-Chen Chou, Yen-Cheng Shih, Li-Chuan Kuo, Yu-Te Wang, Aileen McGonigal
, Hsiang-Yu Yu, Jen-Cheng Hou, Yu Tsao
:
Epileptic Seizure Classification with Patient-level and Video-level Contrastive Pretraining. EMBC 2024: 1-4
[c256]Jui-Bang Lu, Yu Tsao
, Yu Te Wang:
Design and Evaluate Semi-dry Watermill-like EEG Electrodes. EMBC 2024: 1-4
[c255]Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao:
Bridging the Gap: Integrating Pre-Trained Speech Enhancement and Recognition Models for Robust Speech Recognition. EUSIPCO 2024: 426-430
[c254]Po-Heng Chou
, Ching-Wen Chen, Wan-Jen Huang, Walid Saad, Yu Tsao, Ronald Y. Chang:
DNN-Based Precoding in RIS-Aided mmWave MIMO Systems With Practical Phase Shift. GLOBECOM (Workshops) 2024: 1-5
[c253]Ryandhimas E. Zezario
, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. ICASSP 2024: 831-835
[c252]Yu-Tung Liu
, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao
:
SDEMG: Score-Based Diffusion Model for Surface Electromyographic Signal Denoising. ICASSP 2024: 1736-1740
[c251]Haibin Wu, Heng-Cheng Kuo, Yu Tsao
, Hung-Yi Lee:
Scalable Ensemble-Based Detection Method Against Adversarial Attacks For Speaker Verification. ICASSP 2024: 4670-4674
[c250]Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu
, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao
, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894
[c249]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-Based ASR. ICASSP 2024: 13116-13120
[c248]Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao
:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. ICCE 2024: 1-6
[c247]Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. ICLR 2024
[c246]Ryandhimas E. Zezario
, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao
, Hsin-Min Wang
, Chiou-Shann Fuh:
A Study On Incorporating Whisper For Robust Speech Assessment. ICME 2024: 1-6
[c245]Li-Chin Chen
, Jung-Nien Lai
, Hung-En Lin
, Hsien-Te Chen
, Kuo-Hsuan Hung
, Yu Tsao
:
Prognosticating Lumbar Spinal Surgery Outcomes for Low Back Pain and Sciatica Patients by Utilizing Preoperative Assessments from Western and Eastern Medicine and Multimodal Fusion Learning Techniques. ICMHI 2024: 262-267
[c244]Sheng-Chieh Chiu, Chia-Hua Wu, Jih-Kang Hsieh, Yu Tsao
, Hsin-Min Wang
:
Learnable Layer Selection and Model Fusion for Speech Self-Supervised Learning Models. INTERSPEECH 2024
[c243]Chun Yin, Tai-Shih Chi, Yu Tsao
, Hsin-Min Wang
:
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models. INTERSPEECH 2024
[c242]Ryandhimas E. Zezario
, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata. INTERSPEECH 2024
[c241]Muhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao
:
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-Based Speech Enhancement. MMSP 2024: 1-6
[c240]Pin-Yen Huang, Szu-Wei Fu, Yu Tsao:
RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier. NeurIPS 2024
[c239]Hsin-Li Chang, Enoch Hsin-Ho Huang
, Yi-Ching Wang, Yu Tsao
:
Using Automatic Speech Recognition for Speech Comprehension Evaluation in the Cochlear Implant. O-COCOSDA 2024: 1-5
[c238]Hsin-Te Hwang, Chia-Hua Wu, Ming-Chi Yen, Yu Tsao
, Hsin-Min Wang
:
Exemplar-Based Methods for Mandarin Electrolaryngeal Speech Voice Conversion. O-COCOSDA 2024: 1-6
[c237]Wenze Ren, Kuo-Hsuan Hung, Rong Chao, You-Jin Li, Hsin-Min Wang
, Yu Tsao
:
Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments With Advanced Post-Processing. O-COCOSDA 2024: 1-6
[c236]Yu Tsao, Chi-Chun Lee:
Message from the Program Chair. O-COCOSDA 2024: vii
[c235]Chun-Yuan Huang, Po-Heng Chou
, Wan-Jen Huang, Ying-Ren Chien, Yu Tsao
:
Capacity-Net-Based RIS Precoding Design Without Channel Estimation for mmWave MIMO System. PIMRC 2024: 1-6
[c234]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Temporal Order Preserved Optimal Transport-Based Cross-Modal Knowledge Transfer Learning for ASR. SLT 2024: 1-8
[c233]Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao
:
An Investigation of Incorporating Mamba For Speech Enhancement. SLT 2024: 302-308
[c232]Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao
, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe
, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition. SLT 2024: 371-378
[c231]Moreno La Quatra, Valerio Mario Salerno, Yu Tsao
, Sabato Marco Siniscalchi:
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction. SLT 2024: 608-615
[c230]Sung-Feng Huang, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Chao-Han Huck Yang, Yu Tsao
, Yu-Chiang Frank Wang, Hung-Yi Lee, Szu-Wei Fu:
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits. SLT 2024: 652-659
[c229]Wen-Chin Huang, Szu-Wei Fu, Erica Cooper, Ryandhimas E. Zezario
, Tomoki Toda, Hsin-Min Wang
, Junichi Yamagishi, Yu Tsao
:
The Voicemos Challenge 2024: Beyond Speech Quality Prediction. SLT 2024: 803-810
[c228]Jiawei Du
, I-Ming Lin, I-Hsiang Chiu, Xuanjun Chen, Haibin Wu, Wenze Ren, Yu Tsao
, Hung-Yi Lee, Jyh-Shing Roger Jang:
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset. SLT 2024: 921-928
[i166]Dyah A. M. G. Wisnu, Epri W. Pratiwi, Stefano Rini, Ryandhimas E. Zezario, Hsin-Min Wang
, Yu Tsao:
HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids. CoRR abs/2401.01145 (2024)
[i165]Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao
:
SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising. CoRR abs/2402.03808 (2024)
[i164]Cho-Yuan Lee, Kuan-Chen Wang, Kai-Chun Liu, Xugang Lu, Ping-Cheng Yeh, Yu Tsao:
A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals. CoRR abs/2402.05482 (2024)
[i163]Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao
, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. CoRR abs/2402.16321 (2024)
[i162]Tassadaq Hussain, Kia Dashtipour, Yu Tsao, Amir Hussain
:
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues. CoRR abs/2402.16394 (2024)
[i161]Jasper Kirton-Wingate, Shafique Ahmed, Adeel Hussain, Mandar Gogate, Kia Dashtipour, Jen-Cheng Hou, Tassadaq Hussain, Yu Tsao
, Amir Hussain
:
Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids. CoRR abs/2402.16757 (2024)
[i160]Ammarah Hashmi, Sahibzada Adil Shahzad
, Chia-Wen Lin, Yu Tsao
, Hsin-Min Wang
:
Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes. CoRR abs/2405.04097 (2024)
[i159]Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao:
An Investigation of Incorporating Mamba for Speech Enhancement. CoRR abs/2405.06573 (2024)
[i158]Whenty Ariyanti
, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. CoRR abs/2405.08342 (2024)
[i157]Chun Yin, Tai-Shih Chi, Yu Tsao
, Hsin-Min Wang
:
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models. CoRR abs/2406.08445 (2024)
[i156]Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen
, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao:
Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition. CoRR abs/2406.12699 (2024)
[i155]Wenze Ren, Yi-Cheng Lin, Huang-Cheng Chou, Haibin Wu, Yi-Chiao Wu, Chi-Chun Lee, Hung-yi Lee, Yu Tsao:
EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations. CoRR abs/2407.15458 (2024)
[i154]Muhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao:
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement. CoRR abs/2408.04773 (2024)
[i153]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR. CoRR abs/2409.02239 (2024)
[i152]Wen-Chin Huang, Szu-Wei Fu, Erica Cooper, Ryandhimas E. Zezario, Tomoki Toda, Hsin-Min Wang
, Junichi Yamagishi, Yu Tsao
:
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction. CoRR abs/2409.07001 (2024)
[i151]Jiawei Du
, I-Ming Lin, I-Hsiang Chiu
, Xuanjun Chen, Haibin Wu, Wenze Ren, Yu Tsao, Hung-yi Lee, Jyh-Shing Roger Jang:
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset. CoRR abs/2409.08731 (2024)
[i150]Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe
, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024)
[i149]Ryandhimas E. Zezario, Sabato Marco Siniscalchi, Hsin-Min Wang
, Yu Tsao:
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models. CoRR abs/2409.09914 (2024)
[i148]Wenze Ren, Haibin Wu, Yi-Cheng Lin, Xuanjun Chen, Rong Chao, Kuo-Hsuan Hung, You-Jin Li, Wen-Yuan Ting, Hsin-Min Wang
, Yu Tsao:
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement. CoRR abs/2409.10376 (2024)
[i147]Wenze Ren, Kuo-Hsuan Hung, Rong Chao, You-Jin Li, Hsin-Min Wang
, Yu Tsao:
Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing. CoRR abs/2409.14554 (2024)
[i146]Wen-Yuan Ting, Wenze Ren, Rong Chao, Hsin-Yi Lin, Yu Tsao, Fan-Gang Zeng:
MC-SEMamba: A Simple Multi-channel Extension of SEMamba. CoRR abs/2409.17898 (2024)
[i145]Kuo-Hsuan Hung, Kuan-Chen Wang, Kai-Chun Liu, Wei-Lun Chen
, Xugang Lu, Yu Tsao, Chii-Wann Lin:
MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal. CoRR abs/2409.18828 (2024)
[i144]Kuan-Chen Wang, Kai-Chun Liu, Ping-Cheng Yeh, Sheng-Yu Peng, Yu Tsao
:
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement. CoRR abs/2410.03843 (2024)
[i143]Pin-Yen Huang
, Szu-Wei Fu, Yu Tsao
:
RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier. CoRR abs/2410.22124 (2024)
[i142]Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao
, Hsin-Min Wang
:
Understanding Audiovisual Deepfake Detection: Techniques, Challenges, Human Factors and Perceptual Insights. CoRR abs/2411.07650 (2024)
[i141]Sahibzada Adil Shahzad, Ammarah Hashmi, Yan-Tsung Peng, Yu Tsao, Hsin-Min Wang
:
How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception. CoRR abs/2411.09266 (2024)
[i140]Yu-Tung Liu, Kuan-Chen Wang, Rong Chao, Sabato Marco Siniscalchi, Ping-Cheng Yeh, Yu Tsao:
MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network. CoRR abs/2411.18902 (2024)
[i139]Jie Lin, I Chiu, Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang
, Ping-Cheng Yeh, Yu Tsao:
MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution. CoRR abs/2412.04861 (2024)- 2023
[j71]Fei Chen, Yu Tsao
:
Advances in biomedical signal processing for communication disorders. Biomed. Signal Process. Control. 80(Part): 104346 (2023)
[j70]Yen-Ju Lu
, Xuankai Chang
, Chenda Li
, Wangyou Zhang
, Samuele Cornell
, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler
, Zhong-Qiu Wang
, Yu Tsao
, Yanmin Qian
, Shinji Watanabe
:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023)
[j69]Chin-Yi Cheng
, Hung-Shin Lee
, Yu Tsao
, Hsin-Min Wang
:
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization. IEEE Signal Process. Lett. 30: 638-642 (2023)
[j68]Ryandhimas E. Zezario
, Szu-Wei Fu
, Fei Chen
, Chiou-Shann Fuh
, Hsin-Min Wang
, Yu Tsao
:
Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features. IEEE ACM Trans. Audio Speech Lang. Process. 31: 54-70 (2023)
[j67]Yen-Ju Lu
, Chia-Yu Chang, Cheng Yu
, Ching-Feng Liu, Jeih-weih Hung
, Shinji Watanabe
, Yu Tsao
:
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2738-2750 (2023)
[j66]Heng-Cheng Kuo, Yu-Peng Hsieh
, Huan-Hsin Tseng
, Chi-Te Wang
, Shih-Hau Fang
, Yu Tsao
:
Toward Real-World Voice Disorder Classification. IEEE Trans. Biomed. Eng. 70(10): 2922-2932 (2023)
[j65]Tsai-Min Chen
, Yuan-Hong Tsai, Huan-Hsin Tseng
, Kai-Chun Liu
, Jhih-Yu Chen, Chih-Han Huang
, Guo-Yuan Li, Chun-Yen Shen, Yu Tsao
:
SRECG: ECG Signal Super-Resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification. IEEE Trans. Consumer Electron. 69(3): 250-260 (2023)
[c227]Hsin-Tien Chiang, Kuo-Hsuan Hung, Szu-Wei Fu, Heng-Cheng Kuo, Ming-Hsueh Tsai, Yu Tsao
:
Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility. ASRU 2023: 1-7
[c226]Erica Cooper, Wen-Chin Huang, Yu Tsao
, Hsin-Min Wang
, Tomoki Toda, Junichi Yamagishi:
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. ASRU 2023: 1-7
[c225]Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang
, Tsung-Te Liu, Yu Tsao
:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. ASRU 2023: 1-8
[c224]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Cross-Modal Alignment With Optimal Transport For CTC-Based ASR. ASRU 2023: 1-7
[c223]Whenty Ariyanti
, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. EMBC 2023: 1-4
[c222]En-Ping Chu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao
, Chia-Tai Chan:
Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation. EMBC 2023: 1-5
[c221]I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain
, Yu Tsao
, Jen-Cheng Hou:
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings. ICASSP Workshops 2023: 1-5
[c220]Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao
, Chia-Tai Chan:
Prefallkd: Pre-Impact Fall Detection Via CNN-ViT Knowledge Distillation. ICASSP 2023: 1-5
[c219]Chan-Jan Hsu, Ho-Lam Chung, Hung-Yi Lee, Yu Tsao
:
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5. ICASSP 2023: 1-5
[c218]Jasper Kirton-Wingate
, Shafique Ahmed, Mandar Gogate, Yu Tsao
, Amir Hussain
:
Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids. ICASSP Workshops 2023: 1-5
[c217]Hsin-Yi Lin
, Huan-Hsin Tseng, Yu Tsao
:
On the Robustness of Non-Intrusive Speech Quality Model by Adversarial Examples. ICASSP 2023: 1-5
[c216]Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao
:
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks. ICASSP 2023: 1-5
[c215]Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. ICLR 2023
[c214]Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. ICLR 2023
[c213]Li-Wei Chen
, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao
, Hsin-Min Wang
:
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech. INTERSPEECH 2023: 2473-2477
[c212]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu
, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao
:
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition. INTERSPEECH 2023: 3317-3321
[c211]Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Tai-Shih Chi, Hsin-Min Wang
, Yu Tsao
:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. INTERSPEECH 2023: 5018-5022
[c210]Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang
, Yu Tsao
, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. INTERSPEECH 2023: 5023-5026
[c209]Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao
:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. MeMeA 2023: 1-5
[c208]I-Chun Chern, Steffi Chern, Heng-Cheng Kuo, Huan-Hsin Tseng, Kuo-Hsuan Hung, Yu Tsao
:
Voice Direction-Of-Arrival Conversion. MLSP 2023: 1-6
[c207]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao
:
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement. MLSP 2023: 1-6
[c206]Wen-Yuan Ting, Syu-Siang Wang, Yu Tsao
, Borching Su:
IANS: Intelligibility-Aware Null-Steering Beamforming for Dual-Microphone Arrays. MLSP 2023: 1-6
[c205]Chih-Hsing Chen, Kai-Chun Liu, Ting-Yang Lu
, Chih-Ya Chang, Chia-Tai Chan, Yu Tsao
:
Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning. NER 2023: 1-4
[d2]Ying-Ren Chien
, Po-Heng Chou, You-Jie Peng, Chun-Yuan Huang, Hen-Wai Tsao, Yu Tsao:
Cyclostationary Impulse Noise Dataset. IEEE DataPort, 2023
[d1]Yen-Ju Lu
, Xuankai Chang
, Chenda Li
, Wangyou Zhang
, Samuele Cornell
, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler
, Zhong-Qiu Wang
, Yu Tsao
, Yanmin Qian
, Shinji Watanabe
:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023
[i138]Yu-Wen Chen, Hsin-Min Wang
, Yu Tsao
:
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm. CoRR abs/2301.04120 (2023)
[i137]Huan-Hsin Tseng, Hsin-Yi Lin
, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. CoRR abs/2302.01798 (2023)
[i136]Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao
, Chia-Tai Chan:
PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation. CoRR abs/2303.03634 (2023)
[i135]Li-Chin Chen, Kuo-Hsuan Hung, Yi-Ju Tseng, Hsin-Yao Wang, Tse-Min Lu, Wei-Chieh Huang, Yu Tsao
:
Self-supervised based general laboratory progress pretrained model for cardiovascular event detection. CoRR abs/2303.06980 (2023)
[i134]Li-Chin Chen, Jung-Nien Lai, Hung-En Lin, Hsien-Te Chen, Kuo-Hsuan Hung, Yu Tsao
:
Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning. CoRR abs/2303.09085 (2023)
[i133]Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao
:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. CoRR abs/2304.06335 (2023)
[i132]Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu:
ElectrodeNet - A Deep Learning Based Sound Coding Strategy for Cochlear Implants. CoRR abs/2305.16753 (2023)
[i131]Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. CoRR abs/2306.06652 (2023)
[i130]Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-Shih Chi, Hsin-Min Wang:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. CoRR abs/2306.06653 (2023)
[i129]Li-Chin Chen, Yi-Heng Lin, Li-Ning Peng, Feng-Ming Wang, Yu-Hsin Chen, Po-Hsun Huang, Shang-Feng Yang, Yu Tsao:
Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula. CoRR abs/2306.06865 (2023)
[i128]Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. CoRR abs/2308.09262 (2023)
[i127]Yu-Wen Chen, Julia Hirschberg, Yu Tsao
:
Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement. CoRR abs/2309.01164 (2023)
[i126]Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. CoRR abs/2309.09548 (2023)
[i125]Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu
, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao
, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023)
[i124]Shafique Ahmed, Chia-Wei Chen, Wenze Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain
, Hsin-Min Wang
, Yu Tsao
, Jen-Cheng Hou:
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement. CoRR abs/2309.11059 (2023)
[i123]Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao
, Hsin-Min Wang
, Chiou-Shann Fuh:
A Study on Incorporating Whisper for Robust Speech Assessment. CoRR abs/2309.12766 (2023)
[i122]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Cross-modal Alignment with Optimal Transport for CTC-based ASR. CoRR abs/2309.13650 (2023)
[i121]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR. CoRR abs/2309.16093 (2023)
[i120]Ammarah Hashmi, Sahibzada Adil Shahzad
, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang
:
AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection. CoRR abs/2310.13103 (2023)
[i119]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Neural domain alignment for spoken language recognition based on optimal transport. CoRR abs/2310.13471 (2023)
[i118]Sahibzada Adil Shahzad
, Ammarah Hashmi, Yan-Tsung Peng, Yu Tsao, Hsin-Min Wang
:
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection. CoRR abs/2311.02733 (2023)
[i117]Hsin-Tien Chiang, Szu-Wei Fu, Hsin-Min Wang, Yu Tsao, John H. L. Hansen:
Multi-objective Non-intrusive Hearing-aid Speech Assessment Model. CoRR abs/2311.08878 (2023)
[i116]Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao
:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. CoRR abs/2311.15582 (2023)
[i115]Chi-Chang Lee, Yu Tsao
, Hsin-Min Wang
, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. CoRR abs/2311.16595 (2023)
[i114]Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang
, Tsung-Te Liu, Yu Tsao
:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. CoRR abs/2311.16604 (2023)
[i113]Haibin Wu, Heng-Cheng Kuo, Yu Tsao
, Hung-yi Lee:
Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification. CoRR abs/2312.08622 (2023)- 2022
[j64]Yu-Wen Chen
, Kuo-Hsuan Hung, You-Jin Li, Alexander Chao-Fu Kang
, Ya-Hsin Lai
, Kai-Chun Liu
, Szu-Wei Fu, Syu-Siang Wang
, Yu Tsao
:
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application. IEEE Access 10: 46082-46099 (2022)
[j63]Yi Lin, Yu Tsao
, Po-Jang Hsieh:
Neural correlates of individual differences in predicting ambiguous sounds comprehension level. NeuroImage 251: 119012 (2022)
[j62]Cheng-Hung Hu
, Yu-Huai Peng, Junichi Yamagishi
, Yu Tsao
, Hsin-Min Wang
:
SVSNet: An End-to-End Speaker Voice Similarity Assessment Model. IEEE Signal Process. Lett. 29: 767-771 (2022)
[j61]Lichin Chen
, Po-Hsun Chen, Richard Tzong-Han Tsai
, Yu Tsao
:
EPG2S: Speech Generation and Speech Enhancement Based on Electropalatography and Audio Signals Using Multimodal Learning. IEEE Signal Process. Lett. 29: 2582-2586 (2022)
[j60]Tassadaq Hussain
, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour
, Yu Tsao
, Xugang Lu
, Ahsan Adeel, Amir Hussain
:
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement. IEEE Trans. Artif. Intell. 3(5): 833-842 (2022)
[j59]Kai-Chun Liu
, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao
:
Deep-Learning-Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems. IEEE Trans. Cogn. Dev. Syst. 14(3): 1270-1281 (2022)
[j58]Yu-Chen Lin
, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu
, Yu Tsao
, Tei-Wei Kuo
:
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1016-1031 (2022)
[j57]Shang-Yi Chuang
, Hsin-Min Wang
, Yu Tsao
:
Improved Lite Audio-Visual Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1345-1359 (2022)
[c204]Chan-Jan Hsu, Hung-yi Lee, Yu Tsao
:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. ACL (2) 2022: 479-489
[c203]Syu-Siang Wang, Yu Tsao
, Wei-Zhong Zheng, Hsiu-Wei Yeh, Pei-Chun Li, Shih-Hau Fang, Ying-Hui Lai:
Dysarthric Speech Enhancement Based on Convolution Neural Network. EMBC 2022: 60-64
[c202]Tassadaq Hussain
, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao
, Amir Hussain
:
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning. EMBC 2022: 2581-2584
[c201]Zicheng Feng, Yu Tsao, Fei Chen:
Recurrent Neural Network-based Estimation and Correction of Relative Transfer Function for Preserving Spatial Cues in Speech Separation. EUSIPCO 2022: 155-159
[c200]Bo-Rong Chen
, Hsin-Tien Chiang, Heng-Cheng Kuo, Yu Tsao
, Yih-Chun Hu:
Key Generation with Ambient Audio. GLOBECOM 2022: 5510-5515
[c199]Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao
, Tei-Wei Kuo
:
Speech Recovery For Real-World Self-Powered Intermittent Devices. ICASSP 2022: 26-30
[c198]Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang
, Yu Tsao
:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. ICASSP 2022: 1116-1120
[c197]Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe
, Alexander Richard, Cheng Yu, Yu Tsao
:
Conditional Diffusion Probabilistic Model for Speech Enhancement. ICASSP 2022: 7402-7406
[c196]Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao
:
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech. ICASSP 2022: 7412-7416
[c195]Guan-Ting Lin, Chan-Jan Hsu, Da-Rong Liu, Hung-Yi Lee, Yu Tsao
:
Analyzing The Robustness of Unsupervised Speech Recognition. ICASSP 2022: 8202-8206
[c194]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao
, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. ICASSP 2022: 8602-8606
[c193]Haibin Wu, Heng-Cheng Kuo, Naijun Zheng
, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao
, Hsin-Min Wang
, Helen Meng:
Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery. ICASSP 2022: 9236-9240
[c192]Kuo-Hsuan Hung, Szu-Wei Fu, Huan-Hsin Tseng, Hsin-Tien Chiang, Yu Tsao
, Chii-Wann Lin:
Boosting Self-Supervised Embeddings for Speech Enhancement. INTERSPEECH 2022: 186-190
[c191]Chiang-Jen Peng, Yun-Ju Chan, Yih-Liang Shen, Cheng Yu, Yu Tsao
, Tai-Shih Chi:
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement. INTERSPEECH 2022: 211-215
[c190]Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao
, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. INTERSPEECH 2022: 981-985
[c189]Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang
, Yu Tsao
:
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling. INTERSPEECH 2022: 1183-1187
[c188]Yu-Wen Chen, Yu Tsao
:
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network. INTERSPEECH 2022: 3088-3092
[c187]Ryandhimas Edo Zezario
, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. INTERSPEECH 2022: 3944-3948
[c186]Wen-Chin Huang, Erica Cooper, Yu Tsao
, Hsin-Min Wang
, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. INTERSPEECH 2022: 4536-4540
[c185]Fan-Lin Wang, Hung-Shin Lee, Yu Tsao
, Hsin-Min Wang
:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. INTERSPEECH 2022: 5343-5347
[c184]Rong Chao, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao
:
Perceptual Contrast Stretching on Target Feature for Speech Enhancement. INTERSPEECH 2022: 5448-5452
[c183]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell
, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao
, Yanmin Qian, Shinji Watanabe
:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462
[c182]Ryandhimas Edo Zezario
, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model. INTERSPEECH 2022: 5463-5467
[c181]Hung-Shin Lee, Pin-Yuan Chen, Yao-Fei Cheng, Yu Tsao
, Hsin-Min Wang
:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. ISCSLP 2022: 145-149
[c180]Wen-Yuan Ting, Syu-Siang Wang
, Hsin-Li Chang, Borching Su, Yu Tsao
:
Speech Enhancement Based on CycleGAN with Noise-informed Training. ISCSLP 2022: 155-159
[c179]Zicheng Feng, Yu Tsao
, Fei Chen:
Preservation Of Interaural Level Difference Cue In A Deep Learning-Based Speech Separation System For Bilateral And Bimodal Cochlear Implants Users. IWAENC 2022: 1-5
[c178]Shang-Bao Luo, Cheng-Chung Fan, Kuan-Yu Chen, Yu Tsao, Hsin-Min Wang, Keh-Yih Su:
Chinese Movie Dialogue Question Answering Dataset. ROCLING 2022: 7-14
[i112]Tassadaq Hussain, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour, Yu Tsao, Xugang Lu, Ahsan Adeel, Amir Hussain:
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement. CoRR abs/2201.09913 (2022)
[i111]Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. CoRR abs/2202.05256 (2022)
[i110]Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain:
A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning. CoRR abs/2202.05756 (2022)
[i109]Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. CoRR abs/2202.06507 (2022)
[i108]Haibin Wu, Heng-Cheng Kuo, Naijun Zheng, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao, Hsin-Min Wang, Helen Meng:
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery. CoRR abs/2202.06684 (2022)
[i107]Syu-Siang Wang, Chi-Te Wang, Chih-Chung Lai, Yu Tsao, Shih-Hau Fang:
Continuous Speech for Improved Learning Pathological Voice Disorders. CoRR abs/2202.10777 (2022)
[i106]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao
, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. CoRR abs/2203.03550 (2022)
[i105]Wen-Chin Huang, Erica Cooper, Yu Tsao
, Hsin-Min Wang
, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. CoRR abs/2203.11389 (2022)
[i104]Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao
, Hsin-Min Wang
:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. CoRR abs/2203.13696 (2022)
[i103]Hung-Shin Lee, Yu Tsao
, Shyh-Kang Jeng, Hsin-Min Wang
:
Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition. CoRR abs/2203.15576 (2022)
[i102]Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang
:
Multi-Target Filter and Detector for Speaker Diarization. CoRR abs/2203.16007 (2022)
[i101]Fan-Lin Wang, Hung-Shin Lee, Yu Tsao
, Hsin-Min Wang
:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. CoRR abs/2203.16040 (2022)
[i100]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Partial Coupling of Optimal Transport for Spoken Language Identification. CoRR abs/2203.17036 (2022)
[i99]Rong Chao, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao
:
Perceptual Contrast Stretching on Target Feature for Speech Enhancement. CoRR abs/2203.17152 (2022)
[i98]Chiang-Lin Tai, Hung-Shin Lee, Yu Tsao
, Hsin-Min Wang
:
Filter-based Discriminative Autoencoders for Children Speech Recognition. CoRR abs/2204.00164 (2022)
[i97]Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. CoRR abs/2204.03305 (2022)
[i96]Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model. CoRR abs/2204.03310 (2022)
[i95]Shih-Kuang Lee, Yu Tsao
, Hsin-Min Wang
:
A Study of Using Cepstrogram for Countermeasure Against Replay Attacks. CoRR abs/2204.04333 (2022)
[i94]Chan-Jan Hsu, Hung-yi Lee, Yu Tsao
:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. CoRR abs/2204.07316 (2022)
[i93]Li-Chin Chen, Po-Hsun Chen, Richard Tzong-Han Tsai, Yu Tsao
:
EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning. CoRR abs/2206.07860 (2022)
[i92]Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang
, Yu Tsao
:
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling. CoRR abs/2206.09058 (2022)
[i91]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell
, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao
, Yanmin Qian, Shinji Watanabe
:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022)
[i90]Yin-Ping Cho, Yu Tsao
, Hsin-Min Wang
, Yi-Wen Liu:
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN. CoRR abs/2209.10446 (2022)
[i89]Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao
:
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks. CoRR abs/2210.13271 (2022)
[i88]Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference. CoRR abs/2210.15368 (2022)
[i87]Fan-Lin Wang, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao
, Hsin-Min Wang
:
CasNet: Investigating Channel Robustness for Speech Separation. CoRR abs/2210.15370 (2022)
[i86]I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou:
Audio-Visual Speech Enhancement and Separation by Leveraging Multi-Modal Self-Supervised Embeddings. CoRR abs/2210.17456 (2022)
[i85]Chan-Jan Hsu, Ho-Lam Chung, Hung-yi Lee, Yu Tsao
:
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5. CoRR abs/2211.00586 (2022)
[i84]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao
:
Inference and Denoise: Causal Inference-based Neural Speech Enhancement. CoRR abs/2211.01189 (2022)
[i83]Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao
:
On the robustness of non-intrusive speech quality model by adversarial examples. CoRR abs/2211.06508 (2022)- 2021
[j56]Tzu-Hao Lin
, Tomonari Akamatsu
, Yu Tsao
:
Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan. PLoS Comput. Biol. 17(2) (2021)
[j55]Fatma S. Abousaleh
, Wen-Huang Cheng
, Neng-Hao Yu
, Yu Tsao
:
Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media. IEEE Trans. Cogn. Dev. Syst. 13(3): 679-692 (2021)
[j54]Rung-Yu Tseng
, Taowei Wang, Szu-Wei Fu
, Chia-Ying Lee, Yu Tsao
:
A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation. IEEE Trans. Cogn. Dev. Syst. 13(4): 984-994 (2021)
[j53]Xugang Lu
, Peng Shen
, Yu Tsao
, Hisashi Kawai:
Coupling a Generative Model With a Discriminative Learning Framework for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3631-3641 (2021)
[j52]Shintami Chusnul Hidayati, Ting Wei Goh, Ji-Sheng Gary Chan
, Cheng-Chun Hsu, John See
, Lai-Kuan Wong
, Kai-Lung Hua
, Yu Tsao
, Wen-Huang Cheng
:
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes. IEEE Trans. Multim. 23: 365-377 (2021)
[c177]Yen-Ju Lu, Yu Tsao, Shinji Watanabe:
A Study on Speech Enhancement Based on Diffusion Probabilistic Model. APSIPA ASC 2021: 659-666
[c176]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification. APSIPA ASC 2021: 769-774
[c175]Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion. APSIPA ASC 2021: 1234-1238
[c174]Zicheng Feng, Yu Tsao, Fei Chen:
Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues. APSIPA ASC 2021: 1239-1244
[c173]You-Jin Li, Syu-Siang Wang, Yu Tsao, Borching Su:
MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder. APSIPA ASC 2021: 1245-1250
[c172]Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-Wen Yang, Yu Tsao
, Hung-yi Lee, Shinji Watanabe
:
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition. ASRU 2021: 228-235
[c171]Ming-Chi Yen, Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Shu-Wei Tsai, Yu Tsao
, Tomoki Toda, Jyh-Shing Roger Jang, Hsin-Min Wang
:
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling. ASRU 2021: 650-657
[c170]Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network. ASRU 2021: 907-913
[c169]Ting-Yang Lu
, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao
, Chia-Tai Chan:
Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder. BHI 2021: 1-4
[c168]Ryandhimas E. Zezario
, Chiou-Shann Fuh, Hsin-Min Wang
, Yu Tsao
:
Speech Enhancement with Zero-Shot Model Selection. EUSIPCO 2021: 491-495
[c167]Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Xugang Lu, Yu Tsao
:
A Study of Incorporating Articulatory Movement Information in Speech Enhancement. EUSIPCO 2021: 496-500
[c166]Yuan-Kuei Wu
, Kuan-Po Huang, Yu Tsao
, Hung-yi Lee:
One Shot Learning for Speech Separation. ICASSP 2021: 5769-5773
[c165]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification. ICASSP 2021: 7213-7217
[c164]Tsun-An Hsieh, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao
:
Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement. Interspeech 2021: 196-200
[c163]Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao
:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. Interspeech 2021: 201-205
[c162]Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Ching-Feng Liu, Yu Tsao
, Hsin-Min Wang
, Tomoki Toda:
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion. Interspeech 2021: 1329-1333
[c161]Gang-Xuan Lin, Shih-Wei Hu, Yen-Ju Lu, Yu Tsao
, Chun-Shien Lu:
QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization. Interspeech 2021: 1639-1643
[c160]Yi-Chiao Wu, Cheng-Hung Hu, Hung-Shin Lee, Yu-Huai Peng, Wen-Chin Huang, Yu Tsao
, Hsin-Min Wang
, Tomoki Toda:
Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder. Interspeech 2021: 3630-3634
[c159]Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Wen-Chin Huang, Xugang Lu, Yu Tsao
:
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System. ISCAS 2021: 1-5
[c158]Chiang-Jen Peng, Yun-Ju Chan, Cheng Yu, Syu-Siang Wang
, Yu Tsao
, Tai-Shih Chi:
Attention-Based Multi-Task Learning for Speech-Enhancement and Speaker-Identification in Multi-Speaker Dialogue Scenario. ISCAS 2021: 1-5
[c157]Yu-Tao Chang, Yuan-Hong Yang, Yu-Huai Peng, Syu-Siang Wang
, Tai-Shih Chi, Yu Tsao
, Hsin-Min Wang
:
MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration. ISCSLP 2021: 1-5
[c156]Lichin Chen, Ji-Tian Sheu, Yu Tsao
, Yuh-Jue Chuang:
Deep Learning and Explainable Artificial Intelligence to Predict Patients' Choice of Hospital Levels in Urban and Rural Areas. MedInfo 2021: 734-738
[c155]Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao:
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport. NeurIPS 2021: 19935-19946
[c154]Md Mahbub E. Noor
, Yen-Ju Lu, Syu-Siang Wang
, Supratip Ghose, Chia-Yu Chang, Ryandhimas E. Zezario
, Shafique Ahmed, Wei-Ho Chung
, Yu Tsao
, Hsin-Min Wang
:
Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions. O-COCOSDA 2021: 7-12
[c153]Cheng-Chung Fan, Chia-Chih Kuo, Shang-Bao Luo, Pei-Jun Liao, Kuang-Yu Chang, Chiao-Wei Hsu, Meng-Tse Wu, Shih-Hong Tsai, Tzu-Man Wu, Aleksandra Smolka, Chao-Chun Liang, Hsin-Min Wang, Kuan-Yu Chen, Yu Tsao, Keh-Yih Su:
A Flexible and Extensible Framework for Multiple Answer Modes Question Answering. ROCLING 2021: 33-42
[i82]Chiang-Jen Peng, Yun-Ju Chan, Cheng Yu, Syu-Siang Wang, Yu Tsao, Tai-Shih Chi:
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario. CoRR abs/2101.02550 (2021)
[i81]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification. CoRR abs/2101.03329 (2021)
[i80]Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Wen-Chin Huang, Xugang Lu, Yu Tsao:
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System. CoRR abs/2102.03786 (2021)
[i79]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification. CoRR abs/2104.03004 (2021)
[i78]Cheng-Hung Hu, Yi-Chiao Wu, Wen-Chin Huang, Yu-Huai Peng, Yu-Wen Chen, Pin-Jui Ku, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
The AS-NU System for the M2VoC Challenge. CoRR abs/2104.03009 (2021)
[i77]Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. CoRR abs/2104.03538 (2021)
[i76]Fatma S. Abousaleh, Wen-Huang Cheng, Neng-Hao Yu, Yu Tsao:
Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media. CoRR abs/2105.08809 (2021)
[i75]Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Ching-Feng Liu, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion. CoRR abs/2106.01415 (2021)
[i74]Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo:
Intermittent Speech Recovery. CoRR abs/2106.05229 (2021)
[i73]Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model. CoRR abs/2107.09392 (2021)
[i72]Yen-Ju Lu, Yu Tsao, Shinji Watanabe:
A Study on Speech Enhancement Based on Diffusion Probabilistic Model. CoRR abs/2107.11876 (2021)
[i71]Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion. CoRR abs/2109.03551 (2021)
[i70]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming. CoRR abs/2110.03894 (2021)
[i69]Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-Wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe:
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition. CoRR abs/2110.04590 (2021)
[i68]Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech. CoRR abs/2110.05866 (2021)
[i67]Yun-Ju Chan, Chiang-Jen Peng, Syu-Siang Wang, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments. CoRR abs/2110.09923 (2021)
[i66]Wen-Yuan Ting, Syu-Siang Wang, Hsin-Li Chang, Borching Su, Yu Tsao:
Speech Enhancement Based on Cyclegan with Noise-informed Training. CoRR abs/2110.09924 (2021)
[i65]Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features. CoRR abs/2111.02363 (2021)
[i64]Yu-Wen Chen, Yu Tsao:
InQSS: a speech intelligibility assessment model using a multi-task learning network. CoRR abs/2111.02585 (2021)
[i63]Yu-Chen Lin, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points. CoRR abs/2111.04436 (2021)
[i62]Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-net: A non-intrusive hearing-aid speech assessment network. CoRR abs/2111.05691 (2021)
[i61]Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. CoRR abs/2111.05703 (2021)
[i60]Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao:
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport. CoRR abs/2111.06316 (2021)
[i59]Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Tei Wang, Shih-Hau Fang, Yu Tsao:
Toward Real-World Pathological Voice Detection. CoRR abs/2112.02538 (2021)
[i58]Lichin Chen, Ji-Tian Sheu, Yuh-Jue Chuang, Yu Tsao:
Predicting the Travel Distance of Patients to Access Healthcare using Deep Neural Networks. CoRR abs/2112.03541 (2021)- 2020
[j51]Francesco Potortì
, Sangjoon Park, Antonino Crivello
, Filippo Palumbo
, Michele Girolami
, Paolo Barsocchi
, Soyeon Lee
, Joaquín Torres-Sospedra
, Antonio Ramón Jiménez Ruiz
, Antoni Pérez-Navarro
, Germán Martín Mendoza-Silva, Fernando Seco
, Miguel Ortiz
, Johan Perul, Valérie Renaudin
, Hyunwoong Kang
, Soyoung Park
, Jae Hong Lee
, Chan Gook Park
, Jisu Ha, Jaeseung Han, Changjun Park, Keunhye Kim, Yonghyun Lee, Seunghun Gye, Keumryeol Lee
, Eun-Jee Kim, Jeongsik Choi
, Yang-Seok Choi
, Shilpa Talwar
, Seong Yun Cho
, Boaz Ben-Moshe, Alex Scherbakov
, Leonid Antsfeld, Emilio Sansano-Sansano
, Boris Chidlovskii
, Nikolai Kronenwett, Silvia Prophet, Yael Landay, Revital Marbel
, Lingxiang Zheng, Ao Peng
, Zhichao Lin, Bang Wu
, Chengqi Ma
, Stefan Poslad
, David R. Selviah
, Wei Wu, Zixiang Ma, Wenchao Zhang
, Dongyan Wei, Hong Yuan
, Jun-Bang Jiang, Shao-Yung Huang, Jing-Wen Liu, Kuan-Wu Su, Jenq-Shiou Leu
, Kazuki Nishiguchi, Walid Bousselham, Hideaki Uchiyama
, Diego Thomas
, Atsushi Shimada
, Rin-Ichiro Taniguchi
, Vicente Cortés Puschel, Tomás Lungenstrass Poulsen
, Imran Ashraf
, Chanseok Lee, Muhammad Usman Ali
, Yeongjun Im, Gunzung Kim
, Jeongsook Eom, Soojung Hur, Yongwan Park, Miroslav Opiela
, Adriano J. C. Moreira
, Maria João Nicolau
, Cristiano G. Pendão
, Ivo Silva
, Filipe Meneses
, António Costa
, Jens Trogh
, David Plets
, Ying-Ren Chien
, Tzu-Yu Chang, Shih-Hau Fang
, Yu Tsao
:
The IPIN 2019 Indoor Localisation Competition - Description and Results. IEEE Access 8: 206674-206718 (2020)
[j50]Xin Wang
, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado
, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah
, Ville Vestman, Tomi Kinnunen, Kong Aik Lee
, Lauri Juvela
, Paavo Alku
, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Sébastien Le Maguer
, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020)
[j49]Szu-Wei Fu
, Chien-Feng Liao, Yu Tsao
:
Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality. IEEE Signal Process. Lett. 27: 26-30 (2020)
[j48]Cheng Yu, Kuo-Hsuan Hung, Syu-Siang Wang
, Yu Tsao
, Jeih-weih Hung:
Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement. IEEE Signal Process. Lett. 27: 1035-1039 (2020)
[j47]Tsun-An Hsieh, Hsin-Min Wang
, Xugang Lu, Yu Tsao
:
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement. IEEE Signal Process. Lett. 27: 2149-2153 (2020)
[j46]Tassadaq Hussain
, Sabato Marco Siniscalchi
, Hsiao-Lan Sharon Wang
, Yu Tsao
, Valerio Mario Salerno
, Wen-Hung Liao
:
Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation. IEEE Trans. Cogn. Dev. Syst. 12(4): 744-758 (2020)
[j45]Chang-Le Liu, Sze-Wei Fu, You-Jin Li, Jen-Wei Huang
, Hsin-Min Wang
, Yu Tsao
:
Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1888-1900 (2020)
[j44]Cheng Yu, Ryandhimas E. Zezario
, Syu-Siang Wang
, Jonathan Sherman, Yi-Yen Hsieh
, Xugang Lu, Hsin-Min Wang
, Yu Tsao
:
Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2756-2769 (2020)
[j43]Hung-Shin Lee
, Yu Tsao
, Shyh-Kang Jeng
, Hsin-Min Wang
:
Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3065-3079 (2020)
[j42]Wen-Chin Huang
, Hao Luo
, Hsin-Te Hwang, Chen-Chou Lo
, Yu-Huai Peng, Yu Tsao
, Hsin-Min Wang
:
Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion. IEEE Trans. Emerg. Top. Comput. Intell. 4(4): 468-479 (2020)
[j41]Kun-Hsi Tsai, Wei-Chien Wang, Chui-Hsuan Cheng, Chan-Yen Tsai
, Jou-Kou Wang
, Tzu-Hao Lin
, Shih-Hau Fang
, Lichin Chen
, Yu Tsao
:
Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder. IEEE J. Biomed. Health Informatics 24(11): 3203-3214 (2020)
[c152]Szu-Wei Fu, Chien-Feng Liao, Tsun-An Hsieh, Kuo-Hsuan Hung, Syu-Siang Wang, Cheng Yu, Heng-Cheng Kuo, Ryandhimas E. Zezario, You-Jin Li, Shang-Yi Chuang, Yen-Ju Lu, Yu-Chen Lin, Yu Tsao:
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing. APSIPA 2020: 455-459
[c151]Ryandhimas E. Zezario, Szu-Wei Fu, Chiou-Shann Fuh, Yu Tsao, Hsin-Min Wang:
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model. APSIPA 2020: 482-486
[c150]Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c149]Chi-Lun Lin, Kate Ching-Ju Lin
, Chi-Cheng Lee, Yu Tsao
:
Cross-Technology Interference Mitigation Using Fully Convolutional Denoising Autoencoders. GLOBECOM 2020: 1-6
[c148]Ryandhimas E. Zezario
, Tassadaq Hussain
, Xugang Lu, Hsin-Min Wang
, Yu Tsao
:
Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement. ICASSP 2020: 6669-6673
[c147]Chen-Li Lin, Zi-Qiang Lin, Syu-Siang Wang
, Yu Tsao
, Jeih-Weih Hung:
Exponentiated magnitude spectrogram-based relative-to-maximum masking for speech enhancement in adverse environments. ICCE-TW 2020: 1-2
[c146]Chih-Wei Wu, Chih-Ting Liu, Wei-Chih Tu, Yu Tsao
, Yu-Chiang Frank Wang, Shao-Yi Chien:
Space-Time Guided Association Learning For Unsupervised Person Re-Identification. ICIP 2020: 2261-2265
[c145]Shang-Yi Chuang, Yu Tsao
, Chen-Chou Lo
, Hsin-Min Wang
:
Lite Audio-Visual Speech Enhancement. INTERSPEECH 2020: 1131-1135
[c144]Haoyu Li, Szu-Wei Fu, Yu Tsao
, Junichi Yamagishi:
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning. INTERSPEECH 2020: 1336-1340
[c143]Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao
:
Incorporating Broad Phonetic Information for Speech Enhancement. INTERSPEECH 2020: 2417-2421
[c142]Chi-Chang Lee, Yu-Chen Lin, Hsuan-Tien Lin, Hsin-Min Wang
, Yu Tsao
:
SERIL: Noise Adaptive Speech Enhancement Using Regularization-Based Incremental Learning. INTERSPEECH 2020: 2432-2436
[c141]Chen-Yu Chen, Wei-Zhong Zheng, Syu-Siang Wang
, Yu Tsao
, Pei-Chun Li, Ying-Hui Lai:
Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-Based Voice Conversion System. INTERSPEECH 2020: 4686-4690
[i57]Cheng Yu, Ryandhimas E. Zezario, Jonathan Sherman, Yi-Yen Hsieh, Xugang Lu, Hsin-Min Wang, Yu Tsao:
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders. CoRR abs/2001.01538 (2020)
[i56]Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion. CoRR abs/2001.07849 (2020)
[i55]Haoyu Li, Szu-Wei Fu, Yu Tsao, Junichi Yamagishi:
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning. CoRR abs/2004.00932 (2020)
[i54]Tsun-An Hsieh, Hsin-Min Wang, Xugang Lu, Yu Tsao:
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement. CoRR abs/2004.04098 (2020)
[i53]Yuan-Kuei Wu, Chao-I Tuan, Hung-yi Lee, Yu Tsao:
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning. CoRR abs/2005.09966 (2020)
[i52]Chi-Chang Lee, Yu-Chen Lin, Hsuan-Tien Lin, Hsin-Min Wang, Yu Tsao:
SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning. CoRR abs/2005.11760 (2020)
[i51]Shang-Yi Chuang
, Yu Tsao, Chen-Chou Lo, Hsin-Min Wang:
Lite Audio-Visual Speech Enhancement. CoRR abs/2005.11769 (2020)
[i50]Szu-Wei Fu, Chien-Feng Liao, Tsun-An Hsieh, Kuo-Hsuan Hung, Syu-Siang Wang, Cheng Yu, Heng-Cheng Kuo, Ryandhimas E. Zezario, You-Jin Li, Shang-Yi Chuang, Yen-Ju Lu, Yu Tsao:
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing. CoRR abs/2006.10296 (2020)
[i49]Lichin Chen
, Yu Tsao, Ji-Tian Sheu:
Using Deep Learning and Explainable Artificial Intelligence in Patients' Choices of Hospital Levels. CoRR abs/2006.13427 (2020)
[i48]Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao:
Incorporating Broad Phonetic Information for Speech Enhancement. CoRR abs/2008.07618 (2020)
[i47]Alexander Chao-Fu Kang, Kuo-Hsuan Hung, Yu-Wen Chen, You-Jin Li, Ya-Hsin Lai, Kai-Chun Liu, Sze-Wei Fu, Syu-Siang Wang, Yu Tsao:
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application. CoRR abs/2008.09264 (2020)
[i46]Shang-Yi Chuang, Hsin-Min Wang, Yu Tsao:
Improved Lite Audio-Visual Speech Enhancement. CoRR abs/2008.13222 (2020)
[i45]Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. CoRR abs/2010.02669 (2020)
[i44]Tsun-An Hsieh, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao:
Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement. CoRR abs/2010.15174 (2020)
[i43]Ryandhimas E. Zezario, Szu-Wei Fu, Chiou-Shann Fuh, Yu Tsao, Hsin-Min Wang:
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model. CoRR abs/2011.04292 (2020)
[i42]Yen-Ju Lu, Chia-Yu Chang, Yu Tsao, Jeih-weih Hung:
Speech enhancement guided by contextual articulatory information. CoRR abs/2011.07442 (2020)
[i41]Yuan-Kuei Wu, Kuan-Po Huang
, Yu Tsao, Hung-yi Lee:
One Shot Learning for Speech Separation. CoRR abs/2011.10233 (2020)
[i40]Kai-Chun Liu, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems. CoRR abs/2012.03426 (2020)
[i39]Tsai-Min Chen, Yuan-Hong Tsai, Huan-Hsin Tseng, Jhih-Yu Chen, Chih-Han Huang, Guo-Yuan Li, Chun-Yen Shen, Yu Tsao:
ECG Signal Super-resolution by Considering Reconstruction and Cardiac Arrhythmias Classification Loss. CoRR abs/2012.03803 (2020)
[i38]Ryandhimas E. Zezario, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Speech Enhancement with Zero-Shot Model Selection. CoRR abs/2012.09359 (2020)
[i37]Kai-Chun Liu, Michael Chan, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Domain-adaptive Fall Detection Using Deep Adversarial Training. CoRR abs/2012.10911 (2020)
[i36]Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Unsupervised neural adaptation model based on optimal transport for spoken language identification. CoRR abs/2012.13152 (2020)
2010 – 2019
- 2019
[j40]Hsin-Tien Chiang
, Yi-Yen Hsieh
, Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao
, Shao-Yi Chien:
Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders. IEEE Access 7: 60806-60813 (2019)
[j39]Valérie Renaudin
, Miguel Ortiz
, Johan Perul, Joaquín Torres-Sospedra
, Antonio Ramón Jiménez
, Antoni Pérez-Navarro
, Germán Martín Mendoza-Silva, Fernando Seco, Yael Landau, Revital Marbel
, Boaz Ben-Moshe, Xingyu Zheng, Feng Ye
, Jian Kuang, Yu Li, Xiaoji Niu, Vlad Landa, Shlomi Hacohen
, Nir Shvalb
, Chuanhua Lu
, Hideaki Uchiyama, Diego Thomas, Atsushi Shimada
, Rin-Ichiro Taniguchi, Zhenxing Ding, Feng Xu, Nikolai Kronenwett, Blagovest Vladimirov, Soyeon Lee, Eunyoung Cho, Sungwoo Jun, Chang-Eun Lee, Sangjoon Park, Yonghyun Lee, Jehyeok Rew, Changjun Park, Hyeongyo Jeong, Jaeseung Han, Keumryeol Lee, Wenchao Zhang
, Xianghong Li, Dongyan Wei, Ying Zhang, So Young Park, Chan Gook Park
, Stefan Knauth, Georgios Pipelidis, Nikolaos Tsiamitros, Tomás Lungenstrass, Juan Pablo Morales, Jens Trogh, David Plets, Miroslav Opiela
, Shih-Hau Fang, Yu Tsao
, Ying-Ren Chien
, Shi-Shen Yang, Shih-Jyun Ye, Muhammad Usman Ali, Soojung Hur, Yongwan Park:
Evaluating Indoor Positioning Systems in a Shopping Mall: The Lessons Learned From the IPIN 2018 Competition. IEEE Access 7: 148594-148628 (2019)
[j38]Yu Tsao
, Tzu-Hao Lin
, Fei Chen
, Yun-Fan Chang, Chui-Hsuan Cheng, Kun-Hsi Tsai:
Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training. Biomed. Signal Process. Control. 49: 173-180 (2019)
[j37]Jyun-Yi Wu, Cheng Yu, Szu-Wei Fu
, Chih-Ting Liu
, Shao-Yi Chien
, Yu Tsao
:
Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques. IEEE Signal Process. Lett. 26(12): 1887-1891 (2019)
[j36]Shan-Wen Hsiao, Hung-Ching Sun, Ming-Chuan Hsieh, Ming-Hsueh Tsai, Yu Tsao
, Chi-Chun Lee
:
Toward Automating Oral Presentation Scoring During Principal Certification Program Using Audio-Video Low-Level Behavior Profiles. IEEE Trans. Affect. Comput. 10(4): 552-567 (2019)
[j35]Chih-Ting Liu
, Tung-Wei Lin
, Yi-Heng Wu, Yu-Sheng Lin, Heng Lee, Yu Tsao
, Shao-Yi Chien
:
Computation-Performance Optimization of Convolutional Neural Networks With Redundant Filter Removal. IEEE Trans. Circuits Syst. I Regul. Pap. 66-I(5): 1908-1921 (2019)
[c140]Yu-Ting Lo, Syu-Siang Wang
, Yu Tsao
, Sheng-Yu Peng:
A Pruned-CELP Speech Codec Using Denoising Autoencoder with Spectral Compensation for Quality and Intelligibility Enhancement. AICAS 2019: 150-151
[c139]Fuqiang Ye, Yu Tsao
, Fei Chen:
Subjective Feedback-based Neural Network Pruning for Speech Enhancement. APSIPA 2019: 673-677
[c138]Tassadaq Hussain
, Yu Tsao
, Hsin-Min Wang
, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao
:
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement. APSIPA 2019: 678-683
[c137]Wei-Cheng Lin, Yu Tsao
, Fei Chen, Hsin-Min Wang
:
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement. APSIPA 2019: 1179-1184
[c136]Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing
, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
, Yu Tsao
, Hsin-Min Wang
:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. EUSIPCO 2019: 1-5
[c135]Tassadaq Hussain
, Yu Tsao
, Hsin-Min Wang
, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao
:
Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine. EUSIPCO 2019: 1-5
[c134]Yih-Liang Shen, Chao-Yuan Huang, Syu-Siang Wang
, Yu Tsao
, Hsin-Min Wang
, Tai-Shih Chi:
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition. ICASSP 2019: 6750-6754
[c133]Szu-Wei Fu, Chien-Feng Liao, Yu Tsao, Shou-De Lin:
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement. ICML 2019: 2031-2041
[c132]Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing
, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
, Yu Tsao
, Hsin-Min Wang
:
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion. INTERSPEECH 2019: 709-713
[c131]Li-Wei Chen, Hung-yi Lee, Yu Tsao
:
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech. INTERSPEECH 2019: 719-723
[c130]Chen-Chou Lo
, Szu-Wei Fu, Wen-Chin Huang, Xin Wang
, Junichi Yamagishi, Yu Tsao
, Hsin-Min Wang
:
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion. INTERSPEECH 2019: 1541-1545
[c129]Pin-Tuan Huang, Hung-Shin Lee, Syu-Siang Wang
, Kuan-Yu Chen, Yu Tsao
, Hsin-Min Wang
:
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR. INTERSPEECH 2019: 1631-1635
[c128]Yu-Chen Lin, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao
, Tei-Wei Kuo
:
IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network. INTERSPEECH 2019: 1801-1805
[c127]Chien-Feng Liao, Yu Tsao
, Xugang Lu, Hisashi Kawai:
Incorporating Symbolic Sequential Modeling for Speech Enhancement. INTERSPEECH 2019: 2733-2737
[c126]Chien-Feng Liao, Yu Tsao
, Hung-yi Lee, Hsin-Min Wang
:
Noise Adaptive Speech Enhancement Using Domain Adversarial Training. INTERSPEECH 2019: 3148-3152
[c125]Ryandhimas E. Zezario
, Szu-Wei Fu, Xugang Lu, Hsin-Min Wang
, Yu Tsao
:
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric. INTERSPEECH 2019: 3168-3172
[c124]Fu-Kai Chuang, Syu-Siang Wang
, Jeih-weih Hung, Yu Tsao
, Shih-Hau Fang:
Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement. INTERSPEECH 2019: 3173-3177
[c123]Xugang Lu, Peng Shen, Sheng Li
, Yu Tsao
, Hisashi Kawai:
Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection. INTERSPEECH 2019: 3614-3618
[c122]Ryandhimas E. Zezario
, Join W. C. Sigalingging, Tassadaq Hussain
, Jia-Ching Wang, Yu Tsao
:
Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement. ISPACS 2019: 1-2
[c121]Tassadaq Hussain
, Yu Tsao
, Sabato Marco Siniscalchi, Jia-Ching Wang, Hsin-Min Wang
, Wen-Hung Liao
:
Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine. IWSDS 2019: 153-162
[c120]Shintami Chusnul Hidayati, Kai-Lung Hua
, Yu Tsao
, Hong-Han Shuai, Jiaying Liu
, Wen-Huang Cheng:
Garment Detectives: Discovering Clothes and Its Genre in Consumer Photos. MIPR 2019: 471-474
[c119]Kuan-Yi Liu, Syu-Siang Wang, Yu Tsao, Jeih-Weih Hung:
Speech enhancement based on the integration of fully convolutional network, temporal lowpass filtering and spectrogram masking. ROCLING 2019: 226-240
[c118]Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion. SSW 2019: 57-62
[i35]Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
MOSNet: Deep Learning based Objective Assessment for Voice Conversion. CoRR abs/1904.08352 (2019)
[i34]Chien-Feng Liao, Yu Tsao, Xugang Lu, Hisashi Kawai:
Incorporating Symbolic Sequential Modeling for Speech Enhancement. CoRR abs/1904.13142 (2019)
[i33]Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion. CoRR abs/1905.00615 (2019)
[i32]Szu-Wei Fu, Chien-Feng Liao, Yu Tsao:
Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality. CoRR abs/1905.01898 (2019)
[i31]Szu-Wei Fu, Chien-Feng Liao, Yu Tsao, Shou-De Lin:
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement. CoRR abs/1905.04874 (2019)
[i30]Jyun-Yi Wu, Cheng Yu, Szu-Wei Fu, Chih-Ting Liu, Shao-Yi Chien, Yu Tsao:
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques. CoRR abs/1906.01078 (2019)
[i29]Chang-Le Liu, Szu-Wei Fu, You-Jin Lee, Yu Tsao, Jen-Wei Huang, Hsin-Min Wang:
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks. CoRR abs/1909.11909 (2019)
[i28]Natalie Yu-Hsien Wang, Hsiao-Lan Sharon Wang, Taowei Wang, Szu-Wei Fu, Xugang Lu, Yu Tsao, Hsin-Min Wang:
Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement. CoRR abs/1909.11912 (2019)
[i27]Rung-Yu Tseng, Taowei Wang, Szu-Wei Fu, Yu Tsao, Chia-Ying Lee:
Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation. CoRR abs/1909.11919 (2019)
[i26]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019)
[i25]Syu-Siang Wang, Yu-You Liang, Jeih-weih Hung, Yu Tsao, Hsin-Min Wang, Shih-Hau Fang:
Distributed Microphone Speech Enhancement based on Deep Learning. CoRR abs/1911.08153 (2019)
[i24]Cheng Yu, Yan-Ting Lin, Kuo-Hsuan Hung, Syu-Siang Wang, Szu-Wei Fu, Yu Tsao, Jeih-weih Hung:
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement. CoRR abs/1911.09847 (2019)
[i23]Chao-I Tuan, Yuan-Kuei Wu, Hung-yi Lee, Yu Tsao:
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing. CoRR abs/1912.03884 (2019)
[i22]Yu-Tao Chang, Yuan-Hong Yang, Yu-Huai Peng, Syu-Siang Wang, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation. CoRR abs/1912.11984 (2019)
[i21]Xugang Lu, Peng Shen, Sheng Li
, Yu Tsao, Hisashi Kawai:
Deep progressive multi-scale attention for acoustic event classification. CoRR abs/1912.12011 (2019)- 2018
[j34]Yu Tsao
, Hao-Chun Chu, Shih-Hau Fang
, Junghsi Lee, Chih-Min Lin:
Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller. IEEE Access 6: 37395-37402 (2018)
[j33]Hsin-Te Hwang, Yi-Chiao Wu, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Locally Linear Embedding Based Post-Filtering for Speech Enhancement. J. Inf. Sci. Eng. 34(6): 1469-1491 (2018)
[j32]Hsin-Te Hwang, Yi-Chiao Wu, Yu-Huai Peng, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Voice Conversion Based on Locally Linear Embedding. J. Inf. Sci. Eng. 34(6): 1493-1516 (2018)
[j31]Joaquín Torres-Sospedra
, Antonio Ramón Jiménez
, Adriano J. C. Moreira
, Tomás Lungenstrass, Wei-Chung Lu, Stefan Knauth
, Germán M. Mendoza-Silva
, Fernando Seco, Antoni Pérez-Navarro
, Maria João Nicolau
, António Costa
, Filipe Meneses
, Joaquín Farina, Juan Pablo Morales, Wen-Chen Lu, Ho-Ti Cheng, Shi-Shen Yang, Shih-Hau Fang
, Ying-Ren Chien
, Yu Tsao
:
Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition. Sensors 18(2): 487 (2018)
[j30]Yu-Cheng Lin, Ying-Hui Lai, Hsiu-Wen Chang, Yu Tsao
, Yi-ping Chang
, Ronald Y. Chang
:
SmartHear: A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies. IEEE Syst. J. 12(1): 20-29 (2018)
[j29]Hung-Ping Liu
, Yu Tsao
, Chiou-Shann Fuh:
Bone-conducted speech enhancement using deep denoising autoencoder. Speech Commun. 104: 106-112 (2018)
[j28]Syu-Siang Wang
, Payton Lin
, Yu Tsao
, Jeih-Weih Hung, Borching Su
:
Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(3): 564-579 (2018)
[j27]Szu-Wei Fu
, Taowei Wang, Yu Tsao
, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1570-1584 (2018)
[j26]Jen-Cheng Hou
, Syu-Siang Wang
, Ying-Hui Lai, Yu Tsao
, Hsiu-Wen Chang, Hsin-Min Wang
:
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks. IEEE Trans. Emerg. Top. Comput. Intell. 2(2): 117-128 (2018)
[c117]Ryandhimas E. Zezario
, Jen-Wei Huang, Xugang Lu, Yu Tsao
, Hsin-Te Hwang, Hsin-Min Wang
:
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement. APSIPA 2018: 373-377
[c116]Ying-Hui Lai, Wei-Zhong Zheng, Shih-Tsang Tang, Shih-Hau Fang, Wen-Huei Liao, Yu Tsao
:
Improving the performance of hearing aids in noisy environments based on deep learning technology. EMBC 2018: 404-408
[c115]Lei Wang, Yu Tsao
, Fei Chen
:
Congruent Visual Stimulation Facilitates Auditory Frequency Change Detection: An ERP Study. EMBC 2018: 2446-2449
[c114]Neville Ryant, Elika Bergelson, Kenneth Church
, Alejandrina Cristià, Jun Du, Sriram Ganapathy, Sanjeev Khudanpur, Diana Kowalski, Mahesh Krishnamoorthy, Rajat Kulshreshta, Mark Y. Liberman
, Yu-Ding Lu, Matthew Maciejewski, Florian Metze, Ján Profant, Lei Sun, Yu Tsao
, Zhou Yu:
Enhancement and Analysis of Conversational Speech: JSALT 2017. ICASSP 2018: 5154-5158
[c113]Lei Sun, Jun Du, Tian Gao, Yu-Ding Lu, Yu Tsao
, Chin-Hui Lee, Neville Ryant:
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions. ICASSP 2018: 5234-5238
[c112]Wei-Jen Lee, Syu-Siang Wang
, Fei Chen
, Xugang Lu, Shao-Yi Chien, Yu Tsao
:
Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm. ICASSP 2018: 5454-5458
[c111]Shang-Chih Lin, Yu Tsao
, Shun-Feng Su, Yennun Huang
:
An Industrial IoT Analysis System Based on Machining Data of Metal Materials. iFUZZY 2018: 225-230
[c110]Yu-Huai Peng, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Exemplar-Based Spectral Detail Compensation for Voice Conversion. INTERSPEECH 2018: 486-490
[c109]Xugang Lu, Peng Shen, Sheng Li
, Yu Tsao
, Hisashi Kawai:
Temporal Attentive Pooling for Acoustic Event Detection. INTERSPEECH 2018: 1354-1357
[c108]Szu-Wei Fu, Yu Tsao
, Hsin-Te Hwang, Hsin-Min Wang
:
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM. INTERSPEECH 2018: 1873-1877
[c107]Shih-Kuang Lee
, Syu-Siang Wang, Yu Tsao
, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform. ISCSLP 2018: 16-20
[c106]Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao
, Hsin-Min Wang
:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. ISCSLP 2018: 51-55
[c105]Ji-Yan Han, Wei-Zhong Zheng, Ren-Jie Huang, Yu Tsao
, Ying-Hui Lai:
Hearing aids APP design based on deep learning technology. ISCSLP 2018: 495-496
[c104]Wen-Huei Liao, Pei-Chun Li, Shuenn-Tsong Young, Ying-Hui Lai, Yu Tsao
:
IOS-based Ear Scale application for Clinical Audiology and Otology Usage. ISCSLP 2018: 497-498
[c103]Yi-Ying Kao, Hsiang-Ping Hsu, Chien-Feng Liao, Yu Tsao
, Hao-Chun Yang
, Jeng-Lin Li, Chi-Chun Lee
, Hung-Shin Lee, Hsin-Min Wang
:
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation. IWAENC 2018: 416-420
[c102]Wen-Chin Huang, Chen-Chou Lo, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang:
WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese]. ROCLING 2018: 96-110
[c101]Bin-Syh Yu, Yu Tsao
, Shao-Wen Yang, Yen-Kuang Chen
, Shao-Yi Chien:
Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform. SiPS 2018: 88-93
[c100]Yi-Te Hsu, Yu-Chen Lin, Szu-Wei Fu, Yu Tsao
, Tei-Wei Kuo
:
A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN). SLT 2018: 566-573
[i20]Wei-Jen Lee, Syu-Siang Wang, Fei Chen, Xugang Lu, Shao-Yi Chien, Yu Tsao:
Speech Dereverberation Based on Integrated Deep and Ensemble Learning. CoRR abs/1801.04052 (2018)
[i19]Chien-Feng Liao, Yu Tsao, Hung-yi Lee, Hsin-Min Wang:
Noise Adaptive Speech Enhancement using Domain Adversarial Training. CoRR abs/1807.07501 (2018)
[i18]Szu-Wei Fu, Yu Tsao, Hsin-Te Hwang, Hsin-Min Wang:
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. CoRR abs/1808.05344 (2018)
[i17]Yi-Te Hsu, Yu-Chen Lin, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN). CoRR abs/1808.06474 (2018)
[i16]Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. CoRR abs/1808.09634 (2018)
[i15]Li-Wei Chen, Hung-yi Lee, Yu Tsao:
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech. CoRR abs/1810.12656 (2018)
[i14]Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform. CoRR abs/1811.03486 (2018)
[i13]Yih-Liang Shen, Chao-Yuan Huang, Syu-Siang Wang, Yu Tsao, Hsin-Min Wang, Tai-Shih Chi:
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition. CoRR abs/1811.04224 (2018)
[i12]Yi-Te Hsu, Zining Zhu, Chi-Te Wang, Shih-Hau Fang, Frank Rudzicz, Yu Tsao:
Robustness against the channel effect in pathological voice detection. CoRR abs/1811.10376 (2018)
[i11]Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. CoRR abs/1811.11078 (2018)- 2017
[j25]Alan Chern, Ying-Hui Lai, Yi-ping Chang
, Yu Tsao
, Ronald Y. Chang
, Hsiu-Wen Chang:
A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom. IEEE Access 5: 10339-10351 (2017)
[j24]Tassadaq Hussain
, Sabato Marco Siniscalchi, Chi-Chun Lee
, Syu-Siang Wang
, Yu Tsao
, Wen-Hung Liao
:
Experimental Study on Extreme Learning Machine Applications for Speech Enhancement. IEEE Access 5: 25542-25554 (2017)
[j23]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Regularization of neural network model with distance metric learning for i-vector based spoken language identification. Comput. Speech Lang. 44: 48-60 (2017)
[j22]Payton Lin, Dau-Cheng Lyu, Fei Chen
, Syu-Siang Wang
, Yu Tsao
:
Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT). Comput. Speech Lang. 46: 481-495 (2017)
[j21]Jin Li-You, Yu Tsao, Ying-Ren Chien:
Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm. Int. J. Comput. Linguistics Chin. Lang. Process. 22(2) (2017)
[j20]Chia-Lung Wu, Hsiang-Ping Hsu, Yu-Ding Lu, Yu Tsao, Hung-Shin Lee, Hsin-Min Wang:
A Replay Spoofing Detection System Based on Discriminative Autoencoders. Int. J. Comput. Linguistics Chin. Lang. Process. 22(2) (2017)
[j19]Hung-yi Lee, Bo-Hsiang Tseng, Tsung-Hsien Wen, Yu Tsao
:
Personalizing Recurrent-Neural-Network-Based Language Model by Social Network. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 519-530 (2017)
[j18]Tien-En Chen
, Shih-I Yang, Li-Ting Ho, Kun-Hsi Tsai, Yu-Hsuan Chen, Yun-Fan Chang, Ying-Hui Lai, Syu-Siang Wang
, Yu Tsao
, Chau-Chung Wu:
S1 and S2 Heart Sound Recognition Using Deep Neural Networks. IEEE Trans. Biomed. Eng. 64(2): 372-380 (2017)
[j17]Ying-Hui Lai, Fei Chen
, Syu-Siang Wang
, Xugang Lu, Yu Tsao
, Chin-Hui Lee:
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation. IEEE Trans. Biomed. Eng. 64(7): 1568-1578 (2017)
[j16]Szu-Wei Fu, Pei-Chun Li, Ying-Hui Lai
, Cheng-Chien Yang, Li-Chun Hsieh, Yu Tsao
:
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery. IEEE Trans. Biomed. Eng. 64(11): 2584-2594 (2017)
[c99]Shih-Wei Lan, Yu Tsao
, Junghsi Lee:
Acoustic echo cancellation using deep cerebellar model articulation controller. ACSSC 2017: 808-811
[c98]Szu-Wei Fu, Yu Tsao
, Xugang Lu, Hisashi Kawai:
Raw waveform-based speech enhancement by fully convolutional networks. APSIPA 2017: 6-12
[c97]Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao
, Hsin-Min Wang
:
Fast locally linear embedding algorithm for exemplar-based voice conversion. APSIPA 2017: 591-595
[c96]Syu-Siang Wang
, Yu Tsao
, Hsiao-Lan Sharon Wang, Ying-Hui Lai, Lieber Po-Hung Li:
A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise. APSIPA 2017: 808-812
[c95]Chih-Wei Wu, Meng-Ting Zhong, Yu Tsao
, Shao-Wen Yang, Yen-Kuang Chen
, Shao-Yi Chien:
Track-Clustering Error Evaluation for Track-Based Multi-camera Tracking System Employing Human Re-identification. CVPR Workshops 2017: 1416-1424
[c94]Hung-Shin Lee, Yu-Ding Lu, Chin-Cheng Hsu, Yu Tsao
, Hsin-Min Wang
, Shyh-Kang Jeng:
Discriminative autoencoders for speaker verification. ICASSP 2017: 5375-5379
[c93]Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang
, Chin-Cheng Hsu, Ying-Hui Lai, Yu Tsao
, Hsin-Min Wang
:
A locally linear embbeding based postfiltering approach for speech enhancement. ICASSP 2017: 5555-5559
[c92]Chia-Lung Wu, Hsiang-Ping Hsu, Syu-Siang Wang
, Jeih-Weih Hung, Ying-Hui Lai, Hsin-Min Wang
, Yu Tsao
:
Wavelet Speech Enhancement Based on Robust Principal Component Analysis. INTERSPEECH 2017: 439-443
[c91]Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang
, Chin-Cheng Hsu, Yu Tsao
, Hsin-Min Wang
:
A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement. INTERSPEECH 2017: 1953-1957
[c90]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks. INTERSPEECH 2017: 3364-3368
[c89]Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu, Kuan-Yu Chen, Yu Tsao
, Berlin Chen, Hsin-Min Wang
:
Discriminative Autoencoders for Acoustic Modeling. INTERSPEECH 2017: 3557-3561
[c88]Shih-Ting Lin, Yuan-Hsin Liao, Yu Tsao
, Shao-Yi Chien:
Object-based on-line video summarization for internet of video things. ISCAS 2017: 1-4
[c87]Szu-Wei Fu, Ting-Yao Hu, Yu Tsao
, Xugang Lu:
Complex spectrogram enhancement by convolutional neural network with multi-metrics learning. MLSP 2017: 1-6
[c86]Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
多樣訊雜比之訓練語料於降噪自動編碼器其語音強化功能之初步研究 (A Preliminary Study of Various SNR-level Training Data in the Denoising Auto-encoder (DAE) Technique for Speech Enhancement) [In Chinese]. ROCLING 2017: 101-113
[c85]Yu-Ding Lu, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese]. ROCLING 2017: 114-115
[c84]Jin Li-You, Yu Tsao, Ying-Ren Chien:
改進的向量空間可適性濾波器用於聲學回聲消除 (Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm) [In Chinese]. ROCLING 2017: 178-182
[c83]Chi-Te Wang, Feng-Chuan Lin, Wei-Zhong Zheng, Shih-Hau Fang, Yu Tsao, Ying-Hui Lai:
以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese]. ROCLING 2017: 287-294
[c82]Taowei Wang, Yu Tsao, Ying-Hui Lai, Hsiang-Ping Hsu, Chia-Lung Wu:
以軟體為基礎建構語音增強系統使用者介面 (Development of a software-based User-Interface of Speech Enhancement System) [In Chinese]. ROCLING 2017: 323-331
[e2]Lun-Wei Ku, Yu Tsao:
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, ROCLING 2017, Taipei, Taiwan, November 27-28, 2017. The Association for Computational Linguistics and Chinese Language Processing (ACLCLP) 2017, ISBN 978-986-95769-0-1 [contents]
[i10]Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks. CoRR abs/1703.02205 (2017)
[i9]Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Jen-Chun Lin, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang:
Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network. CoRR abs/1703.10893 (2017)
[i8]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks. CoRR abs/1704.00849 (2017)
[i7]Szu-Wei Fu, Ting-Yao Hu, Yu Tsao, Xugang Lu:
Multi-Metrics Learning for Speech Enhancement. CoRR abs/1704.08504 (2017)
[i6]Yu Tsao, Hao-Chun Chu, Shih-Wei Lan, Shih-Hau Fang, Junghsi Lee, Chih-Min Lin:
Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller. CoRR abs/1705.00945 (2017)
[i5]Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. CoRR abs/1709.03658 (2017)- 2016
[j15]Payton Lin, Szu-Wei Fu, Syu-Siang Wang
, Ying-Hui Lai, Yu Tsao
:
Maximum Entropy Learning with Deep Belief Networks. Entropy 18(7): 251 (2016)
[j14]Shih-Hau Fang
, Hao-Hsiang Liao, Yu-Xiang Fei, Kai-Hsiang Chen, Jen-Wei Huang
, Yu-Ding Lu, Yu Tsao
:
Transportation Modes Classification Using Sensors on Smartphones. Sensors 16(8): 1324 (2016)
[j13]Yu Tsao
, Ying-Hui Lai:
Generalized maximum a posteriori spectral amplitude estimation for speech enhancement. Speech Commun. 76: 112-126 (2016)
[j12]Fei Chen
, Yu Tsao
, Ying-Hui Lai:
Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus. Speech Commun. 81: 120-128 (2016)
[j11]Syu-Siang Wang
, Alan Chern, Yu Tsao
, Jeih-weih Hung, Xugang Lu, Ying-Hui Lai, Borching Su
:
Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization. IEEE Signal Process. Lett. 23(8): 1101-1105 (2016)
[c81]Jen-Cheng Hou, Syu-Siang Wang
, Ying-Hui Lai, Jen-Chun Lin
, Yu Tsao
, Hsiu-Wen Chang, Hsin-Min Wang
:
Audio-visual speech enhancement using deep neural networks. APSIPA 2016: 1-6
[c80]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Voice conversion from non-parallel corpora using variational auto-encoder. APSIPA 2016: 1-6
[c79]Yueh-Ting Tsai, Borching Su, Yu Tsao
, Syu-Siang Wang
:
Adaptive subspace-constrained diagonal loading. APSIPA 2016: 1-4
[c78]Syu-Siang Wang, Yu Tsao
:
Temporal Modulation Spectral Restoration for Robust Speech Recognition. BigMM 2016: 481-486
[c77]Yi-Yen Hsieh
, Ching-Da Wu, Shey-Shi Lu, Yu Tsao
:
A linear regression model with dynamic pulse transit time features for noninvasive blood pressure prediction. BioCAS 2016: 604-607
[c76]Ting-Jia Wu, Shih-Hau Fang, Yong-Bin Wu, Cheng-Tse Wu, Jen-Wei Huang
, Yu Tsao
:
A study of mobile advertisement recommendation using real big data from AdLocus. GCCE 2016: 1-2
[c75]Yen-Teh Liu, Yu Tsao
, Ronald Y. Chang
:
Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users. ICASSP 2016: 5905-5909
[c74]Syu-Siang Wang
, Jeremy Chiaming Yang, Yu Tsao
, Jeih-weih Hung:
Leveraging nonnegative matrix factorization in processing the temporal modulation spectrum for speech enhancement. ICCE-TW 2016: 1-2
[c73]Jeremy Chiaming Yang, Syu-Siang Wang
, Yu Tsao
, Jeih-Weih Hung:
Speech enhancement via ensemble modeling NMF adaptation. ICCE-TW 2016: 1-2
[c72]Yi-Chiao Wu, Hsin-Te Hwang, Chin-Cheng Hsu, Yu Tsao
, Hsin-Min Wang
:
Locally Linear Embedding for Exemplar-Based Spectral Conversion. INTERSPEECH 2016: 1652-1656
[c71]Hung-Shin Lee, Yu Tsao
, Chi-Chun Lee
, Hsin-Min Wang
, Wei-Cheng Lin, Wei-Chen Chen, Shan-Wen Hsiao, Shyh-Kang Jeng:
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation. INTERSPEECH 2016: 2031-2035
[c70]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification. INTERSPEECH 2016: 3216-3220
[c69]Szu-Wei Fu, Yu Tsao
, Xugang Lu:
SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement. INTERSPEECH 2016: 3768-3772
[c68]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Dictionary update for NMF-based voice conversion using an encoder-decoder network. ISCSLP 2016: 1-5
[c67]Chia-Yung Hsu, Ryandhimas E. Zezario
, Jia-Ching Wang, Chin-Wen Ho, Xugang Lu, Yu Tsao
:
Incorporating local environment information with ensemble neural networks to robust automatic speech recognition. ISCSLP 2016: 1-5
[c66]Ying-Hui Lai, Syu-Siang Wang
, Yu-Ting Su, Cheng Han-Che, Fan Kang Fu, Yu Tsao
:
Improving the performance of speech perception in noisy environment based on an FAME strategy. ISCSLP 2016: 1-5
[c65]Xugang Lu, Peng Shen, Yu Tsao
, Hisashi Kawai:
A pseudo-task design in multi-task learning deep neural network for speaker recognition. ISCSLP 2016: 1-5
[c64]Shih-Yu Ku, Kai-Hsiang Chen, Jen-Wei Huang
, Yu Tsao
:
Image Retrieval Using Color-Aware Tag on Progressive Image Search and Recommendation System. MMM (2) 2016: 162-173
[e1]Chung-Hsien Wu, Yuen-Hsien Tseng, Hung-Yu Kao, Lun-Wei Ku, Yu Tsao, Shih-Hung Wu:
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, ROCLING 2016, National Cheng Kung University, Tainan, Taiwan, October 6-7, 2015. Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Taiwan 2016, ISBN 978-957-30792-9-3 [contents]
[i4]Syu-Siang Wang, Alan Chern, Yu Tsao, Jeih-Weih Hung, Xugang Lu, Ying-Hui Lai, Borching Su:
Wavelet speech enhancement based on nonnegative matrix factorization. CoRR abs/1601.02309 (2016)
[i3]Yueh-Ting Tsai, Borching Su, Yu Tsao, Syu-Siang Wang:
Robust Beamforming Against DoA Mismatch Using Subspace-Constrained Diagonal Loading. CoRR abs/1602.02690 (2016)
[i2]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network. CoRR abs/1610.03988 (2016)
[i1]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder. CoRR abs/1610.04019 (2016)- 2015
[j10]Chung-Chien Hsu, Kah-Meng Cheong, Tai-Shih Chi, Yu Tsao
:
Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation. IEICE Trans. Inf. Syst. 98-D(10): 1808-1817 (2015)
[j9]Jin Li-You, Ying-Ren Chien
, Yu Tsao
:
Rapid Converging M-Max Partial Update Least Mean Square Algorithms with New Variable Step-Size Methods. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 98-A(12): 2650-2657 (2015)
[j8]Yu Tsao
, Payton Lin, Ting-Yao Hu, Xugang Lu:
Ensemble environment modeling using affine transform group. Speech Commun. 68: 55-68 (2015)
[j7]Yu Tsao
, Shih-Hau Fang, Yao Shiao:
Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm. IEEE Signal Process. Lett. 22(3): 351-355 (2015)
[j6]Shih-Hau Fang, Chu-Hsuan Wang, Yu Tsao
:
Compensating for Orientation Mismatch in Robust Wi-Fi Localization Using Histogram Equalization. IEEE Trans. Veh. Technol. 64(11): 5210-5220 (2015)
[c63]Syu-Siang Wang
, Hsin-Te Hwang, Ying-Hui Lai, Yu Tsao
, Xugang Lu, Hsin-Min Wang
, Borching Su:
Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm. APSIPA 2015: 365-369
[c62]Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Yih-Ru Wang, Sin-Horng Chen:
A probabilistic interpretation for artificial neural network-based voice conversion. APSIPA 2015: 552-558
[c61]Payton Lin, Dau-Cheng Lyu, Yun-Fan Chang, Yu Tsao
:
Temporal alignment for deep neural networks. GlobalSIP 2015: 108-112
[c60]Yen-Teh Liu, Ronald Y. Chang
, Yu Tsao
, Yi-ping Chang:
A new frequency lowering technique for Mandarin-speaking hearing aid users. GlobalSIP 2015: 722-726
[c59]Wei-Chen Chen, Po-Tsun Lai, Yu Tsao
, Chi-Chun Lee
:
Multimodal arousal rating using unsupervised fusion technique. ICASSP 2015: 5296-5300
[c58]Ying-Hui Lai, Syu-Siang Wang
, Pei-Chun Li, Yu Tsao
:
A discriminative post-filter for speech enhancement in hearing aids. ICASSP 2015: 5868-5872
[c57]Yen-Teh Liu, Yu Tsao
, Ronald Y. Chang
:
A deep neural network based approach to mandarin consonant/vowel separation. ICCE-TW 2015: 324-325
[c56]Payton Lin, Syu-Siang Wang
, Yu Tsao
:
Temporal information in tone recognition. ICCE-TW 2015: 326-327
[c55]Payton Lin, Dau-Cheng Lyu, Yun-Fan Chang, Yu Tsao:
Speech recognition with temporal neural networks. INTERSPEECH 2015: 21-25
[c54]Xugang Lu, Peng Shen, Yu Tsao, Chiori Hori, Hisashi Kawai:
Sparse representation with temporal max-smoothing for acoustic event detection. INTERSPEECH 2015: 1176-1180
[c53]Chia-Yung Hsu, Jia-Ching Wang, Yu Tsao:
類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese]. ROCLING 2015- 2014
[j5]Yu Tsao
, Xugang Lu, Paul R. Dixon, Ting-Yao Hu, Shigeki Matsuda, Chiori Hori:
Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation. Comput. Speech Lang. 28(3): 709-726 (2014)
[j4]Yu Tsao
, Ting-Yao Hu, Sakriani Sakti, Satoshi Nakamura, Lin-Shan Lee:
Variable Selection Linear Regression for Robust Speech Recognition. IEICE Trans. Inf. Syst. 97-D(6): 1477-1487 (2014)
[j3]Yu Tsao
, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee:
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 403-416 (2014)
[c52]Yun-Fan Chang, Payton Lin, Shao-Hua Cheng, Kai-Hsuan Chan, Yi-Chong Zeng, Chia-Wei Liao, Wen-Tsung Chang, Yu-Chiang Wang, Yu Tsao
:
Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system. APSIPA 2014: 1-4
[c51]Hao-Teng Fan, Jeih-weih Hung, Xugang Lu, Syu-Siang Wang
, Yu Tsao
:
Speech enhancement using segmental nonnegative matrix factorization. ICASSP 2014: 4483-4487
[c50]Xugang Lu, Yu Tsao
, Shigeki Matsuda, Chiori Hori:
Sparse representation based on a bag of spectral exemplars for acoustic event detection. ICASSP 2014: 6255-6259
[c49]How Jing, An-Chun Liang, Shou-De Lin
, Yu Tsao
:
A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering. ICDM 2014: 250-259
[c48]How Jing, Ting-Yao Hu, Hung-Shin Lee, Wei-Chen Chen, Chi-Chun Lee, Yu Tsao, Hsin-Min Wang:
Ensemble of machine learning algorithms for cognitive and physical speaker load detection. INTERSPEECH 2014: 447-451
[c47]Payton Lin, Fei Chen, Syu-Siang Wang
, Ying-Hui Lai, Yu Tsao:
Automatic speech recognition with primarily temporal envelope information. INTERSPEECH 2014: 476-480
[c46]Ying-Hui Lai, Fei Chen, Yu Tsao:
An adaptive envelope compression strategy for speech processing in cochlear implants. INTERSPEECH 2014: 481-484
[c45]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Ensemble modeling of denoising autoencoder for speech spectrum restoration. INTERSPEECH 2014: 885-889
[c44]Hung-Shin Lee, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng:
Clustering-based i-vector formulation for speaker recognition. INTERSPEECH 2014: 1101-1105
[c43]Xugang Lu, Yu Tsao
, Peng Shen, Chiori Hori:
Spectral patch based sparse coding for acoustic event detection. ISCSLP 2014: 317-320
[c42]Syu-Siang Wang
, Payton Lin, Dau-Cheng Lyu, Yu Tsao
, Hsin-Te Hwang, Borching Su
:
Acoustic feature conversion using a polynomial based feature transferring algorithm. ISCSLP 2014: 454-458
[c41]Ying-Hui Lai, Fei Chen
, Yu Tsao
:
Effect of adaptive envelope compression in simulated electric hearing in reverberation. ISIC 2014: 204-207- 2013
[c40]Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Yih-Ru Wang, Sin-Horng Chen:
Incorporating global variance in the training phase of GMM-based voice conversion. APSIPA 2013: 1-6
[c39]Chu-Hsuan Wang, Tai-Wei Kao, Shih-Hau Fang, Yu Tsao
, Lun-Chia Kuo, Kao Shih-Wei, Nien-Chen Lin:
Robust Wi-Fi location fingerprinting against device diversity based on spatial mean normalization. APSIPA 2013: 1-4
[c38]Syu-Siang Wang
, Yu Tsao
, Jeih-Weih Hung:
Filtering on the temporal probability sequence in histogram equalization for robust speech recognition. ICASSP 2013: 7112-7116
[c37]Yu-Cheng Su, Yu Tsao
, Jung-En Wu, Fu-Rong Jean:
Speech enhancement using generalized maximum a posteriori spectral amplitude estimator. ICASSP 2013: 7467-7471
[c36]How Jing, Yu Tsao, Kuan-Yu Chen, Hsin-Min Wang:
Semantic Naïve Bayes Classifier for Document Classification. IJCNLP 2013: 1117-1123
[c35]How Jing, Yu Tsao
:
Sparse maximum entropy deep belief nets. IJCNN 2013: 1-6
[c34]Hung-yi Lee, Ting-Yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long Pao:
Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition. INTERSPEECH 2013: 215-219
[c33]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Speech enhancement based on deep denoising autoencoder. INTERSPEECH 2013: 436-440
[c32]Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-Shan Lee:
Recurrent neural network based language model personalization by social network crowdsourcing. INTERSPEECH 2013: 2703-2707
[c31]Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. INTERSPEECH 2013: 3002-3006
[c30]Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training. INTERSPEECH 2013: 3062-3066
[c29]Ying-Hui Lai, Yu-Cheng Su, Yu Tsao
, Shuenn-Tsong Young:
Evaluation of generalized maximum a posteriori spectral amplitude (GMAPA) speech enhancement algorithm in hearing aids. ISCE 2013: 245-246
[c28]Yun-Fan Chang, Yu Tsao, Shao-Hua Cheng, Kai-Hsuan Chan, Chia-Wei Liao, Wen-Tsung Chang:
結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese]. ROCLING 2013- 2012
[c27]Yu Tsao
, Chien-Lin Huang, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
A linear projection approach to environment modeling for robust speech recognition. ICASSP 2012: 4329-4332
[c26]Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
A Study of Mutual Information for GMM-Based Spectral Conversion. INTERSPEECH 2012: 78-81
[c25]Ting-Yao Hu, Yu Tsao, Lin-Shan Lee:
Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation. INTERSPEECH 2012: 567-570
[c24]Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Yih-Ru Wang, Sin-Horng Chen:
Exploring mutual information for GMM-based spectral conversion. ISCSLP 2012: 50-54
[c23]Syu-Siang Wang
, Jeih-Weih Hung, Yu Tsao
:
A study on cepstral sub-band normalization for robust ASR. ISCSLP 2012: 141-145
[c22]Xugang Lu, Yu Tsao
, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. ISCSLP 2012: 311-314- 2011
[c21]Yu Tsao
, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation. ICASSP 2011: 5320-5323
[c20]Yu Tsao
, Shigeki Matsuda, Shinsuke Sakai, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
A sampling-based environment population projection approach for rapid acoustic model adaptation. ICASSP 2011: 5504-5507
[c19]Yu Tsao, Paul R. Dixon, Chiori Hori, Hisashi Kawai:
Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition. INTERSPEECH 2011: 2585-2588- 2010
[c18]Yu Tsao
, Hanwu Sun, Haizhou Li
, Chin-Hui Lee:
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition. ICASSP 2010: 4422-4425
[c17]Jinyu Li, Yu Tsao, Chin-Hui Lee:
Shrinkage model adaptation in automatic speech recognition. INTERSPEECH 2010: 1656-1659
[c16]Aleem Mushtaq, Yu Tsao, Chin-Hui Lee:
A particle filter feature compensation approach to robust speech recognition. INTERSPEECH 2010: 2054-2057
[c15]Yu Tsao
, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition. ISCSLP 2010: 29-32
2000 – 2009
- 2009
[j2]Yu Tsao
, Chin-Hui Lee:
An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition. IEEE Trans. Speech Audio Process. 17(5): 1025-1037 (2009)
[c14]Yu Tsao
, Shigeki Matsuda, Satoshi Nakamura, Chin-Hui Lee:
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling. ASRU 2009: 271-275
[c13]Yu Tsao
, Jinyu Li
, Chin-Hui Lee:
Ensemble speaker and speaking environment modeling approach with advanced online estimation process. ICASSP 2009: 3833-3836
[c12]Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606
[c11]Yu Tsao
, Jinyu Li
, Chin-Hui Lee, Satoshi Nakamura:
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling. IUCS 2009: 404-408- 2008
[b1]Yu Tsao:
An ensemble speaker and speaking environment modeling approach to robust speech recognition. Georgia Institute of Technology, Atlanta, GA, USA, 2008
[c10]Sheng-Yu Peng, Yu Tsao
, Paul E. Hasler, David V. Anderson:
A programmable analog radial-basis-function based classifier. ICASSP 2008: 1425-1428
[c9]


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID