default search action

combined dblp search
author search
venue search
publication search

ask others

Yu Tsao 0001

> Home > Persons

Person information

affiliation: Academia Sinica, Research Center for Information Technology Innovation, Taipei, Taiwan

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j74]
- view
  authority control:
- export record
  dblp key:
  - journals/jssc/PengLWLCLCLHT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jssc/PengLWLCLCLHT24
Sheng-Yu Peng, I-Chun Liu, Yi-Heng Wu, Ting-Ju Lin, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Pin-Han Lin, Kuo-Hsuan Hung, Yu Tsao:
An SRAM-Based Reconfigurable Cognitive Computation Matrix for Sensor Edge Applications. IEEE J. Solid State Circuits 59(2): 636-648 (2024)
[j73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tamd/HuangCTW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/HuangCTW24
Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu:
ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants. IEEE Trans. Cogn. Dev. Syst. 16(1): 346-357 (2024)
[j72]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangCBFT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangCBFT24
Syu-Siang Wang, Jia-Yang Chen, Bo-Ren Bai, Shih-Hau Fang, Yu Tsao:
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3826-3837 (2024)
[c235]
- view
  authority control:
- export record
  dblp key:
  - conf/aicas/LiuCLCHLPP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aicas/LiuCLCHLPP024
I-Chun Liu, Chun-Jui Chen, Xiu-Zhu Li, Yong-Qi Cheng, Chung-Wei Huang, Pin-Han Lin, Hsuan-Wei Pu, Sheng-Yu Peng, Yu Tsao:
The Multilayer Neural Network Implementation Using SRAM-Based Reconfigurable Cognitive Computation Matrices. AICAS 2024: 467-471
[c234]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZezarioBFW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZezarioBFW024
Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. ICASSP 2024: 831-835
[c233]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWLPT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWLPT24
Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
SDEMG: Score-Based Diffusion Model for Surface Electromyographic Signal Denoising. ICASSP 2024: 1736-1740
[c232]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuK0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuK0L24
Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-Yi Lee:
Scalable Ensemble-Based Detection Method Against Adversarial Attacks For Speaker Verification. ICASSP 2024: 4670-4674
[c231]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsengBCCLLPSWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsengBCCLLPSWW024
Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894
[c230]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuS0K24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuS0K24
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-Based ASR. ICASSP 2024: 13116-13120
[c229]
- view
  authority control:
- export record
  dblp key:
  - conf/iccel/LinTCTT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccel/LinTCTT24
Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. ICCE 2024: 1-6
[c228]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/FuH0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FuH0W24
Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. ICLR 2024
[i151]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01145
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01145
Dyah A. M. G. Wisnu, Epri W. Pratiwi, Stefano Rini, Ryandhimas E. Zezario, Hsin-Min Wang, Yu Tsao:
HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids. CoRR abs/2401.01145 (2024)
[i150]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03808
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03808
Yu-Tung Liu, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising. CoRR abs/2402.03808 (2024)
[i149]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05482
Cho-Yuan Lee, Kuan-Chen Wang, Kai-Chun Liu, Xugang Lu, Ping-Cheng Yeh, Yu Tsao:
A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals. CoRR abs/2402.05482 (2024)
[i148]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16321
Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang:
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech. CoRR abs/2402.16321 (2024)
[i147]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16394
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16394
Tassadaq Hussain, Kia Dashtipour, Yu Tsao, Amir Hussain:
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues. CoRR abs/2402.16394 (2024)
[i146]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16757
Jasper Kirton-Wingate, Shafique Ahmed, Adeel Hussain, Mandar Gogate, Kia Dashtipour, Jen-Cheng Hou, Tassadaq Hussain, Yu Tsao, Amir Hussain:
Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids. CoRR abs/2402.16757 (2024)
[i145]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-04097
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-04097
Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang:
Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes. CoRR abs/2405.04097 (2024)
[i144]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-06573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-06573
Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao:
An Investigation of Incorporating Mamba for Speech Enhancement. CoRR abs/2405.06573 (2024)
[i143]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-08342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-08342
Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. CoRR abs/2405.08342 (2024)
[i142]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08445
Chun Yin, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models. CoRR abs/2406.08445 (2024)
[i141]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12699
Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao:
Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition. CoRR abs/2406.12699 (2024)
[i140]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15458
Wenze Ren, Yi-Cheng Lin, Huang-Cheng Chou, Haibin Wu, Yi-Chiao Wu, Chi-Chun Lee, Hung-yi Lee, Yu Tsao:
EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations. CoRR abs/2407.15458 (2024)
[i139]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-04773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-04773
Muhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao:
Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement. CoRR abs/2408.04773 (2024)
2023
[j71]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/ChenT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/ChenT23
Fei Chen, Yu Tsao:
Advances in biomedical signal processing for communication disorders. Biomed. Signal Process. Control. 80(Part): 104346 (2023)
[j70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jossw/LuCLZCNMYSWTQW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossw/LuCLZCNMYSWTQW23
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023)
[j69]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChengLTW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChengLTW23
Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization. IEEE Signal Process. Lett. 30: 638-642 (2023)
[j68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZezarioFCFWT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZezarioFCFWT23
Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features. IEEE ACM Trans. Audio Speech Lang. Process. 31: 54-70 (2023)
[j67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuCYLHWT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuCYLHWT23
Yen-Ju Lu, Chia-Yu Chang, Cheng Yu, Ching-Feng Liu, Jeih-weih Hung, Shinji Watanabe, Yu Tsao:
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2738-2750 (2023)
[j66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tbe/KuoHTWFT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/KuoHTWFT23
Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Te Wang, Shih-Hau Fang, Yu Tsao:
Toward Real-World Voice Disorder Classification. IEEE Trans. Biomed. Eng. 70(10): 2922-2932 (2023)
[j65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tce/ChenTTLCHLST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tce/ChenTTLCHLST23
Tsai-Min Chen, Yuan-Hong Tsai, Huan-Hsin Tseng, Kai-Chun Liu, Jhih-Yu Chen, Chih-Han Huang, Guo-Yuan Li, Chun-Yen Shen, Yu Tsao:
SRECG: ECG Signal Super-Resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification. IEEE Trans. Consumer Electron. 69(3): 250-260 (2023)
[c227]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChiangHFKTT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChiangHFKTT23
Hsin-Tien Chiang, Kuo-Hsuan Hung, Szu-Wei Fu, Heng-Cheng Kuo, Ming-Hsueh Tsai, Yu Tsao:
Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility. ASRU 2023: 1-7
[c226]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CooperHTWTY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CooperHTWTY23
Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. ASRU 2023: 1-7
[c225]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LeeCCWLT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LeeCCWLT23
Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. ASRU 2023: 1-8
[c224]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LuSTK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LuSTK23
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Cross-Modal Alignment With Optimal Transport For CTC-Based ASR. ASRU 2023: 1-7
[c223]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/AriyantiLC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/AriyantiLC023
Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao:
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer. EMBC 2023: 1-4
[c222]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/ChuLHC0C23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/ChuLHC0C23
En-Ping Chu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan:
Multi-Task Learning U-Net for Functional Shoulder Sub-Task Segmentation. EMBC 2023: 1-5
[c221]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChernHCHGHTH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChernHCHGHTH23
I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou:
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings. ICASSP Workshops 2023: 1-5
[c220]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChiLHTC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChiLHTC23
Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan:
Prefallkd: Pre-Impact Fall Detection Via CNN-ViT Knowledge Distillation. ICASSP 2023: 1-5
[c219]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HsuCLT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HsuCLT23
Chan-Jan Hsu, Ho-Lam Chung, Hung-Yi Lee, Yu Tsao:
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5. ICASSP 2023: 1-5
[c218]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KirtonWingateAGTH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KirtonWingateAGTH23
Jasper Kirton-Wingate, Shafique Ahmed, Mandar Gogate, Yu Tsao, Amir Hussain:
Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids. ICASSP Workshops 2023: 1-5
[c217]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinTT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinTT23
Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao:
On the Robustness of Non-Intrusive Speech Quality Model by Adversarial Examples. ICASSP 2023: 1-5
[c216]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLPT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLPT23
Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks. ICASSP 2023: 1-5
[c215]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Lee0WC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Lee0WC23
Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. ICLR 2023
[c214]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TsengLH023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TsengLH023
Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. ICLR 2023
[c213]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCL0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCL0W23
Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech. INTERSPEECH 2023: 2473-2477
[c212]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YenKYHSC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YenKYHSC023
Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition. INTERSPEECH 2023: 3317-3321
[c211]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0006CYTCW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0006CYTCW023
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Tai-Shih Chi, Hsin-Min Wang, Yu Tsao:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. INTERSPEECH 2023: 5018-5022
[c210]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chien0YTW0C23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chien0YTW0C23
Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. INTERSPEECH 2023: 5023-5026
[c209]
- view
  authority control:
- export record
  dblp key:
  - conf/memea/LiuLCHLCT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/memea/LiuLCHLCT23
Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. MeMeA 2023: 1-5
[c208]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/ChernCKTHT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/ChernCKTHT23
I-Chun Chern, Steffi Chern, Heng-Cheng Kuo, Huan-Hsin Tseng, Kuo-Hsuan Hung, Yu Tsao:
Voice Direction-Of-Arrival Conversion. MLSP 2023: 1-6
[c207]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/HsiehYCST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/HsiehYCST23
Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement. MLSP 2023: 1-6
[c206]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/TingWTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/TingWTS23
Wen-Yuan Ting, Syu-Siang Wang, Yu Tsao, Borching Su:
IANS: Intelligibility-Aware Null-Steering Beamforming for Dual-Microphone Arrays. MLSP 2023: 1-6
[c205]
- view
  authority control:
- export record
  dblp key:
  - conf/ner/ChenLLCCT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ner/ChenLLCCT23
Chih-Hsing Chen, Kai-Chun Liu, Ting-Yang Lu, Chih-Ya Chang, Chia-Tai Chan, Yu Tsao:
Wearable-based Pain Assessment in Patients with Adhesive Capsulitis Using Machine Learning. NER 2023: 1-4
[d2]
- view
  authority control:
- export record
  dblp key:
  - data/10/ChienCPHTT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/ChienCPHTT23
Ying-Ren Chien, Po-Heng Chou, You-Jie Peng, Chun-Yuan Huang, Hen-Wai Tsao, Yu Tsao:
Cyclostationary Impulse Noise Dataset. IEEE DataPort, 2023
[d1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - data/10/LuCLZCNMYSWTQW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/LuCLZCNMYSWTQW23
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023
[i138]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04120
Yu-Wen Chen, Hsin-Min Wang, Yu Tsao:
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm. CoRR abs/2301.04120 (2023)
[i137]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01798
Huan-Hsin Tseng, Hsin-Yi Lin, Kuo-Hsuan Hung, Yu Tsao:
Interpretations of Domain Adaptations via Layer Variational Analysis. CoRR abs/2302.01798 (2023)
[i136]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03634
Tin-Han Chi, Kai-Chun Liu, Chia-Yeh Hsieh, Yu Tsao, Chia-Tai Chan:
PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation. CoRR abs/2303.03634 (2023)
[i135]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-06980
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-06980
Li-Chin Chen, Kuo-Hsuan Hung, Yi-Ju Tseng, Hsin-Yao Wang, Tse-Min Lu, Wei-Chieh Huang, Yu Tsao:
Self-supervised based general laboratory progress pretrained model for cardiovascular event detection. CoRR abs/2303.06980 (2023)
[i134]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09085
Li-Chin Chen, Jung-Nien Lai, Hung-En Lin, Hsien-Te Chen, Kuo-Hsuan Hung, Yu Tsao:
Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning. CoRR abs/2303.09085 (2023)
[i133]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-06335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-06335
Chien-Pin Liu, Ju-Hsuan Li, En-Ping Chu, Chia-Yeh Hsieh, Kai-Chun Liu, Chia-Tai Chan, Yu Tsao:
Deep Learning-based Fall Detection Algorithm Using Ensemble Model of Coarse-fine CNN and GRU Networks. CoRR abs/2304.06335 (2023)
[i132]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16753
Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu:
ElectrodeNet - A Deep Learning Based Sound Coding Strategy for Cochlear Implants. CoRR abs/2305.16753 (2023)
[i131]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06652
Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion. CoRR abs/2306.06652 (2023)
[i130]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06653
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-Shih Chi, Hsin-Min Wang:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. CoRR abs/2306.06653 (2023)
[i129]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06865
Li-Chin Chen, Yi-Heng Lin, Li-Ning Peng, Feng-Ming Wang, Yu-Hsin Chen, Po-Hsun Huang, Shang-Feng Yang, Yu Tsao:
Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula. CoRR abs/2306.06865 (2023)
[i128]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09262
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09262
Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. CoRR abs/2308.09262 (2023)
[i127]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01164
Yu-Wen Chen, Julia Hirschberg, Yu Tsao:
Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement. CoRR abs/2309.01164 (2023)
[i126]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09548
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09548
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. CoRR abs/2309.09548 (2023)
[i125]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10787
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023)
[i124]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11059
Shafique Ahmed, Chia-Wei Chen, Wenze Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou:
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement. CoRR abs/2309.11059 (2023)
[i123]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12766
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12766
Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, Yu Tsao, Hsin-Min Wang, Chiou-Shann Fuh:
A Study on Incorporating Whisper for Robust Speech Assessment. CoRR abs/2309.12766 (2023)
[i122]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13650
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13650
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Cross-modal Alignment with Optimal Transport for CTC-based ASR. CoRR abs/2309.13650 (2023)
[i121]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16093
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR. CoRR abs/2309.16093 (2023)
[i120]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13103
Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang:
AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection. CoRR abs/2310.13103 (2023)
[i119]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13471
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13471
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Neural domain alignment for spoken language recognition based on optimal transport. CoRR abs/2310.13471 (2023)
[i118]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-02733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-02733
Sahibzada Adil Shahzad, Ammarah Hashmi, Yan-Tsung Peng, Yu Tsao, Hsin-Min Wang:
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection. CoRR abs/2311.02733 (2023)
[i117]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08878
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08878
Hsin-Tien Chiang, Szu-Wei Fu, Hsin-Min Wang, Yu Tsao, John H. L. Hansen:
Multi-objective Non-intrusive Hearing-aid Speech Assessment Model. CoRR abs/2311.08878 (2023)
[i116]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-15582
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-15582
Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao:
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice. CoRR abs/2311.15582 (2023)
[i115]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-16595
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-16595
Chi-Chang Lee, Yu Tsao, Hsin-Min Wang, Chu-Song Chen:
D4AM: A General Denoising Framework for Downstream Acoustic Models. CoRR abs/2311.16595 (2023)
[i114]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-16604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-16604
Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao:
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. CoRR abs/2311.16604 (2023)
[i113]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08622
Haibin Wu, Heng-Cheng Kuo, Yu Tsao, Hung-yi Lee:
Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification. CoRR abs/2312.08622 (2023)
2022
[j64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/ChenHLKLLFWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/ChenHLKLLFWT22
Yu-Wen Chen, Kuo-Hsuan Hung, You-Jin Li, Alexander Chao-Fu Kang, Ya-Hsin Lai, Kai-Chun Liu, Szu-Wei Fu, Syu-Siang Wang, Yu Tsao:
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application. IEEE Access 10: 46082-46099 (2022)
[j63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/neuroimage/LinTH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/neuroimage/LinTH22
Yi Lin, Yu Tsao, Po-Jang Hsieh:
Neural correlates of individual differences in predicting ambiguous sounds comprehension level. NeuroImage 251: 119012 (2022)
[j62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/spl/HuPYTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HuPYTW22
Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
SVSNet: An End-to-End Speaker Voice Similarity Assessment Model. IEEE Signal Process. Lett. 29: 767-771 (2022)
[j61]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChenCTT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChenCTT22
Lichin Chen, Po-Hsun Chen, Richard Tzong-Han Tsai, Yu Tsao:
EPG2S: Speech Generation and Speech Enhancement Based on Electropalatography and Audio Signals Using Multimodal Learning. IEEE Signal Process. Lett. 29: 2582-2586 (2022)
[j60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tai/HussainWGDTLAH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tai/HussainWGDTLAH22
Tassadaq Hussain, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour, Yu Tsao, Xugang Lu, Ahsan Adeel, Amir Hussain:
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement. IEEE Trans. Artif. Intell. 3(5): 833-842 (2022)
[j59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tamd/LiuHHHCT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/LiuHHHCT22
Kai-Chun Liu, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Deep-Learning-Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems. IEEE Trans. Cogn. Dev. Syst. 14(3): 1270-1281 (2022)
[j58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LinYHFTK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LinYHFTK22
Yu-Chen Lin, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1016-1031 (2022)
[j57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ChuangWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChuangWT22
Shang-Yi Chuang, Hsin-Min Wang, Yu Tsao:
Improved Lite Audio-Visual Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1345-1359 (2022)
[c204]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HsuL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HsuL022
Chan-Jan Hsu, Hung-yi Lee, Yu Tsao:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. ACL (2) 2022: 479-489
[c203]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/WangTZYLFL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/WangTZYLFL22
Syu-Siang Wang, Yu Tsao, Wei-Zhong Zheng, Hsiu-Wei Yeh, Pei-Chun Li, Shih-Hau Fang, Ying-Hui Lai:
Dysarthric Speech Enhancement Based on Convolution Neural Network. EMBC 2022: 60-64
[c202]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/HussainDGDATH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/HussainDGDATH22
Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain:
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning. EMBC 2022: 2581-2584
[c201]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/FengTC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/FengTC22
Zicheng Feng, Yu Tsao, Fei Chen:
Recurrent Neural Network-based Estimation and Correction of Relative Transfer Function for Preserving Spatial Cues in Speech Separation. EUSIPCO 2022: 155-159
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/globecom/ChenCK0H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globecom/ChenCK0H22
Bo-Rong Chen, Hsin-Tien Chiang, Heng-Cheng Kuo, Yu Tsao, Yih-Chun Hu:
Key Generation with Ambient Audio. GLOBECOM 2022: 5510-5515
[c199]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinHHYGTK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinHHYGTK22
Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo:
Speech Recovery For Real-World Self-Powered Intermittent Devices. ICASSP 2022: 26-30
[c198]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWT22
Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. ICASSP 2022: 1116-1120
[c197]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuWWRYT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuWWRYT22
Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. ICASSP 2022: 7402-7406
[c196]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FuYHRT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FuYHRT22
Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech. ICASSP 2022: 7412-7416
[c195]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinHLLT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinHLLT22
Guan-Ting Lin, Chan-Jan Hsu, Da-Rong Liu, Hung-Yi Lee, Yu Tsao:
Analyzing The Robustness of Unsupervised Speech Recognition. ICASSP 2022: 8202-8206
[c194]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangQCTC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangQCTC22
Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. ICASSP 2022: 8602-8606
[c193]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKZHLTWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKZHLTWM22
Haibin Wu, Heng-Cheng Kuo, Naijun Zheng, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao, Hsin-Min Wang, Helen Meng:
Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery. ICASSP 2022: 9236-9240
[c192]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HungFTC0L22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HungFTC0L22
Kuo-Hsuan Hung, Szu-Wei Fu, Huan-Hsin Tseng, Hsin-Tien Chiang, Yu Tsao, Chii-Wann Lin:
Boosting Self-Supervised Embeddings for Speech Enhancement. INTERSPEECH 2022: 186-190
[c191]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengCSY0C22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengCSY0C22
Chiang-Jen Peng, Yun-Ju Chan, Yih-Liang Shen, Cheng Yu, Yu Tsao, Tai-Shih Chi:
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement. INTERSPEECH 2022: 211-215
[c190]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuFH0R22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuFH0R22
Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. INTERSPEECH 2022: 981-985
[c189]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHLCWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHLCWT22
Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao:
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling. INTERSPEECH 2022: 1183-1187
[c188]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen022
Yu-Wen Chen, Yu Tsao:
InQSS: a speech intelligibility and quality assessment model using a multi-task learning network. INTERSPEECH 2022: 3088-3092
[c187]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zezario0FWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zezario0FWT22
Ryandhimas Edo Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. INTERSPEECH 2022: 3944-3948
[c186]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangC0WTY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangC0WTY22
Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. INTERSPEECH 2022: 4536-4540
[c185]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLTW22
Fan-Lin Wang, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. INTERSPEECH 2022: 5343-5347
[c184]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaoYFL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaoYFL022
Rong Chao, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao:
Perceptual Contrast Stretching on Target Feature for Speech Enhancement. INTERSPEECH 2022: 5448-5452
[c183]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuCLZCNMYSW0Q022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuCLZCNMYSW0Q022
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462
[c182]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZezarioFCFW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZezarioFCFW022
Ryandhimas Edo Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model. INTERSPEECH 2022: 5463-5467
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeeCCTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeeCCTW22
Hung-Shin Lee, Pin-Yuan Chen, Yao-Fei Cheng, Yu Tsao, Hsin-Min Wang:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. ISCSLP 2022: 145-149
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TingWCST22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TingWCST22
Wen-Yuan Ting, Syu-Siang Wang, Hsin-Li Chang, Borching Su, Yu Tsao:
Speech Enhancement Based on CycleGAN with Noise-informed Training. ISCSLP 2022: 155-159
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/Feng0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/Feng0022
Zicheng Feng, Yu Tsao, Fei Chen:
Preservation Of Interaural Level Difference Cue In A Deep Learning-Based Speech Separation System For Bilateral And Bimodal Cochlear Implants Users. IWAENC 2022: 1-5
[c178]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/LuoFC0WS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/LuoFC0WS22
Shang-Bao Luo, Cheng-Chung Fan, Kuan-Yu Chen, Yu Tsao, Hsin-Min Wang, Keh-Yih Su:
Chinese Movie Dialogue Question Answering Dataset. ROCLING 2022: 7-14
[i112]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-09913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-09913
Tassadaq Hussain, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour, Yu Tsao, Xugang Lu, Ahsan Adeel, Amir Hussain:
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement. CoRR abs/2201.09913 (2022)
[i111]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05256
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05256
Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. CoRR abs/2202.05256 (2022)
[i110]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05756
Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain:
A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning. CoRR abs/2202.05756 (2022)
[i109]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-06507
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-06507
Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. CoRR abs/2202.06507 (2022)
[i108]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-06684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-06684
Haibin Wu, Heng-Cheng Kuo, Naijun Zheng, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao, Hsin-Min Wang, Helen Meng:
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery. CoRR abs/2202.06684 (2022)
[i107]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10777
Syu-Siang Wang, Chi-Te Wang, Chih-Chung Lai, Yu Tsao, Shih-Hau Fang:
Continuous Speech for Improved Learning Pathological Voice Disorders. CoRR abs/2202.10777 (2022)
[i106]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03550
Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. CoRR abs/2203.03550 (2022)
[i105]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11389
Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. CoRR abs/2203.11389 (2022)
[i104]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13696
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13696
Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. CoRR abs/2203.13696 (2022)
[i103]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15576
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15576
Hung-Shin Lee, Yu Tsao, Shyh-Kang Jeng, Hsin-Min Wang:
Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition. CoRR abs/2203.15576 (2022)
[i102]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16007
Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Multi-Target Filter and Detector for Speaker Diarization. CoRR abs/2203.16007 (2022)
[i101]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16040
Fan-Lin Wang, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. CoRR abs/2203.16040 (2022)
[i100]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17036
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Partial Coupling of Optimal Transport for Spoken Language Identification. CoRR abs/2203.17036 (2022)
[i99]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17152
Rong Chao, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao:
Perceptual Contrast Stretching on Target Feature for Speech Enhancement. CoRR abs/2203.17152 (2022)
[i98]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00164
Chiang-Lin Tai, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Filter-based Discriminative Autoencoders for Children Speech Recognition. CoRR abs/2204.00164 (2022)
[i97]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03305
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03305
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. CoRR abs/2204.03305 (2022)
[i96]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03310
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03310
Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model. CoRR abs/2204.03310 (2022)
[i95]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-04333
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-04333
Shih-Kuang Lee, Yu Tsao, Hsin-Min Wang:
A Study of Using Cepstrogram for Countermeasure Against Replay Attacks. CoRR abs/2204.04333 (2022)
[i94]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07316
Chan-Jan Hsu, Hung-yi Lee, Yu Tsao:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. CoRR abs/2204.07316 (2022)
[i93]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07860
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07860
Li-Chin Chen, Po-Hsun Chen, Richard Tzong-Han Tsai, Yu Tsao:
EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning. CoRR abs/2206.07860 (2022)
[i92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09058
Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao:
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling. CoRR abs/2206.09058 (2022)
[i91]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-09514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-09514
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022)
[i90]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10446
Yin-Ping Cho, Yu Tsao, Hsin-Min Wang, Yi-Wen Liu:
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN. CoRR abs/2209.10446 (2022)
[i89]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13271
Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
ECG Artifact Removal from Single-Channel Surface EMG Using Fully Convolutional Networks. CoRR abs/2210.13271 (2022)
[i88]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15368
Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference. CoRR abs/2210.15368 (2022)
[i87]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15370
Fan-Lin Wang, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
CasNet: Investigating Channel Robustness for Speech Separation. CoRR abs/2210.15370 (2022)
[i86]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17456
I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou:
Audio-Visual Speech Enhancement and Separation by Leveraging Multi-Modal Self-Supervised Embeddings. CoRR abs/2210.17456 (2022)
[i85]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00586
Chan-Jan Hsu, Ho-Lam Chung, Hung-yi Lee, Yu Tsao:
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5. CoRR abs/2211.00586 (2022)
[i84]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01189
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01189
Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-based Neural Speech Enhancement. CoRR abs/2211.01189 (2022)
[i83]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06508
Hsin-Yi Lin, Huan-Hsin Tseng, Yu Tsao:
On the robustness of non-intrusive speech quality model by adversarial examples. CoRR abs/2211.06508 (2022)
2021
[j56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ploscb/LinAT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ploscb/LinAT21
Tzu-Hao Lin, Tomonari Akamatsu, Yu Tsao:
Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan. PLoS Comput. Biol. 17(2) (2021)
[j55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tamd/AbousalehCYT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/AbousalehCYT21
Fatma S. Abousaleh, Wen-Huang Cheng, Neng-Hao Yu, Yu Tsao:
Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media. IEEE Trans. Cogn. Dev. Syst. 13(3): 679-692 (2021)
[j54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tamd/TsengWFLT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/TsengWFLT21
Rung-Yu Tseng, Taowei Wang, Szu-Wei Fu, Chia-Ying Lee, Yu Tsao:
A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation. IEEE Trans. Cogn. Dev. Syst. 13(4): 984-994 (2021)
[j53]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LuSTK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuSTK21
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Coupling a Generative Model With a Discriminative Learning Framework for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3631-3641 (2021)
[j52]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/HidayatiGCHSWHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/HidayatiGCHSWHT21
Shintami Chusnul Hidayati, Ting Wei Goh, Ji-Sheng Gary Chan, Cheng-Chun Hsu, John See, Lai-Kuan Wong, Kai-Lung Hua, Yu Tsao, Wen-Huang Cheng:
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes. IEEE Trans. Multim. 23: 365-377 (2021)
[c177]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/LuTW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LuTW21
Yen-Ju Lu, Yu Tsao, Shinji Watanabe:
A Study on Speech Enhancement Based on Diffusion Probabilistic Model. APSIPA ASC 2021: 659-666
[c176]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/LuSTK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LuSTK21
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification. APSIPA ASC 2021: 769-774
[c175]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/LiouHYTPTTW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiouHYTPTTW21
Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion. APSIPA ASC 2021: 1234-1238
[c174]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/FengTC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/FengTC21
Zicheng Feng, Yu Tsao, Fei Chen:
Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues. APSIPA ASC 2021: 1239-1244
[c173]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/LiWTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiWTS21
You-Jin Li, Syu-Siang Wang, Yu Tsao, Borching Su:
MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder. APSIPA ASC 2021: 1245-1250
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChangMGSLSWYTLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChangMGSLSWYTLW21
Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-Wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe:
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition. ASRU 2021: 228-235
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YenHKPTTTJW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YenHKPTTTJW21
Ming-Chi Yen, Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Shu-Wei Tsai, Yu Tsao, Tomoki Toda, Jyh-Shing Roger Jang, Hsin-Min Wang:
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling. ASRU 2021: 650-657
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChiangWYTWHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChiangWYTWHT21
Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network. ASRU 2021: 907-913
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/bhi/LuLHCTC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bhi/LuLHCTC21
Ting-Yang Lu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan:
Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder. BHI 2021: 1-4
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ZezarioFW021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZezarioFW021
Ryandhimas E. Zezario, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Speech Enhancement with Zero-Shot Model Selection. EUSIPCO 2021: 491-495
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ChenHCSL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ChenHCSL021
Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Xugang Lu, Yu Tsao:
A Study of Incorporating Articulatory Movement Information in Speech Enhancement. EUSIPCO 2021: 496-500
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuH0L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuH0L21
Yuan-Kuei Wu, Kuan-Po Huang, Yu Tsao, Hung-yi Lee:
One Shot Learning for Speech Separation. ICASSP 2021: 5769-5773
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuS0K21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuS0K21
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification. ICASSP 2021: 7213-7217
[c164]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsiehYFL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsiehYFL021
Tsun-An Hsieh, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao:
Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement. Interspeech 2021: 196-200
[c163]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuYHPRL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuYHPRL021
Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. Interspeech 2021: 201-205
[c162]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangKPLTWT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangKPLTWT21
Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Ching-Feng Liu, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion. Interspeech 2021: 1329-1333
[c161]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHL0L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHL0L21
Gang-Xuan Lin, Shih-Wei Hu, Yen-Ju Lu, Yu Tsao, Chun-Shien Lu:
QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization. Interspeech 2021: 1639-1643
[c160]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHLPHTWT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHLPHTWT21
Yi-Chiao Wu, Cheng-Hung Hu, Hung-Shin Lee, Yu-Huai Peng, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder. Interspeech 2021: 3630-3634
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/iscas/ChenHCSHL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscas/ChenHCSHL021
Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Wen-Chin Huang, Xugang Lu, Yu Tsao:
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System. ISCAS 2021: 1-5
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/iscas/PengCYW0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscas/PengCYW0C21
Chiang-Jen Peng, Yun-Ju Chan, Cheng Yu, Syu-Siang Wang, Yu Tsao, Tai-Shih Chi:
Attention-Based Multi-Task Learning for Speech-Enhancement and Speaker-Identification in Multi-Speaker Dialogue Scenario. ISCAS 2021: 1-5
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChangYPWC0W21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChangYPWC0W21
Yu-Tao Chang, Yuan-Hong Yang, Yu-Huai Peng, Syu-Siang Wang, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration. ISCSLP 2021: 1-5
[c156]
- view
  authority control:
- export record
  dblp key:
  - conf/medinfo/ChenS0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/medinfo/ChenS0C21
Lichin Chen, Ji-Tian Sheu, Yu Tsao, Yuh-Jue Chuang:
Deep Learning and Explainable Artificial Intelligence to Predict Patients' Choice of Hospital Levels in Urban and Rural Areas. MedInfo 2021: 734-738
[c155]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LinTLT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LinTLT21
Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao:
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport. NeurIPS 2021: 19935-19946
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/NoorLWGCZAC0W21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/NoorLWGCZAC0W21
Md Mahbub E. Noor, Yen-Ju Lu, Syu-Siang Wang, Supratip Ghose, Chia-Yu Chang, Ryandhimas E. Zezario, Shafique Ahmed, Wei-Ho Chung, Yu Tsao, Hsin-Min Wang:
Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions. O-COCOSDA 2021: 7-12
[c153]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/FanKLLCHWTWSLWC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/FanKLLCHWTWSLWC21
Cheng-Chung Fan, Chia-Chih Kuo, Shang-Bao Luo, Pei-Jun Liao, Kuang-Yu Chang, Chiao-Wei Hsu, Meng-Tse Wu, Shih-Hong Tsai, Tzu-Man Wu, Aleksandra Smolka, Chao-Chun Liang, Hsin-Min Wang, Kuan-Yu Chen, Yu Tsao, Keh-Yih Su:
A Flexible and Extensible Framework for Multiple Answer Modes Question Answering. ROCLING 2021: 33-42
[i82]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-02550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-02550
Chiang-Jen Peng, Yun-Ju Chan, Cheng Yu, Syu-Siang Wang, Yu Tsao, Tai-Shih Chi:
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario. CoRR abs/2101.02550 (2021)
[i81]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-03329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-03329
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification. CoRR abs/2101.03329 (2021)
[i80]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03786
Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Wen-Chin Huang, Xugang Lu, Yu Tsao:
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System. CoRR abs/2102.03786 (2021)
[i79]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03004
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification. CoRR abs/2104.03004 (2021)
[i78]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03009
Cheng-Hung Hu, Yi-Chiao Wu, Wen-Chin Huang, Yu-Huai Peng, Yu-Wen Chen, Pin-Jui Ku, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
The AS-NU System for the M2VoC Challenge. CoRR abs/2104.03009 (2021)
[i77]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03538
Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. CoRR abs/2104.03538 (2021)
[i76]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-08809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-08809
Fatma S. Abousaleh, Wen-Huang Cheng, Neng-Hao Yu, Yu Tsao:
Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media. CoRR abs/2105.08809 (2021)
[i75]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-01415
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-01415
Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Ching-Feng Liu, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion. CoRR abs/2106.01415 (2021)
[i74]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05229
Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo:
Intermittent Speech Recovery. CoRR abs/2106.05229 (2021)
[i73]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09392
Cheng-Hung Hu, Yu-Huai Peng, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model. CoRR abs/2107.09392 (2021)
[i72]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-11876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-11876
Yen-Ju Lu, Yu Tsao, Shinji Watanabe:
A Study on Speech Enhancement Based on Diffusion Probabilistic Model. CoRR abs/2107.11876 (2021)
[i71]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-03551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-03551
Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion. CoRR abs/2109.03551 (2021)
[i70]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03894
Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming. CoRR abs/2110.03894 (2021)
[i69]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04590
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04590
Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-Wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe:
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition. CoRR abs/2110.04590 (2021)
[i68]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05866
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05866
Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech. CoRR abs/2110.05866 (2021)
[i67]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09923
Yun-Ju Chan, Chiang-Jen Peng, Syu-Siang Wang, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi:
Speech Enhancement-assisted Stargan Voice Conversion in Noisy Environments. CoRR abs/2110.09923 (2021)
[i66]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09924
Wen-Yuan Ting, Syu-Siang Wang, Hsin-Li Chang, Borching Su, Yu Tsao:
Speech Enhancement Based on Cyclegan with Noise-informed Training. CoRR abs/2110.09924 (2021)
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-02363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-02363
Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features. CoRR abs/2111.02363 (2021)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-02585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-02585
Yu-Wen Chen, Yu Tsao:
InQSS: a speech intelligibility assessment model using a multi-task learning network. CoRR abs/2111.02585 (2021)
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-04436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-04436
Yu-Chen Lin, Cheng Yu, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points. CoRR abs/2111.04436 (2021)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05691
Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-net: A non-intrusive hearing-aid speech assessment network. CoRR abs/2111.05691 (2021)
[i61]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05703
Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. CoRR abs/2111.05703 (2021)
[i60]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-06316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-06316
Hsin-Yi Lin, Huan-Hsin Tseng, Xugang Lu, Yu Tsao:
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport. CoRR abs/2111.06316 (2021)
[i59]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02538
Heng-Cheng Kuo, Yu-Peng Hsieh, Huan-Hsin Tseng, Chi-Tei Wang, Shih-Hau Fang, Yu Tsao:
Toward Real-World Pathological Voice Detection. CoRR abs/2112.02538 (2021)
[i58]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-03541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-03541
Lichin Chen, Ji-Tian Sheu, Yuh-Jue Chuang, Yu Tsao:
Predicting the Travel Distance of Patients to Access Healthcare using Deep Neural Networks. CoRR abs/2112.03541 (2021)
2020
[j51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/PotortiPCPGBLTR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/PotortiPCPGBLTR20
Francesco Potorti, Sangjoon Park, Antonino Crivello, Filippo Palumbo, Michele Girolami, Paolo Barsocchi, Soyeon Lee, Joaquín Torres-Sospedra, Antonio Ramón Jiménez Ruiz, Antoni Pérez-Navarro, Germán Martín Mendoza-Silva, Fernando Seco, Miguel Ortiz, Johan Perul, Valérie Renaudin, Hyunwoong Kang, Soyoung Park, Jae Hong Lee, Chan Gook Park, Jisu Ha, Jaeseung Han, Changjun Park, Keunhye Kim, Yonghyun Lee, Seunghun Gye, Keumryeol Lee, Eun-Jee Kim, Jeongsik Choi, Yang-Seok Choi, Shilpa Talwar, Seong Yun Cho, Boaz Ben-Moshe, Alex Scherbakov, Leonid Antsfeld, Emilio Sansano-Sansano, Boris Chidlovskii, Nikolai Kronenwett, Silvia Prophet, Yael Landay, Revital Marbel, Lingxiang Zheng, Ao Peng, Zhichao Lin, Bang Wu, Chengqi Ma, Stefan Poslad, David R. Selviah, Wei Wu, Zixiang Ma, Wenchao Zhang, Dongyan Wei, Hong Yuan, Jun-Bang Jiang, Shao-Yung Huang, Jing-Wen Liu, Kuan-Wu Su, Jenq-Shiou Leu, Kazuki Nishiguchi, Walid Bousselham, Hideaki Uchiyama, Diego Thomas, Atsushi Shimada, Rin-Ichiro Taniguchi, Vicente Cortés Puschel, Tomás Lungenstrass Poulsen, Imran Ashraf, Chanseok Lee, Muhammad Usman Ali, Yeongjun Im, Gunzung Kim, Jeongsook Eom, Soojung Hur, Yongwan Park, Miroslav Opiela, Adriano J. C. Moreira, Maria João Nicolau, Cristiano G. Pendão, Ivo Silva, Filipe Meneses, António Costa, Jens Trogh, David Plets, Ying-Ren Chien, Tzu-Yu Chang, Shih-Hau Fang, Yu Tsao:
The IPIN 2019 Indoor Localisation Competition - Description and Results. IEEE Access 8: 206674-206718 (2020)
[j50]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/WangYTDNESVKLJA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/WangYTDNESVKLJA20
Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020)
[j49]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/FuLT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/FuLT20
Szu-Wei Fu, Chien-Feng Liao, Yu Tsao:
Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality. IEEE Signal Process. Lett. 27: 26-30 (2020)
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/YuHWTH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/YuHWTH20
Cheng Yu, Kuo-Hsuan Hung, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
Time-Domain Multi-Modal Bone/Air Conducted Speech Enhancement. IEEE Signal Process. Lett. 27: 1035-1039 (2020)
[j47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/spl/HsiehWLT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HsiehWLT20
Tsun-An Hsieh, Hsin-Min Wang, Xugang Lu, Yu Tsao:
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement. IEEE Signal Process. Lett. 27: 2149-2153 (2020)
[j46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tamd/HussainSWTSL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/HussainSWTSL20
Tassadaq Hussain, Sabato Marco Siniscalchi, Hsiao-Lan Sharon Wang, Yu Tsao, Valerio Mario Salerno, Wen-Hung Liao:
Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation. IEEE Trans. Cogn. Dev. Syst. 12(4): 744-758 (2020)
[j45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuFLHWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuFLHWT20
Chang-Le Liu, Sze-Wei Fu, You-Jin Li, Jen-Wei Huang, Hsin-Min Wang, Yu Tsao:
Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1888-1900 (2020)
[j44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/YuZWSHLW020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YuZWSHLW020
Cheng Yu, Ryandhimas E. Zezario, Syu-Siang Wang, Jonathan Sherman, Yi-Yen Hsieh, Xugang Lu, Hsin-Min Wang, Yu Tsao:
Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2756-2769 (2020)
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Lee0JW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Lee0JW20
Hung-Shin Lee, Yu Tsao, Shyh-Kang Jeng, Hsin-Min Wang:
Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3065-3079 (2020)
[j42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tetci/HuangLHLPTW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tetci/HuangLHLPTW20
Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion. IEEE Trans. Emerg. Top. Comput. Intell. 4(4): 468-479 (2020)
[j41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/titb/TsaiWCTWLFCT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/titb/TsaiWCTWLFCT20
Kun-Hsi Tsai, Wei-Chien Wang, Chui-Hsuan Cheng, Chan-Yen Tsai, Jou-Kou Wang, Tzu-Hao Lin, Shih-Hau Fang, Lichin Chen, Yu Tsao:
Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder. IEEE J. Biomed. Health Informatics 24(11): 3203-3214 (2020)
[c152]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/FuLHHWYKZLCLL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/FuLHHWYKZLCLL020
Szu-Wei Fu, Chien-Feng Liao, Tsun-An Hsieh, Kuo-Hsuan Hung, Syu-Siang Wang, Cheng Yu, Heng-Cheng Kuo, Ryandhimas E. Zezario, You-Jin Li, Shang-Yi Chuang, Yen-Ju Lu, Yu-Chen Lin, Yu Tsao:
Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing. APSIPA 2020: 455-459
[c151]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/ZezarioFFTW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZezarioFFTW20
Ryandhimas E. Zezario, Szu-Wei Fu, Chiou-Shann Fuh, Yu Tsao, Hsin-Min Wang:
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model. APSIPA 2020: 482-486
[c150]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/PengHKLC0W20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/PengHKLC0W20
Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/globecom/LinLLT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globecom/LinLLT20
Chi-Lun Lin, Kate Ching-Ju Lin, Chi-Cheng Lee, Yu Tsao:
Cross-Technology Interference Mitigation Using Fully Convolutional Denoising Autoencoders. GLOBECOM 2020: 1-6
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZezarioHLWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZezarioHLWT20
Ryandhimas E. Zezario, Tassadaq Hussain, Xugang Lu, Hsin-Min Wang, Yu Tsao:
Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement. ICASSP 2020: 6669-6673
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/icce-tw/LinLW0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icce-tw/LinLW0H20
Chen-Li Lin, Zi-Qiang Lin, Syu-Siang Wang, Yu Tsao, Jeih-Weih Hung:
Exponentiated magnitude spectrogram-based relative-to-maximum masking for speech enhancement in adverse environments. ICCE-TW 2020: 1-2
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/WuLT0WC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/WuLT0WC20
Chih-Wei Wu, Chih-Ting Liu, Wei-Chih Tu, Yu Tsao, Yu-Chiang Frank Wang, Shao-Yi Chien:
Space-Time Guided Association Learning For Unsupervised Person Re-Identification. ICIP 2020: 2261-2265
[c145]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chuang0LW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chuang0LW20
Shang-Yi Chuang, Yu Tsao, Chen-Chou Lo, Hsin-Min Wang:
Lite Audio-Visual Speech Enhancement. INTERSPEECH 2020: 1131-1135
[c144]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiF0Y20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiF0Y20
Haoyu Li, Szu-Wei Fu, Yu Tsao, Junichi Yamagishi:
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning. INTERSPEECH 2020: 1336-1340
[c143]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuLLH020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuLLH020
Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao:
Incorporating Broad Phonetic Information for Speech Enhancement. INTERSPEECH 2020: 2417-2421
[c142]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLLW020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLLW020
Chi-Chang Lee, Yu-Chen Lin, Hsuan-Tien Lin, Hsin-Min Wang, Yu Tsao:
SERIL: Noise Adaptive Speech Enhancement Using Regularization-Based Incremental Learning. INTERSPEECH 2020: 2432-2436
[c141]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZW0LL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZW0LL20
Chen-Yu Chen, Wei-Zhong Zheng, Syu-Siang Wang, Yu Tsao, Pei-Chun Li, Ying-Hui Lai:
Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-Based Voice Conversion System. INTERSPEECH 2020: 4686-4690
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01538
Cheng Yu, Ryandhimas E. Zezario, Jonathan Sherman, Yi-Yen Hsieh, Xugang Lu, Hsin-Min Wang, Yu Tsao:
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders. CoRR abs/2001.01538 (2020)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-07849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-07849
Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion. CoRR abs/2001.07849 (2020)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-00932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-00932
Haoyu Li, Szu-Wei Fu, Yu Tsao, Junichi Yamagishi:
iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning. CoRR abs/2004.00932 (2020)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-04098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-04098
Tsun-An Hsieh, Hsin-Min Wang, Xugang Lu, Yu Tsao:
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement. CoRR abs/2004.04098 (2020)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09966
Yuan-Kuei Wu, Chao-I Tuan, Hung-yi Lee, Yu Tsao:
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning. CoRR abs/2005.09966 (2020)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11760
Chi-Chang Lee, Yu-Chen Lin, Hsuan-Tien Lin, Hsin-Min Wang, Yu Tsao:
SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental Learning. CoRR abs/2005.11760 (2020)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11769
Shang-Yi Chuang, Yu Tsao, Chen-Chou Lo, Hsin-Min Wang:
Lite Audio-Visual Speech Enhancement. CoRR abs/2005.11769 (2020)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-10296
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-10296
Szu-Wei Fu, Chien-Feng Liao, Tsun-An Hsieh, Kuo-Hsuan Hung, Syu-Siang Wang, Cheng Yu, Heng-Cheng Kuo, Ryandhimas E. Zezario, You-Jin Li, Shang-Yi Chuang, Yen-Ju Lu, Yu Tsao:
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing. CoRR abs/2006.10296 (2020)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13427
Lichin Chen, Yu Tsao, Ji-Tian Sheu:
Using Deep Learning and Explainable Artificial Intelligence in Patients' Choices of Hospital Levels. CoRR abs/2006.13427 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-07618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-07618
Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao:
Incorporating Broad Phonetic Information for Speech Enhancement. CoRR abs/2008.07618 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-09264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-09264
Alexander Chao-Fu Kang, Kuo-Hsuan Hung, Yu-Wen Chen, You-Jin Li, Ya-Hsin Lai, Kai-Chun Liu, Sze-Wei Fu, Syu-Siang Wang, Yu Tsao:
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application. CoRR abs/2008.09264 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-13222
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-13222
Shang-Yi Chuang, Hsin-Min Wang, Yu Tsao:
Improved Lite Audio-Visual Speech Enhancement. CoRR abs/2008.13222 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02669
Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. CoRR abs/2010.02669 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15174
Tsun-An Hsieh, Cheng Yu, Szu-Wei Fu, Xugang Lu, Yu Tsao:
Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement. CoRR abs/2010.15174 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-04292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-04292
Ryandhimas E. Zezario, Szu-Wei Fu, Chiou-Shann Fuh, Yu Tsao, Hsin-Min Wang:
STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model. CoRR abs/2011.04292 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07442
Yen-Ju Lu, Chia-Yu Chang, Yu Tsao, Jeih-weih Hung:
Speech enhancement guided by contextual articulatory information. CoRR abs/2011.07442 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-10233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-10233
Yuan-Kuei Wu, Kuan-Po Huang, Yu Tsao, Hung-yi Lee:
One Shot Learning for Speech Separation. CoRR abs/2011.10233 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-03426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-03426
Kai-Chun Liu, Kuo-Hsuan Hung, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Deep Learning Based Signal Enhancement of Low-Resolution Accelerometer for Fall Detection Systems. CoRR abs/2012.03426 (2020)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-03803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-03803
Tsai-Min Chen, Yuan-Hong Tsai, Huan-Hsin Tseng, Jhih-Yu Chen, Chih-Han Huang, Guo-Yuan Li, Chun-Yen Shen, Yu Tsao:
ECG Signal Super-resolution by Considering Reconstruction and Cardiac Arrhythmias Classification Loss. CoRR abs/2012.03803 (2020)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-09359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-09359
Ryandhimas E. Zezario, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Speech Enhancement with Zero-Shot Model Selection. CoRR abs/2012.09359 (2020)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-10911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-10911
Kai-Chun Liu, Michael Chan, Chia-Yeh Hsieh, Hsiang-Yun Huang, Chia-Tai Chan, Yu Tsao:
Domain-adaptive Fall Detection Using Deep Adversarial Training. CoRR abs/2012.10911 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13152
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Unsupervised neural adaptation model based on optimal transport for spoken language identification. CoRR abs/2012.13152 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/ChiangHFHTC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/ChiangHFHTC19
Hsin-Tien Chiang, Yi-Yen Hsieh, Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Shao-Yi Chien:
Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders. IEEE Access 7: 60806-60813 (2019)
[j39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/RenaudinOPTJPMS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/RenaudinOPTJPMS19
Valérie Renaudin, Miguel Ortiz, Johan Perul, Joaquín Torres-Sospedra, Antonio Ramón Jiménez, Antoni Pérez-Navarro, Germán Martín Mendoza-Silva, Fernando Seco, Yael Landau, Revital Marbel, Boaz Ben-Moshe, Xingyu Zheng, Feng Ye, Jian Kuang, Yu Li, Xiaoji Niu, Vlad Landa, Shlomi Hacohen, Nir Shvalb, Chuanhua Lu, Hideaki Uchiyama, Diego Thomas, Atsushi Shimada, Rin-Ichiro Taniguchi, Zhenxing Ding, Feng Xu, Nikolai Kronenwett, Blagovest Vladimirov, Soyeon Lee, Eunyoung Cho, Sungwoo Jun, Chang-Eun Lee, Sangjoon Park, Yonghyun Lee, Jehyeok Rew, Changjun Park, Hyeongyo Jeong, Jaeseung Han, Keumryeol Lee, Wenchao Zhang, Xianghong Li, Dongyan Wei, Ying Zhang, So Young Park, Chan Gook Park, Stefan Knauth, Georgios Pipelidis, Nikolaos Tsiamitros, Tomás Lungenstrass, Juan Pablo Morales, Jens Trogh, David Plets, Miroslav Opiela, Shih-Hau Fang, Yu Tsao, Ying-Ren Chien, Shi-Shen Yang, Shih-Jyun Ye, Muhammad Usman Ali, Soojung Hur, Yongwan Park:
Evaluating Indoor Positioning Systems in a Shopping Mall: The Lessons Learned From the IPIN 2018 Competition. IEEE Access 7: 148594-148628 (2019)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/TsaoLCCCT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/TsaoLCCCT19
Yu Tsao, Tzu-Hao Lin, Fei Chen, Yun-Fan Chang, Chui-Hsuan Cheng, Kun-Hsi Tsai:
Robust S1 and S2 heart sound recognition based on spectral restoration and multi-style training. Biomed. Signal Process. Control. 49: 173-180 (2019)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WuYFLCT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WuYFLCT19
Jyun-Yi Wu, Cheng Yu, Szu-Wei Fu, Chih-Ting Liu, Shao-Yi Chien, Yu Tsao:
Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques. IEEE Signal Process. Lett. 26(12): 1887-1891 (2019)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/HsiaoSHTTL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/HsiaoSHTTL19
Shan-Wen Hsiao, Hung-Ching Sun, Ming-Chuan Hsieh, Ming-Hsueh Tsai, Yu Tsao, Chi-Chun Lee:
Toward Automating Oral Presentation Scoring During Principal Certification Program Using Audio-Video Low-Level Behavior Profiles. IEEE Trans. Affect. Comput. 10(4): 552-567 (2019)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/tcas/LiuLWLLTC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcas/LiuLWLLTC19
Chih-Ting Liu, Tung-Wei Lin, Yi-Heng Wu, Yu-Sheng Lin, Heng Lee, Yu Tsao, Shao-Yi Chien:
Computation-Performance Optimization of Convolutional Neural Networks With Redundant Filter Removal. IEEE Trans. Circuits Syst. I Regul. Pap. 66-I(5): 1908-1921 (2019)
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/aicas/LoWTP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aicas/LoWTP19
Yu-Ting Lo, Syu-Siang Wang, Yu Tsao, Sheng-Yu Peng:
A Pruned-CELP Speech Codec Using Denoising Autoencoder with Spectral Compensation for Quality and Intelligibility Enhancement. AICAS 2019: 150-151
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Ye0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Ye0C19
Fuqiang Ye, Yu Tsao, Fei Chen:
Subjective Feedback-based Neural Network Pruning for Speech Enhancement. APSIPA 2019: 673-677
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HussainTWWSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HussainTWWSL19
Tassadaq Hussain, Yu Tsao, Hsin-Min Wang, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao:
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement. APSIPA 2019: 678-683
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LinTCW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LinTCW19
Wei-Cheng Lin, Yu Tsao, Fei Chen, Hsin-Min Wang:
Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature Enhancement. APSIPA 2019: 1179-1184
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HuangWHTHKTTW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HuangWHTHKTTW19
Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. EUSIPCO 2019: 1-5
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HussainTWWSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HussainTWWSL19
Tassadaq Hussain, Yu Tsao, Hsin-Min Wang, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao:
Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine. EUSIPCO 2019: 1-5
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShenHWTWC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShenHWTWC19
Yih-Liang Shen, Chao-Yuan Huang, Syu-Siang Wang, Yu Tsao, Hsin-Min Wang, Tai-Shih Chi:
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition. ICASSP 2019: 6750-6754
[c133]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/FuLTL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FuLTL19
Szu-Wei Fu, Chien-Feng Liao, Yu Tsao, Shou-De Lin:
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement. ICML 2019: 2031-2041
[c132]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangWLTHKT0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangWLTHKT0W19
Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion. INTERSPEECH 2019: 709-713
[c131]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenL019
Li-Wei Chen, Hung-yi Lee, Yu Tsao:
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech. INTERSPEECH 2019: 719-723
[c130]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoFHWYTW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoFHWYTW19
Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion. INTERSPEECH 2019: 1541-1545
[c129]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLWCTW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLWCTW19
Pin-Tuan Huang, Hung-Shin Lee, Syu-Siang Wang, Kuan-Yu Chen, Yu Tsao, Hsin-Min Wang:
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR. INTERSPEECH 2019: 1631-1635
[c128]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHF0K19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHF0K19
Yu-Chen Lin, Yi-Te Hsu, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network. INTERSPEECH 2019: 1801-1805
[c127]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Liao0LK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Liao0LK19
Chien-Feng Liao, Yu Tsao, Xugang Lu, Hisashi Kawai:
Incorporating Symbolic Sequential Modeling for Speech Enhancement. INTERSPEECH 2019: 2733-2737
[c126]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Liao0LW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Liao0LW19
Chien-Feng Liao, Yu Tsao, Hung-yi Lee, Hsin-Min Wang:
Noise Adaptive Speech Enhancement Using Domain Adversarial Training. INTERSPEECH 2019: 3148-3152
[c125]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZezarioFLWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZezarioFLWT19
Ryandhimas E. Zezario, Szu-Wei Fu, Xugang Lu, Hsin-Min Wang, Yu Tsao:
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric. INTERSPEECH 2019: 3168-3172
[c124]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChuangWHTF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChuangWHTF19
Fu-Kai Chuang, Syu-Siang Wang, Jeih-weih Hung, Yu Tsao, Shih-Hau Fang:
Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement. INTERSPEECH 2019: 3173-3177
[c123]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuS00K19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuS00K19
Xugang Lu, Peng Shen, Sheng Li, Yu Tsao, Hisashi Kawai:
Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection. INTERSPEECH 2019: 3614-3618
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/ispacs/ZezarioSHWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ispacs/ZezarioSHWT19
Ryandhimas E. Zezario, Join W. C. Sigalingging, Tassadaq Hussain, Jia-Ching Wang, Yu Tsao:
Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement. ISPACS 2019: 1-2
[c121]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/Hussain0SWWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/Hussain0SWWL19
Tassadaq Hussain, Yu Tsao, Sabato Marco Siniscalchi, Jia-Ching Wang, Hsin-Min Wang, Wen-Hung Liao:
Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine. IWSDS 2019: 153-162
[c120]
- view
  authority control:
- export record
  dblp key:
  - conf/mipr/HidayatiHTS0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mipr/HidayatiHTS0C19
Shintami Chusnul Hidayati, Kai-Lung Hua, Yu Tsao, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng:
Garment Detectives: Discovering Clothes and Its Genre in Consumer Photos. MIPR 2019: 471-474
[c119]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/LiuW0H19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/LiuW0H19
Kuan-Yi Liu, Syu-Siang Wang, Yu Tsao, Jeih-Weih Hung:
Speech enhancement based on the integration of fully convolutional network, temporal lowpass filtering and spectrogram masking. ROCLING 2019: 226-240
[c118]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/HuangWKPHT0WT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/HuangWKPHT0WT19
Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion. SSW 2019: 57-62
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-08352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-08352
Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
MOSNet: Deep Learning based Objective Assessment for Voice Conversion. CoRR abs/1904.08352 (2019)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-13142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-13142
Chien-Feng Liao, Yu Tsao, Xugang Lu, Hisashi Kawai:
Incorporating Symbolic Sequential Modeling for Speech Enhancement. CoRR abs/1904.13142 (2019)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00615
Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion. CoRR abs/1905.00615 (2019)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-01898
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-01898
Szu-Wei Fu, Chien-Feng Liao, Yu Tsao:
Learning with Learned Loss Function: Speech Enhancement with Quality-Net to Improve Perceptual Evaluation of Speech Quality. CoRR abs/1905.01898 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-04874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-04874
Szu-Wei Fu, Chien-Feng Liao, Yu Tsao, Shou-De Lin:
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement. CoRR abs/1905.04874 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-01078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-01078
Jyun-Yi Wu, Cheng Yu, Szu-Wei Fu, Chih-Ting Liu, Shao-Yi Chien, Yu Tsao:
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques. CoRR abs/1906.01078 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11909
Chang-Le Liu, Szu-Wei Fu, You-Jin Lee, Yu Tsao, Jen-Wei Huang, Hsin-Min Wang:
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks. CoRR abs/1909.11909 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11912
Natalie Yu-Hsien Wang, Hsiao-Lan Sharon Wang, Taowei Wang, Szu-Wei Fu, Xugang Lu, Yu Tsao, Hsin-Min Wang:
Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement. CoRR abs/1909.11912 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11919
Rung-Yu Tseng, Taowei Wang, Szu-Wei Fu, Yu Tsao, Chia-Ying Lee:
Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation. CoRR abs/1909.11919 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01601
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01601
Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-08153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-08153
Syu-Siang Wang, Yu-You Liang, Jeih-weih Hung, Yu Tsao, Hsin-Min Wang, Shih-Hau Fang:
Distributed Microphone Speech Enhancement based on Deep Learning. CoRR abs/1911.08153 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-09847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-09847
Cheng Yu, Yan-Ting Lin, Kuo-Hsuan Hung, Syu-Siang Wang, Szu-Wei Fu, Yu Tsao, Jeih-weih Hung:
Time-Domain Multi-modal Bone/air Conducted Speech Enhancement. CoRR abs/1911.09847 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-03884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-03884
Chao-I Tuan, Yuan-Kuei Wu, Hung-yi Lee, Yu Tsao:
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing. CoRR abs/1912.03884 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-11984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-11984
Yu-Tao Chang, Yuan-Hong Yang, Yu-Huai Peng, Syu-Siang Wang, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
MoEVC: A Mixture-of-experts Voice Conversion System with Sparse Gating Mechanism for Accelerating Online Computation. CoRR abs/1912.11984 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-12011
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-12011
Xugang Lu, Peng Shen, Sheng Li, Yu Tsao, Hisashi Kawai:
Deep progressive multi-scale attention for acoustic event classification. CoRR abs/1912.12011 (2019)
2018
[j34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/TsaoCFLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/TsaoCFLL18
Yu Tsao, Hao-Chun Chu, Shih-Hau Fang, Junghsi Lee, Chih-Min Lin:
Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller. IEEE Access 6: 37395-37402 (2018)
[j33]
- view
  - electronic edition @ sinica.edu.tw (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jise/HwangWWHTWWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jise/HwangWWHTWWC18
Hsin-Te Hwang, Yi-Chiao Wu, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Locally Linear Embedding Based Post-Filtering for Speech Enhancement. J. Inf. Sci. Eng. 34(6): 1469-1491 (2018)
[j32]
- view
  - electronic edition @ sinica.edu.tw (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jise/HwangWPHTWWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jise/HwangWPHTWWC18
Hsin-Te Hwang, Yi-Chiao Wu, Yu-Huai Peng, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Voice Conversion Based on Locally Linear Embedding. J. Inf. Sci. Eng. 34(6): 1493-1516 (2018)
[j31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/Torres-Sospedra18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/Torres-Sospedra18
Joaquín Torres-Sospedra, Antonio Ramón Jiménez, Adriano J. C. Moreira, Tomás Lungenstrass, Wei-Chung Lu, Stefan Knauth, Germán M. Mendoza-Silva, Fernando Seco, Antoni Pérez-Navarro, Maria João Nicolau, António Costa, Filipe Meneses, Joaquín Farina, Juan Pablo Morales, Wen-Chen Lu, Ho-Ti Cheng, Shi-Shen Yang, Shih-Hau Fang, Ying-Ren Chien, Yu Tsao:
Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition. Sensors 18(2): 487 (2018)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/sj/LinLCTCC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sj/LinLCTCC18
Yu-Cheng Lin, Ying-Hui Lai, Hsiu-Wen Chang, Yu Tsao, Yi-ping Chang, Ronald Y. Chang:
SmartHear: A Smartphone-Based Remote Microphone Hearing Assistive System Using Wireless Technologies. IEEE Syst. J. 12(1): 20-29 (2018)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiuTF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiuTF18
Hung-Ping Liu, Yu Tsao, Chiou-Shann Fuh:
Bone-conducted speech enhancement using deep denoising autoencoder. Speech Commun. 104: 106-112 (2018)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangLTHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangLTHS18
Syu-Siang Wang, Payton Lin, Yu Tsao, Jeih-Weih Hung, Borching Su:
Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(3): 564-579 (2018)
[j27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/FuWTLK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/FuWTLK18
Szu-Wei Fu, Taowei Wang, Yu Tsao, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1570-1584 (2018)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/tetci/HouWLTCW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tetci/HouWLTCW18
Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang:
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks. IEEE Trans. Emerg. Top. Comput. Intell. 2(2): 117-128 (2018)
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZezarioHLTHW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZezarioHLTHW18
Ryandhimas E. Zezario, Jen-Wei Huang, Xugang Lu, Yu Tsao, Hsin-Te Hwang, Hsin-Min Wang:
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement. APSIPA 2018: 373-377
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/LaiZTFLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/LaiZTFLT18
Ying-Hui Lai, Wei-Zhong Zheng, Shih-Tsang Tang, Shih-Hau Fang, Wen-Huei Liao, Yu Tsao:
Improving the performance of hearing aids in noisy environments based on deep learning technology. EMBC 2018: 404-408
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/WangTC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/WangTC18
Lei Wang, Yu Tsao, Fei Chen:
Congruent Visual Stimulation Facilitates Auditory Frequency Change Detection: An ERP Study. EMBC 2018: 2446-2449
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RyantBCCDGKKKKL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RyantBCCDGKKKKL18
Neville Ryant, Elika Bergelson, Kenneth Church, Alejandrina Cristià, Jun Du, Sriram Ganapathy, Sanjeev Khudanpur, Diana Kowalski, Mahesh Krishnamoorthy, Rajat Kulshreshta, Mark Liberman, Yu-Ding Lu, Matthew Maciejewski, Florian Metze, Ján Profant, Lei Sun, Yu Tsao, Zhou Yu:
Enhancement and Analysis of Conversational Speech: JSALT 2017. ICASSP 2018: 5154-5158
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SunDGLTLR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SunDGLTLR18
Lei Sun, Jun Du, Tian Gao, Yu-Ding Lu, Yu Tsao, Chin-Hui Lee, Neville Ryant:
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions. ICASSP 2018: 5234-5238
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeWCLCT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeWCLCT18
Wei-Jen Lee, Syu-Siang Wang, Fei Chen, Xugang Lu, Shao-Yi Chien, Yu Tsao:
Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm. ICASSP 2018: 5454-5458
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/ifuzzy/LinTSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ifuzzy/LinTSH18
Shang-Chih Lin, Yu Tsao, Shun-Feng Su, Yennun Huang:
An Industrial IoT Analysis System Based on Machining Data of Metal Materials. iFUZZY 2018: 225-230
[c110]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengHWTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengHWTW18
Yu-Huai Peng, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Exemplar-Based Spectral Detail Compensation for Voice Conversion. INTERSPEECH 2018: 486-490
[c109]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuSLTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuSLTK18
Xugang Lu, Peng Shen, Sheng Li, Yu Tsao, Hisashi Kawai:
Temporal Attentive Pooling for Acoustic Event Detection. INTERSPEECH 2018: 1354-1357
[c108]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuTHW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuTHW18
Szu-Wei Fu, Yu Tsao, Hsin-Te Hwang, Hsin-Min Wang:
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM. INTERSPEECH 2018: 1873-1877
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LeeWTH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LeeWTH18
Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform. ISCSLP 2018: 16-20
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangHPTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangHPTW18
Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. ISCSLP 2018: 51-55
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HanZHTL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HanZHTL18
Ji-Yan Han, Wei-Zhong Zheng, Ren-Jie Huang, Yu Tsao, Ying-Hui Lai:
Hearing aids APP design based on deep learning technology. ISCSLP 2018: 495-496
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiaoLYLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiaoLYLT18
Wen-Huei Liao, Pei-Chun Li, Shuenn-Tsong Young, Ying-Hui Lai, Yu Tsao:
IOS-based Ear Scale application for Clinical Audiology and Otology Usage. ISCSLP 2018: 497-498
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/KaoHLTYLLLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/KaoHLTYLLLW18
Yi-Ying Kao, Hsiang-Ping Hsu, Chien-Feng Liao, Yu Tsao, Hao-Chun Yang, Jeng-Lin Li, Chi-Chun Lee, Hung-Shin Lee, Hsin-Min Wang:
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation. IWAENC 2018: 416-420
[c102]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/HuangLHTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/HuangLHTW18
Wen-Chin Huang, Chen-Chou Lo, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang:
WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese]. ROCLING 2018: 96-110
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/sips/YuTYCC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sips/YuTYCC18
Bin-Syh Yu, Yu Tsao, Shao-Wen Yang, Yen-Kuang Chen, Shao-Yi Chien:
Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform. SiPS 2018: 88-93
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HsuLFTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HsuLFTK18
Yi-Te Hsu, Yu-Chen Lin, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN). SLT 2018: 566-573
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-04052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-04052
Wei-Jen Lee, Syu-Siang Wang, Fei Chen, Xugang Lu, Shao-Yi Chien, Yu Tsao:
Speech Dereverberation Based on Integrated Deep and Ensemble Learning. CoRR abs/1801.04052 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-07501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-07501
Chien-Feng Liao, Yu Tsao, Hung-yi Lee, Hsin-Min Wang:
Noise Adaptive Speech Enhancement using Domain Adversarial Training. CoRR abs/1807.07501 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05344
Szu-Wei Fu, Yu Tsao, Hsin-Te Hwang, Hsin-Min Wang:
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. CoRR abs/1808.05344 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-06474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-06474
Yi-Te Hsu, Yu-Chen Lin, Szu-Wei Fu, Yu Tsao, Tei-Wei Kuo:
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN). CoRR abs/1808.06474 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-09634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-09634
Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. CoRR abs/1808.09634 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-12656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-12656
Li-Wei Chen, Hung-yi Lee, Yu Tsao:
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech. CoRR abs/1810.12656 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03486
Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform. CoRR abs/1811.03486 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-04224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-04224
Yih-Liang Shen, Chao-Yuan Huang, Syu-Siang Wang, Yu Tsao, Hsin-Min Wang, Tai-Shih Chi:
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition. CoRR abs/1811.04224 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-10376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-10376
Yi-Te Hsu, Zining Zhu, Chi-Te Wang, Shih-Hau Fang, Frank Rudzicz, Yu Tsao:
Robustness against the channel effect in pathological voice detection. CoRR abs/1811.10376 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-11078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-11078
Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. CoRR abs/1811.11078 (2018)
2017
[j25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/ChernLCTCC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/ChernLCTCC17
Alan Chern, Ying-Hui Lai, Yi-ping Chang, Yu Tsao, Ronald Y. Chang, Hsiu-Wen Chang:
A Smartphone-Based Multi-Functional Hearing Assistive System to Facilitate Speech Recognition in the Classroom. IEEE Access 5: 10339-10351 (2017)
[j24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/HussainSLWTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/HussainSLWTL17
Tassadaq Hussain, Sabato Marco Siniscalchi, Chi-Chun Lee, Syu-Siang Wang, Yu Tsao, Wen-Hung Liao:
Experimental Study on Extreme Learning Machine Applications for Speech Enhancement. IEEE Access 5: 25542-25554 (2017)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LuSTK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LuSTK17
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Regularization of neural network model with distance metric learning for i-vector based spoken language identification. Comput. Speech Lang. 44: 48-60 (2017)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LinLCWT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LinLCWT17
Payton Lin, Dau-Cheng Lyu, Fei Chen, Syu-Siang Wang, Yu Tsao:
Multi-style learning with denoising autoencoders for acoustic modeling in the internet of things (IoT). Comput. Speech Lang. 46: 481-495 (2017)
[j21]
- view
  - electronic edition @ aclclp.org.tw (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/ijclclp/Li-YouTC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijclclp/Li-YouTC17
Jin Li-You, Yu Tsao, Ying-Ren Chien:
Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm. Int. J. Comput. Linguistics Chin. Lang. Process. 22(2) (2017)
[j20]
- view
  - electronic edition @ aclclp.org.tw (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/ijclclp/WuHLTLW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijclclp/WuHLTLW17
Chia-Lung Wu, Hsiang-Ping Hsu, Yu-Ding Lu, Yu Tsao, Hung-Shin Lee, Hsin-Min Wang:
A Replay Spoofing Detection System Based on Discriminative Autoencoders. Int. J. Comput. Linguistics Chin. Lang. Process. 22(2) (2017)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeeTWT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeeTWT17
Hung-yi Lee, Bo-Hsiang Tseng, Tsung-Hsien Wen, Yu Tsao:
Personalizing Recurrent-Neural-Network-Based Language Model by Social Network. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 519-530 (2017)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/tbe/ChenYHTCCLWTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/ChenYHTCCLWTW17
Tien-En Chen, Shih-I Yang, Li-Ting Ho, Kun-Hsi Tsai, Yu-Hsuan Chen, Yun-Fan Chang, Ying-Hui Lai, Syu-Siang Wang, Yu Tsao, Chau-Chung Wu:
S1 and S2 Heart Sound Recognition Using Deep Neural Networks. IEEE Trans. Biomed. Eng. 64(2): 372-380 (2017)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/tbe/LaiCWLTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/LaiCWLTL17
Ying-Hui Lai, Fei Chen, Syu-Siang Wang, Xugang Lu, Yu Tsao, Chin-Hui Lee:
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation. IEEE Trans. Biomed. Eng. 64(7): 1568-1578 (2017)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tbe/FuLLYHT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/FuLLYHT17
Szu-Wei Fu, Pei-Chun Li, Ying-Hui Lai, Cheng-Chien Yang, Li-Chun Hsieh, Yu Tsao:
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery. IEEE Trans. Biomed. Eng. 64(11): 2584-2594 (2017)
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/LanTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/LanTL17
Shih-Wei Lan, Yu Tsao, Junghsi Lee:
Acoustic echo cancellation using deep cerebellar model articulation controller. ACSSC 2017: 808-811
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/FuTLK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/FuTLK17
Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
Raw waveform-based speech enhancement by fully convolutional networks. APSIPA 2017: 6-12
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PengHWHLTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengHWHLTW17
Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao, Hsin-Min Wang:
Fast locally linear embedding algorithm for exemplar-based voice conversion. APSIPA 2017: 591-595
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangTWLL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangTWLL17
Syu-Siang Wang, Yu Tsao, Hsiao-Lan Sharon Wang, Ying-Hui Lai, Lieber Po-Hung Li:
A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise. APSIPA 2017: 808-812
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WuZTYCC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WuZTYCC17
Chih-Wei Wu, Meng-Ting Zhong, Yu Tsao, Shao-Wen Yang, Yen-Kuang Chen, Shao-Yi Chien:
Track-Clustering Error Evaluation for Track-Based Multi-camera Tracking System Employing Human Re-identification. CVPR Workshops 2017: 1416-1424
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeLHTWJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeLHTWJ17
Hung-Shin Lee, Yu-Ding Lu, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng:
Discriminative autoencoders for speaker verification. ICASSP 2017: 5375-5379
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuHWHLTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuHWHLTW17
Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang, Chin-Cheng Hsu, Ying-Hui Lai, Yu Tsao, Hsin-Min Wang:
A locally linear embbeding based postfiltering approach for speech enhancement. ICASSP 2017: 5555-5559
[c92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHWHLWT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHWHLWT17
Chia-Lung Wu, Hsiang-Ping Hsu, Syu-Siang Wang, Jeih-Weih Hung, Ying-Hui Lai, Hsin-Min Wang, Yu Tsao:
Wavelet Speech Enhancement Based on Robust Principal Component Analysis. INTERSPEECH 2017: 439-443
[c91]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHWHTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHWHTW17
Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang:
A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement. INTERSPEECH 2017: 1953-1957
[c90]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuHWTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuHWTW17
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks. INTERSPEECH 2017: 3364-3368
[c89]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangLLCTCW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangLLCTCW17
Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu, Kuan-Yu Chen, Yu Tsao, Berlin Chen, Hsin-Min Wang:
Discriminative Autoencoders for Acoustic Modeling. INTERSPEECH 2017: 3557-3561
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/iscas/LinLTC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscas/LinLTC17
Shih-Ting Lin, Yuan-Hsin Liao, Yu Tsao, Shao-Yi Chien:
Object-based on-line video summarization for internet of video things. ISCAS 2017: 1-4
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/FuHTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/FuHTL17
Szu-Wei Fu, Ting-Yao Hu, Yu Tsao, Xugang Lu:
Complex spectrogram enhancement by convolutional neural network with multi-metrics learning. MLSP 2017: 1-6
[c86]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/LeeWTH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/LeeWTH17
Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
多樣訊雜比之訓練語料於降噪自動編碼器其語音強化功能之初步研究 (A Preliminary Study of Various SNR-level Training Data in the Denoising Auto-encoder (DAE) Technique for Speech Enhancement) [In Chinese]. ROCLING 2017: 101-113
[c85]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/LuLTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/LuLTW17
Yu-Ding Lu, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese]. ROCLING 2017: 114-115
[c84]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/Li-YouTC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/Li-YouTC17
Jin Li-You, Yu Tsao, Ying-Ren Chien:
改進的向量空間可適性濾波器用於聲學回聲消除 (Acoustic Echo Cancellation Using an Improved Vector-Space-Based Adaptive Filtering Algorithm) [In Chinese]. ROCLING 2017: 178-182
[c83]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/WangLZFTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/WangLZFTL17
Chi-Te Wang, Feng-Chuan Lin, Wei-Zhong Zheng, Shih-Hau Fang, Yu Tsao, Ying-Hui Lai:
以語音能量特性發展即時語速偵測裝置-前導型研究 (Real-time monitoring device of phonation speed and volume based on speech energy: A pilot study) [In Chinese]. ROCLING 2017: 287-294
[c82]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/WangTLHW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/WangTLHW17
Taowei Wang, Yu Tsao, Ying-Hui Lai, Hsiang-Ping Hsu, Chia-Lung Wu:
以軟體為基礎建構語音增強系統使用者介面 (Development of a software-based User-Interface of Speech Enhancement System) [In Chinese]. ROCLING 2017: 323-331
[e2]
- view
- export record
  dblp key:
  - conf/rocling/2017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/2017
Lun-Wei Ku, Yu Tsao:
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, ROCLING 2017, Taipei, Taiwan, November 27-28, 2017. The Association for Computational Linguistics and Chinese Language Processing (ACLCLP) 2017, ISBN 978-986-95769-0-1 [contents]
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/FuTLK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FuTLK17
Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks. CoRR abs/1703.02205 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HouWLLTCW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HouWLLTCW17
Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Jen-Chun Lin, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang:
Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network. CoRR abs/1703.10893 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HsuHWTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HsuHWTW17
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks. CoRR abs/1704.00849 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/FuHTL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FuHTL17
Szu-Wei Fu, Ting-Yao Hu, Yu Tsao, Xugang Lu:
Multi-Metrics Learning for Speech Enhancement. CoRR abs/1704.08504 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/TsaoCLFLL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TsaoCLFLL17
Yu Tsao, Hao-Chun Chu, Shih-Wei Lan, Shih-Hau Fang, Junghsi Lee, Chih-Min Lin:
Adaptive Noise Cancellation Using Deep Cerebellar Model Articulation Controller. CoRR abs/1705.00945 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-03658
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-03658
Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. CoRR abs/1709.03658 (2017)
2016
[j15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/entropy/LinFWLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/entropy/LinFWLT16
Payton Lin, Szu-Wei Fu, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao:
Maximum Entropy Learning with Deep Belief Networks. Entropy 18(7): 251 (2016)
[j14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/FangLFCHLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/FangLFCHLT16
Shih-Hau Fang, Hao-Hsiang Liao, Yu-Xiang Fei, Kai-Hsiang Chen, Jen-Wei Huang, Yu-Ding Lu, Yu Tsao:
Transportation Modes Classification Using Sensors on Smartphones. Sensors 16(8): 1324 (2016)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/TsaoL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/TsaoL16
Yu Tsao, Ying-Hui Lai:
Generalized maximum a posteriori spectral amplitude estimation for speech enhancement. Speech Commun. 76: 112-126 (2016)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ChenTL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ChenTL16
Fei Chen, Yu Tsao, Ying-Hui Lai:
Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus. Speech Commun. 81: 120-128 (2016)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WangCTHLLS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WangCTHLLS16
Syu-Siang Wang, Alan Chern, Yu Tsao, Jeih-weih Hung, Xugang Lu, Ying-Hui Lai, Borching Su:
Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization. IEEE Signal Process. Lett. 23(8): 1101-1105 (2016)
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HouWLLTCW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HouWLLTCW16
Jen-Cheng Hou, Syu-Siang Wang, Ying-Hui Lai, Jen-Chun Lin, Yu Tsao, Hsiu-Wen Chang, Hsin-Min Wang:
Audio-visual speech enhancement using deep neural networks. APSIPA 2016: 1-6
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HsuHWTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HsuHWTW16
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice conversion from non-parallel corpora using variational auto-encoder. APSIPA 2016: 1-6
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TsaiSTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TsaiSTW16
Yueh-Ting Tsai, Borching Su, Yu Tsao, Syu-Siang Wang:
Adaptive subspace-constrained diagonal loading. APSIPA 2016: 1-4
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/bigmm/WangT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bigmm/WangT16
Syu-Siang Wang, Yu Tsao:
Temporal Modulation Spectral Restoration for Robust Speech Recognition. BigMM 2016: 481-486
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/biocas/HsiehWLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/biocas/HsiehWLT16
Yi-Yen Hsieh, Ching-Da Wu, Shey-Shi Lu, Yu Tsao:
A linear regression model with dynamic pulse transit time features for noninvasive blood pressure prediction. BioCAS 2016: 604-607
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/WuFWWHT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/WuFWWHT16
Ting-Jia Wu, Shih-Hau Fang, Yong-Bin Wu, Cheng-Tse Wu, Jen-Wei Huang, Yu Tsao:
A study of mobile advertisement recommendation using real big data from AdLocus. GCCE 2016: 1-2
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuTC16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuTC16a
Yen-Teh Liu, Yu Tsao, Ronald Y. Chang:
Nonnegative matrix factorization-based frequency lowering technology for Mandarin-speaking hearing aid users. ICASSP 2016: 5905-5909
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icce-tw/WangY0H16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icce-tw/WangY0H16
Syu-Siang Wang, Jeremy Chiaming Yang, Yu Tsao, Jeih-weih Hung:
Leveraging nonnegative matrix factorization in processing the temporal modulation spectrum for speech enhancement. ICCE-TW 2016: 1-2
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icce-tw/YangW0H16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icce-tw/YangW0H16
Jeremy Chiaming Yang, Syu-Siang Wang, Yu Tsao, Jeih-Weih Hung:
Speech enhancement via ensemble modeling NMF adaptation. ICCE-TW 2016: 1-2
[c72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHHTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHHTW16
Yi-Chiao Wu, Hsin-Te Hwang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang:
Locally Linear Embedding for Exemplar-Based Spectral Conversion. INTERSPEECH 2016: 1652-1656
[c71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeTLWLCHJ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeTLWLCHJ16
Hung-Shin Lee, Yu Tsao, Chi-Chun Lee, Hsin-Min Wang, Wei-Cheng Lin, Wei-Chen Chen, Shan-Wen Hsiao, Shyh-Kang Jeng:
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation. INTERSPEECH 2016: 2031-2035
[c70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuSTK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuSTK16
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification. INTERSPEECH 2016: 3216-3220
[c69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuTL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuTL16
Szu-Wei Fu, Yu Tsao, Xugang Lu:
SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement. INTERSPEECH 2016: 3768-3772
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HsuHWTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HsuHWTW16
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Dictionary update for NMF-based voice conversion using an encoder-decoder network. ISCSLP 2016: 1-5
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HsuZWHLT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HsuZWHLT16
Chia-Yung Hsu, Ryandhimas E. Zezario, Jia-Ching Wang, Chin-Wen Ho, Xugang Lu, Yu Tsao:
Incorporating local environment information with ensemble neural networks to robust automatic speech recognition. ISCSLP 2016: 1-5
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LaiWSHFT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LaiWSHFT16
Ying-Hui Lai, Syu-Siang Wang, Yu-Ting Su, Cheng Han-Che, Fan Kang Fu, Yu Tsao:
Improving the performance of speech perception in noisy environment based on an FAME strategy. ISCSLP 2016: 1-5
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LuSTK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LuSTK16
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
A pseudo-task design in multi-task learning deep neural network for speaker recognition. ISCSLP 2016: 1-5
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/mmm/KuCHT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmm/KuCHT16
Shih-Yu Ku, Kai-Hsiang Chen, Jen-Wei Huang, Yu Tsao:
Image Retrieval Using Color-Aware Tag on Progressive Image Search and Recommendation System. MMM (2) 2016: 162-173
[e1]
- view
- export record
  dblp key:
  - conf/rocling/2016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/2016
Chung-Hsien Wu, Yuen-Hsien Tseng, Hung-Yu Kao, Lun-Wei Ku, Yu Tsao, Shih-Hung Wu:
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, ROCLING 2016, National Cheng Kung University, Tainan, Taiwan, October 6-7, 2015. Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Taiwan 2016, ISBN 978-957-30792-9-3 [contents]
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WangCTHLLS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangCTHLLS16
Syu-Siang Wang, Alan Chern, Yu Tsao, Jeih-Weih Hung, Xugang Lu, Ying-Hui Lai, Borching Su:
Wavelet speech enhancement based on nonnegative matrix factorization. CoRR abs/1601.02309 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/TsaiSTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TsaiSTW16
Yueh-Ting Tsai, Borching Su, Yu Tsao, Syu-Siang Wang:
Robust Beamforming Against DoA Mismatch Using Subspace-Constrained Diagonal Loading. CoRR abs/1602.02690 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HsuHWTW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HsuHWTW16
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network. CoRR abs/1610.03988 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HsuHWTW16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HsuHWTW16a
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder. CoRR abs/1610.04019 (2016)
2015
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/HsuCCT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/HsuCCT15
Chung-Chien Hsu, Kah-Meng Cheong, Tai-Shih Chi, Yu Tsao:
Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation. IEICE Trans. Inf. Syst. 98-D(10): 1808-1817 (2015)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/Li-YouCT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/Li-YouCT15
Jin Li-You, Ying-Ren Chien, Yu Tsao:
Rapid Converging M-Max Partial Update Least Mean Square Algorithms with New Variable Step-Size Methods. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 98-A(12): 2650-2657 (2015)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/TsaoLHL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/TsaoLHL15
Yu Tsao, Payton Lin, Ting-Yao Hu, Xugang Lu:
Ensemble environment modeling using affine transform group. Speech Commun. 68: 55-68 (2015)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/TsaoFS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/TsaoFS15
Yu Tsao, Shih-Hau Fang, Yao Shiao:
Acoustic Echo Cancellation Using a Vector-Space-Based Adaptive Filtering Algorithm. IEEE Signal Process. Lett. 22(3): 351-355 (2015)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tvt/FangWT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvt/FangWT15
Shih-Hau Fang, Chu-Hsuan Wang, Yu Tsao:
Compensating for Orientation Mismatch in Robust Wi-Fi Localization Using Histogram Equalization. IEEE Trans. Veh. Technol. 64(11): 5210-5220 (2015)
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangHLTLWS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangHLTLWS15
Syu-Siang Wang, Hsin-Te Hwang, Ying-Hui Lai, Yu Tsao, Xugang Lu, Hsin-Min Wang, Borching Su:
Improving denoising auto-encoder based speech enhancement with the speech parameter generation algorithm. APSIPA 2015: 365-369
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HwangTWWC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HwangTWWC15
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
A probabilistic interpretation for artificial neural network-based voice conversion. APSIPA 2015: 552-558
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/LinLCT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/LinLCT15
Payton Lin, Dau-Cheng Lyu, Yun-Fan Chang, Yu Tsao:
Temporal alignment for deep neural networks. GlobalSIP 2015: 108-112
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/LiuCTC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/LiuCTC15
Yen-Teh Liu, Ronald Y. Chang, Yu Tsao, Yi-ping Chang:
A new frequency lowering technique for Mandarin-speaking hearing aid users. GlobalSIP 2015: 722-726
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLTL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLTL15
Wei-Chen Chen, Po-Tsun Lai, Yu Tsao, Chi-Chun Lee:
Multimodal arousal rating using unsupervised fusion technique. ICASSP 2015: 5296-5300
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaiWLT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaiWLT15
Ying-Hui Lai, Syu-Siang Wang, Pei-Chun Li, Yu Tsao:
A discriminative post-filter for speech enhancement in hearing aids. ICASSP 2015: 5868-5872
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icce-tw/Liu0C15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icce-tw/Liu0C15
Yen-Teh Liu, Yu Tsao, Ronald Y. Chang:
A deep neural network based approach to mandarin consonant/vowel separation. ICCE-TW 2015: 324-325
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icce-tw/LinWT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icce-tw/LinWT15
Payton Lin, Syu-Siang Wang, Yu Tsao:
Temporal information in tone recognition. ICCE-TW 2015: 326-327
[c55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinLCT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinLCT15
Payton Lin, Dau-Cheng Lyu, Yun-Fan Chang, Yu Tsao:
Speech recognition with temporal neural networks. INTERSPEECH 2015: 21-25
[c54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuSTHK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuSTHK15
Xugang Lu, Peng Shen, Yu Tsao, Chiori Hori, Hisashi Kawai:
Sparse representation with temporal max-smoothing for acoustic event detection. INTERSPEECH 2015: 1176-1180
[c53]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/HsuWT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/HsuWT15
Chia-Yung Hsu, Jia-Ching Wang, Yu Tsao:
類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese]. ROCLING 2015
2014
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/csl/TsaoLDHMH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/TsaoLDHMH14
Yu Tsao, Xugang Lu, Paul R. Dixon, Ting-Yao Hu, Shigeki Matsuda, Chiori Hori:
Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation. Comput. Speech Lang. 28(3): 709-726 (2014)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/TsaoHSNL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/TsaoHSNL14
Yu Tsao, Ting-Yao Hu, Sakriani Sakti, Satoshi Nakamura, Lin-Shan Lee:
Variable Selection Linear Regression for Robust Speech Recognition. IEICE Trans. Inf. Syst. 97-D(6): 1477-1487 (2014)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TsaoMHKL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TsaoMHKL14
Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee:
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 403-416 (2014)
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChangLCCZLCWT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChangLCCZLCWT14
Yun-Fan Chang, Payton Lin, Shao-Hua Cheng, Kai-Hsuan Chan, Yi-Chong Zeng, Chia-Wei Liao, Wen-Tsung Chang, Yu-Chiang Wang, Yu Tsao:
Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system. APSIPA 2014: 1-4
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanHLWT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanHLWT14
Hao-Teng Fan, Jeih-weih Hung, Xugang Lu, Syu-Siang Wang, Yu Tsao:
Speech enhancement using segmental nonnegative matrix factorization. ICASSP 2014: 4483-4487
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuTMH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuTMH14
Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Sparse representation based on a bag of spectral exemplars for acoustic event detection. ICASSP 2014: 6255-6259
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/JingLLT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/JingLLT14
How Jing, An-Chun Liang, Shou-De Lin, Yu Tsao:
A Transfer Probabilistic Collective Factorization Model to Handle Sparse Data in Collaborative Filtering. ICDM 2014: 250-259
[c48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JingHLCLTW14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JingHLCLTW14
How Jing, Ting-Yao Hu, Hung-Shin Lee, Wei-Chen Chen, Chi-Chun Lee, Yu Tsao, Hsin-Min Wang:
Ensemble of machine learning algorithms for cognitive and physical speaker load detection. INTERSPEECH 2014: 447-451
[c47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinCWLT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinCWLT14
Payton Lin, Fei Chen, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao:
Automatic speech recognition with primarily temporal envelope information. INTERSPEECH 2014: 476-480
[c46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiCT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiCT14
Ying-Hui Lai, Fei Chen, Yu Tsao:
An adaptive envelope compression strategy for speech processing in cochlear implants. INTERSPEECH 2014: 481-484
[c45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuTMH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuTMH14
Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Ensemble modeling of denoising autoencoder for speech spectrum restoration. INTERSPEECH 2014: 885-889
[c44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeTWJ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeTWJ14
Hung-Shin Lee, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng:
Clustering-based i-vector formulation for speaker recognition. INTERSPEECH 2014: 1101-1105
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LuTSH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LuTSH14
Xugang Lu, Yu Tsao, Peng Shen, Chiori Hori:
Spectral patch based sparse coding for acoustic event detection. ISCSLP 2014: 317-320
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangLLTHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangLLTHS14
Syu-Siang Wang, Payton Lin, Dau-Cheng Lyu, Yu Tsao, Hsin-Te Hwang, Borching Su:
Acoustic feature conversion using a polynomial based feature transferring algorithm. ISCSLP 2014: 454-458
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/isicir/LaiCT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isicir/LaiCT14
Ying-Hui Lai, Fei Chen, Yu Tsao:
Effect of adaptive envelope compression in simulated electric hearing in reverberation. ISIC 2014: 204-207
2013
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HwangTWWC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HwangTWWC13
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Incorporating global variance in the training phase of GMM-based voice conversion. APSIPA 2013: 1-6
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangKFTKSL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangKFTKSL13
Chu-Hsuan Wang, Tai-Wei Kao, Shih-Hau Fang, Yu Tsao, Lun-Chia Kuo, Kao Shih-Wei, Nien-Chen Lin:
Robust Wi-Fi location fingerprinting against device diversity based on spatial mean normalization. APSIPA 2013: 1-4
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangTH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangTH13
Syu-Siang Wang, Yu Tsao, Jeih-Weih Hung:
Filtering on the temporal probability sequence in histogram equalization for robust speech recognition. ICASSP 2013: 7112-7116
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SuTWJ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SuTWJ13
Yu-Cheng Su, Yu Tsao, Jung-En Wu, Fu-Rong Jean:
Speech enhancement using generalized maximum a posteriori spectral amplitude estimator. ICASSP 2013: 7467-7471
[c36]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcnlp/JingTCW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/JingTCW13
How Jing, Yu Tsao, Kuan-Yu Chen, Hsin-Min Wang:
Semantic Naïve Bayes Classifier for Document Classification. IJCNLP 2013: 1117-1123
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/JingT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/JingT13
How Jing, Yu Tsao:
Sparse maximum entropy deep belief nets. IJCNN 2013: 1-6
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHJCTKP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHJCTKP13
Hung-yi Lee, Ting-Yao Hu, How Jing, Yun-Fan Chang, Yu Tsao, Yu-Cheng Kao, Tsang-Long Pao:
Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition. INTERSPEECH 2013: 215-219
[c33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuTMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuTMH13
Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Speech enhancement based on deep denoising autoencoder. INTERSPEECH 2013: 436-440
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WenHLTL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WenHLTL13
Tsung-Hsien Wen, Aaron Heidel, Hung-yi Lee, Yu Tsao, Lin-Shan Lee:
Recurrent neural network based language model personalization by social network crowdsourcing. INTERSPEECH 2013: 2703-2707
[c31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTS13
Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. INTERSPEECH 2013: 3002-3006
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HwangTWWC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HwangTWWC13
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training. INTERSPEECH 2013: 3062-3066
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/isce/LaiSTY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isce/LaiSTY13
Ying-Hui Lai, Yu-Cheng Su, Yu Tsao, Shuenn-Tsong Young:
Evaluation of generalized maximum a posteriori spectral amplitude (GMAPA) speech enhancement algorithm in hearing aids. ISCE 2013: 245-246
[c28]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/rocling/ChangTCCLC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rocling/ChangTCCLC13
Yun-Fan Chang, Yu Tsao, Shao-Hua Cheng, Kai-Hsuan Chan, Chia-Wei Liao, Wen-Tsung Chang:
結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese]. ROCLING 2013
2012
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoHMHK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoHMHK12
Yu Tsao, Chien-Lin Huang, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
A linear projection approach to environment modeling for robust speech recognition. ICASSP 2012: 4329-4332
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HwangTWWC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HwangTWWC12
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
A Study of Mutual Information for GMM-Based Spectral Conversion. INTERSPEECH 2012: 78-81
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuTL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuTL12
Ting-Yao Hu, Yu Tsao, Lin-Shan Lee:
Discriminative Fuzzy Clustering Maximum a Posterior Linear Regression for Speaker Adaptation. INTERSPEECH 2012: 567-570
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HwangTWWC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HwangTWWC12
Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Exploring mutual information for GMM-based spectral conversion. ISCSLP 2012: 50-54
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangHT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangHT12
Syu-Siang Wang, Jeih-Weih Hung, Yu Tsao:
A study on cepstral sub-band normalization for robust ASR. ISCSLP 2012: 141-145
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LuTMHK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LuTMHK12
Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. ISCSLP 2012: 311-314
2011
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoIKN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoIKN11
Yu Tsao, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation. ICASSP 2011: 5320-5323
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoMSIKN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoMSIKN11
Yu Tsao, Shigeki Matsuda, Shinsuke Sakai, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
A sampling-based environment population projection approach for rapid acoustic model adaptation. ICASSP 2011: 5504-5507
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoDHK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoDHK11
Yu Tsao, Paul R. Dixon, Chiori Hori, Hisashi Kawai:
Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition. INTERSPEECH 2011: 2585-2588
2010
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoSLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoSLL10
Yu Tsao, Hanwu Sun, Haizhou Li, Chin-Hui Lee:
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition. ICASSP 2010: 4422-4425
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTL10
Jinyu Li, Yu Tsao, Chin-Hui Lee:
Shrinkage model adaptation in automatic speech recognition. INTERSPEECH 2010: 1656-1659
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MushtaqTH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MushtaqTH10
Aleem Mushtaq, Yu Tsao, Chin-Hui Lee:
A particle filter feature compensation approach to robust speech recognition. INTERSPEECH 2010: 2054-2057
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TsaoIKN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TsaoIKN10
Yu Tsao, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition. ISCSLP 2010: 29-32

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TsaoL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TsaoL09
Yu Tsao, Chin-Hui Lee:
An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition. IEEE Trans. Speech Audio Process. 17(5): 1025-1037 (2009)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TsaoMNL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TsaoMNL09
Yu Tsao, Shigeki Matsuda, Satoshi Nakamura, Chin-Hui Lee:
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling. ASRU 2009: 271-275
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoLL09
Yu Tsao, Jinyu Li, Chin-Hui Lee:
Ensemble speaker and speaking environment modeling approach with advanced online estimation process. ICASSP 2009: 3833-3836
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsudaTLNL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsudaTLNL09
Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iucs/TsaoLLN09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iucs/TsaoLLN09
Yu Tsao, Jinyu Li, Chin-Hui Lee, Satoshi Nakamura:
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling. IUCS 2009: 404-408
2008
[b1]
- view
  - electronic edition via handle.net
  - no references & citations available
  authority control:
- export record
  dblp key:
  - phd/basesearch/Tsao08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Tsao08
Yu Tsao:
An ensemble speaker and speaking environment modeling approach to robust speech recognition. Georgia Institute of Technology, Atlanta, GA, USA, 2008
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PengTHA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PengTHA08
Sheng-Yu Peng, Yu Tsao, Paul E. Hasler, David V. Anderson:
A programmable analog radial-basis-function based classifier. ICASSP 2008: 1425-1428
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoL08
Yu Tsao, Chin-Hui Lee:
Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process. INTERSPEECH 2008: 1265-1268
2007
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TsaoL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TsaoL07
Yu Tsao, Chin-Hui Lee:
Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition. ASRU 2007: 77-80
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoL07
Yu Tsao, Chin-Hui Lee:
An ensemble modeling approach to joint characterization of speaker and speaking environments. INTERSPEECH 2007: 1050-1053
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrombergQHLMMMMSTW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrombergQHLMMMMSTW07
Ilana Bromberg, Qian Qian, Jun Hou, Jinyu Li, Chengyuan Ma, Brett Matthews, Antonio Moreno-Daniel, Jeremy Morris, Sabato Marco Siniscalchi, Yu Tsao, Yu Wang:
Detection-based ASR in the automatic speech attribute transcription project. INTERSPEECH 2007: 1829-1832
2006
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaTL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaTL06
Chengyuan Ma, Yu Tsao, Chin-Hui Lee:
A study on detection based automatic speech recognition. INTERSPEECH 2006
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoL06
Yu Tsao, Chin-Hui Lee:
A vector space approach to environment modeling for robust speech recognition. INTERSPEECH 2006
2005
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TsaoLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TsaoLL05
Yu Tsao, Shang-Ming Lee, Lin-Shan Lee:
Segmental eigenvoice with delicate eigenspace for improved speaker adaptation. IEEE Trans. Speech Audio Process. 13(3): 399-411 (2005)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTL05
Jinyu Li, Yu Tsao, Chin-Hui Lee:
A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition. ICASSP (1) 2005: 837-840
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoLL05
Yu Tsao, Jinyu Li, Chin-Hui Lee:
A study on separation between acoustic models and its applications. INTERSPEECH 2005: 1109-1112
2001
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoLCL01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoLCL01
Yu Tsao, Shang-Ming Lee, Fu-Chiang Chou, Lin-Shan Lee:
Segmental eigenvoice for rapid speaker adaptation. INTERSPEECH 2001: 1269-1272

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.