default search action
Chao-Han Huck Yang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [b1]Chao-Han Huck Yang:
A Perturbation Approach to Differential Privacy for Deep Learning based Speech Processing. Georgia Institute of Technology, Atlanta, GA, USA, 2024 - [c61]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, EngSiong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. ACL (1) 2024: 74-90 - [c60]Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu:
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification. ICASSP Workshops 2024: 655-659 - [c59]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-Yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. ICASSP 2024: 10316-10320 - [c58]Pin-Jui Ku, I-Fan Chen, Chao-Han Huck Yang, Anirudh Raju, Pranav Dheram, Pegah Ghahremani, Brian King, Jing Liu, Roger Ren, Phani Sankar Nidadavolu:
Hot-Fixing Wake Word Recognition for End-to-End ASR Via Neural Model Reprogramming. ICASSP 2024: 10816-10820 - [c57]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-Yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks. ICASSP 2024: 12856-12860 - [c56]Xianyan Fu, Xiao-Lei Zhang, Chao-Han Huck Yang, Jun Qi:
Exploiting A Quantum Multiple Kernel Learning Approach For Low-Resource Spoken Command Recognition. ICASSP 2024: 12931-12935 - [c55]Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang:
Can Whisper Perform Speech-Based In-Context Learning? ICASSP 2024: 13421-13425 - [c54]Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Engsiong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. ICLR 2024 - [c53]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Engsiong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. ICLR 2024 - [i73]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks. CoRR abs/2401.02921 (2024) - [i72]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Eng Siong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. CoRR abs/2401.10446 (2024) - [i71]Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Jia Xu, Ivan Bulyko, Andreas Stolcke:
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition. CoRR abs/2401.10447 (2024) - [i70]Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. CoRR abs/2402.05457 (2024) - [i69]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. CoRR abs/2402.06894 (2024) - [i68]Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang:
Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities. CoRR abs/2404.14716 (2024) - [i67]Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao:
An Investigation of Incorporating Mamba for Speech Enhancement. CoRR abs/2405.06573 (2024) - [i66]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang:
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models. CoRR abs/2405.14161 (2024) - [i65]Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima:
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment. CoRR abs/2406.13912 (2024) - [i64]Rithik Sachdev, Zhong-Qiu Wang, Chao-Han Huck Yang:
Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction. CoRR abs/2407.16370 (2024) - [i63]Yuka Ko, Sheng Li, Chao-Han Huck Yang, Tatsuya Kawahara:
Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction. CoRR abs/2408.16180 (2024) - 2023
- [j3]Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen, Javier Tejedor:
Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing. IEEE ACM Trans. Audio Speech Lang. Process. 31: 633-642 (2023) - [c52]Chang Wang, Jun Du, Hang Chen, Ruoyu Wang, Chao-Han Huck Yang, Jiangjiang Zhao, Yuling Ren, Qinglong Li, Chin-Hui Lee:
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition. APSIPA ASC 2023: 635-642 - [c51]Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke:
Generative Speech Recognition Error Correction With Large Language Models and Task-Activating Prompting. ASRU 2023: 1-8 - [c50]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-Rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. ASRU 2023: 1-8 - [c49]Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring:
Causalainer: Causal Explainer for Automatic Video Summarization. CVPR Workshops 2023: 2630-2636 - [c48]Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegnér:
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition. EMNLP 2023: 10007-10016 - [c47]Jhih-Cing Huang, Yu-Lin Tsai, Chao-Han Huck Yang, Cheng-Fang Su, Chia-Mu Yu, Pin-Yu Chen, Sy-Yen Kuo:
Certified Robustness of Quantum Classifiers Against Adversarial Examples Through Quantum Noise. ICASSP 2023: 1-5 - [c46]Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch:
Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming. ICASSP 2023: 1-5 - [c45]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. ICASSP 2023: 1-5 - [c44]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5 - [c43]Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? INTERSPEECH 2023: 456-460 - [c42]Chun-Wei Ho, Chao-Han Huck Yang, Sabato Marco Siniscalchi:
Differentially Private Adapters for Parameter Efficient Acoustic Modeling. INTERSPEECH 2023: 839-843 - [c41]Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegnér:
A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model. INTERSPEECH 2023: 1958-1962 - [c40]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. INTERSPEECH 2023: 2453-2457 - [c39]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition. INTERSPEECH 2023: 3317-3321 - [c38]Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Modeling Approach to Efficient Speech Separation. INTERSPEECH 2023: 3784-3788 - [c37]Li-Jen Yang, Chao-Han Huck Yang, Jen-Tzung Chien:
Parameter-Efficient Learning for Text-to-Speech Accent Adaptation. INTERSPEECH 2023: 4354-4358 - [c36]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement. MLSP 2023: 1-6 - [c35]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Chng Eng Siong:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. NeurIPS 2023 - [c34]Chao-Han Huck Yang, Zhengling Qi, Yifan Cui, Pin-Yu Chen:
Pessimistic Model Selection for Offline Deep Reinforcement Learning. UAI 2023: 2379-2389 - [c33]Chao-Han Huck Yang, I-Te Danny Hung, Yi-Chieh Liu, Pin-Yu Chen:
Treatment Learning Causal Transformer for Noisy Image Classification. WACV 2023: 6128-6139 - [i62]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. CoRR abs/2301.07851 (2023) - [i61]Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring:
Causalainer: Causal Explainer for Automatic Video Summarization. CoRR abs/2305.00455 (2023) - [i60]Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Narsis Aftab Kiani, David Gomez-Cabrero, Jesper N. Tegnér:
A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model. CoRR abs/2305.11244 (2023) - [i59]Li-Jen Yang, Chao-Han Huck Yang, Jen-Tzung Chien:
Parameter-Efficient Learning for Text-to-Speech Accent Adaptation. CoRR abs/2305.11320 (2023) - [i58]Chun-Wei Ho, Chao-Han Huck Yang, Sabato Marco Siniscalchi:
Differentially Private Adapters for Parameter Efficient Acoustic Modeling. CoRR abs/2305.11360 (2023) - [i57]Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Model Approach to Efficient Speech Separation. CoRR abs/2305.16932 (2023) - [i56]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. CoRR abs/2306.00331 (2023) - [i55]Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? CoRR abs/2306.01015 (2023) - [i54]Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring:
Causal Video Summarizer for Video Exploration. CoRR abs/2307.01947 (2023) - [i53]Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang:
Can Whisper perform speech-based in-context learning. CoRR abs/2309.07081 (2023) - [i52]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. CoRR abs/2309.15223 (2023) - [i51]Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke:
Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting. CoRR abs/2309.15649 (2023) - [i50]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. CoRR abs/2309.15701 (2023) - [i49]Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegnér:
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition. CoRR abs/2310.06434 (2023) - [i48]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023) - [i47]Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring:
Conditional Modeling Based Automatic Video Summarization. CoRR abs/2311.12159 (2023) - [i46]Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu:
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification. CoRR abs/2312.14378 (2023) - [i45]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. CoRR abs/2312.15316 (2023) - 2022
- [c32]Chao-Han Huck Yang, I-Te Danny Hung, Yi Ouyang, Pin-Yu Chen:
Training a Resilient Q-network against Observational Interference. AAAI 2022: 8814-8822 - [c31]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. ICASSP 2022: 4041-4045 - [c30]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. ICASSP 2022: 6302-6306 - [c29]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. ICASSP 2022: 7572-7576 - [c28]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. ICASSP 2022: 8602-8606 - [c27]Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring:
Causal Video Summarizer for Video Exploration. ICME 2022: 1-6 - [c26]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. ISCSLP 2022: 1-5 - [c25]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. ISCSLP 2022: 453-457 - [c24]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [c23]Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Zenglin Shi, I-Hung Lin, Jesper Tegnér, Marcel Worring:
Non-local Attention Improves Description Generation for Retinal Images. WACV 2022: 3250-3259 - [i44]Hengshun Zhou, Jun Du, Chao-Han Huck Yang, Shifu Xiong, Chin-Hui Lee:
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning. CoRR abs/2202.08509 (2022) - [i43]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. CoRR abs/2202.08532 (2022) - [i42]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Yu Tsao, Pin-Yu Chen:
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing. CoRR abs/2203.03550 (2022) - [i41]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification. CoRR abs/2203.04114 (2022) - [i40]Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen, Javier Tejedor:
Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing. CoRR abs/2203.06031 (2022) - [i39]Chao-Han Huck Yang, I-Te Danny Hung, Yi-Chieh Liu, Pin-Yu Chen:
Treatment Learning Transformer for Noisy Image Classification. CoRR abs/2203.15529 (2022) - [i38]Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hsiu Hsieh:
Theoretical Error Performance Analysis for Variational Quantum Circuit Based Functional Regression. CoRR abs/2206.04804 (2022) - [i37]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i36]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. CoRR abs/2210.06382 (2022) - [i35]Jhih-Cing Huang, Yu-Lin Tsai, Chao-Han Huck Yang, Cheng-Fang Su, Chia-Mu Yu, Pin-Yu Chen, Sy-Yen Kuo:
Certified Robustness of Quantum Classifiers against Adversarial Examples through Quantum Noise. CoRR abs/2211.00887 (2022) - [i34]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-based Neural Speech Enhancement. CoRR abs/2211.01189 (2022) - [i33]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022) - [i32]Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch:
Low-Resource Music Genre Classification with Advanced Neural Model Reprogramming. CoRR abs/2211.01317 (2022) - 2021
- [j2]Hongzhuang Wu, Xiaoli Ma, Chao-Han Huck Yang, Songyong Liu:
Attention Based Bidirectional Convolutional LSTM for High-Resolution Radio Tomographic Imaging. IEEE Trans. Circuits Syst. II Express Briefs 68(4): 1482-1486 (2021) - [c22]Chao-Han Huck Yang, Linda Liu, Ankur Gandhe, Yile Gu, Anirudh Raju, Denis Filimonov, Ivan Bulyko:
Multi-Task Language Modeling for Improving Speech Recognition of Rare Words. ASRU 2021: 1087-1093 - [c21]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. ICASSP 2021: 845-849 - [c20]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. ICASSP 2021: 6523-6527 - [c19]Chao-Han Huck Yang, Mohit Chhabra, Yi-Chieh Liu, Quan Kong, Tomoaki Yoshinaga, Tomokazu Murakami:
Robust Unsupervised Multi-Object Tracking In Noisy Environments. ICIP 2021: 2239-2243 - [c18]Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring:
Deep Context-Encoding Network For Retinal Image Captioning. ICIP 2021: 3762-3766 - [c17]Chao-Han Huck Yang, Yun-Yun Tsai, Pin-Yu Chen:
Voice2Series: Reprogramming Acoustic Models for Time Series Classification. ICML 2021: 11808-11819 - [c16]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. Interspeech 2021: 881-885 - [c15]Jia-Hong Huang, Chao-Han Huck Yang, Fangyu Liu, Meng Tian, Yi-Chieh Liu, Ting-Wei Wu, I-Hung Lin, Kang Wang, Hiromasa Morikawa, Hernghua Chang, Jesper Tegnér, Marcel Worring:
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation. WACV 2021: 2441-2451 - [i31]Chao-Han Huck Yang, I-Te Danny Hung, Yi Ouyang, Pin-Yu Chen:
Causal Inference Q-Network: Toward Resilient Reinforcement Learning. CoRR abs/2102.09677 (2021) - [i30]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. CoRR abs/2104.01271 (2021) - [i29]Chao-Han Huck Yang, Mohit Chhabra, Yi-Chieh Liu, Quan Kong, Tomoaki Yoshinaga, Tomokazu Murakami:
Robust Unsupervised Multi-Object Tracking in Noisy Environments. CoRR abs/2105.10005 (2021) - [i28]Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring:
Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning". CoRR abs/2105.14538 (2021) - [i27]Chao-Han Huck Yang, Yun-Yun Tsai, Pin-Yu Chen:
Voice2Series: Reprogramming Acoustic Models for Time Series Classification. CoRR abs/2106.09296 (2021) - [i26]Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Qing Wang, Yuyang Wang, Xianjun Xia, Yuanjun Zhao, Yuzhong Wu, Yannan Wang, Jun Du, Chin-Hui Lee:
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification. CoRR abs/2107.01461 (2021) - [i25]Jun Qi, Chao-Han Huck Yang, Pin-Yu Chen:
QTN-VQC: An End-to-End Learning framework for Quantum Neural Networks. CoRR abs/2110.03861 (2021) - [i24]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming. CoRR abs/2110.03894 (2021) - [i23]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. CoRR abs/2110.08598 (2021) - [i22]Chao-Han Huck Yang, Zhengling Qi, Yifan Cui, Pin-Yu Chen:
Pessimistic Model Selection for Offline Deep Reinforcement Learning. CoRR abs/2111.14346 (2021) - 2020
- [j1]Samuel Yen-Chi Chen, Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Hsi-Sheng Goan:
Variational Quantum Circuits for Deep Reinforcement Learning. IEEE Access 8: 141007-141024 (2020) - [c14]Haoling Zhang, Chao-Han Huck Yang, Hector Zenil, Narsis Aftab Kiani, Yue Shen, Jesper N. Tegnér:
Evolving Neural Networks through a Reverse Encoding Tree. CEC 2020: 1-10 - [c13]Hongzhuang Wu, Xiaoli Ma, Chao-Han Huck Yang, Songyong Liu:
Convolutional Neural Network Based Radio Tomographic Imaging. CISS 2020: 1-6 - [c12]Yi-Chieh Liu, Yung-An Hsieh, Min-Hung Chen, Chao-Han Huck Yang, Jesper Tegnér, Yi-Chang James Tsai:
Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding. ICASSP 2020: 2338-2342 - [c11]Hao-Hsiang Yang, Chao-Han Huck Yang, Yi-Chang James Tsai:
Y-Net: Multi-Scale Feature Aggregation Network With Wavelet Structure Similarity Loss Function For Single Image Dehazing. ICASSP 2020: 2628-2632 - [c10]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee:
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement. ICASSP 2020: 3107-3111 - [c9]Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yi Ouyang,