


Остановите войну!
for scientists:


default search action
Sabato Marco Siniscalchi
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j31]Mohammad Adiban, Sabato Marco Siniscalchi, Giampiero Salvi:
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity. Neurocomputing 537: 296-308 (2023) - [c79]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg
, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023: 1-5 - [c78]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5 - [c77]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-Based Neural Speech Enhancement. MLSP 2023: 1-6 - [i30]Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition. CoRR abs/2303.06326 (2023) - [i29]Chun-Wei Ho, Chao-Han Huck Yang, Sabato Marco Siniscalchi:
Differentially Private Adapters for Parameter Efficient Acoustic Modeling. CoRR abs/2305.11360 (2023) - [i28]Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models. CoRR abs/2306.00331 (2023) - [i27]Nicole Dalia Cilia, Claudio De Stefano, Francesco Fontanella, Sabato Marco Siniscalchi:
How word semantics and phonology affect handwriting of Alzheimer's patients: a machine learning based analysis. CoRR abs/2307.04762 (2023) - [i26]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. CoRR abs/2307.06701 (2023) - [i25]Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao:
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction. CoRR abs/2309.08348 (2023) - [i24]Hao Yen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints. CoRR abs/2309.08828 (2023) - [i23]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. CoRR abs/2309.15701 (2023) - [i22]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023) - 2022
- [j30]Abdolreza Sabzi Shahrebabaki
, Giampiero Salvi
, Torbjørn Svendsen, Sabato Marco Siniscalchi
:
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE ACM Trans. Audio Speech Lang. Process. 30: 135-147 (2022) - [c76]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. BMVC 2022: 636 - [c75]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. ICASSP 2022: 4041-4045 - [c74]Hang Chen, Hengshun Zhou, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe
, Sabato Marco Siniscalchi, Odette Scharenborg
, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results. ICASSP 2022: 9266-9270 - [c73]Hengshun Zhou, Jun Du, Gongzhen Zou, Zhaoxu Nian, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe
, Odette Scharenborg, Jingdong Chen, Shifu Xiong, Jianqing Gao:
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1111-1115 - [c72]Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee, Sabato Marco Siniscalchi, Shinji Watanabe
, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan:
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. INTERSPEECH 2022: 1766-1770 - [c71]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. ISCSLP 2022: 1-5 - [c70]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. ISCSLP 2022: 453-457 - [c69]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [i21]Qing Wang, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification. CoRR abs/2203.04114 (2022) - [i20]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. CoRR abs/2208.04554 (2022) - [i19]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i18]Chao-Han Huck Yang, Jun Qi, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition. CoRR abs/2210.06382 (2022) - [i17]Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco Siniscalchi, Yu Tsao:
Inference and Denoise: Causal Inference-based Neural Speech Enhancement. CoRR abs/2211.01189 (2022) - [i16]Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022) - 2021
- [j29]Vincenzo Conti
, Leonardo Rundo
, Carmelo Militello
, Valerio Mario Salerno, Salvatore Vitabile, Sabato Marco Siniscalchi:
A multimodal retina-iris biometric system using the Levenshtein distance for spatial feature comparison. IET Biom. 10(1): 44-64 (2021) - [j28]Sabato Marco Siniscalchi
:
Vector-to-Vector Regression via Distributional Loss for Speech Enhancement. IEEE Signal Process. Lett. 28: 254-258 (2021) - [c68]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. ICASSP 2021: 845-849 - [c67]Abdolreza Sabzi Shahrebabaki
, Negar Olfati, Ali Shariq Imran, Magne Hallstein Johnsen, Sabato Marco Siniscalchi, Torbjørn Svendsen:
A Two-Stage Deep Modeling Approach to Articulatory Inversion. ICASSP 2021: 6453-6457 - [c66]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. ICASSP 2021: 6523-6527 - [c65]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. Interspeech 2021: 881-885 - [c64]Abdolreza Sabzi Shahrebabaki
, Sabato Marco Siniscalchi, Torbjørn Svendsen:
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation. Interspeech 2021: 1184-1188 - [c63]Abdolreza Sabzi Shahrebabaki
, Sabato Marco Siniscalchi, Giampiero Salvi
, Torbjørn Svendsen:
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion. ISCAS 2021: 1-5 - [e1]Erik Marchi, Sabato Marco Siniscalchi, Sandro Cumani, Valerio Mario Salerno, Haizhou Li:
Increasing Naturalness and Flexibility in Spoken Dialogue Interaction - 10th International Workshop on Spoken Dialogue Systems, IWSDS 2019, Syracuse, Sicily, Italy, 24-26 April 2019. Lecture Notes in Electrical Engineering 714, Springer 2021, ISBN 978-981-15-9322-2 [contents] - [i15]Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification. CoRR abs/2104.01271 (2021) - [i14]Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Qing Wang, Yuyang Wang, Xianjun Xia, Yuanjun Zhao, Yuzhong Wu, Yannan Wang, Jun Du, Chin-Hui Lee:
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification. CoRR abs/2107.01461 (2021) - [i13]Zhen Huang, Xiaodan Zhuang, Daben Liu, Xiaoqiang Xiao, Yuchen Zhang, Sabato Marco Siniscalchi:
Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching. CoRR abs/2109.00921 (2021) - [i12]Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Yu Tsao:
A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming. CoRR abs/2110.03894 (2021) - [i11]Hu Hu, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. CoRR abs/2110.08598 (2021) - 2020
- [j27]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Xiaoli Ma
, Chin-Hui Lee
:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. IEEE Signal Process. Lett. 27: 1485-1489 (2020) - [j26]Tassadaq Hussain
, Sabato Marco Siniscalchi
, Hsiao-Lan Sharon Wang
, Yu Tsao
, Valerio Mario Salerno
, Wen-Hung Liao
:
Ensemble Hierarchical Extreme Learning Machine for Speech Dereverberation. IEEE Trans. Cogn. Dev. Syst. 12(4): 744-758 (2020) - [j25]Ivan Kukanov
, Trung Ngo Trong, Ville Hautamäki
, Sabato Marco Siniscalchi
, Valerio Mario Salerno, Kong Aik Lee
:
Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 682-695 (2020) - [j24]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Xiaoli Ma
, Chin-Hui Lee
:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression. IEEE Trans. Signal Process. 68: 3411-3422 (2020) - [c62]Jun Qi, Xiaoli Ma, Chin-Hui Lee, Jun Du, Sabato Marco Siniscalchi:
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression. CISS 2020: 1-6 - [c61]Sicheng Wang, Wei Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers. ICASSP 2020: 6219-6223 - [c60]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network. ICASSP 2020: 7504-7508 - [c59]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. INTERSPEECH 2020: 76-80 - [c58]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. INTERSPEECH 2020: 1196-1200 - [c57]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. INTERSPEECH 2020: 1201-1205 - [c56]Abdolreza Sabzi Shahrebabaki
, Negar Olfati, Sabato Marco Siniscalchi, Giampiero Salvi
, Torbjørn Svendsen:
Transfer Learning of Articulatory Information Through Phone Information. INTERSPEECH 2020: 2877-2881 - [c55]Abdolreza Sabzi Shahrebabaki
, Sabato Marco Siniscalchi, Giampiero Salvi
, Torbjørn Svendsen:
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals. INTERSPEECH 2020: 2882-2886 - [i10]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network. CoRR abs/2002.00544 (2020) - [i9]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation. CoRR abs/2007.08389 (2020) - [i8]Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement. CoRR abs/2007.13024 (2020) - [i7]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Xue Bai, Jun Du, Chin-Hui Lee:
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances. CoRR abs/2008.00107 (2020) - [i6]Hu Hu, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification. CoRR abs/2008.00110 (2020) - [i5]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.05459 (2020) - [i4]Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression. CoRR abs/2008.07281 (2020) - [i3]Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee:
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition. CoRR abs/2010.13309 (2020) - [i2]Hu Hu, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. CoRR abs/2011.01447 (2020)
2010 – 2019
- 2019
- [j23]Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Chin-Hui Lee
:
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1932-1943 (2019) - [j22]Wei Li
, Nancy F. Chen
, Sabato Marco Siniscalchi
, Chin-Hui Lee:
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2012-2024 (2019) - [c54]Tassadaq Hussain
, Yu Tsao
, Hsin-Min Wang
, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao:
Compressed Multimodal Hierarchical Extreme Learning Machine for Speech Enhancement. APSIPA 2019: 678-683 - [c53]Tassadaq Hussain
, Yu Tsao
, Hsin-Min Wang
, Jia-Ching Wang, Sabato Marco Siniscalchi, Wen-Hung Liao:
Audio-Visual Speech Enhancement using Hierarchical Extreme Learning Machine. EUSIPCO 2019: 1-5 - [c52]Zhen Huang, Xiaodan Zhuang, Daben Liu, Xiaoqiang Xiao, Yuchen Zhang, Sabato Marco Siniscalchi:
Exploring Retraining-free Speech Recognition for Intra-sentential Code-switching. ICASSP 2019: 6066-6070 - [c51]Wei Li, Sicheng Wang, Ming Lei, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training. ICASSP 2019: 6560-6564 - [c50]Abdolreza Sabzi Shahrebabaki
, Negar Olfati, Ali Shariq Imran, Sabato Marco Siniscalchi, Torbjørn Svendsen:
A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion. INTERSPEECH 2019: 3775-3779 - [c49]Tassadaq Hussain
, Yu Tsao, Sabato Marco Siniscalchi, Jia-Ching Wang, Hsin-Min Wang
, Wen-Hung Liao:
Bone-Conducted Speech Enhancement Using Hierarchical Extreme Learning Machine. IWSDS 2019: 153-162 - 2018
- [j21]Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang
, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018) - [c48]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models. ICASSP 2018: 6249-6253 - 2017
- [j20]Tassadaq Hussain
, Sabato Marco Siniscalchi, Chi-Chun Lee, Syu-Siang Wang
, Yu Tsao
, Wen-Hung Liao:
Experimental Study on Extreme Learning Machine Applications for Speech Enhancement. IEEE Access 5: 25542-25554 (2017) - [j19]Bo Wu, Minglei Yang
, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Tong Wang, Chin-Hui Lee:
A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation. EURASIP J. Adv. Signal Process. 2017: 81 (2017) - [j18]Bo Wu
, Kehuang Li, Fengpei Ge
, Zhen Huang
, Minglei Yang
, Sabato Marco Siniscalchi
, Chin-Hui Lee:
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1289-1300 (2017) - [j17]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation. Pattern Recognit. Lett. 98: 1-7 (2017) - [j16]Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 60-71 (2017) - [j15]Sabato Marco Siniscalchi
, Valerio Mario Salerno
:
Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions. IEEE Trans. Neural Networks Learn. Syst. 28(8): 1959-1965 (2017) - [c47]Bo Wu, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Minglei Yang, Chin-Hui Lee:
A unified deep modeling approach to simultaneous speech dereverberation and recognition for the reverb challenge. HSCMA 2017: 36-40 - [c46]Sicheng Wang, Kehuang Li, Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement. ICASSP 2017: 5575-5579 - [c45]Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee:
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models. INTERSPEECH 2017: 2759-2763 - [c44]Fengpei Ge, Kehuang Li, Bo Wu, Sabato Marco Siniscalchi, Yonghong Yan, Chin-Hui Lee:
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition. INTERSPEECH 2017: 3847-3851 - 2016
- [j14]Zhen Huang, Sabato Marco Siniscalchi
, Chin-Hui Lee:
A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition. Neurocomputing 218: 448-459 (2016) - [j13]Hamid Behravan
, Ville Hautamäki
, Sabato Marco Siniscalchi
, Tomi Kinnunen, Chin-Hui Lee:
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 29-41 (2016) - [c43]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Chin-Hui Lee:
Towards a direct Bayesian adaptation framework for deep models. APSIPA 2016: 1-4 - [c42]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations. APSIPA 2016: 1-4 - [c41]Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling. ICASSP 2016: 6135-6139 - [c40]Wei Li, Kehuang Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee:
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees. INTERSPEECH 2016: 3127-3131 - [c39]Ivan Kukanov, Ville Hautamäki
, Sabato Marco Siniscalchi, Kehuang Li:
Deep learning with maximal figure-of-merit cost to advance multi-label speech attribute detection. SLT 2016: 489-495 - 2015
- [c38]Ville Hautamäki, Sabato Marco Siniscalchi, Hamid Behravan, Valerio Mario Salerno, Ivan Kukanov:
Boosting universal speech attributes classification with deep neural network for foreign accent characterization. INTERSPEECH 2015: 408-412 - [c37]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jinyu Li, Jiadong Wu, Chin-Hui Lee:
Maximum a posteriori adaptation of network parameters in deep models. INTERSPEECH 2015: 1076-1080 - [c36]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Ji Wu, Chin-Hui Lee:
Rapid adaptation for deep neural networks through multi-task learning. INTERSPEECH 2015: 3625-3629 - [i1]Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jiadong Wu, Chin-Hui Lee:
Maximum a Posteriori Adaptation of Network Parameters in Deep Models. CoRR abs/1503.02108 (2015) - 2014
- [j12]Sabato Marco Siniscalchi
, Torbjørn Svendsen, Chin-Hui Lee:
An artificial neural network approach to automatic speech processing. Neurocomputing 140: 326-338 (2014) - [c35]I-Fan Chen, Sabato Marco Siniscalchi
, Chin-Hui Lee:
Attribute based lattice rescoring in spontaneous speech recognition. ICASSP 2014: 3325-3329 - [c34]Hamid Behravan
, Ville Hautamäki, Sabato Marco Siniscalchi
, Tomi Kinnunen, Chin-Hui Lee:
Introducing attribute features to foreign accent recognition. ICASSP 2014: 5332-5336 - [c33]Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, Chin-Hui Lee:
Dialect levelling in Finnish: a universal speech attribute approach. INTERSPEECH 2014: 2165-2169 - [c32]Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Chao Weng, Chin-Hui Lee:
Feature space maximum a posteriori linear regression for adaptation of deep neural networks. INTERSPEECH 2014: 2992-2996 - 2013
- [j11]Sabato Marco Siniscalchi
, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1): 209-227 (2013) - [j10]Sabato Marco Siniscalchi, Jinyu Li
, Chin-Hui Lee:
Model-based margin estimation for hidden Markov model learning and generalisation. IET Signal Process. 7(8): 704-709 (2013) - [j9]Sabato Marco Siniscalchi
, Dong Yu, Li Deng, Chin-Hui Lee:
Exploiting deep neural networks for detection-based speech recognition. Neurocomputing 106: 148-157 (2013) - [j8]Chin-Hui Lee, Sabato Marco Siniscalchi
:
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition. Proc. IEEE 101(5): 1089-1115 (2013) - [j7]Sabato Marco Siniscalchi
, Dong Yu, Li Deng, Chin-Hui Lee:
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model. IEEE Signal Process. Lett. 20(3): 201-204 (2013) - [j6]Sabato Marco Siniscalchi
, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 21(4): 786-797 (2013) - [j5]Sabato Marco Siniscalchi
,