export record
dblp key:
journals/speech/XueZSWS23 share record
persistent URL:
Junxiao Xue , Hao Zhou , Huawei Song , Bin Wu , Lei Shi : Cross-modal information fusion for voice spoofing detection. Speech Commun. 147 : 41-50 (2023 )share record
persistent URL:
Bingyuan Huang , Sanshuai Cui , Jiwu Huang , Xiangui Kang : Discriminative Frequency Information Learning for End-to-End Speech Anti-Spoofing. IEEE Signal Process. Lett. 30 : 185-189 (2023 )share record
persistent URL:
Weizhao Zhang , Hongwu Yang : Improving Sequence-to-sequence Tibetan Speech Synthesis with Prosodic Information. ACM Trans. Asian Low Resour. Lang. Inf. Process. 22 (9 ) : 225:1-225:13 (2023 )share record
persistent URL:
Qian-Bei Hong , Chung-Hsien Wu , Hsin-Min Wang : Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning. IEEE ACM Trans. Audio Speech Lang. Process. 31 : 1745-1757 (2023 )export record
dblp key:
journals/taslp/KrugBGNXX23 share record
persistent URL:
Paul Konstantin Krug , Peter Birkholz , Branislav Gerazov , Daniel Rudolph van Niekerk , Anqi Xu , Yi Xu : Artificial Vocal Learning Guided by Phoneme Recognition and Visual Information. IEEE ACM Trans. Audio Speech Lang. Process. 31 : 1734-1744 (2023 )export record
dblp key:
journals/taslp/LeiZCWWKM23 share record
persistent URL:
Shun Lei , Yixuan Zhou , Liyang Chen , Zhiyong Wu , Xixin Wu , Shiyin Kang , Helen Meng : MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 31 : 3290-3303 (2023 )export record
dblp key:
journals/taslp/LuCYLHWT23 share record
persistent URL:
Yen-Ju Lu , Chia-Yu Chang , Cheng Yu , Ching-Feng Liu , Jeih-weih Hung , Shinji Watanabe , Yu Tsao : Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information. IEEE ACM Trans. Audio Speech Lang. Process. 31 : 2738-2750 (2023 )export record
dblp key:
journals/taslp/YoonKLAK23 share record
persistent URL:
Ji Won Yoon , Hyung Yong Kim , Hyeonseung Lee , Sunghwan Ahn , Nam Soo Kim : Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models. IEEE ACM Trans. Audio Speech Lang. Process. 31 : 2974-2987 (2023 )share record
persistent URL:
Eunseop Yoon , Hee Suk Yoon , John B. Harvill , Mark Hasegawa-Johnson , Chang Dong Yoo : INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition. ACL (Findings) 2023 : 9893-9902 share record
persistent URL:
Xuening Wang , Zhaopeng Qian , Chongchong Yu : Multi-stage Multi-modalities Fusion of Lip, Tongue and Acoustics Information for Speech Recognition. AICCC 2023 : 226-231 share record
persistent URL:
Pan Deng , Jie Zhang , Xinyuan Zhou , Zhongyi Ye , Weitai Zhang , Jianwei Cui , Lirong Dai : Learning Semantic Information from Machine Translation to Improve Speech-to-Text Translation. APSIPA ASC 2023 : 954-959 share record
persistent URL:
Aoqi Guo , Junnan Wu , Peng Gao , Wenbo Zhu , Qinwen Guo , Dazhi Gao , Yujun Wang : Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction. APSIPA ASC 2023 : 107-113 share record
persistent URL:
Nagito Shione , Yukoh Wakabayashi , Norihide Kitaoka : Construction of Automatic Speech Recognition Model that Recognizes Linguistic Information and Verbal/Non-verbal Phenomena. APSIPA ASC 2023 : 2306-2311 share record
persistent URL:
Pengcheng Yue , Shu-Kai Zheng , Taihao Li : Complex Feature Information Enhanced Speech Emotion Recognition. APSIPA ASC 2023 : 941-946 share record
persistent URL:
Lloyd May , So Yeon Park , Jonathan Berger : Enhancing Non-Speech Information Communicated in Closed Captioning Through Critical Design. ASSETS 2023 : 16:1-16:14 share record
persistent URL:
Yinggang Liu , Yuanjie Deng , Ying Wei : Cross-modal Speech Separation Without Visual Information During Testing. BioCAS 2023 : 1-5 share record
persistent URL:
Shunbo Dong , Jun Xue , Cunhang Fan , Kang Zhu , Yujie Chen , Zhao Lv : Multi-perspective Information Fusion Res2Net with Random Specmix for Fake Speech Detection. DADA@IJCAI 2023 : 82-88 share record
persistent URL:
Jeong-Yoon Kim , Seung-Ho Lee : CoordViT: A Novel Method of Improve Vision Transformer-Based Speech Emotion Recognition using Coordinate Information Concatenate. ICEIC 2023 : 1-4 export record
dblp key:
conf/eusipco/PorjazovskiGK23 share record
persistent URL:
Dejan Porjazovski , Tamás Grósz , Mikko Kurimo : Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information. EUSIPCO 2023 : 396-400 share record
persistent URL:
Ella Velner : Measuring Trust in Children's Speech: Towards Responsible Robot-Supported Information Search. HRI (Companion) 2023 : 748-750 export record
dblp key:
conf/icassp/ChenWDWD0C0SSLY23 share record
persistent URL:
Hang Chen , Shilong Wu , Yusheng Dai , Zhe Wang , Jun Du , Chin-Hui Lee , Jingdong Chen , Shinji Watanabe , Sabato Marco Siniscalchi , Odette Scharenborg , Diyuan Liu , Bao-Cai Yin , Jia Pan , Jianqing Gao , Cong Liu : Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge. ICASSP 2023 : 1-2 share record
persistent URL:
Wei-Cheng Lin , Carlos Busso : Role of Lexical Boundary Information in Chunk-Level Segmentation for Speech Emotion Recognition. ICASSP 2023 : 1-5 export record
dblp key:
conf/icassp/PurohitYVDM23 share record
persistent URL:
Tilak Purohit , Sarthak Yadav , Bogdan Vlasenko , S. Pavankumar Dubagunta , Mathew Magimai-Doss : Towards Learning Emotion Information from Short Segments of Speech. ICASSP 2023 : 1-5 export record
dblp key:
conf/icassp/WangWCHDLCWSSLYPGL23 share record
persistent URL:
Zhe Wang , Shilong Wu , Hang Chen , Mao-Kui He , Jun Du , Chin-Hui Lee , Jingdong Chen , Shinji Watanabe , Sabato Marco Siniscalchi , Odette Scharenborg , Diyuan Liu , Baocai Yin , Jia Pan , Jianqing Gao , Cong Liu : The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023 : 1-5 share record
persistent URL:
Chenyue Zhang , Hang Chen , Jun Du , Bao-Cai Yin , Jia Pan , Chin-Hui Lee : Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech Enhancement. ICASSP 2023 : 1-5 share record
persistent URL:
Siddhant Bikram Shah , Aashish Bhandari , Prashant Giridhar Shambharkar : Leveraging Multimodal Information in Speech Data for the Non-Invasive Detection of Alzheimer's Disease. ICCCNT 2023 : 1-6 share record
persistent URL:
Rodolfo L. Tonoli , Leonardo B. de M. M. Marques , Lucas H. Ueda , Paula Dornhofer Paro Costa : Gesture Generation with Diffusion Models Aided by Speech Activity Information. ICMI Companion 2023 : 193-199 share record
persistent URL:
Yihe Deng , Zuxu Dai : A Short Text Classification Model Based on Chinese Part-of-Speech Information and Mutual Learning. ICPCSEE (2) 2023 : 330-343 share record
persistent URL:
Masaaki Yokoyama , Kazutaka Shimada : Detecting speech recognition errors using topic information and BERT. IIAI-AAI 2023 : 400-405 share record
persistent URL:
Damián Ariel Furman , Pablo Torres , José A. Rodríguez , Diego Letzen , Maria Vanina Martinez , Laura Alonso Alemany : An Initial Exploration of How Argumentative Information Impacts Automatic Generation of Counter-Narratives Against Hate Speech. Arg&App@KR 2023 : 26-39