Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

- view
  authority control:
- export record
  dblp key:
  - journals/speech/XueZSWS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/XueZSWS23
Junxiao Xue, Hao Zhou, Huawei Song, Bin Wu, Lei Shi:
Cross-modal information fusion for voice spoofing detection. Speech Commun. 147: 41-50 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/spl/HuangCHK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HuangCHK23
Bingyuan Huang, Sanshuai Cui, Jiwu Huang, Xiangui Kang:
Discriminative Frequency Information Learning for End-to-End Speech Anti-Spoofing. IEEE Signal Process. Lett. 30: 185-189 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/talip/ZhangY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/ZhangY23
Weizhao Zhang, Hongwu Yang:
Improving Sequence-to-sequence Tibetan Speech Synthesis with Prosodic Information. ACM Trans. Asian Low Resour. Lang. Inf. Process. 22(9): 225:1-225:13 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HongWW23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HongWW23a
Qian-Bei Hong, Chung-Hsien Wu, Hsin-Min Wang:
Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1745-1757 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KrugBGNXX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KrugBGNXX23
Paul Konstantin Krug, Peter Birkholz, Branislav Gerazov, Daniel Rudolph van Niekerk, Anqi Xu, Yi Xu:
Artificial Vocal Learning Guided by Phoneme Recognition and Visual Information. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1734-1744 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeiZCWWKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeiZCWWKM23
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Xixin Wu, Shiyin Kang, Helen Meng:
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3290-3303 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuCYLHWT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuCYLHWT23
Yen-Ju Lu, Chia-Yu Chang, Cheng Yu, Ching-Feng Liu, Jeih-weih Hung, Shinji Watanabe, Yu Tsao:
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2738-2750 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YoonKLAK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YoonKLAK23
Ji Won Yoon, Hyung Yong Kim, Hyeonseung Lee, Sunghwan Ahn, Nam Soo Kim:
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2974-2987 (2023)
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YoonYHHY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YoonYHHY23
Eunseop Yoon, Hee Suk Yoon, John B. Harvill, Mark Hasegawa-Johnson, Chang Dong Yoo:
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition. ACL (Findings) 2023: 9893-9902
- view
  authority control:
- export record
  dblp key:
  - conf/aiccc/WangQY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aiccc/WangQY23
Xuening Wang, Zhaopeng Qian, Chongchong Yu:
Multi-stage Multi-modalities Fusion of Lip, Tongue and Acoustics Information for Speech Recognition. AICCC 2023: 226-231
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DengZZYZCD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DengZZYZCD23
Pan Deng, Jie Zhang, Xinyuan Zhou, Zhongyi Ye, Weitai Zhang, Jianwei Cui, Lirong Dai:
Learning Semantic Information from Machine Translation to Improve Speech-to-Text Translation. APSIPA ASC 2023: 954-959
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GuoWGZGGW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GuoWGZGGW23
Aoqi Guo, Junnan Wu, Peng Gao, Wenbo Zhu, Qinwen Guo, Dazhi Gao, Yujun Wang:
Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction. APSIPA ASC 2023: 107-113
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShioneWK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShioneWK23
Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka:
Construction of Automatic Speech Recognition Model that Recognizes Linguistic Information and Verbal/Non-verbal Phenomena. APSIPA ASC 2023: 2306-2311
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YueZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YueZL23
Pengcheng Yue, Shu-Kai Zheng, Taihao Li:
Complex Feature Information Enhanced Speech Emotion Recognition. APSIPA ASC 2023: 941-946
- view
  authority control:
- export record
  dblp key:
  - conf/assets/MayPB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/assets/MayPB23
Lloyd May, So Yeon Park, Jonathan Berger:
Enhancing Non-Speech Information Communicated in Closed Captioning Through Critical Design. ASSETS 2023: 16:1-16:14
- view
  authority control:
- export record
  dblp key:
  - conf/biocas/LiuD023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/biocas/LiuD023
Yinggang Liu, Yuanjie Deng, Ying Wei:
Cross-modal Speech Separation Without Visual Information During Testing. BioCAS 2023: 1-5
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dada/DongXFZCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dada/DongXFZCL23
Shunbo Dong, Jun Xue, Cunhang Fan, Kang Zhu, Yujie Chen, Zhao Lv:
Multi-perspective Information Fusion Res2Net with Random Specmix for Fake Speech Detection. DADA@IJCAI 2023: 82-88
- view
  authority control:
- export record
  dblp key:
  - conf/elinfocom/KimL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/elinfocom/KimL23
Jeong-Yoon Kim, Seung-Ho Lee:
CoordViT: A Novel Method of Improve Vision Transformer-Based Speech Emotion Recognition using Coordinate Information Concatenate. ICEIC 2023: 1-4
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/PorjazovskiGK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/PorjazovskiGK23
Dejan Porjazovski, Tamás Grósz, Mikko Kurimo:
Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information. EUSIPCO 2023: 396-400
- view
  authority control:
- export record
  dblp key:
  - conf/hri/Velner23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hri/Velner23
Ella Velner:
Measuring Trust in Children's Speech: Towards Responsible Robot-Supported Information Search. HRI (Companion) 2023: 748-750
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWDWD0C0SSLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWDWD0C0SSLY23
Hang Chen, Shilong Wu, Yusheng Dai, Zhe Wang, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Bao-Cai Yin, Jia Pan, Jianqing Gao, Cong Liu:
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge. ICASSP 2023: 1-2
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinB23
Wei-Cheng Lin, Carlos Busso:
Role of Lexical Boundary Information in Chunk-Level Segmentation for Speech Emotion Recognition. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PurohitYVDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PurohitYVDM23
Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta, Mathew Magimai-Doss:
Towards Learning Emotion Information from Short Segments of Speech. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWCHDLCWSSLYPGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWCHDLCWSSLYPGL23
Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu:
The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCDYPL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCDYPL23
Chenyue Zhang, Hang Chen, Jun Du, Bao-Cai Yin, Jia Pan, Chin-Hui Lee:
Incorporating Visual Information Reconstruction into Progressive Learning for Optimizing audio-visual Speech Enhancement. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/icccnt/ShahBS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icccnt/ShahBS23
Siddhant Bikram Shah, Aashish Bhandari, Prashant Giridhar Shambharkar:
Leveraging Multimodal Information in Speech Data for the Non-Invasive Detection of Alzheimer's Disease. ICCCNT 2023: 1-6
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/TonoliMUC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/TonoliMUC23
Rodolfo L. Tonoli, Leonardo B. de M. M. Marques, Lucas H. Ueda, Paula Dornhofer Paro Costa:
Gesture Generation with Diffusion Models Aided by Speech Activity Information. ICMI Companion 2023: 193-199
- view
  authority control:
- export record
  dblp key:
  - conf/icycsee/DengD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icycsee/DengD23
Yihe Deng, Zuxu Dai:
A Short Text Classification Model Based on Chinese Part-of-Speech Information and Mutual Learning. ICPCSEE (2) 2023: 330-343
- view
  authority control:
- export record
  dblp key:
  - conf/iiaiaai/YokoyamaS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iiaiaai/YokoyamaS23
Masaaki Yokoyama, Kazutaka Shimada:
Detecting speech recognition errors using topic information and BERT. IIAI-AAI 2023: 400-405
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/kr/FurmanT0LMA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kr/FurmanT0LMA23
Damián Ariel Furman, Pablo Torres, José A. Rodríguez, Diego Letzen, Maria Vanina Martinez, Laura Alonso Alemany:
An Initial Exploration of How Argumentative Information Impacts Automatic Generation of Counter-Narratives Against Hate Speech. Arg&App@KR 2023: 26-39