- JongYoon Lim, Inkyu Sa, Bruce A. MacDonald, Ho Seok Ahn:
Enhancing Human-Robot Interaction: Integrating ASL Recognition and LLM-Driven Co-Speech Gestures in Pepper Robot with a Compact Neural Network. UR 2024: 663-668 - Sara Kaszuba, Sandeep Reddy Sabbella, Francesco Leotta, Pascal Serrarens, Daniele Nardi:
Testing Human-Robot Interaction in Virtual Reality: Experience from a Study on Speech Act Classification. CoRR abs/2401.04534 (2024) - Snehesh Shrestha, Yantian Zha, Saketh Banagiri, Ge Gao, Yiannis Aloimonos, Cornelia Fermüller:
NatSGD: A Dataset with Speech, Gestures, and Demonstrations for Robot Learning in Natural Human-Robot Interaction. CoRR abs/2403.02274 (2024) - Yue Li, Koen V. Hindriks, Florian Kunneman:
Single-Channel Robot Ego-Speech Filtering during Human-Robot Interaction. CoRR abs/2403.02918 (2024) - Ruben Janssens, Eva Verhelst, Giulio Antonio Abbo, Qiaoqiao Ren, María J. Pinto Bernal, Tony Belpaeme:
Child Speech Recognition in Human-Robot Interaction: Problem Solved? CoRR abs/2404.17394 (2024) - Yue Li, Florian A. Kunneman, Koen V. Hindriks:
A Near-Real-Time Processing Ego Speech Filtering Pipeline Designed for Speech Interruption During Human-Robot Interaction. CoRR abs/2405.13477 (2024) - Anfeng Xu, Kevin Huang, Tiantian Feng, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan:
Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions. CoRR abs/2406.07890 (2024) - Rémi Uro, Marie Tahon, David Doukhan, Antoine Laurent, Albert Rilliard:
Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content. CoRR abs/2406.10073 (2024) - Gabriela Molina León, Anastasia Bezerianos, Olivier Gladin, Petra Isenberg:
Talk to the Wall: The Role of Speech Interaction in Collaborative Visual Analytics. CoRR abs/2408.03813 (2024) - Suyi Zhang, Ekram Alam, Jack Baber, Francesca Bianco, Edward Turner, Maysam Chamanzar, Hamid Dehghani:
MindGPT: Advancing Human-AI Interaction with Non-Invasive fNIRS-Based Imagined Speech Decoding. CoRR abs/2408.05361 (2024) - Suyi Zhang, Ekram Alam, Jack Baber, Francesca Bianco, Edward Turner, Maysam Chamanzar, Hamid Dehghani:
MindSpeech: Continuous Imagined Speech Decoding using High-Density fNIRS and Prompt Tuning for Advanced Human-AI Interaction. CoRR abs/2408.05362 (2024) - (Withdrawn) RETRACTED ARTICLE: Research on life prediction method of rolling bearing based on deep learning and voice interaction technology. Int. J. Speech Technol. 27(2): 507 (2024)
- 2023
- Razan Jaber:
Towards Designing Better Speech Agent Interaction: Using Eye Gaze for Interaction. Stockholm University, Sweden, 2023 - Martin Lebourdais:
Interactions entre locuteurs : de la détection de la parole superposée à la détection des interruptions. (Speaker interactions : from overlapped speech to interruption detection). Le Mans University, France, 2023 - Pablo Arnau-González, Miguel Arevalillo-Herráez, Romina Soledad Albornoz-De Luise, David Arnau:
A methodological approach to enable natural language interaction in an Intelligent Tutoring System. Comput. Speech Lang. 81: 101516 (2023) - Casey C. Bennett, Young-Ho Bae, Jun Hyung Yoon, Yejin Chae, Eunseo Yoon, Seeun Lee, Uijae Ryu, Say Young Kim, Benjamin Weiss:
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments. Comput. Speech Lang. 81: 101521 (2023) - Sippee Bharadwaj, Purnendu Bikash Acharjee:
Exploring human voice prosodic features and the interaction between the excitation signal and vocal tract for Assamese speech. Int. J. Speech Technol. 26(1): 77-93 (2023) - Lishuang Zhan, Tianyang Xiong, Hongwei Zhang, Shihui Guo, Xiaowei Chen, Jiangtao Gong, Juncong Lin, Yipeng Qin:
TouchEditor: Interaction Design and Evaluation of a Flexible Touchpad for Text Editing of Head-Mounted Displays in Speech-unfriendly Environments. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 7(4): 198:1-198:29 (2023) - Yongjian Dong, Qinrong Ye:
Neural network-based speech fuzzy enhancement algorithm for smart home interaction. J. Comput. Methods Sci. Eng. 23(3): 1225-1236 (2023) - Chen Li, Dimitrios Chrysostomou, Hongji Yang:
A speech-enabled virtual assistant for efficient human-robot interaction in industrial environments. J. Syst. Softw. 205: 111818 (2023) - Fateme Nazari, Shima Tabibian, Elaheh Homayounvala:
Multimodal user interaction with in-car equipment in real conditions based on touch and speech modes in the Persian language. Multim. Tools Appl. 82(9): 12995-13023 (2023) - Qisheng Yang, Weiqiu Jin, Qihang Zhang, Yuhong Wei, Zhanfeng Guo, Xiaoshi Li, Yi Yang, Qingquan Luo, He Tian, Tianling Ren:
Mixed-modality speech recognition and interaction using a wearable artificial throat. Nat. Mac. Intell. 5(2): 169-180 (2023) - Waleed Alsabhan:
Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention. Sensors 23(3): 1386 (2023) - Nigel G. Ward, Jonathan E. Avila:
A dimensional model of interaction style variation in spoken dialog. Speech Commun. 149: 47-62 (2023) - Fangkai Jiao, Yangyang Guo, Minlie Huang, Liqiang Nie:
Enhanced Multi-Domain Dialogue State Tracker With Second-Order Slot Interactions. IEEE ACM Trans. Audio Speech Lang. Process. 31: 265-276 (2023) - Yaxin Liu, Yan Zhou, Ziming Li, Junlin Wang, Wei Zhou, Songlin Hu:
HIM: An End-to-End Hierarchical Interaction Model for Aspect Sentiment Triplet Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2272-2285 (2023) - Yukang Yan, Haohua Liu, Yingtian Shi, Jingying Wang, Ruici Guo, Zisu Li, Xuhai Xu, Chun Yu, Yuntao Wang, Yuanchun Shi:
ConeSpeech: Exploring Directional Speech Interaction for Multi-Person Remote Communication in Virtual Reality. IEEE Trans. Vis. Comput. Graph. 29(5): 2647-2657 (2023) - Zehua Zhang, Changjun He, Shiyun Xu, Mingjiang Wang:
Real and imaginary part interaction network for monaural speech enhancement and de-reverberation. APSIPA ASC 2023: 972-977 - Jun Rekimoto:
WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions. CHI 2023: 700:1-700:12 - Zixiong Su, Shitao Fang, Jun Rekimoto:
LipLearner: Customizable Silent Speech Interactions on Mobile Devices. CHI 2023: 696:1-696:21