


default search action
Computer Speech & Language, Volume 95
Volume 95, 2026
- Hangshou Shao, Yilin Pan, Yue Wang, Yi-Jia Zhang
:
Modality fusion using auxiliary tasks for dementia detection. 101814 - Samreen Kazi
, Shakeel A. Khoja:
Towards building Urdu language document retrieval framework. 101797 - Francisco Teixeira
, Karla Pizzi, Raphaël Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Exploring features for membership inference in ASR model auditing. 101812 - Yoshiki Masuyama
, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
An end-to-end integration of speech separation and recognition with self-supervised learning representation. 101813 - Paige Tuttösí
, Mantaj Dhillon, Luna Sang, Shane Eastwood, Poorvi Bhatia, Quang Minh Dinh, Avni Kapoor, Yewon Jin, Angelica Lim:
BERSting at the screams: A benchmark for distanced, emotional and shouted speech recognition. 101815 - Qinwen Hu
, Tianchi Sun, Xin'an Chen, Xiaobin Rong, Jing Lu:
Optimization of modular multi-speaker distant conversational speech recognition. 101816 - Chaina Santos Oliveira, Ricardo B. C. Prudêncio:
An item response theory framework to evaluate automatic speech recognition systems against speech difficulty. 101817 - Fan Yang, Tan Zhu, Jing Huang, Zhilin Huang, Guoqi Xie:
A novel graph kernel algorithm for improving the effect of text classification. 101818 - Yuxin Wu, Guofeng Deng:
Aspect-level sentiment analysis based on graph convolutional networks and interactive aggregate attention. 101819 - Naoyuki Kamo, Naohiro Tawara, Atsushi Ando, Takatomo Kano, Hiroshi Sato, Rintaro Ikeshita, Takafumi Moriya, Shota Horiguchi, Kohei Matsuura, Atsunori Ogawa, Alexis Plaquet, Takanori Ashihara, Tsubasa Ochiai, Masato Mimura, Marc Delcroix, Tomohiro Nakatani, Taichi Asami, Shoko Araki:
Microphone array geometry-independent multi-talker distant ASR: NTT system for DASR task of the CHiME-8 challenge. 101820 - Yufeng Yang
, Ashutosh Pandey, DeLiang Wang:
Towards decoupling frontend enhancement and backend recognition in monaural robust ASR. 101821 - Yuewei Wu, Jialu Wang, Xiaoli Feng, Zhaoliang Wu, Jiakai Peng, Fulian Yin
:
Multi-task unified model for Chinese aspect-based sentiment analysis. 101822 - Jule Pohlhausen
, Francesco Nespoli, Jörg Bitzer
:
Towards privacy-preserving conversation analysis in everyday life: Exploring the privacy-utility trade-off. 101823 - Joonas Kalda
, Séverin Baroudi, Martin Lebourdais, Clément Pagés, Ricard Marxer
, Tanel Alumäe, Hervé Bredin:
Design choices for PixIT-based speaker-attributed ASR: Team ToTaTo at the NOTSOFAR-1 challenge. 101824 - Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi, Myeonghun Jeong, Ge Zhu, Yongyi Zang, You Zhang, Soumi Maiti, Florian Lux, Nicolas Müller, Wangyou Zhang, Chengzhe Sun, Shuwei Hou, Siwei Lyu, Sébastien Le Maguer, Cheng Gong, Hanjie Guo, Liping Chen, Vishwanath Pratap Singh:
ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech. 101825 - Jianling Li, Meishan Zhang, Jianrong Wang, Min Zhang, Yue Zhang
:
Universal constituency treebanking and parsing: A pilot study. 101826 - Hongbo Lan, Ya Jiang, Jun Du, Qing Wang:
Exploring knowledge distillation for low-resource multi-modal streaming ASR in the CHiME-8 MMCSG challenge. 101837 - Changfan Luo
, Ling Fang, Bensheng Qiu:
Sentiment analysis for live video comments with variational residual representations. 101838 - Jagabandhu Mishra, Manasi Chhibber, Hye-jin Shim, Tomi H. Kinnunen:
Towards explainable spoofed speech attribution and detection: A probabilistic approach for characterizing speech synthesizer components. 101840 - Weiwei Li, Yuzhong Chen
, Junjie Xu, Jiayuan Zhong, Chen Dong:
Multi-turn response selection with Language Style and Topic Aware enhancement. 101842 - Yuta Hirano
, Mau Nguyen, Kakeru Azuma, Jan Meyer Saragih, Sakriani Sakti
:
Toward fast meeting transcription: NAIST system for CHiME-8 NOTSOFAR-1 task and its analysis. 101836 - Zhengjun Yue
, Erfan Loweimi, Zoran Cvetkovic, Jon Barker, Heidi Christensen:
Raw acoustic-articulatory multimodal dysarthric speech recognition. 101839 - Alexander Polok
, Dominik Klement, Martin Kocour, Jiangyu Han, Federico Landini, Bolaji Yusuf, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
DiCoW: Diarization-conditioned Whisper for target speaker automatic speech recognition. 101841 - Rhiannon Mogridge
, Anton Ragni:
Minerva 2 for speech and language tasks. 101843 - Matej Ulcar, Ales Zagar, Carlos Santos Armendariz, Andraz Repar, Senja Pollak, Matthew Purver, Marko Robnik-Sikonja:
Mono- and cross-lingual evaluation of representation language models on less-resourced languages. 101852 - Ashwini Dasare
, K. T. Deepak:
Performance assessment of voice conversion models using speech production-based parameters. 101853 - Apiwat Ditthapron
, Emmanuel O. Agu, Adam C. Lammert:
Privacy-preserving feature extractor using adversarial pruning for TBI assessment from speech. 101854 - Xiaoyang Wang, Wenfeng Liu:
Sentiment classification method based on BERT-CondConv multi-moment state fusion. 101855 - Carlos Mena, Ana Laura Padilla-Ortíz, Felipe Orduña-Bustamante:
Automatic speech recognition in the presence of babble noise and reverberation compared to human speech intelligibility in Spanish. 101856 - Asmaa Alrayzah
, Fawaz Alsolami, Mostafa Saleh:
AraFastQA: a transformer model for question-answering for Arabic language using few-shot learning. 101857 - Wenwei Dong, Catia Cucchiarini, Roeland van Hout, Helmer Strik:
Predicting accentedness and comprehensibility through ASR scores and acoustic features. 101858 - Vishwanath Pratap Singh, Md. Sahidullah, Tomi H. Kinnunen:
Causal analysis of ASR errors for children: Quantifying the impact of physiological, cognitive, and extrinsic factors. 101859 - Hiroaki Takatsu, Shungo Suzuki
, Masaki Eguchi, Ryuki Matsuura, Mao Saeki, Yoichi Matsuyama:
Gnowsis: Multimodal multitask learning for oral proficiency assessments. 101860

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
