default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Publication search results

found 2,903 matches

2024
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/AlkuKLK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/AlkuKLK24
Paavo Alku, Manila Kodali, Laura Laaksonen, Sudarsana Reddy Kadiri:
AVID: A speech database for machine learning studies on vocal intensity. Speech Commun. 157: 103039 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/BaghelRJCSVG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BaghelRJCSVG24
Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy:
Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments. Speech Commun. 161: 103080 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/BannoM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BannoM24
Stefano Bannò, Marco Matassoni:
Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency. Speech Commun. 157: 103025 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/BarrientosC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BarrientosC24
Eduardo Barrientos, Edson Cataldo:
Adapted Weighted Linear Prediction with Attenuated Main Excitation for formant frequency estimation in high-pitched singing. Speech Commun. 156: 103006 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/BesdouriZB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BesdouriZB24
Fatma Zahra Besdouri, Inès Zribi, Lamia Hadrich Belguith:
Arabic Automatic Speech Recognition: Challenges and Progress. Speech Commun. 163: 103110 (2024)
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/BoudinBROB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BoudinBROB24
Auriane Boudin, Roxane Bertrand, Stéphane Rauzy, Magalie Ochs, Philippe Blache:
A multimodal model for predicting feedback position and type during conversation. Speech Commun. 159: 103066 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ChenWXYP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ChenWXYP24
Zhipeng Chen, Xinheng Wang, Lun Xie, Haijie Yuan, Hang Pan:
LPIPS-AttnWav2Lip: Generic audio-driven lip synchronization for talking head generation in the wild. Speech Commun. 157: 103028 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DengW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DengW24
Keqi Deng, Philip C. Woodland:
Decoupled structure for improved adaptability of end-to-end models. Speech Commun. 163: 103109 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/Drgas24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/Drgas24
Szymon Drgas:
Speech intelligibility prediction using generalized ESTOI with fine-tuned parameters. Speech Commun. 159: 103068 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ElieST24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ElieST24
Benjamin Elie, Juraj Simko, Alice Turk:
Optimization-based planning of speech articulation using general Tau Theory. Speech Commun. 160: 103083 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/EmaraS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/EmaraS24
Ingy Emara, Nabil H. Shaker:
The impact of non-native English speakers' phonological and prosodic features on automatic speech recognition accuracy. Speech Commun. 157: 103038 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/FernandezMartinCPN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/FernandezMartinCPN24
Claudio Fernandez-Martín, Adrián Colomer, Claudio Panariello, Valery Naranjo:
Choosing only the best voice imitators: Top-K many-to-many voice conversion with StarGAN. Speech Commun. 156: 103022 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/GuoWLLT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GuoWLLT24
Na Guo, Jianguo Wei, Yongwei Li, Wenhuan Lu, Jianhua Tao:
Zero-shot voice conversion based on feature disentanglement. Speech Commun. 165: 103143 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/GuoWZZF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GuoWZZF24
Minghao Guo, Jianguo Wei, Ruiteng Zhang, Yu Zhao, Qiang Fang:
Multi-modal co-learning for silent speech recognition based on ultrasound tongue images. Speech Commun. 165: 103140 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/Haj24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/Haj24
Abir El Haj:
Emotions recognition in audio signals using an extension of the latent block model. Speech Commun. 161: 103092 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/HuZZHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/HuZZHL24
Qing Hu, Yan Zhang, Xianlei Zhang, Zongyu Han, Xiuxia Liang:
Language fusion via adapters for low-resource speech recognition. Speech Commun. 158: 103037 (2024)
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/JavanmardiKA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/JavanmardiKA24
Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Pre-trained models for detection and severity level classification of dysarthria from speech. Speech Commun. 158: 103047 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/KachelSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KachelSS24
Sven Kachel, Adrian P. Simpson, Melanie C. Steffens:
Speakers' vocal expression of sexual orientation depends on experimenter gender. Speech Commun. 156: 103023 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/KangX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KangX24
Weiyi Kang, Yi Xu:
Tone-syllable synchrony in Mandarin: New evidence and implications. Speech Commun. 163: 103121 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/KarakasidisKBG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KarakasidisKBG24
Georgios Karakasidis, Mikko Kurimo, Peter Bell, Tamás Grósz:
Comparison and analysis of new curriculum criteria for end-to-end ASR. Speech Commun. 163: 103113 (2024)
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/KeerthanaRARM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KeerthanaRARM24
Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. Speech Commun. 157: 103040 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/KressnerJKMPBHBKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/KressnerJKMPBHBKM24
Abigail Anne Kressner, Kirsten Maria Jensen-Rico, Johannes Kizach, Brian Kai Loong Man, Anja Kofoed Pedersen, Lars Bramsløw, Lise Bruun Hansen, Laura Winther Balling, Brent Kirkwood, Tobias May:
A corpus of audio-visual recordings of linguistically balanced, Danish sentences for speech-in-noise experiments. Speech Commun. 165: 103141 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiWGULD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiWGULD24
Nan Li, Longbiao Wang, Meng Ge, Masashi Unoki, Sheng Li, Jianwu Dang:
Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network. Speech Commun. 157: 103024 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiXKS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiXKS24
Zimu Li, Yanyan Xu, Dengfeng Ke, Kaile Su:
PLDE: A lightweight pooling layer for spoken language recognition. Speech Commun. 158: 103055 (2024)
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/LinB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LinB24
Wei-Cheng Lin, Carlos Busso:
Deep temporal clustering features for speech emotion recognition. Speech Commun. 157: 103027 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiuRRMZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiuRRMZZ24
Minying Liu, Alex Noel Joseph Raj, Vijayarajan Rajangam, Kunwu Ma, Zhemin Zhuang, Shuxin Zhuang:
Multiscale-multichannel feature extraction and classification through one-dimensional convolutional neural network for Speech emotion recognition. Speech Commun. 156: 103010 (2024)
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/LuS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LuS24
JinHong Lu, Hiroshi Shimodaira:
Speech-driven head motion generation from waveforms. Speech Commun. 159: 103056 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MaWLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MaWLZ24
Fei Ma, Chengliang Wang, Xusheng Li, Zhuo Zeng:
Selective transfer subspace learning for small-footprint end-to-end cross-domain keyword spotting. Speech Commun. 156: 103019 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MaYWYTWWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MaYWYTWWF24
Haoxin Ma, Jiangyan Yi, Chenglong Wang, Xinrui Yan, Jianhua Tao, Tao Wang, Shiming Wang, Ruibo Fu:
CFAD: A Chinese dataset for fake audio detection. Speech Commun. 164: 103122 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MorroneCSZBS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MorroneCSZBS24
Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
End-to-end integration of speech separation and voice activity detection for low-latency diarization of telephone conversations. Speech Commun. 161: 103081 (2024)

skipping 2,873 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results