default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 2,903 matches
- 2024
- Paavo Alku, Manila Kodali, Laura Laaksonen, Sudarsana Reddy Kadiri:
AVID: A speech database for machine learning studies on vocal intensity. Speech Commun. 157: 103039 (2024) - Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy:
Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments. Speech Commun. 161: 103080 (2024) - Stefano Bannò, Marco Matassoni:
Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency. Speech Commun. 157: 103025 (2024) - Eduardo Barrientos, Edson Cataldo:
Adapted Weighted Linear Prediction with Attenuated Main Excitation for formant frequency estimation in high-pitched singing. Speech Commun. 156: 103006 (2024) - Fatma Zahra Besdouri, Inès Zribi, Lamia Hadrich Belguith:
Arabic Automatic Speech Recognition: Challenges and Progress. Speech Commun. 163: 103110 (2024) - Auriane Boudin, Roxane Bertrand, Stéphane Rauzy, Magalie Ochs, Philippe Blache:
A multimodal model for predicting feedback position and type during conversation. Speech Commun. 159: 103066 (2024) - Zhipeng Chen, Xinheng Wang, Lun Xie, Haijie Yuan, Hang Pan:
LPIPS-AttnWav2Lip: Generic audio-driven lip synchronization for talking head generation in the wild. Speech Commun. 157: 103028 (2024) - Keqi Deng, Philip C. Woodland:
Decoupled structure for improved adaptability of end-to-end models. Speech Commun. 163: 103109 (2024) - Szymon Drgas:
Speech intelligibility prediction using generalized ESTOI with fine-tuned parameters. Speech Commun. 159: 103068 (2024) - Benjamin Elie, Juraj Simko, Alice Turk:
Optimization-based planning of speech articulation using general Tau Theory. Speech Commun. 160: 103083 (2024) - Ingy Emara, Nabil H. Shaker:
The impact of non-native English speakers' phonological and prosodic features on automatic speech recognition accuracy. Speech Commun. 157: 103038 (2024) - Claudio Fernandez-Martín, Adrián Colomer, Claudio Panariello, Valery Naranjo:
Choosing only the best voice imitators: Top-K many-to-many voice conversion with StarGAN. Speech Commun. 156: 103022 (2024) - Na Guo, Jianguo Wei, Yongwei Li, Wenhuan Lu, Jianhua Tao:
Zero-shot voice conversion based on feature disentanglement. Speech Commun. 165: 103143 (2024) - Minghao Guo, Jianguo Wei, Ruiteng Zhang, Yu Zhao, Qiang Fang:
Multi-modal co-learning for silent speech recognition based on ultrasound tongue images. Speech Commun. 165: 103140 (2024) - Abir El Haj:
Emotions recognition in audio signals using an extension of the latent block model. Speech Commun. 161: 103092 (2024) - Qing Hu, Yan Zhang, Xianlei Zhang, Zongyu Han, Xiuxia Liang:
Language fusion via adapters for low-resource speech recognition. Speech Commun. 158: 103037 (2024) - Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Pre-trained models for detection and severity level classification of dysarthria from speech. Speech Commun. 158: 103047 (2024) - Sven Kachel, Adrian P. Simpson, Melanie C. Steffens:
Speakers' vocal expression of sexual orientation depends on experimenter gender. Speech Commun. 156: 103023 (2024) - Weiyi Kang, Yi Xu:
Tone-syllable synchrony in Mandarin: New evidence and implications. Speech Commun. 163: 103121 (2024) - Georgios Karakasidis, Mikko Kurimo, Peter Bell, Tamás Grósz:
Comparison and analysis of new curriculum criteria for end-to-end ASR. Speech Commun. 163: 103113 (2024) - Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. Speech Commun. 157: 103040 (2024) - Abigail Anne Kressner, Kirsten Maria Jensen-Rico, Johannes Kizach, Brian Kai Loong Man, Anja Kofoed Pedersen, Lars Bramsløw, Lise Bruun Hansen, Laura Winther Balling, Brent Kirkwood, Tobias May:
A corpus of audio-visual recordings of linguistically balanced, Danish sentences for speech-in-noise experiments. Speech Commun. 165: 103141 (2024) - Nan Li, Longbiao Wang, Meng Ge, Masashi Unoki, Sheng Li, Jianwu Dang:
Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network. Speech Commun. 157: 103024 (2024) - Zimu Li, Yanyan Xu, Dengfeng Ke, Kaile Su:
PLDE: A lightweight pooling layer for spoken language recognition. Speech Commun. 158: 103055 (2024) - Wei-Cheng Lin, Carlos Busso:
Deep temporal clustering features for speech emotion recognition. Speech Commun. 157: 103027 (2024) - Minying Liu, Alex Noel Joseph Raj, Vijayarajan Rajangam, Kunwu Ma, Zhemin Zhuang, Shuxin Zhuang:
Multiscale-multichannel feature extraction and classification through one-dimensional convolutional neural network for Speech emotion recognition. Speech Commun. 156: 103010 (2024) - JinHong Lu, Hiroshi Shimodaira:
Speech-driven head motion generation from waveforms. Speech Commun. 159: 103056 (2024) - Fei Ma, Chengliang Wang, Xusheng Li, Zhuo Zeng:
Selective transfer subspace learning for small-footprint end-to-end cross-domain keyword spotting. Speech Commun. 156: 103019 (2024) - Haoxin Ma, Jiangyan Yi, Chenglong Wang, Xinrui Yan, Jianhua Tao, Tao Wang, Shiming Wang, Ruibo Fu:
CFAD: A Chinese dataset for fake audio detection. Speech Commun. 164: 103122 (2024) - Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini:
End-to-end integration of speech separation and voice activity detection for low-latency diarization of telephone conversations. Speech Commun. 161: 103081 (2024)
skipping 2,873 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-11-19 03:43 CET from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint