![](https://dblp.uni-trier.de/img/logo.ua.320x120.png)
![](https://dblp.uni-trier.de/img/dropdown.dark.16x16.png)
![](https://dblp.uni-trier.de/img/peace.dark.16x16.png)
Остановите войну!
for scientists:
![search dblp search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
default search action
EURASIP Journal on Audio, Speech, and Music Processing, Volume 2021
Volume 2021, Number 1, December 2021
- Jorge Llombart
, Dayana Ribas, Antonio Miguel, Luis Vicente
, Alfonso Ortega
, Eduardo Lleida:
Progressive loss functions for speech enhancement with deep neural networks. 1 - Yuval Dorfan, Boaz Schwartz, Sharon Gannot:
Forward-backward recursive expectation-maximization for concurrent speaker tracking. 2 - Vincent Lostanlen, Christian El-Hajj, Mathias Rossignol, Grégoire Lafay, Joakim Andén, Mathieu Lagrange
:
Time-frequency scattering accurately models auditory similarities between instrumental playing techniques. 3 - Yuki Saishu, Amir Hossein Poorjam
, Mads Græsbøll Christensen
:
A CNN-based approach to identification of degradations in speech signals. 1-10 - Chunyan Ji, Thosini Bamunu Mudiyanselage, Yutong Gao, Yi Pan
:
A review of infant cry analysis and classification. 1-17 - Rajat Hebbar
, Pavlos Papadopoulos, Ramon Reyes, Alexander F. Danvers, Angelina J. Polsinelli
, Suzanne A. Moseley, David A. Sbarra, Matthias R. Mehl, Shrikanth Narayanan:
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices. 1-8 - Sören Schulze
, Emily J. King
:
Sparse pursuit and dictionary learning for blind source separation in polyphonic music recordings. 1-25 - Bracha Laufer-Goldshtein, Ronen Talmon, Sharon Gannot
:
Audio source separation by activity probability detection with maximum correlation and simplex geometry. 1-16 - Norihide Kitaoka
, Bohan Chen, Yuya Obashi:
Dynamic out-of-vocabulary word registration to language model for speech recognition. 1-8 - Randall Ali
, Toon van Waterschoot, Marc Moonen:
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones. 10 - Kacper Radzikowski
, Le Wang, Osamu Yoshie, Robert M. Nowak
:
Accent modification for speech recognition of non-native speakers using neural style transfer. 11 - Beatriz Martínez-González, José M. Pardo
, José A. Vallejo-Pinto, Rubén San Segundo
, Javier Ferreiros:
Analysis of transition cost and model parameters in speaker diarization for meetings. 12 - Sushmita Thakallapalli
, Suryakanth V. Gangashetty, Nilesh Madhu
:
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain. 13 - Zonglong Bai
, Liming Shi, Jesper Rindom Jensen
, Jinwei Sun, Mads Græsbøll Christensen
:
Acoustic DOA estimation using space alternating sparse Bayesian learning. 14 - Randall Ali
, Toon van Waterschoot, Marc Moonen:
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones. 15 - Hodaya Hammer, Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot
:
Dynamically localizing multiple speakers based on the time-frequency domain. 16 - Yuxuan Ke, Andong Li, Chengshi Zheng, Renhua Peng, Xiaodong Li:
Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms. 17 - Duowei Tang
, Peter Kuppens, Luc Geurts, Toon van Waterschoot:
End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network. 18 - Sichen Liu, Feiran Yang
, Yin Cao, Jun Yang
:
Frequency-dependent auto-pooling function for weakly supervised sound event detection. 19 - Miao Liu, Jing Wang
, Weiming Yi, Fang Liu:
Neural network-based non-intrusive speech quality assessment using attention pooling function. 20 - Marco Gimm
, Philipp Bulling, Gerhard Schmidt:
Residual feedback suppression with extended model-based postfilters. 21 - Yong Lü, Han Lin
, Pingping Wu
, Yitao Chen:
Feature compensation based on independent noise estimation for robust speech recognition. 22 - Maoshen Jia
, Shang Gao, Changchun Bao:
Multi-source localization by using offset residual weight. 23 - Ziyi Xu
, Samy Elshamy, Ziyue Zhao, Tim Fingscheidt
:
Components loss for neural networks in mask-based speech enhancement. 24 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. 25 - Lujun Li
, Yikai Kang, Yuchen Shi, Ludwig Kürzinger
, Tobias Watzel, Gerhard Rigoll:
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition. 26 - Gui-Xin Shi, Wei-Qiang Zhang
, Guan-Bo Wang, Jing Zhao, Shuzhou Chai, Ze-Yu Zhao:
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system. 27 - Alexandru-Lucian Georgescu
, Alessandro Pappalardo, Horia Cucu, Michaela Blott:
Performance vs. hardware requirements in state-of-the-art automatic speech recognition. 28 - Linhui Sun
, Yunyi Bu, Pingan Li, Zihao Wu:
Single-channel speech enhancement based on joint constrained dictionary learning. 29 - Mina Mounir
, Peter Karsmakers
, Toon van Waterschoot
:
Musical note onset detection based on a spectral sparsity measure. 30 - Masoud Geravanchizadeh
, Elnaz Forouhandeh, Meysam Bashirpour:
Feature compensation based on the normalization of vocal tract length for the improvement of emotion-affected speech recognition. 31 - Sujan Kumar Roy
, Kuldip K. Paliwal:
A noise PSD estimation algorithm using derivative-based high-pass filter in non-stationary noise conditions. 32 - Nili Cohen, Gershon Hazan, Boaz Schwartz, Sharon Gannot
:
An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method. 33 - Yanhua Long
, Shuang Wei, Jie Lian, Yijie Li:
Pronunciation augmentation for Mandarin-English code-switching speech recognition. 34 - Hongsheng Chen, Guoliang Chen, Kai Chen, Jing Lu:
Nonlinear residual echo suppression based on dual-stream DPRNN. 35 - David Ackermann, Fabian Brinkmann, Franz Zotter, Malte Kob, Stefan Weinzierl:
Comparative evaluation of interpolation methods for the directivity of musical instruments. 36 - Johannes M. Arend
, Tim Lübeck, Christoph Pörschmann
:
Efficient binaural rendering of spherical microphone array data by linear filtering. 37 - You-Siang Chen, Zi-Jie Lin, Mingsian R. Bai
:
A multichannel learning-based approach for sound source separation in reverberant environments. 38 - Diego Di Carlo
, Pinchas Tandeitnik, Cédric Foy, Nancy Bertin, Antoine Deleforge, Sharon Gannot:
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing. 39 - Fangkun Liu, Hui Wang, Renhua Peng, Chengshi Zheng, Xiaodong Li:
U2-VC: one-shot voice conversion using two-level nested U-structure. 40 - Yuancheng Luo:
Spherical harmonic covariance and magnitude function encodings for beamformer design. 41 - Zolzaya Byambadorj
, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka:
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation. 42 - Ofer Schwartz, Sharon Gannot
:
A recursive expectation-maximization algorithm for speaker tracking and separation. 43 - Yuki Takashima, Ryoichi Takashima
, Ryota Tsunoda, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuaki Motoyama:
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation. 44 - Jiacheng Yao, Jing Zhang, Jiafeng Li, Li Zhuo:
Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit. 45
![](https://dblp.uni-trier.de/img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.