default search action
31st EUSIPCO 2023: Helsinki, Finland
- 31st European Signal Processing Conference, EUSIPCO 2023, Helsinki, Finland, September 4-8, 2023. IEEE 2023, ISBN 978-9-4645-9360-0
- Nour Hobloss, Joshua Maraval, Jérôme Fournier, Nicolas Ramin, Lu Zhang:
MUSE: A Multi-view Synthesis Enhancer. 1-5 - Virginia Bordignon, Mert Kayaalp, Vincenzo Matta, Ali H. Sayed:
Social Learning with Non-Bayesian Local Updates. 1-5 - Xueqin Luo, Gongping Huang, Jingdong Chen, Jacob Benesty:
On the Design of Robust Differential Beamformers with Uniform Circular Microphone Arrays. 1-5 - Xiaoquan Li, Stephan Weiss, Yijun Yan, Yinhe Li, Jinchang Ren, John J. Soraghan, Ming Gong:
Siamese Residual Neural Network for Musical Shape Evaluation in Piano Performance Assessment. 1-5 - Emmanuel Martinez, Jorge Bacca, Tatiana Gelvez-Barrera, Henry Arguello:
ReDIP: Rethinking Deep Image Prior for Compressive Spectral Imaging Calibration. 1-5 - Arnab Neelim Mazumder, Niall Lyons, Ashutosh Pandey, Avik Santra, Tinoosh Mohsenin:
Harnessing the Power of Explanations for Incremental Training: A LIME-Based Approach. 1-5 - Fan Zhang, Chao Pan, Jacob Benesty, Jingdong Chen:
Simplified Maximum SNR Beamformers with Spatial Coherence Matrix Modeling. 6-10 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. 11-15 - Jinfu Wang, Feiran Yang, Jianfeng Guo, Jun Yang:
Robust Adaptation Control for Generalized Sidelobe Canceller with Time-Varying Gaussian Source Model. 16-20 - Shuming Luan, Yukoh Wakabayashi, Tomoki Toda:
Sound Field Interpolation with Unsupervised Calibration for Freely Spaced Circular Microphone Array in Rotation-Robust Beamforming. 21-25 - Shekhar Kumar Yadav, Nithin V. George:
Speech Enhancement via Maximum Likelihood Modal Beamforming with Complex Gaussian and Laplacian Priors. 26-30 - Esteban Gómez, Mohammad Hassan Vali, Tom Bäckström:
Low-Complexity Real-Time Neural Network for Blind Bandwidth Extension of Wideband Speech. 31-35 - Paul Magron, Tuomas Virtanen:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. 36-40 - Yichen Yang, Xianrui Wang, Andreas Brendel, Wen Zhang, Walter Kellermann, Jingdong Chen:
Geometrically Constrained Source Extraction and Dereverberation Based on Joint Optimization. 41-45 - Pedro J. Villasana T., Janusz Klejsa, Lars F. Villemoes, Per Hedelin:
Distribution Preserving Source Separation with Time Frequency Predictive Models. 46-50 - Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, Kazuyoshi Yoshii:
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation. 51-55 - Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi:
Mask-CTC-Based Encoder Pre-Training for Streaming End-to-End Speech Recognition. 56-60 - Kishor Kayyar Lakshminarayana, Christian Dittmar, Nicola Pia, Emanuël A. P. Habets:
Low-Resource Text-to-Speech Using Specific Data and Noise Augmentation. 61-65 - Dushyant Sharma, Francesco Nespoli, Rong Gong, Patrick A. Naylor:
Canonical Voice Conversion and Dual-Channel Processing for Improved Voice Privacy of Speech Recognition Data. 66-70 - James Fosburgh, Dushyant Sharma, Patrick A. Naylor:
Room Adaptation of Training Data for Distant Speech Recognition. 71-75 - Shoko Niwa, Sayaka Shiota, Hitoshi Kiya:
A Privacy-Preserving Method Using Secret Key for Convolutional Neural Network-Based Speech Classification. 76-80 - Florian Hilgemann, Peter Jax:
Adaptive Feedback Active Noise Control by Robust Controller Interpolation. 81-85 - Felix Albu, Marius Giorgian Ionita:
Efficient Affine Projection Tanh Algorithm for Acoustic Feedback Cancellation. 86-90 - Tobias Kabzinski, Florian Hilgemann, Peter Jax:
Design of Crosstalk Cancellation Filters: Combining Inverse Filtering and Optimal Control. 91-95 - Takaaki Kojima, Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari:
Multichannel Active Noise Control with Exterior Radiation Suppression Based on Riemannian Optimization. 96-100 - Yurii Iotov, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Græsbøll Christensen:
Non-Stationary Prediction for Addressing the Non-Causality Problem in Fixed-Filter ANC Headphones for Speech Reduction. 101-105 - Christoph Weyer, Florian Hilgemann, Peter Jax:
Empirical Analysis of Waterbed-like Performance Trade-Off for a Feedforward Occlusion Effect Reduction System. 106-110 - Lars Thieling, Peter Jax:
Beyond Clean Phase: Using Silence-Generating Phase for DNN-Based Speech Enhancement. 111-115 - Louis Bahrman, Marina Krémé, Paul Magron, Antoine Deleforge:
Signal Inpainting from Fourier Magnitudes. 116-120 - Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini:
A Time-Frequency Generative Adversarial Based Method for Audio Packet Loss Concealment. 121-125 - Louis Delebecque, Romain Serizel:
BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement. 126-130 - Georgiana-Elena Sfeclis, Ben Milner, Danny Websdale:
Investigating Imaginary Mask Estimation in Complex Masking for Speech Enhancement. 131-135 - Helena Cuesta, Nadine Kroher, Aggelos Pikrakis, Stojan Djordjevic:
DAACI-VoDAn: Improving Vocal Detection with New Data and Methods. 136-140 - Rikuto Ito, Natsuki Akaishi, Kohei Yatabe, Yasuhiro Oikawa:
On-Line Chord Recognition Using FifthNet with Synchrosqueezing Transform. 141-145 - Petros Vavaroutsos, Pantelis Vikatos:
HSP-TL: A Deep Metric Learning Model with Triplet Loss for Hit Song Prediction. 146-150 - Andres Fernandez:
Onsets and Velocities: Affordable Real-Time Piano Transcription Using Convolutional Neural Networks. 151-155 - Rashen Fernando, Pamudu Ranasinghe, Udula Ranasinghe, Janaka Wijayakulasooriya, Pantaleon Perera:
Hybrid Y-Net Architecture for Singing Voice Separation. 156-160 - Sascha Grollmisch, Estefanía Cano, Hanna M. Lukashevich, Jakob Abeßer:
Uncertainty in Semi-Supervised Audio Classification - A Novel Extension for FixMatch. 161-165 - Alessandro Ilic Mezza, Paolo Sani, Augusto Sarti:
Automatic TV Genre Classification Based on Visually-Conditioned Deep Audio Features. 166-170 - Zied Mnasri, Stefano Rovetta, Francesco Masulli:
Anomalous Sound Event Detection Based on One-Class Classification Using Variational Autoencoders and Interval Type-2 Fuzzy Sets. 171-175 - Tobias Morocutti, Florian Schmid, Khaled Koutini, Gerhard Widmer:
Device-Robust Acoustic Scene Classification via Impulse Response Augmentation. 176-180 - Shahed Masoudian, Khaled Koutini, Markus Schedl, Gerhard Widmer, Navid Rekabsaz:
Domain Information Control at Inference Time for Acoustic Scene Classification. 181-185 - Kevin Wilkinghoff, Fabian Fritz:
On Using Pre-Trained Embeddings for Detecting Anomalous Sounds with Limited Training Data. 186-190 - Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda:
First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline. 191-195 - Paul M. Baggenstoss, Kevin Wilkinghoff:
Novel Generative Classifier for Acoustic Events. 196-200 - Kai Li, Dung Kim Tran, Xugang Lu, Masato Akagi, Masashi Unoki:
Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection. 201-205 - Natalia P. García-de-la-Puente, Félix Fuentes-Hurtado, Laura Fuster, Valery Naranjo, Gema Piñero:
Deep Learning Models for Gunshot Detection in the Albufera Natural Park. 206-210 - Candy Olivia Mawalim, Benita Angela Titalim, Shogo Okada, Masashi Unoki:
Auditory Model Optimization with Wavegram-CNN and Acoustic Parameter Models for Nonintrusive Speech Intelligibility Prediction in Hearing Aids. 211-215 - Yuta Ide, Naohiro Tawara, Susumu Saito, Teppei Nakano, Tetsuji Ogawa:
Voice or Content? - Exploring Impact of Speech Content on Age Estimation from Voice. 221-225 - Fredrik Cumlin, Christian Schüldt, Saikat Chatterjee:
Latent-Based Neural Net for Non-Intrusive Speech Quality Assessment. 226-230 - Ashish Panda, Rajul Acharya, Sunil Kumar Kopparapu:
Oral Fluency Classification for Speech Assessment. 231-235 - Ruchi Pandey, Shreyas Jaiswal, Huy Phan, Santosh Nannuru:
Improving audio event localization accuracy via derivative prediction. 236-240 - Reza Varzandeh, Simon Doclo, Volker Hohmann:
A Two-Stage CNN with Feature Reduction for Speech-Aware Binaural DOA Estimation. 241-245 - Rene Glitza, Luca Becker, Alexandru Nelus, Rainer Martin:
Database of Simulated Room Impulse Responses for Acoustic Sensor Networks Deployed in Complex Multi-Source Acoustic Environments. 246-250 - David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Position Tracking of a Varying Number of Sound Sources with Sliding Permutation Invariant Training. 251-255 - Bing Zhu, Wen Zhang:
Reactive Intensity Vector Based Direct Path Detection for DoA Estimation on a Single Acoustic Vector Sensor. 256-260 - Daan Delabie, Chesney Buyle, Bert Cox, Liesbet Van der Perre, Lieven De Strycker:
An Acoustic Simulation Framework to Support Indoor Positioning and Data Driven Signal Processing Assessments. 261-265 - Aviad Eisenberg, Sharon Gannot, Shlomo E. Chazan:
A Two-Stage Speaker Extraction Algorithm Under Adverse Acoustic Conditions Using a Single-Microphone. 266-270 - Protima Nomo Sudro, Anton Ragni, Thomas Hain:
Adapting Pretrained Models for Adult to Child Voice Conversion. 271-275 - Ritujoy Biswas, Karan Nathwani, Vinayak Abrol:
Near-end Intelligibility Improvement Through Voice Transformation in Transfer Learning Framework. 276-280 - Seyun Um, Jihyun Kim, Jihyun Lee, Hong-Goo Kang:
Facetron: A Multi-Speaker Face-to-Speech Model Based on Cross-Modal Latent Representations. 281-285 - Tomoki Ariga, Yosuke Higuchi, Mitsunori Kanno, Rie Shigyo, Takato Mizuguchi, Naoki Okamoto, Tetsuji Ogawa:
Spotting Parodies: Detecting Alignment Collapse Between Lyrics and Singing Voice. 286-290 - Ünal Ege Gaznepoglu, Nils Peters:
Deep Learning-based F0 Synthesis for Speaker Anonymization. 291-295 - Shogo Seki, Kanami Imamura, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Noboru Harada:
W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion. 296-300 - Yashish M. Siriwardena, Ahmed Adel Attia, Ganesh Sivaraman, Carol Y. Espy-Wilson:
Audio Data Augmentation for Acoustic-to-Articulatory Speech Inversion. 301-305 - Richard Füg:
Spectral Windowing for Enhanced Temporal Noise Shaping Analysis in Transform Audio Codecs. 306-310 - Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini:
Upsampling Layers for Music Source Separation. 311-315 - Yuancheng Luo:
Active Barycentric Beamformed Stereo Upmixing. 316-320 - Yuki Nakamura, Taishi Nakashima, Nobutaka Ono, Ryoichi Miyazaki:
Unaliasing of Recorded Signals Based on Blind Source Separation. 321-325 - Kanami Imamura, Tomohiko Nakamura, Norihiro Takamune, Kohei Yatabe, Hiroshi Saruwatari:
Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides. 326-330 - William Ravenscroft, Stefan Goetze, Thomas Hain:
On Data Sampling Strategies for Training Neural Network Speech Separation Models. 331-335 - Thushara D. Abhayapala, Lachlan Birnie, Manish Kumar, Daniel Grixti-Cheng, Prasanga N. Samarasinghe:
Generalizing the Relative Transfer Function to a Matrix for Multiple Sources and Multichannel Microphones. 336-340 - Jesper Brunnström, Toon van Waterschoot, Marc Moonen:
Sound Zone Control for Arbitrary Sound Field Reproduction Methods. 341-345 - Konstantin Schmidt, Bernd Edler, Ahmed Mustafa Mahmoud, Guillaume Fuchs:
LPC-GAN for Speech Super-Resolution. 346-350 - Laura-Maria Dogariu, Constantin Paleologu, Jacob Benesty, Cristian Lucian Stanciu, Silviu Ciochina:
A Decomposition-Based Kalman Filter for the Identification of Acoustic Impulse Responses. 351-355 - Ziye Yang, Wenxing Yang, Kai Xie, Jie Chen:
Speech Dereverberation Using Weighted Prediction Error with Prior Learnt from Data. 356-360 - Sankha Subhra Bhattacharjee, Mads Græsbøll Christensen, Jacob Benesty:
Study of Sparsity Emanating from NKPD and its Utilization to Enhance NKPD based Adaptive Algorithms. 361-365 - Mohit Sharma, Marc Moonen:
Prototype filter design for weighted overlap-add filter bank based sub-band adaptive filtering applications. 366-370 - Miaomiao Wang, Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu:
A Recursive Least M-Estimate Adaptive Algorithm with Low Complexity for Active Control of Impulsive Noises. 371-375 - Huawei Zhang, Jihui Zhang, Fei Ma, Prasanga N. Samarasinghe, Huiyuan Sun:
A Time-Domain Multi-Channel Directional Active Noise Control System. 376-380 - Olympia Axelou, Pavlos Stoikos, George Floros, Nestor E. Evmorfopoulos, George I. Stamoulis:
A System Theoretic Approach for the Reduction of Large-Scale Room Acoustic Models. 381-385 - Pooja Kumawat, Aurobinda Routray:
Improving Speech Emotion Recognition with Data Expression Aware Multi-Task Learning. 386-390 - Paul Primus, Gerhard Widmer:
On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers. 391-395 - Dejan Porjazovski, Tamás Grósz, Mikko Kurimo:
Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information. 396-400 - Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Probing Statistical Representations for End-to-End ASR. 401-405 - Yuya Yamamoto, Juhan Nam, Hiroko Terasawa:
PrimaDNN': A Characteristics-Aware DNN Customization for Singing Technique Detection. 406-410 - Mirela Gheorghe, Serban Mihalache, Dragos Burileanu:
Using Deep Neural Networks for Detecting Depression from Speech. 411-415 - Dalia Sherman, Gershon Hazan, Sharon Gannot:
Study of Speech Emotion Recognition Using Blstm with Attention. 416-420 - Rodrigo Borges, Marcelo Queiroz:
Audio-Based Sequential Music Recommendation. 421-425 - Mohammad Bokaei, Jesper Jensen, Simon Doclo, Jan Østergaard:
Deep Joint Source-Channel Analog Coding for Low-Latency Speech Transmission Over Gaussian Channels. 426-430 - Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. 431-435 - Sahar Sadrizadeh, Clément Barbier, Ljiljana Dolamic, Pascal Frossard:
A Relaxed Optimization Approach for Adversarial Attacks Against Neural Machine Translation Models. 436-440 - Ozan Cakiroglu, Eduardo Pérez, Florian Roemer, Martin Schiffner:
Optimization of Transmission Parameters in Fast Pulse-Echo Ultrasound Imaging Using Sparse Recovery. 441-445 - Noumida Abdul Kareem, R. Mukund, N. Madhavan Nair, Rajeev Rajan:
Stacked Res2Net-CBAM with Grouped Channel Attention for Multi-Label Bird Species Classification. 446-450 - Florian Schmid, Khaled Koutini, Gerhard Widmer:
Low-Complexity Audio Embedding Extractors. 451-455 - Nour Aburaed, Mohammed Q. Alkhatib, Stephen Marshall, Jaime Zabalza, Hussain Al-Ahmad:
Attention-Infused 3D-SRCNN for Hyperspectral Image Super Resolution. 456-460 - Alvaro Lopez Paredes, Miguel Heredia Conde, Thoriq Ibrahim, Anh Ngoc Pham, Keiichiro Kagawa:
Spatio-Temporal Super-Resolution for CS-Based ToF 3D Imaging. 461-465 - Simon Mignon, Bruno Galerne, Moncef Hidane, Cécile Louchet, Julien Mille:
Semi-Unbalanced Regularized Optimal Transport for Image Restoration. 466-470 - Shuo Li, Mehrdad Yaghoobi:
Self-Supervised Hyperspectral Inpainting with the Optimisation inspired Deep Neural Network Prior. 471-475 - Faisal Ahmed, Miguel Heredia Conde, Paula López Martinez:
GIPS: Geometry-Inspired Passive ToF Sensing for 3D Depth Reconstruction. 476-480 - Irfan Manisali, Figen S. Oktem:
Deep Learning-Based Reconstruction for Near-Field MIMO Radar Imaging. 481-485 - Said Sadeg, Jean Cauzid, Cécile Fabre, Yingying Song, David Brie, El-Hadi Djermoune:
A Background Correction Algorithm for Hyperspectral Images. 486-490 - Jaelin Lee, Byeungwoo Jeon:
Motion Deblurring of RAW Mosaic Image Using Coded Exposure Photography. 491-495 - Okyanus Oral, Figen S. Oktem:
Plug-and-Play Reconstruction with 3D Deep Prior for Complex-Valued Near-Field MIMO Imaging. 496-500 - Kei Shibasaki, Masaaki Ikehara:
Pose-aware Disentangled Multiscale Transformer for Pose Guided Person Image Generation. 506-510 - Efstratios Kakaletsis, Nikos Nikolaidis:
Active Face Recognition Through View Synthesis. 511-515 - Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning. 516-520 - Yuxing Yang, Zeyu Fu, Syed Mohsen Naqvi:
A Scene-Adaptive Framework for Pose-Oriented Abnormal Event Detection. 521-525 - Alexandros Benetatos, Markos Diomataris, Vassilis Pitsikalis, Petros Maragos:
Generating Salient Scene Graphs with Weak Language Supervision. 526-530 - Elif Duygu Petenkaya, Ozge Basak Lacin, Omer Faruk Kara, Mehmet Türkan:
Underwater Image Dehazing via Red-Channel Recovery. 531-535 - Oguzhan Ulucan, Diclehan Ulucan, Marc Ebner:
Multi-Scale Block-Based Color Constancy. 536-540 - Diclehan Ulucan, Oguzhan Ulucan, Marc Ebner:
CC-NORD: A Camera-Invariant Global Color Constancy Dataset. 541-545 - Aytaç Özkan, Yi-Hsin Li, Thomas Sikora:
Steered-Mixture-of-Experts Regression for Image Denoising with Multi-Model Inference. 546-550 - Nils Mandischer, Sebastian Döbler, Burkhard Corves:
A Simple Screw-Hole Discrimination Pipeline for Deployment in Autonomous Manufacturing. 551-555 - Kevin Feghoul, Deise Santana Maia, M