


default search action
ICASSP 2024: Seoul, Korea
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. IEEE 2024, ISBN 979-8-3503-4485-1
- Jiwei Shen
, Hu Lu, Hao Zhang, Shujing Lyu, Yue Lu:
Enhanced Deep Reinforcement Learning for Parcel Singulation in Non-Stationary Environments. 1-5 - Yaowei Li, Yating Liu, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Bang Yang, Zhiqi Huang:
KC-Prompt: End-To-End Knowledge-Complementary Prompting for Rehearsal-Free Continual Learning. 1-5 - Miao Jiang, Min Li, Junxing Ren, Weiqing Huang:
HOICS: Zero-Shot Hoi Detection via Compatibility Self-Learning. 1-5 - Xuanhao Zhang, Hui Kou, Chenjie Xia, Hao Cai, Bo Liu:
Small-Footprint Automatic Speech Recognition System using Two-Stage Transfer Learning based Symmetrized Ternary Weight Network. 1-5 - Zhenjiao Liu, Xiao Wang, Xiaodi Huang, Guanlin Li, Ke Sun, Zhikui Chen:
Incomplete Multi-View Representation Learning Through Anchor Graph-Based GCN and Information Bottleneck. 1-5 - Samuel Fernández-Menduiña, Joshua Rapp, Hassan Mansour, M. Greiff, Kieran Parsons:
Tracking Beyond the Unambiguous Range with Modulo Single-Photon Lidar. 6-10 - Yhonatan Kvich, Yonina C. Eldar:
Modulo Sampling and Recovery in Shift-Invariant Spaces. 11-15 - Chaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Li:
Text2Avatar: Text to 3d Human Avatar Generation with Codebook-Driven Body Controllable Attribute. 16-20 - Tao Chen, Minxing Li, Ziming Liu:
The Joint Grid-Free DOA and Polarization Estimation Algorithm based on Atomic Norm Minimization. 21-25 - Shaolei Feng, Xiaoguang Lu, Deshana Kaushal Desai, Lei Guan:
A Learning-Based System for Automatic Intentional Non-Adherence Detection from Dosing Videos. 26-30 - Jingqing Ruan, Runpeng Xie, Xuantang Xiong, Shuang Xu, Bo Xu:
MaDE: Multi-Scale Decision Enhancement for Multi-Agent Reinforcement Learning. 31-35 - Yuanbo Wen, Tao Gao, Ziqi Li, Jing Zhang, Ting Chen:
Encoder-Minimal and Decoder-Minimal Framework for Remote Sensing Image Dehazing. 36-40 - Tao Chen, Qi An, Minxing Li:
An Error Self-Corrected DOA Estimation Model for Sparse Array Based on ANM. 41-45 - Yijia Zhang, Deepak Mishra, Hassan Habibi Gharakheili, Derrick Wing Kwan Ng:
UAV Operation Time Minimization for Wireless-Powered Data Collection. 46-50 - Christophe El Zeinaty
, Glenn Herrou, Wassim Hamidouche
, Daniel Ménard:
Dicetrack: Lightweight Dice Classification on Resource-Constrained Platforms with Optimized Deep Learning Models. 51-55 - Kaiyuan Hu, Hongjie Liao, Mingxiao Li, Fangxin Wang:
MMCOUNT: Stationary Crowd Counting System Based on Commodity Millimeter-Wave Radar. 56-60 - Zirui Wan, Saeid Sanei:
Crowd Modeling and Control Via Cooperative Adaptive Filtering. 61-65 - Pavlo Hilei, Marian Petruk
, Ievgen Korotkyi
, Oleg Farenyuk
:
Deep Learning AMR Model Inference Acceleration with CFU for Edge Systems. 66-70 - Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin:
Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure. 71-75 - Deeksha Chandola, Enas Altarawneh, Michael Jenkin, Manos Papagelis:
SERC-GCN: Speech Emotion Recognition In Conversation Using Graph Convolutional Networks. 76-80 - Tenghao Cai, Lei Li
, Tsung-Hui Chang:
Sensing-Assisted Distributed User Scheduling and Beamforming in Muli-Cell mmWave Networks. 81-85 - Jiayuan Gao, Yingwei Zhang, Yiqiang Chen, Tengxiang Zhang, Boshi Tang, Xiaoyu Wang:
Unsupervised Human Activity Recognition Via Large Language Models and Iterative Evolution. 91-95 - Tao Chen, Ziming Liu, Lei Zhan:
ANM-Based Source Localization Under Mixed Field. 96-100 - Ran Wang, Jing Sun, Cheng Xu, Ruixue Li, Shihong Duan, Xiaotong Zhang:
Reinforcement Learning Compensated Filter for Multi-Agents Cooperative Localization. 101-105 - Entong He
, Yuxiang Yang, Chenshu Wu:
Quantum Ranging Enhanced TDoA Localization. 106-110 - Haoyu Wang, Jinbo Chen, Dongheng Zhang, Zhi Lu, Changwei Wu, Yang Hu, Qibin Sun, Yan Chen:
Contactless Radar Heart Rate Variability Monitoring Via Deep Spatio-Temporal Modeling. 111-115 - Nikolaos Palaiodimopoulos
, Vítor Fortes Rey, Matthias Tschöpe, Christina Jörg
, Paul Lukowicz, Maximilian Kiefer-Emmanouilidis:
Quantum Inspired Image Augmentation Applicable to Waveguides and Optical Image Transfer Via Anderson Localization. 116-120 - Anestis Kaimakamidis, Ioannis Pitas:
Political Tweet Sentiment Analysis for Public Opinion Polling. 121-125 - Victor R. J. Deville, C. M. Lievers, Jonathan H. Manton:
Enhanced Axle-Based Vehicle Classification Using Angle-Based Micro-Doppler Signature. 126-130 - Su Fong Chien, David Chieng
, Samuel Y. C. Chen, Charilaos C. Zarakovitis, Heng Siong Lim, Y. H. Xu:
Applying Hybrid Quantum LSTM for Indoor Localization Based on RSSI. 131-135 - Hengxi Zhang, Zhendong Shi, Yuanquan Hu, Wenbo Ding, Ercan E. Kuruoglu, Xiao-Ping Zhang:
Optimizing Trading Strategies in Quantitative Markets Using Multi-Agent Reinforcement Learning. 136-140 - Yan Zhang, Xin Liu, Zuping Zhang:
Motif-Matching Based Sub-Braingraph Level Networks for Noisy Resting-State fMRI Analysis. 141-145 - Judith Herrmann, Raphael Kunert, Ron Hachmon, Aviv Markus, Allison Gunby-Mann, Sarel Cohen, Tobias Friedrich, Peter Chin:
Detecting Continuous Gravitational Waves Using Generated Training Data. 146-150 - Titan Yuan, Filip Maksimovic, David C. Burnett, Kristofer S. J. Pister:
Hardware-Limited Time Constant Estimation Using a Weighted Linear Regression. 151-155 - Kunwar Pritiraj Rajput
, Linlong Wu, M. R. Bhavani Shankar, Pramod K. Varshney:
Joint Transmit Precoders and Passive Reflection Beamformer Design in IRS-Aided IoT Networks. 156-160 - Zhiqiang Zhou, Linxiao Yang, Qingsong Wen, Liang Sun:
RobustTSVar: A Robust Time Series Variance Estimation Algorithm. 161-165 - Xu Wang, Dongheng Zhang, Fengquan Zhan, Xuecheng Xie, Pengcheng Huang, Yang Hu, Yan Chen:
RoFi: Robust WiFi Intrusion Detection via Distribution Matching. 166-170 - Wuxia Hu, Yang Yang, Yonina C. Eldar, Chunyan Feng, Caili Guo:
Digital Task-Oriented Communication with Hardware-Limited Task-Based Quantization. 171-175 - Shuai Yang, Dongheng Zhang, Jinbo Chen, Fang Zhou, Guanzhong Wang, Qibin Sun, Yan Chen:
Automotive Radar Interference Mitigation Via SINR Maximization. 176-180 - Keshab K. Parhi
:
A Low-Latency Fft-Ifft Cascade Architecture. 181-185 - Seyed Ali Ghazi Asgar
, Kaan Sel, Anando Paul, Roderic I. Pettigrew, Roozbeh Jafari:
Cuffless Blood Pressure Estimation Using Magnetic Flux In A Ring Form Factor. 186-190 - Xuantang Xiong, Linghui Meng, Jingqing Ruan, Shuang Xu, Bo Xu:
UNeC: Unsupervised Exploring In Controllable Space. 191-195 - Jia-Yu Yang, Chih-I Ho, Pei-Yun Tsai, Hung-Ju Lin, Tzung-Dau Wang:
MAML-Based 24-Hour Personalized Blood Pressure Estimation from Wrist Photoplethysmography Signals in Free-Living Context. 196-200 - Shuyi Ren, Beichen Huang, Xiaoyang Li, Kaiming Shen:
Aerial-IRS-Assisted Load Balancing In Downlink Networks. 201-205 - Yu-Min Chiu, Ching-Te Chiu, Dao-Heng Luo:
Multi-Layer Relation Knowledge Distillation For Fingerprint Restoration. 206-210 - Toivo Henningson, Stefan Ingi Adalbjörnsson, Anders Berkeman, Carl Drougge, Xavante Erickson, Alexander Hunt:
A Concept for a Slam Back End Hardware Accelerator. 211-215 - Ganlin Zhang, Dongheng Zhang, Hongyu Deng, Yun Wu, Fengquan Zhan, Yan Chen:
Practical Challenge and Solution for IRS-Aided Indoor Localization System. 216-220 - Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li:
SVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks. 221-225 - Zeyang Song, Jibin Wu, Malu Zhang, Mike Zheng Shou, Haizhou Li:
Spiking-Leaf: A Learnable Auditory Front-End for Spiking Neural Networks. 226-230 - Zheng Si, Chao Liu, Jianyu Liu, Yinhao Zhou:
Application of SNNS Model Based On Multi-Dimensional Attention In Drone Radio Frequency Signal Classification. 231-235 - Yize Sun, Jiarui Liu, Yunpu Ma, Volker Tresp:
Differentiable Quantum Architecture Search For Job Shop Scheduling Problem. 236-240 - Peichao Wang
, Qian He:
Low-Complexity GLRT Based Quickest Detection With Unknown Parameters. 241-245 - Irtaza Shahid, Khaldoon Al-Naimi, Ting Dang
, Yang Liu, Fahim Kawsar, Alessandro Montanari:
Towards Enabling DPOAE Estimation on Single-Speaker Earbuds. 246-250 - Bo Han, Liangjian Han:
Efficient 3D Position Estimation in Badminton Scene. 251-255 - Kevin Wilkinghoff
, Keisuke Imoto:
F1-EV score: Measuring The Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection. 256-260 - Xinlei Niu
, Jing Zhang, Christian Walder, Charles Patrick Martin:
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation. 261-265 - Christopher Hahne, Michel Hayoz, Raphael Sznitman:
StofNet: Super-Resolution Time of Flight Network. 266-270 - Yiming Li, Xiangdong Wang, Hong Liu, Rui Tao, Long Yan, Kazushige Ouchi:
Semi-Supervised Sound Event Detection with Local and Global Consistency Regularization. 271-275 - Kevin Wilkinghoff:
Self-Supervised Learning for Anomalous Sound Detection. 276-280 - Yushu Wu, Xiao Quan, Mohammad Rasool Izadi, Chuan-Che Jeff Huang:
"It os Okay to be Uncommon": Quantizing Sound Event Detection Networks on Hardware Accelerators with Uncommon Sub-Byte Support. 281-285 - Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. 286-290 - Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
CED: Consistent Ensemble Distillation for Audio Tagging. 291-295 - Ali Gökçe
, Hüseyin Hacihabiboglu:
Semi-Blind Estimation of Direct-to-Reverberant Energy Ratio Using Residual Energy Test Statistics. 296-300 - Haojie Wei, Xueke Cao, Wenbo Xu, Tangpeng Dan, Yueguo Chen:
DJCM: A Deep Joint Cascade Model for Singing Voice Separation and Vocal Pitch Estimation. 301-305 - Rhiannon Mogridge, George Close, Robert Sutherland, Thomas Hain
, Jon Barker, Stefan Goetze
, Anton Ragni:
Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models. 306-310 - Jiayi Zhang, Rita Singh:
Vocal Fold Dynamics for Automatic Detection of Amyotrophic Lateral Sclerosis from Voice. 311-315 - Shih-Lun Wu, Xuankai Chang, Gordon Wichern, Jee-Weon Jung, François G. Germain, Jonathan Le Roux, Shinji Watanabe:
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation. 316-320 - Yoshihide Tomita, Shoichi Koyama, Hiroshi Saruwatari:
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression. 321-325 - Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. 326-330 - Mahesh Kumar Nandwana, Yifan He, Joseph Liu
, Xiao Yu, Charles Shang, Eloi du Bois, Morgan McGuire, Kiran Bhat:
Voice Toxicity Detection Using Multi-Task Learning. 331-335 - Benjamin Elizalde, Soham Deshmukh, Huaming Wang:
Natural Language Supervision For General-Purpose Audio Representations. 336-340 - Yao Qiu, Jinchao Zhang, Yong Shan, Jie Zhou:
Enhancing Note-Level Singing Transcription Model with Unlabeled and Weakly Labeled Data. 341-345 - Yo Sasaki, Yasushige Nakayama:
Simultaneous Interior and Exterior Sound Field Synthesis Using Cylindrical and Spherical Loudspeaker Arrays. 346-350 - George Close, William Ravenscroft, Thomas Hain
, Stefan Goetze
:
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement. 351-355 - Johannes Zeitler
, Michael Krause
, Meinard Müller:
Soft Dynamic Time Warping with Variable Step Weights. 356-360 - Yi-Chiao Wu, Dejan Markovic, Steven Krenn, Israel D. Gebru, Alexander Richard:
ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter. 361-365 - Ali Vosoughi, Luca Bondi, Ho-Hsiang Wu, Chenliang Xu:
Learning Audio Concepts from Counterfactual Natural Language. 366-370 - Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. 371-375 - Pranay Manocha, Donald Williamson
, Adam Finkelstein:
Corn: Co-Trained Full- and No-Reference Speech Quality Assessment. 376-380 - Jozef Coldenhoff, Andrew Harper, Paul Kendrick, Tijana Stojkovic, Milos Cernak:
Multi-Channel Mosra: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and A Teacher Model. 381-385 - Idan Cohen, Sharon Gannot, Ofir Lindenbaum:
Unsupervised Acoustic Scene Mapping Based on Acoustic Features and Dimensionality Reduction. 386-390 - Manuel Milling, Andreas Triantafyllopoulos, Iosif Tsangko, Simon David Noel Rampp, Björn Wolfgang Schuller:
Bringing the Discussion of Minima Sharpness to the Audio Domain: A Filter-Normalised Evaluation for Acoustic Scene Classification. 391-395 - Chih-Cheng Chang, Li Su:
Beast: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. 396-400 - Yiqun Zhang, Xinmeng Xu, Weiping Tu:
Improving Acoustic Echo Cancellation by Exploring Speech and Echo Affinity with Multi-Head Attention. 401-405 - Pavan Seshadri, Chaeyeon Han, Bon-Woo Koo
, Noah Posner, Subhrajit Guhathakurta
, Alexander Lerch:
ASPED: An Audio Dataset for Detecting Pedestrians. 406-410 - Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita:
Environmental Sound Synthesis from Vocal Imitations and Sound Event Labels. 411-415 - Mattes Ohlenbusch
, Christian Rollwage, Simon Doclo:
Multi-Microphone Noise Data Augmentation for DNN-Based Own Voice Reconstruction for Hearables in Noisy Environments. 416-420 - Shulin He, Jinjiang Liu, Hao Li, Yang Yang, Fei Chen, Xueliang Zhang:
3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications. 421-425 - Yi Luo, Rongzhi Gu:
Improving Music Source Separation with Simo Stereo Band-Split Rnn. 426-430 - Yichi Wang, Jie Zhang, Shihao Chen, Weitai Zhang, Zhongyi Ye, Xinyuan Zhou, Lirong Dai:
A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction. 431-435 - Clara Borrelli, James Rae, Dogac Basaran, Matt McVicar, Mehrez Souden, Matthias Mauch:
Resource-Constrained Stereo Singing Voice Cancellation. 436-440 - Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Woon-Seng Gan:
Unsupervised Learning Based End-to-End Delayless Generative Fixed-Filter Active Noise Control. 441-445 - Younglo Lee, Shukjae Choi, Byeong-Yeol Kim, Zhongqiu Wang, Shinji Watanabe:
Boosting Unknown-Number Speaker Separation with Transformer Decoder-Based Attractor. 446-450 - Youqiang Zheng, Weiping Tu, Li Xiao, Xinmeng Xu:
Srcodec: Split-Residual Vector Quantization for Neural Speech Codec. 451-455 - Haocheng Guo, Xiaohuai Le, Kai Chen, Jing Lu:
A Light-Weight State Detection Model for Kalman-Filter-Based Acoustic Feedback Cancellation with Rapid Recovery from Abrupt Path Changes. 456-460 - Anbin Qi, Xiang Xie, Jing Wang:
Mtdiffusion: Multi-Task Diffusion Model With Dual-Unet for Foley Sound Generation. 461-465 - Chenglong Jiang
, Ying Gao
, Hao Jin, Linrong Pan, Wing W. Y. Ng:
Fastmandarin: Efficient Local Modeling for Natural Mandarin Speech Synthesis. 461-465 - Shrishti Saha Shetu, Soumitro Chakrabarty, Oliver Thiergart, Edwin Mabande:
Ultra Low Complexity Deep Learning Based Noise Suppression. 466-470 - Carlotta Anemüller, Oliver Thiergart, Emanuël A. P. Habets:
Binaural Rendering of Heterogeneous Sound Sources with Extent. 471-475 - Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin:
NOLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping. 476-480 - Wei Tsung Lu, Ju-Chiang Wang, Qiuqiang Kong, Yun-Ning Hung:
Music Source Separation With Band-Split Rope Transformer. 481-485 - Yiming Li, Xiangdong Wang, Hong Liu:
Audio-Free Prompt Tuning for Language-Audio Models. 491-495 - Pengyu Wang
, Xiaofei Li:
RVAE-EM: Generative Speech Dereverberation Based On Recurrent Variational Auto-Encoder And Convolutive Transfer Function. 496-500 - Weilong Huang, Cheng Xue, Jinwei Feng, W. Bastiaan Kleijn
:
A Practical Online Multichannel Dereverberation Approach with Data-Reuse Technique. 501-505 - Yile Angela Zhang, Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Amy Bastine
:
An Active Noise Control System Based On Soundfield Interpolation Using A Physics-Informed Neural Network. 506-510 - Fan Zhang, Chao Pan, Jacob Benesty, Jingdong Chen:
Directional Gain Based Noise Covariance Matrix Estimation for MVDR Beamforming. 511-515 - Soonhyeon Choi, Jung-Woo Choi:
Noisy-Arcmix: Additive Noisy Angular Margin Loss Combined With Mixup For Anomalous Sound Detection. 516-520 - Dichucheng Li, Yinghao Ma, Weixing Wei, Qiuqiang Kong, Yulun Wu, Mingjin Che, Fan Xia, Emmanouil Benetos, Wei Li:
Mertech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model with Multi-Task Finetuning. 521-525 - Haesun Joung, Kyogu Lee:
Music Auto-Tagging with Robust Music Representation Learned via Domain Adversarial Training. 526-530 - Théo Mariotte
, Antonio Almudévar, Marie Tahon, Alfonso Ortega Giménez
:
An Explainable Proxy Model for Multilabel Audio Segmentation. 531-535 - Jae-Won Kim, Byeongho Jo, Seungkwon Beack, Hochong Park:
Pre-Echo Reduction in Transform Audio Coding via Temporal Envelope Control with Machine Learning Based Estimation. 536-540 - Wuyang Liu
, Yanzhen Ren:
Semantic Proximity Alignment: Towards Human Perception-Consistent Audio Tagging by Aligning with Label Text Description. 541-545 - Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà:
GASS: Generalizing Audio Source Separation with Large-Scale Data. 546-550 - An-Yan Chang, Jing-Tong Tzeng
, Huan-Yu Chen, Chih-Wei Sung, Chun-Hsiang Huang, Edward Pei-Chuan Huang
, Chi-Chun Lee:
GaP-Aug: Gamma Patch-Wise Correction Augmentation Method for Respiratory Sound Classification. 551-555 - Srikanth Burra, Asutosh Kar, Mads Græsbøll Christensen
:
Conjugate Gradient Based Adaptive Algorithm for Nonlinear AEC. 556-560 - Keigo Wakayama
, Tsubasa Ochiai, Marc Delcroix, Masahiro Yasuda, Shoichiro Saito, Shoko Araki, Akira Nakayama:
Online Target Sound Extraction with Knowledge Distillation from Partially Non-Causal Teacher. 561-565 - Youqiang Zheng, Weiping Tu, Li Xiao, Xinmeng Xu:
SuperCodec: A Neural Speech Codec with Selective Back-Projection Network. 566-570 - Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu:
BAE-Net: a Low Complexity and High Fidelity Bandwidth-Adaptive Neural Network for Speech Super-Resolution. 571-575 - Ron Moisseev
, Gal Itzhak, Israel Cohen:
Array Geometry Optimization for Region-of-Interest Near-Field Beamforming. 576-580 - Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. 581-585 - Liang Xu, Jing Wang, Jianqian Zhang, Xiang Xie:
LightCodec: A High Fidelity Neural Audio Codec with Low Computation Complexity. 586-590 - Zhihao Du, Shiliang Zhang, Kai Hu, Siqi Zheng:
FunCodec: A Fundamental, Reproducible and Integrable Open-Source Toolkit for Neural Speech Codec. 591-595 - Carlos Hernandez-Olivan, Koichi Saito, Naoki Murata, Chieh-Hsin Lai, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Yuki Mitsufuji:
VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance. 596-600 - Satoru Emura:
Permutation-Alignment Method Using Manifold Optimization for Frequency-Domain Blind Source Separation. 601-605 - Zhijian Jiang, Haoming Li, Nengheng Zheng:
Two-Stage Acoustic Echo Cancellation Network with Dual-Path Alignment. 606-610 - Satvik Venkatesh, Arthur Benilov, Philip Coleman, Frederic Roskam:
Real-Time Low-Latency Music Source Separation Using Hybrid Spectrogram-Tasnet. 611-615 - Amal Emthyas, Sebastià V. Amengual Garí, Enzo De Sena
:
Binaural Room Transfer Function Interpolation Via System Inversion. 616-620 - Hassan Taherian, Ashutosh Pandey, Daniel Wong, Buye Xu, DeLiang Wang:
Leveraging Sound Localization to Improve Continuous Speaker Separation. 621-625 - Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann:
Single and Few-Step Diffusion for Generative Speech Enhancement. 626-630 - Stefano Damiano
, Luca Bondi, Shabnam Ghaffarzadegan, Andre Guntoro, Toon van Waterschoot:
Can Synthetic Data Boost the Training of Deep Acoustic Vehicle Counting Networks? 631-635 - Kazuki Shimada, Kengo Uchida, Yuichiro Koyama, Takashi Shibuya
, Shusuke Takahashi, Yuki Mitsufuji, Tatsuya Kawahara:
Zero- and Few-Shot Sound Event Localization and Detection. 636-640 - Xiaoli Tang, Jihui Aimee Zhang
, Thushara D. Abhayapala:
Active Noise Control Over A Large Region with Multiple Spherical Microphone Arrays In Wave Domain. 641-645 - Mayuka Kono, Yutaro Hirao, Monica Perusquía-Hernández, Naoya Isoyama, Hideaki Uchiyama, Nobuchika Sakata, Jun Takamatsu, Kiyoshi Kiyokawa:
U2R: Underwater Ultrasonic Reflection Wave Dataset Toward Pose-Invariant Material Recognition. 646-650 - Guendalina Milano, Oliver Thiergart, Emanuël A. P. Habets:
Sector-Based Interference Cancellation for Robust Keyword Spotting Applications Using an Informed MPDR Beamformer. 651-655 - Huaying Xue, Xiulian Peng, Yan Lu:
Low-Latency Speech Enhancement via Speech Token Generation. 661-665 - Bar Shaybet, Anurag Kumar, Vladimir Tourbabin, Boaz Rafaely:
Ambisonics Networks - The Effect of Radial Functions Regularization. 666-670 - Matthew C. McCallum, Matthew E. P. Davies, Florian Henkel, Jaehun Kim, Samuel E. Sandberg:
On The Effect Of Data-Augmentation On Local Embedding Properties In The Contrastive Learning Of Music Audio Representations. 671-675 - Jonah Casebeer, Junkai Wu, Paris Smaragdis:
Meta-AF Echo Cancellation for Improved Keyword Spotting. 676-680 - Vikas Tokala, Eric Grinstein, Mike Brookes, Simon Doclo, Jesper Jensen, Patrick A. Naylor
:
Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks. 681-685 - Matthew C. McCallum, Florian Henkel, Jaehun Kim, Samuel E. Sandberg, Matthew E. P. Davies:
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search. 686-690 - Gyuhak Kim, Ho-Hsiang Wu, Luca Bondi, Bing Liu:
Multi-Modal Continual Pre-Training For Audio Encoders. 691-695 - Babak Naderi, Ross Cutler, Nicolae-Catalin Ristea:
Multi-Dimensional Speech Quality Assessment in Crowdsourcing. 696-700 - Mikko Heikkinen, Archontis Politis
, Tuomas Virtanen:
Neural Ambisonics Encoding For Compact Irregular Microphone Arrays. 701-705 - María Alfaro-Contreras, Antonio Ríos-Vila, Jose J. Valero-Mas, Jorge Calvo-Zaragoza:
A Transformer Approach for Polyphonic Audio-to-Score Transcription. 706-710 - Hao Zhang, Yixuan Zhang, Meng Yu, Dong Yu:
Advancing Acoustic Howling Suppression Through Recursive Training of Neural Networks. 711-715 - Yuanbo Hou, Qiaoqiao Ren, Siyang Song, Yuxin Song, Wenwu Wang, Dick Botteldooren:
Multi-Level Graph Learning For Audio Event Classification And Human-Perceived Annoyance Rating Prediction. 716-720 - Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-Channel Separation And Adaptation. 721-725 - Riku Arakawa, Mathieu Parvaix, Chiong Lai, Hakan Erdogan, Alex Olwal:
Quantifying The Effect Of Simulator-Based Data Augmentation For Speech Recognition On Augmented Reality Glasses. 726-730 - Daniel Fejgin, Elior Hadad, Sharon Gannot, Zbynek Koldovský, Simon Doclo:
Comparison Of Frequency-Fusion Mechanisms For Binaural Direction-Of-Arrival Estimation For Multiple Speakers. 731-735 - Jens Heitkaemper, Arun Narayanan, Turaj Zakizadeh Shabestary, Sankaran Panchapagesan, James Walker, Bhalchandra Gajare, Shlomi Regev, Ajay Dudani, Alexander Gruenstein:
Improving Acoustic Echo Cancellation for Voice Assistants Using Neural Echo Suppression and Multi-Microphone Noise Reduction. 736-740 - Ke Chen, Jiaqi Su, Zeyu Jin:
MDX-GAN: Enhancing Perceptual Quality in Multi-Class Source Separation Via Adversarial Training. 741-745 - Karn N. Watcharasupat, Alexander Lerch:
Quantifying Spatial Audio Quality Impairment. 746-750 - Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar:
A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement. 751-755 - Kunxing Lu, Xianrui Wang, Tetsuya Ueda, Shoji Makino, Jingdong Chen:
A Computationally Efficient Semi-Blind Source Separation Approach for Nonlinear Echo Cancellation Based on an Element-Wise Iterative Source Steering. 756-760 - Luca Della Libera, Cem Subakan, Mirco Ravanelli, Samuele Cornell
, Frédéric Lepoutre, François Grondin:
Resource-Efficient Separation Transformer. 761-765 - Yu Du, Xu Liu, Yansong Chua:
Spiking Structured State Space Model for Monaural Speech Enhancement. 766-770 - Jinhua Liang, Huy Phan, Emmanouil Benetos:
Learning from Taxonomy: Multi-Label Few-Shot Classification for Everyday Sound Recognition. 771-775 - Xudong Zhao
, Xueqin Luo, Gongping Huang, Jingdong Chen, Jacob Benesty:
Differential Beamforming with Null Constraints for Spherical Microphone Arrays. 776-780 - Yang Xiang, Jingguang Tian, Xinhui Hu, Xinkang Xu, Zhaohui Yin:
A Deep Representation Learning-Based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder. 781-785 - Yichen Yang, Haowen Li, Xianrui Wang, Wen Zhang, Shoji Makino, Jingdong Chen:
Stereophonic Music Source Separation with Spatially-Informed Bridging Band-Split Network. 786-790 - Mahmoud Namazi, Kenneth Rose:
Ultra-Low Delay Lossless Compression of Higher Order Ambisonics. 791-795 - Gaël Le Lan, Varun Nagaraja, Ernie Chang, David Kant, Zhaoheng Ni, Yangyang Shi, Forrest N. Iandola, Vikas Chandra:
Stack-and-Delay: A New Codebook Pattern for Music Generation. 796-800 - Jayeon Yi, Junghyun Koo, Kyogu Lee:
DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper. 801-805 - Li Li, Shogo Seki:
Remixed2remixed: Domain Adaptation for Speech Enhancement by Noise2noise Learning with Remixing. 806-810 - Wei-Yang Lin, Yu-Chiang Frank Wang, Li Su:
Enhancing Violin Fingering Generation through Audio-Symbolic Fusion. 811-815 - Lior Arbel, Ishwarya Ananthabhotla, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely:
On HRTF Notch Frequency Prediction using Anthropometric Features and Neural Networks. 816-820 - Yiqiang Cai, Peihong Zhang, Shengchen Li:
TF-SepNet: An Efficient 1D Kernel Design in Cnns for Low-Complexity Acoustic Scene Classification. 821-825 - Seungheon Doh, Minhee Lee, Dasaem Jeong
, Juhan Nam:
Enriching Music Descriptions with A Finetuned-LLM and Metadata for Text-to-Music Retrieval. 826-830 - Ryandhimas E. Zezario
, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao:
Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model. 831-835 - Matteo Torcoli, Chih-Wei Wu, Sascha Dick
, Phillip A. Williams, Mhd Modar Halimeh, William Wolcott, Emanuël A. P. Habets:
Odaq: Open Dataset of Audio Quality. 836-840 - Pil Moo Byun, Joon-Hyuk Chang:
Generalized Specaugment via Multi-Rectangle Inverse Masking For Acoustic Scene Classification. 841-845 - Tal Peer, Simon Welker, Johannes Kolhoff, Timo Gerkmann:
A Flexible Online Framework for Projection-Based Stft Phase Retrieval. 846-850 - Hanyue Liu, Miao Liu, Jing Wang, Xiang Xie, Lidong Yang:
Non-Intrusive Speech Quality Assessment with Multi-Task Learning Based on Tensor Network. 851-855 - Côme Peladeau, Geoffroy Peeters:
Blind Estimation of Audio Effects Using an Auto-Encoder Approach and Differentiable Digital Signal Processing. 856-860 - Zixing Zhang, Tao Pang, Jing Han, Björn W. Schuller:
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation. 861-865 - Zeyu Xie, Baihan Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Enhancing Audio Generation Diversity with Visual Information. 866-870 - Kazuki Matsumoto, Kohei Yatabe:
Determined BSS by Combination of IVA and DNN via Proximal Average. 871-875 - Wangjin Zhou, Zhengdong Yang, Chenhui Chu, Sheng Li
, Raj Dabre, Yi Zhao, Tatsuya Kawahara:
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction. 876-880 - Huy Phan, Byeonggeun Kim, Vu Nguyen, Andrew Bydlon, Qingming Tang, Chieh-Chi Kao, Chao Wang:
Cross-Triggering Issue in Audio Event Detection and Mitigation. 881-885 - Alireza Nezamdoust, Mario Huemer, Aurelio Uncini, Danilo Comminiello:
Efficient Functional Link Adaptive Filters Based On Nearest Kronecker Product Decomposition. 886-890 - Stepan Shishkin, Danilo Hollosi, Stefan Goetze, Simon Doclo:
Active Learning for Sound Event Classification Using Bayesian Neural Networks with Gaussian Variational Posterior. 896-900 - Aolin Hu, Xueshuai Zhang, Shaoxing Zhang, Pengyuan Zhang, Yu Lu, Pengfei Ye, Qingwei Zhao, Yonghong Yan:
Snore Sound Features Based on Percussive Enhancing and Positional Encoding Combined with Multi-Task Learning for Osahs Detection. 901-905 - Nikolay D. Gaubitch, David Looney:
On The Role of Room Acoustics in Audio Presentation Attack Detection. 906-910 - Nian Shao, Xian Li, Xiaofei Li:
Fine-Tune the Pretrained ATST Model for Sound Event Detection. 911-915 - Manjunath Mulimani, Annamaria Mesaros
:
Class-Incremental Learning for Multi-Label Audio Classification. 916-920 - David Sundström
, Filip Elvander
, Andreas Jakobsson
:
Estimation of Impulse Responses for a Moving Source Using Optimal Transport Regularization. 921-925 - Yiyuan Yang, Kaichen Zhou, Niki Trigoni, Andrew Markham:
SSL-Net: A Synergistic Spectral and Learning-Based Network for Efficient Bird Sound Classification. 926-930 - Jiakun Shen, Xueshuai Zhang, Pengyuan Zhang, Yonghong Yan, Qingwei Zhao, Ta Li, Yanfen Tang, Shaoxing Zhang:
One-Epoch Training with Single Test Sample in Test Time for Better Generalization of Cough-Based Covid-19 Detection Model. 931-935 - Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss:
Syncfusion: Multimodal Onset-Synchronized Video-to-Audio Foley Synthesis. 936-940 - Zhiwei Lin, Jun Chen, Boshi Tang, Binzhu Sha, Jing Yang, Yaolong Ju, Fan Fan, Shiyin Kang, Zhiyong Wu, Helen Meng:
Multi-View Midivae: Fusing Track- and Bar-View Representations for Long Multi-Track Symbolic Music Generation. 941-945 - Gaël Richard, Pierre Chouteau, Bernardo Torres:
A Fully Differentiable Model for Unsupervised Singing Voice Separation. 946-950 - Manvi Agarwal, Changhong Wang, Gaël Richard:
Structure-Informed Positional Encoding for Music Generation. 951-955 - Antonin Gagneré, Slim Essid, Geoffroy Peeters:
Adapting Pitch-Based Self Supervised Learning Models for Tempo Estimation. 956-960 - Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng:
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation. 961-965 - Yuankun Xie, Haonan Cheng, Yutian Wang, Long Ye:
An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection. 966-970 - Xiaobin Rong, Tianchi Sun, Xu Zhang, Yuxiang Hu, Changbao Zhu, Jing Lu:
GTCRN: A Speech Enhancement Model Requiring Ultralow Computational Resources. 971-975 - Aurian Quelennec
, Michel Olvera, Geoffroy Peeters, Slim Essid:
On The Choice of the Optimal Temporal Support for Audio Classification with Pre-Trained Embeddings. 976-980 - Rong Xie, Anqi Tu, Chuang Shi, Stephen Elliott, Huiyong Li, Le Zhang:
Cognitive Virtual Sensing Technique for Feedforward Active Noise Control. 981-985 - Teysir Baoueb, Haocheng Liu, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis. 986-990 - Inseon Jang, Haici Yang, Wootaek Lim, Seungkwon Beack, Minje Kim:
Personalized Neural Speech Codec. 991-995 - Yuliang Zhang, Roberto Togneri
, David Huang
:
A Unified Loss Function to Tackle Inter-Class and Intra-Class Data Imbalance in Sound Event Detection. 996-1000 - Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li:
An Empirical Study on the Impact of Positional Encoding in Transformer-Based Monaural Speech Enhancement. 1001-1005 - Vasudha Sathyapriyan
, Michael Syskind Pedersen, Mike Brookes, Jan Østergaard
, Patrick A. Naylor, Jesper Jensen:
Speech Enhancement in Hearing Aids Using Target Speech Presence Estimation Based on a Delayed Remote Microphone Signal. 1006-1010 - Alessandro Ragano
, Jan Skoglund, Andrew Hines
:
NOMAD: Unsupervised Learning of Perceptual Embeddings For Speech Enhancement and Non-Matching Reference Audio Quality Assessment. 1011-1015 - Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. 1016-1020 - Yadong Guan, Jiqing Han, Hongwei Song, Wenjie Song, Guibin Zheng, Tieran Zheng, Yongjun He:
Contrastive Loss Based Frame-Wise Feature Disentanglement for Polyphonic Sound Event Detection. 1021-1025 - Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada, Shoji Makino:
Unrestricted Global Phase Bias-Aware Single-Channel Speech Enhancement with Conformer-Based Metric Gan. 1026-1030 - Mihailo Kolundzija, Mathew Shaji Kavalekalam, Ivana Balic, Michelle Mao, Raúl Casas:
Low Bitrate Loss Resilience Scheme for a Speech Enhancing Neural Codec. 1031-1035 - Yin-Jyun Luo, Sebastian Ewert, Simon Dixon:
Unsupervised Pitch-Timbre Disentanglement of Musical Instruments Using a Jacobian Disentangled Sequential Autoencoder. 1036-1040 - Shota Okubo, Toshiharu Horiuchi:
Three-Dimensional Sound Wave Propagation Reproduction by CE-FDTD Simulation Applying Actual Radiation Characteristics. 1041-1045 - Zhiheng Wang, Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu
:
A Steered Response Power Approach with Bilinear Prediction-Based Trade-Off Prewhitening for Speaker Localization. 1046-1050 - Xavier Riley, Drew Edwards, Simon Dixon:
High Resolution Guitar Transcription Via Domain Adaptation. 1051-1055 - Tong Xiao
, Simon Doclo:
Effect of Target Signals and Delays on Spatially Selective Active Noise Control for Open-Fitting Hearables. 1056-1060 - Tony Alex, Sara Ahmed, Armin Mustafa, Muhammad Awais, Philip J. B. Jackson:
Max-AST: Combining Convolution, Local and Global Self-Attentions for Audio Event Classification. 1061-1065 - Shuhua Liu, Chunyu Zhang, Binshuai Li, Niantong Qin, Huanting Cheng, Huayu Zhang:
TIA: A Teaching Intonation Assessment Dataset in Real Teaching Situations. 1066-1070 - Shuai Yu
, Jun Liu, Yi Yu, Wei Li:
A Scalable Sparse Transformer Model for Singing Melody Extraction. 1071-1075 - Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
Audiosr: Versatile Audio Super-Resolution at Scale. 1076-1080 - Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis:
Investigating Personalization Methods in Text to Music Generation. 1081-1085 - Akshay Raina
, Sayeedul Islam Sheikh, Vipul Arora:
Learning Ontology Informed Representations with Constraints for Acoustic Event Detection. 1086-1090 - Xuenan Xu, Xiaohang Xu, Zeyu Xie, Pingyue Zhang, Mengyue Wu, Kai Yu:
A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds. 1091-1095 - Francesca Ronchini, Romain Serizel:
Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems. 1096-1100 - Anselm Lohmann, Toon van Waterschoot, Jörg Bitzer
, Simon Doclo:
Microphone Subset Selection for the Weighted Prediction Error Algorithm Using a Group Sparsity Penalty. 1101-1105 - Nils Marggraf-Turley
, Michael Lovedee-Turner, Enzo De Sena:
HRTF Recommendation Based on the Predicted Binaural Colouration Model. 1106-1110 - Xingjian Du
, Pei Zou, Mingyu Liu, Xia Liang, Minghang Chu, Bilei Zhu:
ByteHum: Fast and Accurate Query-by-Humming in the Wild. 1111-1115 - Yingxue Gao, Huan Zhao, Zixing Zhang:
Adaptive Speech Emotion Representation Learning Based On Dynamic Graph. 1116-11120 - Julian D. Parker, Janne Spijkervet, Katerina Kosta, Furkan Yesiler, Boris Kuznetsov, Ju-Chiang Wang, Matt Avent, Jitong Chen, Duc Le:
STEMGEN: A Music Generation Model That Listens. 1116-1120 - Christoph Hold
, Leo McCormack, Archontis Politis
, Ville Pulkki:
Perceptually-Motivated Spatial Audio Codec for Higher-Order Ambisonics Compression. 1121-1125 - Xingjian Du
, Zhesong Yu, Jiaju Lin, Bilei Zhu, Qiuqiang Kong:
Joint Music and Language Attention Models for Zero-Shot Music Tagging. 1126-1130 - Zhongweiyang Xu, Yong Xu, Vinay Kothapally, Heming Wang, Muqiao Yang, Dong Yu:
SPATIALCODEC: Neural Spatial Speech Coding. 1131-1135 - Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data. 1136-1140 - Gabriel Meseguer-Brocal, Dorian Desblancs, Romain Hennequin:
An Experimental Comparison of Multi-View Self-Supervised Methods for Music Tagging. 1141-1145 - Giuseppe Concialdi, Alkis Koudounas
, Eliana Pastor, Barbara Di Eugenio, Elena Baralis:
Ainur: Harmonizing Speed and Quality in Deep Music Generation Through Lyrics-Audio Embeddings. 1146-1150 - Tomoki Ariga, Yosuke Higuchi, Kazutoshi Hayasaka, Naoki Okamoto, Tetsuji Ogawa:
Parody Detection Using Source-Target Attention with Teacher-Forced Lyrics. 1151-1155 - Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. 1156-1160 - Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramani Duraiswami, Dinesh Manocha:
Recap: Retrieval-Augmented Audio Captioning. 1161-1165 - Marco Pasini, Maarten Grachten, Stefan Lattner:
Bass Accompaniment Generation Via Latent Diffusion. 1166-1170 - Harshvardhan C. Takawale, Nirupam Roy:
Learning Speaker-Listener Mutual Head Orientation by Leveraging HRTF and Voice Directivity on Headphones. 1171-1175 - Bernardo Torres, Geoffroy Peeters, Gaël Richard:
Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport. 1176-1180 - Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz:
Parameter Efficient Audio Captioning with Faithful Guidance Using Audio-Text Shared Latent Representation. 1181-1185 - Dennis Fedorishin, Livio Forte, Philip Schneider, Srirangaraj Setlur
, Venu Govindaraju:
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals. 1186-1190 - Darius Petermann, Minje Kim:
Hyperbolic Distance-Based Speech Separation. 1191-1195 - Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. 1196-1200 - Yang Yang, George Sung, Shao-Fu Shih, Hakan Erdogan, Chehung Lee, Matthias Grundmann:
Binaural Angular Separation Network. 1201-1205 - Ke Chen, Yusong Wu, Haohe Liu, Marianna Nezhurina, Taylor Berg-Kirkpatrick, Shlomo Dubnov:
MusicLDM: Enhancing Novelty in text-to-music Generation Using Beat-Synchronous mixup Strategies. 1206-1210 - Alexander Gebhard, Andreas Triantafyllopoulos, Teresa Bez, Lukas Christ, Alexander Kathan, Björn W. Schuller:
Exploring Meta Information for Audio-Based Zero-Shot Bird Classification. 1211-1215 - Minje Kim, Trausti T. Kristjansson:
Scalable and Efficient Speech Enhancement Using Modified Cold Diffusion: A Residual Learning Approach. 1216-1220 - Irán R. Román
, Christopher Ick, Sivan Ding, Adrian S. Roman
, Brian McFee, Juan Pablo Bello:
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms. 1221-1225 - Minz Won, Yun-Ning Hung, Duc Le:
A Foundation Model for Music Informatics. 1226-1230 - Huiyuan Sun
, Howe Yuan Zhu, Minh T. D. Nguyen, Vincent Nguyen, Chin-Teng Lin
, Craig T. Jin
:
From RIR to BRIR: A Sparse Recovery Beamforming Approach for Virtual Binaural Sound Rendering. 1231-1235 - Huiyuan Sun
, Craig T. Jin
, Thushara D. Abhayapala, Prasanga N. Samarasinghe:
Active Noise Control Over 3D Space with A Dynamic Noise Source. 1236-1240 - Karan Thakkar, Jiarui Hai, Mounya Elhilali:
Investigating Self-Supervised Deep Representations for EEG-Based Auditory Attention Decoding. 1241-1245 - Seungmin Shin
, Joon Byun, Jongmo Sung, Seungkwon Beack, Youngcheol Park:
Quantization Noise Masking in Perceptual Neural Audio Coder. 1246-1250 - Haici Yang, Inseon Jang, Minje Kim:
Generative De-Quantization for Neural Speech Codec Via Latent Diffusion. 1251-1255 - Ruimin Wu, Xianke Wang, Yuqing Li, Wei Xu, Wenqing Cheng:
Piano Transcription with Harmonic Attention. 1256-1260 - Xi Liu, Szu-Jui Chen, John H. L. Hansen:
Dual-Path Minimum-Phase and All-Pass Decomposition Network for Single Channel Speech Dereverberation. 1261-1265 - Yucong Zhang
, Juan Liu, Yao Tian, Haifeng Liu, Ming Li:
A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection. 1266-1270 - Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. 1271-1275 - Weinan Tong, Jiaxu Zhu, Jun Chen, Shiyin Kang, Tao Jiang, Yang Li, Zhiyong Wu, Helen Meng:
SCNet: Sparse Compression Network for Music Source Separation. 1276-1280 - Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation. 1281-1285 - Yongyi Zang, Yi Zhong, Frank Cwitkowitz, Zhiyao Duan:
SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription. 1286-1290 - Frank Cwitkowitz, Kin Wai Cheuk, Woosung Choi
, Marco A. Martínez Ramírez, Keisuke Toyama
, Wei-Hsiang Liao, Yuki Mitsufuji:
Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription. 1291-1295 - Kahyun Choi, Minje Kim:
A Comparative Analysis of Poetry Reading Audio: Singing, Narrating, or Somewhere in Between? 1296-1300 - Donghang Wu, Xihong Wu, Tianshu Qu:
A Hybrid Deep-Online Learning Based Method for Active Noise Control in Wave Domain. 1301-1305 - Enis Berk Çoban, Megan Perra, Michael I. Mandel:
Towards High Resolution Weather Monitoring With Sound Data. 1306-1310 - Longling Zhang, Lyqi Liu, Dan Meng, Jun Wang, Shengshan Hu:
Stealthy Backdoor Attack Towards Federated Automatic Speaker Verification. 1311-1315 - David Robinson
, Adelaide Robinson, Lily Akrapongpisak:
Transferable Models for Bioacoustics with Human Language Supervision. 1316-1320 - Adrian S. Roman
, Irán R. Román
, Juan Pablo Bello:
Robust DoA Estimation from Deep Acoustic Imaging. 1321-1325 - Bing Han, Zhiqiang Lv, Anbai Jiang, Wen Huang
, Zhengyang Chen, Yufeng Deng, Jiawei Ding, Cheng Lu, Wei-Qiang Zhang, Pingyi Fan
, Jia Liu, Yanmin Qian:
Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection. 1326-1330 - Azalea Gui, Hannes Gamper, Sebastian Braun, Dimitra Emmanouilidou:
Adapting Frechet Audio Distance for Generative Music Evaluation. 1331-1335 - Shaoheng Xu, Jihui Aimee Zhang
, Thushara D. Abhayapala, Amy Bastine
, Wei-Ting Lai, Prasanga N. Samarasinghe:
Sparse Sound Field Representation Using Complex Orthogonal Matching Pursuit. 1336-1340 - Chunxi Wang, Maoshen Jia, Meiran Li, Changchun Bao, Wenyu Jin:
Attention Is All You Need For Blind Room Volume Estimation. 1341-1345 - Chengbo Chang, Ziye Yang, Jie Chen:
Plug-and-Play MVDR Beamforming for Speech Separation. 1346-1350 - Sankha Subhra Bhattacharjee
, Srikanth Burra, Jesper Rindom Jensen
, Liming Shi
, Guoli Ping, Jingkai Weng, Mads Græsbøll Christensen
:
Broadband Personal Sound Zone Control in the Presence of Nonlinearities. 1351-1355 - Florian Henkel, Jaehun Kim, Matthew C. McCallum, Samuel E. Sandberg, Matthew E. P. Davies:
Tempo Estimation as Fully Self-Supervised Binary Classification. 1356-1360 - Yun Liang, Hai Lin, Shaojian Qiu, Yihang Zhang:
AAT: Adapting Audio Transformer for Various Acoustics Recognition Tasks. 1361-1365 - Jun-You Wang, Chung-Che Wang, Chon-In Leong, Jyh-Shing Roger Jang:
MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics and Audio. 1366-1370 - Jiyun Park, Sangeon Yong, Taegyun Kwon, Juhan Nam:
A Real-Time Lyrics Alignment System Using Chroma and Phonetic Features for Classical Vocal Performance. 1371-1375 - Yurii Iotov
, Sidsel Marie Nørholm, Peter John McCutcheon, Mads Græsbøll Christensen
:
Improving Speech Attenuation in Headphones using Harmonic Model Decomposition and Multiple-Frequency ANC. 1376-1380 - Zizheng Zhang, Chen Chen, Hsin-Hung Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-Aware Speech Separation with Contrastive Learning. 1381-1385 - Ernst Seidel
, Pejman Mowlaee, Tim Fingscheidt:
Efficient High-Performance Bark-Scale Neural Network for Residual Echo and Noise Suppression. 1386-1390 - Ryosuke Tanaka, Satoshi Tamura:
Few-Shot Anomalous Sound Detection Based on Anomaly Map Estimation Using Pseudo Abnormal Data. 1391-1395 - Dail Kim, Min-Sang Baek
, Yungyeo Kim, Joon-Hyuk Chang:
Improving Target Sound Extraction with Timestamp Knowledge Distillation. 1396-1400 - Donghyun Kim, Yungyeo Kim, Joon-Hyuk Chang:
Class: Continual Learning Approach for Speech Super-Resolution. 1401-1405 - Chenglong Wang, Jiayi He
, Jiangyan Yi, Jianhua Tao, Chu Yuan Zhang, Xiaohui Zhang:
Multi-Scale Permutation Entropy for Audio Deepfake Detection. 1406-1410 - Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada:
6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on Self-Motioning Human. 1411-1415 - Jiu Feng, Mehmet Hamza Erol, Joon Son Chung, Arda Senocak:
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers. 1416-1420 - Santiago Cuervo, Ricard Marxer
:
Speech Foundation Models on Intelligibility Prediction for Hearing-Impaired Listeners. 1421-1425 - June-Woo Kim, Sangmin Bae, Won-Yang Cho, Byungjo Lee, Ho-Young Jung:
Stethoscope-Guided Supervised Contrastive Learning for Cross-Domain Adaptation on Respiratory Sound Classification. 1431-1435 - Laura Lechler, Kamil Wójcicki
:
Crowdsourced Multilingual Speech Intelligibility Testing. 1441-1445 - Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang:
Audio Prompt Tuning for Universal Sound Separation. 1446-1450 - Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Selecting N-Lowest Scores for Training MOS Prediction Models. 1451-1455 - Tatsuya Komatsu, Yusuke Fujita, Kazuya Takeda, Tomoki Toda:
Audio Difference Learning for Audio Captioning. 1456-1460 - Yanjue Song, Nilesh Madhu
:
Phase Reconstruction in Single Channel Speech Enhancement Based on Phase Gradients and Estimated Clean-Speech Amplitudes. 1461-1465 - Vitjan Zavrtanik, Matija Marolt, Matej Kristan, Danijel Skocaj:
Anomalous Sound Detection by Feature-Level Anomaly Simulation. 1466-1470 - Xingda Li, Fan Zhuo, Dan Luo, Jun Chen, Shiyin Kang, Zhiyong Wu, Tao Jiang, Yang Li, Han Fang, Yahui Zhou:
Generating Stereophonic Music with Single-Stage Language Models. 1471-1475 - Federico Miotello
, Luca Comanducci, Mirco Pezzoli, Alberto Bernardini, Fabio Antonacci, Augusto Sarti:
Reconstruction of Sound Field Through Diffusion Models. 1476-1480 - Tianchi Sun, Tong Lei
, Xu Zhang, Yuxiang Hu, Changbao Zhu, Jing Lu:
A Lightweight Hybrid Multi-Channel Speech Extraction System with Directional Voice Activity Detection. 1486-1490 - Jin Woo Lee
, Min Jun Choi
, Kyogu Lee:
String Sound Synthesizer On Gpu-Accelerated Finite Difference Scheme. 1491-1495 - Ying Hu, Haitao Xu, Zhongcun Guo, Hao Huang, Liang He:
SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention. 1496-1500 - Jiuqiang Li, Zheng Wang, Shilei Zhu:
Mixed Informed Transformer for Few-Shot Medical Image Segmentation. 1501-1505 - Zhihao Yu, Chaohe Zhang, Yasha Wang, Wen Tang, Jiangtao Wang
, Liantao Ma:
Predict and Interpret Health Risk Using Ehr Through Typical Patients. 1506-1510 - Yee-Fan Tan, Junn Yong Loo, Chee-Ming Ting, Fuad Noman, Raphaël C.-W. Phan, Hernando Ombao
:
BrainFC-CGAN: A Conditional Generative Adversarial Network for Brain Functional Connectivity Augmentation and Aging Synthesis. 1511-1515 - Xuechen Guo, Wenhao Hu, Chiming Ni
, Wenhao Chai, Shiyan Li, Gaoang Wang:
Blind Inpainting with Object-Aware Discrimination for Artificial Marker Removal. 1516-1520 - Minheng Chen
, Zhirun Zhang, Shuheng Gu, Youyong Kong:
Embedded Feature Similarity Optimization with Specific Parameter Initialization for 2D/3D Medical Image Registration. 1521-1525 - Xinzhe Zheng, Sijie Ji, Chenshu Wu:
Predicting Adverse Events for Patients with Type-1 Diabetes Via Self-Supervised Learning. 1526-1530 - Siteng Ma, Haochang Wu, Aonghus Lawlor, Ruihai Dong:
Breaking the Barrier: Selective Uncertainty-Based Active Learning for Medical Image Segmentation. 1531-1535 - Zhuotong Cai, Jingmin Xin, Siyuan Dong, John A. Onofrey, Nanning Zheng, James S. Duncan:
Symmetric Consistency with Cross-Domain Mixup for Cross-Modality Cardiac Segmentation. 1536-1540 - Renqi Chen, Jingjing Luo, Fan Nian, Yuhui Cen
, Yiheng Peng, Zekuan Yu:
SSHNN: Semi-Supervised Hybrid NAS Network for Echocardiographic Image Segmentation. 1541-1545 - Mengjiao Yao, Xiang Gao:
Gland Instance Segmentation by Full Resolution Multi-Scale Dilation Residual Networks. 1546-1550 - Yining Qiu, Yuxi Li, Jiafu Wu, Zhenye Gan, Mingmin Chi, Yabiao Wang, Chengjie Wang, Pei Wang:
Learning Hybrid Negative Probability Model for Weakly-Supervised Whole Slide Image Recognition. 1551-1555 - Peiji Chen, Dian Li, Yifan Tang, Shunta Togo, Hiroshi Yokoi, Yinlai Jiang:
Dynamic Label Smoothing Strategy for Biosignal Classification. 1556-1560 - Jun Liu
, Wenyi Wang, Nuo Shen, Wei Wang, Kuanquan Wang, Qince Li, Yongfeng Yuan, Henggui Zhang, Gongning Luo:
Mutualreg: Mutual Learning for Unsupervised Medical Image Registration. 1561-1565 - Yinda Chen
, Wei Huang, Xiaoyu Liu, Shiyu Deng, Qi Chen, Zhiwei Xiong:
Learning Multiscale Consistency for Self-Supervised Electron Microscopy Instance Segmentation. 1566-1570 - Xingcan Hu, Li Xiao, Yu-Ping Wang:
A Graph Neural Network Based Fusion of MRI-Derived Brain Network and Clinical Data for Glioblastoma Survival Prediction. 1571-1575 - Jiang Shang, Sifan Zhou:
LK-UNet: Large Kernel Design for 3D Medical Image Segmentation. 1576-1580 - Minghui Wu, Yangdi Xu, Yingying Xu, Guangwei Wu, Qingqing Chen, Hongxiang Lin:
Stable Optimization for Large Vision Model Based Deep Image Prior in Cone-Beam CT Reconstruction. 1581-1585 - Yan Li, Zhuoran Zheng, Wenqi Ren, Yunfeng Nie, Jingang Zhang, Xiuyi Jia:
Frequency Aware and Graph Fusion Network for Polyp Segmentation. 1586-1590 - Luyuan Xie, Cong Li, Xin Zhang, Shengfang Zhai, Yuejian Fang, Qingni Shen, Zhonghai Wu:
TRLS: A Time Series Representation Learning Framework Via Spectrogram for Medical Signal Processing. 1591-1595 - Jiacheng Hao, Junhai Xu, Mengting Liu, Jianguo Wei:
SSR-GPCsT: Deep Learning Models Based on Functional Connectivity Maps in Autism Research. 1596-1600 - Xutao Guo, Yanwu Yang, Chenfei Ye, Guoqing Cai, Ting Ma:
CALSeg: Improving Calibration of Medical Image Segmentation Via Variational Label Smoothing. 1601-1605 - Mehrab Bin Morshed, Md Mahbubur Rahman, Viswam Nathan, Li Zhu, Jungmok Bae, Christina Rosa, Wendy Berry Mendes, Jilong Kuang, Alex Gao:
Core Body Temperature and its Role in Detecting Acute Stress: A Feasibility Study. 1606-1610 - Ryo Fujii, Ryo Hachiuma, Hideo Saito:
Weakly Semi-Supervised Tool Detection in Minimally Invasive Surgery Videos. 1611-1615 - Ming Wu, Hao Qi, Wenkang Fan, Sunkui Ke, Hui-Qing Zeng, Yinran Chen, Xiongbiao Luo:
Chat: Cascade Hole-Aware Transformers with Geometric Spatial Consistency for Accurate Monocular Endoscopic Depth Estimation. 1616-1620 - Yintao Zhou, Meng Pang, Wei Huang, Binghui Wang:
Early Diagnosing Parkinson's Disease Via a Deep Learning Model Based on Augmented Facial Expression Data. 1621-1625 - Yan Zhang, Xin Liu, Zuping Zhang:
DDN-Net: Deep Residual Shrinkage Denoising Networks with Channel-Wise Adaptively Soft Thresholds for Automated Major Depressive Disorder Identification. 1626-1630 - Jiyao Wang, Ange Wang, Haolong Hu, Kaishun Wu, Dengbo He:
Multi-Source Domain Generalization for ECG-Based Cognitive Load Estimation: Adversarial Invariant and Plausible Uncertainty Learning. 1631-1635 - Chenyang Li, Zhili Zhang, Peipei Li, Zhaofeng He:
I3FDM: IRIS Inpainting Via Inverse Fusion of Diffusion Models. 1636-1640 - Wenjing Zhang, Hao Yu, Manli Zhang, Gongpeng Cao, Guixia Kang, Lixin Cai:
Matpr-Unet: A Multi Attention Two-Path Residual Unet for Focal Cortical Dysplasia Lesions Segmentation. 1641-1645 - Qijia Shao
, Li Zhu, Mohsin Y. Ahmed, Korosh Vatanparvar, Migyeong Gwak, Nafiul Rashid, Jungmok Bae, Jilong Kuang, Alex Gao:
Normalization is All You Need: Robust Full-Range Contactless SpO2 Estimation Across Users. 1646-1650 - Jiawei Jiang, Jie Wu, Yueqian Quan, Jiacheng Chen, Jianwei Zheng:
Memory-Augmented Dual-Domain Unfolding Network for MRI Reconstruction. 1651-1655 - Yuan Zhang, Yaolei Qi, Xiaoming Qi, Lotfi Senhadji, Yongyue Wei, Feng Chen, Guanyu Yang:
Fedsoda: Federated Cross-Assessment and Dynamic Aggregation for Histopathology Segmentation. 1656-1660 - Boon Peng Yap, Beng Koon Ng:
Single-Source Domain Generalization in Fundus Image Segmentation Via Moderating and Interpolating Input Space Augmentation. 1661-1665 - Lingrui Gu, Weijian Deng, Guoli Wang:
UNAD: Universal Anatomy-Initialized Noise Distribution Learning Framework Towards Low-Dose CT Denoising. 1671-1675 - Chengyu Yuan, Hao Xiong, Guoqing Shangguan, Hualei Shen
, Dong Liu, Haojie Zhang, Zhonghua Liu, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto, Shlomo Berkovsky
:
Deep Fusion of Shifted MLP and CNN for Medical Image Segmentation. 1676-1680 - Po-Chen Lin, Jeng-Lin Li, Woan-Shiuan Chien, Chi-Chun Lee:
In-The-Wild Physiological-Based Stress Detection Using Federated Strategy. 1681-1685 - Jiuming Qin, Che Liu, Sibo Cheng, Yike Guo
, Rossella Arcucci:
Freeze the Backbones: a Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-Training. 1686-1690 - Xianglong Wang, Xifeng An, Eric Rigall, Shu Zhang, Hui Yu
, Junyu Dong:
A Method for X-Ray Image Landmarks Localization using Cyclic Coordinate-Guided Strategy. 1691-1695 - Yuanzhe Peng, Jieming Bian, Jie Xu
:
Fedmm: Federated Multi-Modal Learning with Modality Heterogeneity in Computational Pathology. 1696-1700 - Guorui Liao, Jiawei Liu
, Yuxuan Liang, Shu Wang, Li Liu:
Fall Prediction by a Spatio-Temporal Multi-Channel Causal Model from Wearable Sensors Data. 1701-1705 - Jing Xia, Yi Hao Chan, Deepank Girish
, Jagath C. Rajapakse:
Brain Structure-Function Interaction Network for Fluid Cognition Prediction. 1706-1710 - Linyu Xing, Mengxi Chen, Jiangchao Yao, Ya Zhang, Yanfeng Wang:
Pre-Post Interaction Learning for Brain Tumor Segmentation with Missing MRI Modalities. 1711-1715 - Yuping Huang, Weisheng Li, Guofen Wang, Xiaoyu Qiao, Huanyu Chen:
CT and MRI Fusion with Anisotropic Guided Filtering. 1716-1720 - Yingwei Zhang, Changru Guo, Yiqiang Chen
, Zeping Lv, Qing Li:
Effective Connectivity-Based Multi-View Feature Learning Method for Dementia Diagnosis with FNIRS Signal. 1721-1725 - Jiaqi Cui
, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shen:
Image2Points: A 3D Point-Based Context Clusters GAN for High-Quality Pet Image Reconstruction. 1726-1730 - Zhenyu Zhang, Benlu Wang, Weijie Liang, Yizhi Li, Xuechen Guo, Guanhong Wang, Shiyan Li, Gaoang Wang:
Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning. 1731-1735 - Yu-Tung Liu
, Kuan-Chen Wang, Kai-Chun Liu, Sheng-Yu Peng, Yu Tsao:
SDEMG: Score-Based Diffusion Model for Surface Electromyographic Signal Denoising. 1736-1740 - Zhuang Xie, Jianguo Wei, Wenhuan Lu, Zhongjie Li
, Chunli Wang, Gaoyan Zhang:
EEG-Based Fast Auditory Attention Detection in Real-Life Scenarios Using Time-Frequency Attention Mechanism. 1741-1745 - Guoxin Wang, Sheng Shi, Shan An, Fengmei Fan, Wenshu Ge, Qi Wang, Feng Yu, Zhiren Wang:
A Bi-Pyramid Multimodal Fusion Method for the Diagnosis Of Bipolar Disorders. 1746-1750 - Bo Wang, Hang Zhao, Xiongfei Li, Mingjie Tian, Bo Huang, Feiyang Yang:
Multi-Task Self-Supervised Learning for Medical Image Segmentation. 1751-1755 - Yuda Bi, Anees Abrol, Jing Sui
, Vince D. Calhoun:
Cross-Modal Synthesis of Structural MRI and Functional Connectivity Networks via Conditional ViT-GANs. 1756-1760 - Fangyao Shen, Zehao Zhang, Yong Peng, Hongjie Guo, Lina Chen, Hong Gao:
Self-Supervised Learning for Sleep Stage Classification with Temporal Augmentation and False Negative Suppression. 1761-1765 - Yujie Liu, Peng Zhou, Zongmin Li:
VMCC-NET: Uncovering Challenging Regions in Semi-Supervised Medical Image Segmentation with Voxel Mask Based Cyclic-Consistency Network. 1766-1770 - Chengliang Wang, Xinrun Chen, Haojian Ning, Shiying Li:
SAM-OCTA: A Fine-Tuning Strategy for Applying Foundation Model OCTA Image Segmentation Tasks. 1771-1775 - Shitao Zheng, Dongrui Wu:
Semi-Supervised Domain Adaptation for Eeg-Based Sleep Stage Classification. 1776-1780 - Qi Bi
, Hao Zheng, Xu Sun
, Jingjun Yi
, Wentian Zhang, Yawen Huang, Yuexiang Li, Yefeng Zheng:
Self-Supervised Cross-Level Consistency Learning For Fundus Image Classification. 1781-1785 - Joao Pereira, Dimitrios Halatsis, Balint Hodossy, Dario Farina:
Tackling Electrode Shift in Gesture Recognition with HD-EMG Electrode Subsets. 1786-1790 - Zexin Feng, Na Zeng, Jiansheng Fang, Xingyue Wang, Xiaoxi Lu, Heng Meng, Jiang Liu:
Flattening Singular Values of Factorized Convolution for Medical Images. 1791-1795 - Tatsuki Seino, Naoki Saito, Takahiro Ogawa, Satoshi Asamizu, Miki Haseyama:
Confidence-Aware Spatial-Temporal Attention Graph Convolutional Network for Skeleton-Based Expert-Novice Level Classification. 1796-1800 - Bozhen Hu, Zelin Zang, Cheng Tan, Stan Z. Li:
Deep Manifold Transformation for Protein Representation Learning. 1801-1805 - Ziyi Li, Li-Ming Zhao, Wei-Long Zheng, Bao-Liang Lu:
Temporal-Spatial Prediction: Pre-Training on Diverse Datasets for EEG Classification. 1806-1810 - Johanna Wilroth, Emina Alickovic, Martin A. Skoglund, Martin Enqvist:
Nonlinearity Detection and Compensation for EEG-Based Speech Tracking. 1811-1815 - Wenlong Chen, Chuanwen Feng, Ao Ke, Xike Xie, S. Kevin Zhou:
Out-of-Distribution Detection for Learning-Based Chest X-Ray Diagnosis. 1816-1820 - He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Prompt-Based Personalized Federated Learning for Medical Visual Question Answering. 1821-1825 - Ziqiang Chen, Kang Wang, Yun Liu:
Efficient Polyp Segmentation via Integrity Learning. 1826-1830 - Trung Vu, Hanlu Yang, Francisco Laport, Ben Gabrielson, Vince D. Calhoun, Tülay Adali:
A Robust and Scalable Method with an Analytic Solution for Multi-Subject FMRI Data Analysis. 1831-1835 - Michaela Areti Zervou, Effrosyni Doutsi, Yannis Pantazis, Panagiotis Tsakalides:
Multitask Classification of Antimicrobial Peptides for Simultaneous Assessment of Antimicrobial Property and Structural Fold. 1836-1840 - Wei-Bang Jiang, Ziyi Li, Wei-Long Zheng, Bao-Liang Lu:
Functional Emotion Transformer for EEG-Assisted Cross-Modal Emotion Recognition. 1841-1845 - Erlei Zhang, Weihao Chen, Xiaowei Xu, Zhicheng Zhang, Jinglei Li:
Breast Ultrasound Computer-Aided Diagnosis Using Structure-Aware Triplet Path Networks. 1846-1850 - Ruixing Liang, Xiangyu Zhang
, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola García, Amir Manbachi:
Unidirectional Brain-Computer Interface: Artificial Neural Network Encoding Natural Images to FMRI Response in the Visual Cortex. 1851-1855 - Tianxiang Xia, Rong Zhang, Zhenzuo Chen, Guomin Xie, Xiping Wu, Zhongyue Lv, Lijun Guo:
Progressive Learning Based Knowledge Distillation for Low Resolution Cerebral Microbleed Segmentation. 1856-1860 - Chenglin Liu, Binquan Wang, Zhi Wu:
PN-DetX: A Dedicated Framework for Pulmonary Nodule Detection in X-Ray Images. 1861-1865 - Chiao-Yi Wang, Faranguisse Kakhi Sadrieh, Yi-Ting Shen, Giovanni Oppizzi, Li-Qun Zhang, Yang Tao:
Real-Time Privacy-Preserving Fall Risk Assessment with a Single Body-Worn Tracking Camera. 1866-1870 - Yu-Ting Lan, Wei-Bang Jiang, Wei-Long Zheng, Bao-Liang Lu:
CEMOAE: A Dynamic Autoencoder with Masked Channel Modeling for Robust EEG-Based Emotion Recognition. 1871-1875 - Lu Wen, Zhenghao Feng, Yun Hou, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wang:
DCL-Net: Dual Contrastive Learning Network for Semi-Supervised Multi-Organ Segmentation. 1876-1880 - Yuexiao Liang, Zhineng Chen, Xin Chen, Caiyan Jia, Xiongjun Ye, Xieping Gao:
Dual Contrastive Learning Guided Pathological Image Re-Staining. 1881-1885 - Kun Huang, Xiao Ma, Na Su, Songtao Yuan, Qiang Chen:
Model-Based Label-to-Image Diffusion for Semi-Supervised Choroidal Vessel Segmentation. 1886-1890 - Bingzhi Chen, Jiawei Zhu, Yishu Liu, Biqing Zeng, Jiahui Pan, Meirong Ding:
Medical Vision-Language Representation Learning with Cross-Modal Multi-Teacher Contrastive Distillation. 1891-1895 - Chengxi Zhu, Yong Peng, Yinfeng Fang, Wanzeng Kong:
Label Rectified and Graph Adaptive Semi-Supervised Regression for Electrode Shifted Gesture Recognition. 1896-1900 - Yibin Tang, Jikang Ding, Aimin Jiang, Chun Wang, Yuan Gao:
High-Accuracy Anxiety Disorder Identification Through Subspace-Enhanced Hypergraph Neural Network. 1901-1905 - Wenbo Qi
, Wenyong Zhou, Ngai Wong, S. C. Chan:
Hybrid Module with Multiple Receptive Fields and Self-Attention Layers for Medical Image Segmentation. 1906-1910 - Yuan Gao, Xiaotong Wang, Aimin Jiang, Ying Chen, Yibin Tang:
ADHD Diagnosis and Biomarker Detection Based on Multimodal Graph Convolutional Neural Network. 1911-1915 - Xinrui Chen, Renao Yan, Yizhi Wang, Jiawen Li, Junru Cheng, Tian Guan, Yonghong He:
HIQ: One-Shot Network Quantization for Histopathological Image Classification. 1916-1920 - Jiwon Lee, Eunsong Kang, Junyeong Maeng, Heung-Il Suk:
Eigendecomposition-Based Spatial-Temporal Attention for Brain Cognitive States Identification. 1921-1925 - Yi Guo, Chao Tang
, Hao Wu, Badong Chen:
EEG Emotion Recognition Based on Dynamical Graph Attention Network. 1921-1925 - Pengxuan Gao
, Tianyu Liu, Jia-Wen Liu, Bao-Liang Lu, Wei-Long Zheng:
Multimodal Multi-View Spectral-Spatial-Temporal Masked Autoencoder for Self-Supervised Emotion Recognition. 1926-1930 - Xiangyu Kong, Zeyu Ren
, Lu Liu:
Semi-Supervised Volumetric Medical Image Segmentation via Class Prototype Guided Distribution-Aligned Representation Learning. 1931-1935 - Jiezhou He, Zhiming Luo, Wei Peng
, Songzhi Su, Shaozi Li:
CC-DA: Cross-Domain Consistency Data Augmentation for 3D Tumor Segmentation. 1936-1940 - Xiran Xu
, Bo Wang, Yujie Yan, Xihong Wu, Jing Chen:
A DenseNet-Based Method for Decoding Auditory Spatial Attention with EEG. 1946-1950 - Xiao Chen, Xiaokun Dai, Xueli Liu, Xinrong Chen:
SPTESleepNet: Automatic Sleep Staging Model Based On Strip Patch Embeddings And Transformer Encoder. 1951-1955 - Zongmin Li, Xuanting Li, Jiayue Fan, Zhonghao Du, Chaozhi Yang
:
Non-iterative Pyramid Network for Unsupervised Deformable Medical Image Registration. 1956-1960 - Renhe Liu, Yu Liu, Han Wang, Kai Hu, Shan Du
:
A Novel Medical Image Fusion Framework Integrating Multi-scale Encoder-Decoder with Discrete Wavelet Decomposition. 1961-1965 - Haojian Ning, Chengliang Wang, Xinrun Chen, Shiying Li:
An Accurate and Efficient Neural Network for OCTA Vessel Segmentation and a New Dataset. 1966-1970 - Srikireddy Dhanunjay Reddy
, Tharun Kumar Reddy:
GM-VRC: Semantic Topological Data Ensemble Approach for EEG Signal Classification. 1971-1975 - Yifan Song, Songpengcheng Xia, Jiarui Yang, Ling Pei:
A Learning-Based Multi-Node Fusion Positioning Method Using Wearable Inertial Sensors. 1976-1980 - Xiaochen He, Baoyao Yang, Fei Lyu:
MMS: Morphology-Mixup Stylized Data Generation for Single Domain Generalization in Medical Image Segmentation. 1981-1985 - Mei Yu, Hexin Wang, Xuzhou Fu, Jie Gao, Zhiqiang Liu, Xuewei Li:
DualGCN-MIL: Whole Slide Image Classification Based on Double Relationship Graph Learning. 1986-1990 - Zheyun Qin, Xiaoming Xi, Yilong Yin:
Distribution-Aware Contrastive Learning for Robust Medical Image Segmentation. 1991-1995 - Wenjie Song, Jiqing Han, Jianchen Li
, Guibin Zheng, Tieran Zheng, Yongjun He:
Modeling Quasi-Periodic Dependency via Self-Supervised Pre-Training for Respiratory Sound Classification. 1996-2000 - Zeming He, Gaoyan Zhang:
CEDNet: A Continuous Emotion Detection Network for Naturalistic Stimuli Using MEG Signals. 2001-2005 - Jian Chen, Xing Wu, Chengliang Wang, Zailin Yang, Xuelian Wu, Longrong Ran, Yao Liu:
Texture-Unet: A Texture-Aware Network for Bone Marrow Smear Whole-Slide Image Region of Interest Segmentation. 2006-2010 - Shang-Jui Kuo, Po-Han Huang, Chia-Ching Lin, Jeng-Lin Li, Ming-Ching Chang:
Improving Limited Supervised Foot Ulcer Segmentation Using Cross-Domain Augmentation Strategies. 2011-2015 - Ruihan Qin, Zhenxi Song, Huixia Ren, Zian Pei, Lin Zhu, Xue Shi, Yi Guo, Honghai Liu, Min Zhang, Zhiguo Zhang:
BNMTrans: A Brain Network Sequence-Driven Manifold-Based Transformer for Cognitive Impairment Detection Using EEG. 2016-2020 - Xinxu Zhou, Zhen Liang, Weishan Ye, Junqi Xue, Honghai Liu, Min Zhang, Zhiguo Zhang:
EmoTVR: A Hybrid Model to Estimate Continuous-Time and Continuous-Level Emotion from Electroencephalography. 2021-2025 - Han Chen, Wenxuan Wu, Xiaofen Xing, Xiangmin Xu:
Clinical Scores Prediction and Medication Adjustment for Course of Parkinson's Disease. 2026-2030 - Stanislas Ducotterd, Sebastian Neumayer, Michael Unser:
Learning a Convex Patch-Based Synthesis Model via Deep Equilibrium. 2031-2035 - Christine Beauchene, Michael S. Brandstein, Thomas F. Quatieri, Eric Thompson, Christopher J. Smalt:
A Neurophysiological-Auditory "Listen Receipt" for Communication Enhancement. 2036-2040 - Nastassia Vysotskaya, Noah Maul, Alessandra Fusco, Souvik Hazra, Jens Harnisch, Tomás Arias-Vergara, Andreas K. Maier:
Transforming Cardiovascular Health: a Transformer-Based Approach to Continuous, Non-Invasive Blood Pressure Estimation via Radar Sensing. 2041-2045 - Migyeong Gwak, Korosh Vatanparvar, Li Zhu, Nafiul Rashid, Mohsin Y. Ahmed, Jungmok Bae, Jilong Kuang, Alex Gao:
Multimodal Breathing Rate Estimation Using Facial Motion and RPPG From RGB Camera. 2046-2050 - Chen Zhou, Lingjing Hu:
A Neural Syntax Parser for Coronary Artery Anatomical Labeling in Coronary CT Angiography. 2051-2055 - Wei Wang, Xingcan Hu, Li Xiao, Yu-Ping Wang:
Adaptive Multiview Community-Preserved Graph Convolutional Network for Multiatlas-Based Functional Connectivity Analysis. 2056-2060 - Niki Efthymiou, George Retsinas, Panagiotis Paraskevas Filntisis, Petros Maragos:
Augmenting Transformer Autoencoders with Phenotype Classification for Robust Detection of Psychotic Relapses. 2061-2065 - Yonathan Eder
, Ravit Abel
, Avi Schroeder, Yonina C. Eldar:
Localization and Tracking of Gold Nanoparticles Using mmWave FMCW Radar. 2066-2070 - Ram Sapkota, Bishal Thapaliya, Pranav Suresh, Bhaskar Ray, Vince D. Calhoun, Jingyu Liu:
Multimodal Imaging Feature Extraction with Reference Canonical Correlation Analysis Underlying Intelligence. 2071-2075 - John Stewart Fabila-Carrasco, Avalon Campbell-Cousins, Mario A. Parra-Rodriguez, Javier Escudero:
Graph-Based Permutation Patterns for the Analysis of Task-Related FMRI Signals on DTI Networks in Mild Cognitive Impairment. 2076-2080 - Siddhant Gautam
, Angqi Li, Saiprasad Ravishankar:
Patient-Adaptive and Learned Mri Data Undersampling Using Neighborhood Clustering. 2081-2085 - Shadi Sartipi, Müjdat Çetin:
Multi-Source Domain Adaptation with Transformer-Based Feature Generation for Subject-Independent EEG-Based Emotion Recognition. 2086-2090 - Ramesh Kumar Sah, Md. Mahbubur Rahman, Viswam Nathan, Li Zhu, Jungmok Bae, Christina Rosa, Wendy Berry Mendes, Jilong Kuang, Jun Alex Gao:
Heart Rate Variability Estimation with Dynamic Fine Filtering and Global-Local Context Outlier Removal. 2091-2095 - Rabindra Khadka, Pedro G. Lind, Gustavo B. M. Mello, Michael A. Riegler, Anis Yazidi:
Inducing Inductive Bias in Vision Transformer for EEG Classification. 2096-2100 - Suhas BN, Rakshith Sharma Srinivasa, Yashas Malur Saidutta, Jaejin Cho, Ching Hua Lee, Chouchang Yang, Yilin Shen, Hongxia Jin:
End-To-End Personalized Cuff-Less Blood Pressure Monitoring Using ECG and PPG Signals. 2101-2105 - Hongyi Pan, Bin Wang, Zheyuan Zhang, Xin Zhu, Debesh Jha, Ahmet Enis Çetin
, Concetto Spampinato, Ulas Bagci:
Domain Generalization with fourier Transform and soft thresholding. 2106-2110 - David J. Lin, Md Mahbubur Rahman, Li Zhu, Viswam Nathan, Jungmok Bae, Christina Rosa, Wendy Berry Mendes, Jilong Kuang, Jun Alex Gao:
Ballistocardiogram-Based Heart Rate Variability Estimation for Stress Monitoring using Consumer Earbuds. 2111-2115 - Yuhao Zhang, Shaoming Duan, Xinyu Zha, Jinhang Su, Peiyi Han, Chuanyi Liu:
FEDKA: Federated Knowledge Augmentation for Multi-Center Medical Image Segmentation on non-IID Data. 2116-2120 - Conghao Wang
, Hiok Hian Ong, Shunsuke Chiba, Jagath C. Rajapakse:
De Novo Molecule Generation with Graph Latent Diffusion Model. 2121-2125 - Zi-Chen Fan, Di Li
, Susanto Rahardja
:
A Novel Discrete Fractional Complex Hadamard Transform for Medical Image Encryption. 2126-2130 - Taylor Lawson, John H. L. Hansen:
Situational Signal Processing with Ecological Momentary Assessment: Leveraging Environmental Context for Cochlear Implant Users. 2131-2135 - Jose Hoyos Sanchez, Batoul Taki, Waheed U. Bajwa, Anand D. Sarwate:
Federated Learning of Tensor Generalized Linear Models with low Separation Rank. 2136-2140 - Hanlu Yang, Meiby Ortiz-Bouza, Trung Vu, Francisco Laport, Vince D. Calhoun, Selin Aviyente, Tülay Adali:
Subgroup Identification Through Multiplex Community Structure Within Functional Connectivity Networks. 2141-2145 - Dingding Ye, Charan Santhirasegaran, Ryan Pai, Genevera I. Allen, Joseph Young:
Addressing Confounds in Functional Connectivity Analyses of Calcium Imaging. 2146-2150 - Yiqian Xu, Rui-Wei Zhao, Rui Feng:
Lesion-Aware Open Set Medical Image Recognition with Domain Shift. 2151-2155 - Qiqi Xian, Zhe Sage Chen:
Estimating Directed Spectral Information Flow between Multi-Resolution Time Series. 2156-2159 - Xin Zhu, Hongyi Pan, Shuaiang Rong, Ahmet Enis Çetin:
Electroencephalogram Sensor Data Compression Using an Asymmetrical Sparse Autoencoder with a Discrete Cosine Transform Layer. 2160-2164 - Yuanpin Zhou
, Huogen Wang, Yanfeng Bai, Yidong Wan, Chaohui Jin, Ming Chen, Xiaodong Teng:
Digital Pathology Image Deblurring Via Local Focus Quality Assessment. 2165-2169 - Yudong Yang, Rongfeng Su, Xiaokang Liu, Nan Yan, Lan Wang:
An Audio-Textual Diffusion Model for Converting Speech Signals into Ultrasound Tongue Imaging Data. 2170-2174 - Suizhi Huang, Shalayiding Sirejiding
, Yuxiang Lu
, Yue Ding, Leheng Liu, Hui Zhou, Hongtao Lu:
YOLO-Med : Multi-Task Interaction Network for Biomedical Images. 2175-2179 - Xipeng Pan, Feihu Hou, Zhenbing Liu, Siyang Feng, Rushi Lan:
EOFD-Net: Edge Optimization and Feature Denoising for Weakly Supervised Deep Nuclei Segmentation with Point Annotations. 2180-2184 - Yu Rong, Kawon Han, Isabella Lenz, Daniel W. Bliss:
Motion-Tolerant Radar-Based Heart Sound Detection. 2185-2189 - Bo Wang, Xiran Xu, Longxiang Zhang, Boda Xiao, Xihong Wu, Jing Chen:
Semantic Reconstruction of Continuous Language from Meg Signals. 2190-2194 - Yi Hao Chan, Jun Liang Ang, Sukrit Gupta, Yinan He, Jagath C. Rajapakse:
Subtype-Specific Biomarkers of Alzheimer's Disease from Anatomical and Functional Connectomes via Graph Neural Networks. 2195-2199 - Jiawei Li, Chunxu Guo, Li Fu, Lu Fan, Edward F. Chang, Yuanning Li:
Neural2speech: A Transfer Learning Framework for Neural-Driven Speech Reconstruction. 2200-2204 - Qichang Chen, Zhonghang Zhu, Lianxin Wang, Liansheng Wang:
Shifted-Rectangle-Window Based Transformer for non-Displaced Femoral Neck Fracture Diagnosis. 2205-2209 - Minxi Yang, Dahua Gao, Jiaxuan Li, Wenlong Xu, Xiaodan Song, Guangming Shi:
Mosic: Multimodal Semantic Integrated Communication for Health Monitoring in Iot Scenarios. 2210-2214 - Yuda Jin
, Weidong Chen, Yuanhe Tian, Yan Song, Chenggang Yan, Zhendong Mao:
Improving Radiology Report Generation with D2-Net: When Diffusion Meets Discriminator. 2215-2219 - Gang Liu, Hongyang Li, Zerui He, Shenjun Zhong:
Enhancing Generalization in Medical Visual Question Answering Tasks Via Gradient-Guided Model Perturbation. 2220-2224 - Peili Chen, Linyang He, Li Fu, Lu Fan, Edward F. Chang, Yuanning Li:
Do Self-Supervised Speech and Language Models Extract Similar Representations as Human Brain? 2225-2229 - Kazi Mahmudul Hassan
, Xuyang Zhao, Hidenori Sugano, Toshihisa Tanaka:
Detection of Epileptic Seizures in Long Eeg Recordings Using an Anomaly Detector with Artifact Rejection. 2230-2234 - Aimin Jiang, Shanshan Hou, Yibin Tang, Yanping Zhu:
Joint Spatio-Temporal Filtering of Motion Imagery EEG Signals for Data Alignment in Transfer Learning. 2235-2239 - Ruilin Wang, Xiongfei Li, Mingjie Tian, Feiyang Yang, Xiaoli Zhang:
Patch-Level Knowledge Distillation and Regularization for Missing Modality Medical Image Segmentation. 2240-2244 - Kunpeng Qiu, Zhiying Zhou, Yongxin Guo
:
Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image Classification. 2245-2249 - Yue Hu, Huiying Xu, Xinzhong Zhu, Negalign Wake Hundera:
V-DDPM: MRI Rician Noise Removal Model Based on VST and DDPM. 2250-2254 - Veronika Ecker, Marcel Früh, Bin Yang, Sergios Gatidis, Thomas Küstner:
Deep Regression for Biological Age Estimation in Multiple Organs: Investigations on 40, 000 Subjects of the UK Biobank. 2255-2259 - Meisheng Zhang, Chenye Wang, Wenxuan Zou, Xingqun Qi, Muyi Sun, Wanting Zhou:
Contrmix: Progressive Mixed Contrastive Learning for Semi-Supervised Medical Image Segmentation. 2260-2264 - Seorim Hwang, Jaebin Cha, Junyeong Heo, Sungpil Cho, Youngcheol Park:
Multi-Label Abnormality Classification from 12-Lead ECG Using A 2D Residual U-Net. 2265-2269 - Zhiyong Jin
, Guangqi Wen, Peng Cao, Lingwen Liu, Jinzhu Yang, Xinrong Zhu, Osmar R. Zaïane, Fei Wang:
Towards Disease-Aware Self-Supervised Dynamic Brain Network Learning For Mental Diagnosis. 2270-2274 - Yiwen Ruan, Rui Jin, Zhaorui Liu, Caishan Wang, Lei Zhang, Tao Peng:
Delineation of Prostate Cancer Via Enhanced AI-Based Algorithm In Ultrasound Images. 2275-2279 - Jintong Hu, Hui Che, Zishuo Li, Wenming Yang:
Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging. 2280-2284 - Hongyu Shi, Kaizhong Zheng, Huaning Wang, Baojuan Li, Badong Chen:
Predicting RTMS Treatment Effects Using Open-Loop Control and Neural Manifold. 2285-2289 - Li Li, Jiahui He, Yunxin Tang, Youjian Zhang, Jie Wang, Guanqun Zhou, Zhicheng Zhang:
SRECT: Machine-Specific Spatial-Resolution Enhancement in Computed Tomography. 2290-2294 - Jiayu Zhang, Dexuan Xu, Yiwei Lou, Yu Huang:
A Novel Multi-Atlas Fusion Model Based On Contrastive Learning For Functional Connectivity Graph Diagnosis. 2295-2299 - Peng Du, Baijia Ni, Xiaodong Ju, Xingce Wang, Zhongke Wu, Gege Lou, Keying Hua:
3D Automated Quantitative Calculations Based on CT Images of the Hip Joint. 2300-2304 - Suvadeep Maiti, Shivam Kumar Sharma, Raju S. Bapi:
Enhancing Healthcare with EOG: A Novel Approach to Sleep Stage Classification. 2305-2309 - Xiaolong Zhong, Fei Wu, Zhong Yin, Gang Liu:
An Attention-Enhanced Retentive Broad Learning System for Subject-Generic Emotion Recognition from EEG Signals. 2310-2314 - Lang Wang, Peng Jiang, Wensi Duan, Dehua Cao, Baochuan Pang, Juan Liu:
Coupling Self-Supervised and Supervised Contrastive Learning for Multiple Classification of Cervical Cytological Whole Slide Images. 2315-2319 - Siqi Cai, Ran Zhang, Haizhou Li:
Robust Decoding of the Auditory Attention from EEG Recordings Through Graph Convolutional Networks. 2320-2324 - Xiang Li
, Jian Song, Zhigang Zhao, Chunxiao Wang, Dawei Song, Bin Hu:
A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition. 2325-2329 - Chenyi Zhou
, Hualiang Wang, Xiaomeng Li, Wanlu Liu, Zuozhu Liu:
Multimodal Survival Ensemble Network: Integrating Genomic and Histopathological Insights for Enhanced Cancer Prognosis. 2330-2334 - Yingxin Lai, Guoqing Yang, Yifan He, Zhiming Luo, Shaozi Li:
Selective Domain-Invariant Feature for Generalizable Deepfake Detection. 2335-2339 - Gaoxiang Li, Ying Zhang, Yanlin Luo:
Multi-Task Cascaded Attention Network for Brain Tumor Segmentation and Classification. 2340-2344 - Huadeng Wang, Jiejiang Yu, Bingbing Li, Xipeng Pan, Zhenbing Liu, Rushi Lan, Xiaonan Luo:
Gland Segmentation Via Dual Encoders and Boundary-Enhanced Attention. 2345-2349 - Joohyung Lee, Heejeong Nam, Kwanhyung Lee, Sangchul Hahn:
Compact and De-Biased Negative Instance Embedding for Multi-Instance Learning on Whole-Slide Image Classification. 2350-2354 - Zhengda He, Linjie Chen, Jiaying Xu, Hao Lv, Rui-ning Zhou, Jianhua Hu, Yadong Chen, Yang Gao:
TD-GPT: Target Protein-Specific Drug Molecule Generation GPT. 2355-2359 - Nan Ding, Florence Rossant, Hélène Urien, Jérémie Sublime, Paul Bastelica, Christophe Baudouin, Michel Pâques:
A Complete Method for the 3D Reconstruction of Axonal Pathways from 2 Orthogonal 3D OCT Images of the Lamina Cribrosa. 2360-2364 - Changsheng Ma
, Taicheng Guo, Qiang Yang
, Xiuying Chen, Xin Gao
, Shangsong Liang, Nitesh V. Chawla
, Xiangliang Zhang:
A Property-Guided Diffusion Model For Generating Molecular Graphs. 2365-2369 - Yaping Zhao, Edmund Y. Lam:
SASA: Saliency-Aware Self-Adaptive Snapshot Compressive Imaging. 2370-2374 - Mingtao Huang
, Ranhao Zhang, Xueming Li, Yuan Shen:
Fast Alignment Algorithm for Cryo-EM Particle Images Based on Harmonic Analysis. 2375-2379 - Wenbo Li, Zhipeng Mo, Yilin Shen, Hongxia Jin:
Unified Srgb Real Noise Synthesizing with Adaptive Feature Modulation. 2380-2384 - YinWei Du, Jian Wang, Xing Wu, Xian-Hua Han:
Dual Directional Complementary Gradient Fusion and Deep Refinement for Hyperspectral Image Super Resolution. 2385-2389 - Takumi Takabe, Xian-Hua Han, Yen-Wei Chen:
Deep Versatile Hyperspectral Reconstruction Model from A Snapshot Measurement with Arbitrary Masks. 2390-2394 - Jiuqiang Li, Yutong Ke:
Hybrid Convolution-Transformer for Lightweight Single Image Super-Resolution. 2395-2399 - Zean Chen
, Yeyao Chen, Mei Yu, Haiyong Xu, Gangyi Jiang:
Hybrid Domain Learning towards Light Field Spatial Super-Resolution using Heterogeneous Imaging. 2400-2404 - Quanquan Xiao, Haiyan Jin, Haonan Su, Fengyuan Zuo, Yuanlin Zhang, Zhaolin Xiao, Bin Wang:
SPGFusion: A Semantic Prior Guided Infrared and Visible Image Fusion Network. 2405-2409 - Jiazhang Zheng, Lei Li, Qiuping Liao, Cheng Li, Li Li, Yangxing Liu:
Darkshot: Lighting Dark Images with Low-Compute and High-Quality. 2410-2414 - Yadong Li, Dongheng Zhang, Ruixu Geng, Jincheng Wu, Yang Hu, Qibin Sun, Yan Chen:
IFNet: Imaging and Focusing Network for handheld mmWave Devices. 2415-2419 - Refaldi I. D. Putra, Tatsuya Ishikawa, Naomi Simumba, Michiaki Tatsubori:
Sandwiched Lo-Res Simulation for Scalable Flood Modeling. 2420-2424 - Wenwu Gong, Zhejun Huang
, Lili Yan:
Enhanced Low-Rank and Sparse Tucker Decomposition For Image Completion. 2425-2429 - Chen-Bin Feng
, Jie Zhang, Jiaxue Li, Yicong Zhou:
Seam Mask Guided Partial Reconstruction with Quantum-Inspired Local Aggregation For Deep Image Stitching. 2430-2434 - Shijie Zhang, Boyan Jiang, Keke He, Junwei Zhu, Ying Tai, Chengjie Wang, Yinda Zhang, Yanwei Fu:
T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image. 2435-2439 - Denghui Yang, Yifan Ding, Hao Zhang, Yizhou Li:
PVitNet: An Effective Approach for Android Malware Detection Using Pyramid Feature Processing and Vision Transformer. 2440-2444 - Siwei Li, Mingxuan Liu, Yating Zhang, Shu Chen, Haoxiang Li, Zifei Dou, Hong Chen:
SAM-DEBLUR: Let Segment Anything Boost Image Deblurring. 2445-2449 - Qian Li, Rao Fu, Cheng Wen:
Reference Line Network: On Simultaneous Gaussian Line Detection and Connection Graph Inference. 2450-2454 - Simon Welker, Tal Peer, Henry N. Chapman, Timo Gerkmann:
Live Iterative Ptychography with Projection-Based Algorithms. 2455-2459 - Jorge Bacca, Brayan Monroy, Henry Arguello:
Deep Plug-and-Play Algorithm for Unsaturated Imaging. 2460-2464 - Tom Tirer:
Iteratively Preconditioned Guidance of Denoising (Diffusion) Models For Image Restoration. 2465-2469 - Sreemanti Dey, Snigdha Saha
, Berthy T. Feng, Manxiu Cui, Laure Delisle, Oscar Leong, Lihong V. Wang, Katherine L. Bouman:
Score-based Diffusion Models for Photoacoustic Tomography Image Reconstruction. 2470-2474 - Yvette Y. Lin, Angela F. Gao, Katherine L. Bouman:
Imaging An Evolving Black Hole By Leveraging Shared Structure. 2475-2479 - Jixuan Liang, Yanshan Li:
A Fast Blind Deblurring Algorithm Using Local Gradient Product Prior. 2480-2484 - Jiabao Li
, Yuqi Li, Ciliang Sun, Chong Wang, Jinhui Xiang:
SPEC-NERF: Multi-Spectral Neural Radiance Fields. 2485-2489 - Ziwen Li, Bo Xu, Cheng Lu:
KD-Former: Transformer Knowledge Distillation for Image Matting. 2490-2494 - Shengli Yan, Yuan Rao, Wenhui Hou:
Detection in Complex Scenes Using Rgb and Depth Multimodal Feature Fusion. 2495-2499 - Xian-Hua Han, Huiyan Jiang, Yen-Wei Chen:
Hyperspectral Image Reconstruction Using Hierarchical Neural Architecture Search from A Snapshot Image. 2500-2504 - Jorge Bacca, Marcus Carlsson, Brayan Monroy, Henry Arguello:
Plug-And-Play Algorithm Coupled with Low-Rank Quadratic Envelope Regularization for Compressive Spectral Imaging. 2505-2509 - Jia Chen, Jinlong Qin, Saishang Zhong, Kai Yang
, Xinrong Hu, Tao Peng, Rui Li:
SGM: A Dataset for 3D Garment Reconstruction from Single Hand-Drawn Sketch. 2510-2514 - Kartheek Kumar Reddy Nareddy, Abijith Jagannath Kamath, Chandra Sekhar Seelamantula:
Image Restoration with Generalized L2 Loss and Convergent Plug-and-Play Priors. 2515-2519 - Ryosuke Isono, Shunsuke Ono:
Temporally-Guided Total Variation For Robust Spatiotemporal Fusion Of Satellite Images. 2520-2524 - Abhishek Shreekant Bhandiwad, Abijith Jagannath Kamath, Siddarth Asokan, Chandra Sekhar Seelamantula:
Variational Analysis of Adversarial Regularization for Solving Inverse Problems. 2525-2529 - Aleksei Sholokhov, Joshua Rapp, Saleh Nabi, Steven L. Brunton, J. Nathan Kutz, Hassan Mansour:
Single-Pixel Imaging Of Dynamic Flows Using Neural Ode Regularization. 2530-2534 - Robinson Czajkowski, John Murray-Bruce:
Two-Edge-Resolved 3d Non-Line-of-Sight Imaging: A Fisher Information Equalized Discretization. 2535-2539 - Zheng Zhou, Peter Gerstoft, Kim Olsen:
Fusion of Multi-Resolution Seismic Tomography Maps with Physics-Informed Probability Graphical Models. 2540-2544 - Saishang Zhong, Jiashu Wang, Xinrong Hu:
PMDI: Combining Parametric-Model and Depth-Aware Implicit Function for Single-View Human Reconstruction. 2545-2549 - Alexander Lin, Demba E. Ba:
An Efficient Algorithm For Clustered Multi-Task Compressive Sensing. 2550-2554 - Tianbo Liu, Songping Mai, Xiaoyu Wang:
Deep Learning Based Single-Shot Profilometry by Three-Channel Binary-Defocused Projection. 2555-2559 - Zhuofeng Wu, Yusuke Monno, Masatoshi Okutomi:
Self-Supervised Spatially Variant PSF Estimation for Aberration-Aware Depth-from-Defocus. 2560-2564 - Yousef Kotp
, Marwan Torki:
Flare-Free Vision: Empowering Uformer with Depth Insights. 2565-2569 - Wenjiao Bian, Yusuke Monno, Masatoshi Okutomi:
Reflection Removal Using Recurrent Polarization-to-Polarization Network. 2570-2574 - Xun Wu, Fanqing Meng, Yaqi Wu, Jiawei Zhang, Feng Zhang:
An Efficient Transformer For Demosaicing Via Compressed Multi-Branch Attention Mechanism. 2575-2576 - Yanting Wang, Feng Li, Han Zhang
:
TA2P: Task-Aware Adaptive Pruning Method for Image Classification on Edge Devices. 2580-2584 - Tingyou Li, Zixin Xu, Yong S. Chu, Xiaojing Huang, Jizhou Li:
Coordinate-Based Neural Network for Fourier Phase Retrieval. 2585-2589 - Daniele Picone, Mohamad Jouni, Mauro Dalla Mura:
Spectro-Spatial Hyperspectral Image Reconstruction From Interferometric Acquisitions. 2590-2594 - Shihui Zhang, Ziteng Xue, Yuhong Jiang, Houlin Wang:
Opnet: Deep Occlusion Perception Network with Boundary Awareness for Amodal Instance Segmentation. 2595-2599 - Ling Lin, Congcong Zhu, Lin Zhou, Jingrun Chen:
Toward Quantifiable Face age Transformation. 2600-2604 - Rao Fu, Cheng Wen, Qian Li:
IMFIT: Normal Estimation via Learning Neural Implicit Surface. 2605-2609 - Zhenhu Zhang, Xin Cao, Li Jin, Xueying Qin, Ruofeng Tong:
Semi-Decoupled 6D Pose Estimation via Multi-Modal Feature Fusion. 2610-2614 - Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin:
DAP: Domain-Aware Prompt Learning for Vision-and-Language Navigation. 2615-2619 - Shansi Zhang, Edmund Y. Lam:
Unsupervised Disparity Estimation for Light Field Videos. 2620-2624 - Chunqing Ruan, Mengzhu Wang, Shanshan Wang, Tianyi Liang, Wei Yu:
SBM: Smoothness-Based Minimization for Domain Generalization. 2625-2629 - Haoxing Chen, Yaohui Li, Zhangxuan Gu, Zhuoer Xu, Jun Lan, Huaxiong Li:
Segment Anything Model Meets Image Harmonization. 2630-2634 - Tian Yang, Cong Shen, Tiantian Yuan:
CoSLR: Contrastive Chinese Sign Language Recognition with prior knowledge And Multi-Tasks Joint Learning. 2635-2639 - Jucai Zhai, Yang Liu, Pengcheng Zeng, Chihao Ma, Xinan Wang, Yong Zhao:
Efficient Fusion of Depth Information for Defocus Deblurring. 2640-2644 - Kun Hu, Zhaoyangfan Huang, Xingjun Wang:
Highlight Removal Network Based on an Improved Dichromatic Reflection Model. 2645-2649 - Yinghui Xing, Litao Qu
, Kai Zhang, Yan Zhang, Xiuwei Zhang, Yanning Zhang:
Complementary Fusion Network Based on Frequency Hybrid Attention for Pansharpening. 2650-2654 - Chao Yang, Yong Fan, Cheng Lu:
Dropout Multi-Head Attention for Single Image Super-Resolution. 2655-2659 - Shang Gao
, Chenyang Yu, Pingping Zhang, Huchuan Lu:
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-Identification. 2660-2664 - Wenjie Liu, Xinlong Shi, Xianzhong Liu:
Flipping Consistent and Counterfactual Attention Network for Facial Expression Recognition. 2665-2669 - Xingshuo Han, Xiao Wang, Kui Jiang, Wei Liu, Ruimin Hu, Xuefeng Pan, Xin Xu:
Mutuality Attribute Makes Better Video Anomaly Detection. 2670-2674 - Lijun Wang:
Multi-Modality Conditional Diffusion Model for Time Series Forecasting of Live Sales Volume. 2675-2679 - Chao Wang, Yubiao Yue
, Bingchun Luo, Yujie Chen, Jun Xue:
PseKD: Phase-Shift Encoded Knowledge Distillation for Oriented Object Detection in Remote Sensing Images. 2680-2684 - Jiuqiang Li, Shilei Zhu:
Channel-Spatial Transformer for Efficient Image Super-Resolution. 2685-2689 - Yueqian Quan, Honghui Xu, Yidong Yan, Hang Zheng, Jianwei Zheng:
HMNet: Hierarchical Microscale-Aware Network for Infrared Small Target Detection. 2690-2694 - Lei Zhao
, Xiao-Lei Zhang:
A Hierarchical Multi-Proxy Loss with Dynamic Main-Proxy for Deep Metric Learning. 2695-2699 - Ruofei Wang, Renjie Wan
, Zongyu Guo, Qing Guo, Rui Huang:
SPY-Watermark: Robust Invisible Watermarking for Backdoor Attack. 2700-2704 - Hui Zhang, Bingran Kuang, Yajie Zhao:
Camera Calibration using a Single View of a Symmetric Object. 2705-2709 - Soojung Hong, Kwanghee Choi:
Correcting Faulty Road Maps by Image Inpainting. 2710-2714 - Jinyu Shi, Wenjie Wu:
SRP-UOD: Multi-Branch Hybrid Network Framework Based on Structural Re-Parameterization for Underwater Small Object Detection. 2715-2719 - Taiwei Zhang, Zhenghui Hu, Weixin Li, Qingjie Liu, Yunhong Wang:
Read, Spell and Repeat: Scene Text Recognition with Vision-Language Circular Refinement. 2720-2724 - Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao:
Changenet: Multi-Temporal Asymmetric Change Detection Dataset. 2725-2729 - Zhangxuan Gu, Haoxing Chen, Zhuoer Xu:
Diffusioninst: Diffusion Model for Instance Segmentation. 2730-2734 - Wang Yin
, Peng Lu, Xujun Peng:
COLORFLOW: A Conditional Normalizing Flow for Image Colorization. 2735-2739 - Xiaoyan Tian
, Ye Jin, Zhao Zhang, Peng Liu, Xianglong Tang:
MTIDNet: A Multimodal Temporal Interest Detection Network for Video Summarization. 2740-2744 - Masoud Mokhtari
, Fatemeh Taheri Dezaki, Timo Bolkart, Betty Mohler Tesch, Rahul Suresh, Amin Banitalebi-Dehkordi:
Skin Tone Disentanglement in 2D Makeup Transfer With Graph Neural Networks. 2745-2749 - Eungi Lee, Eung-Joo Lee, Syed Muhammad Anwar, Seok Bong Yoo:
Child FER: Domain-Agnostic Facial Expression Recognition in Children Using a Secondary Image Diffusion Model. 2750-2754 - Lijian Yang
, Jian-Xun Mi, Guofen Wang, Weisheng Li:
Window-Based Convolutional Sparse Coding: Towards A Unified Framework. 2755-2759 - Pengwei Yin, Jingjing Wang, Jiawu Dai, Xiaojun Wu:
NERF-GAZE: A Head-Eye Redirection Parametric Model for Gaze Estimation. 2760-2764 - Xuyang Liu
, Siteng Huang
, Yachen Kang, Honggang Chen, Donglin Wang:
VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders. 2765-2769 - Wei Ji, You Qin, Long Chen, Yinwei Wei, Yiming Wu, Roger Zimmermann:
Mrtnet: Multi-Resolution Temporal Network for Video Sentence Grounding. 2770-2774 - Lei Liao, Mao Feng, Meng Yang
:
Human Guided Cross-Modal Reasoning with Semantic Attention Learning for Visual Question Answering. 2775-2779 - Yuan Cao
, Di Jiang, Guanqun Hou, Fan Deng, Xinjia Chen, Qiang Yang:
Learn to Cluster Faces with Better Subgraphs. 2780-2784 - Hengsheng Zhang, Xinning Chai, Yuhong Zhang, Rong Xie, Li Song:
Hdrtvformer: Efficient Sdrtv-to-Hdrtv via Affine Transformation and Spatial-Aware Transformer. 2785-2789 - Wenbo Zhou, Dongdong Chen, Jing Liao
, Jie Zhang, Kejiang Chen, Weiming Zhang, Nenghai Yu:
Attribute-Aware Head Swapping Guided by 3d Modeling. 2790-2794 - Elena Camuffo, Umberto Michieli, Jijoong Moon, Daehyun Kim, Mete Ozay:
FFT-Based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images. 2795-2799 - Manuel Lage Cañellas, Constantino Álvarez Casado
, Le Nguyen, Miguel Bordallo López
:
Estimating Exercise-Induced Fatigue from Thermal Facial Images. 2800-2804 - Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu:
Image Aesthetics Assessment Via Learnable Queries. 2805-2809 - Babak Naderi, Ross Cutler:
A Crowdsourcing Approach to Video Quality Assessment. 2810-2814 - Ting Li, Jianshu Chao, Deyu An:
Style Adaptation for Domain-Adaptive Semantic Segmentation. 2815-2819 - Zhidan Ran, Xiaobo Lu, Wei Liu:
Anomaly-Aware Semantic Self-Alignment Framework for Video-Based Person Re-Identification. 2820-2824 - Takuya Fujihashi, Sorachi Kato
, Toshiaki Koike-Akino:
Implicit Neural Representation For Low-Overhead Graph-Based Holographic-Type Communications. 2825-2829 - Masato Fujitake:
RL-LOGO: Deep Reinforcement Learning Localization for Logo Recognition. 2830-2834 - Songqi Pan, Sheng Liu, Yuan Feng, Yineng Zhang, Xiaopeng Tian, Jiantao Yang:
POSE-HMR: Heuristic Transformer with Postural Prior Constraints for 3D Human Mesh Reconstruction. 2835-2839 - Han Gao, Hao Wu, Peiwen Dong, Yixin Xu
, Fengyuan Xu, Sheng Zhong:
MuSR: Multi-Scale 3D Scenes Reconstruction based on Monocular Video. 2840-2844 - Jiaqi Su, Weiran Chen, Yi Ji, Chunping Liu:
Glocal Cascading Network for Topic Enhanced Visual Storytelling. 2845-2849 - Jia-Wei Ma
, Min Liang, Haixia Man, Shu Tian, Jingyan Qin, Xu-Cheng Yin:
Attention Decoupling for Query-Based Object Detection. 2850-2854 - Jiayu Yang, Chunhui Yang, Yongqi Zhai, Qi Wang, Xinghao Pan, Ronggang Wang:
Improving Learned Video Compression by Exploring Spatial Redundancy. 2860-2864 - Driton Salihu, Adam Misik, Yuankai Wu, Constantin Patsch, Eckehard G. Steinbach:
NPRF: Neural Painted Radiosity Fields for Neural Implicit Rendering and Surface Reconstruction. 2865-2869 - Xin Li, Feng Xu, Runliang Xia, Nan Xu
, Fan Liu, Chi Yuan, Qian Huang, Xin Lyu:
Locality-Enhanced Transformer for Semantic Segmentation of High-Resolution Remote Sensing Images. 2870-2874 - Zifan Yu, Erfan Bank Tavakoli, Meida Chen, Suya You, Raghuveer Rao, Sanjeev Agarwal, Fengbo Ren:
Tokenmotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection VIA Learnable Token Selection. 2875-2879 - Qilei Li
, Jiabo Huang, Jian Hu, Shaogang Gong:
Feature-Distribution Perturbation and Calibration for Generalized Reid. 2880-2884 - Jiexin Wang, Jiahao Chen, Bing Su:
Domain-Adaptive and Subgroup-Specific Cascaded Temperature Regression for Out-of-Distribution Calibration. 2885-2889 - Chaofei Wang, Xiangan Zhao, Kai Wang, Shuai Wu, Jiayu Xiao, Guotong Geng:
ADIFT: Zero-Shot Generative Model Adaption Via Adaptive Domain-Invariant Feature Transfer. 2890-2894 - Tianle Lv, Shuang Li, Jiaxu Leng, Xinbo Gao:
MGRL: Mutual-Guidance Representation Learning for Text-to-Image Person Retrieval. 2895-2899 - Sidun Liu, Peng Qiao, Yong Dou:
Improving Motion Deblur By Multi-Output Learning. 2900-2904 - Shanzhi Yin
, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, Jingjing Liu:
Bandwidth-Efficient Inference for Nerual Image Compression. 2905-2909 - Xiao Liu, Guangyi Chen, Yansong Tang, Guangrun Wang, Xiao-Ping Zhang, Ser-Nam Lim:
Language-Free Compositional Action Generation via Decoupling Refinement. 2910-2914 - Li Yao, Ao Gao, Yan Wan:
REGIR: Refined Geometry for Single-Image Implicit Clothed Human Reconstruction. 2915-2919 - Jiancheng Huang, Yifan Liu, Jiaxi Lv, Shifeng Chen:
Entwined Inversion: Tune-Free Inversion For Real Image Faithful Reconstruction and Editing. 2920-2924 - Rokia Abdein, Xuezhi Xiang, Yiming Chen, Mingliang Zhai, Abdulmotaleb El-Saddik:
Self-Supervised Multi-Scale Hierarchical Refinement Method for Joint Learning of Optical Flow and Depth. 2925-2929 - Yican Liu, Jiacheng Li
, Delu Zeng:
Low Redundant Attention Network for Efficient Image Super-Resolution. 2930-2954 - Yanhui Guo, Fangzhou Luo, Shaoyuan Xu:
Self-Supervised Face Image Restoration with a One-Shot Reference. 2930-2934 - Xukai Zhao, Yuxing Lu, Jinzhuo Wang:
Multiscale Scoring Model for Enhanced Urban Perception Evaluation. 2935-2939 - Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei:
Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution. 2940-2944 - Shuo Zhang
, Jing Liu:
Feature-Constrained and Attention-Conditioned Distillation Learning for Visual Anomaly Detection. 2945-2949 - Wei Jiang
, Junru Li, Kai Zhang, Li Zhang:
LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression. 2955-2959 - Keli Deng, Peng Wang, Yuntao Qian:
RGB Images Enhancing Hyperspectral Image Denoising with Diffusion Model. 2960-2964 - Zicheng Zhang, Yingjie Zhou, Chunyi Li, Kang Fu, Wei Sun, Xiaohong Liu
, Xiongkuo Min, Guangtao Zhai:
A Reduced-Reference Quality Assessment Metric for Textured Mesh Digital Humans. 2965-2969 - Xiaolu Chen, Haote Xu, Chenghao Deng, Xiaotong Tu, Xinghao Ding, Yue Huang:
Implicit Foreground-Guided Network for Anomaly Detection and Localization. 2970-2974 - Bruno Korbar, Jaesung Huh, Andrew Zisserman:
Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling. 2975-2979 - Shaoxu Li, Ye Pan:
Instant Photorealistic Neural Radiance Fields Stylization. 2980-2984 - Xiangbo Gao, Qinliang Lin, Cheng Luo, Weicheng Xie, Linlin Shen, Keerthy Kusumam, Siyang Song:
Scale-Free And Task-Generic Attack: Generating Photo-Realistic Adversarial Patterns With Patch Quilting Generator. 2985-2989 - Jianan Wang, Zhiliang Wu, Hanyu Xuan, Yan Yan:
Text-Video Completion Networks With Motion Compensation And Attention Aggregation. 2990-2994 - Jing Zhang, Tengfei Zhao, Shiyu Hu, Xin Zhao
:
Robust Single-Particle Cryo-Em Image Denoising and Restoration. 2995-2999 - Feng Zhou, Pei Shen, Ju Dai, Na Jiang, Yong Hu, Yu-Kun Lai, Paul L. Rosin:
AHRNET: Attention and Heatmap-Based Regressor for Hand Pose Estimation and Mesh Recovery. 3000-3004 - Cunjuan Zhu, Dongdong Cui, Qi Jia, Weimin Wang, Yu Liu, Michael S. Lew:
Sketch-Based 3D Shape Retrieval With Multi-View Fusion Transformer. 3005-3009