


default search action
ICASSP 2022: Virtual and Singapore
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. IEEE 2022, ISBN 978-1-6654-0541-6

- Shibo Zhang, Ebrahim Nemati, Minh Dinh, Nathan Folkman, Tousif Ahmed

, Md. Mahbubur Rahman, Jilong Kuang, Nabil Alshurafa, Alex Gao:
Coughtrigger: Earbuds IMU Based Cough Detection Activator Using An Energy-Efficient Sensitivity-Prioritized Time Series Classifier. 1-5 - Hoang Truong, Alessandro Montanari, Fahim Kawsar:

Non-Invasive Blood Pressure Monitoring with Multi-Modal In-Ear Sensing. 6-10 - Xiaolu Zeng, Beibei Wang, Chenshu Wu, Sai Deepika Regani, K. J. Ray Liu:

Intelligent Wi-Fi Based Child Presence Detection System. 11-15 - Wenxuan Li, Dongheng Zhang, Yadong Li, Zhi Wu, Jinbo Chen, Dong Zhang, Yang Hu, Qibin Sun, Yan Chen:

Real-Time Fall Detection Using Mmwave Radar. 16-20 - Dae Yon Hwang

, Pai Chet Ng, Yuanhao Yu
, Yang Wang, Petros Spachos, Dimitrios Hatzinakos, Konstantinos N. Plataniotis:
Hierarchical Deep Learning Model with Inertial and Physiological Sensors Fusion for Wearable-Based Human Activity Recognition. 21-25 - Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo

:
Speech Recovery For Real-World Self-Powered Intermittent Devices. 26-30 - Ai Okano, Yoshinobu Kajikawa:

Phase Control of Parametric Array Loudspeaker by Optimizing Sideband Weights. 31-35 - Florian Scalvini

, Camille Bordeau
, Maxime Ambard, Cyrille Migniot, Julien Dubois
:
Low-Latency Human-Computer Auditory Interface Based on Real-Time Vision Analysis. 36-40 - Akihiko Sugiyama:

Robust Adaptive Noise Canceller Algorithm with Snr-Based Stepsize Control and Noise-Path Gain Compensation. 41-45 - Chao Liu, Linlin Gao, Ruobing Jiang

:
Neartracker: Acoustic 2-D Target Tracking with Nearby Reflector in Siso System. 46-50 - Harinarayanan. E. V, Sachin Ghanekar:

An Efficient Method For Generic Dsp Implementation Of Dilated Convolution. 51-55 - Yu-Shan Tai

, Chieh-Fang Teng, Cheng-Yang Chang, An-Yeu Andy Wu:
Compression-Aware Projection with Greedy Dimension Reduction for Convolutional Neural Network Activations. 56-60 - Simon Narduzzi

, Siavash Arjomand Bigdeli, Shih-Chii Liu, L. Andrea Dunbar:
Optimizing The Consumption Of Spiking Neural Networks With Activity Regularization. 61-65 - Sujan Kumar Gonugondla, Naresh R. Shanbhag:

IMPQ: Reduced Complexity Neural Networks Via Granular Precision Assignment. 66-70 - Youngeun Kim, Hyoungseob Park, Abhishek Moitra, Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Priyadarshini Panda:

Rate Coding Or Direct Coding: Which One Is Better For Accurate, Robust, And Energy-Efficient Spiking Neural Networks? 71-75 - Linghao Song, Yuze Chi, Jason Cong:

PYXIS: An Open-Source Performance Dataset Of Sparse Accelerators. 76-80 - Zuozhou Pan

, Zhiping Lin, Yuanjin Zheng, Zong Meng:
Fast Fault Diagnosis Method Of Rolling Bearings In Multi-Sensor Measurement Enviroment. 81-85 - Diaa Badawi, Ishaan Bassi, Sule Ozev, Ahmet Enis Çetin

:
Detecting Anomaly in Chemical Sensors via Regularized Contrastive Learning. 86-90 - Cheng Tang

, Junkai Ji, Qiuzhen Lin
, Yan Zhou:
Evolutionary Neural Architecture Design of Liquid State Machine for Image Classification. 91-95 - Huy Phan

, Yi Xie, Jian Liu, Yingying Chen, Bo Yuan:
Invisible and Efficient Backdoor Attacks for Compressed Deep Neural Networks. 96-100 - Cheng-Hung Lo, Pei-Yun Tsai:

Tensor-Based Orthogonal Matching Pursuit with Phase Rotation for Channel Estimation In Hybrid Beamforming Mimo-Ofdm Systems. 101-105 - Darius Petermann, Minje Kim:

Spain-Net: Spatially-Informed Stereophonic Music Source Separation. 106-110 - Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:

Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing. 111-115 - Haici Yang, Shivani Firodiya, Nicholas J. Bryan, Minje Kim:

Don't Separate, Learn To Remix: End-To-End Neural Remixing With Joint Optimization. 116-120 - Yu Wang, Daniel Stoller, Rachel M. Bittner, Juan Pablo Bello

:
Few-Shot Musical Source Separation. 121-125 - Ethan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo:

Source Separation By Steering Pretrained Music Models. 126-130 - Xuewen Yao

, Megan Micheletti
, Mckensey Johnson, Edison Thomaz, Kaya de Barbaro
:
Infant Crying Detection In Real-World Environments. 131-135 - Qin Zhang, Qingming Tang, Chieh-Chi Kao, Ming Sun, Yang Liu, Chao Wang:

Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event Classification. 136-140 - Magdalena Fuentes

, Bea Steers, Pablo Zinemanas
, Martín Rocamora
, Luca Bondi
, Julia Wilkins, Qianyi Shi, Yao Hou, Samarjit Das, Xavier Serra, Juan Pablo Bello
:
Urban Sound & Sight: Dataset And Benchmark For Audio-Visual Urban Scene Understanding. 141-145 - Sai Srinadhu Katta, Kide Vuojärvi, Sivaprasad Nandyala

, Ulla-Maria Kovalainen, Lauren Baddeley:
Real-World On-Board Uav Audio Data Set For Propeller Anomalies. 146-150 - Yuan Gong

, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. 151-155 - Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe

, Shoichiro Saito, Yasuhiro Oikawa:
Wearable Seld Dataset: Dataset For Sound Event Localization And Detection Using Wearable Devices Around Head. 156-160 - Viet-Anh Nguyen, Anh H. T. Nguyen, Andy W. H. Khong:

Tunet: A Block-Online Bandwidth Extension Model Based On Transformers And Self-Supervised Pretraining. 161-165 - Jinjiang Liu, Xueliang Zhang:

DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech Dereverberation. 166-170 - Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann

:
Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing Devices. 171-175 - Naoyuki Kamo, Rintaro Ikeshita, Keisuke Kinoshita

, Tomohiro Nakatani:
Importance of Switch Optimization Criterion in Switching WPE Dereverberation. 176-180 - Ziyu Wang, Dejing Xu, Gus Xia

, Ying Shan:
Audio-To-Symbolic Arrangement Via Cross-Modal Music Representation Learning. 181-185 - Shiqi Wei, Gus Xia

, Yixiao Zhang, Liwei Lin, Weiguo Gao:
Music Phrase Inpainting Using Long-Term Representation and Contrastive Loss. 186-190 - Yi Zou, Pei Zou, Yi Zhao, Kaixiang Zhang, Ran Zhang, Xiaorui Wang:

Melons: Generating Melody With Long-Term Structure Using Transformers And Structure Graph. 191-195 - Moyu Terao, Yuki Hiramatsu, Ryoto Ishizuka, Yiming Wu, Kazuyoshi Yoshii:

Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level Criteria. 196-200 - Pedro Ramoneda, Nazif Can Tamer, Vsevolod Eremenko

, Xavier Serra, Marius Miron:
Score Difficulty Analysis for Piano Performance Education based on Fingering. 201-205 - Zhipeng Chen, Yiya Hao, Yaobin Chen, Gong Chen, Liang Ruan:

A Neural Network-based Howling Detection Method for Real-Time Communication Applications. 206-210 - Tomer Fireaizen, Saar Ron, Omer Bobrowski

:
Alarm Sound Detection Using Topological Signal Processing. 211-215 - Osamu Ichikawa, Yuuto Shima, Takahiro Nakayama, Hajime Shirouzu:

A Method For Estimating The Grouping Of Participants In Classroom Group Work Using Only Audio Information. 216-220 - Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi:

Environmental Sound Extraction Using Onomatopoeic Words. 221-225 - Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito:

Echo-Aware Adaptation of Sound Event Localization and Detection in Unknown Environments. 226-230 - Juncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze:

On Adversarial Robustness Of Large-Scale Audio Visual Learning. 231-235 - Haibin Wu, Po-Chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-Yi Lee:

Adversarial Sample Detection for Speaker Verification by Neural Vocoders. 236-240 - Naoya Takahashi, Yuki Mitsufuji:

Amicable Examples for Informed Source Separation. 241-245 - David M. Chan

, Shalini Ghosh, Debmalya Chakrabarty, Björn Hoffmeister:
Multi-Modal Pre-Training for Automated Speech Recognition. 246-250 - Ryota Tsunoda, Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yoshie Imai:

Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss. 251-255 - Yifei Wu, Chenda Li, Jinfeng Bai, Zhongqin Wu, Yanmin Qian:

Time-Domain Audio-Visual Speech Separation on Low Quality Videos. 256-260 - Mhd Modar Halimeh

, Walter Kellermann:
Complex-Valued Spatial Autoencoders for Multichannel Speech Enhancement. 261-265 - Zhi-Wei Tan, Anh H. T. Nguyen, Yuan Liu

, Andy W. H. Khong:
Multichannel Noise Reduction Using Dilated Multichannel U-Net and Pre-Trained Single-Channel Network. 266-270 - Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang:

One Model to Enhance Them All: Array Geometry Agnostic Multi-Channel Personalized Speech Enhancement. 271-275 - Cong Han, Emine Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile:

Multi-Channel Speech Denoising for Machine Ears. 276-280 - Zhong-Qiu Wang, DeLiang Wang:

Localization based Sequential Grouping for Continuous Speech Separation. 281-285 - Mieszko Fras, Marcin Witkowski

, Konrad Kowalczyk
:
Convolutional Weighted Minimum Mean Square Error Filter for Joint Source Separation and Dereverberation. 286-290 - Ethan Manilow, Curtis Hawthorne, Cheng-Zhi Anna Huang, Bryan Pardo, Jesse H. Engel:

Improving Source Separation by Explicitly Modeling Dependencies between Sources. 291-295 - Yuichiro Koyama, Naoki Murata, Stefan Uhlich, Giorgio Fabbro, Shusuke Takahashi, Yuki Mitsufuji:

Music Source Separation With Deep Equilibrium Models. 296-300 - Natsuki Akaishi, Kohei Yatabe

, Yasuhiro Oikawa:
Harmonic and Percussive Sound Separation Based on Mixed Partial Derivative of Phase Spectrogram. 301-305 - Enric Gusó

, Jordi Pons, Santiago Pascual, Joan Serrà:
On Loss Functions and Evaluation Metrics for Music Source Separation. 306-310 - Sangwook Park, Mounya Elhilali:

Time-Balanced Focal Loss for Audio Event Detection. 311-315 - Kazuki Shimada

, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training. 316-320 - Arman Zharmagambetov, Qingming Tang, Chieh-Chi Kao, Qin Zhang, Ming Sun, Viktor Rozgic, Jasha Droppo, Chao Wang:

Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology. 321-325 - Sandeep Kothinti, Mounya Elhilali:

Temporal Contrastive-Loss for Audio Event Detection. 326-330 - Xu Wang, Xiangjinzi Zhang, Yunfei Zi

, Shengwu Xiong:
A Frame Loss of Multiple Instance Learning for Weakly Supervised Sound Event Detection. 331-335 - Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:

Pseudo Strong Labels for Large Scale Weakly Supervised Audio Tagging. 336-340 - Wenyu Jin, Tim Schoof, Henning F. Schepker:

Individualized Hear-Through For Acoustic Transparency Using PCA-Based Sound Pressure Estimation At The Eardrum. 341-345 - Benjamin Lentz

, Rainer Martin
, Kirsten Oberländer, Christiane Völter:
On Spectral and Temporal Sparsification of Speech Signals for the Improvement of Speech Perception in CI Listeners. 346-350 - Fotios Drakopoulos, Sarah Verhulst:

A Differentiable Optimisation Framework for The Design of Individualised DNN-based Hearing-Aid Strategies. 351-355 - Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang:

Personalized speech enhancement: new models and Comprehensive evaluation. 356-360 - Jinxu Xiang

, Yuyang Zhu, Rundi Wu
, Ruilin Xu, Yuko Ishiwaka, Changxi Zheng:
Dynamic Sliding Window for Realtime Denoising Networks. 361-365 - Sunwoo Kim, Minje Kim

:
Bloom-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement. 366-370 - Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang:

HGCN: Harmonic Gated Compensation Network for Speech Enhancement. 371-375 - Wenbin Jiang, Zhijun Liu, Kai Yu, Fei Wen:

Speech Enhancement with Neural Homomorphic Synthesis. 376-380 - Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

:
A Bayesian Permutation Training Deep Representation Learning Method for Speech Enhancement with Variational Autoencoder. 381-385 - Huajian Fang, Tal Peer, Stefan Wermter

, Timo Gerkmann
:
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement. 386-390 - Viet Anh Trinh, Sebastian Braun:

Unsupervised Speech Enhancement with Speech Recognition Embedding and Disentanglement Losses. 391-395 - Xianke Wang, Wei Xu, Weiming Yang, Wenqing Cheng:

Musicyolo: A Sight-Singing Onset/Offset Detection Framework Based on Object Detection Instead of Spectrum Frames. 396-400 - Yun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei Tsung Lu, Minz Won:

Modeling Beats and Downbeats with a Time-Frequency Transformer. 401-405 - Michael Krause

, Meinard Müller:
Hierarchical Classification of Singing Activity, Gender, and Type in Complex Music Recordings. 406-410 - Qiqi He, Xiaoheng Sun, Yi Yu, Wei Li:

Deepchorus: A Hybrid Model of Multi-Scale Convolution And Self-Attention for Chorus Detection. 411-415 - Ju-Chiang Wang, Yun-Ning Hung, Jordan B. L. Smith:

To Catch A Chorus, Verse, Intro, or Anything Else: Analyzing a Song with Structural Functions. 416-420 - Mojtaba Heydari, Matthew C. McCallum, Andreas F. Ehmann, Zhiyao Duan:

A Novel 1D State Space for Efficient Music Rhythmic Analysis. 421-425 - Haici Yang, Sanna Wager, Spencer Russell, Mike Luo, Minje Kim, Wontak Kim:

Upmixing Via Style Transfer: A Variational Autoencoder for Disentangling Spatial Images And Musical Content. 426-430 - Ricardo Falcón Pérez

, Kazuki Shimada
, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection. 431-435 - Tobias Kabzinski, Peter Jax:

Towards Faster Continuous Multi-Channel HRTF Measurements Based On Learning System Models. 436-440 - Bowen Zhi, Dmitry N. Zotkin, Ramani Duraiswami

:
Towards Fast And Convenient End-To-End HRTF Personalization. 441-445 - Mateusz Guzik

, Konrad Kowalczyk
:
Wishart Localization Prior On Spatial Covariance Matrix In Ambisonic Source Separation Using Non-Negative Tensor Factorization. 446-450 - Jiawen Huang, Emmanouil Benetos

, Sebastian Ewert:
Improving Lyrics Alignment Through Joint Pitch Detection. 451-455 - Ilaria Manco, Emmanouil Benetos

, Elio Quinton, György Fazekas:
Learning Music Audio Representations Via Weak Language Supervision. 456-460 - David Giuseppe Badiane, Raffaele Malvermi, Sebastian Gonzalez, Fabio Antonacci, Augusto Sarti:

On the Prediction of the Frequency Response of a Wooden Plate from Its Mechanical Parameters. 461-465 - Bo-Yu Chen, Wei-Han Hsu, Wei-Hsiang Liao, Marco A. Martínez Ramírez, Yuki Mitsufuji, Yi-Hsuan Yang:

Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks. 466-470 - Han Chen, Yan Song, Li-Rong Dai, Ian McLoughlin

, Lin Liu:
Self-Supervised Representation Learning for Unsupervised Anomalous Sound Detection Under Domain Shift. 471-475 - Vasileios Tsouvalas

, Aaqib Saeed
, Tanir Ozcelebi:
Federated Self-Training for Data-Efficient Audio Recognition. 476-480 - Meng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic, Spyros Matsoukas, Chao Wang:

Federated Self-Supervised Learning for Acoustic Event Classification. 481-485 - Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang:

Temporal Knowledge Distillation for on-device Audio Classification. 486-490 - Ognjen (Oggi) Rudovic, Akanksha Bindal, Vineet Garg

, Pramod Simha, Pranay Dighe, Sachin Kajarekar:
Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based Invocation. 491-495 - Hiroshi Sawada, Rintaro Ikeshita, Keisuke Kinoshita

, Tomohiro Nakatani:
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant Environments. 496-500 - Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii

:
Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation. 501-505 - Yudong He

, He Wang, Qifeng Chen, Richard Hau Yue So:
Harvesting Partially-Disjoint Time-Frequency Information for Improving Degenerate Unmixing Estimation Technique. 506-510 - Shogo Seki, Hirokazu Kameoka, Li Li:

Investigation And Comparison of Optimization Methods for Variational Autoencoder-Based Underdetermined Multichannel Source Separation. 511-515 - Li Li, Hirokazu Kameoka, Shogo Seki:

HBP: An Efficient Block Permutation Solver Using Hungarian Algorithm and Spectrogram Inpainting for Multichannel Audio Source Separation. 516-520 - Chenxing Li, Yang Wang, Feng Deng, Zhuo Zhang, Xiaorui Wang, Zhongyuan Wang:

EAD-Conformer: a Conformer-Based Encoder-Attention-Decoder-Network for Multi-Task Audio Source Separation. 521-525 - Darius Petermann, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:

The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks. 526-530 - Félix Mathieu, Thomas Courtat, Gaël Richard, Geoffroy Peeters:

Phase Shifted Bedrosian Filterbank: An Interpretable Audio Front-End for Time-Domain Audio Source Separation. 531-535 - Rahil Parikh, Ilya Kavalerov, Carol Y. Espy-Wilson, Shihab A. Shamma:

Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems. 536-540 - Changsheng Quan, Xiaofei Li:

Multi-Channel Narrow-Band Deep Speech Separation with Full-Band Permutation Invariant Training. 541-545 - Cunhang Fan, Zhao Lv

, Shengbing Pei, Mingyue Niu:
Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction. 546-550 - Ebrahim Nemati, Xuhai Xu, Viswam Nathan, Korosh Vatanparvar, Tousif Ahmed

, Md. Mahbubur Rahman, Dan McCaffrey, Jilong Kuang, Alex Gao:
Ubilung: Multi-Modal Passive-Based Lung Health Assessment. 551-555 - Neeraj Kumar Sharma

, Srikanth Raj Chetupalli
, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy:
The Second Dicova Challenge: Dataset and Performance Analysis for Diagnosis of Covid-19 Using Acoustics. 556-560 - Xing-Yu Chen, Qiu-Shi Zhu

, Jie Zhang, Li-Rong Dai:
Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals. 561-565 - Madhu R. Kamble, Jose Patino, Maria A. Zuluaga, Massimiliano Todisco:

Exploring Auditory Acoustic Features for The Diagnosis of Covid-19. 566-570 - Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu:

Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator. 571-575 - Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari:

Region-to-Region Kernel Interpolation of Acoustic Transfer Function with Directional Weighting. 576-580 - Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets:

Blind Reverberation Time Estimation in Dynamic Acoustic Conditions. 581-585 - Maozhong Fu, Jesper Rindom Jensen

, Yuhan Li
, Mads Græsbøll Christensen
:
Sparse Modeling of The Early Part of Noisy Room Impulse Responses with Sparse Bayesian Learning. 586-590 - Jack Deadman

, Jon Barker:
Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 Dataset. 591-595 - Mattia Papa, Clara Borrelli, Paolo Bestagini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro:

A Data-Driven Approach for Acoustic Parameter Similarity Estimation of Speech Recording. 596-600 - Yudong Zhao, György Fazekas, Mark B. Sandler:

Violinist Identification Using Note-Level Timbre Feature Distributions. 601-605 - Hang Zhao, Chen Zhang, Bilei Zhu, Zejun Ma, Kejun Zhang:

S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification. 606-610 - Morgan Buisson, Pablo Alonso-Jiménez

, Dmitry Bogdanov:
Ambiguity Modelling with Label Distribution Learning for Music Classification. 611-615 - Xingjian Du

, Ke Chen, Zijie Wang, Bilei Zhu, Zejun Ma:
Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification. 616-620 - Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov

:
Tonet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music. 621-625 - Shuai Yu

, Xi Chen, Wei Li:
Hierarchical Graph-Based Neural Network for Singing Melody Extraction. 626-630 - Michel Olvera, Emmanuel Vincent, Gilles Gasso:

On The Impact of Normalization Strategies in Unsupervised Adversarial Domain Adaptation for Acoustic Scene Classification. 631-635 - Tom Denton, Scott Wisdom, John R. Hershey:

Improving Bird Classification with Unsupervised Sound Separation. 636-640 - Francesco Paissan, Alberto Ancilotto, Alessio Brutti

, Elisabetta Farella:
Scalable Neural Architectures for End-to-End Environmental Sound Classification. 641-645 - Ke Chen, Xingjian Du

, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov
:
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection. 646-650 - You Wang, David V. Anderson:

Hybrid Attention-Based Prototypical Networks for Few-Shot Sound Classification. 651-655 - Karn N. Watcharasupat

, Thi Ngoc Tho Nguyen, Woon-Seng Gan
, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. 656-660 - Ziteng Wang, Yueyue Na, Biao Tian, Qiang Fu:

NN3A: Neural Network Supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications. 661-665 - Jan Franzen, Tim Fingscheidt

:
Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System. 666-670 - Hao Zhang, DeLiang Wang:

Neural Cascade Architecture for Joint Acoustic Echo and Noise Suppression. 671-675 - Santiago Ruiz

, Toon van Waterschoot, Marc Moonen:
Cascade Multi-Channel Noise Reduction and Acoustic Feedback Cancellation. 676-680 - Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:

Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation. 681-685 - Aswin Sivaraman

, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training. 686-690 - Eisuke Konno, Daisuke Saito, Nobuaki Minematsu:

Quantifying Discriminability between NMF Bases. 691-695 - Hassan Taherian, Ke Tan, DeLiang Wang:

Location-Based Training for Multi-Channel Talker-Independent Speaker Separation. 696-700 - Robin Scheibler:

SDR - Medium Rare with Fast Computations. 701-705 - Hirokazu Kameoka, Shogo Seki, Li Li, Chihiro Watanabe:

Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention Mechanism. 706-710 - Olga Slizovskaia, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:

Locate This, Not that: Class-Conditioned Sound Event DOA Estimation. 711-715 - Thi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat

, Huy Phan, Woon-Seng Gan
:
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays. 716-720 - Bing Yang, Hong Liu, Xiaofei Li:

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization. 721-725 - Yonggang Hu

, Sharon Gannot:
Closed-Form Single Source Direction-of-Arrival Estimator Using First-Order Relative Harmonic Coefficients. 726-730 - Jianhua Geng, Sifan Wang, Xin Lou:

A Slide-Save Based Framework for Multi-Source DOA Extraction with Closely Spaced Sources. 731-735 - Yu Chen, Bowen Liu

, Zijian Zhang
, Hun-Seok Kim:
An End-to-End Deep Learning Framework For Multiple Audio Source Separation And Localization. 736-740 - Amir Ivry, Israel Cohen, Baruch Berdugo:

Deep Adaptation Control for Acoustic Echo Cancellation. 741-745 - Amir Ivry, Israel Cohen, Baruch Berdugo:

Off-the-Shelf Deep Integration For Residual-Echo Suppression. 746-750 - Chenggang Zhang, Jinjiang Liu, Xueliang Zhang:

A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks For Acoustic Echo Cancellation. 751-755 - Hao Zhang, Srivatsan Kandadai, Harsha Rao, Minje Kim, Tarun Pruthi, Trausti T. Kristjansson:

Deep Adaptive Aec: Hybrid of Deep Learning and Adaptive Acoustic Echo Cancellation. 756-760 - Yurii Iotov

, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Dyrholm, Mads Græsbøll Christensen
:
Computationally Efficient Fixed-Filter ANC for Speech Based on Long-Term Prediction for Headphone Applications. 761-765 - Thomas Haubner, Andreas Brendel

, Walter Kellermann:
End-To-End Deep Learning-Based Adaptation Control for Frequency-Domain Adaptive System Identification. 766-770 - Grigoris Bastas, Stefanos Koutoupis, Maximos Kaliakatsos-Papakostas, Vassilis Katsouros, Petros Maragos:

A Few-Sample Strategy for Guitar Tablature Transcription Based on Inharmonicity Analysis and Playability Constraints. 771-775 - Longshen Ou, Ziyi Guo, Emmanouil Benetos

, Jiqing Han, Ye Wang
:
Exploring Transformer's Potential on Automatic Piano Transcription. 776-780 - Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert:

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation. 781-785 - Yu-Hua Chen, Wen-Yi Hsiao, Tsu-Kuang Hsieh, Jyh-Shing Roger Jang, Yi-Hsuan Yang:

Towards Automatic Transcription of Polyphonic Electric Guitar Music: A New Dataset and a Multi-Loss Transformer Model. 786-790 - Xiaoxue Gao

, Chitralekha Gupta, Haizhou Li:
Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. 791-795 - Sangeun Kum, Jongpil Lee, Keunhyoung Luke Kim, Taehyoung Kim, Juhan Nam

:
Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music. 796-800 - Noriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita:

Sound Event Detection Guided by Semantic Contexts of Scenes. 801-805 - Keigo Wakayama

, Shoichiro Saito:
CNN-Transformer with Self-Attention Network for Sound Event Detection. 806-810 - Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:

A Mutual Learning Framework for Few-Shot Sound Event Detection. 811-815 - Youde Liu, Jian Guan, Qiaoxi Zhu

, Wenwu Wang:
Anomalous Sound Detection Using Spectral-Temporal Information Fusion. 816-820 - Yadong Guan, Jiabin Xue, Guibin Zheng, Jiqing Han:

Sparse Self-Attention for Semi-Supervised Sound Event Detection. 821-825 - Hayato Endo, Hiromitsu Nishizaki:

Peer Collaborative Learning for Polyphonic Sound Event Detection. 826-830 - Srikanth Korse

, Nicola Pia, Kishan Gupta, Guillaume Fuchs
:
PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech. 831-835 - Kishan Gupta, Srikanth Korse

, Bernd Edler, Guillaume Fuchs
:
A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain. 836-840 - Eloi Moliner, Vesa Välimäki:

A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings. 841-845 - Marvin Borsdorf

, Kevin Scheck, Haizhou Li, Tanja Schultz
:
Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages. 846-850 - Guangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:

Category-Adapted Sound Event Enhancement with Weakly Labeled Data. 851-855 - Rubén M. Clavería, Simon J. Godsill:

Sequential MCMC Methods for Audio Signal Enhancement. 856-860 - Tejas Jayashankar, Thilo Köhler, Kaustubh Kalgaonkar, Zhiping Xiu, Jilong Wu, Ju Lin, Prabhav Agrawal, Qing He:

Architecture for Variable Bitrate Neural Speech Codec with Configurable Computation Complexity. 861-865 - Xue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang, Yan Lu:

End-to-End Neural Speech Coding for Real-Time Communications. 866-870 - Seungmin Shin

, Joon Byun, Youngcheol Park, Jongmo Sung, Seungkwon Beack:
Deep Neural Network (DNN) Audio Coder Using A Perceptually Improved Training Method. 871-875 - Chanwoo Lee, Hyungseob Lim, Jihyun Lee, Inseon Jang, Hong-Goo Kang:

Progressive Multi-Stage Neural Audio Coding with Guided References. 876-880 - Ehab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu:

Vocbench: A Neural Vocoder Benchmark for Speech Synthesis. 881-885 - Chandan K. A. Reddy, Vishak Gopal, Ross Cutler:

Dnsmos P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors. 886-890 - Pranay Manocha

, Zeyu Jin, Adam Finkelstein:
SQAPP: No-Reference Speech Quality Assessment Via Pairwise Preference. 891-895 - Wen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda:

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. 896-900 - Marju Purin, Sten Sootla, Mateja Sponza, Ando Saabas, Ross Cutler:

AECMOS: A Speech Quality Assessment Metric for Echo Impairment. 901-905 - Miao Liu, Jing Wang, Shicong Li, Fei Xiang, Yue Yao, Lidong Yang:

MOS Predictor for Synthetic Speech with I-Vector Inputs. 906-910 - Daan Ratering, W. Bastiaan Kleijn

, Jean Gonzalez Silva, Riccardo M. G. Ferrari
:
Wave-Domain Approach for Cancelling Noise Entering Open Windows. 911-915 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach

:
On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. 916-920 - Takuya Yoshioka, Xiaofei Wang, Dongmei Wang:

Picknet: Real-Time Channel Selection for Ad Hoc Microphone Arrays. 921-925 - Jarred Barber, Yifeng Fan, Tao Zhang:

End-To-End Alexa Device Arbitration. 926-930 - Natsuki Ueno, Nobutaka Ono:

Instantaneous Linear Dimensionality Reduction of Multichannel Time-Series Signal for Array Signal Processing. 931-935 - Srdan Kitic, Jérôme Daniel:

Generalized Time Domain Velocity Vector. 936-940 - Masaya Kawamura, Tomohiko Nakamura

, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo:
Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds. 941-945 - Yashish M. Siriwardena, Guilhem Marion, Shihab A. Shamma:

The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction. 946-950 - Hao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, Julian J. McAuley

:
Deep Performer: Score-to-Audio Music Performance Synthesis. 951-955 - Chien-Feng Liao, Jen-Yu Liu, Yi-Hsuan Yang:

KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE Using Mel-Spectrograms. 956-960 - Jihyun Lee, Hyungseob Lim, Chanwoo Lee, Inseon Jang, Hong-Goo Kang:

Adversarial Audio Synthesis Using a Harmonic-Percussive Discriminator. 961-965 - Jing Yang, Chulhong Min, Akhil Mathur, Fahim Kawsar:

SleepGAN: Towards Personalized Sleep Therapy Music. 966-970 - Xuenan Xu, Mengyue Wu, Kai Yu:

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition. 971-975 - Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel:

Audioclip: Extending Clip to Image, Text and Audio. 976-980 - Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:

Can Audio Captions Be Evaluated With Image Caption Metrics? 981-985 - Pablo M. Delgado, Jürgen Herre:

A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment. 986-990 - Ryosuke Sawata, Yosuke Kashiwagi, Shusuke Takahashi:

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models. 991-995 - Sebastian Braun, Hannes Gamper:

Effect of Noise Suppression Losses on Speech Distortion and ASR Performance. 996-1000 - Alix Jeannerot

, Niels de Koeijer, Pablo Martínez-Nuevo
, Martin Bo Møller
, Jakob Dyreby, Paolo Prandoni:
Increasing Loudness in Audio Signals: A Perceptually Motivated Approach to Preserve Audio Quality. 1001-1005 - Sebastian J. Schlecht, Leonardo Fierro, Vesa Välimäki, Juha Backman:

Audio Peak Reduction Using a Synced allpass Filter. 1006-1010 - Tomoro Tanaka, Kohei Yatabe

, Masahiro Yasuda, Yasuhiro Oikawa:
APPLADE: Adjustable Plug-and-Play Audio Declipper Combining DNN with Sparse Optimization. 1011-1015 - Daniel Tompkins, Kshitiz Kumar, Jian Wu:

Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study. 1016-1020 - Janek Ebbers, Reinhold Haeb-Umbach

, Romain Serizel:
Threshold Independent Evaluation of Sound Event Detection Scores. 1021-1025 - Seyed M. R. Modaresi

, Aomar Osmani, Mohammadreza Razzazi
, Abdelghani Chibani:
Multimodal Evaluation Method for Sound Event Detection. 1026-1030 - Francesca Ronchini, Romain Serizel:

A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic Soundscapes. 1031-1035 - Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu:

Attentive Max Feature Map and Joint Training for Acoustic Scene Classification. 1036-1040 - Hu Hu

, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. 4041-4045 - Christian Bergler, Manuel Schmitt, Andreas K. Maier, Rachael Xi Cheng, Volker Barth, Elmar Nöth:

ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep Learning. 1046-1050 - Mirco Pezzoli

, Maximo Cobos
, Fabio Antonacci, Augusto Sarti:
Sparsity-Based Sound Field Separation in the Spherical Harmonics Domain. 1051-1055 - Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari:

Spatial Active Noise Control Based on Individual Kernel Interpolation of Primary and Secondary Sound Fields. 1056-1060 - Sipei Zhao

, Ian S. Burnett
:
Time-Domain Acoustic Contrast Control with A Spatial Uniformity Constraint for Personal Audio Systems. 1061-1065 - Liming Shi

, Guoli Ping, Xiaoxiang Shen, Mads Græsbøll Christensen
:
Generation of Personal Sound Fields in Reverberant Environments Using Interframe Correlation. 1066-1070 - Jesper Brunnström

, Shoichi Koyama, Marc Moonen:
Variable Span Trade-Off Filter for Sound Zone Control with Kernel Interpolation Weighting. 1071-1075 - Nara Hahn, Frank Schultz, Sascha Spors:

Time Domain Radial Filter Design for Spherical Waves. 1076-1080 - Junxiao Sun, Ke Zhang, Shuyi Niu, Yan Zhang, Youyong Kong:

Feature Space Message Passing Network for Medical Image Semantic Segmentation. 1081-1085 - Yixin Wang, Zhe Xu

, Jiang Tian, Jie Luo, Zhongchao Shi, Yang Zhang, Jianping Fan, Zhiqiang He:
Cross-Domain Few-Shot Learning for Rare-Disease Skin Lesion Segmentation. 1086-1090 - Chen Li

, Wei Chen, Xin Luo, Yulin He, Yusong Tan:
Adaptive Pseudo Labeling for Source-Free Domain Adaptation in Medical Image Segmentation. 1091-1095 - Abdullah F. Al-Battal, Imanuel R. Lerman, Truong Q. Nguyen:

Object Detection and Tracking in Ultrasound Scans Using an Optical Flow and Semantic Segmentation Framework Based on Convolutional Neural Networks. 1096-1100 - Dachuan Shi, Ruiyang Liu, Linmi Tao, Chun Yuan:

Heuristic Dropout: An Efficient Regularization Method for Medical Image Segmentation Models. 1101-1105 - Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. Nasrabadi:

Superresolution and Segmentation of OCT Scans Using Multi-Stage Adversarial Guided Attention Training. 1106-1110 - Yusuke Akamatsu

, Yoshifumi Onishi
, Hitoshi Imaoka:
Heart Rate and Oxygen Saturation Estimation from Facial Video with Multimodal Physiological Data Generation. 1111-1115 - Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang

, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. 1116-1120 - Sawon Pratiher, Apoorva Srivastava, Yedla Bindu Priyatha, Nirmalya Ghosh, Amit Patra:

A Dilated Residual Vision Transformer for Atrial Fibrillation Detection from Stacked Time-Frequency ECG Representations. 1121-1125 - Crystal T. Wei, Ming-En Hsieh, Chien-Liang Liu

, Vincent S. Tseng:
Contrastive Heartbeats: Contrastive Learning for Self-Supervised ECG Representation and Phenotyping. 1126-1130 - Omid Dehzangi, Paria Jeihouni, Jad Ramadan, Victor S. Finomore, Nasser M. Nasrabadi, Ali Rezai:

Ubiquitous Physiological Prediction of SUD Patients' Wellness State Using Memory-Based Convolutional Models. 1131-1135 - Mu Yang, Darpit Dave, Madhav Erraguntla, Gerard L. Coté, Ricardo Gutierrez-Osuna:

Joint Hypoglycemia Prediction and Glucose Forecasting via Deep Multi-Task Learning. 1136-1140 - Siddharth Subramani, Achuth Rao M. V, Anwesha Roy, Prasanna Suresh Hegde, Prasanta Kumar Ghosh:

SegNet-Based Deep Representation Learning for Dysphagia Classification. 1141-1145 - Francois Buet-Golfouse, Hans Roggeman, Islam Utyagulov:

Robust Collaborative Learning for Sequence Modelling. 1146-1150 - Jen-Cheng Hou, Aileen McGonigal

, Fabrice Bartolomei, Monique Thonnat:
A Self-Supervised Pre-Training Framework for Vision-Based Seizure Classification. 1151-1155 - Huaiwen Luo, Lu Zhang, Lianyu Zhou, Xu Lin

, Zehuai Zhang, Mingjiang Wang:
Design of Real-Time System Based on Machine Learning for Snoring and OSA Detection. 1156-1160 - Kaan Sel, Noah Huerta, Michael S. Sacks, Roozbeh Jafari:

Parametric Modeling of Human Wrist for Bioimpedance-Based Physiological Sensing. 1161-1165 - José Fernando Adrán Otero, Oscar Soláns Caballer, Pere Martí-Puig, Zhe Sun, Toshihisa Tanaka, Jordi Solé-Casals:

Preliminary Results on the Generation of Artificial Handwriting Data Using a Decomposition-Recombination Strategy. 1166-1170 - Suguru Kanoga

, Takayuki Hoshino
, Mitsunori Tada:
A Style Transfer Mapping and Fine-Tuning Subject Transfer Framework Using Convolutional Neural Networks for Surface Electromyogram Pattern Recognition. 1171-1175 - Chencheng Guo, Hui Qian, Baoling Hong:

Feature-Based Sensing Matrix Design for Analog to Information Converters. 1176-1180 - K. M. Naimul Hassan, Md. Shamiul Alam Hridoy, Naima Tasnim

, Atia Faria Chowdhury, Tanvir Alam Roni, Sheikh Tabrez, Arik Subhana, Celia Shahnaz
:
ALSNet: A Dilated 1-D CNN for Identifying ALS from Raw EMG Signal. 1181-1185 - Bilal Ahmad

, Liana Khamidullina, Alexey Alexandrovich Korobkov, Alla Manina
, Jens Haueisen, Martin Haardt:
Joint Model Order Estimation for Multiple Tensors with A Coupled Mode and Applications to the Joint Decomposition of EEG, MEG Magnetometer, and Gradiometer Tensors. 1186-1190 - Zhikang Zhang

, Jonathan Zhao, Fengbo Ren:
An Experimental Study on Transferring Data-Driven Image Compressive Sensing to Bioelectric Signals. 1191-1195 - Elahe Rahimian, Soheil Zabihi, Amir Asif

, Dario Farina, Seyed Farokh Atashzar
, Arash Mohammadi:
Hand Gesture Recognition Using Temporal Convolutions and Attention Mechanism. 1196-1200 - Bo Fang

, Junxin Chen
, Wei Wang
, Yicong Zhou:
Combining Multiple Style Transfer Networks and Transfer Learning For LGE-CMR Segmentation. 1201-1205 - Jaeyoung Huh

, Shujaat Khan
, Jong Chul Ye:
Multi-Domain Unpaired Ultrasound Image Artifact Removal Using a Single Convolutional Neural Network. 1206-1210 - Xiao Li, Huizhi Liang, Sidhartha Nagala, Jane Chen:

Improving Ultrasound Image Classification with Local Texture Quantisation. 1211-1215 - Tristan S. W. Stevens

, Nishith Chennakeshava, Frederik J. de Bruijn, Martin Pekar, Ruud J. G. van Sloun
:
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning. 1216-1220 - Nishith Chennakeshava, Tristan S. W. Stevens

, Frederik J. de Bruijn, Andrew Hancock, Martin Pekar, Yonina C. Eldar, Massimo Mischi
, Ruud J. G. van Sloun
:
Deep Proximal Unfolding For Image Recovery from Under-Sampled Channel Data in Intravascular Ultrasound. 1221-1225 - Gongpeng Cao, Yiping Wang, Manli Zhang, Jing Zhang, Guixia Kang, Xin Xu:

Multiview Long-Short Spatial Contrastive Learning For 3D Medical Image Analysis. 1226-1230 - Khuong Vo

, Manoj Vishwanath, Ramesh Srinivasan, Nikil D. Dutt, Hung Cao:
Composing Graphical Models with Generative Adversarial Networks for EEG Signal Modeling. 1231-1235 - David Bethge, Philipp Hallgarten, Tobias Grosse-Puppendahl, Mohamed Kari

, Ralf Mikut
, Albrecht Schmidt, Ozan Özdenizci
:
Domain-Invariant Representation Learning from EEG with Private Encoders. 1236-1240 - Guangyi Zhang, Ali Etemad:

Holistic Semi-Supervised Approaches for EEG Representation Learning. 1241-1245 - Pankaj Pandey, Gulshan Sharma

, Krishna P. Miyapuram, Ramanathan Subramanian
, Derek Lomas:
Music Identification Using Brain Responses to Initial Snippets. 1246-1250 - Wei Xu, Jing Wang, Ziyu Jia, Zhiqing Hong, Yunze Li, Youfang Lin

:
Multi-Level Spatial-Temporal Adaptation Network for Motor Imagery Classification. 1251-1255 - Lies Bollens

, Tom Francart, Hugo Van hamme
:
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders. 1256-1260 - Xinru Dai, Tai Ma, Haibin Cai, Ying Wen:

Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration. 1261-1265 - Zailiang Chen, Hailei Lan, Yongan Meng, Yuchen Xiong, Jing Luo, Hailan Shen:

FAZ-BV: A Diabetic Macular Ischemia Grading Framework Combining Faz Attention Network and Blood Vessel Enhancement Filters. 1266-1270 - Lijuan Lu, Shun Miao, Ling Ye:

Fracture Detection and Localization in Chest X-Rays Using Semi-Supervised Learning with Dynamic Sharpening. 1271-1275 - Ryan Zhang

, Jiadai Zhu, Stephen Yang
, Mahdi S. Hosseini, Angelo Genovese
, Lina Chen, Corwyn Rowsell, Savvas Damaskinos, Sonal Varma, Konstantinos N. Plataniotis:
Histokt: Cross Knowledge Transfer in Computational Pathology. 1276-1280 - Giovana Augusta Benvenuto

, Marilaine Colnago, Wallace Casaca:
Unsupervised Deep Learning Network for Deformable Fundus Image Registration. 1281-1285 - Huijuan Yang

, Aaron S. Coyner, Feri Guretno
, Ivan Ho Mien, Chuan Sheng Foo, J. Peter Campbell, Susan Ostmo, Michael F. Chiang
, Pavitra Krishnaswamy:
A Minimally Supervised Approach for Medical Image Quality Assessment in Domain Shift Settings. 1286-1290 - Yanbin He, Zhiyang Lu, Jun Wang

, Jun Shi:
A Channel Attention Based MLP-Mixer Network for Motor Imagery Decoding With EEG. 1291-1295 - Miguel Angrick

, Maarten C. Ottenhoff, Lorenz Diener, Darius Ivucic, Gabriel Ivucic, Sophocles Goulis, Albert J. Colon, G. Louis Wagner, Dean J. Krusienski
, Pieter Leonard Kubben, Tanja Schultz
, Christian Herff
:
Towards Closed-Loop Speech Synthesis from Stereotactic EEG: A Unit Selection Approach. 1296-1300 - Jaeun Phyo, Wonjun Ko

, Eunjin Jeon, Heung-Il Suk
:
Enhancing Contextual Encoding With Stage-Confusion and Stage-Transition Estimation for EEG-Based Sleep Staging. 1301-1305 - Hadi Habibzadeh, Kevin J. Long, Ally E. Atkins, Daphney-Stavroula Zois

, James J. S. Norton:
Improving BCI-based Color Vision Assessment Using Gaussian Process Regression. 1306-1310 - Shuji Komeiji, Kai Shigemi, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka:

Transformer-Based Estimation of Spoken Sentences Using Electrocorticography. 1311-1315 - Marzieh Ajirak, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:

Boost Ensemble Learning for Classification of CTG SIGNALS. 1316-1320 - Yifan Wang, Ying Lan:

Multi-View Learning Based on Non-Redundant Fusion for Icu Patient Mortality Prediction. 1321-1325 - Tong Chen, Guanchao Feng, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:

Improving Phase-Rectified Signal Averaging for Fetal Heart Rate Analysis. 1326-1330 - Liu Yang

, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Unsupervised Clustering and Analysis of Contraction-Dependent Fetal Heart Rate Segments. 1331-1335 - Orestis Apostolou, Vasileios S. Charisis, Georgios K. Apostolidis, Leontios J. Hadjileontiadis:

A Method for Detecting Coronary Artery Disease using Noisy Ultrashort Electrocardiogram Recordings. 1336-1340 - Nele Sophie Brügge

, Jan Graßhoff
, Arne Weigenand, Philipp Rostalski:
Multi-Task Gaussian Process Regression for the Detection of Sleep Cycles in Premature Infants. 1341-1345 - Silpa Babu, Seyedehsara Nayer, Sajan Goud Lingala, Namrata Vaswani

:
Fast Low Rank Column-Wise Compressive Sensing For Accelerated Dynamic MRI. 1346-1350 - Sizhuo Liu, Philip Schniter, Rizwan Ahmad:

MRI Recovery with a Self-Calibrated Denoiser. 1351-1355 - Wanqi Zhang, Lulu Wang, Wei Chen, Yuanyuan Jia, Zhongshi He, Jinglong Du:

3d Cross-Scale Feature Transformer Network for Brain Mr Image Super-Resolution. 1356-1360 - Harsh Singh, Ognjen Arandjelovic

:
Data Efficient Support Vector Machine Training Using the Minimum Description Length Principle. 1361-1365 - Yuanpin Zhou

, Yao Lu:
Multiple Instance Learning with Task-Specific Multi-Level Features for Weakly Annotated Histopathological Image Classification. 1366-1370 - Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama:

Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images. 1371-1375 - Rui Xu, Yufeng Wang, Xinchen Ye, Pengcheng Wu, Yen-Wei Chen, Fangyi Xu, Wenchao Zhu, Chao Chen, Yong Zhou, Hongjie Hu, Xiaofeng Qu, Shoji Kido, Noriyuki Tomiyama:

Pixel-Level and Affinity-Level Knowledge Distillation for Unsupervised Segmentation of Covid-19 Lesions. 1376-1380 - Nastaran Enshaei, Moezedin Javad Rafiee, Arash Mohammadi, Farnoosh Naderkhani:

Data Shapley Value for Handling Noisy Labels: An Application in Screening Covid-19 Pneumonia from Chest CT Scans. 1381-1385 - Xiongbiao Luo:

Accurate Multiscale Selective Fusion of CT and Video Images for Real-Time Endoscopic Camera 3D Tracking in Robotic Surgery. 1386-1390 - Ruixiang Geng, Qing Liu

, Shuo Feng, Yixiong Liang:
Learning Deep Pathological Features for WSI-Level Cervical Cancer Grading. 1391-1395 - Bowen Xu, Wenqiang Zhang:

Selective Scale Cascade Attention Network for Breast Cancer Histopathology Image Classification. 1396-1400 - Archishman Biswas, Hernando C. Ombao

:
Frequency-Specific Non-Linear Granger Causality in a Network of Brain Signals. 1401-1405 - Kosuke Fukumori, Noboru Yoshida, Hidenori Sugano, Madoka Nakajima, Toshihisa Tanaka:

Epileptic Spike Detection by Recurrent Neural Networks with Self-Attention Mechanism. 1406-1410 - Jian Yin, Yuan Wang

:
Topological Correlation of Brain Signals. 1411-1415 - Bahman Abdi-Sargezeh, Antonio Valentín, Gonzalo Alarcón, Saeid Sanei:

Online Detection of Scalp-Invisible Mesial-Temporal Brain Interictal Epileptiform Discharges from EEG. 1416-1420 - Yulu Wang, Yiwen Sun, Lei Fang, Changshui Zhang:

Leveraging Sparse Coding for EEG Based Emotion Recognition in Shooting. 1421-1425 - Weilai Li, Lanfeng Zhong, Weixi Xiang, Tongzhou Kang, Dakun Lai:

A Novel Unsupervised Autoencoder-Based HFOs Detector in Intracranial EEG Signals. 1426-1430 - Fei Ye, Zhiqiang Wang, Sheng Zhu, Xuanya Li, Kai Hu:

A Novel Convolutional Neural Network Based on Adaptive Multi-Scale Aggregation and Boundary-Aware for Lateral Ventricle Segmentation on MR images. 1431-1435 - Wentao Liu, Huihua Yang, Tong Tian, Xipeng Pan, Weijin Xu

:
Multiscale Attention Aggregation Network for 2D Vessel Segmentation. 1436-1440 - Xinxin Shan, Tai Ma, Anqi Gu, Haibin Cai, Ying Wen:

TCRNet: Make Transformer, CNN and RNN Complement Each Other. 1441-1445 - Ke Zheng

, Junhai Xu, Jianguo Wei
:
Double Noise Mean Teacher Self-Ensembling Model for Semi-Supervised Tumor Segmentation. 1446-1450 - Siming Yuan, Qing Liu

, Shenghui Liao, Fuchang Han, Haitao Wei, Yingqi Zhang:
Rethinking Computer-Aided Pelvis Segmentation. 1451-1455 - Hyunwoo Yu

, Jae-hun Shim, Jaeho Kwak, Jou Won Song, Suk-Ju Kang:
Vision Transformer-Based Retina Vessel Segmentation with Deep Adaptive Gamma Correction. 1456-1460 - Yuan Wang

, Moo K. Chung, Julius Fridriksson:
Spectral Permutation Test on Persistence Diagrams. 1461-1465 - Isabell Lehmann

, Evrim Acar, Tanuj Hasija, Mohammad A. B. S. Akhonda, Vince D. Calhoun
, Peter J. Schreier, Tülay Adali:
Multi-Task fMRI Data Fusion Using IVA and PARAFAC2. 1466-1470 - Hanlu Yang, Mohammad A. B. S. Akhonda, Fateme Ghayem, Qunfang Long, Vince D. Calhoun

, Tülay Adali:
Independent Vector Analysis Based Subgroup Identification from Multisubject fMRI Data. 1471-1475 - Damian Pascual, Béni Egressy, Nicolas Affolter, Yiming Cai, Oliver Richter, Roger Wattenhofer:

Improving Brain Decoding Methods and Evaluation. 1476-1480 - Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jerry L. Prince, Jangwon Kim, Georges El Fakhri, Jonghye Woo:

Cmri2spec: Cine MRI Sequence to Spectrogram Synthesis via A Pairwise Heterogeneous Translator. 1481-1485 - Wenhan Wang, Youyong Kong, Zhenghua Hou, Chunfeng Yang, Yonggui Yuan:

Spatio-Temporal Attention Graph Convolution Network for Functional Connectome Classification. 1486-1490 - Avrajit Ghosh, Michael T. McCann, Saiprasad Ravishankar:

Bilevel Learning of ℓ1 Regularizers with Closed-Form Gradients (BLORC). 1491-1495 - V. S. Unni, Ruturaj G. Gavaskar, Kunal N. Chaudhury:

Multiband Image Fusion with Controllable Error Guarantees. 1496-1500 - Zhuojie Huang

, Shuping Zhao, Lunke Fei, Jigang Wu:
Weighted Graph Embedded Low-Rank Projection Learning for Feature Extraction. 1501-1505 - Vasiliki Kouni

, Georgios Paraskevopoulos, Holger Rauhut, George C. Alexandropoulos:
ADMM-DAD Net: A Deep Unfolding Network for Analysis Compressed Sensing. 1506-1510 - Alexander Lin, Andrew H. Song, Berkin Bilgic

, Demba E. Ba:
High-Dimensional Sparse Bayesian Learning without Covariance Matrices. 1511-1515 - Baoshun Shi

, Yuxin Wang, Qiusheng Lian:
A Trainable Bounded Denoiser Using Double Tight Frame Network for Snapshot Compressive Imaging. 1516-1520 - Seobin Park, Tae Hyun Kim:

Progressive Image Super-Resolution via Neural Differential Equation. 1521-1525 - Yuhui Quan, Xinran Qin, Mingqin Chen, Yan Huang:

High-Quality Self-Supervised Snapshot Hyperspectral Imaging. 1526-1530 - Abderrahim Halimi

, Jakeoung Koo
, Robert A. Lamb, Gerald S. Buller, Steve McLaughlin
:
Robust Bayesian Reconstruction of Multispectral Single-Photon 3D Lidar Data with Non-Uniform Background. 1531-1535 - Quentin Febvre, Ronan Fablet, Julien Le Sommer, Clément Ubelmann:

Joint Calibration and Mapping of Satellite Altimetry Data Using Trainable Variational Models. 1536-1540 - Michalis Giannopoulos, Grigorios Tsagkatakis, Panagiotis Tsakalides:

4D Convolutional Neural Networks for Multi-Spectral and Multi-Temporal Remote Sensing Data Classification. 1541-1545 - Cheick T. Cissé

, Ahed Alboody
, Matthieu Puigt
, Gilles Roussel, Vincent Vantrepotte, Cédric Jamet, Trung-Kien Tran:
A New Deep Learning Method for Multispectral Image Time Series Completion Using Hyperspectral Data. 1546-1550 - Xinyi Wei, Hans Van Gorp

, Lizeth Gonzalez-Carabarin, Daniel Freedman, Yonina C. Eldar, Ruud J. G. van Sloun
:
Image Denoising with Deep Unfolding And Normalizing Flows. 1551-1555 - Rohit Ranade, Yangwen Liang, Shuangquan Wang, Dongwoon Bai, Jungwon Lee:

3D Texture Super Resolution via the Rendering Loss. 1556-1560 - Changhun Sung, Byungdeok Kim:

Bundle ICP with Virtual Depth for Hand-Held 3d Scanner. 1561-1565 - Julián Tachella, Michael P. Sheehan, Mike E. Davies:

Sketched RT3D: How to Reconstruct Billions of Photons Per Second. 1566-1570 - Naveen Kuruba, Neel Badadare, Vikram Narayan, Satish Putta:

A Generic Method to Estimate Camera Extrinsic Parameters. 1571-1575 - Yash Sanghvi, Abhiram Gnanasambandan, Stanley H. Chan:

Photon-Limited Deblurring Using Algorithm Unrolling. 1576-1580 - Wenpeng Xing, Jie Chen:

NEX+: Novel View Synthesis with Neural Regularisation Over Multi-Plane Images. 1581-1585 - Daniel Nicholls

, Alex W. Robinson
, Jack Wells
, Amirafshar Moshtaghpour
, Mounib Bahri
, Angus I. Kirkland, Nigel D. Browning:
Compressive Scanning Transmission Electron Microscopy. 1586-1590 - Simon Welker

, Tal Peer, Henry N. Chapman
, Timo Gerkmann
:
Deep Iterative Phase Retrieval for Ptychography. 1591-1595 - Vinayak Killedar, Chandra Sekhar Seelamantula:

Compressive Phase Retrieval Based On Sparse Latent Generative Priors. 1596-1600 - Abdulrahman M. Alanazi, Singanallur V. Venkatakrishnan, Hector J. Santos-Villalobos

, Gregery T. Buzzard, Charles A. Bouman:
Model-Based Reconstruction for Collimated Beam Ultrasound Systems. 1601-1605 - Tim Straubinger, Robert Xiao, Helge Rhodin:

Learned Acoustic Reconstruction Using Synthetic Aperture Focusing. 1606-1610 - Guanze Liu, Bo Xu

, Han Huang, Cheng Lu, Yandong Guo:
SDETR: Attention-Guided Salient Object Detection with Transformer. 1611-1615 - Kristian Fischer, Markus Hofbauer, Christopher B. Kuhn, Eckehard G. Steinbach

, André Kaup:
Evaluation of Video Coding for Machines without Ground Truth. 1616-1620 - Thuc Nguyen Huu, Vinh Van Duong, Jonghoon Yim, Byeungwoo Jeon

:
Raw Plenoptic Video Coding Under Hexagonal Lattice Resolution of Motion Vectors. 1621-1624 - Kianoush Jafari, Alireza Aminlou, Miska M. Hannuksela:

Comparison of Boundary Artifact Removal Methods in Coding of Generalized Cubemap Projection Using VVC. 1625-1629 - Shen Wang, Yibing Fu, Chen Zhu, Li Song, Wenjun Zhang:

Low-Complexity Multi-Model CNN in-Loop Filter for AVS3. 1630-1634 - Junyan Huo, Yu Sun, Haixin Wang, Shuai Wan, Fuzheng Yang, Ming Li:

Unified Matrix Coding for NN Originated MIP in H.266/VVC. 1635-1639 - Yuanyuan Xu

, Taoyu Yang, Zengjie Tan, Haolun Lan:
FOV-Based Coding Optimization for 360-Degree Virtual Reality Videos. 1640-1644 - Jian Wang, Xinyue Li

, Wei Song
, Zhichao Zhang
, Weiqi Guo:
Multi-Hierarchy Proxy Structure for Deep Metric Learning. 1645-1649 - Michail Kaseris, Ioannis Mademlis

, Ioannis Pitas:
Exploiting Caption Diversity for Unsupervised Video Summarization. 1650-1654 - Wanqian Zhang, Dayan Wu, Chule Yang, Bo Li

, Weiping Wang
:
Clustering and Separating Similarities for Deep Unsupervised Hashing. 1655-1659 - Junying Huang, Fan Chen, Keze Wang, Liang Lin, Dongyu Zhang:

Enhancing Prototypical Few-Shot Learning By Leveraging The Local-Level Strategy. 1660-1664 - Chao Zhou

, Miguel R. D. Rodrigues:
Blind Unmixing Using A Double Deep Image Prior. 1665-1669 - Yi Liu, Yanjie Liang, Qiangqiang Wu, Liming Zhang, Hanzi Wang:

A New Framework for Multiple Deep Correlation Filters Based Object Tracking. 1670-1674 - Bo-Hao Chen, Hsiang-Yin Cheng, Jia-Li Yin:

Adaptive Actor-Critic Bilateral Filter. 1675-1679 - Niklas Kämper, Joachim Weickert:

Domain Decomposition Algorithms for Real-Time Homogeneous Diffusion Inpainting in 4K. 1680-1684 - Michiaki Tatsubori, Takao Moriyama, Tatsuya Ishikawa, Paolo Fraccaro, Anne Jones, Blair Edwards, Julian Kuehnert, Sekou L. Remy:

Deep Temporal Interpolation of Radar-Based Precipitation. 1685-1689 - Zikai Sun, Thierry Blu:

A Nonlinear Steerable Complex Wavelet Decomposition of Images. 1690-1694 - Xiang Cao

, Haibo Shen, Liangqi Zhang, Yihao Luo, Tianjiang Wang:
Kernel Estimation Network for Blind Super-Resolution. 1695-1699 - Yixiong Zhang, Zhipeng Su, Feng Qi, Jianyang Zhou, Xiao-Ping Zhang:

Terahertz Image Restoration Benchmarking Dataset. 1700-1704 - Xingrun Xing, Yalong Jiang, Baochang Zhang, Wenrui Ding, Yangguang Li, Hongguang Li, Huan Peng:

Binary Dense Predictors for Human Pose Estimation Based on Dynamic Thresholds and Filtering. 1705-1709 - Haidong Zhu, Zhaoheng Zheng, Mohammad Soleymani, Ram Nevatia:

Self-Supervised Learning for Sentiment Analysis via Image-Text Matching. 1710-1714 - Wei-Yu Lee

, Jheng-Yu Wang, Yu-Chiang Frank Wang:
Domain-Agnostic Meta-Learning for Cross-Domain Few-Shot Classification. 1715-1719 - Dahyun Kim, Sunjae Yoon

, Ji Woo Hong, Chang D. Yoo:
Semantic Association Network for Video Corpus Moment Retrieval. 1720-1724 - Nida Itrat Abbasi, Siyang Song, Hatice Gunes:

Statistical, Spectral and Graph Representations for Video-Based Facial Expression Recognition in Children. 1725-1729 - Nakyeong Yang, Taegwan Kang, Kyomin Jung:

Deriving Explainable Discriminative Attributes Using Confusion About Counterfactual Class. 1730-1734 - Chenghu Du

, Feng Yu
, Minghua Jiang, Yaxin Zhao, Xiong Wei, Tao Peng, Xinrong Hu:
Realistic Monocular-To-3d Virtual Try-On Via Multi-Scale Characteristics Capture. 1735-1739 - Ehsan Pajouheshgar, Tong Zhang, Sabine Süsstrunk:

Optimizing Latent Space Directions for Gan-Based Local Image Editing. 1740-1744 - Jingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin, Zejun Ma:

Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation. 1745-1749 - Somi Jeong, Jiyoung Lee

, Kwanghoon Sohn:
Multi-Domain Unsupervised Image-to-Image Translation with Appearance Adaptive Convolution. 1750-1754 - Yifan Yuan, Siteng Ma, Junping Zhang:

VR-FAM: Variance-Reduced Encoder with Nonlinear Transformation for Facial Attribute Manipulation. 1755-1759 - George Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang:

Wavelet-Based Unsupervised Label-to-Image Translation. 1760-1764 - Sadid Sahami, Gene Cheung, Chia-Wen Lin:

Fast Graph Sampling for Short Video Summarization Using Gershgorin Disc Alignment. 1765-1769 - Xiaopeng Ke, Boyu Chang

, Hao Wu, Fengyuan Xu, Sheng Zhong:
Towards Practical and Efficient Long Video Summary. 1770-1774 - Sunhee Hwang, Minsong Ki, Seung-Hyun Lee, Sanghoon Park, Byoung-Ki Jeon:

Cut And Continuous Paste Towards Real-Time Deep Fall Detection. 1775-1779 - Aditya Singh, Saheb Chhabra, Puspita Majumdar, Richa Singh, Mayank Vatsa:

Mannet: A Large-Scale Manipulated Image Detection Dataset And Baseline Evaluations. 1780-1784 - Laura Kart, Niv Cohen:

Approaches Toward Physical and General Video Anomaly Detection. 1785-1789 - Suiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet:

Considering User Agreement in Learning to Predict the Aesthetic Quality. 1790-1794 - Qi Zheng

, Zhengzhong Tu, Yibo Fan, Xiaoyang Zeng, Alan C. Bovik:
No-Reference Quality Assessment of Variable Frame-Rate Videos Using Temporal Bandpass Statistics. 1795-1799 - Joel Jung, Alexandre Giraud, Meijia Song, Songnan Li, Xiang Li, Shan Liu:

Towards Joint Frame-Level and MOS Quality Predictions with Low-Complexity Objective Models. 1800-1804 - Satyam Mohla, Anshul Nasery, Biplab Banerjee:

Teaching CNNs to Mimic Human Visual Cognitive Process & Regularise Texture-Shape Bias. 1805-1809 - Shaoguo Wen, Suiyi Ling, Junle Wang, Ximing Chen, Yanqing Jing, Patrick Le Callet:

Subjective And Objective Quality Assessment Of Mobile Gaming Video. 1810-1814 - Yanzhe Zhong, Huadong Pan, Bangjie Tang, Zhonggeng Liu, Yiming Zhu, Jun Yin:

ER-PIQA: A Task-Guided Pedestrian Image Quality Assessment Via Embedding Reconstruction. 1815-1819 - Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael A. Greenspan, Ali Etemad:

Multiscale Crowd Counting and Localization By Multitask Point Supervision. 1820-1824 - Yu-Zhang Chen, Tsung-Jung Liu

, Kuan-Hsien Liu
:
Super-Resolution of Satellite Images by two-Dimensional RRDB and Edge-Enhancement Generative Adversarial Network. 1825-1829 - Saurabh Sahu, Palash Goyal:

Leveraging Local Temporal Information for Multimodal Scene Classification. 1830-1834 - Menghao Li, Mingtao Pei, Wei Liang:

Predicting Human Motion Using Key Subsequences. 1835-1839 - Ruxin Ding, Jianfeng Ren, Heng Yu

, Jiawei Li:
Dynamic Texture Recognition Using PDV Hashing and Dictionary Learning on Multi-Scale Volume Local Binary Pattern. 1840-1844 - Qing Gao, Mingtao Pei, Hongyu Shen:

Do You Live a Healthy Life? Analyzing Lifestyle by Visual Life Logging. 1845-1849 - Liping Huang, Taizo Suzuki:

Weighted Wavelet-Based Spectral-Spatial Transforms For CFA-Sampled Raw Camera Image Compression Considering Image Features. 1850-1854 - Dongyang Li, Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang

, Yichen Qian, Hao Li:
Jmpnet: Joint Motion Prediction for Learning-Based Video Compression. 1855-1859 - Fabian Brand, Christian Herglotz, André Kaup:

A Low-Parametric Model for Bit-Rate Estimation of VVC Residual Coding. 1860-1864 - Vignesh V. Menon

, Hadi Amirpour
, Mohammed Ghanbari, Christian Timmerer:
OPTE: Online Per-Title Encoding for Live Video Streaming. 1865-1869 - Kedeng Tong, Xin Jin, Chen Wang, Fan Jiang:

SADN: Learned Light Field Image Compression with Spatial-Angular Decorrelation. 1870-1874 - Wenfeng Li, Zongcai Du, Hao He, Jie Tang, Gangshan Wu:

Hierarchical Feature Aggregation Network for Deep Image Compression. 1875-1879 - Tianyou Chen

, Xiaoguang Hu, Jin Xiao
, Guofeng Zhang, Shaojie Wang:
Accurate Instance Segmentation Via Collaborative Learning. 1880-1884 - Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu

, Matti Pietikäinen, Li Liu:
Dynamic Binary Neural Network by Learning Channel-Wise Thresholds. 1885-1889 - Wanyu Wu, Wei Wang, Kui Jiang, Xin Xu, Ruimin Hu:

Self-Supervised Learning on A Lightweight Low-Light Image Enhancement Model with Curve Refinement. 1890-1894 - Jingquan Wang, Jing Xu

, Yu Pan, Zenglin Xu:
Semantically Proportional Patchmix for Few-Shot Learning. 1895-1899 - Zhikui Chen, Tiandong Ji, Suhua Zhang, Fangming Zhong:

Noise Suppression for Improved Few-Shot Learning. 1900-1904 - Cheryl Sze Yin Wong, Guo Yang, Arulmurugan Ambikapathi, Savitha Ramasamy

:
Online Continual Learning Using Enhanced Random Vector Functional Link Networks. 1905-1909 - Miaohua Zhang

, Yongsheng Gao, Jun Zhou:
A Generalized Kernel Risk Sensitive Loss for Robust Two-Dimensional Singular Value Decomposition. 1910-1914 - Xiangling Ding

, Pu Huang, Dengyong Zhang, Xianfeng Zhao
:
Video Frame Interpolation via Local Lightweight Bidirectional Encoding with Channel Attention Cascade. 1915-1919 - Yue Lv, Wenming Yang, Wangmeng Zuo, Qingmin Liao, Rui Zhu:

Sain: Similarity-Aware Video Frame Interpolation. 1920-1924 - Zejia Fan, Jiaying Liu

, Wenhan Yang, Wei Xiang, Zongming Guo:
Self-Learned Video Super-Resolution with Augmented Spatial and Temporal Context. 1925-1929 - Jiahui Liu, Mingcai Zhou, Meng Xiao:

Deformable Convolution Dense Network for Compressed Video Quality Enhancement. 1930-1934 - Siying Liu

, Roxana Alexandru, Pier Luigi Dragotti
:
Convolutional ISTA Network with Temporal Consistency Constraints for Video Reconstruction from Event Cameras. 1935-1939 - Xuezhi Tong, Rui Wang

, Chuan Wang, Sanyi Zhang, Xiaochun Cao:
PMP-NET: Rethinking Visual Context for Scene Graph Generation. 1940-1944 - Feicheng Huang, Zhixin Li:

Improve Image Captioning Via Relation Modeling. 1945-1949 - Lei Cui, Huan Peng, Yangguang Li, Chuming Li, Xingrun Xing:

Equal Loss: A Simple Loss Function for Noise Robust Learning. 1950-1954 - Boyang Wan, Wenhui Jiang, Yuming Fang:

Informative Attention Supervision for Grounded Video Description. 1955-1959 - Jialu Zhang, Qian Zhang, Jianfeng Ren, Yitian Zhao, Jiang Liu

:
Spatial-Context-Aware Deep Neural Network for Multi-Class Image Classification. 1960-1964 - Hongjun Wu

, Mengzhu Li, Yongcheng Liu, Hongzhe Liu
, Cheng Xu
, Xuewei Li:
Transtl: Spatial-Temporal Localization Transformer for Multi-Label Video Classification. 1965-1969 - Kyuyeon Kim, Junsik Jung, Woo Jae Kim, Sung-Eui Yoon:

Deep Video Inpainting Guided by Audio-Visual Self-Supervision. 1970-1974 - Guangwei Li, Xuenan Xu, Mengyue Wu, Kai Yu:

Navigating Audio-Visual Event Detection Across Mismatched Modalities. 1975-1979 - Donglai Wei, Chen-Geng Liu

, Yang Liu
, Jing Liu, Xiao-Guang Zhu, Xinhua Zeng:
Look, Listen and Pay More Attention: Fusing Multi-Modal Information for Video Violence Detection. 1980-1984 - Changsheng Xu, Zhenlong Xu, Yifan He, Shuigeng Zhou, Jihong Guan:

Multi-Modal Learning with Text Merging for TEXTVQA. 1985-1989 - Ping Wang, Yijie Cao, Lei Lu:

A Novel Part Feature Integration and Fusion Method for Fine-Grained Vehicle Recognition. 1990-1994 - Yiqiang Chen, Feng Liu, Ke Pei:

Monocular Vehicle 3D Bounding Box Estimation Using Homograhy and Geometry in Traffic Scene. 1995-1999 - Xin Yi, Bo Ma, Jiahao Wu:

FSM: Feature Sampling Module for Object Detection. 2000-2004 - Senyun Kuang, Shijin Meng, Bo Xiao, Lv Tang, Bo Li:

Rethinking Two-B-Real Net for Real-Time Salient Object Detection. 2005-2009 - Bo Cui, Hui Qu, Xuhui Huang, Shan Yu:

Balanced Ranking and Sorting For Class Incremental Object Detection. 2010-2014 - Yihao Luo, Xiang Cao

, Juntao Zhang, Leixilan Pan, Tianjiang Wang, Qi Feng:
Multi-Scale Reinforcement Learning Strategy for Object Detection. 2015-2019 - Zhihao Wu, Chengliang Liu

, Chao Huang, Jie Wen, Yong Xu:
Deep Object Detection with Example Attribute Based Prediction Modulation. 2020-2024 - Shanzhi Yin

, Chao Li, Youneng Bao, Yongsheng Liang, Fanyang Meng, Wei Liu:
Universal Efficient Variable-Rate Neural Image Compression. 2025-2029 - Bowen Li

, Xin Yao, Chao Li, Youneng Bao, Fanyang Meng, Yongsheng Liang:
AdderIC: Towards Low Computation Cost Image Compression. 2030-2034 - Saiping Zhang, Luis Herranz

, Marta Mrak, Marc Górriz Blanch, Shuai Wan, Fuzheng Yang:
DCNGAN: A Deformable Convolution-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed Video. 2035-2039 - Anne-Flore Perrin

, Yejing Xie, Tao Zhang, Yiting Liao, Junlin Li, Patrick Le Callet:
Specialised Video Quality Model For Enhanced User Generated Content (UGC) With Special Effects. 2040-2044 - Andreas Pastor, Lukás Krasula, Xiaoqing Zhu, Zhi Li, Patrick Le Callet:

Improving Maximum Likelihood Difference Scaling Method To Measure Inter Content Scale. 2045-2049 - Ao-Xiang Zhang, Yuan-Gen Wang:

Texture Information Boosts Video Quality Assessment. 2050-2054 - Keisuke Ozawa

:
Plug-and-Play and Relay Regularizations on Noisy Low Rank Tensor Completion for Snapshot Multispectral Image Restoration. 2055-2059 - Ashish Tiwari

, Shanmuganathan Raman:
LERPS: Lighting Estimation and Relighting for Photometric Stereo. 2060-2064 - Huiyu Duan, Xiongkuo Min

, Wei Shen, Guangtao Zhai:
A Unified Two-Stage Model for Separating Superimposed Images. 2065-2069 - Siyu Huang

, Haoyi Xiong
, Tianyang Wang, Bihan Wen
, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou:
Parameter-Free Style Projection for Arbitrary Image Style Transfer. 2070-2074 - Yangfan Sun, Zhu Li, Li Li, Shizheng Wang, Wei Gao

:
Optimization of Compressive Light Field Display in Dual-Guided Learning. 2075-2079 - Yusuke Matsui, Yoshiki Imaizumi, Naoya Miyamoto, Naoki Yoshifuji

:
ARM 4-BIT PQ: SIMD-Based Acceleration for Approximate Nearest Neighbor Search on ARM. 2080-2084 - Chao Wang, Yi Gu, Jie Li, Xinlei He, Zirui Zhang, Yuting Gao, Chentao Wu:

Iterative Learning for Distorted Image Restoration. 2085-2089 - Xiaoyu Zhang

, Wei Gao
, Hui Yuan
, Ge Li:
JE2NET: Joint Exploitation and Exploration in Reinforcement Learning Based Image Restoration. 2090-2094 - Kun Yang, Juan Zhang, Xiaoqi Lang:

Multiple Patch-Aware Network for Faster Real-World Image Dehazing. 2095-2099 - Zhenyu Tang, Long Ma, Xiaoke Shang, Xin Fan:

Learning to Fuse Heterogeneous Features for Low-Light Image Enhancement. 2100-2104 - Jiachun Li, Kunkun Qin, Ruotao Xu

, Hui Ji
:
Deep Scale-Aware Image Smoothing. 2105-2109 - Yanbo Gao, Menghu Jia, Shuai Li, Xun Cai, Mao Ye, Frédéric Dufaux:

A Multiscale Gradient-Backpropagation Optimization Framework for Deformable Convolution Based Compressed Video Enhancement. 2110-2114 - Tomohiro Hayase, Suguru Yasutomi, Nakamasa Inoue:

Downstream Augmentation Generation For Contrastive Learning. 2115-2119 - Chao Dong, Qi Ye, Wenchao Meng

, Kaixiang Yang:
Few-Shot Learning with Improved Local Representations via Bias Rectify Module. 2120-2124 - Pichao Wang, Fan Wang, Hao Li:

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer. 2125-2129 - Fangxin Liu, Wenbo Zhao, Yongbiao Chen, Zongwu Wang, Fei Dai:

DynSNN: A Dynamic Approach to Reduce Redundancy in Spiking Neural Networks. 2130-2134 - Yongsheng Zhang, Qing Liu

, Yang Zhao, Yixiong Liang:
MEJIGCLU: More Effective Jigsaw Clustering For Unsupervised Visual Representation Learning. 2135-2139 - Cheng Zhuang, Yunlian Sun:

Ganet: Unary Attention Reaches Pairwise Attention Via Implicit Group Clustering in Light-Weight CNNs. 2140-2144 - Ting-Wei Chang, Wei-Chen Chiu, Ching-Chun Huang:

Find The Way Back: Invertible Kernel Estimator For Blind Image Super-Resolution. 2145-2149 - Haoquan Wang, Gang Zhang, Zhichun Lei:

Fine-Grained Dynamic Loss for Accurate Single-Image Super-Resolution. 2150-2154 - Gongzhe Li, Linwei Qiu

, Haopeng Zhang
, Fengying Xie, Zhiguo Jiang:
Multi-Frame Super-Resolution With Raw Images Via Modified Deformable Convolution. 2155-2159 - Yan Wang, Yao Lu, Shunzhou Wang, Wenyao Zhang, Zijian Wang:

Local-Global Feature Aggregation for Light Field Image Super-Resolution. 2160-2164 - Hao He, Zongcai Du, Wenfeng Li, Jie Tang, Gangshan Wu:

Pyramid Fusion Attention Network For Single Image Super-Resolution. 2165-2169 - Xian Zhong

, Zhuo Zhou
, Wenxuan Liu, Kui Jiang, Xuemei Jia, Wenxin Huang, Zheng Wang:
VCD: View-Constraint Disentanglement for Action Recognition. 2170-2174 - Chengming Zou, Ducheng Yuan, Long Lan, Haoang Chi:

Privacy-Preserving Action Recognition. 2175-2179 - Hongcheng Zhang, Xu Zhao:

Spatio-Temporal Motion Aggregation Network for Video Action Detection. 2180-2184 - Yanhao Jing, Feng Wang:

TP-VIT: A Two-Pathway Vision Transformer for Video Action Recognition. 2185-2189 - Yang Liu

, Jing Liu
, Xiaoguang Zhu, Donglai Wei, Xiaohong Huang, Liang Song:
Learning Task-Specific Representation for Video Anomaly Detection with Spatial-Temporal Attention. 2190-2194 - Mengzhu Li, Hongjun Wu

, Yongcheng Liu, Hongzhe Liu
, Cheng Xu
, Xuewei Li:
W-ART: Action Relation Transformer for Weakly-Supervised Temporal Action Localization. 2195-2199 - Jinpeng Liu, Song Wu, Dehong He, Guoqiang Xiao:

MS-ROCANet: Multi-Scale Residual Orthogonal-Channel Attention Network for Scene Text Detection. 2200-2204 - Shan Liu, Guoqiang Xiao, Xiaohui Xu, Song Wu:

Bi-Directional Normalization and Color Attention-Guided Generative Adversarial Network for Image Enhancement. 2205-2209 - Zhikui Chen, Han Wang, Suhua Zhang, Fangming Zhong:

Dual-Attention Network for Few-Shot Segmentation. 2210-2214 - Jiapeng Li, Ge Li, Thomas H. Li:

Attention Guided Invariance Selection for Local Feature Descriptors. 2215-2219 - Jiahao Wang, Mingdeng Cao, Shuwei Shi, Baoyuan Wu, Yujiu Yang

:
Attention Probe: Vision Transformer Distillation in the Wild. 2220-2224 - Bin Jiang, Fangqiang Xu

, Jun Xia, Chao Yang, Wei Huang, Yun Huang:
Stacked Multi-Scale Attention Network for Image Colorization. 2225-2229 - Han Wang, Yali Li, Shengjin Wang:

CRPN: Distinguish Novel Categories Via Class-Relevant Region Proposal Network for Few-Shot Object Detection. 2230-2234 - Zhishan Li, Mingmu Chen, Yifan He, Lei Xie, Hongye Su:

An Efficient Framework for Detection and Recognition of Numerical Traffic Signs. 2235-2239 - Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama:

Divergence-Guided Feature Alignment for Cross-Domain Object Detection. 2240-2244 - Jun Wang

, Hefeng Zhou
, Xiaohan Yu
:
PGTRNET: Two-Phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement. 2245-2249 - Weijie Liu, Chong Wang, Shenghao Yu, Chenchen Tao

, Jun Wang, Jiafei Wu:
Novel Instance Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection. 2250-2254 - Chuang Yang

, Mulin Chen, Yuan Yuan, Qi Wang:
BiP-Net: Bidirectional Perspective Strategy Based Arbitrary-Shaped Text Detection Network. 2255-2259 - Tim Heydrich, Yimin Yang, Xiangyu Ma, Yu Liu, Shan Du

:
A Novel Lightweight Network for Fast Monocular Depth Estimation. 2260-2264 - Tim Heydrich, Yimin Yang, Shan Du

:
A Lightweight Self-Supervised Training Framework for Monocular Depth Estimation. 2265-2269 - Hao Liu, Hui Yuan

, Raouf Hamzaoui, Wei Gao
, Shuai Li:
PU-Refiner: A Geometry Refiner with Adversarial Learning for Point Cloud Upsampling. 2270-2274 - Bo-Fan Chen, Yang-Ming Yeh, Yi-Chang Lu:

CF-Net: Complementary Fusion Network for Rotation Invariant Point Cloud Completion. 2275-2279 - Zihao Zhang, Nan Sang, Xupeng Wang

:
TH-Net: A Method Of Single 3d Object Tracking Based On Transformers And Hausdorff Distance. 2280-2284 - Hengxin Feng, Weifeng Liu, Yanjiang Wang, Baodi Liu:

Enrich Features for Few-Shot Point Cloud Classification. 2285-2289 - Jaewoo Lee, Daeul Park, Dongwook Lee, Daehyun Ji

:
Semi-Supervised 360° Depth Estimation from Multiple Fisheye Cameras with Pixel-Level Selective Loss. 2290-2294 - Wei Zhong, Yazhi Yuan, Xinchen Ye, Dian Zheng, Rui Xu:

Underwater Stereo Matching Via Unsupervised Appearance And Feature Adaptation Networks. 2295-2299 - Pei Tang, Liangrui Peng, Ruijie Yan, Haodong Shi, Gang Yao, Changsong Liu, Jie Li, Yuqi Zhang:

Domain Adaptation via Mutual Information Maximization for Handwriting Recognition. 2300-2304 - Ang Li, Jian Hu, Chilin Fu, Xiaolu Zhang, Jun Zhou:

Attribute-Conditioned Face Swapping Network for Low-Resolution Images. 2305-2309 - Ying Bian, Peng Zhang, Jingjing Wang, Chunmao Wang, Shiliang Pu:

Learning Multiple Explainable and Generalizable Cues for Face Anti-Spoofing. 2310-2314 - Bastien Laville

, Laure Blanc-Féraud, Gilles Aubert:
Off-The-Grid Covariance-Based Super-Resolution Fluctuation Microscopy. 2315-2319 - Zhiyuan Zha

, Bihan Wen
, Xin Yuan, Jiantao Zhou, Ce Zhu:
Simultaneous Nonlocal Low-Rank And Deep Priors For Poisson Denoising. 2320-2324 - Yiming Liu

, Yanni Zhang, Qiang Li, Jun Kong, Miao Qi, Jianzhong Wang
:
Double Closed-Loop Network for Image Deblurring. 2325-2329 - Ying Zhang, Youjun Xiang, Lei Cai, Yuli Fu, Wanliang Huo, Junjun Xia:

Single Image De-Raining with High-Low Frequency Guidance. 2330-2334 - Wu Yang, Wuzhen Shi:

Detail Generation and Fusion Networks for Image Inpainting. 2335-2339 - Hong Liu, Ying Zhu, Guoliang Hua, Weibo Huang, Runwei Ding:

Adaptive Weighted Network With Edge Enhancement Module For Monocular Self-Supervised Depth Estimation. 2340-2344 - Diclehan Karakaya

, Oguzhan Ulucan
, Mehmet Türkan
:
Pas-Mef: Multi-Exposure Image Fusion Based On Principal Component Analysis, Adaptive Well-Exposedness And Saliency Map. 2345-2349 - Miaoju Ban, Runwei Ding, Jian Zhang

, Tianyu Guo
, Tao Wang
:
PDD-Net: A Precise Defect Detection Network Based on Point Set Representation. 2350-2354 - Renhui Zhang, Tiancheng Lin, Rui Zhang, Yi Xu:

Solving The Long-Tailed Problem Via Intra- And Inter-Category Balance. 2355-2359 - Zhanchao Huang, Wei Li, Ran Tao:

Extracting and Distilling Direction-Adaptive Knowledge for Lightweight Object Detection in Remote Sensing Images. 2360-2364 - Xiaoliu Luo, Jing Luo, Zhao Duan, Jin Tan, Taiping Zhang:

Pseudo-Interacting Guided Network for Few-Shot Segmentation. 2365-2369 - Yuehui Wang, Qing Wang, Dongyu Zhang:

Few-Shot Generation By Modeling Stereoscopic Priors. 2370-2374 - Kohei Matsuzaki, Kei Kawamura:

Relative Viewpoint Estimation Based on Structured 3d Representation Alignment. 2375-2379 - Minxiang Ye, Yifei Zhang, Shiqiang Zhu, Anhuan Xie, Dan Zhang:

Deep Markov Clustering for Panoptic Segmentation. 2380-2384 - Libo Liu, Chengjian Huang, Chunsheng Cai, Xiaodong Zhang, Qingmao Hu:

Multi-Task Learning Improves the Brain Stoke Lesion Segmentation. 2385-2389 - Hongyi Wang

, Shiao Xie, Lanfen Lin, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong:
Mixed Transformer U-Net for Medical Image Segmentation. 2390-2394 - Wankang Zeng, Wenkang Fan, Dongfang Shen, Yinran Chen, Xiongbiao Luo:

Contrastive Translation Learning For Medical Image Segmentation. 2395-2399 - Tianfang Meng, Wenqiang Zhang:

Fast Video Object Segmentation via Dynamic YOLACT. 2400-2404 - Tiyu Fang, Zhen Liang

, Xiuli Shao, Zihao Dong, Jinping Li:
Depth Removal Distillation for RGB-D Semantic Segmentation. 2405-2409 - Lingzhao Ju, Xu Zhao:

Mask-Based Attention Parallel Network for in-the-Wild Facial Expression Recognition. 2410-2414 - Lifang Zhou, Siqin Li, Yi Wang, Junlin Liu:

SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium. 2415-2419 - Mengting Wei

, Wenming Zheng, Yuan Zong, Xingxun Jiang, Cheng Lu, Jiateng Liu:
A Novel Micro-Expression Recognition Approach Using Attention-Based Magnification-Adaptive Networks. 2420-2424 - Weidong Tian, Housen Zhang, Chen Peng, Zhong-Qiu Zhao:

Lipreading Model Based On Whole-Part Collaborative Learning. 2425-2429 - Ahmed Al-Hindawi, Marcela P. Vizcaychipi

, Yiannis Demiris
:
What Is The Patient Looking At? Robust Gaze-Scene Intersection Under Free-Viewing Conditions. 2430-2434 - Haoxian Huang, Luqian Ren, Zhuo Yang, Yinwei Zhan

, Qieshi Zhang, Jujian Lv:
GAZEATTENTIONNET: Gaze Estimation with Attentions. 2435-2439 - Yang Yang

, Yonghua Zhang, Xiaojie Guo:
Low-Light Image Enhancement via Feature Restoration. 2440-2444 - Xiaoyu Zhang

, Wei Gao
:
HIRL: Hybrid Image Restoration Based on Hierarchical Deep Reinforcement Learning via Two-Step Analysis. 2445-2449 - Chengrong Wang, Chenjie Cao, Yanwei Fu

, Xiangyang Xue:
High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent Space. 2450-2454 - Zhihong Pan

:
Learning Adjustable Image Rescaling with Joint Optimization of Perception and Distortion. 2455-2459 - Wenjun Chen, Chunling Yang, Xin Yang:

FSOINET: Feature-Space Optimization-Inspired Network For Image Compressive Sensing. 2460-2464 - Keuntek Lee, Yeong Il Jang, Nam Ik Cho:

Disentangled Feature-Guided Multi-Exposure High Dynamic Range Imaging. 2465-2469 - Peilun Du, Xiaolong Zheng, Liang Liu, Huadong Ma:

Defending Against Universal Attack Via Curvature-Aware Category Adversarial Training. 2470-2474 - Yunjian Zhang, Yanwei Liu, Jinxia Liu, Pengwei Zhan, Liming Wang, Zhen Xu:

SP Attack: Single-Perspective Attack for Generating Adversarial Omnidirectional Images. 2475-2479 - Yachun Li, Ying Lian, Jingjing Wang, Yuhui Chen, Chunmao Wang, Shiliang Pu:

Few-Shot One-Class Domain Adaptation Based On Frequency For Iris Presentation Attack Detection. 2480-2484 - Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giró-i-Nieto:

Pixinwav: Residual Steganography for Hiding Pixels in Audio. 2485-2489 - Yurui Xie, Ling Guan:

A Semi-Handcrafted Keypoint Detector with Discriminative Feature Encoding. 2490-2494 - Antonio Agudo

:
Safari from Visual Signals: Recovering Volumetric 3d Shapes. 2495-2499 - Farshad G. Veshki, Sergiy A. Vorobyov:

Coupled Feature Learning Via Structured Convolutional Sparse Coding for Multimodal Image Fusion. 2500-2504 - Rongtao Xu, Changwei Wang

, Bin Fan, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang:
DOMAINDESC: Learning Local Descriptors With Domain Adaptation. 2505-2509 - Arya Aftab, Alireza Morsali

, Shahrokh Ghaemmaghami:
Multi-Head Relu Implicit Neural Representation Networks. 2510-2514 - ZhaoJing Zhou, Yun Zhou, Zhuqing Jiang, Aidong Men, Haiying Wang:

An Efficient Method for Model Pruning Using Knowledge Distillation with Few Samples. 2515-2519 - Guangyu Ren, Tianhong Dai

, Tania Stathaki:
Adaptive Intra-Group Aggregation for Co-Saliency Detection. 2520-2524 - Tanmoy Mukherjee, Nikos Deligiannis

:
Novel Class Discovery: A Dependency Approach. 2525-2528 - Yanfeng Liu

, Qiang Li, Yuan Yuan, Qi Wang:
Single-Shot Balanced Detector for Geospatial Object Detection. 2529-2533 - Ruixin Shi, Junzheng Zhang, Yong Li, Shiming Ge:

Regularized Latent Space Exploration for Discriminative Face Super-Resolution. 2534-2538 - Yi Hou

, Chengyang Li, Yuheng Lu, Liping Zhu, Yuan Li
, Huizhu Jia, Xiaodong Xie:
Enhancing and Dissecting Crowd Counting by Synthetic Data. 2539-2543 - Chenghu Du

, Feng Yu
, Minghua Jiang, Xiong Wei, Tao Peng, Xinrong Hu:
Multi-Pose Virtual Try-On Via Self-Adaptive Feature Filtering. 2544-2548 - Jie Zhang, Yi Xiao, Guo Chen, Qingping Sun, Fangqiang Xu

, Chi-Sing Leung:
Histogram-Guided Semantic-Aware Colorization. 2549-2553 - Green Rosh K. S, Nikhil Krishnan, B. H. Pawan Prasad, Sachin Deepak Lomte:

Content Preserving Scale Space Network for Fast Image Restoration from Noisy-Blurry Pairs. 2554-2558 - Rong Bao, Yurui Ren, Ge Li, Wei Gao

, Shan Liu:
Flow-Based Point Cloud Completion Network with Adversarial Refinement. 2559-2563 - Zezeng Li

, Weimin Wang
, Na Lei, Rui Wang:
Weakly Supervised Point Cloud Upsampling VIA Optimal Transport. 2564-2568 - Ryosuke Watanabe

, Keisuke Nonaka, Haruhisa Kato, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega:
Point Cloud Denoising Using Normal Vector-Based Graph Wavelet Shrinkage. 2569-2573 - Anique Akhtar, Zhu Li, Geert Van der Auwera, Jianle Chen:

Dynamic Point Cloud Interpolation. 2574-2578 - Shashank N. Sridhara, Eduardo Pavez, Antonio Ortega, Ryosuke Watanabe, Keisuke Nonaka:

Point Cloud Attribute Compression Via Chroma Subsampling. 2579-2583 - Lili Zhao

, Xuhu Lin, Wenyi Wang
, Kai-Kuang Ma, Jianwen Chen:
Rangeinet: Fast Lidar Point Cloud Temporal Interpolation. 2584-2588 - Lianlei Shan, Weiqiang Wang:

MBNet: A Multi-Resolution Branch Network for Semantic Segmentation Of Ultra-High Resolution Images. 2589-2593 - Yuxuan Zhang, Wei Yang:

BSOLO: Boundary-Aware One-Stage Instance Segmentation SOLO. 2594-2598 - Shaoping Jiang, Xiangmin Xu, Fang Liu, Xiaofen Xing, Lin Wang:

CS-GResNet: A Simple and Highly Efficient Network for Facial Expression Recognition. 2599-2603 - Bingxu Lu, Qinghua Hu, Yu Wang, Guosheng Hu:

RCANet: Row-Column Attention Network for Semantic Segmentation. 2604-2608 - Zhaozhi Xie, Hongtao Lu:

Exploring Category Consistency for Weakly Supervised Semantic Segmentation. 2609-2613 - Hyeonbin Hwang, Soyeon Kim, Wei-Jin Park, Jiho Seo, Kyungtae Ko, Hyeon Yeo:

Vision Transformer Equipped With Neural Resizer On Facial Expression Recognition Task. 2614-2618 - Kaining Ying, Zhenhua Wang, Cong Bai, Pengfei Zhou:

ISDA: Position-Aware Instance Segmentation with Deformable Attention. 2619-2623 - Zhenfei Zhang, Ming-Ching Chang, Tien D. Bui:

Improving Class Activation Map for Weakly Supervised Object Localization. 2624-2628 - Ruizhe Chen, Zhenqi Fu

, Yue Huang, En Cheng, Xinghao Ding:
A Robust Object Segmentation Network for UnderWater Scenes. 2629-2633 - Leiping Jie

, Hui Zhang:
A Fast and Efficient Network for Single Image Shadow Detection. 2634-2638 - Arvi Jonnarth

, Michael Felsberg:
Importance Sampling Cams For Weakly-Supervised Segmentation. 2639-2643 - Qingfeng Liu, Hai Su, Mostafa El-Khamy

, Kee-Bong Song:
DeepGBASS: Deep Guided Boundary-Aware Semantic Segmentation. 2644-2648 - Talha Hanif Butt, Murtaza Taj:

Camera Calibration Through Camera Projection Loss. 2649-2653 - Christopher Walker, Yuxing Wang, Yawen Lu, Guoyu Lu

:
Inferring Camera Intrinsics Based on Surfaces of Revolution: A Single Image Geometric Network Approach for Camera Calibration. 2654-2658 - Sibo Zhang, Jiahong Yuan, Miao Liao, Liangjun Zhang:

Text2video: Text-Driven Talking-Head Video Synthesis with Personalized Phoneme - Pose Dictionary. 2659-2663 - Mohamed Afham, Udith Haputhanthri, Jathurshan Pradeepkumar

, Mithunjha Anandakumar
, Ashwin De Silva
, Chamira U. S. Edussooriya:
Towards Accurate Cross-Domain in-Bed Human Pose Estimation. 2664-2668 - Yu Sun

, Tianyu Huang, Qian Bao, Wu Liu, Wenpeng Gao, Yili Fu:
Learning Monocular Mesh Recovery of Multiple Body Parts Via Synthesis. 2669-2673 - Xiyang Liu, Peng Li, Ding Ni, Yan Wang, Hui Xue:

LightPose: A Lightweight and Efficient Model with Transformer for Human Pose Estimation. 2674-2678 - Qier An, Yuan Shen

:
On The Observability in Visual Slam Networks. 2679-2683 - Yuxiao Li

, Santiago Mazuelas, Yuan Shen
:
Variational Bayesian Framework for Advanced Image Generation with Domain-Related Variables. 2684-2688 - Marina Gardella

, Tina Nikoukhah, Yanhao Li
, Quentin Bammey
:
The Impact of JPEG Compression on Prior Image Noise. 2689-2693 - Tin Lay Nwe, Ramanpreet Singh Pahwa, Richard Chang, Oo Zaw Min, Jie Wang, Yiqun Li, Dongyun Lin, Shitala Prasad, Sheng Dong:

On the Use of Component Structural Characteristics for Voxel Segmentation in Semicon 3D Images. 2694-2698 - Zihan Zhang, Thierry Blu:

Blind Source Separation via a Weak Exclusion Principle. 2699-2703 - Yuqi Zhang, Qi Qian, Chong Liu, Weihua Chen, Fan Wang, Hao Li, Rong Jin:

Graph Convolution for Re-Ranking in Person Re-Identification. 2704-2708 - Jing Yang, Canlong Zhang, Zhixin Li, Yanping Tang:

Multi-Level Relation Aware Network for Person Re-Identification. 2709-2713 - Zhaopeng Dou, Zhongdao Wang, Yali Li, Shengjin Wang:

Progressive-Granularity Retrieval Via Hierarchical Feature Alignment for Person Re-Identification. 2714-2718 - Minjung Kim, MyeongAh Cho

, Heansung Lee, Suhwan Cho
, Sangyoun Lee:
Occluded Person Re-Identification Via Relational Adaptive Feature Correction Learning. 2719-2723 - Shiping Li, Min Cao, Min Zhang:

Learning Semantic-Aligned Feature Representation for Text-Based Person Search. 2724-2728 - Xuezhi Xiang, Ning Lv

, Yulong Qiao:
Transformer-Based Person Search Model with Symmetric Online Instance Matching. 2729-2733 - Qingye Zhao, Xin Chen, Zhuoyu Zhao, Enyi Tang, Xuandong Li:

Wassertrain: An Adversarial Training Framework Against Wasserstein Adversarial Attacks. 2734-2738 - Siao Liu, Zhaoyu Chen, Wei Li, Jiwei Zhu, Jiafeng Wang, Wenqiang Zhang, Zhongxue Gan:

Efficient Universal Shuffle Attack for Visual Object Tracking. 2739-2743 - Riran Cheng

, Nan Sang, Yinyuan Zhou, Xupeng Wang
:
Non-Rigid Transformation Based Adversarial Attack Against 3d Object Tracking. 2744-2748 - Zhengyi Wang, Xupeng Wang

, Ferdous Sohel
, Mohammed Bennamoun
, Yong Liao, Jiali Yu:
Adversary Distillation for One-Shot Attacks on 3D Target Tracking. 2749-2453 - Yin Yin Low, Angeline Tanvy, Raphaël C.-W. Phan, Xiaojun Chang

:
AdverFacial: Privacy-Preserving Universal Adversarial Perturbation Against Facial Micro-Expression Leakages. 2754-2758 - Suryabhan Singh Hada, Miguel Á. Carreira-Perpiñán:

Interpretable Image Classification Using Sparse Oblique Decision Trees. 2759-2763 - Zhenqi Fu

, Xiaopeng Lin, Wu Wang, Yue Huang, Xinghao Ding:
Underwater Image Enhancement Via Learning Water Type Desensitized Representations. 2764-2768 - Ziyin Ma, Changjae Oh:

A Wavelet-Based Dual-Stream Network for Underwater Image Enhancement. 2769-2773 - Shu Chai, Zhenqi Fu

, Yue Huang, Xiaotong Tu, Xinghao Ding:
Unsupervised and Untrained Underwater Image Restoration Based on Physical Image Formation Model. 2774-2778 - Zhenlong Wang, Weifeng Liu, Yanjiang Wang, Baodi Liu:

Agcyclegan: Attention-Guided Cyclegan for Single Underwater Image Restoration. 2779-2783 - Shuhan Qi, Jianjun Du, Mingyan Wu, Hong Yi, Linlin Tang, Tao Qian, Xuan Wang:

Underwater Small Target Detection Based on Deformable Convolutional Pyramid. 2784-2788 - Kaixin Chen, Lin Zhang, Ying Shen, Yicong Zhou:

Towards Controllable and Physical Interpretable Underwater Scene Simulation. 2789-2793 - Yongshan Zhang, Xinxin Wang, Zhenyu Wang, Xinwei Jiang, Yicong Zhou:

Graph Learning Based Autoencoder for Hyperspectral Band Selection. 2794-2798 - Fengchao Xiong, Minchao Ye, Jun Zhou, Jianfeng Lu, Yuntao Qian:

Multitask Sparse Neural Network for Hyperspectral Image Denoising. 2799-2803 - Chen Xiaoyue, Xianghai Cao:

Hyperspectral Image Classification Based on Co-Learning Through Dual-Architecture Ensemble. 2804-2808 - Zhuanfeng Li

, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Yuntao Qian:
Material-Guided Siamese Fusion Network for Hyperspectral Object Tracking. 2809-2813 - Xiuheng Wang, Jie Chen, Cédric Richard

:
Hyperspectral Image Super-Resolution with Deep Priors and Degradation Model Inversion. 2814-2818 - Na Liu, Wei Li, Ran Tao:

Geometric Low-Rank Tensor Approximation for Remotely Sensed Hyperspectral And Multispectral Imagery Fusion. 2819-2823 - Haoyue Tian, Pan Gao, Ran Wei, Manoranjan Paul

:
Dilated Convolutional Neural Network-Based Deep Reference Picture Generation for Video Compression. 2824-2828 - Yanghao Li, Xinyao Chen, Jisheng Li

, Jiangtao Wen, Yuxing Han, Shan Liu, Xiaozhong Xu:
Rate Control for Learned Video Compression. 2829-2833 - Xuekai Wei, Mingliang Zhou, Weijia Jia:

Global Optimization Solution for Dynamic Adaptive 360-Degree Streaming. 1-5 - Juliano S. Assine, José Cândido Silveira Santos Filho, Eduardo Valle:

Collaborative Object Detectors Adaptive to Bandwidth and Computation. 2839-2843 - Mu Li, Baojiang Zhong

, Kai-Kuang Ma:
MA-NET: Multi-Scale Attention-Aware Network for Optical Flow Estimation. 2844-2848 - Yizhuo Li, Cewu Lu:

Modeling Human Memory in Multi-Object Tracking with Transformers. 2849-2853 - Chang-Sheng Lin, Chia-Yi Hsu, Pin-Yu Chen, Chia-Mu Yu:

Real-World Adversarial Examples Via Makeup. 2854-2858 - Joseph Clements, Yingjie Lao:

In Pursuit of Preserving the Fidelity of Adversarial Images. 2859-2863 - Meiling Li

, Nan Zhong, Xinpeng Zhang, Zhenxing Qian
, Sheng Li:
Object-Oriented Backdoor Attack Against Image Captioning. 2864-2868 - Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich:

Towards Robust Speech-to-Text Adversarial Attack. 2869-2873 - Yixiao Xu

, Xiaolei Liu, Mingyong Yin, Teng Hu, Kangyi Ding:
Sparse Adversarial Attack For Video Via Gradient-Based Keyframe Selection. 2874-2878 - Hui Zeng

, Kang Deng, Biwei Chen, Anjie Peng:
How Secure Are The Adversarial Examples Themselves? 2879-2883 - Xiaohui Zhao, Yang Yu, Rongrong Ni, Yao Zhao:

Exploring Complementarity of Global and Local Spatiotemporal Information for Fake Face Video Detection. 2884-2888 - Edoardo Daniele Cannas

, János Horváth, Sriram Baireddy, Paolo Bestagini, Edward J. Delp, Stefano Tubaro:
Panchromatic Imagery Copy-Paste Localization Through Data-Driven Sensor Attribution. 2889-2893 - Lv Chen, Dengpan Ye, Yueyun Shang, Jiaqing Huang:

Robust Video Hashing Based on Local Fluctuation Preserving for Tracking Deep Fake Videos. 2894-2898 - Ping Wang, Kunlin Liu, Wenbo Zhou, Hang Zhou, Honggu Liu, Weiming Zhang, Nenghai Yu:

ADT: Anti-Deepfake Transformer. 2899-1903 - Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu:

Eyes Tell All: Irregular Pupil Shapes Reveal GAN-Generated Faces. 2904-2908 - Antonio Theophilo, Rafael Padilha, Fernanda A. Andaló, Anderson Rocha:

Explainable Artificial Intelligence for Authorship Attribution on Social Media. 2909-2913 - Guiping Zhu

, Mingzhu Ma, Yuwen Huang, Kuikui Wang, Gongping Yang
:
Dual-Domain Low-Rank Fusion Deep Metric Learning for Off-the-Person ECG Biometrics. 2914-2918 - Kanghao Zhang

, Shan Liang, Shuai Nie, Shulin He, Jiahui Pan, Xueliang Zhang, Haoxin Ma, Jiangyan Yi:
A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature. 2919-2923 - Kuikui Wang, Gongping Yang

, Yuwen Huang, Lu Yang, Yilong Yin:
Online Ecg Biometrics Via Hadamard Code. 2924-2928 - Ziyue Xiang

, Paolo Bestagini, Stefano Tubaro, Edward J. Delp:
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers. 2929-2933 - Chong Liu, Yuqi Zhang, Weihua Chen, Fan Wang, Hao Li, Yi-Dong Shen:

Adaptive Matching Strategy for Multi-Target Multi-Camera Tracking. 2934-2938 - Hanye Huang, Youjun Xiang, Guodong Yang, Lingling Lv, Xianfeng Li

, Zichun Weng, Yuli Fu:
Generalized Face Anti-Spoofing via Cross-Adversarial Disentanglement with Mixing Augmentation. 2939-2943 - Taoshan Zhang, Youjun Xiang, Xianfeng Li

, Zichun Weng, Zhen Chen, Yuli Fu:
Free Lunch for Cross-Domain Occluded Face Recognition without Source Data. 2944-2948 - Zijun Zhuang

, Hongtao Lu:
Coneface: Approximate Pairwise Loss for Face Recognition. 2949-2953 - Jie Jiang, Yunlian Sun:

Depth-Based Ensemble Learning Network For Face Anti-Spoofing. 2954-2958 - Eklavya Sarkar

, Pavel Korshunov, Laurent Colbois
, Sébastien Marcel:
Are GAN-based morphs threatening face recognition? 2959-2963 - Yulu Jin

, Lifeng Lai:
Privacy Protection In Learning Fair Representations. 2964-2968 - Le Feng, Sheng Li, Zhenxing Qian

, Xinpeng Zhang:
Stealthy Backdoor Attack with Adversarial Training. 2969-2973 - Dan Zhao, Hong Chen, Suyun Zhao, Ruixuan Liu, Cuiping Li, Xiaoying Zhang:

Fldp: Flexible Strategy For Local Differential Privacy. 2974-2978 - Mohammad Amin Zarrabian

, Ni Ding, Parastoo Sadeghi, Thierry Rakotoarivelo:
Enhancing Utility In The Watchdog Privacy Mechanism. 2979-2983 - Michele Cirillo, Mario Di Mauro

, Vincenzo Matta, Giuseppe Basileo:
Cyber-Threat Propagation over Network-Slicing Architectures. 2984-2988 - Ecenaz Erdemir, Pier Luigi Dragotti

, Deniz Gündüz:
Privacy-Aware Communication over a Wiretap Channel with Generative Networks. 2989-2993 - Ran Shi, Jian Xiong, Tong Qiao:

Encrypted Image Visual Security Index via Non-Local Recognizable Degree Evaluation. 2994-2998 - Lu Miao, Wei Yang, Rong Hu, Lu Li, Liusheng Huang:

Against Backdoor Attacks In Federated Learning With Differential Privacy. 2999-3003 - Xinying Liao, Jiaye Xue, Shengxing Yu, Ximeng Liu, Jiangang Shu:

SecMPNN: 3-Party Privacy-Preserving Molecular Structure Properties Inference. 3004-3008 - Behrooz Razeghi

, Shideh Rezaeifar, Sohrab Ferdowsi, Taras Holotyak
, Slava Voloshynovskiy:
Compressed Data Sharing Based On Information Bottleneck Model. 3009-3013 - Thibault Maho, Teddy Furon, Erwan Le Merrer:

Randomized Smoothing Under Attack: How Good is it in Practice? 3014-3018 - Chau Yi Li, Andrea Cavallaro:

Training Privacy-Preserving Video Analytics Pipelines by Suppressing Features That Reveal Information About Private Attributes. 3019-3023 - Yulong Wang

, Xingshu Chen, Qixu Wang, Run Yang, Bangzhou Xin:
Unsupervised Anomaly Detection for Container Cloud Via BILSTM-Based Variational Auto-Encoder. 3024-3028 - Fusen Wang, Jun Sang, Chunlin Huang, Bin Cai,



Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID