- Oleg Popov, Tatyana Chernysheva, Andrey Borisov, Pavel Sapronov, Kirill Orlov:
Changing the Properties of the Audio Broadcast Signal in Adaptive Transmission Channels. FRUCT 2023: 219-225 - W. Bastiaan Kleijn, Michael Chinen, Felicia S. C. Lim, Jan Skoglund:
Multi-Channel Audio Signal Generation. ICASSP 2023: 1-5 - Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio. ICASSP 2023: 1-5 - Jialin Wang, Yunfeng Xu, Borui Miao, Shaojie Zhao:
AudioFormer: Channel Audio Encoder Based on Multi-granularity Features. ICONIP (10) 2023: 357-373 - Yijie Li, Xiatong Tong, Qianfei Ren, Qingyang Li, Lanqing Yang, Yi-Chao Chen, Guangtao Xue, Xiaoyu Ji, Jiadi Yu:
AUDIOSENSE: Leveraging Current to Acoustic Channel to Detect Appliances at Single-Point. SECON 2023: 240-248 - Peng Zhang, Hui Zhao, Meijuan Li, Yida Chen, Jianqiang Zhang, Fuqiang Wang, Xiaoming Wu:
Audio-Visual Emotion Recognition Based on Multi-Scale Channel Attention and Global Interactive Fusion. SMC 2023: 2144-2150 - Meghna Pandharipande, Sunil Kumar Kopparapu:
Candidate Speech Extraction from Multi-speaker Single-Channel Audio Interviews. SPECOM (1) 2023: 210-221 - Guinan Li, Jiajun Deng, Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Mingyu Cui, Helen Meng, Xunying Liu:
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition. CoRR abs/2307.02909 (2023) - Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio. CoRR abs/2308.05218 (2023) - Anton Ratnarajah, Shi-Xiong Zhang, Dong Yu:
M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec. CoRR abs/2309.07416 (2023) - Antoni Dimitriadis, Siqi Pan, Vidhyasaharan Sethu, Beena Ahmed:
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio. CoRR abs/2310.10922 (2023) - 2022
- Antonio Gomez, Marios S. Pattichis, Sylvia Celedón-Pattichis:
Speaker Diarization and Identification From Single Channel Classroom Audio Recordings Using Virtual Microphones. IEEE Access 10: 56256-56266 (2022) - Wenjian Ding, Zhe Sun, Xingxing Wu, Zhenglu Yang, Jordi Solé-Casals, Cesar F. Caiafa:
Tensor completion algorithms for estimating missing values in multi-channel audio signals. Comput. Electr. Eng. 97: 107561 (2022) - Andong Li, Chengshi Zheng, Guochen Yu, Juanjuan Cai, Xiaodong Li:
Filtering and Refining: A Collaborative-Style Framework for Single-Channel Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2156-2172 (2022) - Yi Luo:
A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3008-3019 (2022) - Yanjue Song, Nilesh Madhu:
Improved CEM for Speech Harmonic Enhancement in Single Channel Noise Suppression. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2492-2503 (2022) - Hassan Taherian, Ke Tan, DeLiang Wang:
Multi-Channel Talker-Independent Speaker Separation Through Location-Based Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2791-2800 (2022) - Sixing Wu, Ying Li, Dawei Zhang, Zhonghai Wu:
Generating Rational Commonsense Knowledge-Aware Dialogue Responses With Channel-Aware Knowledge Fusing Network. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3230-3239 (2022) - Joon-Young Yang, Joon-Hyuk Chang:
VACE-WPE: Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 174-189 (2022) - Joon-Young Yang, Joon-Hyuk Chang:
Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3144-3159 (2022) - Hao Zhang, DeLiang Wang:
Neural Cascade Architecture for Multi-Channel Acoustic Echo Suppression. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2326-2336 (2022) - Tianrui Zong, Juan Zhao, Yong Xiang, Iynkaran Natgunanathan, Longxiang Gao, Wanlei Zhou:
Desynchronization-attack-resilient audio watermarking mechanism for stereo signals using the linear correlation between channels. World Wide Web 25(1): 357-379 (2022) - Joel Rixen, Matthias Renz:
SFSRNet: Super-resolution for Single-Channel Audio Source Separation. AAAI 2022: 11220-11228 - Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu:
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. CVPR 2022: 10534-10542 - Munmun Talukder, Jiang Xie:
Exploiting Playback Device's Effect on Multi-channel Audio to Secure Voice Assistants. GLOBECOM 2022: 6085-6090 - Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Speech Separation, Dereverberation and Recognition. ICASSP 2022: 6042-6046 - Penghong Wang, Jiahui Li, Mengyao Ma, Xiaopeng Fan:
Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding. ICASSP 2022: 4623-4627 - Gaopeng Xu, Song Yang, Wei Li, Song Wang, Guo Wei, Junfeng Yuan, Jie Gao:
Channel-Wise AV-Fusion Attention for Multi-Channel Audio-Visual Speech Recognition. ICASSP 2022: 9251-9255 - Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
Mel-Spectrogram Image-Based End-to-End Audio Deepfake Detection Under Channel-Mismatched Conditions. ICME 2022: 1-6 - Kun Chen, Jun Wang, Feng Deng, Xiaorui Wang:
iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning. INTERSPEECH 2022: 4167-4171