Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

- view
  authority control:
- export record
  dblp key:
  - conf/fruct/PopovCBSO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fruct/PopovCBSO23
Oleg Popov, Tatyana Chernysheva, Andrey Borisov, Pavel Sapronov, Kirill Orlov:
Changing the Properties of the Audio Broadcast Signal in Adaptive Transmission Channels. FRUCT 2023: 219-225
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KleijnCLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KleijnCLS23
W. Bastiaan Kleijn, Michael Chinen, Felicia S. C. Lim, Jan Skoglund:
Multi-Channel Audio Signal Generation. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangPLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangPLG23
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio. ICASSP 2023: 1-5
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/WangXMZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/WangXMZ23
Jialin Wang, Yunfeng Xu, Borui Miao, Shaojie Zhao:
AudioFormer: Channel Audio Encoder Based on Multi-granularity Features. ICONIP (10) 2023: 357-373
- view
  authority control:
- export record
  dblp key:
  - conf/secon/LiTRLYCXJY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/secon/LiTRLYCXJY23
Yijie Li, Xiatong Tong, Qianfei Ren, Qingyang Li, Lanqing Yang, Yi-Chao Chen, Guangtao Xue, Xiaoyu Ji, Jiadi Yu:
AUDIOSENSE: Leveraging Current to Acoustic Channel to Detect Appliances at Single-Point. SECON 2023: 240-248
- view
  authority control:
- export record
  dblp key:
  - conf/smc/ZhangZLCZWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smc/ZhangZLCZWW23
Peng Zhang, Hui Zhao, Meijuan Li, Yida Chen, Jianqiang Zhang, Fuqiang Wang, Xiaoming Wu:
Audio-Visual Emotion Recognition Based on Multi-Scale Channel Attention and Global Interactive Fusion. SMC 2023: 2144-2150
- view
  authority control:
- export record
  dblp key:
  - conf/specom/PandharipandeK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/PandharipandeK23
Meghna Pandharipande, Sunil Kumar Kopparapu:
Candidate Speech Extraction from Multi-speaker Single-Channel Audio Interviews. SPECOM (1) 2023: 210-221
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-02909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-02909
Guinan Li, Jiajun Deng, Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Mingyu Cui, Helen Meng, Xunying Liu:
Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition. CoRR abs/2307.02909 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05218
Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio. CoRR abs/2308.05218 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07416
Anton Ratnarajah, Shi-Xiong Zhang, Dong Yu:
M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec. CoRR abs/2309.07416 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10922
Antoni Dimitriadis, Siqi Pan, Vidhyasaharan Sethu, Beena Ahmed:
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio. CoRR abs/2310.10922 (2023)
2022
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/GomezPC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/GomezPC22
Antonio Gomez, Marios S. Pattichis, Sylvia Celedón-Pattichis:
Speaker Diarization and Identification From Single Channel Classroom Audio Recordings Using Virtual Microphones. IEEE Access 10: 56256-56266 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/cee/DingSWYSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cee/DingSWYSC22
Wenjian Ding, Zhe Sun, Xingxing Wu, Zhenglu Yang, Jordi Solé-Casals, Cesar F. Caiafa:
Tensor completion algorithms for estimating missing values in multi-channel audio signals. Comput. Electr. Eng. 97: 107561 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiZYCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiZYCL22
Andong Li, Chengshi Zheng, Guochen Yu, Juanjuan Cai, Xiaodong Li:
Filtering and Refining: A Collaborative-Style Framework for Single-Channel Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2156-2172 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Luo22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Luo22
Yi Luo:
A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3008-3019 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SongM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SongM22
Yanjue Song, Nilesh Madhu:
Improved CEM for Speech Harmonic Enhancement in Single Channel Noise Suppression. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2492-2503 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TaherianTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TaherianTW22
Hassan Taherian, Ke Tan, DeLiang Wang:
Multi-Channel Talker-Independent Speaker Separation Through Location-Based Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2791-2800 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuLZW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuLZW22
Sixing Wu, Ying Li, Dawei Zhang, Zhonghai Wu:
Generating Rational Commonsense Knowledge-Aware Dialogue Responses With Channel-Aware Knowledge Fusing Network. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3230-3239 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangC22
Joon-Young Yang, Joon-Hyuk Chang:
VACE-WPE: Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 174-189 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangC22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangC22a
Joon-Young Yang, Joon-Hyuk Chang:
Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3144-3159 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangW22
Hao Zhang, DeLiang Wang:
Neural Cascade Architecture for Multi-Channel Acoustic Echo Suppression. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2326-2336 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/www/ZongZ0NGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/www/ZongZ0NGZ22
Tianrui Zong, Juan Zhao, Yong Xiang, Iynkaran Natgunanathan, Longxiang Gao, Wanlei Zhou:
Desynchronization-attack-resilient audio watermarking mechanism for stereo signals using the linear correlation between channels. World Wide Web 25(1): 357-379 (2022)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/RixenR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/RixenR22
Joel Rixen, Matthias Renz:
SFSRNet: Super-resolution for Single-Channel Audio Source Separation. AAAI 2022: 11220-11228
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0007MI22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0007MI22
Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu:
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. CVPR 2022: 10534-10542
- view
  authority control:
- export record
  dblp key:
  - conf/globecom/TalukderX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globecom/TalukderX22
Munmun Talukder, Jiang Xie:
Exploiting Playback Device's Effect on Multi-channel Audio to Secure Voice Assistants. GLOBECOM 2022: 6085-6090
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiYDLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiYDLM22
Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Speech Separation, Dereverberation and Recognition. ICASSP 2022: 6042-6046
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLMF22
Penghong Wang, Jiahui Li, Mengyao Ma, Xiaopeng Fan:
Distributed Audio-Visual Parsing Based On Multimodal Transformer and Deep Joint Source Channel Coding. ICASSP 2022: 4623-4627
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuYLWWYG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuYLWWYG22
Gaopeng Xu, Song Yang, Wei Li, Song Wang, Guo Wei, Junfeng Yuan, Jie Gao:
Channel-Wise AV-Fusion Attention for Multi-Channel Audio-Visual Speech Recognition. ICASSP 2022: 9251-9255
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/FathanAK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/FathanAK22
Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
Mel-Spectrogram Image-Based End-to-End Audio Deepfake Detection Under Channel-Mismatched Conditions. ICME 2022: 1-6
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen0DW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen0DW22
Kun Chen, Jun Wang, Feng Deng, Xiaorui Wang:
iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning. INTERSPEECH 2022: 4167-4171