Xiong Xiao

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Zhuo Chen 0006

> Home > Persons > Xiong Xiao

Publications

2023
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuCHXL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuCHXL23
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection For Transformer Transducer ASR. ICASSP 2023: 1-5
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08549
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection for Transformer Transducer ASR. CoRR abs/2302.08549 (2023)
2022
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenWCWLCLKYXWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenWCWLCLKYXWZ22
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022)
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KandaXGWMCY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KandaXGWMCY22
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR. ICASSP 2022: 8082-8086
[c98]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kanda0WXMWG00Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kanda0WXMWG00Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. INTERSPEECH 2022: 521-525
[c97]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaWWXMWGC0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaWWXMWGC0Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. INTERSPEECH 2022: 3774-3778
[c96]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zhang0K00EYXMQW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zhang0K00EYXMQW22
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00842
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. CoRR abs/2202.00842 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16685
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. CoRR abs/2203.16685 (2022)
2021
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KandaXWZGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KandaXWZGWMCY21
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. ASRU 2021: 296-303
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoK0ZYC0L0W0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoK0ZYC0L0W0021
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020. ICASSP 2021: 5824-5828
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02852
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. CoRR abs/2107.02852 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03151
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR. CoRR abs/2110.03151 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14142
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021)
2020
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYLZMLWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYLZMLWXL20
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11458
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020. CoRR abs/2010.11458 (2020)
2019
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangCXMYZLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangCXMYZLL19
Peidong Wang, Zhuo Chen, Xiong Xiao, Zhong Meng, Takuya Yoshioka, Tianyan Zhou, Liang Lu, Jinyu Li:
Speech Separation Using Speaker Inventory. ASRU 2019: 230-236
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YoshiokaHHJKKLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YoshiokaHHJKKLL19
Takuya Yoshioka, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Igor Abramovski, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang:
Advances in Online Audio-Visual Meeting Transcription. ASRU 2019: 276-283
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoCYELDDG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoCYELDDG19
Xiong Xiao, Zhuo Chen, Takuya Yoshioka, Hakan Erdogan, Changliang Liu, Dimitrios Dimitriadis, Jasha Droppo, Yifan Gong:
Single-channel Speech Extraction Using Speaker Inventory and Attention Network. ICASSP 2019: 86-90
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YoshiokaCLXED19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YoshiokaCLXED19
Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis:
Low-latency Speaker-independent Continuous Speech Separation. ICASSP 2019: 6980-6984
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06478
Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis:
Low-Latency Speaker-Independent Continuous Speech Separation. CoRR abs/1904.06478 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-05955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-05955
Liang Lu, Xiong Xiao, Zhuo Chen, Yifan Gong:
PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch. CoRR abs/1907.05955 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-04979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-04979
Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou:
Advances in Online Audio-Visual Meeting Transcription. CoRR abs/1912.04979 (2019)
2018
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYXLSG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYXLSG18
Zhuo Chen, Takuya Yoshioka, Xiong Xiao, Linyu Li, Michael L. Seltzer, Yifan Gong:
Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation. ICASSP 2018: 5384-5388
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZCLXYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZCLXYG18
Jinyu Li, Rui Zhao, Zhuo Chen, Changliang Liu, Xiong Xiao, Guoli Ye, Yifan Gong:
Developing Far-Field Speaker System Via Teacher-Student Learning. ICASSP 2018: 5699-5703
[c76]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshiokaECXA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshiokaECXA18
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva:
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks. INTERSPEECH 2018: 3038-3042
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenXYELG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenXYELG18
Zhuo Chen, Xiong Xiao, Takuya Yoshioka, Hakan Erdogan, Jinyu Li, Yifan Gong:
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network. SLT 2018: 558-565
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-10924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-10924
Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong:
Cracking the cocktail party problem by multi-beam deep attractor network. CoRR abs/1803.10924 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-05166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-05166
Jinyu Li, Rui Zhao, Zhuo Chen, Changliang Liu, Xiong Xiao, Guoli Ye, Yifan Gong:
Developing Far-Field Speaker System Via Teacher-Student Learning. CoRR abs/1804.05166 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-03655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-03655
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva:
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks. CoRR abs/1810.03655 (2018)
2017
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenLXYWWG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenLXYWWG17
Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong:
Cracking the cocktail party problem by multi-beam deep attractor network. ASRU 2017: 437-444

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.