default search action

combined dblp search
author search
venue search
publication search

ask others

Zhifu Gao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/MaZYLGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/MaZYLGZ024
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen:
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation. ACL (Findings) 2024: 15747-15760
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiYLCGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiYLCGZ24
Xian Shi, Yexin Yang, Zerui Li, Yanni Chen, Zhifu Gao, Shiliang Zhang:
SeACo-Paraformer: A Non-Autoregressive ASR System with Flexible and Effective Hotword Customization Ability. ICASSP 2024: 10346-10350
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08846
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08846
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen:
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity. CoRR abs/2402.08846 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05839
Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen:
MaLa-ASR: Multimedia-Assisted LLM-Based ASR. CoRR abs/2406.05839 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04051
Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-05407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-05407
Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, Zhijie Yan:
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens. CoRR abs/2407.05407 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17746
Keyu An, Zerui Li, Zhifu Gao, Shiliang Zhang:
Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition. CoRR abs/2409.17746 (2024)
2023
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoLWLSCLZDZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoLWLSCLZDZ23
Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Shiliang Zhang:
FunASR: A Fundamental End-to-End Speech Recognition Toolkit. INTERSPEECH 2023: 1593-1597
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLGZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLGZY23
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. INTERSPEECH 2023: 3247-3251
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10680
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. CoRR abs/2305.10680 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11013
Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Zhangyu Xiao, Shiliang Zhang:
FunASR: A Fundamental End-to-End Speech Recognition Toolkit. CoRR abs/2305.11013 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04673
Jiaming Wang, Zhihao Du, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang:
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT. CoRR abs/2310.04673 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15185
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen:
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation. CoRR abs/2312.15185 (2023)
2022
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoZ0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoZ0Y22
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. INTERSPEECH 2022: 2063-2067
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08317
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. CoRR abs/2206.08317 (2022)
2021
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoYZYL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoYZYL021
Zhifu Gao, Yiwu Yao, Shiliang Zhang, Jun Yang, Ming Lei, Ian McLoughlin:
Extremely Low Footprint End-to-End ASR System for Smart Device. Interspeech 2021: 4548-4552
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-05784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-05784
Zhifu Gao, Yiwu Yao, Shiliang Zhang, Jun Yang, Ming Lei, Ian McLoughlin:
Extremely Low Footprint End-to-End ASR System for Smart Device. CoRR abs/2104.05784 (2021)
2020
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoZLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoZLM20
Zhifu Gao, Shiliang Zhang, Ming Lei, Ian McLoughlin:
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition. INTERSPEECH 2020: 6-10
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangGLLGYX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangGLLGYX20
Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. INTERSPEECH 2020: 2142-2146
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01712
Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. CoRR abs/2006.01712 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01713
Zhifu Gao, Shiliang Zhang, Ming Lei, Ian McLoughlin:
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition. CoRR abs/2006.01713 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14099
Zhifu Gao, Shiliang Zhang, Ming Lei, Ian McLoughlin:
Universal ASR: Unifying Streaming and Non-Streaming ASR Using a Single Encoder-Decoder Model. CoRR abs/2010.14099 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMLJD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMLJD19
Zhifu Gao, Yan Song, Ian McLoughlin, Pengcheng Li, Yiheng Jiang, Li-Rong Dai:
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System. INTERSPEECH 2019: 361-365
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangSMGD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangSMGD19
Yiheng Jiang, Yan Song, Ian McLoughlin, Zhifu Gao, Li-Rong Dai:
An Effective Deep Embedding Learning Architecture for Speaker Verification. INTERSPEECH 2019: 4040-4044
2018
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMGD18
Zhifu Gao, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai:
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification. INTERSPEECH 2018: 3578-3582

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.