Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Chng Eng Siong

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Yuchen Hu

> Home > Persons > Chng Eng Siong

Publications

2024
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuCZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuCZC24
Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1145-1156 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-05746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-05746
Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan:
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection. CoRR abs/2401.05746 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10446
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Eng Siong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. CoRR abs/2401.10446 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05457
Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. CoRR abs/2402.05457 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06894
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. CoRR abs/2402.06894 (2024)
2023
[c248]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenHZZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenHZZZC23
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. AAAI 2023: 12607-12615
[c245]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZouSCHRC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZouSCHRC23
Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. ACL (Findings) 2023: 659-672
[c244]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuCLZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuCLZC23
Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. ACL (1) 2023: 11610-11625
[c243]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuLCQZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuLCQZC23
Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. ACL (1) 2023: 15213-15232
[c236]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHWC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHWC23
Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-Oriented Speech Enhancement Using Diffusion Probabilistic Model. ICASSP 2023: 1-5
[c235]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHZSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHZSC23
Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise Adaptation Using Data Simulation. ICASSP 2023: 1-5
[c234]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuCLZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuCLZC23
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. ICASSP 2023: 1-5
[c233]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuCZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuCZZC23
Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. ICASSP 2023: 1-5
[c227]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/HuLCZZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HuLCZZC23
Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. IJCAI 2023: 5076-5084
[c224]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0075HYSCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0075HYSCS23
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Chng Eng Siong:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. NeurIPS 2023
[i75]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11131
Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. CoRR abs/2302.11131 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11362
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11362
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. CoRR abs/2302.11362 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11981
Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise adaptation using Data Simulation. CoRR abs/2302.11981 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11989
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11989
Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2302.11989 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04974
Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. CoRR abs/2304.04974 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09212
Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. CoRR abs/2305.09212 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09299
Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. CoRR abs/2305.09299 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10761
Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-aware Speech Separation with Contrastive Learning. CoRR abs/2305.10761 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16932
Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Model Approach to Efficient Speech Separation. CoRR abs/2305.16932 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10563
Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. CoRR abs/2306.10563 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10567
Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. CoRR abs/2306.10567 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08029
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Noise-aware Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2307.08029 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15701
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. CoRR abs/2309.15701 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13013
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023)
2022
[c220]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHHQZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHHQZC22
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-Critical Sequence Training for Automatic Speech Recognition. ICASSP 2022: 3688-3692
[c219]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHHSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHHSC22
Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data. ICASSP 2022: 4298-4302
[c218]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuHCC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuHCC22
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. ICASSP 2022: 6292-6296
[c211]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHHZQC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHHZQC22
Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. INTERSPEECH 2022: 2773-2777
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14838
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15321
Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15526
Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-06260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-06260
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05301
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning. CoRR abs/2212.05301 (2022)
2021
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05267
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021)

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.