default search action

combined dblp search
author search
venue search
publication search

ask others

Yuan Gong 0001

> Home > Persons

Person information

affiliation: Massachusetts Institute of Technology, Cambridge, MA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001LLKG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001LLKG24
Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass:
Listen, Think, and Understand. ICLR 2024
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/ZhangGLCGGKWMG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZhangGLCGGKWMG24
Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, Jim Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. NAACL-HLT (Findings) 2024: 4131-4155
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10082
Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogério Feris, James R. Glass:
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation. CoRR abs/2406.10082 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18625
Liming Wang, Yuan Gong, Nauman Dawalatabad, Marco Vilela, Katerina Placek, Brian Tracey, Yishu Gong, Alan Premasiri, Fernando Vieira, James R. Glass:
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer. CoRR abs/2406.18625 (2024)
2023
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GongLLKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GongLLKG23
Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James R. Glass:
Joint Audio and Speech Understanding. ASRU 2023: 1-8
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LuoZCGKWMG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LuoZCGKWMG23
Hongyin Luo, Tianhua Zhang, Yung-Sung Chuang, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, James R. Glass:
Search Augmented Instruction Learning. EMNLP (Findings) 2023: 3717-3729
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GongRLHKKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GongRLHKKG23
Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. ICLR 2023
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001KKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001KKG23
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. INTERSPEECH 2023: 2798-2802
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10790
Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass:
Listen, Think, and Understand. CoRR abs/2305.10790 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15225
Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
SAIL: Search-Augmented Instruction Learning. CoRR abs/2305.15225 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03183
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. CoRR abs/2307.03183 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10814
Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James R. Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. CoRR abs/2309.10814 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-14405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-14405
Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James R. Glass:
Joint Audio and Speech Understanding. CoRR abs/2309.14405 (2023)
2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/GongLRG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GongLRG22
Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James R. Glass:
UAVM: Towards Unifying Audio and Visual Models. IEEE Signal Process. Lett. 29: 2437-2441 (2022)
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GongLCG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GongLCG22
Yuan Gong, Cheng-I Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. AAAI 2022: 10699-10709
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/DawalatabadGKAG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/DawalatabadGKAG22
Nauman Dawalatabad, Yuan Gong, Sameer Khurana, Rhoda Au, James R. Glass:
Detecting Dementia from Long Neuropsychological Interviews. EMNLP (Findings) 2022: 5270-5283
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GongYG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GongYG22
Yuan Gong, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. ICASSP 2022: 151-155
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GongCCCG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GongCCCG22
Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. ICASSP 2022: 7262-7266
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-06760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-06760
Yuan Gong, Sameer Khurana, Andrew Rouditchenko, James R. Glass:
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification. CoRR abs/2203.06760 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-03432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-03432
Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. CoRR abs/2205.03432 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-03433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-03433
Yuan Gong, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. CoRR abs/2205.03433 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-00061
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-00061
Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James R. Glass:
UAVM: A Unified Model for Audio-Visual Learning. CoRR abs/2208.00061 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07839
Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. CoRR abs/2210.07839 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GongCG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GongCG21
Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3292-3306 (2021)
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongCG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongCG21
Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. Interspeech 2021: 571-575
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01243
Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. CoRR abs/2102.01243 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01778
Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. CoRR abs/2104.01778 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09784
Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/GongYP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GongYP20
Yuan Gong, Jian Yang, Christian Poellabauer:
Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method. IEEE Signal Process. Lett. 27: 920-924 (2020)
[d1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/GongYHMP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/GongYHMP20
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer:
ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. IEEE DataPort, 2020
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-08225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-08225
Yuan Gong, Jian Yang, Christian Poellabauer:
Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method. CoRR abs/2003.08225 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/BryanGZP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/BryanGZP19
Bryan Bryan, Yuan Gong, Yizhe Zhang, Christian Poellabauer:
Second-Order Non-Local Attention Networks for Person Re-Identification. ICCV 2019: 3759-3768
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/GongLPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GongLPS19
Yuan Gong, Boyang Li, Christian Poellabauer, Yiyu Shi:
Real-Time Adversarial Attacks. IJCAI 2019: 4672-4680
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongYHMP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongYHMP19
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer:
ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. INTERSPEECH 2019: 2355-2359
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03365
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer:
ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. CoRR abs/1904.03365 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13399
Yuan Gong, Boyang Li, Christian Poellabauer, Yiyu Shi:
Real-Time Adversarial Attacks. CoRR abs/1905.13399 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-00295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-00295
Bryan (Ning) Xia, Yuan Gong, Yizhe Zhang, Christian Poellabauer:
Second-order Non-local Attention Networks for Person Re-identification. CoRR abs/1909.00295 (2019)
2018
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/bcb/GongYPSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bcb/GongYPSL18
Yuan Gong, Hasini Yatawatte, Christian Poellabauer, Sandra L. Schneider, Susan Latham:
Automatic Autism Spectrum Disorder Detection Using Everyday Vocalizations Captured by Smart Devices. BCB 2018: 465-473
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/bcb/GongSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bcb/GongSP18
Yuan Gong, Kevin Shin, Christian Poellabauer:
Improving LIWC Using Soft Word Matching. BCB 2018: 523
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icccn/GongP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icccn/GongP18
Yuan Gong, Christian Poellabauer:
Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues. ICCCN 2018: 1-9
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongP18
Yuan Gong, Christian Poellabauer:
Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models. INTERSPEECH 2018: 2698-2702
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-09156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-09156
Yuan Gong, Christian Poellabauer:
An Overview of Vulnerabilities of Voice Controlled Systems. CoRR abs/1803.09156 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-10384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-10384
Yuan Gong, Christian Poellabauer:
Topic Modeling Based Multi-modal Depression Detection. CoRR abs/1803.10384 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02939
Yuan Gong, Christian Poellabauer:
Towards Learning Fine-Grained Disentangled Representations from Speech. CoRR abs/1808.02939 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07018
Yuan Gong, Christian Poellabauer:
Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues. CoRR abs/1811.07018 (2018)
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ichi/GongP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ichi/GongP17
Yuan Gong, Christian Poellabauer:
Continuous Assessment of Children's Emotional States Using Acoustic Analysis. ICHI 2017: 171-178
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GongP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GongP17
Yuan Gong, Christian Poellabauer:
Topic Modeling Based Multi-modal Depression Detection. AVEC@ACM Multimedia 2017: 69-76
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-03280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-03280
Yuan Gong, Christian Poellabauer:
Crafting Adversarial Examples For Speech Paralinguistics Applications. CoRR abs/1711.03280 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.