default search action
Jaesung Huh
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j2]Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon Jung, Daniel Garcia-Romero, Andrew Zisserman:
The VoxCeleb Speaker Recognition Challenge: A Retrospective. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3850-3866 (2024) - [c15]Jacob Chalk, Jaesung Huh, Evangelos Kazakos, Andrew Zisserman, Dima Damen:
TIM: A Time Interval Machine for Audio-Visual Action Recognition. CVPR 2024: 18153-18163 - [c14]Bruno Korbar, Jaesung Huh, Andrew Zisserman:
Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling. ICASSP 2024: 2975-2979 - [i21]Bruno Korbar, Jaesung Huh, Andrew Zisserman:
Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling. CoRR abs/2401.12039 (2024) - [i20]Jacob Chalk, Jaesung Huh, Evangelos Kazakos, Andrew Zisserman, Dima Damen:
TIM: A Time Interval Machine for Audio-Visual Action Recognition. CoRR abs/2404.05559 (2024) - [i19]Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon Jung, Daniel Garcia-Romero, Andrew Zisserman:
The VoxCeleb Speaker Recognition Challenge: A Retrospective. CoRR abs/2408.14886 (2024) - 2023
- [c13]Jaesung Huh, Jacob Chalk, Evangelos Kazakos, Dima Damen, Andrew Zisserman:
Epic-Sounds: A Large-Scale Dataset of Actions that Sound. ICASSP 2023: 1-5 - [c12]Jee-Weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung:
In Search of Strong Embedding Extractors for Speaker Diarisation. ICASSP 2023: 1-5 - [c11]Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman:
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio. INTERSPEECH 2023: 4489-4493 - [c10]Kihyun Nam, Youkyum Kim, Jaesung Huh, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung:
Disentangled Representation Learning for Multilingual Speaker Recognition. INTERSPEECH 2023: 5316-5320 - [i18]Jaesung Huh, Jacob Chalk, Evangelos Kazakos, Dima Damen, Andrew Zisserman:
Epic-Sounds: A Large-scale Dataset of Actions That Sound. CoRR abs/2302.00646 (2023) - [i17]Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman:
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge. CoRR abs/2302.10248 (2023) - [i16]Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman:
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio. CoRR abs/2303.00747 (2023) - [i15]Jaesung Huh, Max Bain, Andrew Zisserman:
OxfordVGG Submission to the EGO4D AV Transcription Challenge. CoRR abs/2307.09006 (2023) - 2022
- [j1]Jingu Kang, Jaesung Huh, Hee Soo Heo, Joon Son Chung:
Augmentation Adversarial Training for Self-Supervised Speaker Representation Learning. IEEE J. Sel. Top. Signal Process. 16(6): 1253-1262 (2022) - [i14]Andrew Brown, Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge. CoRR abs/2201.04583 (2022) - [i13]Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung:
In search of strong embedding extractors for speaker diarisation. CoRR abs/2210.14682 (2022) - 2021
- [c9]Evangelos Kazakos, Jaesung Huh, Arsha Nagrani, Andrew Zisserman, Dima Damen:
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. BMVC 2021: 268 - [c8]Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Playing a Part: Speaker Verification at the movies. ICASSP 2021: 6174-6178 - [c7]Jaesung Huh, Minjae Lee, Heesoo Heo, Seongkyu Mun, Joon Son Chung:
Metric Learning for Keyword Spotting. SLT 2021: 133-140 - [c6]Youngki Kwon, Hee Soo Heo, Jaesung Huh, Bong-Jin Lee, Joon Son Chung:
Look Who's Not Talking. SLT 2021: 567-573 - [i12]Evangelos Kazakos, Jaesung Huh, Arsha Nagrani, Andrew Zisserman, Dima Damen:
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. CoRR abs/2111.01024 (2021) - 2020
- [c5]Seongkyu Mun, Soyeon Choe, Jaesung Huh, Joon Son Chung:
The Sound of My Voice: Speaker Representation Loss for Target Voice Separation. ICASSP 2020: 7289-7293 - [c4]Joon Son Chung, Jaesung Huh, Arsha Nagrani, Triantafyllos Afouras, Andrew Zisserman:
Spot the Conversation: Speaker Diarisation in the Wild. INTERSPEECH 2020: 299-303 - [c3]Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee-Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-Jin Lee, Icksang Han:
In Defence of Metric Learning for Speaker Recognition. INTERSPEECH 2020: 2977-2981 - [c2]Joon Son Chung, Jaesung Huh, Seongkyu Mun:
Delving into VoxCeleb: Environment Invariant Speaker Recognition. Odyssey 2020: 349-356 - [i11]Jaesung Huh, Egil Martinsson, Adrian Kim, Jung-Woo Ha:
Modeling Musical Onset Probabilities via Neural Distribution Learning. CoRR abs/2002.03559 (2020) - [i10]Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-Jin Lee, Icksang Han:
In defence of metric learning for speaker recognition. CoRR abs/2003.11982 (2020) - [i9]Jaesung Huh, Minjae Lee, Heesoo Heo, Seongkyu Mun, Joon Son Chung:
Metric Learning for Keyword Spotting. CoRR abs/2005.08776 (2020) - [i8]Joon Son Chung, Jaesung Huh, Arsha Nagrani, Triantafyllos Afouras, Andrew Zisserman:
Spot the conversation: speaker diarisation in the wild. CoRR abs/2007.01216 (2020) - [i7]Jaesung Huh, Hee Soo Heo, Jingu Kang, Shinji Watanabe, Joon Son Chung:
Augmentation adversarial training for unsupervised speaker recognition. CoRR abs/2007.12085 (2020) - [i6]Hee Soo Heo, Bong-Jin Lee, Jaesung Huh, Joon Son Chung:
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020. CoRR abs/2009.14153 (2020) - [i5]Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Playing a Part: Speaker Verification at the Movies. CoRR abs/2010.15716 (2020) - [i4]Youngki Kwon, Hee Soo Heo, Jaesung Huh, Bong-Jin Lee, Joon Son Chung:
Look who's not talking. CoRR abs/2011.14885 (2020) - [i3]Arsha Nagrani, Joon Son Chung, Jaesung Huh, Andrew Brown, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A. Reynolds, Andrew Zisserman:
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge. CoRR abs/2012.06867 (2020)
2010 – 2019
- 2019
- [c1]Hyeong-Seok Choi, Jang-Hyun Kim, Jaesung Huh, Adrian Kim, Jung-Woo Ha, Kyogu Lee:
Phase-Aware Speech Enhancement with Deep Complex U-Net. ICLR (Poster) 2019 - [i2]Joon Son Chung, Jaesung Huh, Seongkyu Mun:
Delving into VoxCeleb: environment invariant speaker recognition. CoRR abs/1910.11238 (2019) - [i1]Seongkyu Mun, Soyeon Choe, Jaesung Huh, Joon Son Chung:
The sound of my voice: speaker representation loss for target voice separation. CoRR abs/1911.02411 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 02:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint