default search action

combined dblp search
author search
venue search
publication search

ask others

Seung-Bin Kim

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/ChoOKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ChoOKL25
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech Via Emotion-Adaptive Spherical Vector. IEEE Trans. Affect. Comput. 16(3): 2365-2380 (2025)
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tnn/LeeCKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/LeeCKL25
Sang-Hoon Lee, Ha-Yeong Choi, Seung-Bin Kim, Seong-Whan Lee:
HierSpeech++: Bridging the Gap Between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-Shot Speech Synthesis. IEEE Trans. Neural Networks Learn. Syst. 36(10): 18422-18436 (2025)
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/KimCOCL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KimCOCL25
Seung-Bin Kim, Junhyeok Cha, Hyung-Seok Oh, Heejin Choi, Seong-Whan Lee:
FillerSpeech: Towards Human-Like Text-to-Speech Synthesis with Filler Insertion and Filler Style Control. EMNLP 2025: 34108-34125
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChaKOL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChaKOL25
Junhyeok Cha, Seung-Bin Kim, Hyung-Seok Oh, Seong-Whan Lee:
JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis. ICASSP 2025: 1-5
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YunKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YunKL25
Jun-Hak Yun, Seung-Bin Kim, Seong-Whan Lee:
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching. ICASSP 2025: 1-5
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoOKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoOKL25
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech. INTERSPEECH 2025
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoOKL25a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoOKL25a
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification. INTERSPEECH 2025
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimCKL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimCKL25
Nam-Gyu Kim, Deok-Hyeon Cho, Seung-Bin Kim, Seong-Whan Lee:
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech. INTERSPEECH 2025
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04904
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04904
Junhyeok Cha, Seung-Bin Kim, Hyung-Seok Oh, Seong-Whan Lee:
JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis. CoRR abs/2501.04904 (2025)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04926
Jun-Hak Yun, Seung-Bin Kim, Seong-Whan Lee:
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching. CoRR abs/2501.04926 (2025)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19687
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech. CoRR abs/2505.19687 (2025)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19693
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification. CoRR abs/2505.19693 (2025)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20868
Nam-Gyu Kim, Deok-Hyeon Cho, Seung-Bin Kim, Seong-Whan Lee:
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech. CoRR abs/2505.20868 (2025)
2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/KimLCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KimLCL24
Seung-Bin Kim, Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee:
Audio Super-Resolution With Robust Speech Representation Learning of Masked Autoencoder. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1012-1022 (2024)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimLL24
Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee:
TranSentence: speech-to-speech Translation via Language-Agnostic Sentence-Level Speech Encoding without Language-Parallel Data. ICASSP 2024: 12722-12726
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoOKLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoOKLL24
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee:
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech. INTERSPEECH 2024
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/smc/LeeKCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smc/LeeKCL24
Ji-Eun Lee, Seung-Bin Kim, Deok-Hyeon Cho, Seong-Whan Lee:
PromotiCon: Prompt-based Emotion Controllable Text-to-Speech via Prompt Generation and Matching. SMC 2024: 1151-1156
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12992
Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee:
TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data. CoRR abs/2401.12992 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07803
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee:
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech. CoRR abs/2406.07803 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-02625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-02625
Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee:
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector. CoRR abs/2411.02625 (2024)
2023
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12454
Sang-Hoon Lee, Ha-Yeong Choi, Seung-Bin Kim, Seong-Whan Lee:
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis. CoRR abs/2311.12454 (2023)
2022
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ImLKL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ImLKL22
Chae-Bin Im, Sang-Hoon Lee, Seung-Bin Kim, Seong-Whan Lee:
EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech. ICASSP 2022: 6317-6321
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeeKLSHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeeKLSHL22
Sang-Hoon Lee, Seung-Bin Kim, Ji-Hyun Lee, Eunwoo Song, Min-Jae Hwang, Seong-Whan Lee:
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis. NeurIPS 2022

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.