default search action

combined dblp search
author search
venue search
publication search

ask others

Kunal Dhawan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParkDKB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParkDKB24
Taejin Park, Kunal Dhawan, Nithin Rao Koluguri, Jagadeesh Balam:
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach. ICASSP 2024: 10861-10865
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PuvvadaKDBG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PuvvadaKDBG24
Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg:
Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition. ICASSP 2024: 12111-12115
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19674
Krishna C. Puvvada, Piotr Zelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg:
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data. CoRR abs/2406.19674 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03495
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03495
Kunal Dhawan, Nithin Rao Koluguri, Ante Jukic, Ryan Langman, Jagadeesh Balam, Boris Ginsburg:
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations. CoRR abs/2407.03495 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13106
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13106
He Huang, Taejin Park, Kunal Dhawan, Ivan Medennikov, Krishna C. Puvvada, Nithin Rao Koluguri, Weiqing Wang, Jagadeesh Balam, Boris Ginsburg:
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks. CoRR abs/2408.13106 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01438
Weiqing Wang, Kunal Dhawan, Taejin Park, Krishna C. Puvvada, Ivan Medennikov, Somshubra Majumdar, He Huang, Jagadeesh Balam, Boris Ginsburg:
Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR. CoRR abs/2409.01438 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06656
Taejin Park, Ivan Medennikov, Kunal Dhawan, Weiqing Wang, He Huang, Nithin Rao Koluguri, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg:
Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens. CoRR abs/2409.06656 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09785
Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12352
Jinhan Wang, Weiqing Wang, Kunal Dhawan, Taejin Park, Myungjong Kim, Ivan Medennikov, He Huang, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg:
META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR. CoRR abs/2409.12352 (2024)
2023
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08753
Kunal Dhawan, Dima Rekesh, Boris Ginsburg:
Towards training Bilingual and Code-Switched Speech Recognition models from Monolingual data sources. CoRR abs/2306.08753 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05248
Taejin Park, Kunal Dhawan, Nithin Rao Koluguri, Jagadeesh Balam:
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach. CoRR abs/2309.05248 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10922
Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg:
Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition. CoRR abs/2309.10922 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12371
Taejin Park, He Huang, Coleman Hooper, Nithin Rao Koluguri, Kunal Dhawan, Ante Jukic, Jagadeesh Balam, Boris Ginsburg:
Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation. CoRR abs/2310.12371 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12378
Taejin Park, He Huang, Ante Jukic, Kunal Dhawan, Krishna C. Puvvada, Nithin Rao Koluguri, Nikolay Karpov, Aleksandr Laptev, Jagadeesh Balam, Boris Ginsburg:
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System. CoRR abs/2310.12378 (2023)
2021
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-14796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-14796
Rahul Sharma, Kunal Dhawan, Balakrishna Pailla:
Phonetic Word Embeddings. CoRR abs/2109.14796 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/GanjiDS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/GanjiDS20
Sreeram Ganji, Kunal Dhawan, Rohit Sinha:
Novel textual features for language modeling of intra-sentential code-switching data. Comput. Speech Lang. 64: 101099 (2020)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ncc/DhawanSP020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ncc/DhawanSP020
Kunal Dhawan, Ganji Sreeram, Kumar Priyadarshi, Rohit Sinha:
Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data. NCC 2020: 1-5
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/spcom/SreeramDP020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spcom/SreeramDP020
Ganji Sreeram, Kunal Dhawan, Kumar Priyadarshi, Rohit Sinha:
Joint Language Identification of Code-Switching Speech using Attention-based E2E Network. SPCOM 2020: 1-5

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SreeramDS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SreeramDS19
Ganji Sreeram, Kunal Dhawan, Rohit Sinha:
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition. Speech Commun. 110: 76-89 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-06342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-06342
Sreeram Ganji, Kunal Dhawan, Kumar Priyadarshi, Rohit Sinha:
Joint Language Identification of Code-Switching Speech using Attention based E2E Network. CoRR abs/1907.06342 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-06859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-06859
Kunal Dhawan, Colin Vaz, Ruchir Travadi, Shrikanth S. Narayanan:
Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features. CoRR abs/1907.06859 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-08293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-08293
Kunal Dhawan, Ganji Sreeram, Kumar Priyadarshi, Rohit Sinha:
Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data. CoRR abs/1907.08293 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00662
Ganji Sreeram, Kunal Dhawan, Rohit Sinha:
Hindi-English Code-Switching Speech Corpus. CoRR abs/1810.00662 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.