default search action
Prashanth Gurunath Shivakumar
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Aditya Gourav, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Grant P. Strimel, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Multi-Modal Retrieval For Large Language Model Based Speech Recognition. ACL (Findings) 2024: 4435-4446 - [c14]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-Yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. ICASSP 2024: 10316-10320 - [c13]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-Yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks. ICASSP 2024: 12856-12860 - [i17]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks. CoRR abs/2401.02921 (2024) - [i16]Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Jia Xu, Ivan Bulyko, Andreas Stolcke:
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition. CoRR abs/2401.10447 (2024) - [i15]Jari Kolehmainen, Aditya Gourav, Prashanth Gurunath Shivakumar, Yile Gu, Ankur Gandhe, Ariya Rastrow, Grant P. Strimel, Ivan Bulyko:
Multi-Modal Retrieval For Large Language Model Based Speech Recognition. CoRR abs/2406.09618 (2024) - [i14]Prashanth Gurunath Shivakumar, Jari Kolehmainen, Aditya Gourav, Yi Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Speech Recognition Rescoring with Large Speech-Text Foundation Models. CoRR abs/2409.16654 (2024) - [i13]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Aditya Gourav, Yile Gu, Ankur Gandhe, Hung-yi Lee, Ivan Bulyko:
Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback. CoRR abs/2411.01834 (2024) - 2023
- [c12]Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Discriminative Speech Recognition Rescoring With Pre-Trained Language Models. ASRU 2023: 1-7 - [c11]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-Rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. ASRU 2023: 1-8 - [c10]Jari Kolehmainen, Yile Gu, Aditya Gourav, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Personalization for BERT-based Discriminative Speech Recognition Rescoring. INTERSPEECH 2023: 366-370 - [c9]Yile Gu, Prashanth Gurunath Shivakumar, Jari Kolehmainen, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Scaling Laws for Discriminative Speech Recognition Rescoring Models. INTERSPEECH 2023: 471-475 - [c8]Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Distillation Strategies for Discriminative Speech Recognition Rescoring. INTERSPEECH 2023: 4084-4088 - [i12]Jari Kolehmainen, Yile Gu, Aditya Gourav, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko:
Personalization for BERT-based Discriminative Speech Recognition Rescoring. CoRR abs/2307.06832 (2023) - [i11]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. CoRR abs/2309.15223 (2023) - [i10]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. CoRR abs/2312.15316 (2023) - 2022
- [j3]Prashanth Gurunath Shivakumar, Shrikanth Narayanan:
End-to-end neural systems for automatic children speech recognition: An empirical study. Comput. Speech Lang. 72: 101289 (2022) - 2021
- [c7]Prashanth Gurunath Shivakumar, Naveen Kumar, Panayiotis G. Georgiou, Shrikanth Narayanan:
RNN Based Incremental Online Spoken Language Understanding. SLT 2021: 989-996 - [i9]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou, Shrikanth Narayanan:
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords. CoRR abs/2102.02270 (2021) - [i8]Prashanth Gurunath Shivakumar, Shrikanth Narayanan:
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study. CoRR abs/2102.09918 (2021) - [i7]Prashanth Gurunath Shivakumar, Somer Bishop, Catherine Lord, Shrikanth Narayanan:
Phone Duration Modeling for Speaker Age Estimation in Children. CoRR abs/2109.01568 (2021) - 2020
- [j2]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou:
Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations. Comput. Speech Lang. 63: 101077 (2020)
2010 – 2019
- 2019
- [j1]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou:
Confusion2Vec: towards enriching vector space word representations with representational ambiguities. PeerJ Comput. Sci. 5: e195 (2019) - [c6]Prashanth Gurunath Shivakumar, Mu Yang, Panayiotis G. Georgiou:
Spoken Language Intent Detection Using Confusion2Vec. INTERSPEECH 2019: 819-823 - [i6]Prashanth Gurunath Shivakumar, Mu Yang, Panayiotis G. Georgiou:
Spoken Language Intent Detection using Confusion2Vec. CoRR abs/1904.03576 (2019) - [i5]Prashanth Gurunath Shivakumar, Shao-Yen Tseng, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Behavior Gated Language Models. CoRR abs/1909.00107 (2019) - [i4]Prashanth Gurunath Shivakumar, Naveen Kumar, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Incremental Online Spoken Language Understanding. CoRR abs/1910.10287 (2019) - 2018
- [i3]Prashanth Gurunath Shivakumar, Haoqi Li, Kevin Knight, Panayiotis G. Georgiou:
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling. CoRR abs/1802.02607 (2018) - [i2]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou:
Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations. CoRR abs/1805.03322 (2018) - [i1]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou:
Confusion2Vec: Towards Enriching Vector Space Word Representations with Representational Ambiguities. CoRR abs/1811.03199 (2018) - 2016
- [c5]Prashanth Gurunath Shivakumar, Sandeep Nallan Chakravarthula, Panayiotis G. Georgiou:
Multimodal Fusion of Multirate Acoustic, Prosodic, and Lexical Speaker Characteristics for Native Language Identification. INTERSPEECH 2016: 2408-2412 - [c4]Prashanth Gurunath Shivakumar, Panayiotis G. Georgiou:
Perception Optimized Deep Denoising AutoEncoders for Speech Enhancement. INTERSPEECH 2016: 3743-3747 - [c3]Md. Nasir, Arindam Jati, Prashanth Gurunath Shivakumar, Sandeep Nallan Chakravarthula, Panayiotis G. Georgiou:
Multimodal and Multiresolution Depression Detection from Speech and Facial Landmark Features. AVEC@ACM Multimedia 2016: 43-50 - 2014
- [c2]Prashanth Gurunath Shivakumar, Ming Li, Vedant Dhandhania, Shrikanth S. Narayanan:
Simplified and supervised i-vector modeling for speaker age regression. ICASSP 2014: 4833-4837 - [c1]Prashanth Gurunath Shivakumar, Alexandros Potamianos, Sungbok Lee, Shrikanth S. Narayanan:
Improving speech recognition for children using acoustic adaptation and pronunciation modeling. WOCCI 2014: 15-19
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-09 13:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint