


default search action
Rithesh Kumar
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c8]Stephen Brade
, Sam Anderson
, Rithesh Kumar
, Zeyu Jin
, Anh Truong
:
SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation. CHI 2025: 756:1-756:19
[c7]Yinghao Aaron Li, Rithesh Kumar, Zeyu Jin:
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis. ICML 2025
[c6]Yunyun Wang, Jiaqi Su, Adam Finkelstein, Rithesh Kumar, Ke Chen, Zeyu Jin:
DiTVC: One-Shot Voice Conversion via Diffusion Transformer with Environment and Speaking Rate Cloning. WASPAA 2025: 1-5
[i14]Stephen Brade, Sam Anderson, Rithesh Kumar, Zeyu Jin, Anh Truong:
SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation. CoRR abs/2504.05106 (2025)
[i13]Heitor R. Guimarães, Jiaqi Su, Rithesh Kumar, Tiago H. Falk, Zeyu Jin:
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers. CoRR abs/2504.09381 (2025)
[i12]Justin Lovelace, Rithesh Kumar, Jiaqi Su, Ke Chen, Kilian Q. Weinberger, Zeyu Jin:
SpeechOp: Inference-Time Task Composition for Generative Speech Processing. CoRR abs/2509.14298 (2025)
[i11]Yutong Wen, Ke Chen, Prem Seetharaman, Oriol Nieto, Jiaqi Su, Rithesh Kumar, Minje Kim, Paris Smaragdis, Zeyu Jin, Justin Salamon:
PromptSep: Generative Audio Separation via Multimodal Prompting. CoRR abs/2511.04623 (2025)- 2024
[i10]Yingahao Aaron Li, Rithesh Kumar, Zeyu Jin:
DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization. CoRR abs/2410.11097 (2024)- 2023
[c5]Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo:
VampNet: Music Generation via Masked Acoustic Token Modeling. ISMIR 2023: 359-366
[c4]Rithesh Kumar, Prem Seetharaman, Alejandro Luebs, Ishaan Kumar, Kundan Kumar:
High-Fidelity Audio Compression with Improved RVQGAN. NeurIPS 2023
[i9]Rithesh Kumar, Prem Seetharaman, Alejandro Luebs, Ishaan Kumar, Kundan Kumar:
High-Fidelity Audio Compression with Improved RVQGAN. CoRR abs/2306.06546 (2023)
[i8]Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo:
VampNet: Music Generation via Masked Acoustic Token Modeling. CoRR abs/2307.04686 (2023)- 2022
[c3]Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron C. Courville, Yoshua Bengio:
Chunked Autoregressive GAN for Conditional Waveform Synthesis. ICLR 2022- 2021
[i7]Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron C. Courville, Yoshua Bengio:
Chunked Autoregressive GAN for Conditional Waveform Synthesis. CoRR abs/2110.10139 (2021)- 2020
[i6]Rithesh Kumar, Kundan Kumar, Vicki Anand, Yoshua Bengio, Aaron C. Courville:
NU-GAN: High resolution neural upsampling with GAN. CoRR abs/2010.11362 (2020)
2010 – 2019
- 2019
[c2]Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brébisson, Yoshua Bengio, Aaron C. Courville:
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis. NeurIPS 2019: 14881-14892
[i5]Rithesh Kumar, Anirudh Goyal, Aaron C. Courville, Yoshua Bengio:
Maximum Entropy Generators for Energy-Based Models. CoRR abs/1901.08508 (2019)
[i4]Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brébisson, Yoshua Bengio, Aaron C. Courville:
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis. CoRR abs/1910.06711 (2019)- 2018
[i3]Rithesh Kumar, Jose Sotelo, Kundan Kumar, Alexandre de Brébisson, Yoshua Bengio:
ObamaNet: Photo-realistic lip-sync from text. CoRR abs/1801.01442 (2018)
[i2]Kyle Kastner, Rithesh Kumar, Tim Cooijmans, Aaron C. Courville:
Harmonic Recomposition using Conditional Autoregressive Modeling. CoRR abs/1811.07426 (2018)- 2017
[c1]Soroush Mehri, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron C. Courville, Yoshua Bengio:
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model. ICLR (Poster) 2017- 2016
[i1]Soroush Mehri, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron C. Courville, Yoshua Bengio:
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model. CoRR abs/1612.07837 (2016)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-06 00:54 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







