


default search action
Siddharth Gururani
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c14]Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, João Felipe Santos, Shuqi Dai, Siddharth Gururani, Aya Aljafari, Alexander H. Liu, Kevin J. Shih, Ryan Prenger, Wei Ping, Chao-Han Huck Yang, Bryan Catanzaro:
Fugatto 1: Foundational Generative Audio Transformer Opus 1. ICLR 2025 - [i15]Niket Agarwal, Arslan Ali, Maciej Bala, Yogesh Balaji, Erik Barker, Tiffany Cai, Prithvijit Chattopadhyay, Yongxin Chen, Yin Cui, Yifan Ding, Daniel Dworakowski, Jiaojiao Fan, Michele Fenzi, Francesco Ferroni, Sanja Fidler, Dieter Fox, Songwei Ge, Yunhao Ge, Jinwei Gu, Siddharth Gururani, Ethan He, Jiahui Huang, Jacob Samuel Huffman, Pooya Jannaty, Jingyi Jin, Seung Wook Kim, Gergely Klár, Grace Lam, Shiyi Lan, Laura Leal-Taixé, Anqi Li, Zhaoshuo Li, Chen-Hsuan Lin, Tsung-Yi Lin, Huan Ling, Ming-Yu Liu, Xian Liu, Alice Luo, Qianli Ma, Hanzi Mao, Kaichun Mo, Arsalan Mousavian, Seungjun Nah, Sriharsha Niverty, David Page, Despoina Paschalidou, Zeeshan Patel, Lindsey Pavao, Morteza Ramezanali, Fitsum Reda, Xiaowei Ren, Vasanth Rao Naik Sabavat, Ed Schmerling, Stella Shi, Bartosz Stefaniak, Shitao Tang, Lyne Tchapmi, Przemek Tredak, Wei-Cheng Tseng, Jibin Varghese, Hao Wang, Haoxiang Wang, Heng Wang, Ting-Chun Wang, Fangyin Wei, Xinyue Wei, Jay Zhangjie Wu, Jiashu Xu, Wei Yang, Lin Yen-Chen, Xiaohui Zeng, Yu Zeng, Jing Zhang, Qinsheng Zhang, Yuxuan Zhang, Qingqing Zhao, Artur Zólkowski:
Cosmos World Foundation Model Platform for Physical AI. CoRR abs/2501.03575 (2025) - [i14]Alisson G. Azzolini, Hannah Brandon, Prithvijit Chattopadhyay, Huayu Chen, Jinju Chu, Yin Cui, Jenna Diamond, Yifan Ding, Francesco Ferroni, Rama Govindaraju, Jinwei Gu, Siddharth Gururani, Imad El Hanafi, Zekun Hao, Jacob Samuel Huffman, Jingyi Jin, Brendan Johnson, Rizwan Khan, George Kurian, Elena Lantz, Nayeon Lee, Zhaoshuo Li, Xuan Li, Tsung-Yi Lin, Yen-Chen Lin, Ming-Yu Liu, Alice Luo, Andrew Mathau, Yun Ni, Lindsey Pavao, Wei Ping, David W. Romero, Misha Smelyanskiy, Shuran Song, Lyne Tchapmi, Andrew Z. Wang, Boxin Wang, Haoxiang Wang, Fangyin Wei, Jiashu Xu, Yao Xu, Xiaodong Yang, Zhuolin Yang, Xiaohui Zeng, Zhe Zhang:
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning. CoRR abs/2503.15558 (2025) - 2024
- [c13]Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli Shama Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue:
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion. ICML 2024 - [c12]Shuqi Dai
, Ming-Yu Liu
, Rafael Valle
, Siddharth Gururani
:
ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control. ACM Multimedia 2024: 3229-3238 - [i13]Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli Shama Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue:
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion. CoRR abs/2402.14285 (2024) - [i12]Yuval Atzmon, Maciej Bala, Yogesh Balaji, Tiffany Cai, Yin Cui, Jiaojiao Fan, Yunhao Ge, Siddharth Gururani, Jacob Samuel Huffman, Ronald Isaac, Pooya Jannaty, Tero Karras, Grace Lam, J. P. Lewis, Aaron Licata, Yen-Chen Lin, Ming-Yu Liu, Qianli Ma, Arun Mallya, Ashlee Martino-Tarr, Doug Mendez, Seungjun Nah, Chris Pruett, Fitsum Reda, Jiaming Song, Ting-Chun Wang, Fangyin Wei, Xiaohui Zeng, Yu Zeng, Qinsheng Zhang:
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models. CoRR abs/2411.07126 (2024) - 2023
- [c11]Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu:
SPACE: Speech-driven Portrait Animation with Controllable Expression. ICCV 2023: 20857-20866 - [c10]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech. INTERSPEECH 2023: 626-630 - [i11]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
Multilingual Multiaccented Multispeaker TTS with RADTTS. CoRR abs/2301.10335 (2023) - 2022
- [i10]Vinod Subramanian, Siddharth Gururani, Emmanouil Benetos
, Mark B. Sandler:
Anomalous behaviour in loss-gradient based interpretability methods. CoRR abs/2207.07769 (2022) - [i9]Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu:
SPACEx: Speech-driven Portrait Animation with Controllable Expression. CoRR abs/2211.09809 (2022) - 2021
- [b1]Siddharth Kumar Gururani:
Weakly Supervised Learning for Musical Instrument Classification. Georgia Institute of Technology, Atlanta, GA, USA, 2021 - [c9]Siddharth Gururani, Alexander Lerch
:
Semi-Supervised Audio Classification with Partially Labeled Data. ISM 2021: 111-114 - [i8]Alexander Lerch, Claire Arthur
, Ashis Pati, Siddharth Gururani:
An Interdisciplinary Review of Music Performance Analysis. CoRR abs/2104.09018 (2021) - [i7]Siddharth Gururani, Alexander Lerch:
Semi-Supervised Audio Classification with Partially Labeled Data. CoRR abs/2111.12761 (2021) - 2020
- [j1]Alexander Lerch
, Claire Arthur
, Ashis Pati
, Siddharth Gururani:
An Interdisciplinary Review of Music Performance Analysis. Trans. Int. Soc. Music. Inf. Retr. 3(1): 221-245 (2020) - [c8]Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch:
dMelodies: A Music Dataset for Disentanglement Learning. ISMIR 2020: 125-133 - [c7]Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch:
Score-informed Networks for Music Performance Assessment. ISMIR 2020: 908-915 - [i6]Karn Watcharasupat, Siddharth Gururani, Alexander Lerch:
Visual Attention for Musical Instrument Recognition. CoRR abs/2006.09640 (2020) - [i5]Ashis Pati
, Siddharth Gururani, Alexander Lerch
:
dMelodies: A Music Dataset for Disentanglement Learning. CoRR abs/2007.15067 (2020) - [i4]Jiawen Huang, Yun-Ning Hung, Ashis Pati
, Siddharth Kumar Gururani, Alexander Lerch
:
Score-informed Networks for Music Performance Assessment. CoRR abs/2008.00203 (2020)
2010 – 2019
- 2019
- [c6]Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani:
Music Performance Analysis: A Survey. ISMIR 2019: 33-43 - [c5]Siddharth Gururani, Mohit Sharma, Alexander Lerch:
An Attention Mechanism for Musical Instrument Recognition. ISMIR 2019: 83-90 - [i3]Alexander Lerch, Claire Arthur, Kumar Ashis Pati
, Siddharth Gururani:
Music Performance Analysis: A Survey. CoRR abs/1907.00178 (2019) - [i2]Siddharth Gururani, Mohit Sharma, Alexander Lerch:
An Attention Mechanism for Musical Instrument Recognition. CoRR abs/1907.04294 (2019) - [i1]Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto:
Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features. CoRR abs/1911.09645 (2019) - 2018
- [c4]Siddharth Gururani, Cameron Summers, Alexander Lerch:
Instrument Activity Detection in Polyphonic Music using Deep Neural Networks. ISMIR 2018: 569-576 - 2017
- [c3]Siddharth Gururani, Alexander Lerch:
Automatic Sample Detection in Polyphonic Music. ISMIR 2017: 264-271 - [c2]Amruta Vidwans, Siddharth Gururani, Chih-Wei Wu, Vinod Subramanian, Rupak Swaminathan, Alexander Lerch:
Objective Descriptors for the Assessment of Student Music Performances. Semantic Audio 2017 - 2016
- [c1]R. Michael Winters, Siddharth Gururani, Alexander Lerch:
Automatic Practice Logging: Introduction, Dataset & Preliminary Study. ISMIR 2016: 598-604
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-17 01:38 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint