default search action

combined dblp search
author search
venue search
publication search

ask others

Siddharth Gururani

Siddharth Kumar Gururani

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ValleBKLG0SDGAL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ValleBKLG0SDGAL25
Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, João Felipe Santos, Shuqi Dai, Siddharth Gururani, Aya Aljafari, Alexander H. Liu, Kevin J. Shih, Ryan Prenger, Wei Ping, Chao-Han Huck Yang, Bryan Catanzaro:
Fugatto 1: Foundational Generative Audio Transformer Opus 1. ICLR 2025
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03575
Niket Agarwal, Arslan Ali, Maciej Bala, Yogesh Balaji, Erik Barker, Tiffany Cai, Prithvijit Chattopadhyay, Yongxin Chen, Yin Cui, Yifan Ding, Daniel Dworakowski, Jiaojiao Fan, Michele Fenzi, Francesco Ferroni, Sanja Fidler, Dieter Fox, Songwei Ge, Yunhao Ge, Jinwei Gu, Siddharth Gururani, Ethan He, Jiahui Huang, Jacob Samuel Huffman, Pooya Jannaty, Jingyi Jin, Seung Wook Kim, Gergely Klár, Grace Lam, Shiyi Lan, Laura Leal-Taixé, Anqi Li, Zhaoshuo Li, Chen-Hsuan Lin, Tsung-Yi Lin, Huan Ling, Ming-Yu Liu, Xian Liu, Alice Luo, Qianli Ma, Hanzi Mao, Kaichun Mo, Arsalan Mousavian, Seungjun Nah, Sriharsha Niverty, David Page, Despoina Paschalidou, Zeeshan Patel, Lindsey Pavao, Morteza Ramezanali, Fitsum Reda, Xiaowei Ren, Vasanth Rao Naik Sabavat, Ed Schmerling, Stella Shi, Bartosz Stefaniak, Shitao Tang, Lyne Tchapmi, Przemek Tredak, Wei-Cheng Tseng, Jibin Varghese, Hao Wang, Haoxiang Wang, Heng Wang, Ting-Chun Wang, Fangyin Wei, Xinyue Wei, Jay Zhangjie Wu, Jiashu Xu, Wei Yang, Lin Yen-Chen, Xiaohui Zeng, Yu Zeng, Jing Zhang, Qinsheng Zhang, Yuxuan Zhang, Qingqing Zhao, Artur Zólkowski:
Cosmos World Foundation Model Platform for Physical AI. CoRR abs/2501.03575 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-15558
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-15558
Alisson G. Azzolini, Hannah Brandon, Prithvijit Chattopadhyay, Huayu Chen, Jinju Chu, Yin Cui, Jenna Diamond, Yifan Ding, Francesco Ferroni, Rama Govindaraju, Jinwei Gu, Siddharth Gururani, Imad El Hanafi, Zekun Hao, Jacob Samuel Huffman, Jingyi Jin, Brendan Johnson, Rizwan Khan, George Kurian, Elena Lantz, Nayeon Lee, Zhaoshuo Li, Xuan Li, Tsung-Yi Lin, Yen-Chen Lin, Ming-Yu Liu, Alice Luo, Andrew Mathau, Yun Ni, Lindsey Pavao, Wei Ping, David W. Romero, Misha Smelyanskiy, Shuran Song, Lyne Tchapmi, Andrew Z. Wang, Boxin Wang, Haoxiang Wang, Fangyin Wei, Jiashu Xu, Yao Xu, Xiaodong Yang, Zhuolin Yang, Xiaohui Zeng, Zhe Zhang:
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning. CoRR abs/2503.15558 (2025)
2024
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangGLHZSGOY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangGLHZSGOY24
Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli Shama Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue:
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion. ICML 2024
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Dai0VG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Dai0VG24
Shuqi Dai, Ming-Yu Liu, Rafael Valle, Siddharth Gururani:
ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control. ACM Multimedia 2024: 3229-3238
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-14285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-14285
Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli Shama Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue:
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion. CoRR abs/2402.14285 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-07126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-07126
Yuval Atzmon, Maciej Bala, Yogesh Balaji, Tiffany Cai, Yin Cui, Jiaojiao Fan, Yunhao Ge, Siddharth Gururani, Jacob Samuel Huffman, Ronald Isaac, Pooya Jannaty, Tero Karras, Grace Lam, J. P. Lewis, Aaron Licata, Yen-Chen Lin, Ming-Yu Liu, Qianli Ma, Arun Mallya, Ashlee Martino-Tarr, Doug Mendez, Seungjun Nah, Chris Pruett, Fitsum Reda, Jiaming Song, Ting-Chun Wang, Fangyin Wei, Xiaohui Zeng, Yu Zeng, Qinsheng Zhang:
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models. CoRR abs/2411.07126 (2024)
2023
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/GururaniMWV023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/GururaniMWV023
Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu:
SPACE: Speech-driven Portrait Animation with Controllable Expression. ICCV 2023: 20857-20866
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BadlaniVSSGC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BadlaniVSSGC23
Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech. INTERSPEECH 2023: 626-630
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-10335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-10335
Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
Multilingual Multiaccented Multispeaker TTS with RADTTS. CoRR abs/2301.10335 (2023)
2022
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07769
Vinod Subramanian, Siddharth Gururani, Emmanouil Benetos, Mark B. Sandler:
Anomalous behaviour in loss-gradient based interpretability methods. CoRR abs/2207.07769 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09809
Siddharth Gururani, Arun Mallya, Ting-Chun Wang, Rafael Valle, Ming-Yu Liu:
SPACEx: Speech-driven Portrait Animation with Controllable Expression. CoRR abs/2211.09809 (2022)
2021
[b1]
- view
  - electronic edition via handle.net
  - details & citations
  authority control:
- export record
  dblp key:
  - phd/basesearch/Gururani21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Gururani21
Siddharth Kumar Gururani:
Weakly Supervised Learning for Musical Instrument Classification. Georgia Institute of Technology, Atlanta, GA, USA, 2021
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/ism/GururaniL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ism/GururaniL21
Siddharth Gururani, Alexander Lerch:
Semi-Supervised Audio Classification with Partially Labeled Data. ISM 2021: 111-114
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-09018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-09018
Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani:
An Interdisciplinary Review of Music Performance Analysis. CoRR abs/2104.09018 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-12761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-12761
Siddharth Gururani, Alexander Lerch:
Semi-Supervised Audio Classification with Partially Labeled Data. CoRR abs/2111.12761 (2021)
2020
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tismir/LerchAPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tismir/LerchAPG20
Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani:
An Interdisciplinary Review of Music Performance Analysis. Trans. Int. Soc. Music. Inf. Retr. 3(1): 221-245 (2020)
[c8]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/PatiG020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/PatiG020
Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch:
dMelodies: A Music Dataset for Disentanglement Learning. ISMIR 2020: 125-133
[c7]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/HuangHPG020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/HuangHPG020
Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch:
Score-informed Networks for Music Performance Assessment. ISMIR 2020: 908-915
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-09640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-09640
Karn Watcharasupat, Siddharth Gururani, Alexander Lerch:
Visual Attention for Musical Instrument Recognition. CoRR abs/2006.09640 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15067
Ashis Pati, Siddharth Gururani, Alexander Lerch:
dMelodies: A Music Dataset for Disentanglement Learning. CoRR abs/2007.15067 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-00203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-00203
Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch:
Score-informed Networks for Music Performance Assessment. CoRR abs/2008.00203 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/LerchAPG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/LerchAPG19
Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani:
Music Performance Analysis: A Survey. ISMIR 2019: 33-43
[c5]
- view
  - electronic edition @ ismir.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/GururaniSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/GururaniSL19
Siddharth Gururani, Mohit Sharma, Alexander Lerch:
An Attention Mechanism for Musical Instrument Recognition. ISMIR 2019: 83-90
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-00178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-00178
Alexander Lerch, Claire Arthur, Kumar Ashis Pati, Siddharth Gururani:
Music Performance Analysis: A Survey. CoRR abs/1907.00178 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-04294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-04294
Siddharth Gururani, Mohit Sharma, Alexander Lerch:
An Attention Mechanism for Musical Instrument Recognition. CoRR abs/1907.04294 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-09645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-09645
Siddharth Gururani, Kilol Gupta, Dhaval Shah, Zahra Shakeri, Jervis Pinto:
Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features. CoRR abs/1911.09645 (2019)
2018
[c4]
- view
  - electronic edition @ ircam.fr (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/GururaniSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/GururaniSL18
Siddharth Gururani, Cameron Summers, Alexander Lerch:
Instrument Activity Detection in Polyphonic Music using Deep Neural Networks. ISMIR 2018: 569-576
2017
[c3]
- view
- export record
  dblp key:
  - conf/ismir/GururaniL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/GururaniL17
Siddharth Gururani, Alexander Lerch:
Automatic Sample Detection in Polyphonic Music. ISMIR 2017: 264-271
[c2]
- view
  - electronic edition @ aes.org
  - details & citations
- export record
  dblp key:
  - conf/semanticaudio/VidwansGWSSL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semanticaudio/VidwansGWSSL17
Amruta Vidwans, Siddharth Gururani, Chih-Wei Wu, Vinod Subramanian, Rupak Swaminathan, Alexander Lerch:
Objective Descriptors for the Assessment of Student Music Performances. Semantic Audio 2017
2016
[c1]
- view
  - electronic edition @ nyu.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/WintersGL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/WintersGL16
R. Michael Winters, Siddharth Gururani, Alexander Lerch:
Automatic Practice Logging: Introduction, Dataset & Preliminary Study. ISMIR 2016: 598-604

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.