default search action
Prem Seetharaman
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c28]Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo:
VampNet: Music Generation via Masked Acoustic Token Modeling. ISMIR 2023: 359-366 - [c27]Rithesh Kumar, Prem Seetharaman, Alejandro Luebs, Ishaan Kumar, Kundan Kumar:
High-Fidelity Audio Compression with Improved RVQGAN. NeurIPS 2023 - [i21]Rithesh Kumar, Prem Seetharaman, Alejandro Luebs, Ishaan Kumar, Kundan Kumar:
High-Fidelity Audio Compression with Improved RVQGAN. CoRR abs/2306.06546 (2023) - [i20]Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo:
VampNet: Music Generation via Masked Acoustic Token Modeling. CoRR abs/2307.04686 (2023) - 2022
- [c26]Ethan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo:
Source Separation By Steering Pretrained Music Models. ICASSP 2022: 126-130 - [c25]Ho-Hsiang Wu, Prem Seetharaman, Kundan Kumar, Juan Pablo Bello:
Wav2CLIP: Learning Robust Audio Representations from Clip. ICASSP 2022: 4563-4567 - [c24]Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron C. Courville, Yoshua Bengio:
Chunked Autoregressive GAN for Conditional Waveform Synthesis. ICLR 2022 - [c23]Ho-Hsiang Wu, Magdalena Fuentes, Prem Seetharaman, Juan Pablo Bello:
How to Listen? Rethinking Visual Sound Localization. INTERSPEECH 2022: 876-880 - [c22]Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo:
Music Separation Enhancement with Generative Modeling. ISMIR 2022: 772-780 - [i19]Ho-Hsiang Wu, Magdalena Fuentes, Prem Seetharaman, Juan Pablo Bello:
How to Listen? Rethinking Visual Sound Localization. CoRR abs/2204.05156 (2022) - [i18]Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo:
Music Separation Enhancement with Generative Modeling. CoRR abs/2208.12387 (2022) - 2021
- [c21]Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's all the Fuss about Free Universal Sound Separation Data? ICASSP 2021: 186-190 - [c20]Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes. ICASSP 2021: 840-844 - [i17]Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron C. Courville, Yoshua Bengio:
Chunked Autoregressive GAN for Conditional Waveform Synthesis. CoRR abs/2110.10139 (2021) - [i16]Ho-Hsiang Wu, Prem Seetharaman, Kundan Kumar, Juan Pablo Bello:
Wav2CLIP: Learning Robust Audio Representations From CLIP. CoRR abs/2110.11499 (2021) - [i15]Ethan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo:
Unsupervised Source Separation By Steering Pretrained Music Models. CoRR abs/2110.13071 (2021) - 2020
- [c19]Alisa Liu, Prem Seetharaman, Bryan Pardo:
Model Selection for Deep Audio Source Separation via Clustering Analysis. DCASE 2020: 91-95 - [c18]Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection in Domestic Environments using Sound Separation. DCASE 2020: 205-209 - [c17]Ethan Manilow, Prem Seetharaman, Bryan Pardo:
Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments. ICASSP 2020: 771-775 - [c16]Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux:
Autoclip: Adaptive Gradient Clipping for Source Separation Networks. MLSP 2020: 1-6 - [i14]Alexander Fang, Alisa Liu, Prem Seetharaman, Bryan Pardo:
Bach or Mock? A Grading Function for Chorales in the Style of J.S. Bach. CoRR abs/2006.13329 (2020) - [i13]Alisa Liu, Alexander Fang, Gaëtan Hadjeres, Prem Seetharaman, Bryan Pardo:
Incorporating Music Knowledge in Continual Dataset Augmentation for Music Generation. CoRR abs/2006.13331 (2020) - [i12]Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection In Domestic Environments Using Sound Separation. CoRR abs/2007.03932 (2020) - [i11]Omkar Ranadive, Grant Gasser, David Terpay, Prem Seetharaman:
OtoWorld: Towards Learning to Separate by Learning to Move. CoRR abs/2007.06123 (2020) - [i10]Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux:
AutoClip: Adaptive Gradient Clipping for Source Separation Networks. CoRR abs/2007.14469 (2020) - [i9]Andreas Bugler, Bryan Pardo, Prem Seetharaman:
A Study of Transfer Learning in Music Source Separation. CoRR abs/2010.12650 (2020) - [i8]Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes. CoRR abs/2011.00801 (2020) - [i7]Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's All the FUSS About Free Universal Sound Separation Data? CoRR abs/2011.00803 (2020)
2010 – 2019
- 2019
- [j1]Eric J. Humphrey, Sravana Reddy, Prem Seetharaman, Aparna Kumar, Rachel M. Bittner, Andrew M. Demetriou, Sankalp Gulati, Andreas Jansson, Tristan Jehan, Bernhard Lehner, Anna M. Kruspe, Luwei Yang:
An Introduction to Signal Processing for Singing-Voice Analysis: High Notes in the Effort to Automate the Understanding of Vocals in Music. IEEE Signal Process. Mag. 36(1): 82-94 (2019) - [c15]Prem Seetharaman, Gautham J. Mysore, Bryan Pardo, Paris Smaragdis, Celso Gomes:
VoiceAssist: Guiding Users to High-Quality Voice Recordings. CHI 2019: 309 - [c14]Fatemeh Pishdadian, Prem Seetharaman, Bongjun Kim, Bryan Pardo:
Classifying Non-speech Vocals: Deep vs Signal Processing Representations. DCASE 2019: 194-198 - [c13]Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux:
Class-conditional Embeddings for Music Source Separation. ICASSP 2019: 301-305 - [c12]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures. ICASSP 2019: 356-360 - [c11]Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux:
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity. WASPAA 2019: 45-49 - [i6]Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux:
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity. CoRR abs/1909.08494 (2019) - [i5]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping deep music separation from primitive auditory grouping principles. CoRR abs/1910.11133 (2019) - [i4]Ethan Manilow, Prem Seetharaman, Bryan Pardo:
Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments. CoRR abs/1910.12621 (2019) - [i3]Alisa Liu, Prem Seetharaman, Bryan Pardo:
Model selection for deep audio source separation via clustering analysis. CoRR abs/1910.12626 (2019) - 2018
- [c10]Prem Seetharaman, Gautham J. Mysore, Paris Smaragdis, Bryan Pardo:
Blind Estimation of the Speech Transmission Index for Speech Quality Prediction. ICASSP 2018: 591-595 - [c9]Ethan Manilow, Prem Seetharaman, Bryan Pardo:
The Northwestern University Source Separation Library. ISMIR 2018: 297-305 - [c8]Julia Wilkins, Prem Seetharaman, Alison Wahl, Bryan Pardo:
VocalSet: A Singing Voice Dataset. ISMIR 2018: 468-474 - [i2]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures. CoRR abs/1811.02130 (2018) - [i1]Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux:
Class-conditional embeddings for music source separation. CoRR abs/1811.03076 (2018) - 2017
- [c7]Prem Seetharaman, Zafar Rafii:
Cover song identification with 2D Fourier Transform sequences. ICASSP 2017: 616-620 - [c6]Prem Seetharaman, Fatemeh Pishdadian, Bryan Pardo:
Music/Voice separation using the 2D fourier transform. WASPAA 2017: 36-40 - [c5]Ethan Manilow, Prem Seetharaman, Fatemeh Pishdadian, Bryan Pardo:
Predicting algorithm efficacy for adaptive multi-cue source separation. WASPAA 2017: 274-278 - 2016
- [c4]Prem Seetharaman, Bryan Pardo:
Simultaneous Separation and Segmentation in Layered Music. ISMIR 2016: 495-501 - [c3]Taylor Zheng, Prem Seetharaman, Bryan Pardo:
SocialFX: Studying a Crowdsourced Folksonomy of Audio Effects Terms. ACM Multimedia 2016: 182-186 - 2014
- [c2]Prem Seetharaman, Bryan Pardo:
Crowdsourcing a Reverberation Descriptor Map. ACM Multimedia 2014: 587-596 - [c1]Prem Seetharaman, Bryan Pardo:
Reverbalize: A Crowdsourced Reverberation Controller. ACM Multimedia 2014: 739-740
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint