default search action
Sharath Adavanne
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c21]Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari:
Improving Speech Prosody of Audiobook Text-To-Speech Synthesis with Acoustic and Textual Contexts. ICASSP 2023: 1-5 - [c20]Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. NeurIPS 2023 - [d9]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.0.0. Zenodo, 2023 [all versions] - [d8]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.1.0. Zenodo, 2023 [all versions] - [i23]Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023) - 2022
- [c19]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022 - [d7]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU Spatial Room Impulse Response Database (TAU-SRIR DB). Zenodo, 2022 - [d6]Adavanne Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.0.0. Zenodo, 2022 [all versions] - [d5]Archontis Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Aleksander Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.1.0. Zenodo, 2022 [all versions] - [i22]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022) - [i21]Arun Baby, Saranya Vinnaitherthan, Akhil Kerhalkar, Pranav Jawale, Sharath Adavanne, Nagaraj Adiga:
Context-based out-of-vocabulary word recovery for ASR systems in Indian languages. CoRR abs/2206.04305 (2022) - [i20]Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari:
Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts. CoRR abs/2211.02336 (2022) - 2021
- [j2]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. IEEE ACM Trans. Audio Speech Lang. Process. 29: 684-698 (2021) - [c18]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. DCASE 2021: 125-129 - [c17]Arun Baby, Pranav Jawale, Saranya Vinnaitherthan, Sumukh Badam, Nagaraj Adiga, Sharath Adavanne:
Non-native English lexicon creation for bilingual speech synthesis. SSW 2021: 154-159 - [c16]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. WASPAA 2021: 211-215 - [d4]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1. Zenodo, 2021 [all versions] - [d3]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1.1.0. Zenodo, 2021 [all versions] - [d2]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
TAU-NIGENS Spatial Sound Events 2021. Version 1.2.0. Zenodo, 2021 [all versions] - [i19]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. CoRR abs/2106.06999 (2021) - [i18]Arun Baby, Pranav Jawale, Saranya Vinnaitherthan, Sumukh Badam, Nagaraj Adiga, Sharath Adavanne:
Non-native English lexicon creation for bilingual speech synthesis. CoRR abs/2106.10870 (2021) - [i17]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. CoRR abs/2111.00030 (2021) - 2020
- [c15]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. DCASE 2020: 165-169 - [i16]Arun Baby, Saranya Vinnaitherthan, Nagaraj Adiga, Pranav Jawale, Sumukh Badam, Sharath Adavanne, Srikanth Konjeti:
An ASR Guided Speech Intelligibility Measure for TTS Model Selection. CoRR abs/2006.01463 (2020) - [i15]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. CoRR abs/2006.01919 (2020) - [i14]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. CoRR abs/2009.02792 (2020)
2010 – 2019
- 2019
- [j1]Sharath Adavanne, Archontis Politis, Joonas Nikunen, Tuomas Virtanen:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. IEEE J. Sel. Top. Signal Process. 13(1): 34-48 (2019) - [c14]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A Multi-room Reverberant Dataset for Sound Event Localization and Detection. DCASE 2019: 10-14 - [c13]Sharath Adavanne, Haytham M. Fayek, Vladimir Tourbabin:
Sound Event Classification and Detection with Weakly Labeled Data. DCASE 2019: 15-19 - [c12]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. DCASE 2019: 20-24 - [c11]Marc C. Green, Sharath Adavanne, Damian T. Murphy, Tuomas Virtanen:
Acoustic Scene Classification Using Higher-Order Ambisonic Features. WASPAA 2019: 328-332 - [c10]Annamaria Mesaros, Sharath Adavanne, Archontis Politis, Toni Heittola, Tuomas Virtanen:
Joint Measurement of Localization and Detection of Sound Events. WASPAA 2019: 333-337 - [d1]Sharath Adavanne, Archontis Politis, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Sound event localization and detection (SELDnet) results. Zenodo, 2019 - [i13]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. CoRR abs/1904.12769 (2019) - [i12]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A multi-room reverberant dataset for sound event localization and detection. CoRR abs/1905.08546 (2019) - 2018
- [c9]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network. EUSIPCO 2018: 1462-1466 - [c8]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. IJCNN 2018: 1-7 - [i11]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. CoRR abs/1801.09522 (2018) - [i10]Sharath Adavanne, Archontis Politis, Joonas Nikunen, Tuomas Virtanen:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. CoRR abs/1807.00129 (2018) - 2017
- [c7]Sharath Adavanne, Tuomas Virtanen:
Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network. DCASE 2017: 12-16 - [c6]Jose Maria Perez-Macias, Sharath Adavanne, Jari Viik, Alpo Värri, Sari-Leena Himanen, Mirja Tenhunen:
Assessment of support vector machines and convolutional neural networks to detect snoring using Emfit mattress. EMBC 2017: 2883-2886 - [c5]Sharath Adavanne, Konstantinos Drossos, Emre Cakir, Tuomas Virtanen:
Stacked convolutional and recurrent neural networks for bird audio detection. EUSIPCO 2017: 1729-1733 - [c4]Emre Cakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional recurrent neural networks for bird audio detection. EUSIPCO 2017: 1744-1748 - [c3]Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen:
Sound event detection using spatial features and convolutional recurrent neural network. ICASSP 2017: 771-775 - [c2]Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated audio captioning with recurrent neural networks. WASPAA 2017: 374-378 - [i9]Emre Çakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1703.02317 (2017) - [i8]Sharath Adavanne, Konstantinos Drossos, Emre Çakir, Tuomas Virtanen:
Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1706.02047 (2017) - [i7]Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen:
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network. CoRR abs/1706.02291 (2017) - [i6]Miroslav Malik, Sharath Adavanne, Konstantinos Drossos, Tuomas Virtanen, Dasa Ticha, Roman Jarina:
Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition. CoRR abs/1706.02292 (2017) - [i5]Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. CoRR abs/1706.02293 (2017) - [i4]Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated Audio Captioning with Recurrent Neural Networks. CoRR abs/1706.10006 (2017) - [i3]Sharath Adavanne, Tuomas Virtanen:
A report on sound event detection with different binaural features. CoRR abs/1710.02997 (2017) - [i2]Sharath Adavanne, Tuomas Virtanen:
Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network. CoRR abs/1710.02998 (2017) - [i1]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network. CoRR abs/1710.10059 (2017) - 2016
- [c1]Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. DCASE 2016: 6-10
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint