Остановите войну!
for scientists:
default search action
Tuomas Virtanen
- > Home > Persons > Tuomas Virtanen
Publications
- 2023
- [c168]Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning. WASPAA 2023: 1-5 - [i73]Shayan Gharib, Minh Tran, Diep Luong, Konstantinos Drossos, Tuomas Virtanen:
Adversarial Representation Learning for Robust Privacy Preservation in Audio. CoRR abs/2305.00011 (2023) - [i66]Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning. CoRR abs/2308.04960 (2023) - 2022
- [c161]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. EUSIPCO 2022: 1140-1144 - [c160]Huang Xie, Okko Räsänen, Konstantinos Drossos, Tuomas Virtanen:
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases. ICASSP 2022: 8867-8871 - [c159]Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. MMSP 2022: 1-6 - [i63]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. CoRR abs/2204.09634 (2022) - [i59]Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. CoRR abs/2208.02406 (2022) - 2021
- [c151]An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. EUSIPCO 2021: 576-580 - [c149]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. ICASSP 2021: 596-600 - [c147]Björn W. Schuller, Tuomas Virtanen, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros, Konstantinos Drossos:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792 - 2020
- [c145]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-Task Regularization Based on Infrequent Classes for Audio Captioning. DCASE 2020: 6-10 - [c143]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. DCASE 2020: 110-114 - [c141]Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters. EUSIPCO 2020: 466-470 - [c140]Yanxiong Li, Mingle Liu, Konstantinos Drossos, Tuomas Virtanen:
Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks. ICASSP 2020: 286-290 - [c139]Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: an Audio Captioning Dataset. ICASSP 2020: 736-740 - [c138]Konstantinos Drossos, Stylianos I. Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. IJCNN 2020: 1-7 - [c136]Pyry Pyykkönen, Stylianos I. Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. MMSP 2020: 1-6 - [i51]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. CoRR abs/2002.00476 (2020) - [i48]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. CoRR abs/2006.08386 (2020) - [i47]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning. CoRR abs/2007.02676 (2020) - [i46]Pyry Pyykkönen, Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. CoRR abs/2007.02683 (2020) - [i45]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-task Regularization Based on Infrequent Classes for Audio Captioning. CoRR abs/2007.04660 (2020) - [i44]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Conditioned Time-Dilated Convolutions for Sound Event Detection. CoRR abs/2007.05183 (2020) - [i42]An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. CoRR abs/2010.11098 (2020) - [i39]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. CoRR abs/2010.14171 (2020) - 2019
- [c133]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. DCASE 2019: 59-63 - [c132]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. DCASE 2019: 139-143 - [c125]Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification. WASPAA 2019: 259-263 - [i37]Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based On The Wasserstein Distance For Acoustic Scene Classification. CoRR abs/1904.10678 (2019) - [i31]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. CoRR abs/1907.08506 (2019) - [i30]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. CoRR abs/1907.09238 (2019) - [i29]Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: An Audio Captioning Dataset. CoRR abs/1910.09387 (2019) - [i28]Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters. CoRR abs/1911.00527 (2019) - [i26]Shayan Gharib, Konstantinos Drossos, Eemi Fagerlund, Tuomas Virtanen:
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation. CoRR abs/1911.07098 (2019) - 2018
- [c120]Shayan Gharib, Konstantinos Drossos, Emre Cakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. DCASE 2018: 138-142 - [c116]Stylianos Ioannis Mimilakis, Konstantinos Drossos, João Felipe Santos, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. ICASSP 2018: 721-725 - [c113]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. IJCNN 2018: 1-8 - [c112]Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation. INTERSPEECH 2018: 332-336 - [c104]Konstantinos Drossos, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. IWAENC 2018: 421-425 - [i24]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. CoRR abs/1802.00300 (2018) - [i22]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Andreas Floros, Tuomas Virtanen, Gerald Schuller:
Close Miking Empirical Practice Verification: A Source Separation Approach. CoRR abs/1802.05132 (2018) - [i17]Konstantinos Drossos, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. CoRR abs/1807.11298 (2018) - [i15]Shayan Gharib, Konstantinos Drossos, Emre Çakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. CoRR abs/1808.05777 (2018) - 2017
- [c96]Sharath Adavanne, Konstantinos Drossos, Emre Cakir, Tuomas Virtanen:
Stacked convolutional and recurrent neural networks for bird audio detection. EUSIPCO 2017: 1729-1733 - [c95]Emre Cakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional recurrent neural networks for bird audio detection. EUSIPCO 2017: 1744-1748 - [c91]Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen, Gerald Schuller:
A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation. MLSP 2017: 1-6 - [c85]Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated audio captioning with recurrent neural networks. WASPAA 2017: 374-378 - [i13]Emre Çakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1703.02317 (2017) - [i12]Sharath Adavanne, Konstantinos Drossos, Emre Çakir, Tuomas Virtanen:
Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1706.02047 (2017) - [i10]Miroslav Malik, Sharath Adavanne, Konstantinos Drossos, Tuomas Virtanen, Dasa Ticha, Roman Jarina:
Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition. CoRR abs/1706.02292 (2017) - [i8]Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated Audio Captioning with Recurrent Neural Networks. CoRR abs/1706.10006 (2017) - [i7]Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen, Gerald Schuller:
A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation. CoRR abs/1709.00611 (2017) - [i2]Stylianos Ioannis Mimilakis, Konstantinos Drossos, João Felipe Santos, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. CoRR abs/1711.01437 (2017)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-12 21:08 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint