Naoya Takahashi

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with coauthor community group 1

> Home > Persons > Naoya Takahashi

Publications

2024
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SawataTUTM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SawataTUTM24
Ryosuke Sawata, Naoya Takahashi, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji:
The whole is greater than the sum of its parts: improving music source separation by bridging networks. EURASIP J. Audio Speech Music. Process. 2024(1): 39 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03822
Mayank Kumar Singh, Naoya Takahashi, Wei-Hsiang Liao, Yuki Mitsufuji:
SilentCipher: Deep Audio Watermarking. CoRR abs/2406.03822 (2024)
2023
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CheukSUMTTHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CheukSUMTTHM23
Kin Wai Cheuk, Ryosuke Sawata, Toshimitsu Uesaka, Naoki Murata, Naoya Takahashi, Shusuke Takahashi, Dorien Herremans, Yuki Mitsufuji:
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability. ICASSP 2023: 1-5
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahSTO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahSTO23
Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. ICASSP 2023: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiSM23
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Hierarchical Diffusion Models for Singing Voice Neural Vocoder. ICASSP 2023: 1-5
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DongTMMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DongTMMB23
Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian J. McAuley, Taylor Berg-Kirkpatrick:
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos. ICLR 2023
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghTO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghTO23
Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Iteratively Improving Speech Recognition and Voice Conversion. INTERSPEECH 2023: 206-210
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ShimadaPS0UAHKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShimadaPS0UAHKT23
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. NeurIPS 2023
[d4]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.0.0. Zenodo, 2023 [all versions]
[d3]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisSSHTKTAKUMV23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisSSHTKTAKUMV23a
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji, Tuomas Virtanen:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.1.0. Zenodo, 2023 [all versions]
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10536
Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. CoRR abs/2302.10536 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13838
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Cross-modal Face- and Voice-style Transfer. CoRR abs/2302.13838 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07855
Ryosuke Sawata, Naoya Takahashi, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji:
The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation. CoRR abs/2305.07855 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15055
Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Iteratively Improving Speech Recognition and Voice Conversion. CoRR abs/2305.15055 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09126
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023)
2022
[c26]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/PolitisSSA0KTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/PolitisSSA0KTTM22
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiM22
Naoya Takahashi, Yuki Mitsufuji:
Amicable Examples for Informed Source Separation. ICASSP 2022: 241-245
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaKTTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaKTTTM22
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training. ICASSP 2022: 316-320
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiM22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiM22a
Naoya Takahashi, Yuki Mitsufuji:
Amicable Examples for Informed Source Separation. ICASSP 2022: 4368-4372
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoyamaSTSTTTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoyamaSTSTTTM22
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection. ICASSP 2022: 8872-8876
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AgarwalTG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AgarwalTG22
Shrutina Agarwal, Naoya Takahashi, Sriram Ganapathy:
Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer. INTERSPEECH 2022: 3013-3017
[d2]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22
Adavanne Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.0.0. Zenodo, 2022 [all versions]
[d1]
- view
  authority control:
- export record
  dblp key:
  - data/10/PolitisMSSAKKTTV22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PolitisMSSAKKTTV22a
Archontis Politis, Yuki Mitsufuji, Parthasaarathy Sudarsanam, Kazuki Shimada, Sharath Adavanne, Yuichiro Koyama, Daniel Aleksander Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.1.0. Zenodo, 2022 [all versions]
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01948
Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-12410
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-12410
Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi:
Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer. CoRR abs/2208.12410 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05148
Kin Wai Cheuk, Ryosuke Sawata, Toshimitsu Uesaka, Naoki Murata, Naoya Takahashi, Shusuke Takahashi, Dorien Herremans, Yuki Mitsufuji:
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability. CoRR abs/2210.05148 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07508
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Hierarchical Diffusion Models for Singing Voice Neural Vocoder. CoRR abs/2210.07508 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11096
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Robust One-Shot Singing Voice Conversion. CoRR abs/2210.11096 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07065
Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian J. McAuley, Taylor Berg-Kirkpatrick:
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos. CoRR abs/2212.07065 (2022)
2021
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/TakahashiM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/TakahashiM21
Naoya Takahashi, Yuki Mitsufuji:
Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks. CVPR 2021: 993-1002
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BasakAGT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BasakAGT21
Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi:
End-to-End Lyrics Recognition with Voice to Singing Style Transfer. ICASSP 2021: 266-270
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiIM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiIM21
Naoya Takahashi, Shota Inoue, Yuki Mitsufuji:
Adversarial Attacks on Audio Source Separation. ICASSP 2021: 521-525
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimadaKTTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimadaKTTM21
Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection. ICASSP 2021: 915-919
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/TakahashiSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/TakahashiSM21
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Hierarchical disentangled representation learning for singing voice conversion. IJCNN 2021: 1-7
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-06842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-06842
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Hierarchical disentangled representation learning for singing voice conversion. CoRR abs/2101.06842 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08575
Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi:
End-to-end lyrics Recognition with Voice to Singing Style Transfer. CoRR abs/2102.08575 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10806
Kazuki Shimada, Naoya Takahashi, Yuichiro Koyama, Shusuke Takahashi, Emiru Tsunoo, Masafumi Takahashi, Yuki Mitsufuji:
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection. CoRR abs/2106.10806 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05054
Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji:
Source Mixing and Separation Robust Audio Steganography. CoRR abs/2110.05054 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05059
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05059
Naoya Takahashi, Yuki Mitsufuji:
Amicable examples for informed source separation. CoRR abs/2110.05059 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06501
Yuichiro Koyama, Kazuhide Shigemi, Masafumi Takahashi, Kazuki Shimada, Naoya Takahashi, Emiru Tsunoo, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection. CoRR abs/2110.06501 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07124
Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training. CoRR abs/2110.07124 (2021)
2020
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TakahashiSBPGM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TakahashiSBPGM20
Naoya Takahashi, Mayank Kumar Singh, Sakya Basak, Sudarsanam Parthasaarathy, Sriram Ganapathy, Yuki Mitsufuji:
Improving Voice Separation by Incorporating End-To-End Speech Recognition. ICASSP 2020: 41-45
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-12014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-12014
Kazuki Shimada, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net. CoRR abs/2006.12014 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01733
Naoya Takahashi, Yuki Mitsufuji:
D3Net: Densely connected multidilated DenseNet for music source separation. CoRR abs/2010.01733 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03164
Naoya Takahashi, Shota Inoue, Yuki Mitsufuji:
Adversarial attacks on audio source separation. CoRR abs/2010.03164 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15306
Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection. CoRR abs/2010.15306 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-11844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-11844
Naoya Takahashi, Yuki Mitsufuji:
Densely connected multidilated convolutional networks for dense prediction tasks. CoRR abs/2011.11844 (2020)
2019
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakahashiPGM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakahashiPGM19
Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, Yuki Mitsufuji:
Recursive Speech Separation for Unknown Number of Speakers. INTERSPEECH 2019: 1348-1352
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03065
Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, Yuki Mitsufuji:
Recursive speech separation for unknown number of speakers. CoRR abs/1904.03065 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-12928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-12928
Naoya Takahashi, Mayank Kumar Singh, Sakya Basak, Sudarsanam Parthasaarathy, Sriram Ganapathy, Yuki Mitsufuji:
Improving Voice Separation by Incorporating End-to-end Speech Recognition. CoRR abs/1911.12928 (2019)
2018
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakahashiAGM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakahashiAGM18
Naoya Takahashi, Purvi Agrawal, Nabarun Goswami, Yuki Mitsufuji:
PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation. INTERSPEECH 2018: 2713-2717
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/TakahashiGM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/TakahashiGM18
Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji:
Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation. IWAENC 2018: 106-110
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-02410
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-02410
Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji:
MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation. CoRR abs/1805.02410 (2018)
2017
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/UhlichPGEKTM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/UhlichPGEKTM17
Stefan Uhlich, Marcello Porcu, Franck Giron, Michael Enenkl, Thomas Kemp, Naoya Takahashi, Yuki Mitsufuji:
Improving music source separation based on deep neural networks through data augmentation and network blending. ICASSP 2017: 261-265
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/TakahashiM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/TakahashiM17
Naoya Takahashi, Yuki Mitsufuji:
Multi-Scale multi-band densenets for audio source separation. WASPAA 2017: 21-25
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/TakahashiM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TakahashiM17
Naoya Takahashi, Yuki Mitsufuji:
Multi-scale Multi-band DenseNets for Audio Source Separation. CoRR abs/1706.09588 (2017)
2016
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/TakahashiMH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TakahashiMH16
Naoya Takahashi, Mitsuharu Matsumoto, Shuji Hashimoto:
Noise reduction combining microphone and piezoelectric device. CoRR abs/1611.03178 (2016)
2007
[c1]
- view
  - electronic edition via handle.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icmc/TakahashiMH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmc/TakahashiMH07
Naoya Takahashi, Mitsuharu Matsumoto, Shuji Hashimoto:
Electric Koto by vibrating Body. ICMC 2007

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.