default search action
Kazuhiro Nakadai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c234]Zirui Lin, Katsutoshi Itoyama, Kazuhiro Nakadai, Hideharu Amano:
FPGA-based Low Power Acceleration of HARK Sound Source Localization. COOL CHIPS 2024: 1-6 - [c233]Takahiro Osaki, Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improving Noise Robustness of Automatic Speech Recognition Based on a Parallel Adapter Model with Near-Identity Initialization. IEA/AIE 2024: 454-466 - [c232]Mert Bozkurtlar, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
Real Time Sound Source Localization Using von-Mises ResNet. SII 2024: 466-471 - [i10]Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution. CoRR abs/2401.14661 (2024) - [i9]Jiang Wang, Yuanzheng He, Daobilige Su, Katsutoshi Itoyama, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization. CoRR abs/2405.19813 (2024) - [i8]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance? CoRR abs/2407.15310 (2024) - 2023
- [c231]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Is the Ideal Ratio Mask Really the Best? - Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers. APSIPA ASC 2023: 1843-1850 - [c230]Ziquan Qin, Kaijie Wei, Hideharu Amano, Kazuhiro Nakadai:
Low power implementation of Geometric High-order Decorrelation-based Source Separation on an FPGA board. COOL CHIPS 2023: 1-6 - [c229]Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. INTERSPEECH 2023: 491-495 - [c228]Haris Gulzar, Monikka Roslianna Busto, Takeharu Eda, Katsutoshi Itoyama, Kazuhiro Nakadai:
miniStreamer: Enhancing Small Conformer with Chunked-Context Masking for Streaming ASR Applications on the Edge. INTERSPEECH 2023: 3277-3281 - [c227]Takahiro Aizawa, Yoshiaki Bando, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Masaki Onishi:
Unsupervised Domain Adaptation of Universal Source Separation Based on Neural Full-Rank Spatial Covariance Analysis. MLSP 2023: 1-6 - [c226]Tan Sihan, Khan Nabeela Khanum, Katsutoshi Itoyama, Kazuhiro Nakadai:
Improving Sign Language Understanding Introducing Label Smoothing. RO-MAN 2023: 113-118 - [c225]Yui Sudo, Masayuki Takigahira, Hideo Tsuru, Kazuhiro Nakadai, Hirofumi Nakajima:
Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation. RO-MAN 2023: 2058-2063 - [c224]Masahiko Fujita, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
An Ensemble Method for Multiple Speech Enhancement Using Deep Learning. SII 2023: 1-6 - [c223]Haris Gulzar, Muhammad Shakeel, Katsutoshi Itoyama, Kazuhiro Nakadai, Kenji Nishida, Hideharu Amano, Takeharu Eda:
FPGA based Power-Efficient Edge Server to Accelerate Speech Interface for Socially Assistive Robotics. SII 2023: 1-6 - [c222]Yuanzheng He, Jiang Wang, Daobilige Su, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization. SII 2023: 1-8 - [c221]Hidehiko Kishinami, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Reconstruction of Depth Scenes Based on Echolocation. SII 2023: 1-6 - [c220]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Metric-Based Multimodal Meta-Learning for Human Movement Identification Via Footstep Recognition. SII 2023: 1-8 - [c219]Chishio Sugiyama, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Assessment of Simultaneous Calibration for Positions, Orientations, and Time Offsets in Multiple Microphone Arrays Systems. SII 2023: 1-6 - [c218]Kei Suzuki, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Audio-Visual Class Association Based on Two-stage Self-supervised Contrastive Learning towards Robust Scene Analysis. SII 2023: 1-6 - [c217]Reiji Suzuki, Shinji Sumitani, Zachary Harlow, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Extracting Bird Vocalizations from a Complex Natural Soundscape in Forests Using Robot Audition Techniques. SII 2023: 1-6 - [i7]Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. CoRR abs/2305.17846 (2023) - [i6]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Is the Ideal Ratio Mask Really the Best? - Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers. CoRR abs/2309.12065 (2023) - 2022
- [j55]Shiho Matsubayashi, Kazuhiro Nakadai, Reiji Suzuki, Tatsuya Ura, Makoto Hasebe, Hiroshi G. Okuno:
Auditory Survey of Endangered Eurasian Bittern Using Microphone Arrays and Robot Audition. Frontiers Robotics AI 9: 854572 (2022) - [c216]Zhongyang Hou, Kaijie Wei, Hideharu Amano, Kazuhiro Nakadai:
An FPGA off-loading of HARK sound source localization. CANDARW 2022: 236-240 - [c215]Ryu Takeda, Yui Sudo, Kazuhiro Nakadai, Kazunori Komatani:
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model. INTERSPEECH 2022: 3789-3793 - [c214]Yoshiaki Bando, Takahiro Aizawa, Katsutoshi Itoyama, Kazuhiro Nakadai:
Weakly-Supervised Neural Full-Rank Spatial Covariance Analysis for a Front-End System of Distant Speech Recognition. INTERSPEECH 2022: 3824-3828 - [c213]Yui Sudo, Muhammad Shakeel, Kazuhiro Nakadai, Jiatong Shi, Shinji Watanabe:
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection. INTERSPEECH 2022: 4641-4645 - [c212]Yasuhiro Kagimoto, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Spotforming by NMF Using Multiple Microphone Arrays. IROS 2022: 9253-9258 - [c211]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Outdoor evaluation of sound source localization for drone groups using microphone arrays. IROS 2022: 9296-9301 - [i5]Yuanzheng He, Jiang Wang, Daobilige Su, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization. CoRR abs/2210.05600 (2022) - 2021
- [j54]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multichannel environmental sound segmentation. Appl. Intell. 51(11): 8245-8259 (2021) - [j53]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Detecting earthquakes: a novel deep learning-based approach for effective disaster response. Appl. Intell. 51(11): 8305-8315 (2021) - [c210]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Spatial Normalization to Reduce Positional Complexity in Direction-aided Supervised Binaural Sound Source Separation. APSIPA ASC 2021: 248-253 - [c209]Katsutoshi Itoyama, Yoshiya Morimoto, Shungo Masaki, Ryosuke Kojima, Kenji Nishida, Kazuhiro Nakadai:
Assessment of von Mises-Bernoulli Deep Neural Network in Sound Source Localization. Interspeech 2021: 2152-2156 - [c208]Kazuhiro Nakadai, Masayuki Takigahira, Yusuke Kawai, Hirofumi Nakajima:
Fully-Online Always-Adaptation of Transfer Functions and Its Application to Sound Source Localization and Separation. IROS 2021: 2100-2105 - [c207]Zhi Zhong, Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation. SII 2021: 369-374 - [c206]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental Sound Segmentation utilizing Sound Source Localization and Separation U-Net. SII 2021: 382-387 - [c205]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
EMC: Earthquake Magnitudes Classification on Seismic Signals via Convolutional Recurrent Networks. SII 2021: 388-393 - [c204]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound Source Tracking Using Integrated Direction Likelihood for Drones with Microphone Arrays. SII 2021: 394-399 - [c203]Reiji Suzuki, Hao Zhao, Shinji Sumitani, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Visualizing Directional Soundscapes of Bird Vocalizations Using Robot Audition Techniques. SII 2021: 487-492 - [c202]Shiho Matsubayashi, Fumiyuki Saito, Reiji Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno:
Observing Nocturnal Birds Using Localization Techniques. SII 2021: 493-498 - [c201]Kazuhiro Nakadai, Yosuke Fukumoto, Ryu Takeda:
Investigation of Node Pruning Criteria for Neural Networks Model Compression with Non-Linear Function and Non-Uniform Network Topology. SLT 2021: 117-124 - [i4]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Metric-based multimodal meta-learning for human movement identification via footstep recognition. CoRR abs/2111.07979 (2021) - 2020
- [j52]Kazuhiro Nakadai, Hiroshi G. Okuno:
Robot Audition and Computational Auditory Scene Analysis. Adv. Intell. Syst. 2(9): 2000050 (2020) - [j51]Toshinori Kagawa, Fumie Ono, Lin Shan, Ryu Miura, Kazuhiro Nakadai, Kotaro Hoshiba, Makoto Kumon, Hiroshi G. Okuno, Shin Kato, Fumihide Kojima:
Multi-hop wireless command and telemetry communication system for remote operation of robots with extending operation area beyond line-of-sight using 920 MHz/169 MHz. Adv. Robotics 34(11): 756-766 (2020) - [j50]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound event aware environmental sound segmentation with Mask U-Net. Adv. Robotics 34(20): 1280-1290 (2020) - [j49]Ryosuke Hasumoto, Kazuhiro Nakadai, Michita Imai:
Reactive Chameleon: A Method to Mimic Conversation Partner's Body Sway for a Robot. Int. J. Soc. Robotics 12(1): 239-258 (2020) - [j48]Heike Brock, Iva Farag, Kazuhiro Nakadai:
Recognition of Non-Manual Content in Continuous Japanese Sign Language. Sensors 20(19): 5621 (2020) - [j47]Heike Brock, Felix Law, Kazuhiro Nakadai, Yuji Nagashima:
Learning Three-dimensional Skeleton Data from Sign Language Video. ACM Trans. Intell. Syst. Technol. 11(3): 30:1-30:24 (2020) - [c200]Toru Yamashita, Futoshi Asano, Kazuhiro Nakadai:
Age Classification of Evacuees at Times of Disaster Using a Vibration Sensor. APSIPA 2020: 184-188 - [c199]Naoki Yamamoto, Kenji Nishida, Katsutoshi Itoyama, Kazuhiro Nakadai:
Detection of Ball Spin Direction using Hitting Sound in Tennis. icSPORTS 2020: 30-37 - [c198]Katsuhiro Dan, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions. IEA/AIE 2020: 614-625 - [c197]Katsutoshi Itoyama, Kazuhiro Nakadai:
Synchronization of Microphones Based on Rank Minimization of Warped Spectrum for Asynchronous Distributed Recording. IROS 2020: 4842-4847 - [c196]Shinji Sumitani, Reiji Suzuki, Takemi Morimatsu, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Soundscape Analysis of Bird Songs in Forests Using Microphone Arrays. SII 2020: 634-639 - [c195]Kazuhiro Nakadai, Shungo Masaki, Ryosuke Kojima, Osamu Sugiyama, Katsutoshi Itoyama, Kenji Nishida:
Sound Source Localization Based on von-Mises-Bernoulli Deep Neural Network. SII 2020: 658-663 - [c194]Yoshiaki Asahara, Kohich Matsuda, Hirofumi Nakajima, Kazuhiro Nakadai:
A Fourier series based Data compression model for Acoustic transfer function. SII 2020: 664-668 - [c193]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound Source Tracking by Drones with Microphone Arrays. SII 2020: 796-801 - [c192]Takashi Konno, Kenji Nishida, Katsutoshi Itoyama, Kazuhiro Nakadai:
Audio-Visual 3D Reconstruction Framework for Dynamic Scenes. SII 2020: 802-807 - [c191]Zhi Zhong, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Design and Assessment of a Scan-and-sum Beamformer for Surface Sound Source Separation. SII 2020: 808-813 - [c190]Mizuho Wakabayashi, Kai Washizaki, Kotaro Hoshiba, Kazuhiro Nakadai, Hiroshi G. Okuno, Makoto Kumon:
Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition. SII 2020: 814-819 - [c189]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental sound segmentation. SII 2020: 820-825
2010 – 2019
- 2019
- [j46]Kazuhiro Nakadai, Emilia I. Barakova, Michita Imai, Tetsunari Inamura:
Special issue on robot and human interactive communication. Adv. Robotics 33(7-8): 307-308 (2019) - [j45]Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
2D sound source position estimation using microphone arrays and its application to a VR-based bird song analysis system. Adv. Robotics 33(7-8): 403-414 (2019) - [j44]Kazuhiro Nakadai, Emilia I. Barakova, Michita Imai, Tetsunari Inamura:
Special issue on robot and human interactive communication. Adv. Robotics 33(15-16): 699 (2019) - [c188]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improvement of DOA Estimation by using Quaternion Output in Sound Event Localization and Detection. DCASE 2019: 244-247 - [c187]Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based Multichannel End-to-End Speech Recognition for Everyday Home Environments*. EUSIPCO 2019: 1-5 - [c186]Zhaofeng Zhang, Kazuhiro Nakadai, Hirofumi Nakajima, Naoaki Sumida:
Acoustic Simulation in Dynamic Environments for Robot Audition. EUSIPCO 2019: 1-5 - [c185]Shinji Sumitani, Reiji Suzuki, Naoaki Chiba, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi Gitchang Okuno:
An Integrated Framework for Field Recording, Localization, Classification and Annotation of Birdsongs Using Robot Audition Techniques - Harkbird 2.0. ICASSP 2019: 8246-8250 - [c184]Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata:
Weakly-Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation. IJCNN 2019: 1-8 - [c183]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Environmental sound segmentation utilizing Mask U-Net. IROS 2019: 5340-5345 - [c182]Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Design and assessment of multiple-sound source localization using microphone arrays. SII 2019: 199-204 - [c181]Makoto Kumon, Kai Washizaki, Kazuhiro Nakadai:
Close Sound Source Localization incorporating Semi-Supervised Variational Bayesian NMF. SII 2019: 313-318 - [p1]Kenzo Nonami, Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Yasutada Tanabe, Koichi Yonezawa, Hiroshi Tokutake, Satoshi Suzuki, Kohei Yamaguchi, Shigeru Sunada, Takeshi Takaki, Toshiyuki Nakata, Ryusuke Noda, Hao Liu, Satoshi Tadokoro:
Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge. Disaster Robotics 2019: 77-142 - 2018
- [j43]Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno:
Assessment of MUSIC-Based Noise-Robust Sound Source Localization with Active Frequency Range Filtering. J. Robotics Mechatronics 30(3): 426-435 (2018) - [j42]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Tatsuya Kawahara, Hiroshi G. Okuno:
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 215-230 (2018) - [c180]Shinji Sumitani, Reiji Suzuki, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Extracting the Relationship between the Spatial Distribution and Types of Bird Vocalizations Using Robot Audition System HARK. IROS 2018: 2485-2490 - [c179]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Reiji Suzuki, Kazuhiro Nakadai:
HARK-Bird-Box: A Portable Real-time Bird Song Scene Analysis System. IROS 2018: 2497-2502 - [c178]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal. IROS 2018: 2503-2510 - [c177]Heike Brock, Shigeaki Nishina, Kazuhiro Nakadai:
To animate or anime-te?: Investigating sign avatar comprehensibility. IVA 2018: 331-332 - [c176]Heike Brock, Kazuhiro Nakadai:
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions. LREC 2018 - [c175]Agathe Balayn, Heike Brock, Kazuhiro Nakadai:
Data-driven development of Virtual Sign Language Communication Agents. RO-MAN 2018: 370-377 - [c174]Ryosuke Taniguchi, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Signal Restoration based on Bi-directional LSTM with Spectral Filtering for Robot Audition. RO-MAN 2018: 955-960 - [i3]Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata:
Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation. CoRR abs/1807.01126 (2018) - [i2]Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments. CoRR abs/1811.02735 (2018) - 2017
- [j41]Lana Sinapayen, Keisuke Nakamura, Kazuhiro Nakadai, Hiroki Takahashi, Tetsuo Kinoshita:
Swarm of micro-quadrocopters for consensus-based sound source localization. Adv. Robotics 31(12): 624-633 (2017) - [j40]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Acoustic model training based on node-wise weight boundary model for fast and small-footprint deep neural networks. Comput. Speech Lang. 46: 461-480 (2017) - [j39]Hiroshi G. Okuno, Kazuhiro Nakadai:
Editorial: Robot Audition Technologies. J. Robotics Mechatronics 29(1): 15 (2017) - [j38]Kazuhiro Nakadai, Hiroshi G. Okuno, Takeshi Mizumoto:
Development, Deployment and Applications of Robot Audition Open Source Software HARK. J. Robotics Mechatronics 29(1): 16-25 (2017) - [j37]Nelson Yalta, Kazuhiro Nakadai, Tetsuya Ogata:
Sound Source Localization Using Deep Learning Models. J. Robotics Mechatronics 29(1): 37-48 (2017) - [j36]Kazuhiro Nakadai, Tomoaki Koiwa:
Psychologically-Inspired Audio-Visual Speech Recognition Using Coarse Speech Recognition and Missing Feature Theory. J. Robotics Mechatronics 29(1): 105-113 (2017) - [j35]Kazuhiro Nakadai, Taiki Tezuka, Takami Yoshida:
Ego-Noise Suppression for Robots Based on Semi-Blind Infinite Non-Negative Matrix Factorization. J. Robotics Mechatronics 29(1): 114-124 (2017) - [j34]Kotaro Hoshiba, Osamu Sugiyama, Akihide Nagamine, Ryosuke Kojima, Makoto Kumon, Kazuhiro Nakadai:
Design and Assessment of Sound Source Localization System with a UAV-Embedded Microphone Array. J. Robotics Mechatronics 29(1): 154-167 (2017) - [j33]Takuma Ohata, Keisuke Nakamura, Akihide Nagamine, Takeshi Mizumoto, Takayuki Ishizaki, Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Outdoor Sound Source Detection Using a Quadcopter with Microphone Array. J. Robotics Mechatronics 29(1): 177-187 (2017) - [j32]Osamu Sugiyama, Satoshi Uemura, Akihide Nagamine, Ryosuke Kojima, Keisuke Nakamura, Kazuhiro Nakadai:
Outdoor Acoustic Event Identification with DNN Using a Quadrotor-Embedded Microphone Array. J. Robotics Mechatronics 29(1): 188-197 (2017) - [j31]Reiji Suzuki, Shiho Matsubayashi, Richard W. Hedley, Kazuhiro Nakadai, Hiroshi G. Okuno:
HARKBird: Exploring Acoustic Interactions in Bird Communities Using a Microphone Array. J. Robotics Mechatronics 29(1): 213-223 (2017) - [j30]Shiho Matsubayashi, Reiji Suzuki, Fumiyuki Saito, Tatsuyoshi Murate, Tomohisa Masuda, Koichi Yamamoto, Ryosuke Kojima, Kazuhiro Nakadai, Hiroshi G. Okuno:
Acoustic Monitoring of the Great Reed Warbler Using Multiple Microphone Arrays and Robot Audition. J. Robotics Mechatronics 29(1): 224-235 (2017) - [j29]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Kazuhiro Nakadai, Reiji Suzuki, Charles E. Taylor:
Bird Song Scene Analysis Using a Spatial-Cue-Based Probabilistic Model. J. Robotics Mechatronics 29(1): 236-246 (2017) - [j28]Kotaro Hoshiba, Kai Washizaki, Mizuho Wakabayashi, Takahiro Ishiki, Makoto Kumon, Yoshiaki Bando, Daniel Gabriel, Kazuhiro Nakadai, Hiroshi G. Okuno:
Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments. Sensors 17(11): 2535 (2017) - [c173]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Reiji Suzuki, Kazuhiro Nakadai:
A Spatial-Cue-Based Probabilistic Model for Bird Song Scene Analysis. DSAA 2017: 395-404 - [c172]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Node Pruning Based on Entropy of Weights and Node Activity for Small-Footprint Acoustic Model Based on Deep Neural Networks. INTERSPEECH 2017: 1636-1640 - [c171]Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Kotaro Hoshiba, Mizuho Wakabayashi, Kai Washizaki, Takahiro Ishiki, Daniel Gabriel, Yoshiaki Bando, Takayuki Morito, Ryosuke Kojima, Osamu Sugiyama:
Development of microphone-array-embedded UAV for search and rescue task. IROS 2017: 5985-5990 - 2016
- [j27]Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Multimodal Scene Understanding Framework and Its Application to Cooking Recognition. Appl. Artif. Intell. 30(3): 181-200 (2016) - [c170]Cosmin Munteanu, Pourang Irani, Sharon L. Oviatt, Matthew P. Aylett, Gerald Penn, Shimei Pan, Nikhil Sharma, Frank Rudzicz, Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai:
Designing Speech and Multimodal Interactions for Mobile, Wearable, and Pervasive Applications. CHI Extended Abstracts 2016: 3612-3619 - [c169]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno:
Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array. EUSIPCO 2016: 1018-1022 - [c168]Takayuki Morito, Osamu Sugiyama, Satoshi Uemura, Ryosuke Kojima, Kazuhiro Nakadai:
Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification. IEA/AIE 2016: 562-573 - [c167]Reiji Suzuki, Shiho Matsubayashi, Kazuhiro Nakadai, Hiroshi G. Okuno:
Localizing Bird Songs Using an Open Source Robot Audition System with a Microphone Array. INTERSPEECH 2016: 2626-2630 - [c166]Ryosuke Kojima, Osamu Sugiyama, Reiji Suzuki, Kazuhiro Nakadai, Charles E. Taylor:
Semi-automatic bird song analysis by spatial-cue-based integration of sound source detection, localization, separation, and identification. IROS 2016: 1287-1292 - [c165]Takayuki Morito, Osamu Sugiyama, Ryosuke Kojima