| 2013 | ||
|---|---|---|
| j24 | Keiichi Tokuda, Yoshihiko Nankaku, Tomoki Toda, Heiga Zen, Junichi Yamagishi, Keiichiro Oura: Speech Synthesis Based on Hidden Markov Models. Proceedings of the IEEE 101(5): 1234-1252 (2013) | |
| 2012 | ||
| j23 | Tomoaki Nakamura, Komei Sugiura, Takayuki Nagai, Naoto Iwahashi, Tomoki Toda, Hiroyuki Okada, Takashi Omori: Learning Novel Objects for Extended Mobile Manipulation. Journal of Intelligent and Robotic Systems 66(1-2): 187-204 (2012) | |
| j22 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Communication 54(1): 134-146 (2012) | |
| j21 | Tomoki Toda, Mikihiro Nakagiri, Kiyohiro Shikano: Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement. IEEE Transactions on Audio, Speech & Language Processing 20(9): 2505-2517 (2012) | |
| c56 | Kenzo Yamamoto, Tomoki Toda, Hironori Doi, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to voice quality control in esophageal speech enhancement. ICASSP 2012: 4497-4500 | |
| c55 | Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai, Sakriani Sakti, Satoshi Nakamura: An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis. INTERSPEECH 2012 | |
| c54 | Tomoki Toda, Takashi Muramatsu, Hideki Banno: Implementation of Computationally Efficient Real-Time Voice Conversion. INTERSPEECH 2012 | |
| 2011 | ||
| c53 | Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura: Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing. ASRU 2011: 494-499 | |
| c52 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques. ICASSP 2011: 5136-5139 | |
| c51 | Denis Babani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic model training for non-audible murmur recognition using transformed normal speech data. ICASSP 2011: 5224-5227 | |
| c50 | Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation. INTERSPEECH 2011: 2769-2772 | |
| 2010 | ||
| j20 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive Training for Voice Conversion Based on Eigenvoices. IEICE Transactions 93-D(6): 1589-1598 (2010) | |
| j19 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion. IEICE Transactions 93-D(7): 1909-1917 (2010) | |
| j18 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models. IEICE Transactions 93-D(9): 2472-2482 (2010) | |
| j17 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improvements of the One-to-Many Eigenvoice Conversion System. IEICE Transactions 93-D(9): 2491-2499 (2010) | |
| j16 | Tatsuya Hirahara, Makoto Otani, Shota Shimizu, Tomoki Toda, Keigo Nakamura, Yoshitaka Nakajima, Kiyohiro Shikano: Silent-speech enhancement using body-conducted vocal-tract resonance signals. Speech Communication 52(4): 301-313 (2010) | |
| j15 | Viet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Tomoki Toda: Improvement to a NAM-captured whisper-to-speech system. Speech Communication 52(4): 314-326 (2010) | |
| c49 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to enhancing esophageal speech based on Gaussian mixture models. ICASSP 2010: 4250-4253 | |
| c48 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Non-parallel training for many-to-many eigenvoice conversion. ICASSP 2010: 4822-4825 | |
| c47 | Yoshinori Shiga, Tomoki Toda, Shinsuke Sakai, Hisashi Kawai: Improved training of excitation for HMM-based parametric speech synthesis. INTERSPEECH 2010: 809-812 | |
| c46 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2010: 1628-1631 | |
| c45 | Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive voice-quality control based on one-to-many eigenvoice conversion. INTERSPEECH 2010: 2158-2161 | |
| 2009 | ||
| j14 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. Speech Communication 51(1): 42-57 (2009) | |
| j13 | Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, Steve Renals: Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis. IEEE Transactions on Audio, Speech & Language Processing 17(6): 1208-1230 (2009) | |
| c44 | Tomoki Toda, Keigo Nakamura, Hidehiko Sekimoto, Kiyohiro Shikano: Voice conversion for various types of body transmitted speech. ICASSP 2009: 3601-3604 | |
| c43 | Kai Yu, Tomoki Toda, Milica Gasic, Simon Keizer, François Mairesse, Blaise Thomson, Steve Young: Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis. ICASSP 2009: 3773-3776 | |
| c42 | Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic compensation methods for body transmitted speech conversion. ICASSP 2009: 3901-3904 | |
| c41 | Tomoki Toda, Steve Young: Trajectory training considering global variance for HMM-based speech synthesis. ICASSP 2009: 4025-4028 | |
| c40 | Tomoki Toda, Keigo Nakamura, Takayuki Nagai, Tomomi Kaino, Yoshitaka Nakajima, Kiyohiro Shikano: Technologies for processing body-conducted speech detected with non-audible murmur microphone. INTERSPEECH 2009: 632-635 | |
| c39 | Viet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Tomoki Toda: Multimodal HMM-based NAM-to-speech conversion. INTERSPEECH 2009: 656-659 | |
| c38 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2009: 1431-1434 | |
| c37 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Many-to-many eigenvoice conversion with reference voice. INTERSPEECH 2009: 1623-1626 | |
| c36 | Malorie Charlier, Yamato Ohtani, Tomoki Toda, Alexis Moinet, Thierry Dutoit: Cross-language voice conversion based on eigenvoices. INTERSPEECH 2009: 1635-1638 | |
| c35 | Ranniery Maia, Tomoki Toda, Keiichi Tokuda, Shinsuke Sakai, Satoshi Nakamura: A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis. INTERSPEECH 2009: 1783-1786 | |
| 2008 | ||
| j12 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training. IEICE Transactions 91-D(3): 499-507 (2008) | |
| j11 | Goshu Nagino, Makoto Shozakai, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method. IEICE Transactions 91-D(3): 607-614 (2008) | |
| j10 | Heiga Zen, Tomoki Toda, Keiichi Tokuda: The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006. IEICE Transactions 91-D(6): 1764-1773 (2008) | |
| j9 | Tomoki Toda, Alan W. Black, Keiichi Tokuda: Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model. Speech Communication 50(3): 215-227 (2008) | |
| c34 | Tomoki Toda, Keiichi Tokuda: Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM. ICASSP 2008: 3925-3928 | |
| c33 | Junichi Yamagishi, Takashi Nose, Heiga Zen, Tomoki Toda, Keiichi Tokuda: Performance evaluation of the speaker-independent HMM-based speech synthesis system "HTS 2007" for the Blizzard Challenge 2007. ICASSP 2008: 3957-3960 | |
| c32 | Ranniery Maia, Tomoki Toda, Keiichi Tokuda, Shinichi Sakai, Shun Nakamura: On the state definition for a trainable excitation model in HMM-based speech synthesis. ICASSP 2008: 3965-3968 | |
| c31 | Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda: Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching. INTERSPEECH 2008: 1072-1075 | |
| c30 | Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. INTERSPEECH 2008: 1076-1079 | |
| c29 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An improved one-to-many eigenvoice conversion system. INTERSPEECH 2008: 1080-1083 | |
| c28 | Daisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum a posteriori adaptation for many-to-one eigenvoice conversion. INTERSPEECH 2008: 1461-1463 | |
| c27 | Keigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. INTERSPEECH 2008: 2209-2212 | |
| 2007 | ||
| j8 | Heiga Zen, Tomoki Toda, Masaru Nakamura, Keiichi Tokuda: Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005. IEICE Transactions 90-D(1): 325-333 (2007) | |
| j7 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics. IEICE Transactions 90-D(2): 554-561 (2007) | |
| j6 | Tomoki Toda, Keiichi Tokuda: A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis. IEICE Transactions 90-D(5): 816-824 (2007) | |
| j5 | Tomoki Toda, Alan W. Black, Keiichi Tokuda: Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory. IEEE Transactions on Audio, Speech & Language Processing 15(8): 2222-2235 (2007) | |
| c26 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. INTERSPEECH 2007: 262-265 | |
| c25 | Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. INTERSPEECH 2007: 1469-1472 | |
| c24 | Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda: A trainable excitation model for HMM-based speech synthesis. INTERSPEECH 2007: 1909-1912 | |
| c23 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2007: 1981-1984 | |
| c22 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. INTERSPEECH 2007: 2517-2520 | |
| 2006 | ||
| j4 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models. IEICE Transactions 89-D(3): 962-969 (2006) | |
| j3 | Randy Gomez, Akinobu Lee, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models. IEICE Transactions 89-D(3): 998-1005 (2006) | |
| j2 | Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano: An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. Speech Communication 48(1): 45-56 (2006) | |
| c21 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. INTERSPEECH 2006 | |
| c20 | Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano: Improving body transmitted unvoiced speech with statistical voice conversion. INTERSPEECH 2006 | |
| c19 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. INTERSPEECH 2006 | |
| c18 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. INTERSPEECH 2006 | |
| c17 | Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano: Eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2006 | |
| c16 | Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda: Voice conversion based on mixtures of factor analyzers. INTERSPEECH 2006 | |
| 2005 | ||
| j1 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Designing Target Cost Function Based on Prosody of Speech Database. IEICE Transactions 88-D(3): 519-524 (2005) | |
| c15 | Heiga Zen, Tomoki Toda: An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005. INTERSPEECH 2005: 93-96 | |
| c14 | Tomoki Toda, Kiyohiro Shikano: NAM-to-speech conversion with Gaussian mixture models. INTERSPEECH 2005: 1957-1960 | |
| c13 | Tomoki Toda, Keiichi Tokuda: Speech parameter generation algorithm considering global variance for HMM-based speech synthesis. INTERSPEECH 2005: 2801-2804 | |
| 2004 | ||
| c12 | Tomoki Toda, Alan W. Black, Keiichi Tokuda: Acoustic-to-articulatory inversion mapping with Gaussian mixture model. INTERSPEECH 2004 | |
| c11 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification. LREC 2004 | |
| 2003 | ||
| c10 | Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: GMM-based voice conversion applied to emotional speech synthesis. INTERSPEECH 2003 | |
| c9 | Tatsuya Shiraishi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Simple designing methods of corpus-based visual speech synthesis. INTERSPEECH 2003 | |
| c8 | Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki: Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations. INTERSPEECH 2003 | |
| 2002 | ||
| c7 | Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano: Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit. ICASSP 2002: 465-468 | |
| c6 | Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano: Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer. INTERSPEECH 2002 | |
| c5 | Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell: Evaluation of cross-language voice conversion using bilingual and non-bilingual databases. INTERSPEECH 2002 | |
| c4 | Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano: Designing speech database with prosodic variety for expressive TTS system. LREC 2002 | |
| 2001 | ||
| c3 | Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. INTERSPEECH 2001: 349-352 | |
| c2 | Mikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell: Evaluation of cross-language voice conversion based on GMM and straight. INTERSPEECH 2001: 361-364 | |
| 2000 | ||
| c1 | Tomoki Toda, Jinlin Lu, Hiroshi Saruwatari, Kiyohiro Shikano: Straight-based voice conversion algorithm based on Gaussian mixture model. INTERSPEECH 2000: 279-282 | |
Data released under the ODC-BY 1.0 license — See also our legal information page