default search action
Tomoki Koriyama
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2021
- [j7]Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation. Speech Commun. 132: 132-145 (2021) - 2020
- [j6]Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-Based Neural Double-Tracking for Synthesized and Natural Singing Voices. IEICE Trans. Inf. Syst. 103-D(3): 639-647 (2020) - 2019
- [j5]Tomoki Koriyama, Takao Kobayashi:
Statistical Parametric Speech Synthesis Using Deep Gaussian Processes. IEEE ACM Trans. Audio Speech Lang. Process. 27(5): 948-959 (2019) - 2018
- [j4]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
GPR-based Thai speech synthesis using multi-level duration prediction. Speech Commun. 99: 114-123 (2018) - 2015
- [j3]Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi:
HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling. Comput. Speech Lang. 34(1): 308-322 (2015) - 2014
- [j2]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Statistical Parametric Speech Synthesis Based on Gaussian Process Regression. IEEE J. Sel. Top. Signal Process. 8(2): 173-183 (2014) - [j1]Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka:
Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis. Speech Commun. 57: 144-154 (2014)
Conference and Workshop Papers
- 2023
- [c41]Koichi Miyazaki, Masato Murata, Tomoki Koriyama:
Structured State Space Decoder for Speech Recognition and Synthesis. ICASSP 2023: 1-5 - [c40]Dong Yang, Tomoki Koriyama, Yuki Saito, Takaaki Saeki, Detai Xin, Hiroshi Saruwatari:
Duration-Aware Pause Insertion Using Pre-Trained Language Model for Multi-Speaker Text-To-Speech. ICASSP 2023: 1-5 - 2022
- [c39]Takaaki Saeki, Detai Xin, Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022. INTERSPEECH 2022: 4521-4525 - [c38]Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis. INTERSPEECH 2022: 4551-4555 - 2021
- [c37]Xuan Luo, Shinnosuke Takamichi, Tomoki Koriyama, Yuki Saito, Hiroshi Saruwatari:
Emotion-Controllable Speech Synthesis Using Emotion Soft Labels and Fine-Grained Prosody Factors. APSIPA ASC 2021: 794-799 - [c36]Taiki Nakamura, Tomoki Koriyama, Hiroshi Saruwatari:
Sequence-to-Sequence Learning for Deep Gaussian Process Based Speech Synthesis Using Self-Attention GP Layer. Interspeech 2021: 121-125 - [c35]Detai Xin, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis. Interspeech 2021: 1614-1618 - [c34]Kazuki Mizuta, Tomoki Koriyama, Hiroshi Saruwatari:
Harmonic WaveGAN: GAN-Based Speech Waveform Generation Model with Harmonic Structure Discriminator. Interspeech 2021: 2192-2196 - [c33]Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
Accent Modeling of Low-Resourced Dialect in Pitch Accent Language Using Variational Autoencoder. SSW 2021: 189-194 - [c32]Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Naoko Tanji, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings. SSW 2021: 211-215 - 2020
- [c31]Tomoki Koriyama, Hiroshi Saruwatari:
Utterance-Level Sequential Modeling for Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit. ICASSP 2020: 7249-7253 - [c30]Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes. INTERSPEECH 2020: 2032-2036 - [c29]Detai Xin, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space. INTERSPEECH 2020: 2947-2951 - [c28]Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis. INTERSPEECH 2020: 3201-3205 - [c27]Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus. LREC 2020: 6438-6443 - 2019
- [c26]Tomoki Koriyama, Takao Kobayashi:
A Training Method Using DNN-guided Layerwise Pretraining for Deep Gaussian Processes. ICASSP 2019: 2787-2791 - [c25]Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking. ICASSP 2019: 7070-7074 - [c24]Tomoki Koriyama, Takao Kobayashi:
Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model. INTERSPEECH 2019: 4450-4454 - [c23]Tomoki Koriyama, Shinnosuke Takamichi, Takao Kobayashi:
Sparse Approximation of Gram Matrices for GMMN-based Speech Synthesis. SSW 2019: 149-154 - 2017
- [c22]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features. APSIPA 2017: 1524-1527 - [c21]Nattapong Kurpukdee, Tomoki Koriyama, Takao Kobayashi, Sawit Kasuriya, Chai Wutiwiwatchai, Poonlap Lamsrichan:
Speech emotion recognition using convolutional long short-term memory neural network and support vector machines. APSIPA 2017: 1744-1749 - [c20]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Duration prediction using multiple Gaussian process experts for GPR-based speech synthesis. ICASSP 2017: 5495-5499 - [c19]Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Sampling-Based Speech Parameter Generation Using Moment-Matching Networks. INTERSPEECH 2017: 3961-3965 - 2016
- [c18]Tomoki Koriyama, Syohei Oshio, Takao Kobayashi:
A speaker adaptation technique for Gaussian process regression based speech synthesis using feature space transform. ICASSP 2016: 5610-5614 - [c17]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Unsupervised Stress Information Labeling Using Gaussian Process Latent Variable Model for Statistical Speech Synthesis. INTERSPEECH 2016: 1517-1521 - 2015
- [c16]Tomoki Koriyama, Takao Kobayashi:
Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis. ICASSP 2015: 4929-4933 - [c15]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Duration prediction using multi-level model for GPR-based speech synthesis. INTERSPEECH 2015: 1591-1595 - [c14]Tomoki Koriyama, Takao Kobayashi:
A comparison of speech synthesis systems based on GPR, HMM, and DNN with a small amount of training data. INTERSPEECH 2015: 3496-3500 - 2014
- [c13]Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
HMM-based Thai speech synthesis using unsupervised stress context labeling. APSIPA 2014: 1-4 - [c12]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization. ICASSP 2014: 3834-3838 - [c11]Daiki Nagahama, Takashi Nose, Tomoki Koriyama, Takao Kobayashi:
Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis. INTERSPEECH 2014: 770-774 - [c10]Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Takao Kobayashi:
Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling. INTERSPEECH 2014: 2337-2341 - [c9]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Parametric speech synthesis using local and global sparse Gaussian processes. MLSP 2014: 1-6 - 2013
- [c8]Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka:
HMM-based expressive speech synthesis based on phrase-level F0 context labeling. ICASSP 2013: 7859-7863 - [c7]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis. ICASSP 2013: 8007-8011 - [c6]Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi:
A style control technique for singing voice synthesis based on multiple-regression HSMM. INTERSPEECH 2013: 378-382 - [c5]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Statistical nonparametric speech synthesis using sparse Gaussian processes. INTERSPEECH 2013: 1072-1076 - 2012
- [c4]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
An F0 modeling technique based on prosodic events for spontaneous speech synthesis. ICASSP 2012: 4589-4592 - [c3]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Discontinuous Observation HMM for Prosodic-Event-Based F0 Generation. INTERSPEECH 2012: 462-465 - 2011
- [c2]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
On the Use of Extended Context for HMM-Based Spontaneous Conversational Speech Synthesis. INTERSPEECH 2011: 2657-2660 - 2010
- [c1]Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Conversational spontaneous speech synthesis using average voice model. INTERSPEECH 2010: 853-856
Informal and Other Publications
- 2024
- [i11]Dong Yang, Tomoki Koriyama, Yuki Saito:
Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech. CoRR abs/2402.00288 (2024) - [i10]Masato Murata, Koichi Miyazaki, Tomoki Koriyama:
An Attribute Interpolation Method in Speech Synthesis by Model Merging. CoRR abs/2407.00766 (2024) - [i9]Tomoki Koriyama:
VAE-based Phoneme Alignment Using Gradient Annealing and SSL Acoustic Features. CoRR abs/2407.02749 (2024) - 2023
- [i8]Dong Yang, Tomoki Koriyama, Yuki Saito, Takaaki Saeki, Detai Xin, Hiroshi Saruwatari:
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech. CoRR abs/2302.13652 (2023) - 2022
- [i7]Takaaki Saeki, Detai Xin, Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022. CoRR abs/2204.02152 (2022) - [i6]Koichi Miyazaki, Masato Murata, Tomoki Koriyama:
Structured State Space Decoder for Speech Recognition and Synthesis. CoRR abs/2210.17098 (2022) - 2020
- [i5]Tomoki Koriyama, Hiroshi Saruwatari:
Utterance-level Sequential Modeling For Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit. CoRR abs/2004.10823 (2020) - [i4]Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes. CoRR abs/2008.02950 (2020) - 2019
- [i3]Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking. CoRR abs/1902.03389 (2019) - [i2]Shinnosuke Takamichi, Kentaro Mitsui, Yuki Saito, Tomoki Koriyama, Naoko Tanji, Hiroshi Saruwatari:
JVS corpus: free Japanese multi-speaker voice corpus. CoRR abs/1908.06248 (2019) - 2017
- [i1]Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Sampling-based speech parameter generation using moment-matching networks. CoRR abs/1704.03626 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:08 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint