default search action

combined dblp search
author search
venue search
publication search

ask others

Tomoki Koriyama

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2021
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MitsuiKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MitsuiKS21
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation. Speech Commun. 132: 132-145 (2021)
2020
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/TamaruSTKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/TamaruSTKS20
Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-Based Neural Double-Tracking for Synthesized and Natural Singing Voices. IEICE Trans. Inf. Syst. 103-D(3): 639-647 (2020)
2019
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/KoriyamaK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KoriyamaK19
Tomoki Koriyama, Takao Kobayashi:
Statistical Parametric Speech Synthesis Using Deep Gaussian Processes. IEEE ACM Trans. Audio Speech Lang. Process. 27(5): 948-959 (2019)
2018
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MoungsriKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MoungsriKK18
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
GPR-based Thai speech synthesis using multi-level duration prediction. Speech Commun. 99: 114-123 (2018)
2015
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/NoseKKK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/NoseKKK15
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi:
HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling. Comput. Speech Lang. 34(1): 308-322 (2015)
2014
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/KoriyamaNK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/KoriyamaNK14
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Statistical Parametric Speech Synthesis Based on Gaussian Process Regression. IEEE J. Sel. Top. Signal Process. 8(2): 173-183 (2014)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MaenoNKKINMY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MaenoNKKINMY14
Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka:
Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis. Speech Commun. 57: 144-154 (2014)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiyazakiMK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiyazakiMK23
Koichi Miyazaki, Masato Murata, Tomoki Koriyama:
Structured State Space Decoder for Speech Recognition and Synthesis. ICASSP 2023: 1-5
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangKSSXS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangKSSXS23
Dong Yang, Tomoki Koriyama, Yuki Saito, Takaaki Saeki, Detai Xin, Hiroshi Saruwatari:
Duration-Aware Pause Insertion Using Pre-Trained Language Model for Multi-Speaker Text-To-Speech. ICASSP 2023: 1-5
2022
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaekiXNKTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaekiXNKTS22
Takaaki Saeki, Detai Xin, Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022. INTERSPEECH 2022: 4521-4525
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataKTSIMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataKTSIMS22
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis. INTERSPEECH 2022: 4551-4555
2021
[c37]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/LuoTKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LuoTKSS21
Xuan Luo, Shinnosuke Takamichi, Tomoki Koriyama, Yuki Saito, Hiroshi Saruwatari:
Emotion-Controllable Speech Synthesis Using Emotion Soft Labels and Fine-Grained Prosody Factors. APSIPA ASC 2021: 794-799
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakamuraKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakamuraKS21
Taiki Nakamura, Tomoki Koriyama, Hiroshi Saruwatari:
Sequence-to-Sequence Learning for Deep Gaussian Process Based Speech Synthesis Using Self-Attention GP Layer. Interspeech 2021: 121-125
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XinSTKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XinSTKS21
Detai Xin, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis. Interspeech 2021: 1614-1618
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MizutaKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MizutaKS21
Kazuki Mizuta, Tomoki Koriyama, Hiroshi Saruwatari:
Harmonic WaveGAN: GAN-Based Speech Waveform Generation Model with Harmonic Structure Discriminator. Interspeech 2021: 2192-2196
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/YufuneKTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/YufuneKTS21
Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
Accent Modeling of Low-Resourced Dialect in Pitch Accent Language Using Variational Autoencoder. SSW 2021: 189-194
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/NakataKTTIMS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/NakataKTTIMS21
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Naoko Tanji, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings. SSW 2021: 211-215
2020
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaS20
Tomoki Koriyama, Hiroshi Saruwatari:
Utterance-Level Sequential Modeling for Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit. ICASSP 2020: 7249-7253
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitsuiKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitsuiKS20
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes. INTERSPEECH 2020: 2032-2036
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XinSTKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XinSTKS20
Detai Xin, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space. INTERSPEECH 2020: 2947-2951
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamashitaKSTIMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamashitaKSTIMS20
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis. INTERSPEECH 2020: 3201-3205
[c27]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/YamashitaKSTIMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/YamashitaKSTIMS20
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus. LREC 2020: 6438-6443
2019
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaK19
Tomoki Koriyama, Takao Kobayashi:
A Training Method Using DNN-guided Layerwise Pretraining for Deep Gaussian Processes. ICASSP 2019: 2787-2791
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TamaruSTKS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TamaruSTKS19
Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking. ICASSP 2019: 7070-7074
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaK19
Tomoki Koriyama, Takao Kobayashi:
Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model. INTERSPEECH 2019: 4450-4454
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/KoriyamaTK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/KoriyamaTK19
Tomoki Koriyama, Shinnosuke Takamichi, Takao Kobayashi:
Sparse Approximation of Gram Matrices for GMMN-based Speech Synthesis. SSW 2019: 149-154
2017
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MoungsriKK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MoungsriKK17
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features. APSIPA 2017: 1524-1527
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/KurpukdeeKKKWL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KurpukdeeKKKWL17
Nattapong Kurpukdee, Tomoki Koriyama, Takao Kobayashi, Sawit Kasuriya, Chai Wutiwiwatchai, Poonlap Lamsrichan:
Speech emotion recognition using convolutional long short-term memory neural network and support vector machines. APSIPA 2017: 1744-1749
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoungsriKK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoungsriKK17
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Duration prediction using multiple Gaussian process experts for GPR-based speech synthesis. ICASSP 2017: 5495-5499
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakamichiKS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakamichiKS17
Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Sampling-Based Speech Parameter Generation Using Moment-Matching Networks. INTERSPEECH 2017: 3961-3965
2016
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaOK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaOK16
Tomoki Koriyama, Syohei Oshio, Takao Kobayashi:
A speaker adaptation technique for Gaussian process regression based speech synthesis using feature space transform. ICASSP 2016: 5610-5614
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoungsriKK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoungsriKK16
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Unsupervised Stress Information Labeling Using Gaussian Process Latent Variable Model for Statistical Speech Synthesis. INTERSPEECH 2016: 1517-1521
2015
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaK15
Tomoki Koriyama, Takao Kobayashi:
Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis. ICASSP 2015: 4929-4933
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoungsriKK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoungsriKK15
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Duration prediction using multi-level model for GPR-based speech synthesis. INTERSPEECH 2015: 1591-1595
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaK15
Tomoki Koriyama, Takao Kobayashi:
A comparison of speech synthesis systems based on GPR, HMM, and DNN with a small amount of training data. INTERSPEECH 2015: 3496-3500
2014
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MoungsriKK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MoungsriKK14
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
HMM-based Thai speech synthesis using unsupervised stress context labeling. APSIPA 2014: 1-4
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaNK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaNK14
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization. ICASSP 2014: 3834-3838
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagahamaNKK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagahamaNKK14
Daiki Nagahama, Takashi Nose, Tomoki Koriyama, Takao Kobayashi:
Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis. INTERSPEECH 2014: 770-774
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaSNSK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaSNSK14
Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Takao Kobayashi:
Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling. INTERSPEECH 2014: 2337-2341
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/KoriyamaNK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/KoriyamaNK14
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Parametric speech synthesis using local and global sparse Gaussian processes. MLSP 2014: 1-6
2013
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaenoNKKINMY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaenoNKKINMY13
Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka:
HMM-based expressive speech synthesis based on phrase-level F0 context labeling. ICASSP 2013: 7859-7863
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaNK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaNK13
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis. ICASSP 2013: 8007-8011
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoseKKK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoseKKK13
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi:
A style control technique for singing voice synthesis based on multiple-regression HSMM. INTERSPEECH 2013: 378-382
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK13
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Statistical nonparametric speech synthesis using sparse Gaussian processes. INTERSPEECH 2013: 1072-1076
2012
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoriyamaNK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoriyamaNK12
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
An F0 modeling technique based on prosodic events for spontaneous speech synthesis. ICASSP 2012: 4589-4592
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK12
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Discontinuous Observation HMM for Prosodic-Event-Based F0 Generation. INTERSPEECH 2012: 462-465
2011
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK11
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
On the Use of Extended Context for HMM-Based Spontaneous Conversational Speech Synthesis. INTERSPEECH 2011: 2657-2660
2010
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoriyamaNK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoriyamaNK10
Tomoki Koriyama, Takashi Nose, Takao Kobayashi:
Conversational spontaneous speech synthesis using average voice model. INTERSPEECH 2010: 853-856

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-00288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-00288
Dong Yang, Tomoki Koriyama, Yuki Saito:
Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech. CoRR abs/2402.00288 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-00766
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-00766
Masato Murata, Koichi Miyazaki, Tomoki Koriyama:
An Attribute Interpolation Method in Speech Synthesis by Model Merging. CoRR abs/2407.00766 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02749
Tomoki Koriyama:
VAE-based Phoneme Alignment Using Gradient Annealing and SSL Acoustic Features. CoRR abs/2407.02749 (2024)
2023
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13652
Dong Yang, Tomoki Koriyama, Yuki Saito, Takaaki Saeki, Detai Xin, Hiroshi Saruwatari:
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech. CoRR abs/2302.13652 (2023)
2022
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02152
Takaaki Saeki, Detai Xin, Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari:
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022. CoRR abs/2204.02152 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17098
Koichi Miyazaki, Masato Murata, Tomoki Koriyama:
Structured State Space Decoder for Speech Recognition and Synthesis. CoRR abs/2210.17098 (2022)
2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-10823
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-10823
Tomoki Koriyama, Hiroshi Saruwatari:
Utterance-level Sequential Modeling For Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit. CoRR abs/2004.10823 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02950
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes. CoRR abs/2008.02950 (2020)
2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03389
Hiroki Tamaru, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking. CoRR abs/1902.03389 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-06248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-06248
Shinnosuke Takamichi, Kentaro Mitsui, Yuki Saito, Tomoki Koriyama, Naoko Tanji, Hiroshi Saruwatari:
JVS corpus: free Japanese multi-speaker voice corpus. CoRR abs/1908.06248 (2019)
2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/TakamichiKS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TakamichiKS17
Shinnosuke Takamichi, Tomoki Koriyama, Hiroshi Saruwatari:
Sampling-based speech parameter generation using moment-matching networks. CoRR abs/1704.03626 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.