default search action
Masayuki Suzuki
- > Home > Persons > Masayuki Suzuki
Publications
- 2024
- [c58]Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon:
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems. ICASSP 2024: 10176-10180 - [i5]Takuma Udagawa, Masayuki Suzuki, Masayasu Muraoka, Gakuto Kurata:
Robust ASR Error Correction with Conservative Data Filtering. CoRR abs/2407.13300 (2024) - 2023
- [i4]Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon:
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems. CoRR abs/2309.04031 (2023) - 2022
- [c57]Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. INTERSPEECH 2022: 2638-2642 - [c56]Takashi Fukuda, Samuel Thomas, Masayuki Suzuki, Gakuto Kurata, George Saon, Brian Kingsbury:
Global RNN Transducer Models For Multi-dialect Speech Recognition. INTERSPEECH 2022: 3138-3142 - [c55]Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon:
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. INTERSPEECH 2022: 3919-3923 - [i3]Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. CoRR abs/2203.15176 (2022) - [i2]Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon:
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. CoRR abs/2204.00212 (2022) - 2020
- [c53]Yosuke Higuchi, Masayuki Suzuki, Gakuto Kurata:
Speaker Embeddings Incorporating Acoustic Conditions for Diarization. ICASSP 2020: 7129-7133 - [c52]Shintaro Ando, Masayuki Suzuki, Nobuyasu Itoh, Gakuto Kurata, Nobuaki Minematsu:
Converting Written Language to Spoken Language with Neural Machine Translation for Language Modeling. ICASSP 2020: 8124-8128 - [c51]Hagai Aronowitz, Weizhong Zhu, Masayuki Suzuki, Gakuto Kurata, Ron Hoory:
New Advances in Speaker Diarization. INTERSPEECH 2020: 279-283 - 2019
- [c50]Tohru Nagano, Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata:
Data Augmentation Based on Vowel Stretch for Improving Children's Speech Recognition. ASRU 2019: 502-508 - [c49]Yinghui Huang, Samuel Thomas, Masayuki Suzuki, Zoltán Tüske, Larry Sansone, Michael Picheny:
Semi-Supervised Training and Data Augmentation for Adaptation of Automatic Broadcast News Captioning Systems. ASRU 2019: 867-874 - [c48]Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. ICASSP 2019: 6455-6459 - [c47]Masayuki Suzuki, Nobuyasu Itoh, Tohru Nagano, Gakuto Kurata, Samuel Thomas:
Improvements to N-gram Language Model Using Text Generated from Neural Language Model. ICASSP 2019: 7245-7249 - [c46]Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata:
Direct Neuron-Wise Fusion of Cognate Neural Networks. INTERSPEECH 2019: 1621-1625 - [i1]Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. CoRR abs/1904.13258 (2019) - 2018
- [c45]Masayuki Suzuki, Tohru Nagano, Gakuto Kurata, Samuel Thomas:
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models. INTERSPEECH 2018: 2893-2897 - 2017
- [j15]Masayuki Suzuki, Ryo Kuroiwa, Keisuke Innami, Shumpei Kobayashi, Shinya Shimizu, Nobuaki Minematsu, Keikichi Hirose:
Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields. IEICE Trans. Inf. Syst. 100-D(4): 655-661 (2017) - [j14]Nobuaki Minematsu, Ibuki Nakamura, Masayuki Suzuki, Hiroko Hirano, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
Development and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody. IEICE Trans. Inf. Syst. 100-D(4): 662-669 (2017) - [c44]Osamu Ichikawa, Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata, Bhuvana Ramabhadran:
Harmonic feature fusion for robust neural network-based acoustic modeling. ICASSP 2017: 5195-5199 - [c43]Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata, Satoshi Nakamura:
Ensembles of Multi-Scale VGG Acoustic Models. INTERSPEECH 2017: 1616-1620 - [c42]Masayuki Suzuki, Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Kenneth Ward Church, Mark Drake:
Symbol Sequence Search from Telephone Conversation. INTERSPEECH 2017: 3612-3616 - [c41]Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata, Samuel Thomas, Jia Cui, Bhuvana Ramabhadran:
Efficient Knowledge Distillation from an Ensemble of Teachers. INTERSPEECH 2017: 3697-3701 - 2016
- [c40]Masayuki Suzuki, Gakuto Kurata, Tohru Nagano, Ryuki Tachibana:
Speech recognition robust against speech overlapping in monaural recordings of telephone conversations. ICASSP 2016: 5685-5689 - [c39]Masayuki Suzuki, Ryuki Tachibana, Samuel Thomas, Bhuvana Ramabhadran, George Saon:
Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings. INTERSPEECH 2016: 1588-1592 - 2015
- [j13]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Discriminative re-ranking for automatic speech recognition by leveraging invariant structures. Speech Commun. 72: 208-217 (2015) - 2014
- [c36]Congying Zhang, Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Leveraging phonetic context dependent invariant structure for continuous speech recognition. ChinaSIP 2014: 52-56 - 2013
- [j12]Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose:
Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting. IEEE Trans. Speech Audio Process. 21(10): 2172-2181 (2013) - [c34]Chengshuo Wang, Masayuki Suzuki, Nobuaki Minematsu, Kyoko Sakuraba, Keikichi Hirose:
Improved estimation of femininity using GMM supervectors and SVR for voice therapy of Gender Identity Disorder Clients. ICASSP 2013: 7751-7754 - [c33]Hiroko Hirano, Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
A free online accent and intonation dictionary for teachers and learners of Japanese. INTERSPEECH 2013: 1875-1876 - [c32]Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Hiroko Hirano, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary). INTERSPEECH 2013: 2554-2558 - [c31]Nguyen Duc Duy, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Artificial bandwidth extension based on regularized piecewise linear mapping with discriminative region weighting and long-Span features. INTERSPEECH 2013: 3453-3457 - [c30]Hiroko Hirano, Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
OJAD: a free online accent and intonation dictionary for teachers and learners of Japanese. SLaTE 2013: 94 - 2012
- [c29]Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose:
MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments. ICASSP 2012: 4109-4112 - [c28]Keigo Chijiiwa, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Unseen noise robust speech recognition using adaptive piecewise linear transformation. ICASSP 2012: 4289-4292 - [c27]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Discriminative Reranking for LVCSR Leveraging Invariant Structure. INTERSPEECH 2012: 563-566 - [c26]Yosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition. SLT 2012: 149-152 - [c25]Yi Luan, Masayuki Suzuki, Yutaka Yamauchi, Nobuaki Minematsu, Shuhei Kato, Keikichi Hirose:
Performance improvement of automatic pronunciation assessment in a noisy classroom. SLT 2012: 428-431 - [c24]Tongmu Zhao, Akemi Hoshino, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Automatic Chinese pronunciation error detection using SVM trained with structural features. SLT 2012: 473-478 - 2011
- [c23]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation. ACPR 2011: 350-354 - [c21]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Continuous Digits Recognition Leveraging Invariant Structure. INTERSPEECH 2011: 993-996 - 2010
- [j10]Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuki, Yu Qiao:
Speech Structure and Its Application to Robust Speech Processing. New Gener. Comput. 28(3): 299-319 (2010) - [c19]Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Integration of multilayer regression analysis with structure-based pronunciation assessment. INTERSPEECH 2010: 586-589 - 2009
- [c16]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu:
A study on Hidden Structural Model and its application to labeling sequences. ASRU 2009: 118-123 - [c15]Masayuki Suzuki, Nobuaki Minematsu, Dean Luo, Keikichi Hirose:
Sub-structure-based estimation of pronunciation proficiency and classification of learners. ASRU 2009: 574-579 - [c14]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu:
Affine invariant features and their application to speech recognition. ICASSP 2009: 4629-4632 - [c13]Nobuaki Minematsu, Masayuki Suzuki:
Structure-based pronunciation assessment. SLaTE 2009 - [c12]Masayuki Suzuki, Dean Luo, Nobuaki Minematsu, Keikichi Hirose:
Improved structure-based automatic estimation of pronunciation proficiency. SLaTE 2009: 137-140
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-23 19:27 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint