Please note: This is a beta version of the new dblp website.
You can find the classic dblp view of this page here.
You can find the classic dblp view of this page here.
Atsushi Nakamura
2010 – today
- 2013
[j28]Marc Delcroix, Shinji Watanabe, Tomohiro Nakatani, Atsushi Nakamura: Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer. Computer Speech & Language 27(1): 350-368 (2013)
[j27]Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Atsunori Ogawa, Takaaki Hori, Shinji Watanabe, Masakiyo Fujimoto, Takuya Yoshioka, Takanobu Oba, Yotaro Kubo, Mehrez Souden, Seong-Jun Hahm, Atsushi Nakamura: Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds. Computer Speech & Language 27(3): 851-873 (2013)
[j26]Seong-Jun Hahm, Shinji Watanabe, Atsunori Ogawa, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura: Prior-shared feature and model space speaker adaptation by consistently employing map estimation. Speech Communication 55(3): 415-431 (2013)- 2012
[j25]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito: Model Shrinkage for Discriminative Language Models. IEICE Transactions 95-D(5): 1465-1474 (2012)
[j24]Takanori Suzuki, Hideo Arimoto, Takeshi Kitatani, Aki Takei, Takafumi Taniguchi, Kazunori Shinoda, Shigehisa Tanaka, Shinji Tsuji, Tatemi Ido, Jun Igrashi, Atsushi Nakamura, Kazuhiko Naoe, Kenji Uchida: Wide-Tuning-Wavelength-Range LGLC Laser with Low-Loss Dual-Core Spot Size Converter. IEICE Transactions 95-C(7): 1272-1275 (2012)
[j23]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Efficient training of discriminative language models by sample selection. Speech Communication 54(6): 791-800 (2012)
[j22]Atsunori Ogawa, Atsushi Nakamura: Joint estimation of confidence and error causes in speech recognition. Speech Communication 54(9): 1014-1028 (2012)
[j21]Takaaki Hori, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato: Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera. IEEE Transactions on Audio, Speech & Language Processing 20(2): 499-513 (2012)
[j20]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito: Round-Robin Duel Discriminative Language Models. IEEE Transactions on Audio, Speech & Language Processing 20(4): 1244-1255 (2012)
[j19]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu: Statistical Voice Conversion Based on Noisy Channel Model. IEEE Transactions on Audio, Speech & Language Processing 20(6): 1784-1794 (2012)
[j18]Yotaro Kubo, Shinji Watanabe, Takaaki Hori, Atsushi Nakamura: Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition. IEEE Transactions on Audio, Speech & Language Processing 20(8): 2240-2251 (2012)
[c64]Takuro Maruyama, Shoko Araki, Tomohiro Nakatani, Shigeki Miyabe, Takeshi Yamada, Shoji Makino, Atsushi Nakamura: New analytical update rule for TDOA inference for underdetermined BSS in noisy environments. ICASSP 2012: 269-272
[c63]Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura, Simon Wiesler, Ralf Schlüter, Hermann Ney: Basis vector orthogonalization for an improved kernel gradient matching pursuit method. ICASSP 2012: 1909-1912
[c62]Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura: Decoding network optimization using minimum transition error training. ICASSP 2012: 4197-4200
[c61]Shinji Watanabe, Yotaro Kubo, Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Bag Of ARCS: New representation of speech segment features based on finite state machines. ICASSP 2012: 4201-4204
[c60]Marc Delcroix, Atsunori Ogawa, Shinji Watanabe, Tomohiro Nakatani, Atsushi Nakamura: Discriminative feature transforms using differenced maximum mutual information. ICASSP 2012: 4753-4756
[c59]Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura: Error type classification and word accuracy estimation using alignment features from word confusion network. ICASSP 2012: 4925-4928
[c58]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito: Spoken document retrieval by discriminative modeling in a high dimensional feature space. ICASSP 2012: 5153-5156
[c57]Seong-Jun Hahm, Atsunori Ogawa, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura: Speaker Adaptation Using Variational Bayesian Linear Regression in Normalized Feature Space. INTERSPEECH 2012
[c56]Yotaro Kubo, Takaaki Hori, Atsushi Nakamura: Integrating Deep Neural Networks into Structural Classification Approach based on Weighted Finite-State Transducers. INTERSPEECH 2012
[c55]Naohiro Tawara, Tetsuji Ogawa, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi: Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model. INTERSPEECH 2012
[c54]Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura: Recognition rate estimation based on word alignment network and discriminative error type classification. SLT 2012: 113-118- 2011
[j17]Atsunori Ogawa, Satoshi Takahashi, Atsushi Nakamura: Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation. IEICE Transactions 94-D(3): 648-658 (2011)
[c53]Yotaro Kubo, Simon Wiesler, Ralf Schlüter, Hermann Ney, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi: Subspace pursuit method for kernel-log-linear models. ICASSP 2011: 4500-4503
[c52]Shinji Watanabe, Daichi Mochihashi, Takaaki Hori, Atsushi Nakamura: Gibbs sampling based Multi-scale Mixture Model for speaker clustering. ICASSP 2011: 4524-4527
[c51]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu: High accurate model-integration-based voice conversion using dynamic features and model structure optimization. ICASSP 2011: 4576-4579
[c50]Atsunori Ogawa, Satoshi Takahashi, Atsushi Nakamura: Machine and acoustical condition dependency analyses for fast acoustic likelihood calculation techniques. ICASSP 2011: 5156-5159
[c49]Takanobu Oba, Takaaki Hori, Akinori Ito, Atsushi Nakamura: Round-robin duel discriminative language models in one-pass decoding with on-the-fly error correction. ICASSP 2011: 5588-5591
[c48]Shinji Watanabe, Atsushi Nakamura, Biing-Hwang Juang: Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution. INTERSPEECH 2011: 1081-1084
[c47]Shinji Motoki, Atsushi Nakamura: Lattice gauge theory on a multi-core processor, Cell/B.E. ICCS 2011: 860-868- 2010
[j16]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Improved Sequential Dependency Analysis Integrating Labeling-Based Sentence Boundary Detection. IEICE Transactions 93-D(5): 1272-1281 (2010)
[j15]Takanori Uno, Kouji Ichikawa, Yuichi Mabuchi, Atsushi Nakamura, Yuji Okazaki, Hideki Asai: An Approach for Practical Use of Common-Mode Noise Reduction Technique for In-Vehicle Electronic Equipment. IEICE Transactions 93-B(7): 1788-1796 (2010)
[j14]Shinji Motoki, Atsushi Nakamura, Koichi Hashimoto, Kiyoshi Mizumaru: Problem Solving Environment for Lattice QCD on Cell/B, E, . JCIT 5(4): 187-194 (2010)
[j13]Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura, Erik McDermott, Tetsunori Kobayashi: A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification. J. Sel. Topics Signal Processing 4(6): 974-984 (2010)
[j12]David Cournapeau, Shinji Watanabe, Atsushi Nakamura, Tatsuya Kawahara: Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection. J. Sel. Topics Signal Processing 4(6): 1071-1083 (2010)
[j11]Shinji Watanabe, Atsushi Nakamura: Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale. IEEE Transactions on Audio, Speech & Language Processing 18(2): 395-406 (2010)
[c46]Hiroyuki Hamasaki, Yasuhiko Hoshi, Atsushi Nakamura, Akihiro Yamamoto, Hideaki Kido, Shoji Muramatsu: SOC for car navigation system with a 55.3GOPS image recognition engine. ASP-DAC 2010: 464-465
[c45]Naoki Yasuraoka, Takuya Yoshioka, Tomohiro Nakatani, Atsushi Nakamura, Hiroshi G. Okuno: Music dereverberation using harmonic structure source model and Wiener filter. ICASSP 2010: 53-56
[c44]Hideyuki Watanabe, Shigeru Katagiri, Kouta Yamada, Erik McDermott, Atsushi Nakamura, Shinji Watanabe, Miho Ohsaki: Minimum Error Classification with geometric margin control. ICASSP 2010: 2170-2173
[c43]Atsunori Ogawa, Atsushi Nakamura: Discriminative confidence and error cause estimation for extended speech recognition function. ICASSP 2010: 4454-4457
[c42]David Cournapeau, Shinji Watanabe, Atsushi Nakamura, Tatsuya Kawahara: Using online model comparison in the Variational Bayes framework for online unsupervised Voice Activity Detection. ICASSP 2010: 4462-4465
[c41]Erik McDermott, Shinji Watanabe, Atsushi Nakamura: Discriminative training based on an integrated view of MPE and MMI in margin and error space. ICASSP 2010: 4894-4897
[c40]Shinji Watanabe, Takaaki Hori, Erik McDermott, Atsushi Nakamura: A discriminative model for continuous speech recognition based on Weighted Finite State Transducers. ICASSP 2010: 4922-4925
[c39]Takaaki Hori, Shinji Watanabe, Atsushi Nakamura: Search error risk minimization in Viterbi beam search for speech recognition. ICASSP 2010: 4934-4937
[c38]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: A comparative study on methods of Weighted language model training for reranking lvcsr N-best hypotheses. ICASSP 2010: 5126-5129
[c37]Atsunori Ogawa, Atsushi Nakamura: A novel confidence measure based on marginalization of jointly estimated error cause probabilities. INTERSPEECH 2010: 242-245
[c36]Shinji Watanabe, Takaaki Hori, Atsushi Nakamura: Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data. INTERSPEECH 2010: 346-349
[c35]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu: Probabilistic integration of joint density model and speaker model for voice conversion. INTERSPEECH 2010: 1728-1731
[c34]Takaaki Hori, Shinji Watanabe, Atsushi Nakamura: Improvements of search error risk minimization in viterbi beam search for speech recognition. INTERSPEECH 2010: 1962-1965
[c33]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Round-robin discrimination model for reranking ASR hypotheses. INTERSPEECH 2010: 2446-2449
[c32]Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi: A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination. INTERSPEECH 2010: 2954-2957
[c31]Yumi Ansa, Shoko Araki, Shoji Makino, Tomohiro Nakatani, Takeshi Yamada, Atsushi Nakamura, Nobuhiko Kitawaki: Cepstral smoothing of separated signals for underdetermined speech separation. ISCAS 2010: 2506-2509
[c30]Takaaki Hori, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato: Real-time meeting recognition and understanding using distant microphones and omni-directional camera. SLT 2010: 424-429
2000 – 2009
- 2009
[c29]Atsushi Nakamura, Erik McDermott, Shinji Watanabe, Shigeru Katagiri: A unified view for discriminative objective functions based on negative exponential of difference measure between strings. ICASSP 2009: 1633-1636
[c28]Atsunori Ogawa, Satoshi Takahashi, Atsushi Nakamura: Efficient combination of likelihood recycling and batch calculation based on conditional fast processing and acoustic back-off. ICASSP 2009: 4161-4164
[c27]Shinji Watanabe, Atsushi Nakamura: On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system. ICASSP 2009: 4373-4376
[c26]Erik McDermott, Shinji Watanabe, Atsushi Nakamura: Margin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training. INTERSPEECH 2009: 224-227
[c25]Atsunori Ogawa, Atsushi Nakamura: Simultaneous estimation of confidence and error cause in speech recognition using discriminative model. INTERSPEECH 2009: 1199-1202- 2008
[j10]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Sequential dependency analysis for online spontaneous speech processing. Speech Communication 50(7): 616-625 (2008)
[c24]Shinji Watanabe, Atsushi Nakamura: A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches. ICASSP 2008: 4285-4288
[c23]Erik McDermott, Atsushi Nakamura: Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithm. INTERSPEECH 2008: 2398-2401
[c22]Izumi Fuse, Shigeto Okabe, Takashi Yamanoue, Atsushi Nakamura, Michio Nakanishi, Shozo Fukada, Takahiro Tagawa, Tatsumi Takeo, Ikuya Murata, Tetsutaro Uehara, Tsuneo Yamada: Improving computer ethics video clips for higher education. SIGUCCS 2008: 235-242- 2007
[j9]Erik McDermott, Timothy J. Hazen, Jonathan Le Roux, Atsushi Nakamura, Shigeru Katagiri: Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error. IEEE Transactions on Audio, Speech & Language Processing 15(1): 203-223 (2007)
[j8]Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura: Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. IEEE Transactions on Audio, Speech & Language Processing 15(4): 1352-1365 (2007)
[c21]Yasuhiro Minami, Minako Sawaki, Kohji Dohsaka, Ryuichiro Higashinaka, Kentaro Ishizuka, Hideki Isozaki, Tatsushi Matsubayashi, Masato Miyoshi, Atsushi Nakamura, Takanobu Oba, Hiroshi Sawada, Takeshi Yamada, Eisaku Maeda: The world of mushrooms: human-computer interaction prototype systems for ambient intelligence. ICMI 2007: 366-373
[c20]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition. INTERSPEECH 2007: 1753-1756
[c19]Erik McDermott, Atsushi Nakamura: String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task. INTERSPEECH 2007: 2081-2084- 2006
[j7]Shinji Watanabe, Atsushi Nakamura: Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework. IEICE Transactions 89-D(3): 970-980 (2006)
[j6]Erik McDermott, Atsushi Nakamura: Production-Oriented Models for Speech Recognition. IEICE Transactions 89-D(3): 1006-1014 (2006)
[j5]Shinji Watanabe, Atsushi Sako, Atsushi Nakamura: Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition. IEEE Transactions on Audio, Speech & Language Processing 14(3): 855-872 (2006)
[j4]Parham Zolfaghari, Hiroko Kato, Yasuhiro Minami, Atsushi Nakamura, Shigeru Katagiri, Roy Patterson: Dynamic Assignment of Gaussian Components in Modelling Speech Spectra. VLSI Signal Processing 45(1-2): 7-19 (2006)
[c18]Takanobu Oba, Takaaki Hori, Atsushi Nakamura: Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. INTERSPEECH 2006- 2005
[j3]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda: Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion. IEICE Transactions 88-D(1): 1-9 (2005)
[c17]Takaaki Hori, Atsushi Nakamura: Generalized fast on-the-fly composition algorithm for WFST-based speech recognition. INTERSPEECH 2005: 557-560
[c16]Shinji Watanabe, Atsushi Nakamura: Effects of Bayesian predictive classification using variational Bayesian posteriors for sparse training data in speech recognition. INTERSPEECH 2005: 1105-1108
[c15]Mike Schuster, Takaaki Hori, Atsushi Nakamura: Experiments with probabilistic principal component analysis in LVCSR. INTERSPEECH 2005: 1685-1688
[c14]
[c13]Takashi Yamanoue, Michio Nakanishi, Atsushi Nakamura, Izumi Fuse, Ikuya Murata, Shozo Fukada, Takahiro Tagawa, Tatsumi Takeo, Shigeto Okabe, Tsuneo Yamada: Digital video clips covering computer ethics in higher education. SIGUCCS 2005: 456-461- 2004
[j2]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda: Variational bayesian estimation and clustering for speech recognition. IEEE Transactions on Speech and Audio Processing 12(4): 365-381 (2004)
[c12]Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri: A theoretical analysis of speech recognition based on feature trajectory models. INTERSPEECH 2004- 2002
[j1]Atsushi Nakamura: Restructuring Gaussian mixture density functions in speaker-independent acoustic models. Speech Communication 36(3-4): 277-289 (2002)
[c11]Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri: A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series. ICASSP 2002: 957-960
[c10]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda: Constructing shared-state hidden Markov models based on a Bayesian approach. INTERSPEECH 2002
[c9]Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda: Application of Variational Bayesian Approach to Speech Recognition. NIPS 2002: 1237-1244- 2000
[c8]Rainer Gruhn, Harald Singer, Hajime Tsukada, Masaki Naito, Atsushi Nishino, Atsushi Nakamura, Yoshinori Sagisaka, Satoshi Nakamura: Cellular-phone based speech-to-speech translation system ATR-MATRIX. INTERSPEECH 2000: 448-451
1990 – 1999
- 1999
[c7]Tomoko Matsui, Masaki Naito, Harald Singer, Atsushi Nakamura, Yoshinori Sagisaka: Japanese spontaneous speech database with wide regional and age distribution. EUROSPEECH 1999
[c6]Atsushi Nakamura, Tomoko Matsui: Acoustic modeling based on a generalized laplacian distribution. EUROSPEECH 1999
[c5]Harald Singer, Atsushi Nakamura: Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees. EUROSPEECH 1999- 1997
[c4]- 1996
[c3]Atsushi Nakamura, Shoichi Matsunaga, Tohru Shimizu, Masahiro Tonomura, Yoshinori Sagisaka: Japanese speech databases for robust speech recognition. ICSLP 1996- 1995
[c2]Atsushi Nakamura: A minimum error training of garbage model for keyword spotter with artificially generated training data. EUROSPEECH 1995- 1994
[c1]Tsuyoshi Morimoto, Noriyoshi Uratani, Toshiyuki Takezawa, Osamu Furuse, Yasuhiro Sobashima, Hitoshi Iida, Atsushi Nakamura, Yoshinori Sagisaka, Norio Higuchi, Yasuhiro Yamazaki: A speech and language database for speech translation research. ICSLP 1994
Coauthor Index
data released under the ODC-BY 1.0 license. See also our legal information page
last updated on 2013-06-07 00:47 CEST by the dblp team



