 | 2010 |
| 25 |  | Kentaro Ishizuka,
Shoko Araki,
Tatsuya Kawahara:
Speech Activity Detection for Multi-Party Conversation Analyses Based on Likelihood Ratio Test on Spatial Magnitude.
IEEE Transactions on Audio, Speech & Language Processing 18(6): 1354-1365 (2010) |
| 24 |  | Kentaro Ishizuka,
Tomohiro Nakatani,
Masakiyo Fujimoto,
Noboru Miyazaki:
Noise robust voice activity detection based on periodic to aperiodic component ratio.
Speech Communication 52(1): 41-60 (2010) |
| 2009 |
| 23 |  | Kazuhiro Otsuka,
Shoko Araki,
Dan Mikami,
Kentaro Ishizuka,
Masakiyo Fujimoto,
Junji Yamato:
Realtime meeting analysis and 3D meeting viewer based on omnidirectional multimodal sensors.
ICMI 2009: 219-220 |
| 22 |  | Kentaro Ishizuka,
Shoko Araki,
Kazuhiro Otsuka,
Tomohiro Nakatani,
Masakiyo Fujimoto:
A speaker diarization method based on the probabilistic fusion of audio-visual location information.
ICMI 2009: 55-62 |
| 21 |  | Masakiyo Fujimoto,
Kentaro Ishizuka,
Tomohiro Nakatani:
A study of mutual front-end processing method based on statistical model for noise robust speech recognition.
INTERSPEECH 2009: 1235-1238 |
| 2008 |
| 20 |  | Masakiyo Fujimoto,
Kentaro Ishizuka,
Tomohiro Nakatani:
A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme.
ICASSP 2008: 4441-4444 |
| 19 |  | Shoko Araki,
Masakiyo Fujimoto,
Kentaro Ishizuka,
Hiroshi Sawada,
Shoji Makino:
Speaker indexing and speech enhancement in real meetings / conversations.
ICASSP 2008: 93-96 |
| 18 |  | Kazuhiro Otsuka,
Shoko Araki,
Kentaro Ishizuka,
Masakiyo Fujimoto,
Martin Heinrich,
Junji Yamato:
A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization.
ICMI 2008: 257-264 |
| 17 |  | Tatsuya Kawahara,
Hisao Setoguchi,
Katsuya Takanashi,
Kentaro Ishizuka,
Shoko Araki:
Multi-modal recording, analysis and indexing of poster sessions.
INTERSPEECH 2008: 1622-1625 |
| 16 |  | Masakiyo Fujimoto,
Kentaro Ishizuka,
Tomohiro Nakatani:
Study of integration of statistical model-based voice activity detection and noise suppression.
INTERSPEECH 2008: 2008-2011 |
| 15 |  | Kentaro Ishizuka,
Shoko Araki,
Tatsuya Kawahara:
Statistical speech activity detection based on spatial power distribution for analyses of poster presentations.
INTERSPEECH 2008: 99-102 |
| 14 |  | Masakiyo Fujimoto,
Kentaro Ishizuka:
Noise Robust Voice Activity Detection Based on Switching Kalman Filter.
IEICE Transactions 91-D(3): 467-477 (2008) |
| 13 |  | Tomohiro Nakatani,
Shigeaki Amano,
Toshio Irino,
Kentaro Ishizuka,
Tadahisa Kondo:
A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments.
Speech Communication 50(3): 203-214 (2008) |
| 12 |  | Hiroko Kato Solvang,
Kentaro Ishizuka,
Masakiyo Fujimoto:
Voice activity detection based on adjustable linear prediction and GARCH models.
Speech Communication 50(6): 476-486 (2008) |
| 2007 |
| 11 |  | Yasuhiro Minami,
Minako Sawaki,
Kohji Dohsaka,
Ryuichiro Higashinaka,
Kentaro Ishizuka,
Hideki Isozaki,
Tatsushi Matsubayashi,
Masato Miyoshi,
Atsushi Nakamura,
Takanobu Oba,
Hiroshi Sawada,
Takeshi Yamada,
Eisaku Maeda:
The world of mushrooms: human-computer interaction prototype systems for ambient intelligence.
ICMI 2007: 366-373 |
| 10 |  | Kentaro Ishizuka,
Tomohiro Nakatani,
Masakiyo Fujimoto,
Noboru Miyazaki:
Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio.
INTERSPEECH 2007: 230-233 |
| 9 |  | Masakiyo Fujimoto,
Kentaro Ishizuka:
Noise robust voice activity detection based on switching kalman filter.
INTERSPEECH 2007: 2933-2936 |
| 2006 |
| 8 |  | Kentaro Ishizuka,
Tomohiro Nakatani:
A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition.
Speech Communication 48(11): 1447-1457 (2006) |
| 2005 |
| 7 |  | Kentaro Ishizuka,
Ryoko Mugitani,
Hiroko Kato Solvang,
Shigeaki Amano:
A longitudinal analysis of the spectral peaks of vowels for a Japanese infant.
INTERSPEECH 2005: 1169-1172 |
| 2004 |
| 6 |  | Kentaro Ishizuka,
Noboru Miyazaki,
Tomohiro Nakatani,
Yasuhiro Minami:
Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition.
INTERSPEECH 2004 |
| 5 |  | Yuichi Ishimoto,
Kentaro Ishizuka,
Kiyoaki Aikawa,
Masato Akagi:
Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features.
IEICE Transactions 87-D(1): 205-214 (2004) |
| 2002 |
| 4 |  | Kiyoaki Aikawa,
Kentaro Ishizuka:
Noise-robust speech recognition using a new spectral estimation method "PHASOR".
ICASSP 2002: 397-400 |
| 3 |  | Kentaro Ishizuka,
Kiyoaki Aikawa:
Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments.
INTERSPEECH 2002 |
| 2000 |
| 2 |  | Yoshinari Kameda,
Kentaro Ishizuka,
Michihiko Minoh:
A Live Video Imaging Method for Capturing Presentation Information in Distance Learning.
IEEE International Conference on Multimedia and Expo (III) 2000: 1237-1240 |
| 1998 |
| 1 |  | Tatsuya Kawahara,
Kentaro Ishizuka,
Shuji Doshita,
Chin-Hui Lee:
Speaking-style dependent lexicalized filler model for key-phrase detection and verification.
ICSLP 1998 |