 | 2010 |
| 9 |  | Lae-Hoon Kim,
Mark Hasegawa-Johnson,
Gerasimos Potamianos,
Vit Libal:
Joint estimation of DOA and speech based on EM beamforming.
ICASSP 2010: 121-124 |
| 2009 |
| 8 |  | Jing Huang,
Xiaodan Zhuang,
Vit Libal,
Gerasimos Potamianos:
Long-time span acoustic activity analysis from far-field sensors in smart homes.
ICASSP 2009: 4173-4176 |
| 7 |  | Kshitiz Kumar,
Jiri Navratil,
Etienne Marcheret,
Vit Libal,
Gerasimos Potamianos:
Robust audio-visual speech synchrony detection by generalized bimodal linear prediction.
INTERSPEECH 2009: 2251-2254 |
| 6 |  | Vit Libal,
Bhuvana Ramabhadran,
Nadia Mana,
Fabio Pianesi,
Paul Chippendale,
Oswald Lanz,
Gerasimos Potamianos:
Multimodal Classification of Activities of Daily Living Inside Smart Homes.
IWANN (2) 2009: 687-694 |
| 2007 |
| 5 |  | Jing Huang,
Etienne Marcheret,
Karthik Visweswariah,
Vit Libal,
Gerasimos Potamianos:
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings.
CLEAR 2007: 429-441 |
| 4 |  | Jing Huang,
Etienne Marcheret,
Karthik Visweswariah,
Vit Libal,
Gerasimos Potamianos:
Detection, diarization, and transcription of far-field lecture speech.
INTERSPEECH 2007: 2161-2164 |
| 2006 |
| 3 |  | Jing Huang,
Martin Westphal,
Stanley F. Chen,
Olivier Siohan,
Daniel Povey,
Vit Libal,
Alvaro Soneiro,
Henrik Schulz,
Thomas Ross,
Gerasimos Potamianos:
The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings.
MLMI 2006: 432-443 |
| 2004 |
| 2 |  | Stephen M. Chu,
Vit Libal,
Etienne Marcheret,
Chalapathy Neti,
Gerasimos Potamianos:
Multistage information fusion for audio-visual speech recognition.
ICME 2004: 1651-1654 |
| 1 |  | Patricia Scanlon,
Gerasimos Potamianos,
Vit Libal,
Stephen M. Chu:
Mutual information based visual feature selection for lipreading.
INTERSPEECH 2004 |