| 2012 | ||
|---|---|---|
| c44 | Panagiotis Giannoulis, Gerasimos Potamianos: A hierarchical approach with feature selection for emotion recognition from speech. LREC 2012: 1203-1206 | |
| c43 | Georgios Galatas, Gerasimos Potamianos, Fillia Makedon: Audio-visual speech recognition using depth information from the Kinect in noisy video conditions. PETRA 2012: 2 | |
| 2011 | ||
| j8 | S.-H. Gary Chan, J. Li, Pascal Frossard, Gerasimos Potamianos: Special Section on Interactive Multimedia. IEEE Transactions on Multimedia 13(5): 841-843 (2011) | |
| c42 | Georgios Galatas, Gerasimos Potamianos, Alexandros Papangelis, Fillia Makedon: Audio visual speech recognition in noisy visual environments. PETRA 2011: 19 | |
| 2010 | ||
| c41 | Lae-Hoon Kim, Mark Hasegawa-Johnson, Gerasimos Potamianos, Vit Libal: Joint estimation of DOA and speech based on EM beamforming. ICASSP 2010: 121-124 | |
| 2009 | ||
| c40 | Gerasimos Potamianos: Audio-visual automatic speech recognition and related bimodal speech technologies: A review of the state-of-the-art and open problems. ASRU 2009: 22 | |
| c39 | Xiaodan Zhuang, Jing Huang, Gerasimos Potamianos, Mark Hasegawa-Johnson: Acoustic fall detection using Gaussian mixture models and GMM supervectors. ICASSP 2009: 69-72 | |
| c38 | Jing Huang, Xiaodan Zhuang, Vit Libal, Gerasimos Potamianos: Long-time span acoustic activity analysis from far-field sensors in smart homes. ICASSP 2009: 4173-4176 | |
| c37 | Kshitiz Kumar, Jiri Navratil, Etienne Marcheret, Vit Libal, Gerasimos Potamianos: Robust audio-visual speech synchrony detection by generalized bimodal linear prediction. INTERSPEECH 2009: 2251-2254 | |
| c36 | Vit Libal, Bhuvana Ramabhadran, Nadia Mana, Fabio Pianesi, Paul Chippendale, Oswald Lanz, Gerasimos Potamianos: Multimodal Classification of Activities of Daily Living Inside Smart Homes. IWANN (2) 2009: 687-694 | |
| p2 | Keni Bernardin, Rainer Stiefelhagen, Aristodemos Pnevmatikakis, Oswald Lanz, Alessio Brutti, Josep R. Casas, Gerasimos Potamianos: Person Tracking. Computers in the Human Interaction Loop 2009: 11-22 | |
| p1 | Gerasimos Potamianos, Lori Lamel, Matthias Wölfel, Jing Huang, Etienne Marcheret, Claude Barras, Xuan Zhu, John W. McDonough, Javier Hernando, Dusan Macho, Climent Nadeu: Automatic Speech Recognition. Computers in the Human Interaction Loop 2009: 43-59 | |
| 2008 | ||
| c35 | Rajesh Balchandran, Mark E. Epstein, Gerasimos Potamianos, Ladislav Serédi: A multi-modal spoken dialog system for interactive TV. ICMI 2008: 191-192 | |
| 2007 | ||
| j7 | Djamel Mostefa, Nicolas Moreau, Khalid Choukri, Gerasimos Potamianos, Stephen M. Chu, Ambrish Tyagi, Josep R. Casas, Jordi Turmo, Luca Cristoforetti, Francesco Tobia, Aristodemos Pnevmatikakis, Vasileios Mylonakis, Fotios Talantzis, Susanne Burger, Rainer Stiefelhagen, Keni Bernardin, Cedrick Rochet: The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms. Language Resources and Evaluation 41(3-4): 389-407 (2007) | |
| j6 | ZhenQiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Thomas S. Huang: Joint face and head tracking inside multi-camera smart rooms. Signal, Image and Video Processing 1(2): 163-178 (2007) | |
| c34 | Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos: The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings. CLEAR 2007: 429-441 | |
| c33 | Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos: The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings. CLEAR 2007: 497-508 | |
| c32 | Ambrish Tyagi, Mark A. Keck, James W. Davis, Gerasimos Potamianos: Kernel-Based 3D Tracking. CVPR 2007 | |
| c31 | Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan: A unified approach to multi-pose audio-visual ASR. INTERSPEECH 2007: 650-653 | |
| c30 | Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos: Detection, diarization, and transcription of far-field lecture speech. INTERSPEECH 2007: 2161-2164 | |
| 2006 | ||
| c29 | Gerasimos Potamianos, ZhenQiu Zhang: A Joint System for Single-Person 2D-Face and 3D-Head Tracking in CHIL Seminars. CLEAR 2006: 105-118 | |
| c28 | ZhenQiu Zhang, Gerasimos Potamianos, Ming Liu, Thomas S. Huang: Robust Multi-View Multi-Camera Face Detection inside Smart Rooms Using Spatio-Temporal Dynamic Programming. FG 2006: 407-412 | |
| c27 | ZhenQiu Zhang, Gerasimos Potamianos, Stephen M. Chu, Jilin Tu, Thomas S. Huang: Person Tracking in Smart Rooms using Dynamic Programming and Adaptive Subspace Learning. ICME 2006: 2061-2064 | |
| c26 | Etienne Marcheret, Gerasimos Potamianos, Karthik Visweswariah, Jing Huang: The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars. MLMI 2006: 323-335 | |
| c25 | Jing Huang, Martin Westphal, Stanley F. Chen, Olivier Siohan, Daniel Povey, Vit Libal, Alvaro Soneiro, Henrik Schulz, Thomas Ross, Gerasimos Potamianos: The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings. MLMI 2006: 432-443 | |
| 2005 | ||
| c24 | ZhenQiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Stephen M. Chu, Thomas S. Huang: A Joint System for Person Tracking and Face Detection. ICCV-HCI 2005: 47-59 | |
| c23 | Dusan Macho, Jaume Padrell, Alberto Abad, Climent Nadeu, Javier Hernando, John W. McDonough, Matthias Wölfel, Ulrich Klee, Maurizio Omologo, Alessio Brutti, Piergiorgio Svaizer, Gerasimos Potamianos, Stephen M. Chu: Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus. ICME 2005: 876-879 | |
| c22 | Jintao Jiang, Gerasimos Potamianos, Giridharan Iyengar: Improved face finding in visually challenging environments. ICME 2005: 1078-1081 | |
| c21 | Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos: Speech activity detection fusing acoustic phonetic and energy features. INTERSPEECH 2005: 241-244 | |
| c20 | Stephen M. Chu, Etienne Marcheret, Gerasimos Potamianos: Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room. MLMI 2005: 332-343 | |
| 2004 | ||
| j5 | Jing Huang, Gerasimos Potamianos, Jonathan Connell, Chalapathy Neti: Audio-visual speech recognition using an infrared headset. Speech Communication 44(1-4): 83-96 (2004) | |
| c19 | Stephen M. Chu, Vit Libal, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos: Multistage information fusion for audio-visual speech recognition. ICME 2004: 1651-1654 | |
| c18 | Etienne Marcheret, Stephen M. Chu, Vaibhava Goel, Gerasimos Potamianos: Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition. INTERSPEECH 2004 | |
| c17 | Patricia Scanlon, Gerasimos Potamianos, Vit Libal, Stephen M. Chu: Mutual information based visual feature selection for lipreading. INTERSPEECH 2004 | |
| 2003 | ||
| c16 | Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti: Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction. ICME 2003: 9-12 | |
| c15 | Jonathan H. Connell, Norman Haas, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos, Senem Velipasalar: A real-time prototype for small-vocabulary audio-visual ASR. ICME 2003: 469-472 | |
| c14 | Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang: Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. ICME 2003: 605-608 | |
| c13 | Gerasimos Potamianos, Chalapathy Neti: Audio-visual speech recognition in challenging environments. INTERSPEECH 2003 | |
| 2002 | ||
| j4 | Chalapathy Neti, Gerasimos Potamianos, Juergen Luettin, Eric Vatikiotis-Bateson: Editorial. EURASIP J. Adv. Sig. Proc. 2002(11): 1151-1153 (2002) | |
| c12 | Guillaume Gravier, Scott Axelrod, Gerasimos Potamianos, Chalapathy Neti: Maximum entropy and MCE based HMM stream weight estimation for audio-visual ASR. ICASSP 2002: 853-856 | |
| c11 | Roland Goecke, Gerasimos Potamianos, Chalapathy Neti: Noisy audio feature enhancement using audio-visual speech data. ICASSP 2002: 2025-2028 | |
| c10 | Sabine Deligne, Gerasimos Potamianos, Chalapathy Neti: Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization). INTERSPEECH 2002 | |
| 2001 | ||
| c9 | Gerasimos Potamianos, Chalapathy Neti: Improved ROI and within frame discriminant features for lipreading. ICIP (3) 2001: 250-253 | |
| c8 | Iain Matthews, Gerasimos Potamianos, Chalapathy Neti, Juergen Luettin: A Comparison Of Model And Transform-Based Visual Features For Audio-Visual LVCSR. ICME 2001 | |
| c7 | Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Eric Helmuth: Large-vocabulary audio-visual speech recognition by machines and humans. INTERSPEECH 2001: 1027-1030 | |
| 2000 | ||
| c6 | Eric Cosatto, Gerasimos Potamianos, Hans Peter Graf: Audio-Visual Unit Selection for the Synthesis of Photo-Realistic Talking-Heads. IEEE International Conference on Multimedia and Expo (II) 2000: 619-622 | |
| c5 | Gerasimos Potamianos, Ashish Verma, Chalapathy Neti, Giridharan Iyengar, Sankar Basu: A Cascade Image Transform for Speaker Independent Automatic Speech Reading. IEEE International Conference on Multimedia and Expo (II) 2000: 1097- | |
| c4 | Chalapathy Neti, Giridharan Iyengar, Gerasimos Potamianos, Andrew W. Senior, Benoît Maison: Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction. INTERSPEECH 2000: 11-14 | |
| c3 | Gerasimos Potamianos, Chalapathy Neti: Stream confidence estimation for audio-visual speech recognition. INTERSPEECH 2000: 746-749 | |
| 1999 | ||
| c2 | Gerasimos Potamianos, Alexandros Potamianos: Speaker adaptation for audio-visual speech recognition. EUROSPEECH 1999 | |
| 1998 | ||
| j3 | Gerasimos Potamianos, Frederick Jelinek: A study of n-gram and decision tree letter language modeling methods. Speech Communication 24(3): 171-192 (1998) | |
| c1 | Gerasimos Potamianos, Hans Peter Graf, Eric Cosatto: An Image Transform Approach for HMM based Automatic Lipreading. ICIP (3) 1998: 173-177 | |
| 1997 | ||
| j2 | Gerasimos Potamianos, John K. Goutsias: Stochastic approximation algorithms for partition function estimation of Gibbs random fields. IEEE Transactions on Information Theory 43(6): 1948-1965 (1997) | |
| 1993 | ||
| j1 | Gerasimos Potamianos, John K. Goutsias: Partition function estimation of Gibbs random field images using Monte Carlo simulations. IEEE Transactions on Information Theory 39(4): 1322-1332 (1993) | |
Colors in the list of coauthors
Last update Wed May 22 10:53:58 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page