1. MLMI 2004: Martigny, Switzerland
Samy Bengio, Hervé Bourlard (Eds.): Machine Learning for Multimodal Interaction, First International Workshop,MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers. Springer 2004 Lecture Notes in Computer Science ISBN 3-540-24509-X
HCI and Applications
Simon Tucker, Steve Whittaker: Accessing Multimodal Meeting Data: Systems, Problems and Possibilities. 1-11
Dennis Reidsma, Rutger Rienks, Natasa Jovanovic: Meeting Modelling in the Context of Multimodal Research. 22-35
Yorick Wilks: Artificial Companions. 36-45
Max Froumentin: Zakim - A Multimodal Software System for Large-Scale Teleconferencing. 46-55
Structuring and Interaction
Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard: Towards Computer Understanding of Human Interactions. 56-75
Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis: Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives. 87-100
Nicolas Moënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno: An Integrated Framework for the Management of Video Collection. 101-110
Jean Carletta, Jonathan Kilgour: The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing. 111-121
Multimodal Processing
Nuria Oliver, Eric Horvitz: S-SEER: Selective Perception in a Multimodal Office Activity Recognition System. 122-135
Tue Lehn-Schiøler, Lars Kai Hansen, Jan Larsen: Mapping from Speech to Images Using Continuous State Space Models. 136-145
Ofer Dekel, Joseph Keshet, Yoram Singer: An Online Algorithm for Hierarchical Phoneme Classification. 146-158
Norman Poh, Samy Bengio: Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks. 159-172
Julien Meynet, Vlad Popovici, Jean-Philippe Thiran: Mixture of SVMs for Face Class Modeling. 173-181
Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez: AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking. 182-195
Speech Processing
Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuomo W. Pirinen, Ivan Bulyko, David Gelbart, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf: The 2004 ICSI-SRI-UW Meeting Recognition System. 196-208
Mathew Magimai-Doss, Hervé Bourlard: On the Adequacy of Baseform Pronunciations and Pronunciation Variants. 209-222
Qifeng Zhu, Barry Y. Chen, Nelson Morgan, Andreas Stolcke: Tandem Connectionist Feature Extraction for Conversational Speech Recognition. 223-231
Barry Y. Chen, Qifeng Zhu, Nelson Morgan: Long-Term Temporal Features for Conversational Speech Recognition. 232-242
Hagai Aronowitz, David Burshtein, Amihood Amir: Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation. 243-252
Mikko Kurimo, Ville T. Turunen, Inger Ekman: Speech Transcription and Spoken Document Retrieval in Finnish. 253-262
Harald Romsdorfer, Beat Pfister, René Beutler: A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System. 263-276
Dialogue Management
Andrei Popescu-Belis, Alexander Clark, Maria Georgescul, Denis Lalanne, Sandrine Zufferey: Shallow Dialogue Processing Using Machine Learning Algorithms (or Not). 277-290
Agnes Lisowska, Martin Rajman, Trung H. Bui: ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings. 291-304
Vision and Emotion

T. Balomenos, Amaryllis Raouzaiou, Spiros Ioannou, Athanasios I. Drosopoulos, Kostas Karpouzis, Stefanos D. Kollias: Emotion Analysis in Man-Machine Interaction Systems. 318-328
Philipp Zehnder, Esther Koller-Meier, Luc J. Van Gool: A Hierarchical System for Recognition, Tracking and Pose Estimation. 329-340
Santiago Venegas-Martinez, Gianluca Antonini, Jean-Philippe Thiran, Michel Bierlaire: Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques. 341-348
Mihai Osian, Tinne Tuytelaars, Luc J. Van Gool: A Shape Based, Viewpoint Invariant Local Descriptor. 349-359



