CLEF 2007: Budapest, Hungary
Carol Peters, Valentin Jijkoun, Thomas Mandl, Henning Müller, Douglas W. Oard, Anselmo Peñas, Vivien Petras, Diana Santos (Eds.): Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers. Springer 2008 Lecture Notes in Computer Science ISBN 978-3-540-85759-4
Introduction
Carol Peters: What Happened in CLEF 2007. 1-12
Part I: Multilingual Textual Document Retrieval (Ad Hoc)
Giorgio Maria Di Nunzio, Nicola Ferro, Thomas Mandl, Carol Peters: CLEF 2007: Ad Hoc Track Overview. 13-32
Monolingual


Elisa Noguera, Fernando Llopis: Applying Query Expansion Techniques to Ad Hoc Monolingual Tasks with the IR-n System. 45-48
Prasenjit Majumder, Mandar Mitra, Dipasree Pal: Bulgarian, Hungarian and Czech Stemming Using YASS. 49-56
Stephen Tomlinson: Sampling Precision to Depth 10000 at CLEF 2007. 57-63
Cross-Language: European
Dong Zhou, Mark Truran, Tim J. Brailsford: Disambiguation and Unknown Term Translation in Cross Language Information Retrieval. 64-71
Péter Schönhofen, András A. Benczúr, István Bíró, Károly Csalogány: Cross-Language Retrieval with Wikipedia. 72-79
Cross-Language: Non-European
Jagadeesh Jagarlamudi, A. Kumaran: Cross-Lingual Information Retrieval System for Indian Languages. 80-87
Sivaji Bandyopadhyay, Tapabrata Mondal, Sudip Kumar Naskar, Asif Ekbal, Rejwanul Haque, Srinivasa Rao Godhavarthy: Bengali, Hindi and Telugu to English Ad-Hoc Bilingual Task at CLEF 2007. 88-94
Debasis Mandal, Mayank Gupta, Sandipan Dandapat, Pratyush Banerjee, Sudeshna Sarkar: Bengali and Hindi to English CLIR Evaluation. 95-102
Prasad Pingali, Kula Kekeba Tune, Vasudeva Varma: Improving Recall for Hindi, Telugu, Oromo to English CLIR. 103-110
Manoj Kumar Chinnakotla, Sagar Ranadive, Om P. Damani, Pushpak Bhattacharyya: Hindi to English and Marathi to English Cross Language Information Retrieval Evaluation. 111-118
Atelach Alemu Argaw: Amharic-English Information Retrieval with Pseudo Relevance Feedback. 119-126
Mirna Adriani, Herika Hayurani, Syandra Sari: Indonesian-English Transitive Translation for Cross-Language Information Retrieval. 127-133
Robot

Fernando Martínez Santiago, Arturo Montejo Ráez, Miguel Angel García Cumbreras: SINAI at CLEF Ad-Hoc Robust Track 2007: Applying Google Search Engine for Robust Cross-Lingual Retrieval. 137-142
Ángel F. Zazo Rodríguez, José Luis Alonso Berrocal, Carlos G. Figuerola: Improving Robustness Using Query Expansion. 143-147
Jesús Vilares, Michael P. Oakes, Manuel Vilares Ferro: English-to-French CLIR: A Knowledge-Light Approach through Character N-Grams Alignment. 148-155
José Carlos González Cristóbal, José Miguel Goñi-Menoyo, Julio Villena-Román, Sara Lana-Serrano: MIRACLE Progress in Monolingual Information Retrieval at Ad-Hoc CLEF 2007. 156-159
Part II: Domain-Specific Information Retrieval (Domain-Specific)
Vivien Petras, Stefan Baerisch, Maximilian Stempfhuber: The Domain-Specific Track at CLEF 2007. 160-173
Jens Kürsten, Thomas Wilhelm, Maximilian Eibl: The XTRIEVAL Framework at CLEF 2007: Domain-Specific Track. 174-181
Ray R. Larson: Experiments in Classification Clustering and Thesaurus Expansion for Domain Specific Cross-Language Retrieval. 188-195
Claire Fautsch, Ljiljana Dolamic, Samir Abdou, Jacques Savoy: Domain-Specific IR for German, English and Russian Languages. 196-199
Part III: Multiple Language Question Answering (QA@CLEF)
Danilo Giampiccolo, Pamela Forner, Jesús Herrera, Anselmo Peñas, Christelle Ayache, Corina Forascu, Valentin Jijkoun, Petya Osenova, Paulo Rocha, Bogdan Sacaleanu, Richard F. E. Sutcliffe: Overview of the CLEF 2007 Multilingual Question Answering Track. 200-236
Anselmo Peñas, Álvaro Rodrigo, Felisa Verdejo: Overview of the Answer Validation Exercise 2007. 237-248
Jordi Turmo, Pere Comas, Christelle Ayache, Djamel Mostefa, Sophie Rosset, Lori Lamel: Overview of QAST 2007. 249-256
Main Task: Mono- and Bilingual QA
Gosse Bouma, Geert Kloosterman, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann: Question Answering with Joost at CLEF 2007. 257-260
Sven Hartrumpf, Ingo Glöckner, Johannes Leveling: Coreference Resolution for Questions and Answer Merging by Validation. 269-272
Mitchell Bowden, Marian Olteanu, Pasin Suriyentrakorn, Thomas D'Silva, Dan I. Moldovan: Multilingual Question Answering through Intermediate Translation: LCC's PowerAnswer at QA@CLEF 2007. 273-283
Dan Tufis, Dan Stefanescu, Radu Ion, Alexandru Ceausu: RACAI's Question Answering System at QA@CLEF2007. 284-291



Davide Buscaldi, Yassine Benajiba, Paolo Rosso, Emilio Sanchis: Web-Based Anaphora Resolution for the QUASAR Question Answering System. 324-327
Alberto Téllez-Valero, Antonio Juárez, Gustavo Hernández, Claudia Denicia-Carral, Esaú Villatoro-Tello, Manuel Montes-y-Gómez, Luis Villaseñor Pineda: A Lexical Approach for Spanish Question Answering. 328-331
Adrian Iftene, Diana Trandabat, Ionut Pistol, Mihai Alex Moruz, Alexandra Balahur, Diana Cotelea, Iustin Dornescu, Iuliana Draghici, Dan Cristea: UAIC Romanian QA System for QA@CLEF. 336-343
Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke, Erik F. Tjong Kim Sang: The University of Amsterdam's Question Answering System at QA@CLEF 2007. 344-351
César de Pablo-Sánchez, José Luis Martínez-Fernández, Ana González-Ledesma, Doaa Samy, Paloma Martínez, Antonio Moreno-Sandoval, Harith T. Al-Jumaily: Combining Wikipedia and Newswire Texts for Question Answering in Spanish. 352-355
Ana Cristina Mendes, Luísa Coheur, Nuno J. Mamede, Ricardo Ribeiro, Fernando Batista, David Martins de Matos: QA@L2F, First Steps at QA@CLEF. 356-363
Carlos Amaral, Adán Cassan, Helena Figueira, André F. T. Martins, Afonso Mendes, Pedro Mendes, Cláudia Pinto, Daniel Vidal: Priberam's Question Answering System in QA@CLEF 2007. 364-371
Answer Validation Exercise (AVE)
Ingo Glöckner: Combining Logic and Aggregation for Answer Selection. 372-376
Óscar Ferrández, Daniel Micol, Rafael Muñoz, Manuel Palomar: On the Application of Lexical-Syntactic Knowledge to the Answer Validation Exercise. 377-380
Miguel Angel García Cumbreras, José M. Perea-Ortega, Fernando Martínez Santiago, Luis Alfonso Ureña López: Combining Lexical Information with Machine Learning for Answer Validation at QA@CLEF 2007. 381-386
Rui Wang, Günter Neumann: Using Recognizing Textual Entailment as a Core Engine for Answer Validation. 387-390
Alberto Téllez-Valero, Manuel Montes-y-Gómez, Luis Villaseñor Pineda: A Supervised Learning Approach to Spanish Answer Validation. 391-394

Question Answering on Speech Transcription (QAST)

Sophie Rosset, Olivier Galibert, Gilles Adda, Eric Bilinski: The LIMSI Participation in the QAst Track. 414-423
Pere Comas, Jordi Turmo, Mihai Surdeanu: Robust Question Answering for Speech Transcripts Using Minimal Syntactic Analysis. 424-432
Part IV: Cross-Language Retrieval in Image Collections (ImageCLEF)
Michael Grubinger, Paul Clough, Allan Hanbury, Henning Müller: Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task. 433-444
Thomas Deselaers, Allan Hanbury, Ville Viitaniemi, András A. Benczúr, Mátyás Brendel, Bálint Daróczy, Hugo Jair Escalante Balderas, Theo Gevers, Carlos Arturo Hernández-Gracidas, Steven C. H. Hoi, Jorma Laaksonen, Mingjing Li, Heidy Marisol Marín Castro, Hermann Ney, Xiaoguang Rui, Nicu Sebe, Julian Stöttinger, Lei Wu: Overview of the ImageCLEF 2007 Object Retrieval Task. 445-471
Henning Müller, Thomas Deselaers, Thomas Martin Deserno, Jayashree Kalpathy-Cramer, Eugene Kim, William R. Hersh: Overview of the ImageCLEFmed 2007 Medical Retrieval and Medical Annotation Tasks. 472-491
ImageCLEFphoto
Tobias Gass, Tobias Weyand, Thomas Deselaers, Hermann Ney: FIRE in ImageCLEF 2007: Support Vector Machines and Logistic Models to Fuse Image Descriptors for Photo Retrieval. 492-499
Julio Villena-Román, Sara Lana-Serrano, José Luis Martínez-Fernández, José Carlos González Cristóbal: MIRACLE at ImageCLEFphoto 2007: Evaluation of Merging Strategies for Multilingual and Multimedia Information Retrieval. 500-503
Yih-Chen Chang, Hsin-Hsi Chen: Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval. 504-511
Miguel Angel García Cumbreras, Manuel Carlos Díaz-Galiano, Maria Teresa Martín-Valdivia, Arturo Montejo Ráez, Luis Alfonso Ureña López: SINAI System: Combining IR Systems at ImageCLEFPhoto 2007. 512-517
András A. Benczúr, István Bíró, Mátyás Brendel, Károly Csalogány, Bálint Daróczy, Dávid Siklósi: Multimodal Retrieval by Text-Segment Biclustering. 518-521
Anni Järvelin, Peter Wilkins, Tomasz Adamek, Eija Airio, Gareth J. F. Jones, Alan F. Smeaton, Eero Sormunen: DCU and UTA at ImageCLEFPhoto 2007. 530-537
Steven C. H. Hoi: Cross-Language and Cross-Media Image Retrieval: An Empirical Study at ImageCLEF2007. 538-545
Hugo Jair Escalante, Carlos A. Hernández, Aurelio López-López, Heidy Marin-Castro, Manuel Montes-y-Gómez, Eduardo F. Morales, Luis Enrique Sucar, Luis Villaseñor Pineda: Towards Annotation-Based Query and Document Expansion for Image Retrieval. 546-553
Florence Tushabe, Michael H. F. Wilkinson: Content-Based Image Retrieval Using Combined 2D Attribute Pattern Spectra. 554-561
Osama El Demerdash, Leila Kosseim, Sabine Bergler: Text-Based Clustering of the ImageCLEFphoto Collection for Augmenting the Retrieved Results. 562-568
Stéphane Clinchant, Jean-Michel Renders, Gabriela Csurka: Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval. 569-576
ImageCLEFmed
Tatiana Tommasi, Francesco Orabona, Barbara Caputo: Cue Integration for Medical Image Annotation. 577-584
Loïc Maisonnasse, Éric Gaussier, Jean-Pierre Chevallet: Multiplying Concept Sources for Graph Modeling. 585-592
Julio Villena-Román, Sara Lana-Serrano, José Carlos González Cristóbal: MIRACLE at ImageCLEFmed 2007: Merging Textual and Visual Strategies to Improve Medical Image Retrieval. 593-596
Sara Lana-Serrano, Julio Villena-Román, José Carlos González Cristóbal, José Miguel Goñi-Menoyo: MIRACLE at ImageCLEFanot 2007: Machine Learning Experiments on Medical Image Annotation. 597-600
Manuel Carlos Díaz-Galiano, Miguel Angel García Cumbreras, Maria Teresa Martín-Valdivia, Arturo Montejo Ráez, Luis Alfonso Ureña López: Integrating MeSH Ontology to Improve Medical Information Retrieval. 601-606
Michael Springmann, Heiko Schuldt: Speeding Up IDM without Degradation of Retrieval Quality. 607-614
Juan C. Caicedo, Fabio A. González, Eduardo Romero: Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification. 615-622
Jayashree Kalpathy-Cramer, William R. Hersh: Medical Image Retrieval and Automatic Annotation: OHSU at ImageCLEF 2007. 623-630
Diem Thi Hoang Le, Jean-Pierre Chevallet, Joo-Hwee Lim: Using Bayesian Network for Conceptual Indexing: Application to Medical Document Indexing with UMLS Metathesaurus. 631-636
Mark Oliver Güld, Thomas Martin Deserno: Baseline Results for the ImageCLEF 2007 Medical Automatic Annotation Task Using Global Image Features. 637-640
Miguel E. Ruiz, Aurélie Névéol: Evaluation of Automatically Assigned MeSH Terms for Retrieval of Medical Images. 641-648
ImageCLEF photo and med
Xin Zhou, Julien Gobeill, Patrick Ruch, Henning Müller: University and Hospitals of Geneva Participating at ImageCLEF 2007. 649-656
Md. Mahmudur Rahman, Bipin C. Desai, Prabir Bhattacharya: An Interactive and Dynamic Fusion-Based Image Retrieval Approach by CINDI. 657-664
Mouna Torjmen, Karen Pinel-Sauvagnat, Mohand Boughanem: Using Pseudo-Relevance Feedback to Improve Image Retrieval Results. 665-673
Part V: Cross-Language Speech Retrieval (CL-SR)
Pavel Pecina, Petra Hoffmannová, Gareth J. F. Jones, Ying Zhang, Douglas W. Oard: Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. 674-686
Matthew Lease, Eugene Charniak: A Dirichlet-Smoothed Bigram Model for Retrieving Spontaneous Speech. 687-694
Ying Zhang, Gareth J. F. Jones, Ke Zhang: Dublin City University at CLEF 2007: Cross-Language Speech Retrieval Experiments. 703-711
Pavel Ircing, Josef V. Psutka, Jan Vavruska: What Can and Cannot Be Found in Czech Spontaneous Speech Using Document-Oriented IR Methods - UWB at CLEF 2007 CL-SR Track. 712-718
Manuel Carlos Díaz-Galiano, Maria Teresa Martín-Valdivia, Miguel Angel García Cumbreras, Luis Alfonso Ureña López: Using Information Gain to Filter Information in CLEF CL-SR Track. 719-724
Part VI: Multilingual Web Retrieval (WebCLEF)

Carlos G. Figuerola, José Luis Alonso Berrocal, Ángel F. Zazo Rodríguez: Segmentation of Web Documents and Retrieval of Useful Passages. 732-736
Okky Hendriansyah, Tri Firgantoro, Mirna Adriani: Using Web-Content for Retrieving Snippets. 742-744
Part VII: Cross-Language Geographical Retrieval (GeoCLEF)
Thomas Mandl, Fredric C. Gey, Giorgio Maria Di Nunzio, Nicola Ferro, Ray R. Larson, Mark Sanderson, Diana Santos, Christa Womser-Hacker, Xing Xie: GeoCLEF 2007: The CLEF 2007 Cross-Language Geographic Information Retrieval Track Overview. 745-772
Johannes Leveling, Sven Hartrumpf: Inferring Location Names for Geographic Information Retrieval. 773-780
Rocio Guillén: GeoParsing Web Queries. 781-785
Sara Lana-Serrano, Julio Villena-Román, José Carlos González Cristóbal, José Miguel Goñi-Menoyo: MIRACLE at GeoCLEF Query Parsing 2007: Extraction and Classification of Geographical Information. 786-793
Nuno Cardoso, David Cruz, Marcirio Silveira Chaves, Mário J. Silva: Using Geographic Signatures as Query and Document Scopes in Geographic IR. 802-810
Ray R. Larson: Cheshire at GeoCLEF 2007: Retesting Text Retrieval Baselines. 811-814
José M. Perea-Ortega, Miguel Angel García Cumbreras, Manuel García Vega, Luis Alfonso Ureña López: Filtering for Improving the Geographic Information Search. 823-829
Daniel Ferrés, Horacio Rodríguez: TALP at GeoCLEF 2007: Results of a Geographical Knowledge Filtering Approach with Terrier. 830-833
Daniel Ferrés, Horacio Rodríguez: TALP at GeoQuery 2007: Linguistic and Geographical Analysis for Query Parsing. 834-837
Zhisheng Li, Chong Wang, Xing Xie, Xufa Wang, Wei-Ying Ma: Exploring LDA-Based Document Model for Geographic Information Retrieval. 842-849
Ralph Kölle, Ben Heuwing, Thomas Mandl, Christa Womser-Hacker: Mono-and Crosslingual Retrieval Experiments with Spatial Restrictions at GeoCLEF 2007. 850-855
Part VIII: CLEF in Other Evaluations
CLEF at MorphoChallenge
Mikko Kurimo, Mathias Creutz, Matti Varjokallio: Morpho Challenge Evaluation Using a Linguistic Gold Standard. 864-872
Delphine Bernhard: Simple Morpheme Labelling in Unsupervised Morpheme Analysis. 873-880
Stefan Bordag: Unsupervised and Knowledge-Free Morpheme Segmentation and Analysis. 881-891
Daniel Zeman: Unsupervised Acquiring of Morphological Paradigms from Tokenized Text. 892-899
Christian Monson, Jaime G. Carbonell, Alon Lavie, Lori S. Levin: ParaMor: Finding Paradigms across Morphology. 900-907
CLEF at SemEval 2007
Eneko Agirre, Oier Lopez de Lacalle, Bernardo Magnini, Arantxa Otegi, German Rigau, Piek Vossen: SemEval-2007 Task 01: Evaluating WSD on Cross-Language Information Retrieval. 908-917



