LREC 2008:
Marrakech,
Morocco
Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May - 1 June 2008, Marrakech, Morocco.
European Language Resources Association 2008
Session O1 - Information Extraction and Question Answering
Session O2 - LRs:
Infrastructures,
Projects,
Centers
- Steven Bird, Robert Dale, Bonnie J. Dorr, Bryan R. Gibson, Mark Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir R. Radev, Yee Fan Tan:
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics.
- Marian Reed, Denise DiPersio, Christopher Cieri:
The Linguistic Data Consortium Member Survey: Purpose, Execution and Results.
- Dieter Van Uytvanck, Alex Dukers, Jacquelijn Ringersma, Paul Trilsbeek:
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems.
- Tamás Váradi, Steven Krauwer, Peter Wittenburg, Martin Wynne, Kimmo Koskenniemi:
CLARIN: Common Language Resources and Technology Infrastructure.
Session O3 - Corpus,
Lexicon and Evaluation
- Jeroen Geertzen, Volha Petukhova, Harry Bunt:
Evaluating Dialogue Act Tagging with Naive and Expert Annotators.
- Drahomíra "johanka" Spoustová, Pavel Pecina, Jan Hajic, Miroslav Spousta:
Validating the Quality of Full Morphological Annotation.
- Kremena Ivanova, Ulrich Heid, Sabine Schulte im Walde, Adam Kilgarriff, Jan Pomikálek:
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case.
- Mark McConville, Myroslava Dzikovska:
Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus.
Session O4 - Multiparty and non-Verbal Communication
- Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Harald C. Traue, Ulrich Weidenbacher:
The PIT Corpus of German Multi-Party Dialogues.
- Martine Adda-Decker, Claude Barras, Gilles Adda, Patrick Paroubek, Philippe Boula de Mareüil, Benoit Habert:
Annotation and analysis of overlapping speech in political interviews.
- Nicolas Moreau, Djamel Mostefa, Rainer Stiefelhagen, Susanne Burger, Khalid Choukri:
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign.
- Susanne Burger, Kornel Laskowski, Matthias Wölfel:
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora.
Session O5 - Spatio-Temporal Annotation
- Inderjeet Mani, Janet Hitzeman, Justin Richer, Dave Harris, Rob Quimby, Ben Wellner:
SpatialML: Annotation Scheme, Corpora, and Tools.
- Steven Bethard, William Corvey, Sara Klingenstein, James H. Martin:
Building a Corpus of Temporal-Causal Structure.
- Alessandra Zarcone, Alessandro Lenci:
Computational Models for Event Type Classification in Context.
- Corina Forascu:
GMT to +2 or how can TimeML be used in Romanian.
- Nianwen Xue, Hua Zhong, Kai-Yun Chen:
Annotating "tense" in a Tense-less Language.
Session O6 - Syntax and Parsing
- Barbara Plank, Khalil Sima'an:
Subdomain Sensitive Statistical Parsing using Raw Corpora.
- Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert:
Developing a TT-MCTAG for German with an RCG-based Parser.
- Peter Adolphs, Stephan Oepen, Ulrich Callmeier, Berthold Crysmann, Dan Flickinger, Bernd Kiefer:
Some Fine Points of Hybrid Natural Language Parsing.
- Jeremy Nicholson, Valia Kordoni, Yi Zhang, Timothy Baldwin, Rebecca Dridan:
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German.
- Yi Zhang, Valia Kordoni:
Robust Parsing with a Large HPSG Grammar.
Session O7 - Document Classification
Session O8 - Multimodal Annotation Tools
- Thomas Schmidt, Susan Duncan, Oliver Ehmer, Jeffrey Hoyt, Michael Kipp, Dan Loehr, Magnus Magnusson, R. Travis Rose, Han Sloetjes:
An Exchange Format for Multimodal Annotations.
- Laura Stoia, Darla Magdalena Shockley, Donna K. Byron, Eric Fosler-Lussier:
SCARE: a Situated Corpus with Annotated Referring Expressions.
- Han Sloetjes, Peter Wittenburg:
Annotation by Category: ELAN and ISO DCR.
- Hennie Brugman, Véronique Malaisé, Laura Hollink:
A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections.
- Philippe Blache, Roxane Bertrand, Gaëlle Ferré:
Creating and Exploiting Multimodal Annotated Corpora.
Session O9 - Lexicon,
Corpus and Semantics
- Annie Zaenen, Daniel G. Bobrow, Cleo Condoravdi:
The Encoding of lexical implications in VerbNet Predicates of change of locations.
- Aljoscha Burchardt, Marco Pennacchiotti:
FATE: a FrameNet-Annotated Corpus for Textual Entailment.
- Stephen A. Boxwell, Michael White:
Projecting Propbank Roles onto the CCGbank.
- Piek T. J. M. Vossen, Isa Maks, Roxane Segers, Hennie VanderVliet:
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database.
- Javier Álvez, Jordi Atserias, Jordi Carrera, Salvador Climent, Egoitz Laparra, Antoni Oliver, German Rigau:
Complete and Consistent Annotation of WordNet using the Top Concept Ontology.
Session O10 - Multimodal and Speech Data over the Web
- Adrian Popescu, Gregory Grefenstette:
A Conceptual Approach to Web Image Retrieval.
- Gwénolé Lecorvé, Guillaume Gravier, Pascale Sébillot:
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems.
- Stanislas Oger, Georges Linares, Frédéric Béchet:
Local Methods for On-Demand Out-of-Vocabulary Word Retrieval.
- Marc Kemps-Snijders, Alexander Klassmann, Claus Zinn, Peter Berck, Albert Russel, Peter Wittenburg:
Exploring and Enriching a Language Resource Archive via the Web.
- Florian Schiel, Hannes Mögele:
Talking and Looking: the SmartWeb Multimodal Interaction Corpus.
Session O11 - Coreference and Discourse
- Erhard W. Hinrichs, Monica Lau:
In Contrast - A Complex Discourse Connective.
- Georg Rehm, Marina Santini, Alexander Mehler, Pavel Braslavski, Rüdiger Gleim, Andrea Stubbe, Svetlana Symonenko, Mirko Tavosanis, Vedrana Vidulin:
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems.
- Olga Uryupina:
Error Analysis for Learning-based Coreference Resolution.
- Lucie Mladová, Sárka Zikánová, Eva Hajicová:
From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank.
- David Day, Janet Hitzeman, Michael L. Wick, Keith Crouch, Massimo Poesio:
A Corpus for Cross-Document Co-reference.
Session O12 - Named Entity Recognition
Session O13 - Parallel and Multilingual Resources
Session O14 - Evaluation Tools and Methodologies
- Cong-phap Huynh, Christian Boitet, Hervé Blanchon:
SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora.
- Mark Arehart, Chris Wolf, Keith J. Miller:
Adjudicator Agreement and System Rankings for Person Name Search.
- Paulo C. F. de Oliveira, Edson Wilson Torrens, Alexandre Cidral, Sidney Schossland, Evandro Bittencourt:
Evaluating Summaries Automatically - A system Proposal.
- Thierry Poibeau, Cédric Messiant:
Do we Still Need Gold Standards for Evaluation?
Session O15 - LRs:
Large Programs,
Policies,
Strategies
- Peter Spyns, Elisabeth D'Halleweyn, Catia Cucchiarini:
The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond.
- Christopher Cieri, Mark Liberman:
15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities.
- Anil Kumar Singh, Kiran Pala, Harshit Surana:
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language.
- Valérie Mapelli, Victoria Arranz, Hélène Mazo, Khalid Choukri:
Latest Developments in ELRA's Services.
- Carol Peters, Martin Braschler, Giorgio Maria Di Nunzio, Nicola Ferro, Julio Gonzalo, Mark Sanderson:
From Research to Application in Multilingual Information Access: the Contribution of Evaluation.
Session O16 - Biomedical Resources
Session O17 - Semantics in Lexicons and Corpora
- Tony Veale, Yanfen Hao:
Acquiring Naturalistic Concept Descriptions from the Web.
- Ulrich Heid, Marion Weller:
Tools for Collocation Extraction: Preferences for Active vs. Passive.
- Francis Bond, Hitoshi Isahara, Kyoko Kanzaki, Kiyotaka Uchimoto:
Boot-Strapping a WordNet Using Multiple Existing WordNets.
- Bartosz Broda, Magdalena Derwojedowa, Maciej Piasecki, Stan Szpakowicz:
Corpus-based Semantic Relatedness for the Construction of Polish WordNet.
- Rafiya Begum, Samar Husain, Lakshmi Bai, Dipti Misra Sharma:
Developing Verb Frames for Hindi.
Session O18 - Affect and Emotion in Speech
- Katherine Forbes-Riley, Diane J. Litman, Scott Silliman, Amruta Purandare:
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems.
- Milan Gnjatovic, Dietmar Rösner:
On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System.
- Stefan Scherer, Hansjörg Hofmann, Malte Lampmann, Martin Pfeil, Steffen Rhinow, Friedhelm Schwenker, Günther Palm:
Emotion Recognition from Speech: Stress Experiment.
- Laure Charonnat, Gaëlle Vidal, Olivier Boëffard:
Automatic Phone Segmentation of Expressive Speech.
- Márk Fék, Nicolas Audibert, János Szabó, Albert Rilliard, Géza Németh, Véronique Aubergé:
Multimodal Spontaneous Expressive Speech Corpus for Hungarian.
Session O19 - Opinion Mining and Summarization
Session O20 - Coreference and Discourse
- Jette Viethen, Robert Dale, Emiel Krahmer, Mariët Theune, Pascal Touset:
Controlling Redundancy in Referring Expressions.
- Massimo Poesio, Ron Artstein:
Anaphoric Annotation in the ARRAU Corpus.
- Mark-Christoph Mueller, Margot Mieskes, Michael Strube:
Knowledge Sources for Bridging Resolution in Multi-Party Dialog.
- Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind K. Joshi, Bonnie L. Webber:
The Penn Discourse TreeBank 2.0.
- Iris Hendrickx, Gosse Bouma, Frederik Coppens, Walter Daelemans, Véronique Hoste, Geert Kloosterman, Anne-Marie Mineur, Joeri Van Der Vloet, Jean-Luc Verschelde:
A Coreference Corpus and Resolution System for Dutch.
Session O21 - Semantic Resources and Acquisition
Session O22 - Speaker and Dialect Identification
- Doroteo Torre Toledano, Daniel Hernández López, Cristina Esteve-Elizalde, Julian Fiérrez, Javier Ortega-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez:
BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition.
- Iker Luengo, Eva Navas, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez, Igor Odriozola, Inma Hernáez:
Text Independent Speaker Identification in Multilingual Environments.
- Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, Robert E. Frederking:
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls.
- Christopher Cieri, Stephanie Strassel, Meghan Lammie Glenn, Reva Schwartz, Wade Shen, Joseph P. Campbell:
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition.
- Linda Brandschain, Christopher Cieri, David Graff, Abby Neely, Kevin Walker:
Speaker Recognition: Building the Mixer 4 and 5 Corpora.
Session O23 - Corpus Annotation and Classification
- Nancy Ide, Collin F. Baker, Christiane Fellbaum, Charles J. Fillmore, Rebecca J. Passonneau:
MASC: the Manually Annotated Sub-Corpus of American English.
- Chu-Ren Huang, Lung-Hao Lee, Jia-Fei Hong, Weiguang Qu, Shiwen Yu:
Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System.
- Claire Cardie, Cynthia Farina, Matt Rawding, Adil Aijaz:
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments.
- Branimir Boguraev, Mary S. Neff:
Navigating through Dense Annotation Spaces.
- David Guthrie, Louise Guthrie, Yorick Wilks:
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora.
Session O24 - Machine Translation and Multilinguality
Session O25 - Evaluation
- Jennifer Foster, Josef van Genabith:
Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics.
- Patrick Paroubek, Isabelle Robba, Anne Vilnat, Christelle Ayache:
EASY, Evaluation of Parsers of French: what are the Results?
- Xavier Tannier, Philippe Muller:
Evaluation Metrics for Automatic Temporal Annotation of Texts.
- Lena Grothe, Ernesto William De Luca, Andreas Nürnberger:
A Comparative Study on Language Identification Methods.
- Eric Villemonte de la Clergerie, Olivier Hamon, Djamel Mostefa, Christelle Ayache, Patrick Paroubek, Anne Vilnat:
PASSAGE: from French Parser Evaluation to Large Sized Treebank.
Session O26 - Broadcast News Processing
- Jáchym Kolár, Jan Svec:
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations.
- Markpong Jongtaveesataporn, Chai Wutiwiwatchai, Koji Iwano, Sadaoki Furui:
Thai Broadcast News Corpus Construction and Evaluation.
- Ingunn Amdal, Ole Morten Strand, Jørn Almberg, Torbjørn Svendsen:
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus.
- Sopheap Seng, Sethserey Sam, Laurent Besacier, Brigitte Bigi, Eric Castelli:
First Broadcast News Transcription System for Khmer Language.
- Chomicha Bendahman, Meghan Lammie Glenn, Djamel Mostefa, Niklas Paulsson, Stephanie Strassel:
Quick Rich Transcriptions of Arabic Broadcast News Speech Data.
Session O27 - Ontologies
Session O28 - Machine Translation and Multilinguality
Session O29 - Information Extraction and Question Answering
Session O30 - Evaluation in Speech Processing
- Gregory A. Sanders, Sebastien Bronsart, Sherri L. Condon, Craig Schlenoff:
Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA's TRANSTAC Program.
- Lori Lamel, Sophie Rosset, Christelle Ayache, Djamel Mostefa, Jordi Turmo, Pere Comas:
Question Answering on Speech Transcriptions: the QAST evaluation in CLEF.
- Willemijn Heeren, Franciska de Jong, Laurens van der Werff, Marijn Huijbregts, Roeland Ordelman:
Evaluation of Spoken Document Retrieval for Historic Speech Collections.
- Sherri L. Condon, Jon Phillips, Christy Doran, John S. Aberdeen, Dan Parvaz, Beatrice T. Oshika, Gregory A. Sanders, Craig Schlenoff:
Applying Automated Metrics to Speech Translation Dialogs.
- Margot Mieskes, Michael Strube:
A Three-stage Disfluency Classifier for Multi Party Dialogues.
Session O31 - Evaluation and Machine Translation
Session O32 - Syntactically Annotated Corpora
Session O33 - Terminology
Session O34 - Emotions
Session 035 - Semantics and Semantic Annotation
- Markus Dickinson, Chong Min Lee:
Detecting Errors in Semantic Annotation.
- Michael Roth, Sabine Schulte im Walde:
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information.
- Emiliano Giovannetti, Simone Marchi, Simonetta Montemagni, Roberto Bartolini:
Ontology Learning and Semantic Annotation: a Necessary Symbiosis.
- Jordi Atserias, Hugo Zaragoza, Massimiliano Ciaramita, Giuseppe Attardi:
Semantically Annotated Snapshot of the English Wikipedia.
- Rodney D. Nielsen, Wayne Ward, James H. Martin, Martha Palmer:
Annotating Students' Understanding of Science Concepts.
Session O36 - Evaluation Methodologies
Session O37 - Lexicons,
Corpora and Acquisition
- Viktor Bielický, Otakar Smrz:
Building the Valency Lexicon of Arabic Verbs.
- Angus Roberts, Robert J. Gaizauskas, Mark Hepple, Yikun Guo:
Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation.
- Rogelio Nazar, Jorge Vivaldi, M. Teresa Cabré:
A Suite to Compile and Analyze an LSP Corpus.
- Eduardo Blanco, Núria Castell, Dan I. Moldovan:
Causal Relation Extraction.
- Grzegorz Chrupala, Georgiana Dinu, Josef van Genabith:
Learning Morphology with Morfette.
Session O38 - Ontologies
- Gaoying Cui, Qin Lu, Wenjie Li, Yi-Rong Chen:
Corpus Exploitation from Wikipedia for Ontology Construction.
- Shiyan Ou, Viktor Pekar, Constantin Orasan, Christian Spurk, Matteo Negri:
Development and Alignment of a Domain-Specific Ontology for Question Answering.
- David Manzano-Macho, Asunción Gómez-Pérez, Daniel Borrajo:
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence.
- Alessandra Potrich, Emanuele Pianta:
L-ISA: Learning Domain Specific Isa-Relations from the Web.
- Arno Hartholt, Thomas A. Russ, David R. Traum, Eduard H. Hovy, Susan Robinson:
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture.
Session O39 - Multilingual Resources
- Eneko Agirre, Aitor Soroa:
Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation.
- Fredric C. Gey, David Kirk Evans, Noriko Kando:
A Japanese-English Technical Lexicon for Translation and Language Research.
- Le An Ha, Gabriela Fernandez, Ruslan Mitkov, Gloria Corpas Pastor:
Mutual Bilingual Terminology Extraction.
- João Graça, Joana Paulo Pardal, Luísa Coheur, Diamantino Caseiro:
Building a Golden Collection of Parallel Multi-Language Word Alignment.
- Elena Cabrio, Milen Kouylekov, Bernardo Magnini, Matteo Negri, Laura Hasler, Constantin Orasan, David Tomás, José Luis Vicedo González, Guenter Neumann, Corinna Weber:
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering.
Session O40 - Tools for Corpus Construction and Annotation
Session O41 - Speech Varieties
- Lynette Melnar, Chen Liu:
Borrowing Language Resources for Development of Automatic Speech Recognition for Low- and Middle-Density Languages.
- Sebastian Möller, Florian Gödde, Maria Wolters:
Corpus Analysis of Spoken Smart-Home Interactions with Older Users.
- Kallirroi Georgila, Maria Wolters, Vasilis Karaiskos, Melissa Kronenthal, Robert H. Logie, Neil Mayo, Johanna D. Moore, Matthew Watson:
A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users' Interactions with Spoken Dialogue Systems.
- Catia Cucchiarini, Joris Driesen, Hugo Van Hamme, Eric Sanders:
Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
- Christoph Draxler, Florian Schiel, Tania Ellbogen:
F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database.
Session O42 - Multimodal Dialogue
- Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing, Oli Mival:
Dialogue, Speech and Images: the Companions Project Data Set.
- Jade Goldstein-Stewart, Kerri A. Goodwin, Roberta Evans Sabin, Ransom K. Winder:
Creating and Using a Correlated Corpus to Glean Communicative Commonalities.
- Roberta Catizone, Alexiei Dingli, Hugo Pinto, Yorick Wilks:
Information Extraction Tools and Methods for Understanding Dialogue in a Companion.
- Carlos Gómez Gallo, T. Florian Jaeger, James F. Allen, Mary D. Swift:
Production in a Multimodal Corpus: how Speakers Communicate Complex Actions.
Session O43 - Semantic Resources
Session O44 - Corpora and Evaluation Resources
- Leen Cleuren, Jacques Duchateau, Pol Ghesquière, Hugo Van Hamme:
Children's Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement.
- Tommaso Caselli, Nancy Ide, Roberto Bartolini:
A Bilingual Corpus of Inter-linked Events.
- Stephanie Strassel, Lauren Friedman, Safa Ismael, Linda Brandschain:
New Resources for Document Classification, Analysis and Translation Technologies.
- Katrin Tomanek, Udo Hahn:
Approximating Learning Curves for Active-Learning-Driven Annotation.
Session O45 - Lexicons
- Thorsten Trippel, Michael Maxwell, Greville Corbett, Cambell Prince, Christopher D. Manning, Stephen Grimes, Steve Moran:
Lexicon Schemas and Related Data Models: when Standards Meet Users.
- Cédric Messiant, Thierry Poibeau, Anna Korhonen:
LexSchem: a Large Subcategorization Lexicon for French Verbs.
- Horacio Rodríguez, David Farwell, Javi Ferreres, Manuel Bertrán, Musa Alkhalifa, M. Antonia Martí:
Arabic WordNet: Semi-automatic Extensions using Bayesian Inference.
Session O46 - Evaluation Methodologies
- Iñaki Sainz, Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Jon Sánchez, Iker Luengo, Igor Odriozola:
Subjective Evaluation of an Emotional Speech Database for Basque.
- Sandra Kübler, Wolfgang Maier, Ines Rehbein, Yannick Versley:
How to Compare Treebanks.
- Romaric Besançon, Stéphane Chaudiron, Djamel Mostefa, Ismaïl Timimi, Khalid Choukri:
The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign.
Session O47 - Authoring Tools and Corpora
Session O48 - TV and Video Processing
Session P1 - Corpus Construction and Annotation
- Mikko Lounela:
Process Model for Composing High-quality Text Corpora.
- Mariona Taulé, Maria Antònia Martí, Marta Recasens:
AnCora: Multilevel Annotated Corpora for Catalan and Spanish.
- Stephen Purpura, John Wilkerson, Dustin Hillard:
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998.
- Jeremy Bensley, Andrew Hickl:
Unsupervised Resource Creation for Textual Inference Applications.
- Markus Dickinson, Charles Jochim:
A Simple Method for Tagset Comparision.
- Nelleke Oostdijk, Martin Reynaert, Paola Monachesi, Gertjan van Noord, Roeland Ordelman, Ineke Schuurman, Vincent Vandeghinste:
From D-Coi to SoNaR: a reference corpus for Dutch.
- Hiromi Itoh Ozaku, Akinori Abe, Kaoru Sagara, Kiyoshi Kogure:
Relationships between Nursing Converstaions and Activities.
- Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, Haejoong Lee, Shawn Medero:
Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing.
- Harald Hammarström, Christina Thornell, Malin Petzell, Torbjörn Westerlund:
Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic).
- Satoshi Sato, Suguru Matsuyoshi, Yohsuke Kondoh:
Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus.
Session P2 - LRs for Specific Domains:
Bio-Medicine and Chemistry
- Paul Thompson, Philip Cotter, John McNaught, Sophia Ananiadou, Simonetta Montemagni, Andrea Trabucco, Giulia Venturi:
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora.
- C. J. Rupp, Ann A. Copestake, Peter Corbett, Peter Murray-Rust, Advaith Siddharthan, Simone Teufel, Benjamin Waldron:
Language Resources and Chemical Informatics.
- Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Michael Poprat, Katrin Tomanek, Joachim Wermter:
Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab.
- Valeria Quochi, Monica Monachini, Riccardo Del Gratta, Nicoletta Calzolari:
A lexicon for biology and bioinformatics: the BOOTStrep experience.
- Fabio Rinaldi, Gerold Schneider, Kaarel Kaljurand, Michael Hess:
Dependency-Based Relation Mining for Biomedical Literature.
- Dimitrios Kokkinakis:
MeSH(c): from a Controlled Vocabulary to a Processable Resource.
- Dimitrios Kokkinakis:
A Semantically Annotated Swedish Medical Corpus.
- Mehdi Embarek, Olivier Ferret:
Learning Patterns for Building Resources about Semantic Relations in the Medical Domain.
Session P3 - Syntactically Annotated Resources and Related Tools
- Dino Ienco, Serena Villata, Cristina Bosco:
Automatic extraction of subcategorization frames for Italian.
- Jerid Francom, Mans Hulden:
Parallel Multi-Theory Annotations of Syntactic Structure.
- Meni Adler, Yael Dahan Netzer, Yoav Goldberg, David Gabay, Michael Elhadad:
Tagging a Hebrew Corpus: the Case of Participles.
- Joy Deep Nath, Monojit Choudhury, Animesh Mukherjee, Christian Biemann, Niloy Ganguly:
Unsupervised Parts-of-Speech Induction for Bengali.
- Guadalupe Aguado de Cea, Javier Puche, José Ángel Ramos:
Tagging Spanish Texts: the Problem of Problem of "SE".
- Jirí Mírovský:
Does Netgraph Fit Prague Dependency Treebank?
- Tomas By:
The Kalshnikov 691 Dependency Bank.
- Natalie Schluter, Josef van Genabith:
Treebank-Based Acquisition of LFG Parsing Resources for French.
- Svetla Koeva, Borislav Rizov, Svetlozara Leseva:
Chooser: a Multi-Task Annotation Tool.
- Pavlina Fragkou, Georgios Petasis, Aris Theodorakos, Vangelis Karkaletsis, Constantine D. Spyropoulos:
BOEMIE Ontology-Based Text Annotation Tool.
- Ralf Krestel, Sabine Bergler, René Witte:
Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles.
Session P4 - Named Entity Recognition,
Information Extraction and Document Classification
- Piek Vossen, Eneko Agirre, Nicoletta Calzolari, Christiane Fellbaum, Shu-Kai Hsieh, Chu-Ren Huang, Hitoshi Isahara, Kyoko Kanzaki, Andrea Marchetti, Monica Monachini, Federico Neri, Remo Raffaelli, German Rigau, Maurizio Tesconi, Joop VanGent:
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures.
- Ulrich Schäfer, Hans Uszkoreit, Christian Federmann, Torsten Marek, Yajing Zhang:
Extracting and Querying Relations in Scientific Papers on Language Technology.
- Adrian Iftene, Alexandra Balahur-Dobrescu:
Named Entity Relation Mining using Wikipedia.
- Claire Grover, Sharon Givon, Richard Tobin, Julian Ball:
Named Entity Recognition for Digitised Historical Texts.
- Zhiyi Song, Stephanie Strassel:
Entity Translation and Alignment in the ACE-07 ET Task.
- Yoji Kiyota, Noriyuki Tamura, Satoshi Sakai, Hiroshi Nakagawa, Hidetaka Masuda:
Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings.
- Linus Sellberg, Arne Jönsson:
Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis.
Session P5 - Multi-Word Expressions
- Spela Vintar, Darja Fiser:
Harvesting Multi-Word Expressions from Parallel Corpora.
- Andrea Agili, Marco Fabbri, Alessandro Panunzi, Manuel Zini:
Integration of a Multilingual Keyword Extractor in a Document Management System.
- Daiga Deksne, Raivis Skadins, Inguna Skadina:
Dictionary of Multiword Expressions for Translation into highly Inflected Languages.
- Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obrêbski:
Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach.
- Weiruo Qu, Christoph Ringlstetter, Randy Goebel:
Targeting Chinese Nominal Compounds in Corpora.
- Margarita Alonso Ramos, Owen Rambow, Leo Wanner:
Using Semantically Annotated Corpora to Build Collocation Resources.
Session P6 - Ontologies and Knowledge
- Katia Kermanidis, Aristomenis Thanopoulos, Manolis Maragoudakis, Nikos Fakotakis:
Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text.
- Francisco Alvarez Montero, Antonio Vaquero Sanchez, Fernando Sáenz-Pérez:
Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations.
- Paul Buitelaar, Thomas Eigner:
Ontology Search with the OntoSelect Ontology Library.
- Cássia Trojahn dos Santos, Paulo Quaresma, Renata Vieira:
A Framework for Multilingual Ontology Mapping.
- Laura Kassner, Vivi Nastase, Michael Strube:
Acquiring a Taxonomy from the German Wikipedia.
- Davide Picca, Alfio Massimiliano Gliozzo, Aldo Gangemi:
LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge.
- Hitoshi Isahara, Francis Bond, Kiyotaka Uchimoto, Masao Utiyama, Kyoko Kanzaki:
Development of the Japanese WordNet.
- Neil Newbold, Bogdan Vrusias, Lee Gillam:
Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation.
- Mukda Suktarachan, Dussadee Thamvijit, Sachit Rajbhandari, Daoyos Noikongka, Puwarat Pavaputanont Na Mahasarakham, Panita Yongyuth, Asanee Kawtrakul, Margherita Sini:
Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance.
- Mehrnoush Shamsfard:
Towards Semi Automatic Construction of a Lexical Ontology for Persian.
- Gerard de Melo, Gerhard Weikum:
Mapping Roget's Thesaurus and WordNet to French.
- Christophe Jouis, Julien Bourdaillet:
Representation of Atypical Entities in Ontologies.
- Siaw-Fong Chung, Laurent Prévot, Mingwei Xu, Kathleen Ahrens, Shu-Kai Hsieh, Chu-Ren Huang:
Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies.
- Jun Okamoto, Kiyoko Uchiyama, Shun Ishizaki:
A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary.
- Berenike Loos, Lasse Schwarten:
A Semantic Memory for Incremental Ontology Population.
Session P7 - Term Identification/Extraction and Terminological Databases
- Jorge Vivaldi, Anna Joan, Mercè Lorente:
Turning a Term Extractor into a new Domain: first Experiences.
- Peter G. Anick, Vijay Murthi, Shaji Sebastian:
Similar Term Discovery using Web Search.
- Junko Kubo, Keita Tsuji, Shigeo Sugimoto:
Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women's Studies Terms.
- Ziqi Zhang, José Iria, Christopher Brewster, Fabio Ciravegna:
A Comparative Evaluation of Term Recognition Algorithms.
- Véronique Hoste, Els Lefever, Klaar Vanopstal, Isabelle Delaere:
Learning-based Detection of Scientific Terms in Patient Information.
- Eli Pociello, Antton Gurrutxaga, Eneko Agirre, Izaskun Aldezabal, German Rigau:
WNTERM: Enriching the MCR with a Terminological Dictionary.
- Rita Marinelli, Melissa Tiberi, Remo Bindi:
Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria.
Session P8 - Information Extraction,
Question Answering and Document Classification
- Thomas Mandl, Fredric C. Gey, Giorgio Maria Di Nunzio, Nicola Ferro, Mark Sanderson, Diana Santos, Christa Womser-Hacker:
An Evaluation Resource for Geographic Information Retrieval.
- Jorge Civera, Alfons Juan-Císcar:
Bilingual Text Classification using the IBM 1 Translation Model.
- Hiroyuki Shinnou, Minoru Sasaki:
Ping-pong Document Clustering using NMF and Linkage-Based Refinement.
- Hiroyuki Shinnou, Minoru Sasaki:
Spectral Clustering for a Large Data Set by Reducing the Similarity Matrix Size.
- Danica Damljanovic, Valentin Tablan, Kalina Bontcheva:
A Text-based Query Interface to OWL Ontologies.
- Han Ren, Dong-Hong Ji, Lei Han:
A Research on Automatic Chinese Catchword Extraction.
- Isaac G. Councill, C. Lee Giles, Min-Yen Kan:
ParsCit: an Open-source CRF Reference String Parsing Package.
- Shunsuke Kozawa, Hitomi Tohyama, Kiyotaka Uchimoto, Shigeki Matsubara:
Automatic Acquisition of Usage Information for Language Resources.
- Michael Wiegand, Jochen L. Leidner, Dietrich Klakow:
Cost-Sensitive Learning in Answer Extraction.
- Lukasz Degórski, Michal Marcinczuk, Adam Przepiórkowski:
Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers.
- Francesca Fallucchi, Fabio Massimo Zanzotto:
Yet another Platform for Extracting Knowledge from Corpora.
- Milena Yankova, Horacio Saggion, Hamish Cunningham:
A Framework for Identity Resolution and Merging for Multi-source Information Extraction.
- Jussi Karlgren, Hercules Dalianis, Bart Jongejan:
Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting.
- Fidelia Ibekwe-Sanjuan, Chaomei Chen, Roberto Pinho:
Identifying Strategic Information from Scientific Articles through Sentence Classification.
- Susana Azeredo, Silvia Moraes, Vera Lima:
Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese.
- Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany:
Automatic Extraction of Textual Elements from News Web Pages.
- Eiko Yamamoto, Hitoshi Isahara, Akira Terada, Yasunori Abe:
Extraction of Informative Expressions from Domain-specific Documents.
- Rune Sætre, Brian Kemper, Kanae Oda, Naoaki Okazaki, Yukiko Matsuoka, Norihiro Kikuchi, Hiroaki Kitano, Yoshimasa Tsuruoka, Sophia Ananiadou, Jun-ichi Tsujii:
Connecting Text Mining and Pathways using the PathText Resource.
- Jan Pomikálek, Pavel Rychlý:
Detecting Co-Derivative Documents in Large Text Collections.
- Lothar Lemnitzer, Paola Monachesi:
Extraction and Evaluation of Keywords from Learning Objects: a Multilingual Approach.
- Peng Zhang, Wenjie Li, Furu Wei, Qin Lu, Yuexian Hou:
Exploiting the Role of Position Feature in Chinese Relation Extraction.
- Ben Allison, Louise Guthrie:
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation.
- Michael Kaisser, John Lowe:
Creating a Research Collection of Question Answer Sentence Pairs with Amazon's Mechanical Turk.
- Feiyu Xu, Hans Uszkoreit, Hong Li, Niko Felger:
Adaptation of Relation Extraction Rules to New Domains.
- Asuka Sumida, Naoki Yoshinaga, Kentaro Torisawa:
Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia.
- Margot Mieskes, Michael Strube:
Parameters for Topic Boundary Detection in Multi-Party Dialogues.
- Eugenio Picchi, Eva Sassolini, Sebastiana Cucurullo, Francesca Bertagna, Paola Baroni:
Semantic Press.
- Lei Xia, José Iria:
An Approach to Modeling Heterogeneous Resources for Information Extraction.
- Anca Dinu:
On Classifying Coherent/Incoherent Romanian Short Texts.
- Lorraine Goeuriot, Natalia Grabar, Béatrice Daille:
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian.
- Jalal Maleki, Lars Ahrenberg:
Converting Romanized Persian to the Arabic Writing Systems.
- Nasser Abouzakhar, Ben Allison, Louise Guthrie:
Unsupervised Learning-based Anomalous Arabic Text Detection.
- Prokopis Prokopidis, Vassia Karra, Aggeliki Papagianopoulou, Stelios Piperidis:
Condensing Sentences for Subtitle Generation.
- Simon Mille, Leo Wanner:
Making Text Resources Accessible to the Reader: the Case of Patent Claims.
Session P9 - Authoring Tools and Related Resources
- Jack Halpern:
Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants.
- Neil Newbold, Lee Gillam:
Automatic Document Quality Control.
- Thepchai Supnithi, Suchinder Singh, Taneth Ruangrajitpakorn, Prachya Boonkwan, Monthika Boriboon:
OpenCCG Workbench and Visualization Tool.
- Matthieu Hermet, Alain Désilets, Stan Szpakowicz:
Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors.
- Davide Fossati, Barbara Di Eugenio:
I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes.
- Martí Quixal, Toni Badia, Francesc Benavent, José Roberto de Freitas Boullosa, Judith Domingo, Bernat Grau, Guillem Massó, Oriol Valentín:
User-Centred Design of Error Correction Tools.
- Wei Liu, Ben Allison, Louise Guthrie:
Professor or Screaming Beast? Detecting Anomalous Words in Chinese.
- Iñaki Alegria, Klara Ceberio, Nerea Ezeiza, Aitor Soroa, Gregorio Hernandez:
Spelling Correction: from Two-Level Morphology to Open Source.
- Catalina Hallett, David Hardcastle:
Automatic Rewriting of Patient Record Narratives.
Session P10 - Coreference and Discourse
- Yannick Versley, Simone Paolo Ponzetto, Massimo Poesio, Vladimir Eidelman, Alan Jern, Jason Smith, Xiaofeng Yang, Alessandro Moschitti:
BART: A modular toolkit for coreference resolution.
- Massimo Poesio, Udo Kruschwitz, Jon Chamberlain:
ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation.
- Daniela Goecke, Maik Stührenberg, Andreas Witt:
Influence of Text Type and Text Length on Anaphoric Annotation.
- Costanza Navarretta, Sussi Olsen:
Annotating Abstract Pronominal Anaphora in the DAD Project.
- Sandra Williams, Richard Power:
Deriving Rhetorical Complexity Data from the RST-DT Corpus.
- Márton Miháltz:
Knowledge-based Coreference Resolution for Hungarian.
- Malvina Nissim, Sara Perboni:
The Italian Particle "ne": Corpus Construction and Analysis.
Session P11 - Tools,
Systems,
Applications
- Dawn Knight, Paul Tennent:
Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis.
- Michaela Atterer, Hinrich Schütze:
An Inverted Index for Storing and Retrieving Grammatical Dependencies.
- Jens Nilsson, Joakim Nivre:
MaltEval: an Evaluation and Visualization Tool for Dependency Parsing.
- Hiroaki Sato:
New Functions of FrameSQL for Multilingual FrameNets.
- Hiroyuki Shinnou, Minoru Sasaki:
Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering.
- Hiroaki Saito, Shunta Kuboya, Takaaki Sone, Hayato Tagami, Kyoko Ohara:
The Japanese FrameNet Software Tools.
- Maria Teresa Pazienza, Armando Stellato, Alexandra Tudorache:
JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages.
- Diana Maynard:
Benchmarking Textual Annotation Tools for the Semantic Web.
- Liviu Petrisor Dinu, Marius Popescu, Anca Dinu:
Authorship Identification of Romanian Texts with Controversial Paternity.
Session P12 - Lexical Resources and Tools
- Marc Kemps-Snijders, Claus Zinn, Jacquelijn Ringersma, Menzo Windhouwer:
Ensuring Semantic Interoperability on Lexical Resources.
- Marc Finthammer, Irene M. Cramer:
Exploring and Navigating: Tools for GermaNet.
- Marianne Santaholma, Nikos Chatzichrisafis:
A Knowledge-Modeling Approach for Multilingual Regulus Lexica.
- Michael Rosner:
ODL: an Object Description Language for Lexical Information.
- Dan Cristea, Corina Forascu, Marius Raschip, Michael Zock:
How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach.
- Bolette Sandford Pedersen, Anna Braasch, Lina Henriksen, Sussi Olsen, Claus Povlsen:
Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet.
- Borislav Rizov:
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration.
Session P13 - Evaluation:
Resources,
Tools,
Methodologies,
Campaigns
- Míriam Luján-Mares, Carlos D. Martínez-Hinarejos, Vicent Alabau Gonzalvo:
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.
- Laurianne Sitbon, Patrice Bellot, Philippe Blache:
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations.
- Heike Bieler, Stefanie Dipper:
Measures for Term and Sentence Relevances: an Evaluation for German.
- Julia Ritz, Stefanie Dipper, Michael Götze:
Annotation of Information Structure: an Evaluation across different Types of Texts.
- Quang Thang Dinh, Hong Phuong Le, Thi Minh Huyen Nguyen, Cam-Tu Nguyen, Mathias Rossignol, Xuân Luong Vu:
Word Segmentation of Vietnamese Texts: a Comparison of Approaches.
- Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Giuseppe Attardi, Anna Corazza, Alberto Lavelli, Leonardo Lesmo, Giorgio Satta, Maria Simi:
Comparing Italian parsers on a common Treebank: the EVALITA experience.
- Bernardo Magnini, Amedeo Cappelli, Fabio Tamburini, Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Francesca Bertagna, Nicoletta Calzolari, Antonio Toral, Valentina Bartalesi Lenzi, Rachele Sprugnoli, Manuela Speranza:
Evaluation of Natural Language Tools for Italian: EVALITA 2007.
- Maria Teresa Pazienza, Armando Stellato, Alexandra Tudorache:
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations.
- Simon Scerri, Myriam Mencke, Brian Davis, Siegfried Handschuh:
Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication.
- Václav Novák, Keith Hall:
Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation.
- Mohamed Maamouri, Seth Kulick, Ann Bies:
Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation.
- Chantal Enguehard, Harouna Naroua:
Evaluation of Virtual Keyboards for West-African Languages.
- Constantin Orasan, Dan Cristea, Ruslan Mitkov, António Horta Branco:
Anaphora Resolution Exercise: an Overview.
- Diana Santos, Alberto Simões:
Portuguese-English Word Alignment: some Experiments.
- Karin Schuler, Vinod Kaggal, James J. Masanz, Philip V. Ogren, Guergana K. Savova:
System Evaluation on a Named Entity Corpus from Clinical Notes.
- Philip V. Ogren, Guergana K. Savova, Christopher G. Chute:
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition.
- Eric K. Ringger, Marc Carmen, Robbie Haertel, Kevin D. Seppi, Deryle Lonsdale, Peter McClanahan, James L. Carroll, Noel Ellison:
Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study.
- Alexandre Allauzen, Hélène Bonneau-Maynard:
Training and Evaluation of POS Taggers on the French MULTITAG Corpus.
- Marco Baroni, Francis Chantree, Adam Kilgarriff, Serge Sharoff:
Cleaneval: a Competition for Cleaning Web Pages.
- Mark Arehart, Keith J. Miller:
A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names.
- Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Takehito Utsuro:
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop.
- Marilisa Amoia, Claire Gardent:
A Test Suite for Inference Involving Adjectives.
- Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -.
- Olivier Hamon, Djamel Mostefa:
An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation.
Session P14 - Evaluation:
Resources,
Tools,
Systems,
Methodologies
- Carlos D. Martínez-Hinarejos, Vicent Tamarit:
Evaluation of Different Segmentation Techniques for Dialogue Turns.
- David Griol, Lluís F. Hurtado, Encarna Segarra, Emilio Sanchis:
Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques.
- Susan Robinson, David R. Traum, Midhun Ittycheriah, Joe Henderer:
What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting.
- Dave Toney, Sophie Rosset, Aurélien Max, Olivier Galibert, Eric Bilinski:
An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System.
- Muriel Amar, Sophie David, Rachel Panckhurst, Lisa Whistlecroft:
Classification Procedures for Software Evaluation.
- Sylwia Ozdowska:
Cross-Corpus Evaluation of Word Alignment.
- Diana Maynard, Wim Peters, Yaoyong Li:
Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection.
- Diana McCarthy:
Lexical Substitution as a Framework for Multiword Evaluation.
- Martin Emms:
Tree Distance and Some Other Variants of Evalb.
- A. Cüneyd Tantug, Kemal Oflazer, Ilknur Durgar El-Kahlout:
BLEU+: a Tool for Fine-Grained BLEU Computation.
- C. Ray Graham, Deryle Lonsdale, Casey Kennington, Aaron Johnson, Jeremiah McGhee:
Elicited Imitation as an Oral Proficiency Measure with ASR Scoring.
- Pedro Concejero Cerezo, Daniel Tapias Merino, Juan José Rodríguez Soler, Juan Carlos Luengo, Sebastián Sánchez:
Methodology for Evaluating the Usability of User Interfaces in Mobile Services.
- Edouard Geoffrois:
An Economic View on Human Language Technology Evaluation.
- Beatrice Alex:
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection.
- Laura Hasler:
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries.
- Stephanie Strassel, Mark A. Przybocki, Kay Peterson, Zhiyi Song, Kazuaki Maeda:
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction.
- Johan Bos:
Let's not Argue about Semantics.
- David Hardcastle, Donia Scott:
Can we Evaluate the Quality of Generated Text?
- Keith J. Miller, Mark Arehart, Catherine Ball, John Polk, Alan Rubenstein, Kenneth Samuel, Elizabeth Schroeder, Eva Vecchi, Chris Wolf:
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems.
- Laurianne Sitbon, Patrice Bellot, Philippe Blache:
Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions.
- Ann Devitt, Khurshid Ahmad:
Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation.
- Cyril Grouin:
Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the "Grammatical" Quality of a Corpus.
- Laurent Blin, Olivier Boëffard, Vincent Barreaud:
WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation.
- Renata Queiroz Dividino, Massimo Romanelli, Daniel Sonntag:
Semiotic-based Ontology Evaluation Tool (S-OntoEval).
- George Demetriou, Robert J. Gaizauskas, Haotian Sun, Angus Roberts:
ANNALIST - ANNotation ALIgnment and Scoring Tool.
- Andrei Popescu-Belis, Mike Flynn, Pierre Wellner, Philippe Baudrion:
Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis.
- Paula Estrella, Andrei Popescu-Belis, Maghi King:
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback.
- Brian A. Weiss, Craig Schlenoff, Gregory A. Sanders, Michelle Potts Steves, Sherri L. Condon, Jon Phillips, Dan Parvaz:
Performance Evaluation of Speech Translation Systems.
- Arne Mauser, Sasa Hasan, Hermann Ney:
Automatic Evaluation Measures for Statistical Machine Translation System Optimization.
Session P15 - LR Infrastructures and Architectures
- Dan Tufis, Radu Ion, Alexandru Ceausu, Dan Stefanescu:
RACAI's Linguistic Web Services.
- Hanno Biber, Evelyn Breiteneder, Karlheinz Mörth:
Words in Contexts: Digital Editions of Literary Journals in the "AAC - Austrian Academy Corpus".
- Chris Biemann, Uwe Quasthoff, Gerhard Heyer, Florian Holz:
ASV Toolbox: a Modular Collection of Language Exploration Tools.
- António Branco, Francisco Costa, Pedro Martins, Filipe Nunes, João Silva, Sara Silveira:
LX-Service: Web Services of Language Technology for Portuguese.
- Emanuele Pianta, Christian Girardi, Roberto Zanoli:
The TextPro Tool Suite.
- Bayan Abu Shawar, Eric Atwell:
An AI-inspired intelligent agent/student architecture to combine Language Resources research and teaching.
- Kjell Elenius, Eva Forsbom, Beáta Megyesi:
Language Resources and Tools for Swedish: A Survey.
- Lars Nygaard, Joel Priestley, Anders Nøklestad, Janne Bondi Johannessen:
Glossa: a Multilingual, Multimodal, Configurable User Interface.
- Ekaterina Buyko, Christian Chiarcos, Antonio Pareja-Lora:
Ontology-Based Interface Specifications for a NLP Pipeline Architecture.
- Daan Broeder, Thierry Declerck, Erhard W. Hinrichs, Stelios Piperidis, Laurent Romary, Nicoletta Calzolari, Peter Wittenburg:
Foundation of a Component-based Flexible Registry for Language Resources and Technology.
- Daan Broeder, David Nathan, Sven Strömqvist, Remco van Veenendaal:
Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN.
- Paul Trilsbeek, Daan Broeder, Tobias Valkenhoef, Peter Wittenburg:
A Grid of Regional Language Archives.
- Takenobu Tokunaga, Dain Kaplan, Chu-Ren Huang, Shu-Kai Hsieh, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Kiyoaki Shirai, Virach Sornlertlamvanich, Thatsanee Charoenporn, Yingju Xia:
Adapting International Standard for Asian Language Technologies.
- Keiji Shinzato, Daisuke Kawahara, Chikara Hashimoto, Sadao Kurohashi:
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure.
- Riccardo Del Gratta, Roberto Bartolini, Tommaso Caselli, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
UFRA: a UIMA-based Approach to Federated Language Resource Architecture.
- Georg Rehm, Oliver Schonefeld, Andreas Witt, Timm Lehmberg, Christian Chiarcos, Hanan Bechara, Florian Eishold, Kilian Evang, Magdalena Leshtanska, Aleksandar Savkov, Matthias Stark:
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources.
- Piroska Lendvai, Steve Hunt:
From Field Notes towards a Knowledge Base.
- Hitomi Tohyama, Shunsuke Kozawa, Kiyotaka Uchimoto, Shigeki Matsubara, Hitoshi Isahara:
Construction of a Metadata Database for Efficient Development and Use of Language Resources.
- Bodil Nistrup Madsen, Hanne Erdman Thomsen:
A Taxonomy of Lexical Metadata Categories.
Session P16 - LR National/International Projects,
Organizational/Policy Issues
- Shuichi Itahashi, Chiu-yu Tseng:
The 2008 Oriental COCOSDA Book Project: in Commemoration of the First Decade of Sustained Activities in Asia.
- Adam Przepiórkowski, Rafal L. Górski, Barbara Lewandowska-Tomaszyk, Marek Lazinski:
Towards the National Corpus of Polish.
- Einar Meister, Jaak Vilo:
Strengthening the Estonian Language Technology.
- Bente Maegaard, Mohammed Atiyya, Khalid Choukri, Steven Krauwer, Chafic Mokbel, Mustafa Yaseen:
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic.
- Simon Krek, Vojko Gorjanc, Spela Arhar:
Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema.
Session P17 - Standards and Best Practices for LRs
- Volha Petukhova, Harry Bunt:
LIRICS Semantic Role Annotation: Design and Evaluation of a Set of Data Categories.
- Daniel Zeman:
Reusable Tagset Conversion Using Tagset Drivers.
- Marie-Jeanne Derouin, André Le Meur:
Presentation of the New ISO-Standard for the Representation of Entries in Dictionaries: ISO 1951.
- Marc Kemps-Snijders, Menzo Windhouwer, Peter Wittenburg, Sue Ellen Wright:
ISOcat: Corralling Data Categories in the Wild.
- Isa Maks, Carole Tiberius, Remco van Veenendaal:
Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework.
- Thierry Declerck:
A Framework for Standardized Syntactic Annotation.
- Victoria Arranz, Franck Gandcher, Valérie Mapelli, Khalid Choukri:
A Guide for the Production of Reusable Language Resources.
Session P18 - Lexical Resources and Tools
- Denis Maurel:
Prolexbase: a Multilingual Relational Lexical Database of Proper Names.
- Yoshihiko Hayashi, Chiharu Narawa, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy.
- Ana-Maria Barbu:
Romanian Lexical Data Bases: Inflected and Syllabic Forms Dictionaries.
- Atsushi Fujii:
Producing an Encyclopedic Dictionary using Patent Documents.
- Folkert de Vriend, Jan Pieter Kunst, Louis ten Bosch, Charlotte Giesbers, Roeland Van Hout:
Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization.
- Piotr Banski, Radoslaw Moszczynski:
Enhancing an English-Polish Electronic Dictionary for Multiword Expression Research.
- Claire Brierley, Eric Atwell:
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering.
- Eline Westerhout, Paola Monachesi:
Creating Glossaries Using Pattern-Based and Machine Learning Techniques.
- Lynne J. Cahill:
Using Similarity Measures to Extend the LinGO Lexicon.
- Peter Adolphs:
Acquiring a Poor Man's Inflectional Lexicon for German.
- Núria Bel, Sergio Espeja, Montserrat Marimon, Marta Villegas:
COLDIC, a Lexicographic Platform for LMF compliant lexica.
Session P19 - Morphology,
Syntax and Tools
- David Bamman, Marco Passarotti, Roberto Busa, Gregory Crane:
The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin.
- Dan Tufis, Elena Irimia, Radu Ion, Alexandru Ceausu:
Unsupervised Lexical Acquisition for Part of Speech Tagging.
- Amalia Todirascu, Dan Tufis, Ulrich Heid, Christopher Gledhill, Dan Stefanescu, Marion Weller, François Rousselot:
A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions.
- Ekaterina Lapshinova-Koltunski, Ulrich Heid:
Head or Non-head? Semi-automatic Procedures for Extracting and Classifying Subcategorisation Properties of Compounds.
- Manuel Kountz, Ulrich Heid, Kerstin Eckart:
A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations.
- Tomaz Erjavec, Simon Krek:
The JOS Morphosyntactically Tagged Corpus of Slovene.
- Aleksander Buczynski, Adam Przepiórkowski:
spade Demo: An Open Source Tool for Partial Parsing and Morphosyntactic Disambiguation.
- Silke Scheible:
Annotating Superlatives.
- Steliana Ivanova, Sandra Kuebler:
POS Tagging for German: how important is the Right Context?
- Christian Hänig, Stefan Bordag, Uwe Quasthoff:
UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging.
- Sara Tonelli, Rodolfo Delmonte, Antonella Bristot:
Enriching the Venice Italian Treebank with Dependency and Grammatical Relations.
- Kristina Vuckovic, Marko Tadic, Zdravko Dovedan:
Rule-Based Chunker for Croatian.
- Valeria Quochi, Basilio Calderone:
Learning properties of Noun Phrases: from data to functions.
- Eva Banik, Alan Lee:
A Study of Parentheticals in Discourse Corpora - Implications for NLG Systems.
- Mohamed Maamouri, Ann Bies, Seth Kulick:
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines.
- Martha Palmer, Olga Babko-Malaya, Ann Bies, Mona T. Diab, Mohamed Maamouri, Aous Mansouri, Wajdi Zaghouani:
A Pilot Arabic Propbank.
Session P20 - Multimodal,
Multimedia and Subjective Corpus
- Mark A. Greenwood, José Iria, Fabio Ciravegna:
Saxon: an Extensible Multimedia Annotator.
- Michael Kipp:
Spatiotemporal Coding in ANVIL.
- Michelina Savino, Laura Scivetti, Mario Refice:
Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different.
- Kazuaki Maeda, Haejoong Lee, Shawn Medero, Julie Medero, Robert Parker, Stephanie Strassel:
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium.
- Jana Trojanová, Marek Hrúz, Pavel Campr, Milos Zelezný:
Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition.
- David Llorens, Federico Prat, Andrés Marzal, Juan Miguel Vilar, María José Castro, Juan-Carlos Amengual, Sergio Barrachina, Antonio Castellanos, Salvador España Boquera, J. A. Gómez, Jorge Gorbe-Moya, Albert Gordo, Vicente Palazón, Guillermo Peris, Rafael Ramos-Garijo, Francisco Zamora-Martínez:
The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters.
- Emilie Chételat-Pelé, Annelies Braffort:
Sign Language Corpus Annotation: toward a new Methodology.
- Philippe Dreuw, Carol Neidle, Vassilis Athitsos, Stan Sclaroff, Hermann Ney:
Benchmark Databases for Video-Based Automatic Sign Language Recognition.
- Jan Bungeroth, Daniel Stein, Philippe Dreuw, Hermann Ney, Sara Morrissey, Andy Way, Lynette van Zijl:
The ATIS Sign Language Corpus.
- Pavel Campr, Marek Hrúz, Jana Trojanová:
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition.
- Shigeyoshi Kitazawa, Shinya Kiriyama, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi, Yoichi Takebayashi:
A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions.
- Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida:
Automatic Emotional Degree Labeling for Speakers' Anger Utterance during Natural Japanese Dialog.
- Theodoros Kostoulas, Todor Ganchev, Iosif Mporas, Nikos Fakotakis:
A Real-World Emotional Speech Corpus for Modern Greek.
- Theresa Wilson:
Annotating Subjective Content in Meetings.
Session P21 - Tools and Data for Speech Systems Development
- Henk van den Heuvel, Jean-Pierre Martens, Bart D'hoore, Kristof D'hanens, Nanneke Konings:
The AUTONOMATA Spoken Names Corpus.
- Briony Williams, Rhys James Jones:
Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language.
- Reiko Kaji, Hajime Mochizuki:
Constructing a Database of Non-Japanese Pronunciations of Different Japanese Romanizations.
- Antoine Laurent, Téva Merlin, Sylvain Meignier, Yannick Estève, Paul Deléglise:
Combined Systems for Automatic Phonetic Transcription of Proper Nouns.
- Harald Höge, Zdravko Kacic, Bojan Kotnik, Matej Rojc, Nicolas Moreau, Horst-Udo Hain:
Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework.
- Dafydd Gibbon, Jolanta Bachan:
An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation.
- Stefan Scherer, Petra-Maria Strauß:
A Flexible Wizard of Oz Environment for Rapid Prototyping.
- Jindrich Matousek, Daniel Tihelka, Jan Romportl:
Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis.
- Luís C. Oliveira, Sérgio Paulo, Luís Figueira, Carlos Mendes, Ana Nunes, Joaquim Godinho:
Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis.
- Alexandre Patry, Philippe Langlais:
MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices.
- Ute Ziegenhain, Hanne Fersoe, Henk van den Heuvel, Asunción Moreno:
LC-STAR II: Starring more Lexica.
- Matthias Eck, Stephan Vogel, Alex Waibel:
Communicating Unknown Words in Machine Translation.
- Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey, Manny Rayner:
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System.
- Nadine Perera, Michael Pitz, Manfred Pinkal:
CLIoS: Cross-lingual Induction of Speech Recognition Grammars.
- Takahiro Ono, Hitomi Tohyama, Shigeki Matsubara:
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus.
- Marie-Jean Meurs, Frédéric Duvert, Frédéric Béchet, Fabrice Lefevre, Renato de Mori:
Semantic Frame Annotation on the French MEDIA corpus.
- Nick Webb, Ting Liu, Mark Hepple, Yorick Wilks:
Cross-Domain Dialogue Act Tagging.
- Nikos Tsourakis, Maria Georgescul, Pierrette Bouillon, Manny Rayner:
Building Mobile Spoken Dialogue Applications Using Regulus.
- Christian Raymond, Kepa Joseba Rodriguez, Giuseppe Riccardi:
Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues.
- Stefan Hahn, Patrick Lehnen, Christian Raymond, Hermann Ney:
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding.
- Stéphane Huet, Guillaume Gravier, Pascale Sébillot:
Morphosyntactic Resources for Automatic Speech Recognition.
Session P22 - Speech Corpus in Various Environments
- Nicolás Morales, Javier Tejedor, Javier Garrido, José Colás, Doroteo Torre Toledano:
rre STC-TIMIT: Generation of a Single-channel Telephone Corpus.
- Eric Sanders, Asunción Moreno, Herbert S. Tropf, Lynette Melnar, Nurit Dekel, Breanna Gillies, Niklas Paulsson:
LILA: Cellular Telephone Speech Databases from Asia.
- Grazyna Demenko, Stefan Grocholewski, Katarzyna Klessa, Jerzy Ogórkiewicz, Agnieszka Wagner, Marek Lange, Daniel Sledzinski, Natalia Cylwik:
JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts.
- Tomas Dekens, Yorgos Patsis, Werner Verhelst, Frédéric Beaugendre, François Capman:
A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments.
- Isabel Trancoso, Rui Martins, Helena Moniz, Ana Isabel Mata, Céu Viana:
The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese.
- Florian Schiel, Christian Heinrich, Sabine Barfüßer, Thomas Gilg:
ALC: Alcohol Language Corpus.
- Rubén Fernández Pozo, Luis A. Hernández Gómez, Eduardo López Gonzalo, José Alcázar Ramírez, Guillermo Portillo, Doroteo Torre Toledano:
Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases.
- Tomoyosi Akiba, Kiyoaki Aikawa, Yoshiaki Itoh, Tatsuya Kawahara, Hiroaki Nanjo, Hiromitsu Nishizaki, Norihito Yasuda, Yoichi Yamashita, Katunobu Itou:
Test Collections for Spoken Document Retrieval from Lecture Audio Data.
- Akira Ozaki, Sunao Hara, Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katunobu Itou, Kazuya Takeda:
In-car Speech Data Collection along with Various Multimodal Signals.
- Masatoshi Tsuchiya, Satoru Kogure, Hiromitsu Nishizaki, Kengo Ohta, Seiichi Nakagawa:
Developing Corpus of Japanese Classroom Lecture Speech Contents.
- Konrad Hofbauer, Stefan Petrik, Horst Hering:
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech.
- Thomas Winkler, Theodoros Kostoulas, Richard Adderley, Christian Bonkowski, Todor Ganchev, Joachim Köhler, Nikos Fakotakis:
The MoveOn Motorcycle Speech Corpus.
- Stavros Ntalampiras, Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis:
Audio Database in Support of Potentiel Threat and Crisis Situation Management.
- Martine Garnier-Rizet, Gilles Adda, Frédérik Cailliau, Jean-Luc Gauvain, Sylvie Guillemin-Lanne, Lori Lamel, Stephan Vanni, Claire Waast-Richard:
CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content.
- Djamel Mostefa, Arnaud Vallée:
New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus.
Session P23 - Speech Corpus in Various Languages
- Krzysztof Marasek, Ryszard Gubrynowicz:
Design and Data Collection for Spoken Polish Dialogs Database.
- Fabíola Santos, Tiago Freitas:
CORP-ORAL: Spontaneous Speech Corpus for European Portuguese.
- Tiit Hennoste, Olga Gerassimenko, Riina Kasterpalu, Mare Koit, Andriela Rääbis, Krista Strandson:
From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian.
- Rudolf Muhr:
The Pronouncing Dictionary of Austrian German (AGPD) and the Austrian Phonetic Database (ADABA): Report on a large Phonetic Resources Database of the three Major Varieties of German.
- Caren Brinckmann, Stefan Kleiner, Ralf Knöbl, Nina Berend:
German Today: a really extensive Corpus of Spoken Standard German.
- Antonio Bonafonte, Jordi Adell, Ignasi Esquerra, Silvia Gallego, Asunción Moreno, Javier Pérez:
Corpus and Voices for Catalan Speech Synthesis.
- Martine Adda-Decker, Thomas Pellegrini, Eric Bilinski, Gilles Adda:
Developments of "Lëtzebuergesch" Resources for Automatic Speech Processing and Linguistic Studies.
Session P24 - Speech Corpus Design Methodology and Tools
- Rena Nemoto, Ioana Vasilescu, Martine Adda-Decker:
Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification.
- Hiroki Yamazaki, Keisuke Kitamura, Takashi Harada, Seiichi Yamamoto:
Creation of Learner Corpus and Its Application to Speech Recognition.
- Jean-Yves Antoine, Abdenour Mokrane, Nathalie Friburger:
Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project.
- Thierry Bazillon, Yannick Estève, Daniel Luzzati:
Manual vs Assisted Transcription of Prepared and Spontaneous Speech.
- Antonio Moreno-Sandoval, Doroteo Torre Toledano, Raùl de la Torre, Marta Garrote, José María Guirao:
Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories.
- Petr Pollák, Jan Volín, Radek Skarnitzl:
Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database.
- Victoria Bobicev, Tatiana Zidrasco:
Estimating Word Phonosemantics.
- Joachim Gasch, Caren Brinckmann, Sylvia Dickgießer:
memasysco: XML schema based metadata management system for speech corpora.
- Jonathan Chevelu, Nelly Barbot, Olivier Boëffard, Arnaud Delhay:
Comparing Set-Covering Strategies for Optimal Corpus Design.
- Pierre Lanchantin, Andrew C. Morris, Xavier Rodet, Christophe Veaux:
Automatic Phoneme Segmentation with Relaxed Textual Constraints.
- Christophe Veaux, Gregory Beller, Xavier Rodet:
IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation.
- Erin Fitzgerald, Frederick Jelinek:
Linguistic Resources for Reconstructing Spontaneous Speech Text.
- Maarten Janssen, Tiago Freitas:
Spock - a Spoken Corpus Client.
- Viktor Trón:
On the Durational Reduction of Repeated Mentions: Recency and Speaker Effects.
Session P25 - Morphology and Morphosyntax
- Florian Koehler, Hinrich Schütze, Michaela Atterer:
A Question Answering System for German. Experiments with Morphological Linguistic Resources.
- Bruno Cartoni:
Lexical Resources for Automatic Translation of Constructed Neologisms: the Case Study of Relational Adjectives.
- Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura:
A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation.
- Reut Tsarfaty, Yoav Goldberg:
Word-Based or Morpheme-Based? Annotation Strategies for Modern Hebrew Clitics.
- Sonja E. Bosch, Laurette Pretorius, Kholisa Podile, Axel Fleisch:
Experimental Fast-Tracking of Morphological Analysers for Nguni Languages.
- Nikola Ljubesic, Tomislava Lauc, Damir Boras:
Generating a Morphological Lexicon of Organization Entity Names.
- Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, Dagmar Divjak:
Designing and Evaluating a Russian Tagset.
- Karel Pala, Lukás Svoboda, Pavel Smerk:
Czech MWE Database.
- Nizar Habash, Ryan Roth:
Identification of Naturally Occurring Numerical Expressions in Arabic.
- Shisanu Tongchim, Randolf Altmeyer, Virach Sornlertlamvanich, Hitoshi Isahara:
A Dependency Parser for Thai.
- Mehrnoush Shamsfard, Hakimeh Fadaei:
A Hybrid Morphology-Based POS Tagger for Persian.
- Baskaran Sankaran, Kalika Bali, Monojit Choudhury, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Girish Nath Jha, S. Rajendran, K. Saravanan, L. Sobha, K. V. Subbarao:
A Common Parts-of-Speech Tagset Framework for Indian Languages.
Session P26 - Semantics,
Semantic Resources and Semantic Annotation
- Rajat Kumar Mohanty, Pushpak Bhattacharyya:
Lexical Resources for Semantics Extraction.
- Alain Joubert, Mathieu Lafourcade:
Evolutionary Basic Notions for a Thematic Representation of General Knowledge.
- Ya-Min Chou, Chu-Ren Huang, Jia-Fei Hong:
The Extended Architecture of Hantology for Japan Kanji.
- Petya Osenova, Kiril Ivanov Simov, Eelco Mossel:
Language Resources for Semantic Document Annotation and Crosslingual Retrieval.
- Sanaz Jabbari, Ben Allison, Louise Guthrie:
Using a Probabilistic Model of Context to Detect Word Obfuscation.
- Sara Tonelli, Emanuele Pianta:
Frame Information Transfer from English to Italian.
- Jordi Carrera, Irene Castellón, Salvador Climent, Marta Coll-Florit:
Towards Spanish Verbs' Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus.
- Paula Cristina Vaz, David Martins de Matos, Nuno J. Mamede:
Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database.
- Chris Reed, Raquel Mochales Palau, Glenn Rowe, Marie-Francine Moens:
Language Resources for Studying Argument.
- Cosmin Adrian Bejan, Sanda M. Harabagiu:
A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference.
- Kyoko Ohara:
Lexicon, Grammar, and Multilinguality in the Japanese FrameNet.
- Nilda Ruimy, Antonio Toral:
More Semantic Links in the SIMPLE-CLIPS Database.
- Riccardo Del Gratta, Nilda Ruimy, Antonio Toral:
Simple-Clips ongoing research: more information with less data by implementing inheritance.
- Brian Davis, Siegfried Handschuh, Alexander Troussov, John Judge, Mikhail Sogrin:
Linguistically Light Lexical Extensions for Ontologies.
Session P27 - Temporal Annotation
- Stéphanie Weiser, Philippe Laublet, Jean-Luc Minel:
Automatic Identification of Temporal Information in Tourism Web Pages.
- Sebastian Gottwald, Matthias Richter, Gerhard Heyer, Gerik Scheuermann:
Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze.
- Ineke Schuurman:
Spatiotemporal Annotation Using MiniSTEx: how to deal with Alternative, Foreign, Vague and/or Obsolete Names?
- Maria Teresa Vicente-Díez, Doaa Samy, Paloma Martínez:
An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora.
- Georgiana Puscasu, Verginica Barbu Mititelu:
Annotation of WordNet Verbs with TimeML Event Classes.
Session P28 - Multilinguality and Machine Translation
- Vincent Claveau:
Automatic Translation of Biomedical Terms by Supervised Machine Learning.
- Toni Badia, Maite Melero, Oriol Valentín:
Rapid Deployment of a New METIS Language Pair: Catalan-English.
- Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl, Paul Schmidt:
Evaluation of a Machine Translation System for Low Resource Languages: METIS-II.
- Marta R. Costa-Jussà, José A. R. Fonollosa, Enric Monte:
Using Reordering in Statistical Machine Translation based on Alignment Block Classification.
- Janne Bondi Johannessen, Torbjørn Nordgård, Lars Nygaard:
Evaluation of Linguistics-Based Translation.
- Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma, Hitoshi Isahara:
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus.
- Qing Ma, Koichi Nakao, Masaki Murata, Hitoshi Isahara:
Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data.
- Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, Joakim Nivre:
Swedish-Turkish Parallel Treebank.
- Julia S. Trushkina, Lieve Macken, Hans Paulussen:
Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort.
- Hiroyuki Kaji, Shin'ichi Tamamura, Dashtseren Erdenebat:
Automatic Construction of a Japanese-Chinese Dictionary via English.
- Kathrin Spreyer, Jonas Kuhn, Bettina Schrader:
Identification of Comparable Argument-Head Relations in Parallel Corpora.
- Svitlana Kurella, Serge Sharoff, Anthony Hartley:
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages.
- Jörg Tiedemann:
Synchronizing Translated Movie Subtitles.
- Takeshi Abekawa, Kyo Kageura:
Constructing a Corpus that Indicates Patterns of Modification between Draft and Final Translations by Human Translators.
- Violaine Prince, Jacques Chauché:
Building a Bilingual Representation of the Roget Thesaurus for French to English Machine Translation.
- Luka Nerima, Eric Wehrli:
Generating Bilingual Dictionaries by Transitivity.
- Jean Tavernier, Rosa Cowan, Michelle Vanni:
Holy Moses! Leveraging Existing Tools and Resources for Entity Translation.
- Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori S. Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime G. Carbonell, Robert E. Frederking, Erik Peterson, Katharina Probst:
Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages.
- Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel:
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion.
- Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Akira Terada, Yasunori Abe:
Application of Resource-based Machine Translation to Real Business Scenes.
- Wolodja Wentland, Johannes Knopp, Carina Silberer, Matthias Hartung:
Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration.
- Marianna Apidianaki:
Translation-oriented Word Sense Induction Based on Parallel Corpora.
- Todor Arnaudov, Ruslan Mitkov:
Smarty - Extendable Framework for Bilingual and Multilingual Comprehension Assistants.
- Péter Halácsy, András Kornai, Péter Németh, Daniel Varga:
Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report.
- Reginald Hobbs, Jamal Laoudi, Clare R. Voss:
MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents.
- Clare R. Voss, Jamal Laoudi, Jeffrey Micher:
Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm.
- Oana Frunza:
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization.
- Karine Megerdoomian, Dan Parvaz:
Low-Density Language Bootstrapping: the Case of Tajiki Persian.
Session P29 - Semantic Resources and their Elicitation
- Lothar Lemnitzer, Holger Wunsch, Piklu Gupta:
Enriching GermaNet with verb-noun relations - a case study of lexical acquisition.
- Diana Santos, Maria do Rosário Silva, Susana Inácio:
What's in a Colour? Studying and Contrasting Colours with COMPARA.
- Beata Trawinski, Jan-Philipp Soehn:
A Multilingual Database of Polarity Items.
- Ernesto William De Luca, Birte Lönneker-Rodman:
Integrating Metaphor Information into RDF/OWL EuroWordNet.
- Richard Johansson, Pierre Nugues:
Comparing Dependency and Constituent Syntax for Frame-semantic Analysis.
- Juan Aparicio, Mariona Taulé, Maria Antònia Martí:
AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora.
- Davide Buscaldi, Paolo Rosso:
Geo-WordNet: Automatic Georeferencing of WordNet.
- Mario Crespo Miguel, Paul Buitelaar:
Domain-Specific English-To-Spanish Translation of FrameNet.
- Hagen Fürstenau:
Enriching Frame Semantic Resources with Dependency Graphs.
- Bento Carlos Dias-da-Silva, Ariani Di Felippo, Maria das Graças Volpe Nunes:
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database.
- Roser Morante:
Semantic Role Labeling Tools Trained on the Cast3LB-CoNNL-SemRol Corpus.
- Evi Marzelou, Maria Zourari, Voula Giouli, Stelios Piperidis:
Building a Greek corpus for Textual Entailment.
- Kyoko Kanzaki, Francis Bond, Noriko Tomuro, Hitoshi Isahara:
Extraction of Attribute Concepts from Japanese Adjectives.
- Adriana Roventini, Nilda Ruimy:
Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet.
- Davide Picca, Alfio Massimiliano Gliozzo, Massimiliano Ciaramita:
Supersense Tagger for Italian.
- Maria Teresa Pazienza, Armando Stellato:
Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Build more Structured Linguistic Resources.
- Stephan Walter:
Linguistic Description and Automatic Extraction of Definitions from German Court Decisions.
- Veronika Vincze, György Szarvas, Attila Almási, Dóra Szauter, Róbert Ormándi, Richárd Farkas, Csaba Hatvani, János Csirik:
Hungarian Word-Sense Disambiguated Corpus.
- Olga N. Lashevskaja, Olga Yu. Shemanaeva:
Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives.
- Mohamed Attia, Mohsen Rashwan, Ahmed Ragheb, Mohamed Al-Badrashiny, Husein Al-Basoumy:
A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields.
- Doaa Samy, Ana González-Ledesma:
Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English).
- Patcharee Varasai, Chaveevan Pechsiri, Thana Sukvaree, Vee Satayamas, Asanee Kawtrakul:
Building an Annotated Corpus for Text Summarization and Question Answering.
Session P30 -Sentiment and Opinion Analysis
- Jonas Sjöbergh, Kenji Araki:
A Multi-Lingual Dictionary of Dirty Words.
- Jonas Sjöbergh, Kenji Araki:
What is poorly Said is a Little Funny.
- Yves Bestgen:
Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis.
- Ruifeng Xu, Yunqing Xia, Kam-Fai Wong, Wenjie Li:
Opinion Annotation in On-line Chinese Product Reviews.
- Xiwen Cheng, Feiyu Xu:
Fine-grained Opinion Topic and Polarity Identification.
- Kugatsu Sadamitsu, Satoshi Sekine, Mikio Yamamoto:
Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information.
- Marco Guerini, Carlo Strapparava, Oliviero Stock:
Valentino: A Tool for Valence Shifting of Natural Language Texts.
Last update Fri May 25 08:25:40 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page