dblp.uni-trier.de www.dagstuhl.de www.uni-trier.de

Learning to Extract Information From Text Based on User-Provided Examples.

Scott B. Huffman: Learning to Extract Information From Text Based on User-Provided Examples. CIKM 1996: 154-163
@inproceedings{DBLP:conf/cikm/Huffman96,
  author    = {Scott B. Huffman},
  title     = {Learning to Extract Information From Text Based on User-Provided
               Examples},
  booktitle = {CIKM '96, Proceedings of the Fifth International Conference on
               Information and Knowledge Management, November 12 - 16, 1996,
               Rockville, Maryland, USA},
  publisher = {ACM},
  year      = {1996},
  pages     = {154-163},
  ee        = {db/conf/cikm/Huffman96.html, http://doi.acm.org/10.1145/238355.238477},
  crossref  = {DBLP:conf/cikm/96},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

A growing population of users want to extract a growing variety of information from on-line texts. Unfortunately, current information extraction systems typically require experts to hand-build dictionaries of extraction patterns for each new type of information to be extracted. This paper presents a system that can learn dictionaries of extraction patterns directly from user-provided examples of texts and events to be extracted from them. The system, called LIEP, learns patterns that recognize relationships between key constituents based on local syntax. Patterns take the form of paths through a finite-state machine. Sets of patterns learned by LIEP for a sample extraction task perform nearly at the level of a hand-built dictionary of patterns.

Copyright © 1996 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

CIKM '96, Proceedings of the Fifth International Conference on Information and Knowledge Management, November 12 - 16, 1996, Rockville, Maryland, USA. ACM 1996
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Online Edition

Citation Page

Last update Thu May 24 04:14:44 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page