dblp.uni-trier.de www.dagstuhl.de www.uni-trier.de

Query Expansion Using Domain Adapted, Weighted Thesaurus in an Extended Boolean Model.

Oh-Woog Kwon, Myoung-Cheol Kim, Key-Sun Choi: Query Expansion Using Domain Adapted, Weighted Thesaurus in an Extended Boolean Model. CIKM 1994: 140-146
@inproceedings{DBLP:conf/cikm/KwonKC94,
  author    = {Oh-Woog Kwon and
               Myoung-Cheol Kim and
               Key-Sun Choi},
  title     = {Query Expansion Using Domain Adapted, Weighted Thesaurus in an
               Extended Boolean Model},
  booktitle = {Proceedings of the Third International Conference on Information
               and Knowledge Management (CIKM'94), Gaithersburg, Maryland, November
               29 - December 2, 1994},
  publisher = {ACM},
  year      = {1994},
  pages     = {140-146},
  ee        = {db/conf/cikm/KwonKC94.html, http://doi.acm.org/10.1145/191246.191270},
  crossref  = {DBLP:conf/cikm/94},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

In this paper, we address there important issues with query expansion using a thesaurus; how to give weights to the terms in expanded queries, how to select additional search terms in the thesaurus, and how to enrich the terms in the manual thesaurus (namely, thesaurus reconstruction). To weight the terms in expanded queries, we construct the weighted thesaurus that has a similarity value between the terms in the thesaurus, using statistical co-occurrence in a corpus. To enrich the terms in the manual thesaurus, domain dependent terms which occur in a corpus are inserted into the weighted thesaurus using the co-occurrence information. In this paper, the reconstructed thesaurus with weights is defined as a domain-adapted, weighted thesaurus. Then we explain query expansion using the domain-adapted, weighted thesaurus in an extended Boolean retrieval model. To select additional search terms during query expansion, our model uses semi-automatic query expansion and a restriction method. In the experiments, our system had almost twice the recall of the boolean retrieval system not using the thesaurus or the query expansion retrieval system using the original thesaurus. And also, the precision of our system was almost the same precision as the other systems.

Copyright © 1994 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Proceedings of the Third International Conference on Information and Knowledge Management (CIKM'94), Gaithersburg, Maryland, November 29 - December 2, 1994. ACM 1994
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Online Edition

Citation Page

Last update Thu May 24 04:14:44 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page