dblp.uni-trier.de www.dagstuhl.de www.uni-trier.de

Improved Techniques for Processing Queries in Full-Text Systems.

Yaacov Choueka, Aviezri S. Fraenkel, Shmuel T. Klein, E. Segal: Improved Techniques for Processing Queries in Full-Text Systems. SIGIR 1987: 306-315
@inproceedings{DBLP:conf/sigir/ChouekaFKS87,
  author    = {Yaacov Choueka and
               Aviezri S. Fraenkel and
               Shmuel T. Klein and
               E. Segal},
  title     = {Improved Techniques for Processing Queries in Full-Text Systems},
  booktitle = {Proceedings of the Tenth Annual International ACM SIGIR Conference
               on Research and Development in Information Retrieval, New Orleans,
               Louisiana, USA, June 3-5, 1987},
  publisher = {ACM},
  year      = {1987},
  pages     = {306-315},
  ee        = {db/conf/sigir/ChouekaFKS87.html},
  crossref  = {DBLP:conf/sigir/87},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

In static full-text retrieval systems, which accommodate metrical as well as Boolean operators, the traditional approach to query processing uses a "concordance", from which large sets of coordinates are retrieved and then merged and/or collated. Alternatively, in a system with l documents, the concordance can be replaced by a set of bit-maps of fixed length l, which are constructed for every different word of the database and serve as occurrence maps. We propose to combine the concordance and bit-map approaches, and show how this can speed up the processing of queries: fast ANDing and ORing of the maps in a preprocessing stage, lead to large I/O savings in collating coordinates of keywords needed to satisfy the metrical and Boolean constraints. Moreover, the bit-maps give partial information on the distribution of the coordinates of the keywords, which can be used when queries must be processed by stages, due to their complexity and the sizes of the involved sets of coordinates. The new techniques are partially implemented at the Responsa Retrieval Project.

Copyright © 1987 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Proceedings of the Tenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, Louisiana, USA, June 3-5, 1987. ACM 1987
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Online Edition: ACM Digital Library

Citation page

Last update Fri May 25 08:37:42 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page