Method Combination For Document Filtering.
David A. Hull, Jan O. Pedersen, Hinrich Schütze:
Method Combination For Document Filtering.
SIGIR 1996: 279-287@inproceedings{DBLP:conf/sigir/HullPS96,
author = {David A. Hull and
Jan O. Pedersen and
Hinrich Sch{\"u}tze},
title = {Method Combination For Document Filtering},
booktitle = {SIGIR},
year = {1996},
pages = {279-287},
ee = {db/conf/sigir/HullPS96.html},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
There is strong empirical and theoretic evidence that combination of retrieval
methods can improve performance. In this paper, we systematically compare
combination strategies in the context of document filtering, using queries
from the Tipster reference corpus. We find that simple averaging strategies do
indeed improve performance, but that direct averaging of probability estimates
is not the correet approach. Instead, the probability estimates must be
renormalized using logistic regression on the known relevance judgments.
We examine more complex combination strategies but find them less successful
due to the high correlations among our filtering methods which are optimized
over the same training data and employ similar document representations.
Copyright © 1996 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
Hans-Peter Frei, Donna Harman, Peter Schäuble, Ross Wilkinson (Eds.):
Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'96, August 18-22, 1996, Zurich, Switzerland (Special Issue of the SIGIR Forum).
ACM 1996, ISBN 0-89791-792-8
Contents
Citation page
Last update Fri May 25 08:37:48 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page