Efficient and Self-Tuning Incremental Query Expansion for Top-k Query Processing

Theobald, Martin and Schenkel, Ralf and Weikum, Gerhard (2005) Efficient and Self-Tuning Incremental Query Expansion for Top-k Query Processing. In: 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

Full text not available from this repository.

Abstract

We present a novel approach for efficient and self-tuning query expansion that is embedded into a top-k query processor with candidate pruning. Traditional query expansion methods select expansion terms whose thematic similarity to the original query terms is above some specified threshold, thus generating a disjunctive query with much higher dimensionality. This poses three major problems: 1) the need for hand-tuning the expansion threshold, 2) the potential topic dilution with overly aggressive expansion, and 3) the drastically increased execution cost of a high-dimensional query. The method developed in this paper addresses all three problems by dynamically and incrementally merging the inverted lists for the potential expansion terms with the lists for the original query terms. A priority queue is used for maintaining result candidates, the pruning of candidates is based on Fagin's family of top-k algorithms, and optionally probabilistic estimators of candidate scores can be used for additional pruning. Experiments on the TREC collections for the 2004 Robust and Terabyte tracks demonstrate the increased efficiency, effectiveness, and scalability of our approach.

Item Type: Conference or Workshop Item (Paper)
Subjects: DBIS Research > Publications
Divisions: Faculty of Engineering, Electronics and Computer Science > Institute of Databases and Informations Systems > DBIS Research and Teaching > DBIS Research > Publications
Depositing User: Prof. Dr. Martin Theobald
Date Deposited: 09 Sep 2015 19:57
Last Modified: 09 Sep 2015 19:57
URI: http://dbis.eprints.uni-ulm.de/id/eprint/1276

Actions (login required)

View Item
View Item