Efficient Text Proximity Search

Schenkel, Ralf and Broschart, Andreas and Hwang, Seungwon and Theobald, Martin and Weikum, Gerhard (2007) Efficient Text Proximity Search. In: 14th String Processing and Information Retrieval Symposium.

Full text not available from this repository.

Abstract

In addition to purely occurrence-based relevance models, term proximity has been frequently used to enhance retrieval quality of keyword-oriented retrieval systems. While there have been approaches on effective scoring functions that incorporate proximity, there has not been much work on algorithms or access methods for their efficient evaluation. This paper presents an efficient evaluation framework including a proximity scoring function integrated within a top-k query engine for text retrieval. We propose precomputed and materialized index structures that boost performance. The increased retrieval effectiveness and efficiency of our framework are demonstrated through extensive experiments on a very large text benchmark collection. In combination with static index pruning for the proximity lists, our algorithm achieves an improvement of two orders of magnitude compared to a term-based top-k evaluation, with a significantly improved result quality.

Item Type: Conference or Workshop Item (Paper)
Subjects: DBIS Research > Publications
Divisions: Faculty of Engineering, Electronics and Computer Science > Institute of Databases and Informations Systems > DBIS Research and Teaching > DBIS Research > Publications
Depositing User: Prof. Dr. Martin Theobald
Date Deposited: 09 Sep 2015 18:57
Last Modified: 09 Sep 2015 18:57
URI: http://dbis.eprints.uni-ulm.de/id/eprint/1257

Actions (login required)

View Item
View Item