KORE: keyphrase overlap relatedness for entity disambiguation

Hoffart, Johannes and Seufert, Stephan and Ba Nguyen, Dat and Theobald, Martin and Weikum, Gerhard (2012) KORE: keyphrase overlap relatedness for entity disambiguation. In: 21st ACM International Conference on Information and Knowledge Management (CIKM 2012).

Full text not available from this repository.


Measuring the semantic relatedness between two entities is the basis for numerous tasks in IR, NLP, and Web-based knowledge extraction. This paper focuses on disambiguating names in a Web or text document by jointly mapping all names onto semantically related entities registered in a knowledge base. To this end, we have developed a novel notion of semantic relatedness between two entities represented as sets of weighted (multi-word) keyphrases, with consideration of partially overlapping phrases. This measure improves the quality of prior link-based models, and also eliminates the need for (usually Wikipedia-centric) explicit interlinkage between entities. Thus, our method is more versatile and can cope with long-tail and newly emerging entities that have few or no links associated with them. For efficiency, we have developed approximation techniques based on min-hash sketches and locality-sensitive hashing. Our experiments on semantic relatedness and on named entity disambiguation demonstrate the superiority of our method compared to state-of-the-art baselines.

Item Type: Conference or Workshop Item (Paper)
Subjects: DBIS Research > Publications
Divisions: Faculty of Engineering, Electronics and Computer Science > Institute of Databases and Informations Systems > DBIS Research and Teaching > DBIS Research > Publications
Depositing User: Prof. Dr. Martin Theobald
Date Deposited: 09 Sep 2015 18:49
Last Modified: 09 Sep 2015 18:49
URI: http://dbis.eprints.uni-ulm.de/id/eprint/1214

Actions (login required)

View Item
View Item