Publikationsansicht

Using Wikipedia Categories and Links in Entity Ranking (2008)

Abstract
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our approach utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the examples (when provided) to improve the effectiveness of entity ranking. Our experiments on the training data set demonstrate that the use of categories and the link structure of Wikipedia, together with entity examples, can significantly improve entity retrieval effectiveness. We also use our system for the ad hoc tasks by inferring target categories from the title of the query. The results were worse than when using a full-text search engine, which confirms our hypothesis that ad hoc retrieval and entity retrieval are two different tasks.

Details der Publikation
Download http://hal.inria.fr/inria-00192489/en/
Herausgeber HAL - CCSD
Archiv INRIA a CCSD electronic archive server based on P.A.O.L (France)
Keywords Computer Science/Information Retrieval, Computer Science/Document and Text Processing, Entity ranking, XML retrieval, Wikipedia, Linkrank, categories
Typ proceeding with peer review
Sprache Englisch
Verknüpfungen http://hal.inria.fr/docs/00/19/24/89/PDF/inex07.pdf