Academia.eduAcademia.edu

Outline

A Model for Ranking Entities and Its Application to Wikipedia

2008, Latin American Web …

Abstract
sparkles

AI

This paper presents a model for ranking entities, emphasizing the distinction between entity search and traditional document search. It develops a general framework applicable to various contexts, particularly focusing on the Wikipedia corpus, and demonstrates its effectiveness through algorithms that integrate Link Analysis, Natural Language Processing, and Named Entity Recognition. Empirical evaluation shows significant improvements in retrieval effectiveness for entity ranking tasks.

References (18)

  1. B.T. Adler and L. de Alfaro. A content-driven reputa- tion system for the Wikipedia. Proceedings of WWW, 7:261-270, 2007.
  2. Alias-i. LingPipe Named Entity Tagger, 2008. Avail- able at: http://www.alias-i.com/lingpipe/.
  3. Holger Bast, Alexandru Chitea, Fabian Suchanek, and Ingmar Weber. Ester: efficient search on text, en- tities, and relations. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 671-678, New York, NY, USA, 2007. ACM.
  4. Paolo Bouquet, Harry Halpin, Heiko Stoermer, and Giovanni Tummarello, editors. Proceedings of the 1st international workshop on Identity and Reference on the Semantic Web (IRSW2008) at the 5th Euro- pean Semantic Web Conference (ESWC 2008), Tener- ife, Spain, June 2nd, 2008, CEUR Workshop Proceed- ings. CEUR-WS.org, 2008.
  5. Paolo Bouquet, Heiko Stoermer, and Barbara Baz- zanella. An Entity Name System (ENS) for the Se- mantic Web. In Proceedings of ESWC, pages 258- 272, 2008.
  6. Paolo Bouquet, Heiko Stoermer, Giovanni Tum- marello, and Harry Halpin, editors. Proceedings of the WWW2007 Workshop I 3 : Identity, Identifiers, Identifi- cation, Entity-Centric Approaches to Information and Knowledge Management on the Web, Banff, Canada, May 8, 2007, volume 249 of CEUR Workshop Pro- ceedings. CEUR-WS.org, 2007.
  7. A. Broder. A taxonomy of web search. ACM SIGIR Forum, 36(2):3-10, 2002.
  8. T. Cheng and K.C.C. Chang. Entity Search En- gine: Towards Agile Best-Effort Information Integra- tion over the Web. Proceedings of CIDR2007, pages 108-113, 2007.
  9. Tao Cheng, Xifeng Yan, and Kevin Chen-Chuan Chang. Entityrank: Searching entities directly and holistically. In Proceedings of VLDB, pages 387-398, 2007.
  10. Gianluca Demartini, Claudiu S. Firan, Tereza Iofciu, and Wolfgang Nejdl. Semantically enhanced entity ranking. In Proceedings of The Ninth International Conference on Web Information Systems Engineering (WISE 2008), 2008.
  11. L. Denoyer and P. Gallinari. The Wikipedia XML cor- pus. ACM SIGIR Forum, 40(1):64-69, 2006.
  12. Christiane Fellbaum, editor. WordNet: An Electronic Lexical Database (Language, Speech, and Communi- cation). The MIT Press, May 1998.
  13. Themis Palpanas, Junaid Chaudhry, Periklis Andrit- sos, and Yannis Velegrakis. Entity data management in okkam. In 2nd International Workshop on Semantic Web Architectures For Enterprises held in conjunction with DEXA 08, 2008.
  14. Jovan Pehcevski, Anne-Marie Vercoustre, and James A. Thom. Exploiting locality of wikipedia links in entity ranking. In Proceedings of ECIR, pages 258-269, 2008.
  15. T. Rölleke, T. Tsikrika, and G. Kazai. A general ma- trix framework for modelling Information Retrieval. Information Processing and Management, 42(1):4- 30, 2006.
  16. Giovanni Semeraro, Marco Degemmis, Pasquale Lops, and Pierpaolo Basile. Combining learning and word sense disambiguation for intelligent user pro- filing. In Twentieth International Joint Conference on Artificial Intelligence, January 6-12, 2007, Hyder- abad, India, 2007.
  17. Jianhan Zhu, Arjen P. de Vries, Gianluca Demartini, and Tereza Iofciu. Relation Retrieval for Entities and Experts. In Future Challenges in Expertise Retrieval (fCHER 2008), SIGIR 2008 Workshop, 2008.
  18. Cäcilia Zirn, Vivi Nastase, and Michael Strube. Dis- tinguishing between instances and classes in the wikipedia taxonomy. In Proceedings of ESWC, pages 376-387, 2008.