Academia.eduAcademia.edu

Outline

Integrating Document Features for Entity Ranking

2008, Lecture Notes in Computer Science

https://doi.org/10.1007/978-3-540-85902-4_29

Abstract

The Knowledge Media Institute of the Open University participated in the entity ranking and entity list completion tasks of the Entity Ranking Track in INEX 2007. In both the entity ranking and entity list completion tasks, we have considered document features in addition to a basic document content based relevance model. These document features include categorizations of documents, relevance of category names to the query, and hierarchical relations between categories. Furthermore, based on our TREC2006 and 2007 expert search approach, we applied a co-occurrence based entity association discovery model to the two tasks based on the assumption that relevant entities often cooccur with query terms or given relevant entities in documents. Our initial experimental results show that, by considering the predefined category, its children and grandchildren in the document content based relevance model, the performance of our entity ranking approach can be significantly improved. Consideration of the predefined category's parents, a category name based relevance model, and the co-occurrence model is not shown to be helpful in entity ranking and list completion, respectively.

References (9)

  1. Bailey, P., Craswell, N., de Vries, A.P., and Soboroff, I.(2007) Overview of the TREC 2007 Enterprise Track (DRAFT). In Proc. of The Sixteenth Text REtrieval Conference (TREC 2007), Gaithersburg, Maryland USA.
  2. Craswell, N., de Vries, A.P., Soboroff, I. (2005) Overview of the TREC-2005 Enterprise Track. In Proc. of The Fourteenth Text REtrieval Conference (TREC 2005).
  3. Soboroff, I., de Vries, A.P. and Craswell, N. (2007) Overview of the TREC 2006 Enterprise Track. In Proc. of The Fifteenth Text REtrieval Conference (TREC 2006), Gaithersburg, Maryland USA.
  4. Zhu, J., Song, D., Rüger, S., Eisenstadt, M. and Motta, E. (2007) The Open University at TREC 2006 Enterprise Track Expert Search Task. In Proc. of The Fifteenth Text REtrieval Conference (TREC 2006).
  5. Zhu, J., Song, D., Rüger, S., Eisenstadt, M. and Motta, E. (2007) The Open University at TREC 2006 Enterprise Track Expert Search Task. In Proc. of The Sixteenth Text REtrieval Conference (TREC 2007) Notebook.
  6. Conrad, J.G., Utt, M.H. (1994) A System for Discovering Relationships by Feature Extraction from Text Databases. In Proc. of SIGIR 1994: 260-270.
  7. Robertson, S.E., Walker, S., Beaulieu, M.M., Gatford, M., Payne, A. (1995): Okapi at TREC-4. In NIST Special Publication 500-236: The Fourth Text REtrieval Conference (TREC-04): 73-96.
  8. Hatcher, E. and Gospodnetic, O. (2004) Lucene in Action. Manning Publications Co, ISBN: 1932394281.
  9. Cao, Y., Liu, J., Bao, S. and Li, H. (2005) Research on Expert Search at Enterprise Track of TREC 2005. In Proc. of TREC 2005.