Academia.eduAcademia.edu

Outline

Search engine driven author disambiguation

2006, Proceedings of the 6th ACM/IEEE-CS joint …

https://doi.org/10.1145/1141753.1141826

Abstract

In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836.

References (5)

  1. REFERENCES
  2. C. L. Giles, K. D. Bollacker, and S. Lawrence. CiteSeer: An automatic citation indexing system. In ACM Conf. on Digital Libraries, 1998.
  3. H. Han, H. Zha, and C. L. Giles. Name disambiguation in author citations using a K-way spectral clustering method. In JCDL, 2005.
  4. D. Lee, B.-W. On, J. Kang, and S. Park. Effective and scalable solutions for mixed and split citation problems in digital libraries. In IQIS, 2005.
  5. M. Ley. The DBLP computer science bibliography: Evolution, research issues, perspectives. In SPIRE, 2002.