Academia.eduAcademia.edu

Outline

Ranking Scientific Publications Based on Their Citation Graph

2009

Abstract

CDS Invenio is the web-based integrated digital library system developed at CERN. It is a suite of applications which provides the framework and tools for building and managing an autonomous digital library server. Within this framework, the goal of this project is to implement new ranking methods based on the bibliographic citation graph extracted from the CDS Invenio database. As a first step, we implemented the Citation Count as a baseline ranking method. The major disadvantage of this method is that all citations are treated equally, disregarding their importance and their publication date. To overcome this drawback, we consider two different approaches: a link-based approach which extends the PageRank model to the bibliographic citation graph and a time-dependent approach which takes into account time in the citation counts. In addition, we also combined these two approaches in a hybrid model based on a time-dependent PageRank. In the present document, we describe the conceptua...

References (27)

  1. Weak Interactions with Lepton-Hadron Symmetry: Glashow, S.L., (1970) 3671
  2. 4 Reliable Perturbative Results for Strong Interactions?: Politzer, H.David, (1973) 2390 43
  3. 5 Radiative Corrections as the Origin of Spontaneous Symmetry Breaking: Coleman, Sidney R., (1973) 2472 40
  4. 6 Broken symmetries, massless particles and gauge fields: Higgs, Peter W., (1964) 1454 111 7 Confinement of Quarks: Wilson, Kenneth G., (1974) 3023 26
  5. 8 Axial vector vertex in spinor electrodynamics: Adler, Stephen L., (1969) 2332 47
  6. 9 Field Theories with Superconductor Solutions: Goldstone, J., (1961) 739 462 10 Supergauge Transformations in Four-Dimensions: Wess, J., (1974) 1373 130 Bibliography [1] Cds invenio. http://cds.cern.ch/.
  7. Cern. http://www.cern.ch.
  8. Epfl infoscience. http://infoscience.epfl.ch/.
  9. S.U. Pillai A. Papoulis. Probability, Random Variables and Stochastic Processes, 4th edition. Mc. Graw Hill, 2002.
  10. M. Gracco J-Y. Le Meur N. Robinson T. Simko A. Pepe, T. Baron and M. Vesely. Cern document server software: the integrated digital library. http://doc.cern. ch/archive/electronic/cern/preprints/open/open-2005-018.pdf, 2005.
  11. A. Allen. Probability, Statistics and Queueing Theory with Computer Science Applications. Academic Press Inc., 1972.
  12. E. Amitay. Trend detection through temporal link analysis. In J. of the American Society for Information Science and Technology, pages 1-12, 2004.
  13. K. Berberich. T-rank: Time-aware authority ranking. In WAW, pages 131-142, 2004.
  14. P. Berkhin. A survey on pagerank computing. In Internet Mathematics, Vol 2, pages 73-120, 2005.
  15. Tim Berners-Lee. Information management: A proposal. http://www.w3.org/ History/1989/proposal.html.
  16. A. McCallum D. Mimno. Mining a digital library for influential authors. In Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, 2007.
  17. D. Walker et al. Ranking scientific publications using a simple model of network traffic. 2006.
  18. M. J. Kurtz et al. The nasa astrophysics data system: Sociology, bibliometrics, and impact. In J. of the American Society for Information Science and Technol- ogy.
  19. M. J. Kurtz et al. Worldwide use and impact of the ansa astrophysics data system digital library. In J. of the American Society for Information Science and Technology, pages 36-45, 2005.
  20. P. Chen et al. Finding scientific gems with google. In J.Informet. 1, pages 8-15, 2007.
  21. E. Garfield. Citation indexes for science: A new dimension in documentation through association of ideas. In Science, 122(3159), pages 108-111, 1955.
  22. E. Garfield. Citation indexing for studying science. In Nature, 227 (5259), pages 669-671, 1970.
  23. E. Garfield. Citation analysis as a tool in journal evaluation. In Science, 178 (4060), pages 471-479, 1972.
  24. Y. Guo B. Feng H. Wang, M. Rajman. Newpr -combined tfidf with pagerank. In ICANN, pages 932-942, 2006.
  25. S. Redner. How popular is your paper? an empirical study of the citation distri- bution. In The European Physical Jurnal B, pages 131-134, 1998.
  26. S. Redner. Citation statistics from 110 years of physical review. In American Institute of Physics, 2005.
  27. L. Page S. Brin. The anatomy of a large-scale hypertextual web search engine. In Computer Networks and ISDN Systems, pages 107-117, 1998.