Academia.eduAcademia.edu

Outline

Report on CLIR task for the NTCIR-5 evaluation campaign

2005

Abstract

This paper describes our second participation in an evaluation campaign involving the Chinese, Japa-nese, Korean and English languages (NTCIR-5). Our participation is motivated by four objectives: 1) study the retrieval performances of various IR models for these languages; 2) compare the relative retrieval effectiveness of bigram and automatic word-segmenting approaches for Chinese and Japanese languages; 3) propose a new blind-query expansion hopefully capable of improving mean average preci-sion; and 4) evaluate the relative performance of the various merging strategies used to combine separate result lists extracted from a corpus written in

References (12)

  1. Amati, G., & van Rijsbergen, C.J. Probabilistic mod- els of information retrieval based on measuring the divergence from randomness. ACM-TOIS, 20(4):357- 389, 2002.
  2. Buckley, C., Singhal, A., Mitra, M., & Salton, G. New retrieval approaches using SMART. Proceedings of TREC-4, pp. 25-48, 1996.
  3. Chen, A., & Gey, F.C. Experiments on cross-lan- guage and patent retrieval at NTCIR-3 workshop. Proceedings of NTCIR-3, 2003.
  4. Fox, E.A., & Shaw, J.A. Combination of multiple searches. Proceedings TREC-2, pp. 243-249, 1994.
  5. Kishida, K., Chen, K.-H., Lee, S., Kuriyama, K., Kando, N., Chen, H.-H., & Myaeng, S.H. Overview of Task at the Fifth NTCIR Workshop. Proceedings of NTCIR-5, Tokyo, 2005.
  6. Luk, R.W.P., & Kwok, K.L.. A comparison of Chinese document indexing strategies and retrieval models. ACM-TALIP, 1(3): 225-268, 2002.
  7. Matsumoto, Y., Kitauchi, A., Yamashita, T., Hirano, Y., Matsuda, H., & Asahara, M. Japanese morpho- logical analysis system ChaSen. Technical Report NAIST-IS-TR99009, NAIST, 1999 (available at http://chasen.aist-nara.ac.jp/).
  8. Robertson, S.E., Walker, S., & Beaulieu, M. Experi- mentation as a way of life: Okapi at TREC. IP&M, 36(1), 95-108, 2000.
  9. Savoy, J. Statistical inference in retrieval effective- ness evaluation. IP&M, 33(4):495-512, 1997.
  10. Savoy, J. Combining multiple strategies for effec- tive monolingual and cross-lingual retrieval. I R Journal, 7(1-2):121-148, 2004.
  11. Savoy, J. Comparative study of monolingual and multilingual search models for use with Asian lan- guages. ACM TALIP, 4(3), 2005.
  12. Singhal, A., Choi, J., Hindle, D., Lewis, D.D., & Pereira, F. AT&T at TREC-7. Proceedings of TREC- 7, 239-251, 1999.