Academia.eduAcademia.edu

Outline

Mining hepatitis data with temporal abstraction

2003, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '03

Abstract

The hepatitis temporal database collected at Chiba university hospital between 1982-2001 was recently given to challenge the KDD research. The database is large where each patient corresponds to 983 tests represented as sequences of irregular timestamp points with different lengths. This paper presents a temporal abstraction approach to mining knowledge from this hepatitis database. Exploiting hepatitis background knowledge and data analysis, we introduce new notions and methods for abstracting short-term changed and long-term changed tests. The abstracted data allow us to apply different machine learning methods for finding knowledge part of which is considered as new and interesting by medical doctors.

References (18)

  1. REFERENCES
  2. Bellazzi, R., Larizza, C., Magni, P., Monntani, S., and Ste- fanelli, M., "Intelligent Analysis of Clinic Time Series: An Application in the Diabetes Mellitus Domain", Artificial In- telligence in Medicine 20 (2000), 37-57.
  3. Clementine 7.0 User's Guide, SPSS, 2002.
  4. Larizza, C., Bellazzi, R., and Riva, A., "Temporal abstrac- tions for diabetic patients management", Artificial Intelli- gence in Medicine, Keravnou, E. et al. (eds.), Proc.AIME-97, 1997, 319-30, Springer.
  5. Han, J. and Kamber, M., Data Mining: Concepts and Tech- niques, Morgan Kaufmann, 2000.
  6. Ho, T.B., Nguyen, T.D., Nguyen, D.D., and Kawasaki, S., "Visualization Support for User-Centered Model Selection in Knowledge Discovery and Data Mining", International Journal of Artificial Intelligence Tools, Vol. 10 (2001), No. 4, 691-713.
  7. Ho, T.B., Nguyen, T.D., and Nguyen, D.D., "Visualization Support for a User-Centered KDD Process", ACM Interna- tional Conference on Knowledge Discovery and Data Mining KDD-02, Edmonton, 519-524.
  8. Hirano, S. and Tsumoto, S., "Mining Similar Temporal Pat- terns in Long Time-Series Data and Its Application to Medi- cine", IEEE International Conference on Data Mining ICDM 2002, Maebashi, December 2002, 219-226.
  9. Horn, W., Miksch, S., Egghart, G., Popow, C., and Paky, F., "Effective Data Validation of High-Frequency Data: Time- Point-, Time-Interval-, and Trend-Based Methods", Com- puter in Biology and Medicine, Special Issue: Time-Oriented Systems in Medicine, 27(5), 389-409, 1997.
  10. Lavrak, N., "Selected Techniques for Data Mining in Medi- cine", Artificial Intelligence in Medicine, 16, 3-23, 1999.
  11. Miksch S., Horn W., Popow C., and Paky F., "Utilizing Temporal Data Abstraction for Data Validation and Therapy Planning for Artificially Ventilated Newborn Infants", Artifi- cial Intelligence in Medicine, 8(6) 543-576, 1996.
  12. MedicinNet.com http://www.focusoncancer.com/script/ main art.asp?articlekey=1917&rd=1.
  13. Nguyen, D.T., Ho, T.B., "An Interactive-Graphic System for Decision Tree Induction", Journal of Japanese Society for Artificial Intelligence, Vol. 14, N. 1, 1999, 131-138.
  14. Ohsaki, M., Sato, Y., Yokoi, H., Yamaguchi, T., "A Rule Discovery Support System for Sequential Medical Data. ⎯In the Case Study of a Chronic Hepatitis Dataset", Interna- tional Workshop on Active Mining, IEEE International Con- ference on Data Mining ICDM 2002, Maebashi, December 2002, 97-102.
  15. PKDD02 challenge http://www.cs.helsinki.fi/events/ eclpkdd/challenge.html.
  16. Quinlan, J.R., C4.5: Programs for Machine Learning, Mor- gan Kaufmann, 1993.
  17. Shahar, Y. and Musen, M.A., "Knowledge-Based Temporal Abstraction in Clinical Domains", Artificial Intelligence in Medicine, 8 (1996), 267-298.
  18. Shahar, Y., "A Framework for Knowledge-based Temporal Abstraction", Artificial Intelligence, 90 (1997), 79-133.