Academia.eduAcademia.edu

Outline

Attribute oriented induction with star schema

2010, International Journal of Database Management Systems

https://doi.org/10.5121/IJDMS.2010.2202

Abstract

This paper will propose a novel star schema attribute induction as a new attribute induction paradigm and as improving from current attribute oriented induction. A novel star schema attribute induction will be examined with current attribute oriented induction based on characteristic rule and using non rule based concept hierarchy by implementing both of approaches. In novel star schema attribute induction some improvements have been implemented like elimination threshold number as maximum tuples control for generalization result, there is no ANY as the most general concept, replacement the role concept hierarchy with concept tree, simplification for the generalization strategy steps and elimination attribute oriented induction algorithm. Novel star schema attribute induction is more powerful than the current attribute oriented induction since can produce small number final generalization tuples and there is no ANY in the results.

References (23)

  1. Alves, R. and Belo, O. (2007), 'Multidimensional Data Mining', SDDI'07 -Doctoral Symposium.
  2. Cai, Y. (1989), 'Attribute-oriented induction in relational databases', Master thesis, Simon Fraser University.
  3. Chen,M., Han, J. and Yu, P.S. (1996), 'Data Mining: An overview from database perspective', IEEE Trans. Knowledge and Data Eng, 8(6),866-883.
  4. Cheung, D.W., Hwang, H.Y., Fu, A.W. and Han, J. (2000), 'Efficient rule-based attribute-oriented induction for data mining', Journal of Intelligent Information Systems, 15(2), 175-200.
  5. Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D. and Venkatrao, M. (1997), 'Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross- Tab, and Sub-Totals', Data Mining and Knowledge Discovery, 1, 29-53.
  6. Han, J., Cai, Y. and Cercone, N. (1992), 'Knowledge discovery in databases: An attribute-oriented approach', in Proceedings 18th International Conference Very Large Data Bases, Vancouver, British Columbia, 547-559.
  7. Han,J., Cai, Y., and Cercone, N. (1993), 'Data-driven discovery of quantitative rules in relational databases', IEEE Trans on Knowl and Data Engin, 5(1), 29-40.
  8. Han, J. and Fu, Y.(1994),'Dynamic Generation and Refinement of Concept Hierarchies for Knowledge Discovery in Databases', in Proceedings of AAAI Workshop on Knowledge Discovery in Databases, 157-168.
  9. Han, J., Fu, Y., Huang, Y., Cai, Y., and Cercone, N. (1994), 'DBLearn: a system prototype for knowledge discovery in relational databases', ACM SIGMOD Record, 23(2), 516.
  10. Han, J., Fu, Y., and Tang, S. (1995a), 'Advances of the DBLearn system for knowledge discovery in large databases', in Proceedings of the 14th international Joint Conference on Artificial intelligence, 2049-2050.
  11. Han, J., Cai, Y., Cercone, N. and Huang, Y. (1995b), 'Discovery of Data Evolution Regularities in Large Databases', Journal of Computer and Software Engineering, 3(1), 41-69.
  12. Han,J., Fu, Y., Koperski, K., Melli,G., Wang, W. and Zaiane,O.R. (1995c), 'Knowledge Mining in Databases: An Integration of Machine Learning Methodologies with Databases Technologies', Canadian Artificial Intelligence Magazine, 38, 4-8.
  13. Han,J., Fu, Y.,Wang, W., Chiang, J., Gong, W., Koperski, K., Li,D., Lu, Y., Rajan,A., Stefanovic,N., Xia,B. and Zaiane,O.R. (1996), 'DBMiner:A system for mining knowledge in large relational databases', in Proceedings Int'l Conf. on Data Mining and Knowledge Discovery, 250-255.
  14. Han, J., Chiang, J. Y., Chee, S., Chen, J., Chen, Q., Cheng, S., Gong, W., Kamber, M., Koperski, K., Liu, G., Lu, Y., Stefanovic, N., Winstone, L., Xia, B. B., Zaiane, O. R., Zhang, S., and Zhu, H. (1997), 'DBMiner: a system for data mining in relational databases and data warehouses', In Proceedings of the 1997 Conference of the Centre For Advanced Studies on Collaborative Research, 8.
  15. Han,J., Lakshmanan, L.V.S. and Ng, R.T. (1999), 'Constraint-based, multidimensional data mining', IEEE Computer, 32(8), 46-50.
  16. Hsu, C. (2004),'Extending attribute-oriented induction algorithm for major values and numeric values', Expert Systems with Applications, 27, 187-202.
  17. Hu, X. (2003), 'DB-HReduction: A Data Preprocessing Algorithm for Data Mining Applications', Applied Mathematics Letters,16(6),889-895.
  18. Huang, Y. (1993), 'Intelligent Query Answering by Knowledge Discovery Techniques', Master thesis, Simon Fraser University.
  19. Huang, Y. and Lin, S. (1996), 'An Efficient Inductive Learning Method for Object- Oriented Database Using Attribute Entropy', IEEE Transactions on Knowledge and Data Engineering, 8(6),946-951
  20. Wu, Y., Chen, Y. and Chang, R. (2009), 'Generalized Knowledge Discovery from Relational Databases', International Journal of Computer Science and Network, 9(6),148-153.
  21. Muyeba,M.K. and Keane,J.A. (1999), 'Extending attribute-oriented induction as a key- preserving data mining method', in Proceedings 3 rd European Conference on Principles of Data Mining and Knowledge Discovery, Lecture Notes in Computer science, 1704, 448-455.
  22. Muyeba, M. (2005),'On Post-Rule Mining of Inductive Rules using a Query Operator', in Proceedings of Artificial Intelligence and Soft Computing.
  23. Muyeba, M. and Marnadapali, R. (2005),'A framework for Post-Rule Mining of Distributed Rules Bases', in Proceeding of Intelligent Systems and Control.