Mining Association Rules for Label Ranking
2011, Lecture Notes in Computer Science
Abstract
Recently, a number of learning algorithms have been adapted for label ranking, including instance-based and tree-based methods. In this paper, we continue this line of work by proposing an adaptation of association rules for label ranking based on the APRIORI algorithm. Given that the original APRIORI algorithm does not aim to obtain predictive models, two changes were needed for this achievement. The adaptation essentially consists of using variations of the support and confidence measures based on ranking similarity functions that are suitable for label ranking. Additionally we propose a simple greedy method to select the parameters of the algorithm. We also adapt the method to make a prediction from the possibly conflicting consequents of the rules that apply to an example. Despite having made our adaptation from a very simple variant of association rules for classification, partial results clearly show that the method is making valid predictions. Additionally, they show that it competes well with state-of-the-art label ranking algorithms.
References (21)
- Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. 20th Int. Conf. Very Large Data Bases, VLDB. vol. 1215, p. 487499. Citeseer (1994)
- Aiguzhinov, A., Soares, C., Serra, A.P.: A similarity-based adaptation of naive bayes for label ranking. In: Discovery Science (2010)
- Azevedo, P.: CAREN-A java based apriori implementation for classification pur- poses (2003)
- Brazdil, P., Soares, C., Costa, J.: Ranking Learning Algorithms: Using IBL and Meta-Learning on Accuracy and Time Results. Machine Learning 50(3), 251-277 (2003)
- Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and im- plication rules for market basket data. Proceedings of the 1997 ACM SIGMOD in- ternational conference on Management of data -SIGMOD '97 pp. 255-264 (1997), http://portal.acm.org/citation.cfm?doid=253260.253325
- Cheng, W., Hühn, J., Hüllermeier, E.: Decision tree and instance-based learning for label ranking. In: ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning. pp. 161-168. ACM, New York, NY, USA (2009)
- Dekel, O., Manning, C., Singer, Y.: Log-linear models for label ranking. Advances in Neural Information Processing Systems 16 (2003)
- Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: MACHINE LEARNING-INTERNATIONAL WORK- SHOP THEN CONFERENCE-. pp. 194-202. MORGAN KAUFMANN PUBLISH- ERS, INC. (1995), http://robotics.stanford.edu/users/sahami/papers-dir/disc.pdf
- Fayyad, Irani: Multi-interval discretization of continuous-valued attributes for clas- sification learning. In: International Conference on Machine Learning. pp. 1022- 1027 (1993), http://www.cs.orst.edu/˜bulatov/papers/fayyad-discretization.pdf
- Fürnkranz, J., Hüllermeier, E.: Preference learning. KI 19(1), 60-(2005)
- Har-Peled, S., Roth, D., Zimak, D.: Constraint classification: a new approach to multiclass classification. In: Proc. of the International Workshop on Algorithmic Learning Theory (ALT). pp. 135-150. Springer-Verlag (2002)
- Kemeny, J., Snell, J.: Mathematical Models in the Social Sciences. MIT Press (1972)
- Kendall, M., Gibbons, J.: Rank correlation methods (1970)
- Lebanon, G., Lafferty, J.D.: Conditional Models on the Ranking Poset. In: NIPS. pp. 415-422 (2002)
- Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. Knowledge Discovery and Data Mining pp. 80-86 (1998), http://www.aaai.org/Library/KDD/1998/kdd98-012.php
- Park, J.S., Chen, M.S., Yu, P.S.: An effective hash-based algorithm for min- ing association rules. ACM SIGMOD Record 24(2), 175-186 (May 1995), http://portal.acm.org/citation.cfm?doid=568271.223813
- Park, J., Chen, M., Yu, P.: Efficient parallel data mining for as- sociation rules. of the fourth international conference on (1995), http://portal.acm.org/citation.cfm?id=221270.221320
- Spearman, C.: The proof and measurement of association between two things. American Journal of Psychology 15, 72-101 (1904)
- Thomas, S., Sarawagi, S.: Mining generalized association rules and sequential pat- terns using SQL queries. . Conf. on Knowledge Discovery and Data Mining (1998), http://www.aaai.org/Papers/KDD/1998/KDD98-062.pdf
- Todorovski, L., Blockeel, H., Džeroski, S.: Ranking with Predictive Clustering Trees. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) Proc. of the 13th Euro- pean Conf. on Machine Learning. pp. 444-455. No. 2430 in LNAI, Springer-Verlag (2002) 21. Vembu, S., Gärtner, T.: Label ranking algorithms: A sur- vey. Preference Learning. Springer (2009), http://www-kd.iai.uni- bonn.de/pubattachments/399/VembuG09Book.pdf
- Vembu, S., Gärtner, T.: Label Ranking Algorithms: A Survey. In: Johannes Fürnkranz, E.H. (ed.) Preference Learning. Springer-Verlag (2010)