Data Mining in Education
2016, International Journal of Advanced Computer Science and Applications
https://doi.org/10.14569/IJACSA.2016.070659Abstract
Data mining techniques are used to extract useful knowledge from raw data. The extracted knowledge is valuable and significantly affects the decision maker. Educational data mining (EDM) is a method for extracting useful information that could potentially affect an organization. The increase of technology use in educational systems has led to the storage of large amounts of student data, which makes it important to use EDM to improve teaching and learning processes. EDM is useful in many different areas including identifying at-risk students, identifying priority learning needs for different groups of students, increasing graduation rates, effectively assessing institutional performance, maximizing campus resources, and optimizing subject curriculum renewal. This paper surveys the relevant studies in the EDM field and includes the data and methodologies used in those studies.
FAQs
AI
What role does educational data mining (EDM) play in improving student retention rates?
EDM methods have been shown to increase retention and graduation rates by identifying at-risk students, as illustrated in Saurabh Pal's study that utilized the Nave Bayes classification algorithm.
How do clustering techniques enhance understanding of educational datasets?
Clustering techniques categorize data into natural groups based on common features, providing insights into student learning behaviors as well as creating targeted interventions.
What specific education data mining methods have gained significant attention recently?
Key methods attracting attention include prediction, clustering, and relationship mining, with relationship mining historically being the most utilized in EDM research.
How does predictive modeling utilize historical data in educational outcomes?
Predictive modeling in EDM leverages historical data to forecast educational outcomes, with accuracy heavily reliant on the quality of labeled training data and selected variables.
What challenges do researchers face when applying EDM in educational environments?
Challenges include handling diverse data types from various educational systems and ensuring suitable adaptations of data mining techniques to address specific educational inquiries.
References (45)
- S.-T. Wu, "Knowledge discovery using pattern taxonomy model in text mining," 2007.
- J. Mostow and J. Beck, "Some useful tactics to modify, map and mine data from intelligent tutors," Natural Language Engineering, vol. 12, no. 02, pp. 195-208, 2006.
- S. K. Mohamad and Z. Tasir, "Educational data mining: A review," Procedia-Social and Behavioral Sciences, vol. 97, pp. 320-324, 2013.
- R. Baker et al., "Data mining for education," International encyclopedia of education, vol. 7, pp. 112-118, 2010.
- C. Romero, S. Ventura, and P. De Bra, "Knowledge discovery with genetic programming for providing feedback to courseware authors," User Modeling and User-Adapted Interaction, vol. 14, no. 5, pp. 425- 464, 2004.
- N. S. Raghavan, "Data mining in e-commerce: A survey," Sadhana, vol. 30, no. 2-3, pp. 275-289, 2005.
- C. Romero, S. Ventura, M. Pechenizkiy, and R. S. Baker, Handbook of educational data mining. CRC Press, 2010.
- R. S. Baker and K. Yacef, "The state of educational data mining in 2009: A review and future visions," JEDM-Journal of Educational Data Mining, vol. 1, no. 1, pp. 3-17, 2009.
- F. Castro, A. Vellido, À. Nebot, and F. Mugica, "Applying data min- ing techniques to e-learning problems," in Evolution of teaching and learning paradigms in intelligent environment, pp. 183-221, Springer, 2007.
- U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, "The kdd process for extracting useful knowledge from volumes of data," Communications of the ACM, vol. 39, no. 11, pp. 27-34, 1996.
- T. Barnes, M. Desmarais, C. Romero, and S. Ventura, in Proc. Educa- tional Data Mining 2009:2nd International Conf, 2009.
- S. L. Tanimoto, "Improving the prospects for educational data mining," in Track on Educational Data Mining, at the Workshop on Data Mining for User Modeling, at the 11th International Conference on User Modeling, pp. 1-6, 2007.
- J. E. Beck and J. Mostow, "How who should practice: Using learning decomposition to evaluate the efficacy of different types of practice for different types of students," in Intelligent tutoring systems, pp. 353-362, Springer, 2008.
- M. Cocea, A. Hershkovitz, and R. S. Baker, "The impact of off-task and gaming behaviors on learning: immediate or aggregate?," 2009.
- H. Jeong and G. Biswas, "Mining student behavior models in learning- by-teaching environments.," in EDM, pp. 127-136, Citeseer, 2008.
- V. J. Hodge and J. Austin, "A survey of outlier detection methodologies," Artificial Intelligence Review, vol. 22, no. 2, pp. 85-126, 2004.
- C. C. Chan, "A framework for assessing usage of web-based e-learning systems," in Innovative Computing, Information and Control, 2007. ICICIC '07. Second International Conference on, pp. 147-147, Sept 2007.
- M. Muehlenbrock, "Automatic action analysis in an interactive learning environment," in Proceedings of the 12 th International Conference on Artificial Intelligence in Education, pp. 73-80, 2005.
- M. Ueno, "Data mining and text mining technologies for collaborative learning in an ilms" ssamurai"," in Advanced Learning Technologies, 2004. Proceedings. IEEE International Conference on, pp. 1052-1053, IEEE, 2004.
- L. P. Dringus and T. Ellis, "Using data mining as a strategy for assessing asynchronous discussion forums," Computers & Education, vol. 45, no. 1, pp. 141-160, 2005.
- J. Chen, Q. Li, L. Wang, and W. Jia, "Automatically generating an e- textbook on the web," in Advances in Web-Based Learning-ICWL 2004, pp. 35-42, Springer, 2004.
- J. Tane, C. Schmitz, and G. Stumme, "Semantic resource management for the web: an e-learning application," in Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, pp. 1-10, ACM, 2004.
- C. Tang, R. W. Lau, Q. Li, H. Yin, T. Li, and D. Kilis, "Personalized courseware construction based on web data mining," in Web Informa- tion Systems Engineering, 2000. Proceedings of the First International Conference on, vol. 2, pp. 204-211, IEEE, 2000.
- J. Scott, Social network analysis. Sage, 2012.
- P. Reyes and P. Tchounikine, "Mining learning groups' activities in forum-type tools," in Proceedings of th 2005 conference on Computer support for collaborative learning: learning 2005: the next 10 years!, pp. 509-513, International Society of the Learning Sciences, 2005.
- C. Romero, S. Ventura, P. G. Espejo, and C. Hervás, "Data mining algorithms to classify students.," in EDM, pp. 8-17, 2008.
- A. P. Bradley, "The use of the area under the roc curve in the evaluation of machine learning algorithms," Pattern recognition, vol. 30, no. 7, pp. 1145-1159, 1997.
- B. Liu, Web data mining: exploring hyperlinks, contents, and usage data. Springer Science & Business Media, 2007.
- C. R. Beal, L. Qu, and H. Lee, "Classifying learner engagement through integration of multiple data sources," in Proceedings of the National Conference on Artificial Intelligence, vol. 21, p. 151, Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, 2006.
- S. Amershi and C. Conati, "Automatic recognition of learner groups in exploratory learning environments," in Intelligent Tutoring Systems, pp. 463-472, Springer, 2006.
- A. Merceron and K. Yacef, "Interestingness measures for associations rules in educational data.," EDM, vol. 8, pp. 57-66, 2008.
- C. Wallace, K. B. Korb, and H. Dai, "Causal discovery via mml," in ICML, vol. 96, pp. 516-524, Citeseer, 1996.
- A. Hershkovitz and R. Nachmias, "Developing a log-based motivation measuring tool.," in EDM, pp. 226-233, Citeseer, 2008.
- J. Kay, N. Maisonneuve, K. Yacef, and P. Reimann, "The big five and visualisations of team work activity," in Intelligent tutoring systems, pp. 197-206, Springer, 2006.
- R. S. d Baker, A. T. Corbett, and V. Aleven, "More accurate student modeling through contextual estimation of slip and guess probabilities in bayesian knowledge tracing," in Intelligent Tutoring Systems, pp. 406- 415, Springer, 2008.
- A. T. Corbett and J. R. Anderson, "Knowledge tracing: Modeling the acquisition of procedural knowledge," User modeling and user-adapted interaction, vol. 4, no. 4, pp. 253-278, 1994.
- R. Baker et al., "Data mining for education," International encyclopedia of education, vol. 7, pp. 112-118, 2010.
- C. Romero, S. Ventura, and P. De Bra, "Knowledge discovery with genetic programming for providing feedback to courseware authors," User Modeling and User-Adapted Interaction, vol. 14, no. 5, pp. 425- 464, 2004.
- N. S. Raghavan, "Data mining in e-commerce: A survey," Sadhana, vol. 30, no. 2-3, pp. 275-289, 2005.
- C. Romero, S. Ventura, M. Pechenizkiy, and R. S. Baker, Handbook of educational data mining. CRC Press, 2010.
- M. Hanna, "Data mining in the e-learning domain," Campus-wide information systems, vol. 21, no. 1, pp. 29-34, 2004.
- C. Romero and S. Ventura, "Educational data mining: a review of the state of the art," Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, vol. 40, no. 6, pp. 601-618, 2010.
- K. Alexandros and E. Georgios, "A framework for recording, monitoring and analyzing learner behavior while watching and interacting with online educational videos," in Advanced Learning Technologies (ICALT), 2013 IEEE 13th International Conference on, pp. 20-22, IEEE, 2013.
- S. Pal, "Mining educational data using classification to decrease dropout rate of students," arXiv preprint arXiv:1206.3078, 2012.
- L. Dadkhahan and M. A. Al Azmeh, "Critical appraisal of data mining as an approach to improve student retention rate," International Journal of Engineering and Innovative Technology (IJEIT) Volume, vol. 2.