Academia.eduAcademia.edu

Outline

Web Mining: An Application of Data Mining

2020, Web Mining: An Application of Data Mining

Abstract

The World Wide Web is the largest, popular and most widely used information source, which is increasing day by day. The information over the web is available in the form of web pages, the content of web pages may include texts, images, audios, videos, lists, charts, tables, hyperlinks etc. Web page structure, users' navigation on the web sites and server logs also provides useful information. To extract meaningful information from the web and discover knowledge, web mining techniques are used. Web mining uses data mining techniques to extract knowledge from web. This paper presents the study of web mining and its types.

Key takeaways
sparkles

AI

  1. Web mining employs data mining techniques to extract knowledge from the vast information available on the web.
  2. Web structure mining analyzes hyperlinks to discover knowledge and rank web pages using algorithms like PageRank and HITS.
  3. Web usage mining analyzes user interaction data to identify access patterns and behaviors on websites.
  4. Key data sources for web usage mining include server logs, user sessions, and browser interactions.
  5. This paper presents an overview of web mining and its various types, emphasizing their significance in knowledge discovery.

References (15)

  1. Claus Pahl and Dave Donnellan, "Data Mining Technology for the Evaluation of Web-based Teaching and Learning Systems", 7th Int. Conference on E- Learning in Business, Government and Higher Education, October 2002.
  2. K. Sharma, G. Shrivastava and V. Kumar, "Web Mining: Today and Tomorrow", IEEE 3rd International Conference on Electronics Computer Technology (ICECT), pp. 399-403 2011.
  3. Anurag Kumar and Ravi Kumar Singh, "Web Mining Overview, Techniques, Tools and Applications: A Survey," International Research Journal of Engineering and Technology (IRJET), vol. 03, no. 12, pp. 1543-1547, December 2016.
  4. P. M. Bharti and T. J. Raval, "Improving Web Page Access Prediction using Web Usage Mining and Web Content Mining", 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2019, pp. 1268-1273.
  5. Anurag kumar and Kumar Ravi Singh, "A Study on Web Content Mining," International Journal Of Engineering And Computer Science, vol. 6, no. 1, pp. 20003-20006, January 2017.
  6. J. M. Kleinberg, "Authoritative sources in a hyperlinked environment", In Proc. of ACM-SIAM Symposium on Discrete Algorithms, pages 668-677, 1998.
  7. R. Cooley, B. Mobasher, J. Srivastava, "Web mining: information and pattern discovery on the World Wide Web", In Proceedings of Ninth IEEE International Conference. pp. 558 -567, 3-8 Nov. 1997.
  8. Inamdar SA and shinde GN, "An Agent Based Intelligent Search Engine System for Web Mining", International Journal on Computer Science and Engineering, Vol. 02, No. 03, pp 759-763, 2010.
  9. Johnson, F., Gupta, S.K., "Web Content Minings Techniques: A Survey," International Journal of Computer Application. Volume 47 -No.11, p44, June (2012).
  10. Suvarn Sharma and Amit Bhagat, "Data preprocessing algorithm for Web Structure Mining", Fifth International Conference on Eco-friendly Computing and Communication Systems (ICECCS), IEEE, 2016.
  11. Yeqing Li, "Research on Technology, Algorithm and Application of Web Mining", 2017
  12. Monika Dhandi and Rajesh Kumar Chakrawarti, "A Comprehensive Study of Web Usage Mining", Symposium on Colossal Data Analysis and Networking (CDAN), IEEE, Indore, 2016, pp. 1-5, 2016.
  13. S. P. Nina, M. Rahman, K. I. Bhuiyan and K. E. U. Ahmed, "Pattern Discovery of Web Usage Mining," 2009 International Conference on Computer Technology and Development, Kota Kinabalu, IEEE, pp. 499-503, 2009.
  14. L. H. Suadaa, "A survey on web usage mining techniques and applications," 2014 International Conference on Information Technology Systems and Innovation (ICITSI), IEEE, Bandung, pp. 39-43, 2014.
  15. Facca, Federico & Lanzi, Pier Luca, "Recent Developments in Web Usage Mining Research," Vol. 2737, pp 140-150, 2003.