An Overview of Web Content Mining Tools
https://doi.org/10.9756/BIJDM.8126Abstract
Web is one of the most widespread platforms for information exchange today, as it is easier to publish documents. As the number of users and providers increases, the number of documents grows, searching for information becomes a difficult and time-consuming process. Web mining uses various data mining techniques to discover useful knowledge from Web hyperlinks, page content and usage log file. The mining tools are used to scan the HTML documents, images, and text, the results is provided for the search engines.It can assist search engines in providing productive results of each search in order of their relevance. In this paper, we brief introduction to the concepts related to data mining, web mining and then an overview of different Web mining tools. We conclude by presenting a comparative table of these tools based on some pertinent criteria.
References (11)
- Kargupta, Han, Yu, Motwani, Vipin Kumar, "Next Generation of Data Mining", Chapman & Hall/CRC Data Mining and Knowledge Discovery Series, Taylor and Francis Group LLC, 2008.
- J. Han and M. Kamber., "Data Mining: Concepts and Techniques (2nd ed.)." Morgan Kaufmann, San Francisco, CA, 2006.
- J. Srivastava et al, "Foundations and advances in data mining", 2005
- Johnson, F., Gupta, S.K., "Web Content Minings Techniques: A Survey", International Journal of Computer Application. Volume 47 - No.11, Pp:44, June, 2012.
- Bharanipriya, V., Prasad, V.K., "Web Content Mining Tools: A comparative Study", International Journal of Information Technology and Knowledge Management. Vol. 4, No 1, Pp. 211-215, 2011.
- Sharma, A.K., Gupta, P.C., "Study & Analysis of Web Content Mining Tools to Improve Techniques of Web Data mining", International Journal of Advanced Research in Computer Engineering & Technology (IJARCET). Volume 1, Issue 8, October, 2012.
- Screen-scraper, http://www.screen-scraper.com Viewed 19February 2013.
- Automation Anywhere Manual.AA, http://www.automationanywhere. com Viewed 06 February 2013.
- Zhang, Q., Segall, R.S., "Web Mining: A Survey of Current Research, Techniques, and Software", International Journal of Information Technology & Decision Making. Vol.7, No. 4, Pp.683-720. World Scientific Publishing Company (2008).
- Mozenda, http://www.mozenda.com/web-mining-software Viewed 18 February 2013.
- Web Content Extractor help. WCE, http://www.newprosoft.com/web- content-extractor.htm Viewed 18 February 2013.