SESSION IDENTIFICATION IN WEB USAGE MINING TO PERSONALIZE THE WEB
Abstract
—The World Wide Web becomes very popular and interactive for transferring of Information. Web usage mining is the area of data mining which deals with the discovery and analysis of usage patterns from Web data, specifically web logs, in order to improve web based applications like E-Commerce. Web site owners have trouble identifying customer purchasing patterns from their Web logs because the two aren't directly related. The technique described here can help E-Commerce providers to identify likely customers even with a small data set. Every request made by the users is recorded and saved as web access log file. However, the data stored in the log files does not specify accurate details of the users' accesses to the Web site. So, preprocessing of the Web log data is first and important phase before web log file can be applied for pattern analysis & pattern discovery in E-Commerce industry. The preprocessed Web Log file can then be suitable for the discovery and analysis of useful information referred to as access patterns of the potential buyers. This project gives detailed description of how pre-processing is done on E-Commerce web log file taken from different users and a Classification algorithm is implemented to find statistical information of the user's behavior to predict the potential buyer of a particular E-Commerce website.
References (13)
- Public Security Department of Jiangsu Province in China and Published by the IEEE Computer Society in 2010
- Jeffrey Xu Yu, Yuming Ou,Chengqi and Zhang "Identifying Interesting Visitors through Web Log Classification" Published by the IEEE Computer Society in May/June 2005.
- Fang Yuankang , Huang Zhiqiu" A Session Identification Algorithm Based on Frame Page and Page threshold" Published by the IEEE Computer Society in 2010.
- He Xinhua and Wang Qiong "Dynamic Timeout-Based A Session Identification Algorithm" Published by the IEEE Computer Society in 2011.
- Tasawar Hussain, Sohail Asghar and Nayyer Masood "Hierarchical Sessionization at Preprocessing Level of WUM Based on Swarm Intelligence" IEEE 6th International Conference on Emerging Technologies , 2010.
- Ramya C, Dr. Shreedhara K S and Kavitha G "Preprocessing: A Prerequisite for Discovering Patterns in Web Usage Mining Process" IEEE International Conference on Communication and Electronics Information , 2011.
- K. Sudheer Reddy, G. Partha Saradhi Varma, and M. Kantha Reddy "An Effective Preprocessing Method for Web Usage Mining" International Journal of Computer Theory and Engineering, Vol. 6, No. 5, October 2014.
- Priyanka Patel and Mitixa Parmar " A Review on User Session Identification through Web Server Log" International Journal of Computer Science and Information Technology,Volume5,Issue1,2014.
- D. Tanasa and B. Trousse, "Data preprocessing for wum," Intelligent Systems, IEEE, vol. 23, no. 3, pp. 22-25, 2004.
- L. Niu et al., "Product Hierarchy-Based Customer Profiles for Electronic Commerce Recommendation," Asian J. Information Technology, vol. 4, no. 1 pp. 18-24, 2003.
- Q. Yang and H. Zhang, "Web-Log Mining for Predictive Caching," IEEE Trans. Knowledge and Data Eng., vol. 15, no. 4, pp.1050-1053, 2003.
- M. G. Elfeky, W. G. Aref, and A. K. Elmagarmid. "Using convolution to mine obscure periodic patterns in one pass," In E. Bertino, S. Christodoulakis, D. Plexousakis, V. Christophides, M. Koubarakis, K. B¨ohm, and E. Ferrari, editors, Proceedings of the 9th International Conference on Extending Database Technology (EDBT 04), Springer pp. 605-620, 2004.
- G. Casas-Garriga. "Summarizing sequential data with closed partial orders," In H. Kargupta, J. Srivastava, C. Kamath, and A. Goodman, editors, Proceedings of the 5th SIAM International Conference on Data Mining (SDM 05), SIAM pp. 380-391, 2005.