Improved FCM Algorithm for Clustering on Web Usage Mining
2011, 2011 International Conference on Computer and Management (CAMAN)
https://doi.org/10.1109/CAMAN.2011.5778781Abstract
In this paper we present clustering method is very sensitive to the initial center values ,requirements on the data set too high, and cannot handle noisy data the proposal method is using information entropy to initialize the cluster centers and introduce weighting parameters to adjust the location of cluster centers and noise problems. The navigation datasets which are sequential in nature, Clustering web data is finding the groups which share common interests and behavior by analyzing the data collected in the web servers, this improves clustering on web data efficiently using improved fuzzy c-means(FCM) clustering. Web usage mining is the application of data mining techniques to web log data repositories. It is used in finding the user access patterns from web access log. Web data Clusters are formed using on MSNBC web navigation dataset.
References (8)
- J. Srivastava, R. Cooley, M. Deshpande, PN. Tan, Web usage mining: discovery and applications of usage patterns from web data. SIGKDD Explorations, Vol. 1, No. 2, 2000, pp.12-23.
- M. N. Garofalakis, R. Rastogi, S. Seshadri, K. Shim Data minino and the web: past, present and future, In Proc. of the second international workshop on webinformation and data management, ACM, 1999.
- A. K. Jain and R. C. Dubes, "Data clustering: A review.," ACM Computing Surveys, vol. 31, 1999.
- U. Maulik and S. Bandyopadhyay, "Genetic algorithm based clustering technique," Pattern Recognition, vol. 33, pp. 1455-1465, 2000.
- P. Zhang, X. Wang, and P. X. Song, "Clustering categorical data based on distance vectors," The Journal of the American Statistical Association, vol. 101,no. 473, pp. 355-367, 2006.
- A. Vakali, J. Pokorný and T. Dalamagas, An Overview of Web Data Clustering Practices, EDBT Workshops, 2004, pp. 597-606.
- Lin Zhu, Fu-Lai Chung, Shitong Wang.Generalized Fuzzy C-Means Clustering Algorithm With Improved Fuzzy Partitions[J].IEEE Transactions on Systerms,2009:39-3.
- Cheul Hwang,Frank Chung-Hoon Rhee.Uncertain Fuzzy Clustering:Interval Type-2 Fuzzy Approach to C-Means[J]. IEEE Transcations on Fuzzy Systerms,2007:15-1.