Academia.eduAcademia.edu

Outline

Improving Collaborative Filtering Recommendation

2018

https://doi.org/10.2991/CAAI-18.2018.25

Abstract

Collaborative filtering recommender systems traditionally recommend products to users solely based on the user-item rating matrix and are simple, convenient to use. In this paper, we focus on two main issues, data sparsity and scalability. Data sparsity can lead to inaccurate recommendations, while scalability may cause an unacceptably long delay before valuable recommendations are acquired. We propose a novel approach to deal with these two issues. Word2Vec is employed to build item vectors from the user comments. Through the user-item rating matrix, user vectors of all the users are then obtained. A clustering technique is applied to reduce the time complexity related to the large numbers of items and users. Experimental results of real data sets are shown to demonstrate the effectiveness of our proposed approach. Keywords—data sparsity, scalability, Word2Vec, selfconstructing clustering, word vectors

References (10)

  1. M. Allahbakhsh and A. Ignjatovic. An iterative method for calculating robust rating scores. IEEE Transections on Parallel and Distributed Systems, 26(2):340-350, 2015.
  2. S. Zahra, M. Ghazanfar, A. Khalid, M. Azam, U. Naeem and A. bennett. Novel centroid selection approaches for KMeans-clustering based recommender systems. Information sciences, 320:156-189, 2015.
  3. S.Debnath, N.Ganguly and P.Mitra.Feature weighting in content based recommendation system using social network analysis. Proceedings of the 17th international conference on World Wide Web, ACM, pp. 1041-1042, 2018.
  4. Y-J. Park, The adaptive clustering method for the long tail problem of recommender systems, IEEE Transections on knowledge and data engineering, 25(8):1904-1915, 2013.
  5. D. Zhang, C-H. Hsu, M. Chen, Q. Chen,N. Xiong and J. Lloret, Cold-start recommendation using bi-clustering and fusion for large-scale social recommender systems, IEEE Transections on Emerging Topics in Computing, 2(2):239-250, 2014.
  6. T.Mikolov, I.Sutskever, K.Chen, G.S.Corrado and J.Dean. Distributed representations of words and phrases and their compositionality. Pro- ceedings of the 26th International Conference on Neural Information Processing Systems, Vol. 2, pp. 3111-3119, 2013.
  7. S. Wold, K. Esbensen and P. Geladi. Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3):37-52, 1987.
  8. S.-J. Lee and C.-S. Ouyang. A neuro-fuzzy system modeling with self- constructing rule generation and hybrid SVD-based learning. IEEE Transactions on Fuzzy Systems, 11(3):341-353, 2003.
  9. M. Wan, M. and J. McAuley. Modeling ambiguity, subjectivity, and diverging viewpoints in opinion question answering systems. Proceed- ings of the 2016 IEEE 16th international conference on Data Mining (ICDM), IEEE, pp. 489-498, 2016.
  10. C.-L. and S.-L. Lee. A clustering based approach to improving the efficiency of collaborative filtering recommendation. em Electronic Commerce Research and Applications, 18:1-9, 2016.