A Topic Modeling Approach to Ranking
2015, International Conference on Artificial Intelligence and Statistics
Abstract
We propose a topic modeling approach to the prediction of preferences in pairwise comparisons. We develop a new generative model for pairwise comparisons that accounts for multiple shared latent rankings that are prevalent in a population of users. This new model also captures inconsistent user behavior in a natural way. We show how the estimation of latent rankings in the new generative model can be formally reduced to the estimation of topics in a statistically equivalent topic modeling problem. We leverage recent advances in the topic modeling literature to develop an algorithm that can learn shared latent rankings with provable consistency as well as sample and computational complexity guarantees. We demonstrate that the new approach is empirically competitive with the current state-of-the-art approaches in predicting preferences on some semi-synthetic and real world datasets.
References (26)
- S. Arora, R. Ge, and A. Moitra. Learning topic models - going beyond SVD. In Proc. of the IEEE 53rd Annual Symposium on Foundations of Computer Science, New Brunswick, NJ, USA, Oct. 2012.
- S. Arora, R. Ge, Y. Halpern, D. Mimno, A. Moitra, D. Son- tag, Y. Wu, and M. Zhu. A practical algorithm for topic modeling with provable guarantees. In Proc. of the 30th International Conference on Machine Learning, Atlanta, GA, USA, Jun. 2013.
- P. Awasthi, A. Blum, O. Sheffet, and A. .Vijayaragha- van. Learning mixtures of ranking models. In Ad- vances in Neural Information Processing Systems. Mon- treal, Canada, Dec. 2014.
- H. Azari Soufiani, H. Diao, Z. Lai, and D. C. Parkes. Generalized random utility models with multiple types. In Advances in Neural Information Processing Systems, pages 73-81. Lake Tahoe, NV, USA, Dec. 2013.
- D. Blei. Probabilistic topic models. Commun. of the ACM, 55(4):77-84, 2012.
- W. Ding, M. H. Rohban, P. Ishwar, and V. Saligrama. Topic discovery through data dependent and random projections. In Proc. of the 30th International Con- ference on Machine Learning, Atlanta, GA, USA, Jun. 2013.
- W. Ding, M. H. Rohban, P. Ishwar, and V. Saligrama. Ef- ficient Distributed Topic Modeling with Provable Guar- antees. In Proc. ot the 17th International Conference on Artificial Intelligence and Statistics, Reykjavik, Iceland, Apr. 2014.
- D. Donoho and V. Stodden. When does non-negative matrix factorization give a correct decomposition into parts? In Advances in Neural Information Processing Systems 16, pages 1141-1148, Cambridge, MA, 2004. MIT press.
- V. Farias, S. Jagabathula, and D. Shah. A data-driven approach to modeling choice. In Advances in Neural In- formation Processing Systems. Vancouver, Canada, Dec. 2009.
- D. F. Gleich and L.-H. Lim. Rank aggregation via nuclear norm minimization. In Proc. of the 17th ACM Inter- national Conference on Knowledge Discovery and Data Mining, pages 60-68, San Diego, CA, USA, 2011.
- S. Jagabathula and D. Shah. Inferring rankings under con- strained sensing. In Advances in Neural Information Processing Systems, pages 753-760. Vancouver, Canada, Dec. 2008.
- T. Lu and C. Boutilier. Learning mallows models with pairwise preferences. In Proc. of the 28th International Conference on Machine Learning, Bellevue, WA, USA, Jun. 2011.
- C. L. Mallows. Non-null ranking models. i. Biometrika, pages 114-130, 1957.
- S. Negahban, S. Oh, and D. Shah. Iterative ranking from pair-wise comparisons. In Advances in Neural Informa- tion Processing Systems, pages 2474-2482. Lake Tahoe, NV, USA, Dec. 2012.
- S. Oh and D. Shah. Learning mixed multinomial logit model from ordinal data. In Advances in Neural In- formation Processing Systems, Montreal, Canada, Dec. 2014.
- B. Osting, C. Brune, and S. Osher. Enhanced statisti- cal rankings via targested data collection. In Proc. of the 30th International Conference on Machine Learning, pages 489-497, Atlanta, GA, USA, Jun. 2013.
- R. Plackett. The analysis of permutations. Applied Statis- tics, pages 193-202, 1975.
- T. Qin, X. Geng, and T.-Y. Liu. A new probabilistic model for rank aggregation. In Advances in Neural Informa- tion Processing Systems, pages 1948-1956. Vancouver, Canada, Dec. 2010.
- A. Rajkumar and S. Agarwal. A statistical convergence perspective of algorithms for rank aggregation from pair- wise data. In Proc. of the 31st International Conference on Machine Learning, Beijing, China, Jun. 2014.
- F. Ricci, L. Rokach, and B. Shapira. Introduction to rec- ommender systems handbook. Springer, 2011.
- R. Salakhutdinov and A. Mnih. Bayesian probabilistic ma- trix factorization using markov chain monte carlo. In Proc. of the 25th International Conference on Machine Learning, pages 880-887, Helsinki, Finland, Jun. 2008a.
- R. Salakhutdinov and A. Mnih. Probabilistic matrix fac- torization. In Advances in neural information processing systems, pages 1257-1264, 2008b.
- A. Toscher, M. Jahrer, and R. M. Bell. The bigchaos solu- tion to the netflix grand prize, 2009.
- M. Volkovs and R. Zemel. New learning methods for super- vised and unsupervised preference aggregation. Journal of Machine Learning Research, 15:1135-1176, 2014.
- H. M. Wallach, I. Murray, R. Salakhutdinov, and D. Mimno. Evaluation methods for topic models. In Proc. of the 26th International Conference on Machine Learning, Montreal, Canada, Jun. 2009.
- C. Wang and D. Blei. Collaborative topic modeling for recommending scientific articles. In Proc. of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 448-456, 2011.