CCRank: Parallel Learning to Rank with Cooperative Coevolution
2016
Abstract
We propose CCRank, the first parallel algorithm for learning to rank, targeting simultaneous improvement in learning accuracy and efficiency. CCRank is based on cooperative coevolution (CC), a divide-and-conquer framework that has demonstrated high promise in func-tion optimization for problems with large search space and complex structures. Moreover, CC naturally allows parallelization of sub-solutions to the decomposed sub-problems, which can substantially boost learning effi-ciency. With CCRank, we investigate parallel CC in the context of learning to rank. Extensive experiments on benchmarks in comparison with the state-of-the-art al-gorithms show that CCRank gains in both accuracy and efficiency. 1
References (22)
- Cao, Z.; Qin, T.; Liu, T.-Y.; Tsai, M.-F.; and Li, H. 2007. Learning to rank: from pairwise approach to listwise ap- proach. In Proc. of ICML'07.
- Cao, B.; Shen, D.; Wang, K.; and Yang, Q. 2010. Click- through log analysis by collaborative ranking. In Proc. of AAAI'10.
- Chu, C. T.; Kim, S. K.; Lin, Y. A.; Yu, Y.; Bradski, G. R.; Ng, A. Y.; and Olukotun, K. 2006. Map-reduce for machine learning on multicore. In Proc. of NIPS'06.
- Collobert, R.; Bengio, Y.; and Bengio, S. 2004. A parallel mixture of svms for very large scale problems. In Proc. of NIPS'04.
- de Almeida, H. M.; Gonc ¸alves, M. A.; Cristo, M.; and Cal- ado, P. 2007. A combined component approach for finding collection-adapted ranking functions based on genetic pro- gramming. In Proc. of SIGIR'07.
- Fan, W.; Gordon, M. D.; and Pathak, P. 2004. Discovery of context-specific ranking functions for effective information retrieval using genetic programming. IEEE Trans. Knowl. Data Eng. 16(4):523-527.
- Freund, Y.; Iyer, R.; Schapire, R. E.; and Singer, Y. 2003. An efficient boosting algorithm for combining preferences. J. Mach. Learning Res. 4(1):933-969.
- Graf, H. P.; Cosatto, E.; Bottou, L.; Durdanovic, I.; and Vap- nik, V. 2004. Parallel support vector machines: The cascade SVM. In Proc. of NIPS'04.
- Hoi, S. C. H., and Jin, R. 2008. Semi-supervised ensemble ranking. In Proc. of AAAI'08.
- Järvelin, K., and Kekäläinen, J. 2002. Cumulated gain- based evaluation of IR techniques. ACM Trans. Inf. Sys. 20(4):422-446.
- Joachims, T. 2002. Optimizing search engines using click- through data. In Proc. of KDD'02.
- Joachims, T. 2006. Training linear SVMs in linear time. In Proc. of KDD'06.
- Li, X., and Yao, X. 2009. Tackling high dimensional non- separable optimization problems by cooperatively coevolv- ing particle swarms. In Proc. of CEC'09.
- Liu, T.-Y. 2009. Learning to rank for information retrieval. Found. Trends Inf. Retr. 3(3):225-331.
- Musilek, P.; Lau, A.; Reformat, M.; and Wyard-Scott, L. 2006. Immune programming. Inf. Sci. 176(8):972-1002.
- Phil Husbands, F. M. 1991. Simulated co-evolution as the mechanism for emergent planning and scheduling. In Proc. of GA'91.
- Potter, M. A., and Jong, K. A. D. 1994. A cooperative coevolutionary approach to function optimization. In Proc. of PPSN'94.
- Robertson, S. E. 1997. Overview of the okapi projects. J. Doc. 53(1):3-7.
- Wang, S.; Ma, J.; and Liu, J. 2009. Learning to rank using evolutionary computation: Immune programming or genetic programming? In Proc. of CIKM'09.
- Wiegand, R. P. 2004. An Analysis of Cooperative Coevolu- tionary Algorithms. Ph.D. Dissertation, George Mason Uni- versity, Fairfax, VA, USA.
- Xu, J., and Li, H. 2007. AdaRank: a boosting algorithm for information retrieval. In Proc. of SIGIR'07.
- Yang, Z.; Zhang, J.; Tang, K.; Yao, X.; and Sanderson, A. C. 2009. An adaptive coevolutionary differential evolution al- gorithm for large-scale optimization. In Proc. of CEC'09. Zhai, C., and Lafferty, J. 2001. A study of smoothing meth- ods for language models applied to Ad Hoc information re- trieval. In Proc. of SIGIR'01.