Learning to Rank Research Papers

Visual Comparison of Images Using Multiple Kernel Learning for Ranking

2025, Procedings of the British Machine Vision Conference 2015

Ranking is the central problem for many applications such as web search, recommendation systems, and visual comparison of images. In this paper, the multiple kernel learning framework is generalized for the learning to rank problem. This... more

descriptionView Paper arrow_downwardDownload

An Intelligent Surfer Model Based on Combining Web Contents and Links

by Bouchra FRIKH

2025

The PageRank algorithm is an iterative algorithm used in the Google search engine to improve the results of requests by taking into account the link structure of the web. More interesting and intelligent surfer model combining the link... more

descriptionView Paper arrow_downwardDownload

An Intelligent Surfer Model Based on Combining Web Contents and Links

by Bouchra FRIKH

2025

The PageRank algorithm is an iterative algorithm used in the Google search engine to improve the results of requests by taking into account the link structure of the web. More interesting and intelligent surfer model combining the link... more

descriptionView Paper arrow_downwardDownload

QU-IR at SemEval 2016 Task 3: Learning to Rank on Arabic Community Question Answering Forums with Word Embedding

by Rana Malhas

2025, Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

Resorting to community question answering (CQA) websites for finding answers has gained momentum in the past decade with the explosive rate at which social media has been proliferating. With many questions left unanswered on those... more

descriptionView Paper arrow_downwardDownload

Generalized rank-breaking: computational and statistical tradeoffs

by Sewoong Oh

2025, Journal of Machine Learning Research

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited. In the application of rank aggregation, for the Plackett-Luce... more

descriptionView Paper arrow_downwardDownload

Computational and Statistical Tradeoffs in Learning to Rank

by Sewoong Oh

2025, Neural Information Processing Systems

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited. In the application of learning to rank, we provide a hierarchy of... more

descriptionView Paper arrow_downwardDownload

Generalized Rank-Breaking: Computational and Statistical Tradeoffs

by Sewoong Oh

2025, J. Mach. Learn. Res.

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited. In the application of rank aggregation, for the Plackett-Luce... more

descriptionView Paper arrow_downwardDownload

Computational and Statistical Tradeoffs in Learning to Rank

by Sewoong Oh

2025

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited. In the application of learning to rank, we provide a hierarchy of... more

descriptionView Paper arrow_downwardDownload

Can Large Language Models Generate Effective Datasets for Emotion Recognition in Conversations?

by Stefan Wermter and

2025, Procedia Computer Science

Emotion recognition in conversations (ERC) focuses on identifying emotion shifts within interactions, representing a significant step toward advancing machine intelligence. However, ERC data remains scarce, and existing datasets face... more

descriptionView Paper arrow_downwardDownload

On equivalence relationships between classification and ranking algorithms

by Seyda Ertekin

2025, MIT Press eBooks

We demonstrate that there are machine learning algorithms that can achieve success for two separate tasks simultaneously, namely the tasks of classification and bipartite ranking. This means that advantages gained from solving one task... more

descriptionView Paper arrow_downwardDownload

Automated Collection of Evaluation Dataset for Semantic Search in Low-Resource Domain Language

by Anastasia Zhukova

2025, Proceedings of the First Workshop on Language Models for Low-Resource Languages

Domain-specific languages that use a lot of specific terminology often fall into the category of low-resource languages. Collecting test datasets in a narrow domain is time-consuming and requires skilled human resources with domain... more

descriptionView Paper arrow_downwardDownload

Online Learning with Low Rank Experts

by Elad Hazan

2025, arXiv (Cornell University)

We consider the problem of prediction with expert advice when the losses of the experts have low-dimensional structure: they are restricted to an unknown d-dimensional subspace. We devise algorithms with regret bounds that are independent... more

descriptionView Paper arrow_downwardDownload

Rank aggregation methods for the Web

by Ravi Kumar

2025, Proceedings of the 10th international conference on World Wide Web

We consider the problem of combining ranking results from various sources. In the context of the Web, the main applications include building meta-search engines, combining ranking functions, selecting documents based on multiple criteria,... more

descriptionView Paper arrow_downwardDownload

Generalized BROOF-L2R

by Marcos Goncalves

2025, Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

The task of retrieving information that really matters to the users is considered hard when taking into consideration the current and increasingly amount of available information. To improve the effectiveness of this information seeking... more

descriptionView Paper arrow_downwardDownload

Incremental learning to rank with partially-labeled data

by Seungjin Choi

2025, Proceedings of the 2009 workshop on Web Search Click Data

In this paper we present a semi-supervised learning method for a problem of learning to rank where we exploit Markov random walks and graph regularization in order to incorporate not only "labeled" web pages but also plenty of "unlabeled"... more

descriptionView Paper arrow_downwardDownload

Risk-Sensitive Deep Neural Learning to Rank

by Daniel Xavier de Sousa

2025, Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Learning to Rank (L2R) is the core task of many Information Retrieval systems. Recently, a great effort has been put on exploring Deep Neural Networks (DNNs) for L2R, with significant results. However, risk-sensitiveness, an important and... more

Learning to Rank (L2R) is the core task of many Information Retrieval systems. Recently, a great effort has been put on exploring Deep Neural Networks (DNNs) for L2R, with significant results. However, risk-sensitiveness, an important and recent advance in the L2R arena, that reduces variability and increases trust, has not been incorporated into Deep Neural L2R yet. Risk-sensitive measures are important to assess the risk of an IR system to perform worse than a set of baseline IR systems for several queries. However, the risk-sensitive measures described in the literature have a non-smooth behavior, making them difficult, if not impossible, to be optimized by DNNs. In this work we solve this difficult problem by proposing a family of new loss functions -RiskLoss -that support a smooth risk-sensitive optimization. RiskLoss introduces two important contributions: (i) the substitution of the traditional NDCG or MAP metrics in risk-sensitive measures with smooth loss functions that evaluate the correlation between the predicted and the true relevance order of documents for a given query and (ii) the use of distinct versions of the same DNN architecture as baselines by means of a multi-dropout technique during the smooth risk-sensitive optimization, avoiding the inconvenience of assessing multiple IR systems as part of DNN training. We empirically demonstrate significant achievements of the proposed RiskLoss functions when used with recent DNN methods in the context of well-known web-search datasets such as WEB10K, YAHOO, and MQ2007. Our solutions reach improvements of 8% in effectiveness (NDCG) while improving in around 5% the risk-sensitiveness (GeoRisk measure) when applied together with a state-of-the-art Self-Attention DNN-L2R architecture. Furthermore, RiskLoss is capable of reducing by 28% the losses over the best evaluated baselines and significantly improving over the risk-sensitive state-of-the-art non-DNN method (by up to 13.3%) while keeping (or even increasing) overall effectiveness. All these results ultimately establish a new level for the state-of-the-art on risk-sensitiveness and DNN-L2R research.

descriptionView Paper arrow_downwardDownload

Risk-Sensitive Learning to Rank with Evolutionary Multi-Objective Feature Selection

by Daniel Xavier de Sousa

2025, ACM Transactions on Information Systems

Learning to Rank (L2R) is one of the main research lines in Information Retrieval. Risk-sensitive L2R is a sub-area of L2R that tries to learn models that are good on average while at the same time reducing the risk of performing poorly... more

descriptionView Paper arrow_downwardDownload

Learning in unlabeled networks – An active learning and inference approach

by Przemysław Kazienko

2025, AI Communications

The task of determining labels of all network nodes based on the knowledge about network structure and labels of some training subset of nodes is called the within-network classification. It may happen that none of the labels of the nodes... more

The task of determining labels of all network nodes based on the knowledge about network structure and labels of some training subset of nodes is called the within-network classification. It may happen that none of the labels of the nodes is known and additionally there is no information about number of classes (types of labels) to which nodes can be assigned. In such a case a subset of nodes has to be selected for initial label acquisition. The question that arises is: "labels of which nodes should be collected and used for learning in order to provide the best classification accuracy for the whole network?". Active learning and inference is a practical framework to study this problem. In this paper, set of methods for active learning and inference for within-network classification is proposed and validated. The utility score calculation for each node based on network structure is the first step in the entire process. The scores enable to rank the nodes. Based on the created ranking, a set of nodes, for which the labels are acquired, is selected (e.g. by taking top or bottom N from the ranking). The new measure-neighbour methods proposed in the paper suggest not obtaining labels of nodes from the ranking but rather acquiring labels of their neighbours. The paper examines 29 distinct formulations of utility score and selection methods reporting their impact on the results of two collective classification algorithms: Iterative Classification Algorithm (ICA) and Loopy Belief Prorogation (LBP). We advocate that the accuracy of presented methods depends on the structural properties of the examined network. We claim that measure-neighbour methods will work better than the regular methods for networks with higher clustering coefficient and worse than regular methods for networks with low clustering coefficient. According to our hypothesis, based on clustering coefficient of a network we are able to recommend appropriate active learning and inference method. Experimental studies were carried out on six real-world networks. In order to investigate our hypothesis, all analysed networks were categorized based on their structural characteristics into three groups. In addition, the representativeness of initial set of nodes for which the labels are obtained and its influence on classification accuracy was examined.

descriptionView Paper arrow_downwardDownload

Exploiting Image Content in Location-Based Shopping Recommender Systems for Mobile Users

by Sunday Ojo

2025, International Journal of Information Technology & Decision Making

This paper demonstrates how image content can be used to realize a location-based shopping recommender system for intuitively supporting mobile users in decision making. Generic Fourier Descriptors (GFD) image content of an item was... more

descriptionView Paper arrow_downwardDownload

Beyond Convexity: Online Submodular Minimization

by Elad Hazan

2025, Neural Information Processing Systems

We consider an online decision problem over a discrete space in which the loss function is submodular. We give algorithms which are computationally efficient and are Hannan-consistent in both the full information and bandit settings.

descriptionView Paper arrow_downwardDownload

Incremental Refinement of Relevance Rankings: Introducing a New Method Supported with Pennant Retrieval

by Müge Akbulut

2025, Türk Kütüphaneciliği

Relevance ranking algorithms rank retrieved documents based on the degrees of topical similarity (relevance) between search queries and documents. This paper aims to introduce a new relevance ranking method combining a probabilistic topic... more

descriptionView Paper arrow_downwardDownload

Learning to Rank with Deep Autoencoder Features

by Adriano Veloso

2025

Learning to rank in Information Retrieval is the problem of learning the full order of a set of documents from their partially observed order. Datasets used by learning to rank algorithms are growing enormously in terms of number of... more

descriptionView Paper arrow_downwardDownload

A systematic review on page ranking algorithms

by DIVAKAR YADAV

2025, International journal of information technology

Search engines are very useful tool now a days to fulfill the information need of a user. The performance of search engine mainly depends on page ranking algorithm which provides highly relevant web pages at the top of the search result.

descriptionView Paper arrow_downwardDownload

An Improved Approach to Ranking Web Documents

by DIVAKAR YADAV

2025, Journal of Information Processing Systems

Ranking thousands of web documents so that they are matched in response to a user query is really a challenging task. For this purpose, search engines use different ranking mechanisms on apparently related resultant web documents to... more

descriptionView Paper arrow_downwardDownload

Ranking the Online Documents Based on Relative Credibility Measures

by Ahmad Dahlan

2025, ITB Journal of Information and Communication Technology

Information searching is the most popular activity in Internet. Usually the search engine provides the search results ranked by the relevance. However, for a certain purpose that concerns with information credibility, particularly citing... more

descriptionView Paper arrow_downwardDownload

Cumulated gain-based indicators of IR performance

by Kal Jarvelin

2025

Modern large retrieval environments tend to overwhelm their users by their large output. Since all documents are not of equal relevance to their users, highly relevant documents should be identified and ranked first for presentation to... more

descriptionView Paper arrow_downwardDownload

Reinforcement online learning to rank with unbiased reward shaping

by Zhihao Qiao

2025, Information Retrieval Journal

Online learning to rank (OLTR) aims to learn a ranker directly from implicit feedback derived from users’ interactions, such as clicks. Clicks however are a biased signal: specifically, top-ranked documents are likely to attract more... more

descriptionView Paper arrow_downwardDownload

Competition for popularity and interventions on a Chinese microblogging site

by Janos Kertesz

2025, PLOS ONE

Microblogging sites are important vehicles for the users to obtain information and shape public opinion thus they are arenas of continuous competition for popularity. Most popular topics are usually indicated on ranking lists. In this... more

descriptionView Paper arrow_downwardDownload

Visual Comparison of Images Using Multiple Kernel Learning for Ranking

by mohamed Ismail

2025, Procedings of the British Machine Vision Conference 2015

Ranking is the central problem for many applications such as web search, recommendation systems, and visual comparison of images. In this paper, the multiple kernel learning framework is generalized for the learning to rank problem. This... more

descriptionView Paper arrow_downwardDownload

Learning to rank definitions to generate quizzes for interactive information presentation

by Kohji Dohsaka

2025, Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions - ACL '07

This paper proposes the idea of ranking definitions of a person (a set of biographical facts) to automatically generate "Who is this?" quizzes. The definitions are ordered according to how difficult they make it to name the person. Such... more

descriptionView Paper arrow_downwardDownload

The application of social recommendation algorithm integrating attention model in movie recommendation

by Pengjia Cui

2025, Scientific Reports

To improve the accuracy of recommendations, alleviate sparse data problems, and mitigate the homogenization of traditional socialized recommendations, a gated recurrent neural network is studied to construct a relevant user preference... more

descriptionView Paper arrow_downwardDownload

Learning to Rank for Active Learning: A Listwise Approach

by Joost van de Weijer

2025

Active learning emerged as an alternative to alleviate the effort to label huge amount of data for data-hungry applications (such as image/video indexing and retrieval, autonomous driving, etc.). The goal of active learning is to... more

descriptionView Paper arrow_downwardDownload

A Learning Theory of Ranking Aggregation

by Stéphan Clémençon

2025, HAL (Le Centre pour la Communication Scientifique Directe)

Originally formulated in Social Choice theory, Ranking Aggregation, also referred to as Consensus Ranking, has motivated the development of numerous statistical models since the middle of the 20th century. Recently, the analysis of... more

descriptionView Paper arrow_downwardDownload

User recommendation system based on MIND dataset

by Ahmed Obaid

2025, arXiv (Cornell University)

Nowadays, it's a very significant way for researchers and other individuals to achieve their interests because it provides short solutions to satisfy their demands. Because there are so many pieces of information on the internet, news... more

descriptionView Paper arrow_downwardDownload

Ranking and Selecting Clustering Algorithms Using a Meta-Learning Approach

by Sabino araujo

2025, Proceedings of the International Joint Conference on Neural Networks

We present a novel framework that applies a metalearning approach to clustering algorithms. Given a dataset, our meta-learning approach provides a ranking for the candidate algorithms that could be used with that dataset. This ranking... more

descriptionView Paper arrow_downwardDownload

Representation learning for entity type ranking

by Md Mostafizur Rahman

2025, Proceedings of the 35th Annual ACM Symposium on Applied Computing

The type of an entity is a key piece of information to understand what an entity is and how it relates to other entities mentioned in a document. Search engine result pages (SERPs) often surface facts and entity type information from a... more

descriptionView Paper arrow_downwardDownload

Is learning to rank worth it? A statistical analysis of learning to rank methods

by Marcos Goncalves

2024

The Learning to Rank (L2R) research field has experienced a fast paced growth over the last few years, with a wide variety of benchmark datasets and baselines available for experimentation. We here investigate the main assumption behind... more

descriptionView Paper arrow_downwardDownload

AttentiveBugLocator: A Bug Localization Model using Attention-based SemanticFeatures and Information Retrieval

by Mohamed Kholief

2024, Research Square (Research Square)

In recent years, deep learning-based algorithms such as CNN, LSTM, and auto-encoders have been proposed to rank suspicious buggy őles. Meanwhile, representational learning has served to be the best approach to extract rich semantic... more

descriptionView Paper arrow_downwardDownload

Generating Pseudo Test Collections for Learning to Rank Scientific Articles

by Manos Tsagkias

2024, Lecture Notes in Computer Science

Pseudo test collections are automatically generated to provide training material for learning to rank methods. We propose a method for generating pseudo test collections in the domain of digital libraries, where data is relatively sparse,... more

descriptionView Paper arrow_downwardDownload

FEM Analysis of High Impact Velocity on Composite Laminated Plates

by CHANDAN KUMAR

2024, International Journal For Scientific Research and Development

The World Wide Web contains the large amount of information sources and these are increasing tremendously. When the user searching the web for information retrieval, user may fetch irrelevant and redundant data causing a waste in user... more

descriptionView Paper arrow_downwardDownload

Towards Sequential Counterfactual Learning to Rank

by Tesi Xiao

2024

Counterfactual evaluation plays a crucial role in learning-to-rank problems, as it addresses the discrepancy between the data logging policy and the policy being evaluated, due to the presence of presentation bias. Existing counterfactual... more

descriptionView Paper arrow_downwardDownload

Learning temporal-dependent ranking models

by Mario Esteban Donayre Silva

2024, Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval

Web archives already hold together more than 534 billion files and this number continues to grow as new initiatives arise. Searching on all versions of these files acquired throughout time is challenging, since users expect as fast and... more

descriptionView Paper arrow_downwardDownload

Automatic multilabel categorization using learning to rank framework for complaint text on Bandung government

by dzikri ilhamy fauzan

2024, 2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)

Learning to rank is a technique in machine learning for ranking problem. This paper aims to investigate this technique to classify the responsible agencies of each complaint text of LAPOR, which is our government complaint management... more

descriptionView Paper arrow_downwardDownload

Learning to Rank Effective Paraphrases from Query Logs for Community Question Answering

by alejandro figueroa

2024, Proceedings of the AAAI Conference on Artificial Intelligence

We present a novel method for ranking query paraphrases for effective search in community question answering (cQA). The method uses query logs from Yahoo! Search and Yahoo! Answers for automatically extracting a corpus of paraphrases of... more

descriptionView Paper arrow_downwardDownload

Category-specific models for ranking effective paraphrases in community Question Answering

by alejandro figueroa

2024, Expert Systems with Applications

Platforms for community-based Question Answering (cQA) are playing an increasing role in the synergy of informationseeking and social networks. Being able to categorize user questions is very important, since these categories are good... more

descriptionView Paper arrow_downwardDownload

Multi-Field Models in Neural Recipe Ranking - An Early Exploratory Study

by Kentaro Takiguchi

2024, arXiv (Cornell University)

Explicitly modelling field interactions and correlations in complex document structures has recently gained popularity in neural document embedding and retrieval tasks. Although this requires the specification of bespoke task-dependent... more

descriptionView Paper arrow_downwardDownload

RankDNN: Learning to Rank for Few-shot Learning

by Yanwei Fu

2024, arXiv (Cornell University)

descriptionView Paper arrow_downwardDownload

Scoring anomalies : a M-estimation formulation

by Stéphan Clémençon

2024, HAL (Le Centre pour la Communication Scientifique Directe)

It is the purpose of this paper to formulate the issue of scoring multivariate observations depending on their degree of abnormality/novelty as an unsupervised learning task. Whereas in the 1-d situation, this problem can be dealt with by... more

Incidentally, we point out that the empirical MV curve of the scoring function produced by the algorithm above is always convex, just like the target MV™ (see Proposition 4). Borrowing standard concepts of the finite element method, consider the ”hat functions”: Pe(-) = PO; (@x-1,%)) — VCs (Qe, An41)), for LS k < K, with v(a,(a’,a@”)) = (a—a’)/(a” — a’) - {a € [a’,a”"|} for a’ < a”, and set WK = (-;(ax,1)). We may then write:

descriptionView Paper arrow_downwardDownload

On Bootstrapping the ROC Curve

by Stéphan Clémençon

2024, HAL (Le Centre pour la Communication Scientifique Directe)

This paper is devoted to thoroughly investigating how to bootstrap the ROC curve, a widely used visual tool for evaluating the accuracy of test/scoring statistics in the bipartite setup. The issue of confidence bands for the ROC curve is... more

descriptionView Paper arrow_downwardDownload

Ranking the best instances

by Stéphan Clémençon

2024

We formulate a local form of the bipartite ranking problem where the goal is to focus on the best instances. We propose a methodology based on the construction of real-valued scoring functions. We study empirical risk minimization of... more

descriptionView Paper arrow_downwardDownload

Learning to Rank

Related Topics