Query Reformulation

description400 papers

group12 followers

lightbulbAbout this topic

Query reformulation is the process of modifying a user's search query to improve the relevance and accuracy of search results. This involves techniques such as synonym replacement, phrase restructuring, and the addition of context-specific terms to enhance information retrieval in databases and search engines.

lightbulbAbout this topic

Key research themes

1. How can query expansion and term weighting techniques improve the effectiveness of query reformulation in information retrieval systems?

This research theme investigates algorithmic and automated methods for expanding or reformulating an initial query by adding semantically related or relevant terms and determining their importance to improve recall and precision in information retrieval. These approaches address challenges such as lexical gaps, short query lengths, and user naivety in query formulation by leveraging semantic similarity metrics, lexical relations, or graph-based syntactic dependencies to generate richer queries and better represent user intent.

Xu: An Automated Query Expansion and Optimization Tool

by Morgan Gallant

2022

Key finding: The paper presents Xu, an automated query expansion technique that integrates multiple semantic similarity sources including Datamuse API and a Wikipedia-trained Word2Vec model to generate expanded queries. Xu demonstrates... Read more

articleView Paper downloadDownload

An Algorithmic Query Refinement Model based on Query Classification

by Behin Sam

2022, Indian Journal of Science and Technology

Key finding: This study proposes a hybrid algorithmic refinement model that classifies web queries and refines them by generating candidate terms using both ontology and thesaurus sources. The classification-based approach reduces query... Read more

articleView Paper downloadDownload

STRICT: Information Retrieval Based Search Term Identification for Concept Location

by Masud Rahman, PhD

2019, SANER/IEEE

Key finding: The STRICT technique automatically identifies suitable search terms for software change tasks using graph-based term weighting algorithms TextRank and POSRank, which analyze term co-occurrences and linguistic syntactic... Read more

articleView Paper downloadDownload

Knowledge-Based Systems

by Lorenzo Massai

2023, Evaluation of semantic relations impact in query expansion-based retrieval systems

Key finding: This work develops automated semantic resource generation methods exploiting taxonomies with synonymy, antonymy, and other semantic relations to reformulate user queries as intents. It constructs semantic expansion corpora... Read more

articleView Paper downloadDownload

Negative Relevance Feedback for Exploratory Search with Visual Interactive Intent Modeling

by Jonathan Strahl

2023

Key finding: The paper introduces a novel exploratory search system enabling both positive and negative feedback directly on keyword features of a probabilistic user intent model via an interactive visual interface. Negative relevance... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How does adapting and optimizing query reformulation benefit specialized domains like bug localization and software maintenance?

This research theme explores query reformulation techniques tailored to domain-specific applications such as bug localization and software maintenance, where the input queries (e.g., bug reports) often lack explicit structured information or contain noisy elements like stack traces. These approaches focus on contextual query reformulation, quality-aware preprocessing, and dynamic term selection to improve localization accuracy and reduce developer effort by automatically refining suboptimal queries inherent in domain texts.

The Forgotten Role of Search Queries in IR-based Bug Localization: An Empirical Study

by Masud Rahman, PhD

2022, Springer

Key finding: Through an empirical study on 2,320 bug reports and multiple query construction methodologies, this paper exposes that many natural language-only bug reports inherently contain high-quality keywords for bug localization even... Read more

articleView Paper downloadDownload

Improving IR-Based Bug Localization with Context-Aware Query Reformulation

by Masud Rahman, PhD

2019, ESEC-FSE/ACM

Key finding: BLIZZARD, the proposed technique, classifies bug reports into noisy, rich, or poor based on their structured information content and applies suitable query reformulations accordingly. Evaluations conducted on 5,139 bug... Read more

articleView Paper downloadDownload

Poster: Improving Bug Localization with Report quality Dynamics and Query Reformulation

by Masud Rahman, PhD

2019, ICSE/ACM

Key finding: This empirical study replicates three existing IR-based bug localization techniques using a dataset of 5,500 bug reports clustered by structured information quality (stack traces, program entities, plain text). Results reveal... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What roles do cognitive abilities and alternative query formulation strategies play in users' interaction with query reformulation processes?

This theme investigates the interplay between individual cognitive differences and query reformulation behaviors during information seeking. It focuses on understanding how cognitive abilities such as visualization and memory impact users' usage of query modification moves, and explores alternative query formulation interfaces and languages that go beyond traditional Boolean-based querying, aiming to reduce user difficulty and broaden effective query construction across diverse user profiles.

How does Cognitive Ability impact the use of Query Reformulation Moves?

by Erin K Moore

2017

Key finding: Secondary analysis of user study data reveals that higher visualization ability correlates with significantly more frequent use of query reformulation moves, including term manipulations, compared to users with lower... Read more

articleView Paper downloadDownload

No IFs, ANDs, or ORs: A Study of Database Querying

by Louis M Gomez

2017

Key finding: This experimental study compares a conventional Boolean-based query language (SQL) with a Truth-table Exemplar-Based Interface (TEBI) that allows users to construct queries by selecting exemplar tuples, bypassing the need for... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Query Reformulation

“ Integrating Heterogeneous Data Sources Using XML ”

by Yogesh Rochlani

2025

Nowadays organizations not only are increasing the data volume, but also they have to work with a large variety of data sources with different types of data. The central problem of information sources integration resides on their... more

descriptionView Paper arrow_downwardDownload

@GEWEB : Agents personnels d’aide à la recherche sur le Web

by Ismaïl Biskri

2025

Nous présentons dans cet article un logiciel permettant d’assister l’usager, de manière personnalisée lors de la recherche documentaire sur le Web. L’architecture du logiciel est basée sur l’intégration d’outils numériques de traitements... more

descriptionView Paper arrow_downwardDownload

Reformulating Aggregate Queries Using Views

by Michael Genesereth

2025, Symposium on Abstraction, Reformulation and Approximation

descriptionView Paper arrow_downwardDownload

Effectiveness of keyword-based display and selection of retrieval results for interactive searches

by claudio carpineto

2025, International Journal on Digital Libraries

We present an approach to increasing the effectiveness of rankedoutput retrieval systems that relies on graphical display and user manipulation of "views" of retrieval results, where a view is the subset of retrieved documents that... more

descriptionView Paper arrow_downwardDownload

Realistic belief revision

by John Slaney

2025

In this paper we consider the implications for belief revision of weakening the logic under which belief sets are taken to be closed. A widely held view is that the usual belief revision functions are highly classical, especially in being... more

descriptionView Paper arrow_downwardDownload

ENHANCING SPARQL QUERY REWRITING FOR COMPLEX ONTOLOGY ALIGNMENTS

by Anicet Lepetit Ondo

2025, International Journal of Web & Semantic Technology (IJWesT) Vol.16, No.2

SPARQL query rewriting is a fundamental mechanism for uniformly querying heterogeneous ontologies in the Linked Data Web. However, the complexity of ontology alignments, particularly rich correspondences (c : c), makes this process... more

descriptionView Paper arrow_downwardDownload

ENHANCING SPARQL QUERY REWRITING FOR COMPLEX ONTOLOGY ALIGNMENTS

by Anicet Lepetit Ondo

2025, International Journal of Web & Semantic Technology (IJWesT) Vol.16, No.2

descriptionView Paper arrow_downwardDownload

Self-organizing schema mappings in the gridvine peer data management system

by Suchit Agarwal

2025, Proceedings of the …

GridVine is a Peer Data Management System based on a decentralized access structure. Built following the principle of data independence, it separates a logical layer--where data, schemas and mappings are managed--from a physical layer... more

descriptionView Paper arrow_downwardDownload

Semantic Features for Classifying Referring Search Terms

by eric marshall

2025

descriptionView Paper arrow_downwardDownload

Rewriting XPath queries using materialized views

by Z. Ozsoyoglu

2025, Very Large Data Bases

As a simple XML query language but with enough expressive power, XPath has become very popular. To expedite evaluation of XPath queries, we consider the problem of rewriting XPath queries using materialized XPath views. This problem is... more

descriptionView Paper arrow_downwardDownload

Equivalence of SQL queries in presence of embedded dependencies

by Michael Genesereth

2024

We consider the problem of finding equivalent minimalsize reformulations of SQL queries in presence of embedded dependencies . Our focus is on select-project-join (SPJ) queries with equality comparisons, also known as safe conjunctive (CQ) queries, possibly with grouping and aggregation. For SPJ queries, the semantics of the SQL standard treat query answers as multisets (a.k.a. bags), whereas the stored relations may be treated either as sets, which is called bag-set semantics for query evaluation, or as bags, which is called bag semantics. (Under set semantics, both query answers and stored relations are treated as sets.) In the context of the above Query-Reformulation Problem, we develop a comprehensive framework for equivalence of CQ queries under bag and bag-set semantics in presence of embedded dependencies, and make a number of conceptual and technical contributions. Specifically, we develop equivalence tests for CQ queries in presence of arbitrary sets of embedded dependencies under bag and bag-set semantics, under the condition that chase [10] under set semantics (set-chase) on the inputs terminates. We also present equivalence tests for aggregate CQ queries in presence of embedded dependencies. We use our equivalence tests to develop sound and complete (whenever set-chase on the inputs terminates) algorithms for solving instances of the Query-Reformulation Problem with CQ queries under each of bag and bag-set semantics, as well as for instances of the problem with aggregate queries. Some of our results are of independent interest. In particular, it is known that constraints that force some relations to be sets on all instances of a given database schema arise naturally in the context of sound (i.e., correct) chase [9] under bag semantics. We develop a formal framework for defining such constraints as embedded dependencies, provided that row (tuple) IDs, commonly used in commercial database-management systems, are defined for the respective relations. We also extend the condition of [4] for bag equivalence of CQ queries, to those cases where some relations are set valued in all instances of the given schema. Our proof of this nontrivial result includes reasoning involving bag (non)containment. In particular, we provide an original proof (adapted to our context) of the result of [4] that CQ query Q 1 is bag contained in CQ query Q 2 only if, for each predicate used in Q 1 , Q 2 has at least as many subgoals with this predicate as Q 1 does. Our contributions are clearly applicable beyond the Query-Reformulation Problem considered in this paper. Specifically, the results of this paper can be used in developing algorithms for rewriting CQ queries and queries in more expressive languages (e.g., including grouping and aggregation, or arithmetic comparisons) using views in presence of embedded dependencies, under bag or bag-set semantics for query evaluation. This text contains corrections to Sections 2.4 and 4 of [5].

descriptionView Paper arrow_downwardDownload

Annotated Last Unicorn Commentary translated chapter 7

by Makoto Kuroda

2024

The Hagsgate town episode which was omitted in the animated film. There are very interesting parts concerning the prophesy and curse set on the prosperous town. The existence of prince Lír aggregates “antifantasy” elements, through... more

descriptionView Paper arrow_downwardDownload

Interoperability in peer data management systems

by Kai-Uwe Sattler

2024

Interoperability plays an important role for a variety of applications. One of them are Peer Data Management Systems, where autonomous data sources (peers) interact with each other based on semantic mappings between their schemas. The... more

descriptionView Paper arrow_downwardDownload

Search improvement via automatic query reformulation

by John B. Smith

2024, ACM Transactions on Information Systems

Users of online retrieval systems experience many difficulties, particularly with search tactics, User studies have indicated that searchers use vocabulary incorrectly and do not take full advantage of iteration to improve their queries.... more

descriptionView Paper arrow_downwardDownload

A schema-driven approach for knowledge-oriented retrieval and query formulation

by Hany Azzam

2024, Proceedings of the Third International Workshop on Keyword Search on Structured Data

In order to search across factual knowledge and content explicated using different data formats this paper leverages a generic data model (schema) that transforms keyword-based retrieval models and queries to knowledge-oriented models and... more

descriptionView Paper arrow_downwardDownload

Nouvelles méthodes pour l'évaluation, l'évolution et l'interrogation des bases du Web des données. (New methods to evaluate, check and query the Web of data)

by Pierre Maillot

2024

Le Web des donnees offre un environnement de partage et de diffusion des donnees, selon un cadre particulier qui permet une exploitation des donnees tant par l’humain que par la machine. Pour cela, le framework RDF propose de formater les... more

descriptionView Paper arrow_downwardDownload

Consolidating User Search Histories using Query Groups

by P M Chawan

2024

With the exponential growth in web users, search history is also growing exponentially. To manage the web search , search engine uses different techniques. It gives users an easy feel to search their interest by providing page ranking,... more

descriptionView Paper arrow_downwardDownload

SIXTH FRAMEWORK PROGRAMME PRIORITY IST-2002-2.3. 1.12 Technology-enhanced Learning and Access to Cultural Heritage

by Vassilis Christophides

2024

Executive Summary Digital libraries can be viewed as an infrastructure for supporting both the creation of information sources and the movement of information across global networks, and moreover the effective and efficient interaction... more

descriptionView Paper arrow_downwardDownload

Ontology based data warehouses federation management system

by Naoual Mouhni

2024

Data warehouses are nowadays an important component in every competitive system, it's one of the main components on which business intelligence is based. We can even say that many companies are climbing to the next level and use a set of... more

descriptionView Paper arrow_downwardDownload

A Flexible Meta-Wrapper Interface for Autonomous Distributed Information Sources

by maria vidal

2024

We support exible query processing with autonomous networked information sources. Flexibility allows a query to be accepted in a dynamic environment with unavailable sources. Flexibility provides the ability to identify equivalent... more

descriptionView Paper arrow_downwardDownload

Intelligent query processing for semantic mediation of information systems

by Guy Caplat

2024, Egyptian Informatics Journal

We propose an intelligent and an efficient query processing approach for semantic mediation of information systems. We propose also a generic multi agent architecture that supports our approach. Our approach focuses on the exploitation of... more

descriptionView Paper arrow_downwardDownload

Alignment of short length parallel corpora with an application to web search

by Hema Koppula

2024

With evolving Web, short length parallel corpora is becoming very common and some of these include user queries, web snippets etc. This paper concerns situations where short length parallel corpora has to be analyzed in order to find... more

descriptionView Paper arrow_downwardDownload

Alignment of short length parallel corpora with an application to web search

by Hema Koppula

2024, Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10

descriptionView Paper arrow_downwardDownload

Guided Composition of Tasks with Logical Information Systems - Application to Data Analysis Workflows in Bioinformatics

by Mouhamadou Djamal Bâ

2024, Lecture Notes in Computer Science

In a number of domains, particularly in bioinformatics, there is a need for complex data analysis. For that issue, elementary data analysis operations called tasks are composed as workflows. The composition of tasks is however difficult... more

descriptionView Paper arrow_downwardDownload

FORUM: a flexible data integration system based on data semantics

by Farouk Toumani

2024, Sigmod Record

The FORUM project aims at extending existing data integration techniques in order to facilitate the development of mediation systems in large and dynamic environments. It is well known from the literature that a crucial point that hampers... more

descriptionView Paper arrow_downwardDownload

Interprétation linguistique de requêtes pour un moteur de questions réponses grand public

by Johannes Heinecke

2024, CORIA

,johannes.heinecke}@orange-ftgroup.com RÉSUMÉ. Cet article décrit l'utilisation d'une plateforme de traitement automatique des langues naturelles pour le développement d'une fonction de réponses à des questions dans un moteur de... more

descriptionView Paper arrow_downwardDownload

Analyzing the Performance of a Multiobjective GA-P Algorithm for Learning Fuzzy Queries in a Machine Learning Environment

by María Luque

2024, Lecture Notes in Computer Science

The fuzzy information retrieval model was proposed some years ago to solve several limitations of the Boolean model without a need of a complete redesign of the information retrieval system. However, the complexity of the fuzzy query... more

descriptionView Paper arrow_downwardDownload

A New Hybrid Document Clustering for PRF-Based Automatic Query Expansion Approach for Effective IR

by Yogesh Gupta

2024, International Journal of e-Collaboration

Automatic query expansion (AQE) is an effective measure to improve information retrieval performance by including additional terms in a user query. The pseudo relevance feedback (PRF) method employed for AQE so far has suffered from a... more

descriptionView Paper arrow_downwardDownload

Provenance-Based Retrieval: Fostering Reuse and Reproducibility Across Scientific Disciplines

by Lucas Augusto

2024, Lecture Notes in Computer Science

When computational researchers from several domains cooperate, one recurrent problem is finding tools, methods and approaches that can be used across disciplines, to enhance collaboration through reuse. The paper presents our ongoing work... more

descriptionView Paper arrow_downwardDownload

by Charly Castillo

2024, SIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Defining a measure of similarity between queries is an interesting and difficult problem. A reliable query-similarity measure can be used in a variety of applications such as query recommendation, query expansion, and advertising. In this... more

Figure 1 shows an example. The sizes of the obtained sub- graphs vary widely, with |V2(d)| ranging from 31 to 20702 queries (median: 2320 queries).

Figure 4: Example: query similarities using cosine similarity of neighbors (no projection) and projection o the full graph for the query watch . Lines separate manually-assigned clusters for these queries. Figure 5: Example query similarities using projection of the subgraph S2(q) for the query watch

(a) Method: N, i.e., cosine similarity between vectors of neighbors in the QFG

Table 6: Result of user test for assessing diversity of recommendations ~~ | Ey 77 rs a rr Significance: 0.1 * 0.05 ** 0.01 **«

Figure 2: Comparison among local projections, projec- tion of the full graph and direct use of the query-flow graph

Figure 3: Summary of performance of some systems

Table 3: Average Me obtained for S2(q), S3(q) using different weighting schemes.

Table 2: Average Mg for different projections with different number of dimensions SS So a

Table 5: Mg, for varying numbers of clusters; none of the pair-wise differences is statistically significant

descriptionView Paper arrow_downwardDownload

Multimodal Image Retrieval over a Large Database

by Adrian Popescu

2024, Lecture Notes in Computer Science

We introduce a new multimodal retrieval technique which combines query reformulation and visual image reranking in order to deal with results sparsity and imprecision, respectively. Textual queries are reformulated using Wikipedia... more

descriptionView Paper arrow_downwardDownload

Buyer agent to enhance consumer awareness: SAATHI

by Karina Hernandez

2024, Electronic Commerce Research and Applications

Personal agents have been developed that assist a user with information processing needs by generating, filtering, collecting, or transforming information. On the other hand internet stores are providing services customized by the needs... more

descriptionView Paper arrow_downwardDownload

XViz: A Tool for Visualizing XPath Expressions

by Ben Handy

2024, Lecture Notes in Computer Science

We describe a visualization tool for XPath expressions called XViz. Starting from a workload of XQueries, the tool extracts the set of all XPath expressions, and displays them together with some relationships. XViz is intended to be used... more

descriptionView Paper arrow_downwardDownload

Ontology Based Query Reformulation using Rhetorical Relations

by Fiaz Majeed

2024

Web searching is becoming more and more complex due to increased size of information on the web. Users have to face a lot of problems in specifying their needs in the form of query. Query Reformulation techniques are required in order to... more

descriptionView Paper arrow_downwardDownload

Progress Report of Spoken Document Processing Working Group

by Tomoko Matsui

2024, Scientific Programming

This report describes the activities of SLP Spoken Document Processing Working Group (SDPWG). The SDPWG was organized in 2006. The working group was reorganized in 2009. This report mainly describes the activities of the second period of... more

descriptionView Paper arrow_downwardDownload

Peer coordination through distributed triggers

by Verena Kantere

2024, Proceedings of the VLDB Endowment

This is a demonstration of data coordination in a peer data management system through the employment of distributed triggers. The latter express in a declarative manner individual security and consistency requirements of peers, that... more

descriptionView Paper arrow_downwardDownload

Think outside the search box: A comparative study of visual and form-based query builders

by Tony Russell-Rose

2024, arXiv (Cornell University)

Knowledge workers such as healthcare information professionals, legal researchers, and librarians need to create and execute search strategies that are comprehensive, transparent, and reproducible. The traditional solution is to use... more

descriptionView Paper arrow_downwardDownload

Does User Search Behaviour Mediate User Knowledge and Search Satisfaction?

by WAN HUSSAIN WAN ISHAK

2024, DOAJ (DOAJ: Directory of Open Access Journals)

Information searching in web environment is habitually tedious and challenging task. Rapid growth of web information infrastructure has led to the rapid publication of information on web environment. Too many information publish on web... more

descriptionView Paper arrow_downwardDownload

Efficient evaluation of n-ary conjunctive queries over trees and graphs

by Tim Furche

2024, Proceedings of the 8th …

N-ary conjunctive queries, ie, queries with any number of answer variables, are the formal core of many Web query languages including XSLT, XQuery, SPARQL, and Xcerpt. Despite a considerable body of research on the optimization of such... more

descriptionView Paper arrow_downwardDownload

Uncovering Task Based Behavioral Heterogeneities in Online Search Behavior

by Prasanta Bhattacharya

2024, Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

While a major share of prior work have considered search sessions as the focal unit of analysis for seeking behavioral insights, search tasks are emerging as a competing perspective in this space. In the current work, we quantify user... more

descriptionView Paper arrow_downwardDownload

An Intelligent Interface for Accessing a Technical Data Base

by Carlo Tasso

2024, Analysis, Design and Evaluation of Man–Machine Systems

The paper deals with assisting e n d-users in accessing a European Reliability Data System-the design of an inte l ligent int.erface aimed at large and complex technical data base (namely, the ERDS) in a friendly , correct , and effective... more

descriptionView Paper arrow_downwardDownload

Preserving the Original Query Semantics in Routing Processes

by Ana Salgado

2024, Proceedings of the 16th International Conference on Enterprise Information Systems

In distributed data environments, peers (data sources) are connected with each other through a set of semantic correspondences in such a way that peers directly connected are called semantic neighbours. Queries are submitted considering... more

descriptionView Paper arrow_downwardDownload

Semantic Loss in Query Reformulation in Dynamic Distributed Environments

by Ana Salgado

2024

Dynamic environments are descentralized systems that provide users with querying capabilities over a set of heterogeneous, distributed and autonomous data sources. Data Integration Systems, Peer Data Management Systems (PDMS) and... more

descriptionView Paper arrow_downwardDownload

Relevance feedback versus local context analysis as term suggestion devices: Rutgers’ TREC-8 interactive track experience

by Judy Jeng

2024, Text REtrieval Conference

Query formulation and reformulation is recognized as one of the most difficult tasks that users in information retrieval systems are asked to perform. This study investigated the use of two different techniques for supporting query... more

descriptionView Paper arrow_downwardDownload

Automatic Taxonomy Extraction from Query Logs with no Additional Sources of Information

by Miguel Fernandez

2024, arXiv (Cornell University)

Search engine logs store detailed information on Web users interactions. Thus, as more and more people use search engines on a daily basis, important trails of users common knowledge are being recorded in those files. Previous research... more

descriptionView Paper arrow_downwardDownload

Clinico-Radiographic spectrum of cleidocranial dysplasia: A case series

by Anita Munde

2024, Indian Journal of Case Reports

Success of query reformulation and relevant information retrieval depends on many factors, such as users' prior knowledge, age, gender, and cognitive styles. One of the important factors that affect a user's query reformulation behaviour... more

descriptionView Paper arrow_downwardDownload

Clinico-Radiographic spectrum of cleidocranial dysplasia: A case series

by Anita Munde

2024, Indian Journal of Case Reports

descriptionView Paper arrow_downwardDownload

InOrder: enhancing Google via stigmergic query refinement

by Mihaela Ulieru

2023, Comput. Syst. Sci. Eng.

InOrder is a query refinement tool that works on top of Goolge and helps individual users to collaboratively participate in best Web query formulations. The incremental refinement works via an indirect communication process facilitated by... more

descriptionView Paper arrow_downwardDownload

Réponses coopératives dans l'interrogation de documents RDF

by Adrian Tanasescu

2023, HAL (Le Centre pour la Communication Scientifique Directe)

lyon1.fr Résumé. Le développement du Web Sémantique a conduità l'élaboration de standards pour la représentation des connaissances sur le Web. RDF, comme un de ces standards, est devenu une recommandation du W3C. Même s'il aété conçu... more

descriptionView Paper arrow_downwardDownload

Réponses coopératives dans l'interrogation de documents RDF

by Adrian Tanasescu

2023, month

descriptionView Paper arrow_downwardDownload