Web Search Engine

description2,037 papers

group8 followers

lightbulbAbout this topic

A web search engine is a software system designed to search for information on the World Wide Web. It indexes web pages and retrieves relevant results based on user queries, utilizing algorithms to rank the relevance and authority of the content.

lightbulbAbout this topic

Key research themes

1. What are effective user search strategies and behaviors for locating specific information on the Web?

This research area investigates how users approach Web information searching, their success rates, search patterns, duration, step counts, and strategy effectiveness. Understanding these user-centric aspects is vital because Web search involves complex cognitive and technical skills, and user behavior directly impacts search outcomes and frustration levels.

Needle in a Hyperstack: Searching Information on the World Wide Web

by Rafi Nachmias

2016

Key finding: Through an empirical study involving 54 graduate students performing search tasks, this work identified typical characteristics of Web search processes including search duration, number of steps, and identified common search... Read more

articleView Paper downloadDownload

A study of the intension of using computer as a strategic resource of web searching

by YM Chu

2023

Key finding: Using a controlled pretest-posttest design, this study showed that explicit teaching of 'Technology Strategic Usefulness (TSU)' significantly enhanced high-school students' perceived usefulness and strategic intent in using... Read more

articleView Paper downloadDownload

Evaluation and evolution of a browse and search interface: Relation browser

by gary marchionini

2025

Key finding: This work developed and empirically evaluated the Relation Browser++ (RB++), a novel interface combining faceted category overviews and dynamic filtering to tightly couple browsing and searching across large information sets.... Read more

articleView Paper downloadDownload

A Novel Architecture for Search Engine using Domain Based Web Log Data

by DIVAKAR YADAV

2025, The International Arab Journal of Information Technology

Key finding: The study proposed a proxy server architecture which caches search results in a domain-specific Web log to expedite repeated user queries. Experimental evaluations with duplicate queries across domains showed significant... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How do search engine architectures and algorithms address scalability and efficiency in crawling and indexing vast Web content?

Research under this theme explores the design, implementation, and optimization of crawling architectures and indexing strategies that enable search engines to efficiently gather and organize vast and dynamic Web content. Scaling to billions of pages requires algorithms for distributed crawling, handling AJAX-based dynamic content, load balancing, and incremental updating while improving speed and reliability.

Efficient Distributed Web Crawler Using Hefty and Enhanced Bandwidth Algorithms for Drug Website Search

by Dr Aghila Rajagopal

2024, International Journal of Machine Learning and Networked Collaborative Engineering

Key finding: Introduced a combined approach using an enhanced Hefty algorithm and bandwidth optimization to implement a distributed Web crawler minimizing redundant crawling and maximizing crawl throughput for drug-related websites. The... Read more

articleView Paper downloadDownload

Web Crawler: Design And Implementation For Extracting Article-Like Contents

by Ngo Le Huy Hien

2023, Cybernetics and physics

Key finding: This study presented a machine learning-based web crawler designed to extract article-like content from diverse web pages by leveraging visual, trivial HTML, and text-based features. The approach specifically addresses... Read more

articleView Paper downloadDownload

Crawling Ajax-Based Web Applications: Evolution and State-Of-The-Art

by shah khalid

2023, Malaysian Journal of Computer Science

Key finding: Provided a comprehensive survey of the evolution and methodologies in crawling AJAX-based Web applications, which present unique challenges due to dynamic content and multiple states per URL. Identified key issues such as... Read more

articleView Paper downloadDownload

Needle in a Hyperstack: Searching Information on the World Wide Web

by Rafi Nachmias

2016

Key finding: Additionally, by emphasizing user difficulties in locating relevant information and highlighting the explosion of Web content, this work indirectly underscores the necessity for efficient crawling and indexing architectures... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What are contemporary techniques in Search Engine Optimization (SEO), ranking algorithms, and semantic search that improve search relevance and page ranking?

This theme encapsulates advanced methodologies for optimizing Web page ranking and retrieval relevance, focusing on both technical page optimization (e.g., page speed, audit rules) and semantic understanding through ontology and knowledge representation. These approaches inform search engines to serve more accurate, relevant, and quality results to users, overcoming shortcomings of simple keyword matching.

Identification of an Optimized Google PageSpeed Audit-Rule-Sequence to Optimize Page Speed

by Mohsin Ashraf

2025

Key finding: The study systematically analyzed the impact of Google PageSpeed audit rules on website performance, identifying a prioritized sequence of audit rules that, when applied, yielded over 80% performance improvement after... Read more

articleView Paper downloadDownload

Semantic Search Engine

by Principal Dr Pradip M Jawandhiya

2023, Indian Journal of Science and Technology

Key finding: Proposed an ontology-based semantic search engine framework for the tourism domain leveraging WordNet to construct synonym sets enabling deeper semantic query matching beyond keywords. Experiments showed improved retrieval... Read more

articleView Paper downloadDownload

Web Page Ranking using Web Mining Techniques: A comprehensive survey

by DIVAKAR YADAV

2025

Key finding: Surveyed web mining techniques (structure, content, usage mining) and their application in developing ranking algorithms. Highlighted that combining hyperlink analysis (e.g., PageRank), content analysis, and user behavior... Read more

articleView Paper downloadDownload

Improving the Web Search Using Search Engines Operators

by Radu Cretulescu

2023

Key finding: This work evaluated how search engine query modifiers (operators) can be employed to effectively refine user queries, significantly reducing result set size and increasing precision. Empirical results with Google and Bing... Read more

articleView Paper downloadDownload

Search Engine Techniques

by Rami Hijazi

2024, Open Source Intelligence Methods and Tools

Key finding: Discusses advanced Google search operators and techniques for optimizing query specificity, such as field searches (title:, link:), truncation, exclusion, proximity operators, and leveraging services like image and flight... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Web Search Engine

Hacia la optimización de un sistema de recuperación de información

by Viviana Ledesma

2025, XXII Workshop de Investigadores en Ciencias de la Computación (WICC 2020, El Calafate, Santa Cruz)

descriptionView Paper arrow_downwardDownload

Investigating query bursts in a web search engine

by Carlos Castillo

2025, Web Intelligence and Agent Systems: An International Journal

Abstr act. The Internet has become for many the most important medium for staying informed about current news events. Some events cause heightened interest on a topic, which in turn yields a higher frequency of the search queries related... more

descriptionView Paper arrow_downwardDownload

An Introduction to Information Searching Techniques

by Chinthaka Suranjith

2025, "Studies of Social Sciences" Academic Journal

descriptionView Paper arrow_downwardDownload

Study of Indexing Techniques to Improve the Performance of Information Retrieval in Telugu Language

by Dr.Ramakrishna Kolikipogu

2025

Information Retrieval Systems (IRS) are so popular through World Wide Web. Availability of Text Information related to all types of objects like Documents, Web Pages,Images, Videos and Audio files on web are increasing day by day in an... more

descriptionView Paper arrow_downwardDownload

Malaysian Web Search Engines: A Critical Analysis

by Kiran kaur

2025, Malaysian Journal of Library and Information …

This paper reports the results of a study conducted to explore and compare the features of independently built Malaysian Web search engines, as well as evaluate their performance and search capabilities. Four Malaysian independently built... more

descriptionView Paper arrow_downwardDownload

How Will Online Affiliate Marketing Networks Impact Search Engine Rankings

by Eric Van Heck

2025, RePEc: Research Papers in Economics

In online affiliate marketing networks advertising web sites offer their affiliates revenues based on provided web site traffic and associated leads and sales. Advertising web sites can have a network of thousands of affiliates providing... more

descriptionView Paper arrow_downwardDownload

A Multi-Agent System for Information Semantic Sharing

by Agostino Poggi

2025

AOIS is a multi-agent system that supports the sharing of information among a community of users connected through the Internet. In respect to Web search engines, this system enhances the search through domain ontologies, avoids the... more

descriptionView Paper arrow_downwardDownload

Architecture of a grid-enabled Web search engine

by Cevdet Aykanat

2025, Information Processing & Management

Search Engine for South-East Europe (SE4SEE) is a socio-cultural search engine running on the grid infrastructure. It offers a personalized, on-demand, country-specific, category-based Web search facility. The main goal of SE4SEE is to... more

descriptionView Paper arrow_downwardDownload

SE4SEE: A Grid-Enabled Search Engine for South-East Europe

by Cevdet Aykanat

2025

Search Engine for South-East Europe (SE4SEE) is an application project aiming to develop a grid-enabled search engine that specifically targets the countries in the South-East Europe. It is one of the two selected regional applications... more

descriptionView Paper arrow_downwardDownload

An Efficient Workload-balancing Algorithm for a Parallel Environment Using Hybrid Spatio-temporal Indexes

by Marco A Palomino

2025, The Journal of Universal Computer Science (J.UCS)

In recent years, we have witnessed the proliferation of applications that generate thousands of terabytes of data per day, due to the explosive increase in storage capacity across various devices. As a consequence, a new concept called... more

descriptionView Paper arrow_downwardDownload

Searching the Web Through User Information Spaces

by Athanasios Papagelis

2025, Lecture Notes in Computer Science

During the last years web search engines have moved from the simple but inefficient syntactical analysis (first generation) to the more robust and usable web graph analysis (second generation). Much of the current research is focussed on the so-called third generation search engines that, in principle, inject "human characteristics" on how results are obtained and presented to the end user. Approaches exploited towards this direction include (among others): an alteration of PageRank [1] that takes into account user specific characteristics and bias the page ordering using the user preferences (an approach, though, that does not scale well with the number of users). The approach is further exploited in , where several PageRanks are computed for a given number of distinct search topics. A similar idea is used in , where the PageRank computation takes into account the content of the pages and the query terms the surfer is looking for. In , a decomposition of PageRank to basic components is suggested that may be able to scale the different PageRank computations to a bigger number of topics or even distinct users. Another approach to web search is presented in , where a rich extension of the web, called semantic web, and the application of searching over this new setting is described. In this work we depart from the above lines of research and propose a new conceptual framework for representing the web and potentially improving search results. In particular, the new framework views the web as a collection of webrelated data collected and semi-organized by individual users inside their information spaces. These data can be explicitly collected (e.g., bookmarks) or implicitly collected (e.g., web-browsing history). Our approach is based on the observation that users act as small crawlers seeking information on the web using various media (search engines, catalogs, word-of-mouth, hyperlinks, direct URL typing, etc). They tend to store and organize important-for-them pages in tree-like structures, referred to as bookmark collections, where the folder names act as tags over the collected URLs. This method of organizing data helps people to recall collected URLs faster, but can also be used as a kind of semantic tagging over the URLs (the path to the URL can be perceived as different ways to communicate the URL itself). This information constitutes part of the user's personal information space and it is indicative of his interests. One might argue that people do not collect bookmarks or that they do not organize them in any reasonable manner. However, as our experiments show, people indeed collect and organize bookmarks under certain patterns that follow power law distributions.

descriptionView Paper arrow_downwardDownload

Experience of Developing a Meta-semantic Search Engine

by Trupti Pagare

2025, 2013 International Conference on Cloud & Ubiquitous Computing & Emerging Technologies

Thinking of today's web search scenario which is mainly keyword based, leads to the need of effective and meaningful search provided by Semantic Web. Existing search engines are vulnerable to provide relevant answers to users query due to... more

descriptionView Paper arrow_downwardDownload

Predicting Response Uncertainty in Online Surveys: A Proof of Concept

by Cátia Cepeda

2025

Online questionnaire-based research is growing at a fast pace. Mouse-tracking methods provide a potentially important data source for this research by enabling the capture of respondents' online behaviour while answering questionnaire... more

descriptionView Paper arrow_downwardDownload

Mouse Tracking Measures and Movement Patterns with Application for Online Surveys

by Cátia Cepeda

2025, Lecture Notes in Computer Science

There is growing interest in the field of human-computer interaction in the use of mouse movement data to infer e.g. user's interests, preferences and personality. Previous work has defined various patterns of mouse movement behavior.... more

descriptionView Paper arrow_downwardDownload

Search Engines Evaluation

by Dr.Rakesh Kumar

2025, DESIDOC Bulletin of Information Technology

The volume of world wide web ( WWW) is increasing enormously due to a world wide move to migrate information to online sources. To search some information on WWW, search engines are used, which when presented with queries, return a list... more

descriptionView Paper arrow_downwardDownload

Aspects of Medical Information Search

by celia Boyer

2025

The Internet is increasingly used to find health information worldwide. Online health information search can be beneficial for novice users but due to the overwhelming quantity and uneven quality of online health information it may also... more

descriptionView Paper arrow_downwardDownload

Search Engines Evaluation

by Rakesh Kumar

2025, DESIDOC Bulletin of Information Technology

descriptionView Paper arrow_downwardDownload

(147) TTLS: A Grouped Display of Search Results based on Organizational Taxonomy using the LCC&K Interface

by offer drori

2025, Leibniz Center for Research in Computer Science

(147) One of the major problems in the process of Information Retrieval (IR) arises at the stage where the user reviews the results list. This paper presents the latest research in a series of research works that aims at finding the most... more

descriptionView Paper arrow_downwardDownload

(ׂ140ׁ) Grouping Search Results by Organizational Taxonomy Using LCC&K Interface

by offer drori

2025, Leibniz Center Technical Reports

(140) One of the major problems in the process of Information Retrieval (IR) arises at the stage where the user reviews the results list. This paper presents the latest research in a series of research works that aims at finding the most... more

descriptionView Paper arrow_downwardDownload

Improving the performance of personal name disambiguation using web directories

by Quang Minh Vũ

2025, Information Processing & Management

Frequent requests from users to search engines on the World Wide Web are to search for information about people using personal names. Current search engines only return sets of documents containing the name queried, but, as several people... more

descriptionView Paper arrow_downwardDownload

Incremental learning to rank with partially-labeled data

by Seungjin Choi

2025, Proceedings of the 2009 workshop on Web Search Click Data

In this paper we present a semi-supervised learning method for a problem of learning to rank where we exploit Markov random walks and graph regularization in order to incorporate not only "labeled" web pages but also plenty of "unlabeled"... more

descriptionView Paper arrow_downwardDownload

Usage of a binary integrated spell check algorithm for an upgraded search engine optimization

by jabez j

2025

Search engines have become an integral part of our lives. To augment the power of such engines-even while offline-was our goal. To accomplish this, a distinct offline search engine was created to retrieve data from archives. An updated UI... more

descriptionView Paper arrow_downwardDownload

Identification of an Optimized Google PageSpeed Audit-Rule-Sequence to Optimize Page Speed

by Mohsin Ashraf

2025

World Wide Web is a collection of online resources and websites, including e-commerce, social sites, educational content, etc. To find relevant online resources, people search these by using search engines by providing their desired... more

descriptionView Paper arrow_downwardDownload

Web Mining Based Distributed Crawling with Instant Backup Supports

by YOGESH PAWAR

2025

As the World Wide Web is growing rapidly and data in the present day scenario is stored in a distributed manner. The need to develop a search Engine based architectural model for people to search through the Web. Broad web search engines as well as many more specialized search tools rely on web crawlers to acquire large collections of pages for indexing and analysis. The crawler is an important module of a web search engine. The quality of a crawler directly affects the searching quality of such web search engines. Such a web crawler may interact with millions of hosts over a period of weeks or months, and thus issues of robustness, flexibility, and manageability are of major importance. Given some URLs, the crawler should retrieve the web pages of those URLs, parse the HTML files, add new URLs into its queue and go back to the first phase of this cycle. The crawler also can retrieve some other information from the HTML files as it is parsing them to get the new URLs. In this paper, we describe the design of a web crawler that uses Page Rank algorithm for distributed searches and can be run on a network of workstations. The crawler initially search for all the stop words (such as a, an, the, and etc). While searching the web pages for some keyword the crawler will initially remove all collected stop word. Also at the same time the crawler will search for snippets from web documents. All the matching word & collected snippet will be stored in temporary cache memory created at central server of crawlers. Where after applying page rank algorithm on the basis of no. of visit of web pages we will arrange the pages according to their ranks & display the results. Since, due to extensive search on web through web crawlers the chances of various virus attacks are more & processing capacity of system may get halt so to provide solution in such scenario we can provide backup to our system by creating web services. The web service will be designed in such manner that any valid updations to any database servers will automatically updates the backup servers. Therefore, even in failure of any server system, we can continue with crawling process.

descriptionView Paper arrow_downwardDownload

Web Mining Based Distributed Crawling with Instant Backup Supports

by YOGESH PAWAR

2025, IJCST

descriptionView Paper arrow_downwardDownload

The Compass Filter: Search Engine Result Personalization Using Web Communities

by M. Sideri

2025, Lecture Notes in Computer Science

We propose a simple approach to search engine personalization based on Web communities . User information -in particular, the Web communities whose neighborhoods the user has selected in the past-is used to change the order of the... more

descriptionView Paper arrow_downwardDownload

Federated search in the wild

by Dong Nguyen

2025, Proceedings of the 21st ACM international conference on Information and knowledge management

Federated search has the potential of improving web search: the user becomes less dependent on a single search provider and parts of the deep web become available through a unified interface, leading to a wider variety in the retrieved... more

descriptionView Paper arrow_downwardDownload

Semantic Web Search Based on Ontological Conjunctive Queries

by Georg Gottlob

2025, Lecture Notes in Computer Science

Many experts predict that the next huge step forward in Web information technology will be achieved by adding semantics to Web data, and will possibly consist of (some form of) the Semantic Web. In this paper, we present a novel approach... more

descriptionView Paper arrow_downwardDownload

Search Engines Evaluation

by RAKESH Kumar

2025, DESIDOC Bulletin of Information Technology

descriptionView Paper arrow_downwardDownload

An Approach to Developing an Ontology that Represents Knowledge Embedded in Filmed Materials

by Reyad Binzabiah

2025

This paper introduces the reader to the approach we are taking to develop an ontology that could be used to represent the knowledge inherent in filmed materials. Such an ontology could be used as the semantic basis for multimedia... more

descriptionView Paper arrow_downwardDownload

Study of "Semantic Web" For Finding Relevant Information on Web

by Jyoti Kukade

2025, Journal of emerging technologies and innovative research

descriptionView Paper arrow_downwardDownload

A Terminological Search Algorithm for Ontology Matching

by Mohammad Nematbakhsh

2025, International journal of sciences

Most of the ontology alignment tools use terminological techniques as the initial step and then apply the structural techniques to refine the results. Since each terminological similarity measure considers some features of similarity,... more

descriptionView Paper arrow_downwardDownload

Compressed multi-framed signature files

by Seyit Koçberber

2025, Proceedings of the 1999 ACM symposium on Applied computing

A new indexing method. called Compressed Multi-Framed Signature File (C-MFSF). that uses a partial query evaluation strategy with compressed signature bit slices is presented. In C-MFSF. a signature tile is divided into variable sized... more

descriptionView Paper arrow_downwardDownload

Collection and Selection Based Relevant Degrees Of Docume

by Zekri Lougmiri

2025

In this paper, we address the problem of selection collections. This is important for locating responses in digital libraries. The aim of methods, which deal with the area of information retrieval, is to reduce the amount of the exchanged... more

descriptionView Paper arrow_downwardDownload

Manageable Approaches to the Semantic Web

by ID Phil166

2025

The Semantic Web is usually envisaged as a collection of Web acces- sible RDF documents that re-use RDF schemas. These schemas are expected to be most often independently designed and hence not sharing many categories. We are unconvinced... more

descriptionView Paper arrow_downwardDownload

Asynchronous and Anticipatory Filter-Stream Based Parallel Algorithm for Frequent Itemset Mining

by Adriano Veloso

2025, Lecture Notes in Computer Science

In this paper we propose a novel parallel algorithm for frequent itemset mining. The algorithm is based on the filter-stream programming model, in which the frequent itemset mining process is represented as a data flow controlled by a... more

descriptionView Paper arrow_downwardDownload

A Novel Architecture for Search Engine using Domain Based Web Log Data

by DIVAKAR YADAV

2025, The International Arab Journal of Information Technology

Search engines, an information retrieval tool are the main source of information for users' information need now a day. For every query, the search engine explores its repository and/or indexer to find the relevant documents/URLs for that... more

descriptionView Paper arrow_downwardDownload

Web Page Ranking using Web Mining Techniques: A comprehensive survey

by DIVAKAR YADAV

2025

Purpose: Due to the exponential growth of internet users and internet traffic, information seekers are highly dependent upon search engines to extract relevant information. Due to the accessibility of a large amount of textual, audio,... more

descriptionView Paper arrow_downwardDownload

Parallel crawler architecture and web page change detection

by DIVAKAR YADAV

2025, WSEAS Transactions on …

In this paper, we put forward a technique for parallel crawling of the web. The World Wide Web today is growing at a phenomenal rate. It has enabled a publishing explosion of useful online information, which has produced the unfortunate... more

descriptionView Paper arrow_downwardDownload

An Approach to Design Incremental Parallel Webcrawler

by DIVAKAR YADAV

2025

World Wide Web (WWW) is a huge repository of interlinked hypertext documents known as web pages. Users access these hypertext documents via Internet. Since its inception in 1990, WWW has become many folds in size, and now it contains more... more

descriptionView Paper arrow_downwardDownload

Search Engines Evaluation

by RAKESH KUMAR

2025, DESIDOC Bulletin of Information Technology

descriptionView Paper arrow_downwardDownload

Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering

by Komal Bhatia

2025, International Journal of Computer Applications

The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains... more

descriptionView Paper arrow_downwardDownload

An Algorithm For Gaze Region Estimation On Web Pages

by simon Maina

2025, International Journal of Scientific & Technology Research

Accurate gaze region estimation on the web is important for the purpose of placing marketing advertisements in web pages and monitoring authenticity of user’s response in web forms. To identify gaze region on the web, we need cheap, less... more

descriptionView Paper arrow_downwardDownload

AROOOGA: An Audio Search Engine for theWorldWideWeb

by Ian Knopke

2025, International Computer Music Conference

Existing search engines use web crawlers to gather web pages. The extracted information is used to build indexes, which are later used to answer user queries. This approach is useful for general queries, but ignores the special properties... more

descriptionView Paper arrow_downwardDownload

Machine Learning as a New Search Engine Interface: An Overview

by Taposh Neogy

2025, Engineering international

The essence of a web page is an inherently predisposed issue, one that is built on behaviors, interests, and intelligence. There are relatively a ton of reasons web pages are critical to the new world, as the matter cannot be... more

descriptionView Paper arrow_downwardDownload

SpiderServer: the MetaSearch engine of WebNaut

by Nick Zacharis

2025, Proceedings of Hellenic Conference on …

Abstract. Search engines on the Web are valuable tools for searching information according to a user's interests whether an individual or a software agent. In the present article we describe the design and the operation mode of... more

descriptionView Paper arrow_downwardDownload

Automatic Translation in Cross-Lingual Access to Legislative Databases

by Catherine Bounsaythip

2025

This paper considers the use of controlled languages for query translation in a legislative document retrieval system. Problem statement and analysis of the approach are described. The use of controlled languages is motivated by the fact... more

descriptionView Paper arrow_downwardDownload

Economic background of the Microsoft/Yahoo! case

by Andrea Amelio

2025, Citeseer

descriptionView Paper arrow_downwardDownload

Aging effects on query flow graphs for query suggestion

by CARLOS CASTILLO

2025, Proceedings of the 18th ACM conference on Information and knowledge management

World Wide Web content continuously grows in size and importance. Furthermore, users ask Web search engines to satisfy increasingly disparate information needs. New techniques and tools are constantly developed aimed at assisting users in... more

descriptionView Paper arrow_downwardDownload

Query-log mining for detecting polysemy and spam

by CARLOS CASTILLO

2025, Proceedings of the KDD Workshop on Web Mining and Web Usage Analysis (WEBKDD)

Abstract. Through their interaction with search engines, users provide implicit feedback that can be used to extract useful knowledge and improve the quality of the search process. This feedback is encoded in the form of a query log that... more

descriptionView Paper arrow_downwardDownload

Web Search Engine

Key research themes

1. What are effective user search strategies and behaviors for locating specific information on the Web?

2. How do search engine architectures and algorithms address scalability and efficiency in crawling and indexing vast Web content?

3. What are contemporary techniques in Search Engine Optimization (SEO), ranking algorithms, and semantic search that improve search relevance and page ranking?

Related Topics

All papers in Web Search Engine