Improving Domain Searches through Customized Search Engines

Veda Storey

doi:10.4018/978-1-60960-595-7.CH001

Outline

Improving Domain Searches through Customized Search Engines

Veda Storey

2011

https://doi.org/10.4018/978-1-60960-595-7.CH001

visibility

…

description

3 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Arguably, the most important driver in the growth of the Internet and e-commerce is the existence of easy to use and effective search engines. This makes search engines an integral part of the world economy. Unfortunately, there is no single best search engine for all contexts. Algorithms suited for a domain such as medical research (Mao & Tian, 2009) are not effective for searching the Se

Tushar K A N T I Saha

International Journal of Information Retrieval Research, 2013

Recently researchers are using Google scholar widely to find out the research articles and the relevant experts in their domain. But it is unable to find out all experts in a relevant research area from a specific country by a quick search. Basically the custom search technique is not available in the current Google scholar’s setup. The authors have combined custom search with domain-specific search and named as domain specific custom search in this research. First time this research introduces a domain specific custom search technique using new search methodology called n-paged-m-items partial crawling algorithm. This algorithm is a real-time faster crawling algorithm due to the partial crawling technique. It does not store anything in the database, which can be shown later on to the user. The proposed algorithm is implemented on a new domain scholar.google.com to find out the scholars or experts quickly. Finally the authors observe the better performance of the proposed algorithm ...

downloadDownload free PDF View PDFchevron_right

Improving Web Search with the EWEBSEARCH Model

Michael B . Soroyewun

Traditional web which is the largest information database lacks semantic and as a result the information available in the web is only human understandable, not by machine. With the rapid increase in the amount of information on networks, search engine has become the infrastructure for people gaining access to Web information, and is the second largest Internet application besides e-mail. However, search engine returns a huge number of results, and the relevance between results and user queries is also different. There are lots of search engines available today, but the way to retrieve meaningful information is difficult. To overcome this problem in search engines to retrieve meaningful information intelligently or smartly, Semantic Web technology has played a major role. In the light of this, our paper, proposes an algorithm, architecture for the semantic web based search engine named EWEBSEARCH model, powered by XML meta-tags (which ensures machine understandability) to improve web search. The EWEBSEARCH model provides a simple interface to capture user's queries (keywords), then the search or query engine processes the queries from the repository (database) using the search engine algorithm, interpreting the queries, retrieving and providing appropriate ranking of results in order to satisfy users queries. Query answers are ranked using extended information-retrieval techniques, are generated in an order of ranking and implementation of the model.

downloadDownload free PDF View PDFchevron_right

Web Search Engines Practice and Experience

Soo Hyung Choi

downloadDownload free PDF View PDFchevron_right

Quality and relevance of domain-specific search: A case study in mental health

Helen Christensen

Information Retrieval, 2006

When searching for health information, results quality can be judged against available scientific evidence: Do search engines return advice consistent with evidence based medicine? We compared the performance of domain-specific health and depression search engines against a general-purpose engine (Google) on both relevance of results and quality of advice. Over 101 queries, to which the term 'depression' was added if not already present, Google returned more relevant results than those of the domain-specific engines. However, over the 50 treatment-related queries, Google returned 70 pages recommending for or against a well studied treatment, of which 19 strongly disagreed with the scientific evidence. A domain-specific index of 4 sites selected by domain experts was only wrong in 5 of 50 recommendations. Analysis suggests a tension between relevance and quality. Indexing more pages can give a greater number of relevant results, but selective inclusion can give better quality.

downloadDownload free PDF View PDFchevron_right

Efficient information retrieval model: overcoming challenges in search engines-an overview

Simple Sharma, Indonesian Journal of Electrical Engineering and Computer Science

The Indonesian Journal of Electrical Engineering and Computer Science (IJEECS), 2023

Search engines play a vital role in information retrieval (IR) indexing and processing vast and diverse data, which now encompasses the ever-expanding wealth of multimedia content. However, search engine performance relies on the efficiency and effectiveness of their information retrieval systems (IRS). To enhance search engine performance, there is a need to develop more efficient and accurate IRS that retrieves relevant information quickly and accurately. To address this challenge, various approaches, including inverted indexing, query expansion, and relevance feedback, have been proposed for IR. Although these approaches have shown promising results, but their effectiveness and limitations require a comprehensive examination This research aims to investigate the challenges and opportunities in designing an efficient IRS for search engines and identify key areas for improvement and future research. The study involves a comprehensive literature review on IR impacting academia, industry, healthcare, e-commerce, and other domains. Researchers rely on search engines to access relevant scientific papers, professionals use them to gather market intelligence, and consumers utilize them for product research and decision-making. The findings of this study will contribute to the development of more efficient and effective IRS, leading to improved search engine performance and user satisfaction.

downloadDownload free PDF View PDFchevron_right

Search Engines going beyond Keyword Search: A Survey

Mahmudur Rahman, Ph.D.

In order to solve the problem of information overkill on the web or large domains, current information retrieval tools especially search engines need to be improved. Much more intelligence should be embedded to search tools to manage the search and filtering processes effectively and present relevant information. As the web swells with more and more data, the predominant way of sifting through all of that data -keyword search -will one day break down in its ability to deliver the exact information people want at our fingertips. Hence search engines are trying to break the shackles of the concept of keyword search what typically most search engines do. This paper tries to identify the major challenges for today's keyword search engines to adapt with the fast growth of web and support comprehensive user demands in quick time. Then it surveys different non-keyword based paradigms proposed, developed or implemented by researchers and different search engines and also classifies those approaches according to the features focused by the different search engines to deliver results.

downloadDownload free PDF View PDFchevron_right

An Introduction to a Meta-meta-search Engine

martin lalnunsanga

2023

downloadDownload free PDF View PDFchevron_right

Survey of Web Search Engines: Classifications, Characteristics and Effectiveness

Juryon Paik

International Journal of Information Processing and Management, 2013

The explosive growth of data available in the internet exposes a serious problem, informationoverflow, where each user gets rarely necessary information and which brings blind spot of information search. The blind spot means the areas which cannot be accessed by search engines. Hence, there is no way users can get the information in blind spots. They are getting wider, which cause loss of valuable information for users' queries. The problem of blind spots stems from the way of navigating the web for current leading search engines, Google or Yahoo; they crawl web pages periodically and automatically, store them into indexed databases, and retrieve search results via queries. However, the rapid growth of the web data brings a limit of indexing pages, which massproduces data areas that cannot be accessed by the search engines. Besides, they still retrieve useless results for depending on a few keywords, where users wander again for really necessary information. The truly required searching way is to provide valuable and accurate search results to users in a customized way and to deliver the information from the viewpoint of a user, not from the viewpoint of a search engine provider. Recently, fresh search engines are developed and issued with Silicon Valley as the center. Their objectives are the intelligent and specialized search results as well as easy user interfaces. In this manuscript, we introduce some representatives of the newly published search engines along with surveying and classifying systematically current existing web search engines.

downloadDownload free PDF View PDFchevron_right

Improving the Web Search Using Search Engines Operators

Radu Cretulescu

2019

The amount of information stored in the Web is huge and without help from search engines it becomes almost impossible to find what you want. The search engines try to index as many web pages as possible. On April 3 rd 2018 there were over 45.5 billion webpages indexed by Google and just over 4 billion web indexed by Bing. Even with the help of search engines, the number of addresses of pages returned by them for a simple query is quite large and we, as people, must look through those pages to find what we are really interested in. Finding the URLs in such a huge data space is challenging and requires some special techniques for refining and reducing the number of returned addresses. In our paper we present some interrogation modifiers and search operators used by the Google search engine to reduce significantly the number of the search results and to increase the quality of that results. The presented techniques and commands for search engines that can be used for finding similar we...

downloadDownload free PDF View PDFchevron_right

A Domain-Specific Search Engine: A Case of University of Abuja

rama prasad

International Journal of Advances in Scientific Research and Engineering (ijasre), 2019

Users need more effective ways of organizing and searching for information. It has become increasingly difficult for users to find information on the World Wide Web that satisfies their individual needs since information resources on the World Wide Web continue to grow. The reason is that the World Wide Web provides a bunch of information that is generic in perspective. Although, the generic search engine makes attempts to streamline searching to search-key by traversing and combing the whole bunch of stored information. Under these circumstances, the system turns out to be inefficient and many times the results do not tally to what users desire. This paper presents a system to find interesting textual content among tons of documents taking the department of computer science of the University of Abuja, Nigeria as a case study. The paper proposed a vector space ranking algorithm which is a content-based ranking method that allows the user to utilize the full co-occurrence matrix of all words in the corpus to bring out relevant material using a simple and structured query interface. The index structures of the system have been specifically designed to support the ranking scheme. One important aspect of the system is its flexibility and robustness that makes it adaptable to any domain rather than being tailored to a particular one.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Matt Crane

SIGIR2012 Workshop on Open Source Information Retrieval

Building an efficient and effective search engine requires both science and engineering. In this paper, we discuss the ATIRE search engine developed in our research lab, and both the engineering decisions and research questions that have motivated building ATIRE.

downloadDownload free PDF View PDFchevron_right

Enhanced Search Engine

Research Publish Journals

Abstract: In this era of the World Wide Web clients of information assistances faces not only an extremely distributed but also varied data space with unrelated data foundations and also with "value added" sections in information systems and with the more important question of whether they should still, search literature in the focused databases of information centers like the Information-zentrum Sozialwissenschaften, which is also known as IZ in Germany, or whether they should use WWW search engines like AltaVista or Fast (Krause, 2001). The main aim is to provide elevated proper search results over a quickly growing World Wide Web and also to develop sensible system which can make use of the added information which is there in the hypertext. Keywords: Enhanced Search Engine, World Wide Web. Title: Enhanced Search Engine Author: Akash Kosambia, Prof. Tarik El Taeib International Journal of Computer Science and Information Technology Research ISSN 2348-1196 (print), ISSN 2348-120X (online) Research Publish Journals

downloadDownload free PDF View PDFchevron_right

A Comparative Evaluation of Search Engines on Finding Specific Domain Information on the Web

Azilawati Azizan

2018

Recently search engines have provided a truly amazing search service, especially in finding general information on the Web. However, the question arises, does search engine perform the same when seeking domain specific information such as medical, geographical or agriculture information? Along with that issue, an experiment has been conducted to test the effectiveness of today’s search engines from the aspect of information searching in a specific domain. There were four search engines have been selected namely Google, Bing, Yahoo and DuckDuckGo for the experiment. While for the domain specific, we chose to test information about the popular fruit in Southeast Asia that is durian. Precision metric has been used to evaluate the retrieval effectiveness. The findings show that Google has outperformed the other three search engines. Nevertheless, the mean average precision value 0.51 given by Google is still low to be satisfied neither by the researcher nor the information seekers.

downloadDownload free PDF View PDFchevron_right

Techniques for Specialized Search Engines

Professor Robert Steele

Proceedings of the International Conference on Internet Computing, 2001

It is emerging that it is very difficult for the major search engines to provide a comprehensive and up-to-date search service of the Web. Even the largest search engines index only a small proportion of static Web pages and do not search the Web’s backend databases that are estimated to be 500 times larger than the static Web. The scale of such searching introduces both technical and economic problems. What is more, in many cases users are not able to retrieve the information they desire because of the simple and generic search interface provided by the major search engines. A necessary response to these search problems is the creation of specialized search engines. These search engines search just for information in a particular topic or category on the Web. Such search engines will have smaller and more manageable indexes and have a powerful domainspecific search interface. This paper discusses the issues in this area and gives an overview of the techniques for building specialized search engines.

downloadDownload free PDF View PDFchevron_right

A Reflection of Current Search Engine Techniques on Medical Search Environments

Pervaiz Ahmed

2010 Sixth International Conference on Intelligent Environments, 2010

In this day and age medical search engines have become a necessity. External influences and user trends contribute to the popularity of medical search engines. Unlike their predecessors (horizontal search engines) extensive search strategies have yet to be implemented on vertical search engines like medical search engines. The composition and structure of medical search engines draw many users to utilize them regularly. This makes a medical search engine an ideal domain to implement and asses the feasibility of existing search engine strategies. By doing so, users will be provided with relevant search results. In this research study, we review current search strategies and analyze the applicability of these techniques on medical search environments. We also suggest a direction for the future of search engine strategies on medical search engines.

downloadDownload free PDF View PDFchevron_right

Search Engine : a Synthesis, Innovation & Regulation Chair, Workshop, 16 Mai 2008

Pierre-Jean Benghozi

Recent information of a very different nature has shown the challenge posed by search engines in the internet economy.

downloadDownload free PDF View PDFchevron_right

A Machine Learning Approach to Building Domain-Specific Search Engines

Andrew McCallum

… Joint Conference on …, 1999

downloadDownload free PDF View PDFchevron_right

Towards Improving Search Results for Medical Experts and Laypersons

Karin Friberg Heppin

2012

In a domain such as medicine, it is important that individuals' infor- mation needs are met with information on a suitable level of difficulty and ex- pertise. This paper focuses on facilitating medical information access through reformulating queries and re-ranking result lists utilizing features typical for the language written for professionals or for laypersons. The aim is to produce re- sult lists where the ranking is better suited for the expertise level of the user. We will explore the possibility of using features such as trigger phrases for que- ry reformulation and document length, average word length or compound ratio for re-ranking. The Swedish medical IR test collection, MedEval, from Sprakbanken, Uni- versity of Gothenburg, will be used to find features specific for professional language and lay language and to study the effectiveness of these features in re- formulating queries and re-ranking search results based on the target group. The test collection contains 4...

downloadDownload free PDF View PDFchevron_right

A Survey of Domain Specific Web Search Techniques

vidya vadke

2016

In the world of internet where every day information is increasing exponentially, retrieving correct information from the World Wide Web has always remained a challenge. The growth in volume of data has made it more difficult to find relevant and useful information on the internet. It is always a challenge to get relevant information if it is searched over the vast internet. There are techniques to crawl the domain specific data so as to keep the throw away data as to minimum. Taking the advantage of knowing domain, priorities could be assigned to the indexes for faster search. Assistance could be given to the user to form the most effective query in the given domain. Some search techniques make use of URL structure, some look for specific words in the web pages. Number of attempts have been made to implement domain specific search namely by combining semantic web technologies with information retrieval. Making use of weighted ontology for crawling, indexing and searching has been t...

downloadDownload free PDF View PDFchevron_right

Building Domain-Specific Search Engines with Machine Learning Techniques

Andrew McCallum

Domain-speci c search engines are growing in popularity because they o er increased accuracy and extra functionality not possible with the general, Web-wide search engines. For example, www.campsearch.com allows complex queries by age-group, size, location and cost over summer camps. Unfortunately these domain-speci c search engines are di cult and timeconsuming to maintain. This paper proposes the use of machine learning techniques to greatly automate the creation and maintenance of domain-speci c search engines. We describe new research in reinforcement learning, information extraction and text classi cation that enables e cient spidering, identifying informative text segments, and populating topic hierarchies. Using these techniques, we have built a demonstration system: a search engine for computer science research papers. It already contains over 50,000 papers and is publicly available at www.cora.justresearch.com.

downloadDownload free PDF View PDFchevron_right

Improving Domain Searches through Customized Search Engines

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics