Efficient Ranking on Entity Graphs with Personalized Relationships

LansA Informatics Pvt Ltd

Outline

Title

Abstract

All Topics

Computer Science

Efficient Ranking on Entity Graphs with Personalized Relationships

LansA Informatics Pvt Ltd

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Authority flow techniques like PageRank and ObjectRank can provide personalized ranking of typed entity-relationship graphs.

Ken-ichi Kawarabayashi

Proceedings of the VLDB Endowment, 2014

We propose a new scalable algorithm that can compute Personalized PageRank (PPR) very quickly. The Power method is a state-of-the-art algorithm for computing exact PPR; however, it requires many iterations. Thus reducing the number of iterations is the main challenge. We achieve this by exploiting graph structures of web graphs and social networks. The convergence of our algorithm is very fast. In fact, it requires up to 7.5 times fewer iterations than the Power method and is up to five times faster in actual computation time. To the best of our knowledge, this is the first time to use graph structures explicitly to solve PPR quickly. Our contributions can be summarized as follows. 1. We provide an algorithm for computing a tree decomposition, which is more efficient and scalable than any previous algorithm. 2. Using the above algorithm, we can obtain a core-tree decomposition of any web graph and social network. This allows us to decompose a web graph and a social network into (1) the core, which behaves like an expander graph, and (2) a small tree-width graph, which behaves like a tree in an algorithmic sense. 3. We apply a direct method to the small tree-width graph to construct an LU decomposition. 4. Building on the LU decomposition and using it as preconditoner, we apply GMRES method (a state-of-theart advanced iterative method) to compute PPR for whole web graphs and social networks.

downloadDownload free PDF View PDFchevron_right

Entity ranking and relationship queries using an extended graph model

Prashant Jaiswal

International Conference on Management of Data, 2012

There is a large amount of textual data on the Web and in Wikipedia, where mentions of entities (such as Gandhi) are annotated with a link to the disambiguated entity (such as M. K. Gandhi). Such annotation may have been done manually (as in Wikipedia) or can be done using named entity recognition/disambiguation techniques. Such an annotated corpus allows queries to return entities, instead of documents. Entity ranking queries retrieve entities that are related to keywords in the query and belong to a given type/category specified in the query; entity ranking has been an active area of research in the past few years. More recently, there have been extensions to allow entity-relationship queries, which allow specification of multiple sets of entities as well as relationships between them. In this paper we address the problem of entity ranking ("near") queries and entity-relationship queries on the Wikipedia corpus. We first present an extended graph model which combines the power of graph models used earlier for structured/semi-structured data, with information from textual data. Based on this model, we show how to specify entity and entity-relationship queries, and defined scoring methods for ranking answers. Finally, we provide efficient algorithms for answering such queries, exploiting a space efficient in-memory graph structure. A performance comparison with the ERQ system proposed earlier shows significant improvement in answer quality for most queries, while also handling a much larger set of entity types.

downloadDownload free PDF View PDFchevron_right

AuthorRank: Ranking Improvement for the Web

Aidan Hogan

Abstract. As the wealth of data on the World Wide Web grows, and as the structuring of that data improves, more sophisticated applications can be developed to derive meaningful characteristics relating to the content and structure of that data. In particular, ranking the various elements of sets of structured information is of great utility with respect to semantic network analysis.

downloadDownload free PDF View PDFchevron_right

Generalizing PageRank

Ricardo Baeza-Yates

Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06, 2006

This paper introduces a family of link-based ranking algorithms that propagate page importance through links. In these algorithms there is a damping function that decreases with distance, so a direct link implies more endorsement than a link through a long path. PageRank is the most widely known ranking function of this family.

downloadDownload free PDF View PDFchevron_right

Mining Neighbors ’ Topicality to Better Control Authority Flow

Brian D Davison

Web pages are often recognized by others through contexts. These contexts determine how linked pages influence and interact with each other. When differentiating such interactions, the authority of web pages can be better estimated by controlling the authority flows among pages. In this work, we determine the authority distribution by examining the topicality relationship between associated pages. In addition, we find it is not enough to quantify the influence of authority propagation from only one type of neighbor, such as parent pages in PageRank algorithm, since web pages, like people, are influenced by diverse types of neighbors within the same network. We propose a probabilistic method to model authority flows from different sources of neighbor pages. In this way, we distinguish page authority interaction by incorporating the topical context and the relationship between associated pages. Experiments on the 2003 and 2004 TREC Web Tracks demonstrate that this approach outperforms other competitive topical ranking models and produces a more than 10% improvement over PageRank on the quality of top 10 search results. When increasing the types of incorporated neighbor sources, the performance shows stable improvements.

downloadDownload free PDF View PDFchevron_right

An Application of Personalized PageRank Vectors: Personalized Search Engine

Mehmet Aktas

We introduce a tool which is an application of personalized pagerank vectors such as personalized search engines. We use pre-computed pagerank vectors to rank the search results in favor of user preferences. We describe the design and architecture of our tool. By using pre-computed personalized pagerank vectors we generate search results biased to user preferences such as top-level domain and regional preferences. We conduct a user study to evaluate search results of three different ranking methods such as similarity-based ranking, plain PageRank and weighted (personalized) PageRank ranking methods. We discuss the results of our user study and evaluate the benefits our personalized PageRank vectors in personalized search engines.

downloadDownload free PDF View PDFchevron_right

Node ranking in labeled directed graphs

Srinivas Kashyap

Proceedings of the Thirteenth ACM conference on Information and knowledge management - CIKM '04, 2004

Our work is motivated by the problem of ranking hyperlinked documents for a given query. Given an arbitrary directed graph with edge and node labels, we present a new flow-based model and an efficient method to dynamically rank the nodes of this graph with respect to any of the original labels. Ranking documents for a given query in a hyperlinked document set and ranking of authors/articles for a given topic in a citation database are some typical applications of our method. We outline the structural conditions that the graph must satisfy for our ranking to be different from the traditional PageRank. We have built a system using two indices that is capable of dynamically ranking documents for any given query. We validate our system and method using experiments on a few datasets: a crawl of the IBM Intranet (12 million pages), a crawl of the www (30 million pages) and the DBLP citation dataset. We compare our method to existing schemes for topic-biased ranking that require a classifier and the traditional PageRank. In these experiments, we demonstrate that our method is well suited for fine-grained ranking and that our method performs better than the existing schemes. We also demonstrate that our system can obtain an improved ranking with very little impact on query time.

downloadDownload free PDF View PDFchevron_right

Efficient calculation of personalized document rankings

Klaus Stein

Proceedings of the 20th international joint conference …, 2007

Social networks allow users getting personalized recommendations for interesting resources like websites or scientific papers by using reviews of users they trust. Search engines rank documents by using the reference structure to compute a visibility for each document with reference structure-based functions like PageRank. Personalized document visibilities can be computed by integrating both approaches. We present a framework for incorporating the information from both networks, and ranking algorithms using this information for personalized recommendations. Because the computation of document visibilities is costly and therefore cannot be done in runtime, i.e., when a user searches a document repository, we pay special attention to develop algorithms providing an efficient calculation of personalized visibilities at query time based on precalculated global visibilities. The presented ranking algorithms are evaluated by a simulation study.

downloadDownload free PDF View PDFchevron_right

Computing Approximate Customized Ranking

Louiqa Raschid

2009

As the amount of information grows and as users become more sophisticated, ranking techniques become important building blocks to meet user needs when answering queries. PageRank is one of the most successful link-based ranking methods, which iteratively computes the importance scores for web pages based on the importance scores of incoming pages. Due to its success, PageRank has been applied in a number of applications that require customization. We address the scalability challenges for two types of customized ranking. The first challenge is to compute the ranking of a subgraph. Various Web applications focus on identifying a subgraph, such as focused crawlers and localized search engines. The second challenge is to compute online personalized ranking. Personalized search improves the quality of search results for each user. The user needs are represented by a personalized set of pages or personalized link importance in an entity relationship graph. This requires an efficient online computation. To solve the subgraph ranking problem efficiently, we estimate the ranking scores for a subgraph. We propose a framework of an exact solution (IdealRank) and

downloadDownload free PDF View PDFchevron_right

Flexible and efficient querying and ranking on hyperlinked data sources

Louiqa Raschid

Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09, 2009

There has been an explosion of hyperlinked data in many domains, e.g., the biological Web. Expressive query languages and effective ranking techniques are required to convert this data into browsable knowledge. We propose the Graph Information Discovery (GID) framework to support sophisticated user queries on a rich web of annotated and hyperlinked data entries, where query answers need to be ranked in terms of some customized ranking criteria, e.g., PageRank or ObjectRank. GID has a data model that includes a schema graph and a data graph, and an intuitive query interface. The GID framework allows users to easily formulate queries consisting of sequences of hard filters (selection predicates) and soft filters (ranking criteria); it can also be combined with other specialized graph query languages to enhance their ranking capabilities. GID queries have a well-defined semantics and are implemented by a set of physical operators, each of which produces a ranked result graph. We discuss rewriting opportunities to provide an efficient evaluation of GID queries. Soft filters are a key feature of GID and they are implemented using authority flow ranking techniques; these are query dependent rankings and are expensive to compute at runtime. We present approximate optimization techniques for GID soft filter queries based on the properties of random walks, and using novel path-length-bound and graphsampling approximation techniques. We experimentally validate our optimization techniques on large biological and bibliographic datasets. Our techniques can produce high quality (Top K) answers with a savings of up to an order of magnitude, in comparison to the evaluation time for the exact solution.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

IOSR Journals

Search engine is most leading and valuable tool that collects the data which is extent and it objectives to offer rising data being reachable to the user. Objective of personalization ranking is to improve the tradition data search and retrieval procedure as per the user concern. Authority flow is the technique of conveying the rating of pages for each user. Authority flow approaches like PageRank and ObjectRank can deal personalized ranking of typed entity-relationship graphs. In entity relationship graph, the authority flow mechanism adjusted with the provision of edge or relationship type. There are two chief processes to personalize authority flow ranking: Node based personalization, where authority constructs since a set of user precise nodes; Edge-based personalization, where the reputation of different edge types is user-specific. Main concentration of the paper is on Edge-based personalization where the hybridization of ScaleRank with clustering algorithm i.e., K Mean clustering and express that the Hybrid ScaleRank provides quick and precise adapted authority flow ranking.

downloadDownload free PDF View PDFchevron_right

Learning to Rank in Entity Relationship Graphs

Louiqa Raschid

Informs Journal on Computing, 2019

Many real-world data sets are modeled as entity relationship graphs or heterogeneous information networks. In these graphs, nodes represent entities and edges mimic relationships. ObjectRank extends the well-known PageRank authority flow-based ranking method to entity relationship graphs using an authority flow weight vector (W). The vector W assigns a different authority flow-based importance (weight) to each edge type based on domain knowledge or personalization. In this paper, our contribution is a framework for Learning to Rank in entity relationship graphs to learn W, in the context of authority flow. We show that the problem is similar to learning a recursive scoring function. We present a two-phase iterative solution and multiple variants of learning. In pointwise learning, we learn W, and hence the scoring function, from the scores of a sample of nodes. In pairwise learning, we learn W from given preferences for pairs of nodes. To demonstrate our contribution in a real setting, we apply our framework to learn the rank, with high accuracy, for a real-world challenge of predicting future citations in a bibliographic archive-that is, the FutureRank score. Our extensive experiments show that with a small amount of training data, and a limited number of iterations, our Learning to Rank approach learns W with high accuracy. Learning works well with pairwise training data in large graphs.

downloadDownload free PDF View PDFchevron_right

Challenges in personalized authority flow based ranking of social media

Louiqa Raschid

Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10, 2010

As the social interaction of Internet users increases, so does the need to effectively rank social media. We study the challenges of personalized ranking of blog posts. Web search techniques are inadequate since social media lack many of the characteristics of the Web such as rich document content and an extensive hyperlink graph. Further, user behavior in social media has moved beyond keyword based search and must support users who follow a particular blog or theme. In this research, we extend a social media dataset to exploit the associations between authors, blog posts, and categories (topics) of the posts. We then apply personalized authority flow based ranking algorithms based on the random surfer model. We evaluate our personalization approaches through an extensive study on a range of virtual users whose preferences are defined based on intuitive criteria. Our evaluation shows that the accuracy of our personalized recommendations ranges from good to very good for a majority of users, and outperforms reasonable baseline approaches.

downloadDownload free PDF View PDFchevron_right

A Multi-Entity Page Rank Algorithm

Kavi Mahesh

2017

We propose a generic multi-entity page rank algorithm for ranking a set of related entities of more than one type. The algorithm takes into account not only the mutual endorsements among entities of the same type but also the influences of other types of entities on the ranks of all entities involved. A key idea of our algorithm is the separation of prime and non-prime entities to structure the iterative evolution of the ranks and matrices involved. We illustrate the working of the proposed algorithm in the domain of concurrently ranking research papers, their authors and the affiliated universities.

downloadDownload free PDF View PDFchevron_right

Entity ranking on graphs: Studies on expert finding

Henning Rode

2007

Todays web search engines try to offer services for finding various information in addition to simple web pages, like showing locations or answering simple fact queries. Understanding the association of named entities and documents is one of the key steps towards such semantic search tasks. This paper addresses the ranking of entities and models it in a graph-based relevance propagation framework. In particular we study the problem of expert finding as an example of an entity ranking task. Entity containment graphs are introduced that represent the relationship between text fragments on the one hand and their contained entities on the other hand. The paper shows how these graphs can be used to propagate relevance information from the pre-ranked text fragments to their entities. We use this propagation framework to model existing approaches to expert finding based on the entity's indegree and extend them by recursive relevance propagation based on a probabilistic random walk over the entity containment graphs. Experiments on the TREC expert search task compare the retrieval performance of the different graph and propagation models.

downloadDownload free PDF View PDFchevron_right

SemRank: ranking complex relationship search results on the semantic web

Amit Sheth

… conference on World Wide Web, 2005

While the idea that querying mechanisms for complex relationships (otherwise known as Semantic Associations) should be integral to Semantic Web search technologies has recently gained some ground, the issue of how search results will be ranked remains largely unaddressed. Since it is expected that the number of relationships between entities in a knowledge base will be much larger than the number of entities themselves, the likelihood that Semantic Association searches would result in an overwhelming number of results for users is increased, therefore elevating the need for appropriate ranking schemes. Furthermore, it is unlikely that ranking schemes for ranking entities (documents, resources, etc.) may be applied to complex structures such as Semantic Associations.

downloadDownload free PDF View PDFchevron_right

Generic Damping Functions for Propagating Importance in Link-Based Ranking

Ricardo Baeza-Yates

Internet Mathematics, 2006

This paper introduces a family of link-based ranking algorithms that propagate page importance through links. The algorithms include a damping function which decreases with distance, thus a direct link implies greater endorsement that a link via a longer path. PageRank is the most widely known ranking function of this family.

downloadDownload free PDF View PDFchevron_right

BiRank: Towards Ranking on Bipartite Graphs

Min-Yen Kan

IEEE Transactions on Knowledge and Data Engineering

The bipartite graph is a ubiquitous data structure that can model the relationship between two entity types: for instance, users and items, queries and webpages. In this paper, we study the problem of ranking vertices of a bipartite graph, based on the graph's link structure as well as prior information about vertices (which we term a query vector). We present a new solution, BiRank, which iteratively assigns scores to vertices and finally converges to a unique stationary ranking. In contrast to the traditional random walk-based methods, BiRank iterates towards optimizing a regularization function, which smooths the graph under the guidance of the query vector. Importantly, we establish how BiRank relates to the Bayesian methodology, enabling the future extension in a probabilistic way. To show the rationale and extendability of the ranking methodology, we further extend it to rank for the more generic n-partite graphs. BiRank's generic modeling of both the graph structure and vertex features enables it to model various ranking hypotheses flexibly. To illustrate its functionality, we apply the BiRank and TriRank (ranking for tripartite graphs) algorithms to two real-world applications: a general ranking scenario that predicts the future popularity of items, and a personalized ranking scenario that recommends items of interest to users. Extensive experiments on both synthetic and real-world datasets demonstrate BiRank's soundness (fast convergence), efficiency (linear in the number of graph edges) and effectiveness (achieving state-of-the-art in the two real-world tasks).

downloadDownload free PDF View PDFchevron_right

Review-based Entity-ranking Refinement

Panagiotis Gourgaris, Andreas Kanavos

Proceedings of the 11th International Conference on Web Information Systems and Technologies, 2015

In this paper, we address the problem of entity ranking using opinions expressed in users' reviews. There is an abundance of opinions on the web, which includes reviews of products and services. Specifically, we examine techniques which utilize clustering information, for coping with the obstacle of the entity ranking problem. Building on this framework, we propose a probabilistic network scheme that employs a topic identification method so as to modify ranking of results based on user personalization. The contribution lies in the construction of a probabilistic network which takes as input the belief of the user for each query (initially, all entities are equivalent) and produces a new ranking for the entities as output. We evaluated our implemented methodology with experiments with the OpinRank Dataset where we observed an improved retrieval performance to current re-ranking methods.

downloadDownload free PDF View PDFchevron_right

The PageRank Citation Ranking: Bringing Order to the Web

David Sottimano

The importance of a Web page is an inherently subjective matter, which depends on the readers interests, knowledge and attitudes. But there is still much that can be said objectively about the relative importance of Web pages. This paper describes PageRank, a method for rating Web pages objectively and mechanically, eeectively measuring the human interest and attention devoted to them. We compare PageRank to an idealized random Web surfer. We show how to eeciently compute PageRank for large numbers of pages. And, we show h o w to apply PageRank to search and to user navigation.

downloadDownload free PDF View PDFchevron_right

Efficient Ranking on Entity Graphs with Personalized Relationships

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics