Relational learning via latent social dimensions

Lei Tang; Huan Liu

doi:10.1145/1557019.1557109

Outline

Artificial Intelligence

Relational learning via latent social dimensions

Huan Liu

2009, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining

https://doi.org/10.1145/1557019.1557109

visibility

…

description

1 page

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract
AI

The paper proposes a relational learning framework, SocDim, which leverages latent social dimensions to improve prediction outcomes in social networks. It addresses the heterogeneous nature of social connections and demonstrates that affiliations among actors significantly affect their interactions. The empirical results suggest that combining network and content features leads to better classification performance, highlighting the importance of soft community detection in utilizing social dimensions for behavioral predictions.

Related papers

Network denoising in social media

Huiji Gao

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining - ASONAM '13, 2013

Social media expands the ways people communicate with each other. On a popular social media website, a user typically has hundreds of contacts (or friends) on average. As a person's social network grows, friend management is increasingly important for effective communications. Often, one can only afford to maintain close friendship in a small scale due to limited time and other resources. In other words, the majority of one's connections are so-so friends and do not hold strong influence on the user. One approach resorts to network denoising, by which unimportant connections are removed as noise. We study the challenges of network denoising in social media and how we can leverage a variety of social media information to denoise the links. We formulate the network denoising task as an optimization problem, and show the efficacy of our network denoising approach and its scalability experimentally in the domain of behavior inference.

downloadDownload free PDF View PDFchevron_right

Followers Are Not Enough: Beyond Structural Communities in Online Social Networks

Joshua Garland, D. Darmon

Community detection in online social networks is typically based on the analysis of the explicit connections between users, such as "friends" on Facebook and "followers" on Twitter. But online users often have hundreds or even thousands of such connections, and many of these connections do not correspond to real friendships or more generally to accounts that users interact with. We claim that community detection in online social networks should be question-oriented and rely on additional information beyond the simple structure of the network. The concept of 'community' is very general, and different questions such as "whom do we interact with?" and "with whom do we share similar interests?" can lead to the discovery of different social groups. In this paper we focus on three types of communities beyond structural communities: activity-based, topic-based, and interaction-based. We analyze a Twitter dataset using three different weightings of the structural network meant to highlight these three community types, and then infer the communities associated with these weightings. We show that the communities obtained in the three weighted cases are highly different from each other, and from the communities obtained by considering only the unweighted structural network. Our results confirm that asking a precise question is an unavoidable first step in community detection in online social networks, and that different questions can lead to different insights about the network under study.

downloadDownload free PDF View PDFchevron_right

Leveraging social media networks for classification

Huan Liu

2011

Abstract Social media has reshaped the way in which people interact with each other. The rapid development of participatory web and social networking sites like YouTube, Twitter, and Facebook, also brings about many data mining opportunities and novel challenges. In particular, we focus on classification tasks with user interaction information in a social network. Networks in social media are heterogeneous, consisting of various relations.

downloadDownload free PDF View PDFchevron_right

Collective Behaviour Prediction Via Social Dimensions Extraction

KANCHAN S A T I S H RATHOD

International journal of engineering research and technology, 2013

Day by day the clicks are increasing in a particular network. Behaviour is nothing but to know the interest and requirement of users. Collective behaviour, which indicates the group of data generated on a large scale. In this framework affiliations of actors are capture by extracting social dimensions and then classify the actors using extracted dimensions. As existing approaches to extract social dimensions are not scalable and can not handle network of huge size. We solve these problem by sparsifying social dimension to make this extraction scalable by using edge-centric clustering scheme and k-means variant algorithm. In social media, multiple modes of actors can be involved in the same network, resulting in a multimode network. In this work, we attempt to harness the predictive power of social connections to determine the preferences or behaviours of individuals such as whether a user supports a certain political view, whether one likes one product, whether he/she would like to vote for a presidential candidate, etc.

downloadDownload free PDF View PDFchevron_right

Joint Label Inference in Networks

Sofus Macskássy

Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW '18

We consider the problem of inferring node labels in a partially labeled graph where each node in the graph has multiple label types and each label type has a large number of possible labels. Our primary example, and the focus of this paper, is the joint inference of label types such as hometown, current city, and employers for people connected by a social network; by predicting these user profile fields, the network can provide a better experience to its users. Existing approaches such as Label Propagation (Zhu et al., 2003) fail to consider interactions between the label types. Our proposed method, called Edge-Explain, explicitly models these interactions, while still allowing scalable inference under a distributed message-passing architecture. On a large subset of the Facebook social network, collected in a previous study (Chakrabarti et al., 2014), EdgeExplain outperforms label propagation for several label types, with lifts of up to 120% for recall@1 and 60% for recall@3.

downloadDownload free PDF View PDFchevron_right

A few good predictions: selective node labeling in a social network

S. Sarawagi, G. Chaudhari, Vashist Avadhanula

Many social network applications face the following problem: given a network G = (V, E) with labels on a small subset O ⊂ V of nodes and an optional set of features on nodes and edges, predict the labels of the remaining nodes. Much research has gone into designing learning models and inference algorithms for accurate predictions in this setting. However, a core hurdle to any prediction effort is that for many nodes there is insufficient evidence for inferring a label. We propose that instead of focusing on the impossible task of providing high accuracy over all nodes, we should focus on selectively making the few node predictions which will be correct with high probability. Any selective prediction strategy will require that the scores attached to node predictions be well-calibrated. Our evaluations show that existing prediction algorithms are poorly calibrated. We propose a new method of training a graphical model using a conditional likelihood objective that provides better calibration than the existing joint likelihood objective. We augment it with a decoupled confidence model created using a novel unbiased training process. Empirical evaluation on two large social networks show that we are able to select a large number of predictions with accuracy as high as 95%, even when the best overall accuracy is only 40%.

downloadDownload free PDF View PDFchevron_right

Increasing the Predictive Power of Affiliation Networks

Lise Getoor

2007

Abstract Scale is often an issue when attempting to understand and analyze large social networks. As the size of the network increases, it is harder to make sense of the network, and it is computationally costly to manipulate and maintain. Here we investigate methods for pruning social networks by determining the most relevant relationships in a social network. We measure importance in terms of predictive accuracy on a set of target attributes of social network groups.

downloadDownload free PDF View PDFchevron_right

Social networks and statistical relational learning: a survey

S. Ferilli

International Journal of Social Network Mining, 2012

One of the most appreciated functionality of computers nowadays is their being a means for communication and information sharing among people. With the spread of the internet, several complex interactions have taken place among people, giving rise to huge information networks based on these interactions. Social networks potentially represent an invaluable source of information that can be exploited for scientific and commercial purposes. On the other hand, due to their distinguishing peculiarities (huge size and inherent relational setting) with respect to all previous information extraction tasks faced in computer science, they require new techniques to gather this information. Social network mining (SNM) is the corresponding research area, aimed at extracting information about the network objects and behaviour that cannot be obtained based on the explicit/implicit description of the objects alone, ignoring their explicit/implicit relationships. Statistical relational learning (SRL) is a very promising approach to SNM, since it combines expressive representation formalisms, able to model complex relational networks, with statistical methods able to handle uncertainty about objects and relations. This paper is a survey of some SRL formalisms and techniques adopted to solve some SNM tasks.

downloadDownload free PDF View PDFchevron_right

Scalable learning of collective behavior based on sparse social dimensions

Huan Liu

2009

Abstract The study of collective behavior is to understand how individuals behave in a social network environment. Oceans of data generated by social media like Facebook, Twitter, Flickr and YouTube present opportunities and challenges to studying collective behavior in a large scale. In this work, we aim to learn to predict collective behavior in social media. In particular, given information about some individuals, how can we infer the behavior of unobserved individuals in the same network?

downloadDownload free PDF View PDFchevron_right

Scalable learning of collective behavior

nishant biradar

Knowledge and Data Engineering, …, 2012

This study of collective behavior is to understand how individuals behave in a social networking environment. Oceans of data generated by social media like Facebook, Twitter, Flickr, and YouTube present opportunities and challenges to study collective behavior on a large scale. In this work, we aim to learn to predict collective behavior in social media. In particular, given information about some individuals, how can we infer the behavior of unobserved individuals in the same network? A social-dimension-based approach has been shown effective in addressing the heterogeneity of connections presented in social media. However, the networks in social media are normally of colossal size, involving hundreds of thousands of actors. The scale of these networks entails scalable learning of models for collective behavior prediction. To address the scalability issue, we propose an edge-centric clustering scheme to extract sparse social dimensions. With sparse social dimensions, the proposed approach can efficiently handle networks of millions of actors while demonstrating a comparable prediction performance to other non-scalable methods.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

prerna jadhav

IOSR Journal of Computer Engineering, 2014

Now a days a huge data is generated by social media like Facebook, Twitter, Flickr, and YouTube .This big data present opportunities and challenges to study collective behavior of data. In this work, we predict collective behavior in social media. In particular, given information about some individuals, how can we infer the behavior of unobserved individuals in the same network? A social-dimension-based approach has been shown effective in addressing the heterogeneity of connections presented in social media. However, the networks in social media are normally of colossal size, involving hundreds of thousands of actors. The scale of these networks entails scalable learning of models for collective behavior prediction. To address the scalability issue, we propose an edge-centric clustering scheme to extract sparse social dimensions. With sparse social dimensions, the proposed approach can efficiently handle networks of millions of actors while demonstrating a comparable prediction performance to other non-scalable methods.

downloadDownload free PDF View PDFchevron_right

Labeling actors in multi-view social networks by integrating information from within and across multiple views

Vasant G Honavar

2016 IEEE International Conference on Big Data (Big Data), 2016

downloadDownload free PDF View PDFchevron_right

Labeling Actors in Social Networks Using a Heterogeneous Graph Kernel

Vasant G Honavar

Lecture Notes in Computer Science , 2014

We consider the problem of labeling actors in social networks where the labels correspond to membership in specific interest groups, or other attributes of the actors. Actors in a social network are linked to not only other actors but also items (e.g., video and photo) which in turn can be linked to other items or actors. Given a social network in which only some of the actors are labeled, our goal is to predict the labels of the remaining actors. We introduce a variant of the random walk graph kernel to deal with the heterogeneous nature of the network (i.e., presence of a large number of node and link types). We show that the resulting heterogeneous graph kernel (HGK) can be used to build accurate classifiers for labeling actors in social networks. Specifically, we describe results of experiments on two real-world data sets that show HGK classifiers often significantly outperform or are competitive with the state-of-the-art methods for labeling actors in social networks.

downloadDownload free PDF View PDFchevron_right

On the Utility of Abstraction in Labeling Actors in Social Networks

Vasant G Honavar

IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining , 2013

Social networks are naturally represented as heterogeneous networks with multiple types of objects e.g., actors, items and multiple types of links e.g., links between actors that denote social ties e.g., friendship, and links that connect actors to items e.g., photo, video, articles, etc. that denote relationships between actors and items. In this paper, we consider the task of assigning labels to the unlabeled actors (individuals) in a large heterogeneous social network in which labels are available for a subset of actors. Specifically, we seek to learn a predictive model to label actors based on the attributes of the actors themselves and/or items that are linked to them in the network. Unfortunately, the number of distinct items, represented in real-world networks such as Facebook or Flickr is quite large (in the millions) although only a small subset of them are linked to specific actors. This leads to data sparsity which causes over-fitting and hence poor performance in predicting the labels of unlabeled actors. To address this problem, we induce hierarchical taxonomies over items and use the resulting taxonomies as a basis for selecting abstract and hence parsimonious representations of network data for learning the predictive models. Our experiments using three different predictors (Iterative classification Naive Bayes, Iterative classification Logistic Regression, and EdgeCluster) on two real-world data sets, Last.fm and Flickr, show that the predictive models that take advantage of abstract representations of network data are competitive with, and in some cases, outperform those that do not.

downloadDownload free PDF View PDFchevron_right

Inferring user profiles in social media by joint modeling of text and networks

zhishan zhao

Science China Information Sciences, 2019

downloadDownload free PDF View PDFchevron_right

Predicting interactions in online social networks

Christoph Trattner

Proceedings of the 4th International Workshop on Modeling Social Media - MSM '13, 2013

Although considerable amount of work has been conducted recently of how to predict links between users in online social media, studies exploiting different kinds of knowledge sources for the link prediction problem are rare. In this paper latest results of a project are presented that studies the extent to which interactions -in our case directed and bidirected message communication -between users in online social networks can be predicted by looking at features obtained from social network and position data. To that end, we conducted two experiments in the virtual world of Second Life. As our results reveal, position data features are a great source to predict interacts between users in online social networks and outperform social network features significantly. However, if we try to predict reciprocal message communication between users, social network features seem to be superior.

downloadDownload free PDF View PDFchevron_right

Mining Homophilic Groups of Users using Edge Attributed Node Embedding from Enterprise Social Networks

Lipika Dey

Companion Proceedings of the Web Conference 2022

We develop a method to identify groups of similarly behaving users with similar work contexts from their activity on enterprise social media. This would allow organizations to discover redundancies and increase efficiency. To better capture the network structure and communication characteristics, we model user communications with directed attributed edges in a graph. Communication parameters including engagement frequency, emotion words, and post lengths act as edge weights of the multiedge. Upon the resultant adjacency tensor, we develop a node embedding algorithm using higher order singular value tensor decomposition and convolutional autoencoder. We develop a peer group identification algorithm using the cluster labels obtained from the node embedding and show its results on Enron emails and StackExchange Workplace community. We observe that people of the same roles in enterprise social media are clustered together by our method. We provide a comparison with existing node embedding algorithms as a reference indicating that attributed social networks and our formulations are an efficient and scalable way to identify peer groups in an enterprise social network that aids in professional social matching. CCS CONCEPTS • Computing methodologies → Factorization methods; Unsupervised learning; • Information systems → Social networks; • Applied computing → Law, social and behavioral sciences.

downloadDownload free PDF View PDFchevron_right

The ML-model for multi-layer social networks

Matteo Magnani

2011

In this paper we introduce a new model to represent an interconnected network of networks. This model is fundamental to reason about the real organization of on-line social networks, where users belong to and interact on different networks at the same time. In addition we extend traditional SNA measures to deal with this multiplicity of networks and we apply the model to a real dataset extracted from two microblogging sites.

downloadDownload free PDF View PDFchevron_right

Learning Graph Influence from Social Interactions

Augusto Santos

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020

In social learning, agents form their opinions or beliefs about certain hypotheses by exchanging local information. This work considers the recent paradigm of weak graphs, where the network is partitioned into sending and receiving components, with the former having the possibility of exerting a domineering effect on the latter. Such graph structures are prevalent over social platforms. We will not be focusing on the direct social learning problem (which examines what agents learn), but rather on the dual or reverse learning problem (which examines how agents learned). Specifically, from observations of the stream of beliefs at certain agents, we would like to examine whether it is possible to learn the strength of the connections (influences) from sending components in the network to these receiving agents.

downloadDownload free PDF View PDFchevron_right

IJSRST1841345 | Graph Theoretic Approach to Social Network Analysis

Krishnendu Dutta

In the last few years, there is a rapid growth of web and various social network sites which have enabled us to easily interconnect people all over the world in a shared platform. A social network is a social structure comprising individuals or organizations which hold dynamic ties between them. Social network can be visualized in terms of connected graph where individuals are represented by vertices or nodes and connections between individuals are represented by link or edges. The tendency of people based on their preferences, choices, likes or dislikes are associated with each other in a shared platform, which forms a virtual cluster or community. In this paper we generate a graph of communication network based on real life data collected from a social network site-Twitter. Several community detection algorithms are in place and our intention is to make a comparative study of these existing algorithms over our graph and detect the communities which cannot be viewed by mere observa...

downloadDownload free PDF View PDFchevron_right

Cited by

Multilabel user classification using the community structure of online networks

Georgios Rizos

PLOS ONE, 2017

We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approximate Regularized Commute-Time Embedding (ARCTE), an algorithm that projects the users of a social graph onto a latent space, but instead of packing the global structure into a matrix of predefined rank, as many spectral and neural representation learning methods do, it extracts local communities for all users in the graph in order to learn a sparse embedding. To this end, we employ an improvement of personalized PageRank algorithms for searching locally in each user's graph structure. Then, we perform supervised community feature weighting in order to boost the importance of highly predictive communities. We assess our method performance on the problem of user classification by performing an extensive comparative study among various recent methods based on graph embeddings. The comparison shows that ARCTE significantly outperforms the competition in almost all cases, achieving up to 35% relative improvement compared to the second best competing method in terms of F1-score.

downloadDownload free PDF View PDFchevron_right

Community Detection in Multi-relational Social Networks

Guandong Xu

Lecture Notes in Computer Science, 2013

Multi-relational networks are ubiquitous in many fields such as bibliography, twitter, and healthcare. There have been many studies in the literature targeting at discovering communities from social networks. However, most of them have focused on single-relational networks. A hint of methods detected communities from multi-relational networks by converting them to single-relational networks first. Nevertheless, they commonly assumed different relations were independent from each other, which is obviously unreal to real-life cases. In this paper, we attempt to address this challenge by introducing a novel co-ranking framework, named MutuRank. It makes full use of the mutual influence between relations and actors to transform the multi-relational network to the single-relational network. We then present GMM-NK (Gaussian Mixture Model with Neighbor Knowledge) based on local consistency principle to enhance the performance of spectral clustering process in discovering overlapping communities. Experimental results on both synthetic and real-world data demonstrate the effectiveness of the proposed method.

downloadDownload free PDF View PDFchevron_right

GAT2VEC: Representation Learning for Attributed Graphs

Nasrullah Sheikh

Computing, 2018

Network Representation Learning (NRL) enables the application of machine learning tasks such as classification, prediction and recommendation to networks. Apart from their graph structure, networks are often associated with diverse information in the form of attributes. Most NRL methods have focused just on structural information, and separately apply a traditional representation learning on attributes. When multiple sources of information are available, using a combination of them may be beneficial as they complement each other in generating accurate contexts; moreover, their combined use may be fundamental when the information sources are sparse. The learning methods should thus preserve both the structural and attribute aspects. In this paper, we investigate how attributes can be modeled, and subsequently used along with structural information in learning the representation. We introduce the gat2vec framework that uses structural information to generate structural contexts, attributes to generate attribute contexts, and employs a shallow neural network model to learn a joint representation from them. We evaluate our proposed method against state-of-the-art baselines, using real-world datasets on vertex classification (multi-class and multi-label), link-prediction, and visualization tasks. The experiments show that gat2vec is effective in exploiting multiple sources of information, thus learning accurate representations and outperforming the state-of-the-art in the aforementioned tasks. Finally, we perform query tasks on learned representation and show how the qualitative analysis of results has better performance as well.

downloadDownload free PDF View PDFchevron_right

ComNE: Reinforcing Network Embedding with Community Learning

Ahmed Fathy

Communications in Computer and Information Science, 2019

Learning network embedding for large-scale networks have been attracting increasing attention due to their importance in supporting numerous network analytic and data mining tasks such as node classification, clustering and visualization. In this paper, we present a novel framework for learning large-scale network embedding incorporating network topology and community structural information. Most existing network embedding methods tend to embed network topology and ignore the partially labeled community structure information that exist in realworld networks and thus are unable to efficiently learn and capture the community structure of real-world networks. Unlike existing works, our framework integrates the network topology and community structure into the learning process. We propose a deep autoencoder model to generate low-dimensional feature representations efficiently through learning network reconstruction and community classification tasks. The experimental results on several real-world networks show that our framework outperforms the state-of-the-art methods.

downloadDownload free PDF View PDFchevron_right

Hyperspherical Variational Co-embedding for Attributed Networks

Jinyuan Fang

ACM Transactions on Information Systems, 2022

Network-based information has been widely explored and exploited in the information retrieval literature. Attributed networks, consisting of nodes, edges as well as attributes describing properties of nodes, are a basic type of network-based data, and are especially useful for many applications. Examples include user profiling in social networks and item recommendation in user-item purchase networks. Learning useful and expressive representations of entities in attributed networks can provide more effective building blocks to down-stream network-based tasks such as link prediction and attribute inference. Practically, input features of attributed networks are normalized as unit directional vectors. However, most network embedding techniques ignore the spherical nature of inputs and focus on learning representations in a Gaussian or Euclidean space, which, we hypothesize, might lead to less effective representations. To obtain more effective representations of attributed networks, we...

downloadDownload free PDF View PDFchevron_right

Scalable Robust Graph Embedding with Spark

Hongzhi Yin

Proc. VLDB Endow., 2021

Graph embedding aims at learning a vector-based representation of vertices that incorporates the structure of the graph. This representation then enables inference of graph properties. Existing graph embedding techniques, however, do not scale well to large graphs. While several techniques to scale graph embedding using compute clusters have been proposed, they require continuous communication between the compute nodes and cannot handle node failure. We therefore propose a framework for scalable and robust graph embedding based on the MapReduce model, which can distribute any existing embedding technique. Our method splits a graph into subgraphs to learn their embeddings in isolation and subsequently reconciles the embedding spaces derived for the subgraphs. We realize this idea through a novel distributed graph decomposition algorithm. In addition, we show how to implement our framework in Spark to enable efficient learning of effective embeddings. Experimental results illustrate t...

downloadDownload free PDF View PDFchevron_right

Co-Embedding Attributed Networks

Hongyan Bao

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Existing embedding methods for attributed networks aim at learning low-dimensional vector representations for nodes only but not for both nodes and attributes, resulting in the fact that they cannot capture the affinities between nodes and attributes. However, capturing such affinities is of great importance to the success of many real-world attributed network applications, such as attribute inference and user profiling. Accordingly, in this paper, we introduce a Co-embedding model for Attributed Networks (CAN), which learns low-dimensional representations of both attributes and nodes in the same semantic space such that the affinities between them can be effectively captured and measured. To obtain high-quality embeddings, we propose a variational auto-encoder that embeds each node and attribute with means and variances of Gaussian distributions. Experimental results on real-world networks demonstrate that our model yields excellent performance in a number of applications compared with state-of-the-art techniques.

downloadDownload free PDF View PDFchevron_right

Relation Learning on Social Networks with Multi-Modal Graph Edge Variational Autoencoders

Carl Yang

Proceedings of the 13th International Conference on Web Search and Data Mining, 2020

While node semantics have been extensively explored in social networks, little research attention has been paid to profile edge semantics, i.e., social relations. Ideal edge semantics should not only show that two users are connected, but also why they know each other and what they share in common. However, relations in social networks are often hard to profile, due to noisy multi-modal signals and limited user-generated ground-truth labels. In this work, we aim to develop a unified and principled framework that can profile user relations as edge semantics in social networks by integrating multi-modal signals in the presence of noisy and incomplete data. Our framework is also flexible towards limited or missing supervision. Specifically, we assume a latent distribution of multiple relations underlying each user link, and learn them with multi-modal graph edge variational autoencoders. We encode the network data with a graph convolutional network, and decode arbitrary signals with multiple reconstruction networks. Extensive experiments and case studies on two public DBLP author networks and two internal LinkedIn member networks demonstrate the superior effectiveness and efficiency of our proposed model.

downloadDownload free PDF View PDFchevron_right

Heri-Graphs: A Dataset Creation Framework for Multi-Modal Machine Learning on Graphs of Heritage Values and Attributes with Social Media

Pirouz Nourian

ISPRS international journal of geo-information, 2022

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY

downloadDownload free PDF View PDFchevron_right

Network representation learning: models, methods and applications

Anuraj Mohan

SN Applied Sciences, 2019

With the rise of large-scale social networks, network mining has become an important sub-domain of data mining. Generating an efficient network representation is one important challenge in applying machine learning to network data. Recently, representation learning methods are widely used in various domains to generate low dimensional latent features from complex high dimensional data. A significant amount of research effort is made in the past few years to generate node representations from graph-structured data using representation learning methods. Here, we provide a detailed study of the latest advancements in the field of network representation learning (also called network embedding). We first discuss the basic concepts and models of network embedding. Further, we build a taxonomy of network embedding methods based on the type of networks and review the major research works that come under each category. We then cover the major datasets used in network embedding research and describe the major applications of network embedding with respect to various network mining tasks. Finally, we provide various directions for future work which enhance further research.

downloadDownload free PDF View PDFchevron_right

Mining Social Interaction Data in Virtual Worlds

Gita Sukthankar

Communications in Computer and Information Science, 2015

Virtual worlds and massively multi-player online games are rich sources of information about large-scale teams and groups, offering the tantalizing possibility of harvesting data about group formation, social networks, and network evolution. However these environments lack many of the cues that facilitate natural language processing in other conversational settings and different types of social media. Public chat data often features players who speak simultaneously, use jargon and emoticons, and only erratically adhere to conversational norms. This chapter presents techniques for inferring the existence of social links from unstructured conversational data collected from groups of participants in the Second Life virtual world.

downloadDownload free PDF View PDFchevron_right

Multi-label relational neighbor classification using social context features

Gita Sukthankar

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, 2013

Networked data, extracted from social media, web pages, and bibliographic databases, can contain entities of multiple classes, interconnected through different types of links. In this paper, we focus on the problem of performing multi-label classification on networked data, where the instances in the network can be assigned multiple labels. In contrast to traditional content-only classification methods, relational learning succeeds in improving classification performance by leveraging the correlation of the labels between linked instances. However, instances in a network can be linked for various causal reasons, hence treating all links in a homogeneous way can limit the performance of relational classifiers. In this paper, we propose a multi-label iterative relational neighbor classifier that employs social context features (SCRN). Our classifier incorporates a class propagation probability distribution obtained from instances' social features, which are in turn extracted from the network topology. This class-propagation probability captures the node's intrinsic likelihood of belonging to each class, and serves as a prior weight for each class when aggregating the neighbors' class labels in the collective inference procedure. Experiments on several real-world datasets demonstrate that our proposed classifier boosts classification performance over common benchmarks on networked multi-label data.

downloadDownload free PDF View PDFchevron_right

Generative Adversarial Networks for Spatio-temporal Data: A Survey

Nan Gao

ACM Transactions on Intelligent Systems and Technology, 2022

Generative Adversarial Networks (GANs) have shown remarkable success in producing realistic-looking images in the computer vision area. Recently, GAN-based techniques are shown to be promising for spatio-temporal-based applications such as trajectory prediction, events generation, and time-series data imputation. While several reviews for GANs in computer vision have been presented, no one has considered addressing the practical applications and challenges relevant to spatio-temporal data. In this article, we have conducted a comprehensive review of the recent developments of GANs for spatio-temporal data. We summarise the application of popular GAN architectures for spatio-temporal data and the common practices for evaluating the performance of spatio-temporal applications with GANs. Finally, we point out future research directions to benefit researchers in this area.

downloadDownload free PDF View PDFchevron_right

GRAPE for fast and scalable graph processing and random-walk-based embedding

Giorgio Valentini

Nature Computational Science

Graph representation learning methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE (Graph Representation Learning, Prediction and Evaluation), a software resource for graph processing and embedding that is able to scale with big graphs by using specialized and smart data structures, algorithms, and a fast parallel implementation of random-walk-based methods. Compared with state-of-the-art software resources, GRAPE shows an improvement of orders of magnitude in empirical space and time complexity, as well as competitive edge- and node-label prediction performance. GRAPE comprises approximately 1.7 million well-documented lines of Python and Rust code and provides 69 node-embedding methods, 25 inference models, a collection of efficient graph-processing...

downloadDownload free PDF View PDFchevron_right

Relational learning via latent social dimensions

Sign up for access to the world's latest research

AbstractAI

Related papers

Related papers

Related topics

Cited by

Abstract
AI