Scalable Learning of Collective Behavior Based on Sparse Social Dimensions

Lei Tang, Huan LiuCIKM’09

Speaker: Hsin-Lan, WangDate: 2010/02/01

Outline Introduction Collective Behavior Learning Social Dimensions Algorithm

Edge-Centric View K-means Variant

Experiment Setup Experiment Results Conclusions and Future Work

Introduction

Social media facilitate people of all walks of life to connect to each other.

We study how networks in social media can help predict some sorts of human behavior and individual preference.

Introduction

In social media, the connections of the same network are not homogeneous. However, this relation type information is not readily available in reality.

A framework based on social dimensions is proposed to address this heterogeneity.

Introduction In the initial study, modularity

maximization is exploited to extract social dimensions.

With huge number of actors, the dimensions cannot even be held in memory.

In this work, we propose an effective edge-centric approach to extract sparse social dimensions.

Collective Behavior Learning

When people are exposed in a social network environment, their behaviors can be influenced by the behaviors of their friends.

People are more likely to connect to others sharing certain similarity with them.

Collective Behavior Learning K class labels network

V is the vertex set, E is the edge set and are the class labels of a vertex

Given known values of for some subsets of vertices .

How to infer the values of for the remaining vertices

Social Dimensions

Social Dimensions To address the heterogeneity present

ed in connections, we have proposed a framework (SocDim) for collective behavior learning.

Framework SocDim is composed of two steps:1. social dimension extraction2. discriminative learning

Social Dimensions

These social dimensions can be treated as features of actors.

Since network is converted into features, typical classifier such as support vector machine can be employed.

Social Dimensions Concerns about the scalability of SocDim wi

th modularity maximization: The social dimensions extracted according to m

odularity maximization are dense. Requires the computation of the top eigenvecto

rs of a modularity matrix which is of size n*n. The dynamic nature of networks entails efficient

update of the model for collective behavior prediction.

Algorithm -Edge-Centric View

Treat each edge as one instance, and the nodes that define edges as features.

Algorithm -Edge-Centric View

Based on the features of each edge, we can cluster the edges into two sets.

One actor is considered associated with one affiliation as long as any of his connections is assigned to that affiliation.

Algorithm -Edge-Centric View In summary, to extract social

dimensions, we cluster edges rather than nodes in a network into disjoint sets.

Because the affiliations of one actor are no more than the connections he has, the social dimensions based on edge-centric clustering are guaranteed to be sparse.

Algorithm -K-means Variant

Algorithm

Experiment Setup -Social Media Data

Experiment Results -Prediction Performance

Prediction performance on all the studied social media data is around 20-30% for F1 measure. This is partly due to : large number of labels in the data only employ the network information

Experiment Results -Scalability Study

Experiment Results -Sensitivity Study

Conclusions and Future Work To address the scalability issue, we

propose an edge-centric clustering scheme to extract social dimensions and a scalable k-means variant to handle edge clustering.

The model based on the sparse social dimensions shows comparable prediction performance as earlier proposed approaches to extract social dimensions.

Conclusions and Future Work

In reality, each edge can be associated with multiple affiliations while our current model assumes only one dominant affiliation.

The proposed EdgeCluster model is sensitive to the number of social dimensions.

Scalable Learning of Collective Behavior Based on Sparse Social Dimensions

Documents

Scalable Sparse Optimization in Dense Cloud-RANshiyuanming.github.io/papers/Thesis_Yuanming.pdf · Scalable Sparse Optimization in Dense Cloud-RAN by Yuanming SHI This is to certify

HomeRun: Scalable Sparse-Spectrum Reconstruction of ...HomeRun: Scalable Sparse-Spectrum Reconstruction of Aggregated Historical Data Faisal M. Almutairi University of Minnesota Minneapolis,

Scalable Collective Communication and Data Transfer for

Similarity Learning for High Dimensional Sparse Datakuanl/papers/aistats15_hdsl_poster.pdf · Derivation of scalable algorithms for the proposed formulations, with time/memory cost

Scalable GPU graph traversalpingali/CS395T/2013fa/papers/...parallel algorithms, prefix sum, graph traversal, sparse graph 1. Introduction Algorithms for analyzing sparse relationships

Scalable Object Detection by Filter Compression with Regularized Sparse ...yenliang/paper/CVPR15.pdf · Scalable Object Detection by Filter Compression with Regularized Sparse Coding

Strider: Architectures for Scalable Memory Centric ... · Strider: Architectures for Scalable Memory Centric Reduction of Sparse Data Streams Sriseshan Srikanth, Tom Conte, Erik DeBenedictis

Face Image Retrieval of Efficient Sparse Code words and ... · Scalable Face Image Retrieval with Identity-Based Quantization and ... using attribute-enhanced sparse codewords [15]

Hornet: An Efficient Data Structure for Dynamic Sparse ...on-demand.gputechconf.com/gtc/2018/presentation/s8177-hornet... · Hornet •A scalable and dynamic data structure for –Sparse

High Performance and Scalable Communication Libraries for HPC … · 2020. 1. 14. · High-Performance and Scalable Non-Blocking All-to-All with Collective Offload on InfiniBand Clusters:

A High Performance Sparse Cholesky Factorization Algorithm For Scalable Parallel Computers 1

Scalable GPU Graph Traversal - NVIDIA...parallel algorithms, prefix sum, graph traversal, sparse graph 1. Introduction Algorithms for analyzing sparse relationships represented as

Densifying Assumed-sparse TensorsDensifying Assumed-sparse Tensors? Improving Memory E ciency and MPI Collective Performance during Tensor Accumulation for Parallelized Training of

Scalable Sparse Optimization in Dense Cloud-RANfaculty.sist.shanghaitech.edu.cn/.../Yuanming_defense.pdfComputing Issues: Scalable Optimization Two-stage large-scale convex optimization

Weisfeiler and Leman go sparse: Towards scalable …Weisfeiler and Leman go sparse: Towards scalable higher-order graph embeddings Christopher Morris1 Gaurav Rattan2 Petra Mutzel3

Scalable Kernel Correlation Filter with Sparse Feature ...Scalable Kernel Correlation Filter with Sparse Feature Integration Andr es Sol s Montero, Jochen Lang and Robert Lagani ere

A Scalable Algorithm for Sparse Portfolio Selection · 2020-08-03 · A Scalable Algorithm for Sparse Portfolio Selection Dimitris Bertsimas Sloan School of Management, Massachusetts

Scalable Sparse Optimization in Dense Cloud-RANshiyuanming.github.io/slides/Yuanming_defense.pdf · cloud radio access networks,” in Proc. IEEE Int. Conf. Commun. (ICC), Sydney,

Scalable Kernel Correlation Filter with Sparse …Scalable Kernel Correlation Filter with Sparse Feature Integration Andr es Sol s Montero, Jochen Lang and Robert Lagani ere. University

Sparse factorizations: Towards optimal complexity and ......Computing) ! FASTMath Institute (2011-2016, Frameworks, Algorithms, and Scalable Technologies for Mathematics) • Software: