1
SimRank Overview Fast Incremental SimRank on Link-Evolving Graphs Weiren Yu 1,2 , Xuemin Lin 1 , Wenjie Zhang 1 1 School of Computer Science & Engineering, University of New South Wales, Sydney, Australia 2 Department of Computing, Imperial College London, UK SimRank An appealing link-based similarity measure (KDD ’02) Basic philosophy Two vertices are similar if they are referenced by similar vertices. Two Forms Original form (KDD ’02) Matrix form (EDBT ’10) damping factor in-neighbor set of node b similarity btw. nodes a and b Existing Work Batch Computations All Pairs s(*,*) Single Pair s(a,b) Single Source s(*,q) Similarity Join s(x,y) for all x in A, and y in B. Incremental Paradigms: link-evolving: Li et. al. [EDBT 2010] needs O(r 4 n 2 ) time for approximation. node-evolving: He et al. [KDD 2010] --- GPU based Finding w Characterizing ∆S via a rank-one Sylvester Pruning “unaffected areas” of ∆S Experimental Evaluations Time & Space Efficiency Effectiveness of Pruning Intermediate Memory Exactness Motivations Li et al. [EDBT 2010] using SVD for incremental SimRank is approximate. When ∆G is small, the “affected areas” of ∆S are also small. Problem (INCREMENTAL SIMRANK COMPUTATION) Given: G, S, ∆G, and C. Compute: ∆S to S. Time complexity: O(Kn 2 ) Step 1. Find u,v s.t. Step 2. Find w s.t. Step 3. Compute ∆S as No mat-mat multiplications Can we further improve it? = Theorem There exists with s.t. is a rank-one Sylvester Equation w.r.t. M. = = As M merely tallies these paths, node-pairs without having such paths can be pruned. Three types of paths captured by M P1: P2: P3: Iteratively Pruning: Let Then Complexity: O(K(nd+|AFF|)) with .

Fast Incremental SimRank on Link-Evolving Graphswyu1/ppt/oral/icde14p.pdf · SimRank Overview Fast Incremental SimRank on Link-Evolving Graphs Weiren Yu 1,2, Xuemin Lin 1, Wenjie

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Fast Incremental SimRank on Link-Evolving Graphswyu1/ppt/oral/icde14p.pdf · SimRank Overview Fast Incremental SimRank on Link-Evolving Graphs Weiren Yu 1,2, Xuemin Lin 1, Wenjie

SimRank Overview

Fast Incremental SimRank on

Link-Evolving Graphs

Weiren Yu 1,2, Xuemin Lin 1, Wenjie Zhang 1

1 School of Computer Science & Engineering, University of New South Wales, Sydney, Australia

2 Department of Computing, Imperial College London, UK

• SimRank

• An appealing link-based similarity measure (KDD ’02)

• Basic philosophy

Two vertices are similar if they are referenced by similar vertices.

• Two Forms • Original form (KDD ’02)

• Matrix form (EDBT ’10)

damping factor

in-neighbor set of node b

similarity btw. nodes a and b

Existing Work

• Batch Computations

• All Pairs s(*,*)

• Single Pair s(a,b)

• Single Source s(*,q)

• Similarity Join s(x,y) for all x in A, and y in B.

• Incremental Paradigms:

• link-evolving:

Li et. al. [EDBT 2010] needs O(r4n2) time for approximation.

• node-evolving:

He et al. [KDD 2010] --- GPU based

Finding w

Characterizing ∆S via a rank-one Sylvester

Pruning “unaffected areas” of ∆S

Experimental Evaluations

Tim

e &

Sp

ace

Eff

icie

ncy

Effectiveness of Pruning Intermediate Memory Exactness

Motivations

• Li et al. [EDBT 2010] using SVD for incremental SimRank is approximate.

• When ∆G is small, the “affected areas” of ∆S are also small.

Problem (INCREMENTAL SIMRANK COMPUTATION) Given: G, S, ∆G, and C. Compute: ∆S to S.

• Time complexity: O(Kn2)

Step 1. Find u,v s.t.

Step 2. Find w s.t.

Step 3. Compute ∆S as

No mat-mat multiplications

Can we further improve it?

=

Theorem There exists with

s.t.

is a rank-one Sylvester Equation w.r.t. M.

= =

• As M merely tallies these paths, node-pairs without having such paths can be pruned.

• Three types of paths captured by M

• P1:

• P2:

• P3:

• Iteratively Pruning:

Let

Then

• Complexity: O(K(nd+|AFF|)) with

… …

… … .