38
Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Embed Size (px)

Citation preview

Page 1: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Lecture 6 - Models of Complex Networks II

Dr. Anthony BonatoRyerson University

AM8002Fall 2014

Page 2: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Key properties of complex networks

1. Large scale.

2. Evolving over time.

3. Power law degree distributions.

4. Small world properties.

• Other properties are also important: densification power law, shrinking distances,…

2

Page 3: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

3

Geometry of the web?

• idea: web pages exist in a topic-space

– a page is more likely to link to pages close to it in topic-space

Page 4: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

4

Random geometric graphs• nodes are randomly

placed in space

• nodes are joined if their distance is less than a threshold value d; nodes each have a region of influence which is a ball of radius d

(Penrose, 03)

Page 5: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

5

Simulation with 5000 nodes

Page 6: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

6

Geometric Preferential Attachment (GPA) model(Flaxman, Frieze, Vera, 04/07)

• nodes chosen on-line u.a.r. from sphere with surface area 1

• each node has a region of influence with constant radius

• new nodes have m out-neighbours, chosena) by preferential attachment; andb) only in the region of influence

• a.a.s. model generates power law,

low diameter graphs with small separators/sparse cuts

Page 7: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

7

Spatially Preferred Attachment graphs

• regions of influence shrink over time (motivation: topic space growing with time), and are functions of in-degree

• non-constant out-degree

Page 8: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

8

Spatially Preferred Attachment (SPA) model

(Aiello,Bonato,Cooper,Janssen,Prałat, 08)• parameter: p a real number in (0,1]• nodes on a 3-dimensional sphere with surface area 1• at time 0, add a single node chosen u.a.r.• at time t, each node v has a region of influence Bv

with radius

• at time t+1, node z is chosen u.a.r. on sphere• if z is in Bv, then add vz independently with probability

p

t

vtG

1deg

Page 9: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

9

Simulation: p=1, t=5,000

Page 10: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

10

• as nodes are born, they are more likely to enter some Bv with larger radius (degree)

• over time, a power law degree distribution results

Page 11: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

11

power law exponent 1+1/p

Theorem 6.1 (ACBJP, 08)Define

Then a.a.s. for t ≤ n and i ≤ if,

Page 12: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

12

Rough sketch of proof• derive an asymptotic expression for E(Ni,t)

1-t0,t0,1t0, N1)|N(Nt

pGt E

1-ti,1-t1,-iti,1ti, N)1(

N)|N(Nt

ip

t

piGt

E :0i

Page 13: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

13

• solve the recurrence asymptotically:

)(N ti,E

Page 14: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

14

• prove that Ni,t is concentrated on E(Ni,t) via martingales

• standard approach is to use c-Lipshitz condition: change in Ni,t is bounded above by constant c

• c-Lipschitz property may fail: new nodes may appear in an unbounded number of overlapping regions of influence

• prove this happens with exponentially small probabilities using the differential equation method

Page 15: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

15

Models of OSNs

• few models for on-line social networks

• goal: find a model which simulates many of the observed properties of OSNs,– densification and shrinking distance– must evolve in a natural way…

Page 16: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

16

Geometry of OSNs?

• OSNs live in social space: proximity of nodes depends on common attributes (such as geography, gender, age, etc.)

• IDEA: embed OSN in 2-, 3-

or higher dimensional space

Page 17: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

17

Dimension of an OSN

• dimension of OSN: minimum number of attributes needed to classify nodes

• like game of “20 Questions”: each question narrows range of possibilities

• what is a credible mathematical formula for the dimension of an OSN?

Page 18: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

18

Geometric model for OSNs

• we consider a geometric model of OSNs, where– nodes are in m-

dimensional Euclidean space

– threshold value variable: a function of ranking of nodes

Page 19: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

19

Geometric Protean (GEO-P) Model(Bonato, Janssen, Prałat, 10)

• parameters: α, β in (0,1), α+β < 1; positive integer m• nodes live in m-dimensional hypercube• each node is ranked 1,2, …, n by some function r

– 1 is best, n is worst – we use random initial ranking

• at each time-step, one new node v is born, one randomly node chosen dies (and ranking is updated)

• each existing node u has a region of influence with volume

• add edge uv if v is in the region of influence of u

nr

Page 20: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

20

Notes on GEO-P model

• models uses both geometry and ranking• number of nodes is static: fixed at n

– order of OSNs at most number of people (roughly…)

• top ranked nodes have larger regions of influence

Page 21: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

21

Simulation with 5000 nodes

Page 22: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

22

Simulation with 5000 nodes

random geometric

GEO-P

Page 23: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

23

Properties of the GEO-P model

Theorem 6.2 (Bonato, Janssen, Prałat, 2010)

A.a.s. the GEO-P model generates graphs with the following properties:– power law degree distribution with exponent

b = 1+1/α– average degree d = (1+o(1))n(1-α-β)/21-α

• densification– diameter D = O(nβ/(1-α)m log2α/(1-α)m n)

• small world: constant order if m = Clog n.

Page 24: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

24

Density

• average number of edges added at each time-step

• parameter β controls density• if β < 1 – α, then density grows with n (as

in real OSNs)

n

i

nni1

1

1

1

Page 25: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

25

Dimension of OSNs

• given the order of the network n, power law exponent b, average degree d, and diameter D, we can calculate m

• gives formula for dimension of OSN:

Dd

n

mb

b

log2

log2

1

Page 26: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

26

Uncovering the hidden reality• reverse engineering approach

– given network data (n, b, d, D), dimension of an OSN gives smallest number of attributes needed to identify users

• that is, given the graph structure, we can (theoretically) recover the social space

Page 27: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

27

6 Dimensions of Separation

OSN Dimension

YouTube 6Twitter 4Flickr 4

Cyworld 7

Page 28: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Discussion

1. Speculate as to what the feature space would be for protein interaction networks.

2. Verify that for fixed constants α, β in (0,1):

28

n

i

nni1

1

1

1

Page 29: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

29

Transitivity

Page 30: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

30

Iterated Local Transitivity (ILT) model(Bonato, Hadi, Horn, Prałat, Wang, 08)

• key paradigm is transitivity: friends of friends are more likely friends

• nodes often only have local influence

• evolves over time, but retains memory of initial graph

Page 31: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

31

ILT model

• start with a graph of order n• to form the graph Gt+1 for each node x from

time t, add a node x’, the clone of x, so that xx’ is an edge, and x’ is joined to each node joined to x

• order of Gt is n2t

Page 32: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

32

G0 = C4

Page 33: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Degrees

Lemma 6.3

In the ILT model, let degt(z) be the degree of z at time t. If x is in V(Gt), then we have the following:

a) degt +1(x) = 2degt (x)+1.

b) degt +1(x’) = degt(x) +1.

33

Page 34: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

34

Properties of ILT model

• average degree increasing to with time

• average distance bounded by constant and converging, and in many cases decreasing with time; diameter does not change

• clustering coefficient higher than in a random generated graph with same average degree

• bad expansion: small gaps between 1st and 2nd eigenvalues in adjacency and normalized Laplacian matrices of Gt

Page 35: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

35

Densification

• nt = order of Gt, et = size of Gt

Lemma 6.4: For t > 0,

nt = 2tn0, et = 3t(e0+n0) - nt.

→ densification power law:et ≈ nt

a, where a = log(3)/log(2).

Page 36: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

36

Average distance

Theorem 6.5: If t > 0, then

• average distance bounded by a constant, and converges; for many initial graphs (large cycles) it decreases

• diameter does not change from time 0

Page 37: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

37

Clustering Coefficient

Theorem 6.6: If t > 0, then

c(Gt) = ntlog(7/8)+o(1).

• higher clustering than in a random graph G(nt,p) with same order and average degree as Gt, which satisfies

c(G(nt,p)) = ntlog(3/4)+o(1)

Page 38: Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

38

…Degree distribution–generate power

law graphs from ILT?• ILT model

gives a binomial-type distribution