47
Stefan Schmid @ T-Labs, 2011 Social Networks GIAN Course on Distributed Network Algorithms

GIAN Course on Distributed Network Algorithms Social …

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Stefan Schmid @ T-Labs, 2011

Social Networks

GIAN Course on Distributed Network Algorithms

Stefan Schmid @ T-Labs, 2011

Social Networks

GIAN Course on Distributed Network Algorithms

A short overview of whatis in the lecture notes: wecan chat live after lunch!

Stanley Milgram‘s Experiment

3

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

Stanley Milgram‘s Experiment

4

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

The Outcome:

Stanley Milgram‘s Experiment

5

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

The Outcome:

- Many letters lost

Stanley Milgram‘s Experiment

6

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

The Outcome:

- Many letters lost

- But the ones that arrived:

average path length 5.5

Stanley Milgram‘s Experiment

7

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

The Outcome:

- Many letters lost

- But the ones that arrived:

average path length 5.5

A small world!

Stanley Milgram‘s Experiment

8

The Experiment:

- Chose random people in US Midwest

- Tell them to send a letter to random person in Boston

- Rule: forward letter only to someone known on first name basis

The Outcome:

- Many letters lost

- But the ones that arrived:

average path length 5.5

A small world!

The experiment was critized a lot, but also inspired manyresearchers: how toexplain this effect?

Researchers designedpowerlaw graph models, Watts-Strogatz model, etc.

Watts-Strogatz Model

9

Social Network =

Graph with large clustering

coefficient

+

random links

Fundamental Concepts and Properties

Cluster CoefficientLikelihood that two friends of a node are also friends,

summed up over all nodes.

?

10

Fundamental Concepts and Properties

Cluster CoefficientLikelihood that two friends of a node are also friends,

summed up over all nodes.

?

11

A good metric?

Cluster Coefficient

12

Clustering Coefficient?

Cluster Coefficient

Coefficient: 013

Clustering Coefficient?

My friends arenever friends.

Cluster Coefficient

Coefficient: 3/7

14

Clustering Coefficient?

Cluster Coefficient

Coefficient: 3/7

15

Clustering Coefficient?

out of binom(8,2)=28

pairs of neighbors, 12 are neighbors

Coefficient: 3/7 16

Coefficient: 0

Are these graphs

really so different?

Is the CC a goodmetric?

vs

Jon Kleinberg: Navigable Social Networks

17

It‘s about routing!

To understand Milgram‘s

experiment, we do not only need to

understand the network topology,

but also the routing: How did the

persons find these routes?

The network topology must also be navigable!

Jon Kleinberg: Navigable Social Networks

18

It‘s about routing!

To understand Milgram‘s

experiment, we do not only need to

understand the network topology,

but also the routing: How did the

persons find these routes?

The network topology must also be navigable!

Routing was likely greedy: userscould not be sure whether thenext hop is really closer to thedestination (in terms of numberof hops).

Kleinberg: Routing on Augmented Grid

19

Square grid (small range

contacts) augmented

with long-range random

links chosen according to

a probability distribution.

Node u points to

node v with

probability

where d() is the distance

in the grid and is a

parameter.

Kleinberg: Routing on Augmented Grid

20

Square grid (small range

contacts) augmented

with long-range random

links chosen according to

a probability distribution.

Node u points to

node v with

probability

where d() is the distance

in the grid and is a

parameter.

If α=0: uniform at random. The larger

α the shorter the long-range links.

For α≤2: the diameter is polylogarithmic.

Kleinberg Routing

Greedy Routingwhile (not at destination):

go to neighbor which is closest to destination

(considering grid distance only)

21

Kleinberg proposed the following greedy routing algorithm:

Kleinberg Routing

Greedy Routingwhile (not at destination):

go to neighbor which is closest to destination

(considering grid distance only)

22

Kleinberg proposed the following greedy routing algorithm:

Note: in O(n) we will definitelyarrive at the destination: along grid!But Milgram promises more…

Kleinberg Routing

Greedy Routingwhile (not at destination):

go to neighbor which is closest to destination

(considering grid distance only)

23

Kleinberg proposed the following greedy routing algorithm:

Note: in O(n) we will definitelyarrive at the destination: along grid!But Milgram promises more…

In the lecture notes: proof thatrouting terminates in O(log2 n) time.

Propagation Studies

24

Where to place an ice-cream shop in a graph so I reach most customers?

Propagation Studies

25

Where to place an ice-cream shop in a graph so I reach most customers?

Placing the shop in the center: first mover advantage!

Propagation Studies

26

Where to place an ice-cream shop in a graph so I reach most customers?

Placing the shop in the center: first mover advantage!

Always first mover advantage?

27

Always first mover advantage?

28

1

Always first mover advantage?

29

12

If second player places here: he gets the lion’s share.

5:6

Always first mover advantage?

Where second?

30

1

Always first mover advantage?

31

12

5:6

Stefan Schmid @ T-Labs, 2011

Community

Detection

Social Network Analysis

33

Social network of a karate club: Dispute between instructor and president

led the club to split into two rival clubs. Guess where!

Social Network Analysis

Social network of a karate club: Dispute between instructor and president

led the club to split into two rival clubs. Guess where!

It’s about community detection!

34

Node Centrality

35

(Vertex) CentralityHow important is the node? Many definitions exist:

degree/betweenness/closeness/... centrality.

36

A: Degree Centrality

Simply the degree of a node. (E.g., Karate Club president)

B: Closeness Centrality

Defined as the inverse of the sum of the distances to all other nodes.

C: Betweenness Centrality

Fraction of shortest paths that include v.

D: Eigenvector Centrality

E: Katz Centrality and Page Rank

F: Alpha Centrality

blue = min, red = max

Betweenness Centrality

37

How to compute?

1. All-Pairs-Shortest-PathsE.g., Floyd-Warshall

2. Better algos exist

red = min, blue = max

How to compute community

from centralities?

1. Idea: use edge-centrality

2. Remove edges of high centrality

(likely to connect different communities?)

Betweenness Centrality

38

Intuition: two communities?

1

2 3

7 8

6

4 5

10

9 11

13

12 14

Betweenness Centrality

39

1

2 3

7 8

6

4 5

10

9 11

13

12 14

Intuition: two communities?

Betweenness Centrality

40

Betweenness?

1

2 3

7 8

6

4 5

10

9 11

13

12 14

?

Betweenness Centrality

41

Betweenness?

1

2 3

7 8

6

4 5

10

9 11

13

12 14

7*7=49

Betweenness Centrality

42

Betweenness?

1

2 3

7 8

6

4 5

10

9 11

13

12 14

4933

33

33

33

1

1

1

1

12

12

12

12

1212

12

12

The Girvan-Newman Method

43

Successively Delete Edges of High Betweenness!

1

2 3

7 8

6

4 5

10

9 11

13

12 14

4933

33

33

33

1

1

1

1

12

12

12

12

1212

12

12

Successively Delete Edges of High Betweenness!

The Girvan-Newman Method

44

1

2 3

7 8

6

4 5

10

9 11

13

12 14

4933

33

33

33

1

1

1

1

12

12

12

12

1212

12

12

Successively Delete Edges of High Betweenness!

The Girvan-Newman Method

45

1

2 3

7 8

6

4 5

10

9 11

13

12 14

24

24

24

24

1

1

1

1

10

10

10

10

1010

10

10

Successively Delete Edges of High Betweenness!

The Girvan-Newman Method

46

1

2 3

7 8

6

4 5

10

9 11

13

12 14

24

24

24

24

1

1

1

1

10

10

10

10

1010

10

10

End of lecture

47