1 Social networks Small world networks. TU/e - 0ZM05/0EM15/0A150 2 Course aim knowledge about concepts in network theory, and being able to apply that

1

Social networks

Small world networks

TU/e - 0ZM05/0EM15/0A150 2

Course aim

knowledge about concepts in network theory, and being able to apply that knowledge

TU/e - 0ZM05/0EM15/0A150 3

The setup in some more detail

Network theory and background

- Introduction: what are they, why important …- Small world networks- Four basic network arguments- Kinds of network data (collection)- Business networks

TU/e - 0ZM05/0EM15/0A150

Two approaches to network theory

Bottom up (let’s try to understand network characteristics and arguments)as in … “Four network arguments” by Matzat (lecture 3)

Top down (let’s have a look at many networks, and try to deduce what is happening from what we see)as in “small world networks” (now)

4

TU/e - 0ZM05/0EM15/0A150

What kind of structures do

networks have, empirically?

Answer: often “small-world”,

and often also scale-free

5

TU/e - 0ZM05/0EM15/0A150

3 important network properties

Average Path Length (APL) (<l>)Shortest path between two nodes i and j of a network, averaged across all (pairs of) nodes

Clustering coefficient (“cliquishness”)Number of closed triplets / Total number of triplets (or: probability that two of my ties are connected)

(Shape of the) degree distributionA distribution is “scale free” when P(k), the proportion of nodes with degree k follows this formula, for some value of gamma:

6

TU/e - 0ZM05/0EM15/0A150 7

Example 1 - Small world networks

NOTE- Edge of network theory- Not fully understood yet …- … but interesting findings

TU/e - 0ZM05/0EM15/0A150 8

Enter: Stanley Milgram (1933-1984)

Remember him?

TU/e - 0ZM05/0EM15/0A150 9

The small world phenomenon –

Milgram´s (1967) original study

Milgram sent packages to several (60? 160?) people in Nebraska and Kansas.

Aim was “get this package to <address of person in Boston>”

Rule: only send this package to someone whom you know on a first name basis. Aim: try to make the chain as short as possible.

Result: average length of a chain is only six “six degrees of separation”

TU/e - 0ZM05/0EM15/0A150 10

Milgram’s original study (2)

An urban myth?

Milgram used only part of the data, actually mainly the ones supporting his claim

Many packages did not end up at the Boston address

Follow up studies typically small scale

TU/e - 0ZM05/0EM15/0A150 11

The small world phenomenon (cont.)

“Small world project” has been testing this assertion (not anymore, see http://smallworld.columbia.edu)

Email to <address>, otherwise same rules. Addresses were American college professor, Indian technology consultant, Estonian archival inspector, …

Conclusion: Low completion rate (384 out of 24,163 = 1.5%) Succesful chains more often through professional ties Succesful chains more often through weak ties (weak ties

mentioned about 10% more often) Chain size 5, 6 or 7.

http://smallworld.columbia.edu/

TU/e - 0ZM05/0EM15/0A150

Some Milgram follow-ups…

12

6.6!

TU/e - 0ZM05/0EM15/0A150 13

TU/e - 0ZM05/0EM15/0A150 14

The Kevin Bacon experiment –

Tjaden (+/- 1996)

Actors = actors

Ties = “has played in a movie with”

TU/e - 0ZM05/0EM15/0A150 15

The Kevin Bacon game

Can be played at:http://oracleofbacon.org

Kevin Bacon number (data might have changed by now)

Jack Nicholson: 1 (A few good men)

Robert de Niro: 1 (Sleepers)

Rutger Hauer (NL): 2 [Nick Stahl]

Famke Janssen (NL): 2 [Nick Stahl]

Bruce Willis: 2 [Patrick Michael Strange]

Kl.M. Brandauer (AU): 2 [Robert Redford]

Arn. Schwarzenegger: 2 [Kevin Pollak]

TU/e - 0ZM05/0EM15/0A150 16

A search for high Kevin Bacon numbers…

32

TU/e - 0ZM05/0EM15/0A150 17

The center of the movie universe (sept 2013)

Nr 370 Nr 136

Nr 39

TU/e - 0ZM05/0EM15/0A150

The best centers… (2013 + 2011)

18

(Kevin Bacon at place 444 in 2011)(Rutger Hauer at place 39, J.Krabbé 935)

TU/e - 0ZM05/0EM15/0A150 19

Where visual analysis fails: IMDB network

TU/e - 0ZM05/0EM15/0A150 20

“Elvis has left the building …”

TU/e - 0ZM05/0EM15/0A150 21

Small world networks

=

short average distance between pairs … … but relatively high “cliquishness”

TU/e - 0ZM05/0EM15/0A150 22

We find small average path lengths in all kinds of places…

Caenorhabditis Elegans959 cellsGenome sequenced 1998Nervous system mapped low average path length

+ cliquishness = small world network

Power grid network of Western States5,000 power plants with high-voltage lines low average path length +

cliquishness = small world network

TU/e - 0ZM05/0EM15/0A150 23

How weird is that?

TU/e - 0ZM05/0EM15/0A150

Could there be a simple explanation?

Consider a random network: each pair of nodes is connected with a given probability p.

This is called an Erdos-Renyi network.

24

NB Erdos was a “Kevin Bacon” long before KevinBacon himself!|

TU/e - 0ZM05/0EM15/0A150

APL is small in random networks

25[Slide copied from Jari_Chennai2010.pdf]

TU/e - 0ZM05/0EM15/0A150 26[Slide copied from Jari_Chennai2010.pdf]

TU/e - 0ZM05/0EM15/0A150

But let’s move on to the second network characteristic …

27

TU/e - 0ZM05/0EM15/0A150 28

TU/e - 0ZM05/0EM15/0A150

This is how small-world networks

are defined:

A short Average Path Length and

A high clustering coefficient

… and a randomly “grown” network does NOT lead to these small-world properties

29

TU/e - 0ZM05/0EM15/0A150

Networks of the Real-world (1) Information networks:

World Wide Web: hyperlinks

Citation networks Blog networks

Social networks: people + interactions

Organizational networks Communication networks Collaboration networks Sexual networks Collaboration networks

Technological networks: Power grid Airline, road, river

networks Telephone networks Internet Autonomous systems

Florence families Karate club network

Collaboration networkFriendship network

Source: Leskovec & Faloutsos

TU/e - 0ZM05/0EM15/0A150

Networks of the Real-world (2)

Biological networks metabolic networks food web neural networks gene regulatory

networks Language networks

Semantic networks Software networks …

Yeast proteininteractions

Semantic network

Language network Software network

Source: Leskovec & Faloutsos

TU/e - 0ZM05/0EM15/0A150 32

Small world networks … so what?

You see it a lot around us: for instance in road maps, food chains, electric power grids, metabolite processing networks, neural networks, telephone call graphs and social influence networks may be useful to study them

They seem to be useful for a lot of things, and there are reasons to believe they might be useful for innovation purposes (and hencewe might want to create them)

TU/e - 0ZM05/0EM15/0A150

Examples of interesting

properties of

small world networks

33

TU/e - 0ZM05/0EM15/0A150

Synchronizing fireflies …

<go to NetLogo>

Synchronization speed depends on small-world properties of the network

Network characteristics important for “integrating local nodes”

34

TU/e - 0ZM05/0EM15/0A150 35

Combining game theory and networks –

Axelrod (1980), Watts & Strogatz (1998?)

1. Consider a given network.

2. All connected actors play the repeated Prisoner’s Dilemma for some rounds

3. After a given number of rounds, the strategies “reproduce” in the sense that the proportion of the more succesful strategies increases in the network, whereas the less succesful strategies decrease or die

4. Repeat 2 and 3 until a stable state is reached.

5. Conclusion: to sustain cooperation, you need a short average distance, and cliquishness (“small worlds”)

TU/e - 0ZM05/0EM15/0A150 36

And another peculiarity ...

Seems to be useful in “decentralized computing” Imagine a ring of 1,000 lightbulbs Each is on or off Each bulb looks at three neighbors left and right... ... and decides somehow whether or not to switch to on or

off.

Question: how can we design a rule so that the network can tackle a given GLOBAL (binary) question, for instance the question whether most of the lightbulbs were initially on or off.

- As yet unsolved. Best rule gives 82 % correct.- But: on small-world networks, a simple majority rule gets 88% correct.

How can local knowledge be used to solve global problems?

TU/e - 0ZM05/0EM15/0A150

If small-world networks are so

interesting and we see them

everywhere, how do they arise?

(potential answer: through random rewiring of a given structure)

37

TU/e - 0ZM05/0EM15/0A150 38

Strogatz and Watts

6 billion nodes on a circle Each connected to nearest 1,000 neighbors Start rewiring links randomly Calculate average path length and clustering as

the network starts to change Network changes from structured to random APL: starts at 3 million, decreases to 4 (!) Clustering: starts at 0.75, decreases to zero

(actually to 1 in 6 million)

Strogatz and Watts asked: what happens along the way with APL and Clustering?

TU/e - 0ZM05/0EM15/0A150 39

Strogatz and Watts (2) “We move in tight circles yet we are all bound together by remarkably short chains” (Strogatz, 2003)

Implications for, for instance, research on the spread of diseases...

The general hint: - If networks start from relatively

structured …- … and tend to progress sort of

randomly …- - … then you might get small

world networks a large part of the time

TU/e - 0ZM05/0EM15/0A150

And now the third characteristic

40

TU/e - 0ZM05/0EM15/0A150

And if we consider all three…

41

TU/e - 0ZM05/0EM15/0A150 42

… then we find this:

Wang & Chen (2003) Complex networks: Small-world, Scale-free and beyond

TU/e - 0ZM05/0EM15/0A150 43

Same thing … we see “scale-freeness” all over

TU/e - 0ZM05/0EM15/0A150

… and it can’t be based on an ER-network

44

TU/e - 0ZM05/0EM15/0A150 45

The scale-free

TU/e - 0ZM05/0EM15/0A150 46

Scale-free networks are:

Robust to random problems/mistakes ... ... but vulnerable to selectively targeted attacks

TU/e - 0ZM05/0EM15/0A150 47

Another BIG question:How do scale free networks arise?

Potential answer: Perhaps through “preferential attachment”

< show NetLogo simulation here>

(Another) critique to this approach: it ignores ties created by those in the network

TU/e - 0ZM05/0EM15/0A150

Some related issues

48

TU/e - 0ZM05/0EM15/0A150 49

“The tipping point” (Watts*)

Consider a network in which each node determines whether or not to adopt, based on what his direct connections do.

Nodes have different thresholds to adopt(randomly distributed)

Question: when do you get cascades of adoption?

Answer: two phase transitions or tipping points: in sparse networks no cascades, as networks get

more dense you get cascades suddenly as networks get more heterogenous, a sudden

jump in the likelihood of cascades as networks get even more heterogenous, the

likelihood of cascades decreases* Watts, D.J. (2002) A simple model of global cascades on random networks. Proceedings of the National Academy of Sciences USA 99, 5766-5771

TU/e - 0ZM05/0EM15/0A150 50

“Find the influentials”(or not?)

TU/e - 0ZM05/0EM15/0A150 51

Malcolm Gladwell

(journalist/writer: wrote “Blink” and “The tipping point”

Duncan Watts

(scientist, Yahoo,Microsoft Research)

http://www.youtube.com/watch?v=AtnR5H6AVVU

http://www.fastcompany.com/641124/tipping-point-toast

http://www.fastcompany.com/641124

http://www.fastcompany.com/641124

TU/e - 0ZM05/0EM15/0A150 52

"All they'll ever say," Watts insists, is that a) there are people who are more influential than others, and b) they are disproportionately important in getting a trend going.

That may be oversimplifying it a bit, but last year, Watts decided to put the whole idea to the test by building another Sims-like computer simulation. He programmed a group of 10,000 people, all governed by a few simple interpersonal rules. Each was able to communicate with anyone nearby. With every contact, each had a small probability of "infecting" another. And each person also paid attention to what was happening around him: If lots of other people were adopting a trend, he would be more likely to join, and vice versa. The "people" in the virtual society had varying amounts of sociability--some were more connected than others. Watts designated the top 10% most-connected as Influentials; they could affect four times as many people as the average Joe. In essence, it was a virtual society run--in a very crude fashion--according to the rules laid out by thinkers like Gladwell and Keller.

Watts set the test in motion by randomly picking one person as a trendsetter, then sat back to see if the trend would spread. He did so thousands of times in a row.

The results were deeply counterintuitive. The experiment did produce several hundred societywide infections. But in the large majority of cases, the cascade began with an average Joe (although in cases where an Influential touched off the trend, it spread much further). To stack the deck in favor of Influentials, Watts changed the simulation, making them 10 times more connected. Now they could infect 40 times more people than the average citizen (and again, when they kicked off a cascade, it was substantially larger). But the rank-and-file citizen was still far more likely to start a contagion.

Why didn't the Influentials wield more power? With 40 times the reach of a normal person, why couldn't they kick-start a trend every time? Watts believes this is because a trend's success depends not on the person who starts it, but on how susceptible the society is overall to the trend--not how persuasive the early adopter is, but whether everyone else is easily persuaded. And in fact, when Watts tweaked his model to increase everyone's odds of being infected, the number of trends skyrocketed.

"If society is ready to embrace a trend, almost anyone can start one--and if it isn't, then almost no one can," Watts concludes. To succeed with a new product, it's less a matter of finding the perfect hipster to infect and more a matter of gauging the public's mood. Sure, there'll always be a first mover in a trend. But since she generally stumbles into that role by chance, she is, in Watts's terminology, an "accidental Influential."

TU/e - 0ZM05/0EM15/0A150 54

The bigger picture:

Understanding macro patternsfrom micro behavior

TU/e - 0ZM05/0EM15/0A150

The general approach … understand

how STRUCTURE can arise from

underlying MICRO-DYNAMICS

Scientists are trying to connect the structural properties …

Scale-free, small-world, locally clustered, bow-tie, hubs and authorities, communities, bipartite cores, network motifs, highly optimized tolerance, …

… to processes(Erdos-Renyi) Random graphs, Exponential

random graphs, Small-world model, Preferential attachment, Edge copying model, Community guided attachment, Forest fire models, Kronecker graphs, …

TU/e - 0ZM05/0EM15/0A150 56

RECAP• Six degrees of separation: average path length is often small• Many real-world networks have properties that are similar: small-world and scale-free.• We do not really understand yet how these properties emerge.• We saw two clues: the Watts-Strogatz model for small-worlds and the preferential attachment model for scale-freeness.• Small-world and scale-free networks have some nice properties (which might explain why they exist)• Considerable controversy over what these kinds of results imply, for instance for marketing purposes• Hot scientific topic: connecting micro-behavior to macro-properties

TU/e - 0ZM05/0EM15/0A150 57

To Do:

Read and comprehend the papers on small world networks, scale-free networks (see website, there is extra material too).

Think about implications and applications of these results

Documents

1 Social networks Small world networks. TU/e - 0ZM05/0EM15/0A150 2 Course aim knowledge about concepts in network theory, and being able to apply that