46
Block, Boda, Hollway, Vörös May 2017

Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Block, Boda, Hollway, Vörös May 2017

Page 2: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Multiple methods for the statistical analysis of networks exist. Mainly: The SAOM/Siena model. The ERGM and various extensions (LERGM, TERGM, StERGM, ##ERGM).

Finding your way through this jungle is tricky, because: There are lots of models. Many modelling consequences are not spelled out explicitly. Everybody claims their model is the best. Some even claim (on questionable grounds) that other’s models are bad.

This presentation shall provide some guidance to model choice (and handling reviewer comments).

Page 3: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

General: Actor-oriented vs. tie-oriented modelling (SAOM or ERGM). Longitudinal: Continuous-time vs. discrete-time (SAOM or TERGM). Not today: Continuous-time models (SAOM and LERGM).

Page 4: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Definition of the ERGM:

Page 5: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

(See Block, P., Stadtfeld, C., & Snijders, T. A. (2016). Forms of Dependence Comparing SAOMs and ERGMs From Basic Principles. Sociological Methods & Research.)

Page 6: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

ERGM: A model for the probability of a observed network, depending on network characteristics

(network statistics). Poss. interpretation: the probability of a tie, given the rest of the network. Poss. interpretation: the outcome of a tie-wise updating process in equilibrium.

SAOM: A model for the probability of network evolution, depending on network characteristics

(network statistics) from the perspective of an actor. Dominant interpretation: the choice of an actor to change one of its outgoing ties, rather

than any other possible option.

Page 7: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Compare how the micro-models of network transitions differ and what that means.

Page 8: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Definition of the ERGM on a transition level:

Definition of the SAOM on a transition level:

ERGM vs SAOM Per Block 8

Page 9: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Only feasible for small networks (<= 5 nodes)

For ERGM: Using the basic ERGM formula.

For SAOM: Use transitions between all network states.

Page 10: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

1. How do the transition probabilities between adjacent networks differ for “equivalent” models?

2. How does this relate to different dependence assumptions?3. How does this leads to different meso- and macro-level network

features in their equilibrium state for identical target statistics?4. How do differences in stationary distribution lead to difference in

estimates for “identical” models?

ERGM vs SAOM Per Block 10

Page 11: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The SAOM naturally has a different level of dependence assumptions resulting from its actor-oriented nature.

Consider the following case:

ERGM vs SAOM Per Block 11

Page 12: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

These are all possible transitions from Triad 012.

ERGM vs SAOM Per Block 12

Page 13: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The likelihood to reach any state with 2 ties in the next step in the ERGM with only a density parameter:

ERGM vs SAOM Per Block 13

Page 14: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

These are all possible transitions from Triad 012.

ERGM vs SAOM Per Block 14

Page 15: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The same probability in the SAOM:

but:

ERGM vs SAOM Per Block 15

Page 16: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The existence of a tie influences the probability to form another tie by the same actor. The outgoing ties of each actor are dependent, even in a model that only contains a

density parameter. We can think of this as sender dependence. On the meso-level, this leads to a different outdegree distribution of the

SAOM compared to the ERGM.

ERGM vs SAOM Per Block 16

Page 17: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The outdegree distribution of the stat. dist. of an ERGM and a SAOM with a density of 0.2, 5 nodes and only a density parameter:

ERGM vs SAOM Per Block 17

Page 18: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The baseline models have a different degree of outdegree centralisation. Estimates of effects that model outdegree centralisation can differ by

model due to this difference in dependence.

ERGM vs SAOM Per Block 18

Page 19: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Dependence between ties differs not only for the density effect, but literally all effects.

The multinomial choice function introduces all sorts of dependence assumption in the model.

ERGM vs SAOM Per Block 19

Page 20: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

What dependence assumptions are “better”?

Page 21: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Not only is the level of dependence different, the form dependence can take also differs.

In the SAOM, the case that the tie k → l is dependent on j → h, but not vice versa, is possible.

We call this asymmetric dependence which has serious modelling consequences.

ERGM vs SAOM Per Block 21

Page 22: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Consider the following transitions around 030T:

ERGM vs SAOM Per Block 22

Page 23: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

In the SAOM, the change statistic is counted from the perspective of the focal actor, in the ERGM there is no focal actor.

The tie i → j depends on the tie h → j, but not vice versa. Asymmetric Dependence allows to test specific hypotheses that relate to

actor motivations!

ERGM vs SAOM Per Block 23

Page 24: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

We calculated the stat. dist. of a series of networks with 5 nodes, a density and a transitivity parameter and fixed target statistics.

We compared the stationary distribution w.r.t. the expected triad census.

ERGM vs SAOM Per Block 24

Page 25: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The expected number of Triads 021D and 021U

ERGM vs SAOM Per Block 25

Page 26: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The expected number of Triads 120D and 120U

ERGM vs SAOM Per Block 26

Page 27: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

1. Interpretation of the parameters differs.2. GoF-based model selection can lead to different model specifications. The control effects included can have a strong impact on substantive conclusions drawn. Unbiased model comparison becomes difficult.

3. Indegree and Outdegree centralisation parameter estimates can differ.

ERGM vs SAOM Per Block 27

Page 28: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

All the above was studied in isolation for illustration purposes. Does it matter in empirical analysis or all the outlined differences

marginal in consequences? We tested this by doing a forward model selection on the same dataset to

compare the final models and conclusions we can draw (friendship network, n=129).

ERGM vs SAOM Per Block 28

Page 29: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

ERGM vs SAOM Per Block 29

Page 30: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Do actors weigh ties against one another or is each tie evaluated on its own (→ actor- or tie-oriented)? E.g. costly vs. costless ties.

Are theories explicit about mechanisms of tie dependence (→ asymmetric dependence)? Do we have strong theories about tie formation and presistence?

Rarely: Which model fits better.

ERGM vs SAOM Per Block 30

Page 31: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Partly in response to the work by Cranmer, Leifeld & Desmarais

(Block, P., Koskinen, J., Hollway, J., Steglich, C., Stadtfeld, C. (2017). Change we can believe in. Somewhere, soon.)

Page 32: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Previous part was based on academic interest. To date, ERGMs are dominant in the analysis of cross-sectional data and cannot (without

adjustments) be applied to longitudinal data. SAOMs are used for longitudinal data and uncommon in the analysis of cross-sectional

data. This part is guided by practical considerations for the analysis of

longitudinal data. Especially some people claim that a specific ERGM extension is the way forward in

longitudinal network models (Cranmer, Leifeld & Desmarais).

Page 33: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

(Statistical) models can be compared on the basis of prediction or explanation.

Prediction: Intuitively, a good model should adequately extrapolate to the future. But: for networks

what do we predict (ties or dependence)? Explanation: A model should have realistic and testable assumptions. A model should have a consistent interpretation that allows inference on social

mechanisms. A model should be transparent and understood in its mechanics.

Page 34: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

In its most basic form, a TERGM is an ERGM, in which a previous realisation (time-point) of the network is a predictors (dyadic covariate).

Information of a previous time-point can be included in more elaborate ways as dyadic covariates.

In the extreme this leads to a TERGM as proposed by Hanneke et al., in which only information from the past is included, making it essentially a logistic regression (see Lerner et al.).

Page 35: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

For the SAOM time is explicitly modelled and dependence unfolds over time.

In the TERGM, dependence is modelled between ties in the last observation, but decoupled from time!

Page 36: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The SAOM can be interpreted on the level of the mini-step, showing what new ties are more/less likely to exist in the next step.

TERGM interpretation parallel to the ERGM: Network level: The probability to observe more or less of a specific pattern than expected

“by chance”, conditioning on the past.▪ But the past might contain many of the same dependencies, so any interpretation is somewhat net off these

dependencies in the past

Tie level: No interpretation similar to the ERGM possible, as equilibrium assumption is unreasonable given it includes interpretation on the past.▪ Not being able to interpret on the tie-level is problematic, as this provides the link to social mechanisms.

Not a model for change!

Page 37: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The network level interpretation of the TERGM is net off the dependencies in the past.

If little change between 2 networks has happened, many dependencies of the current network will be explained by the past network.

This means that, even if we have time-homogeneous process parameter size will depend on time in the TERGM (but not the SAOM).

Page 38: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Create a dataset based on the Glasgow data.

10 periods are simulated into the future with realistic parameter values.

One period 1-10 waves apart is analysed, not using information of the waves in between.

Page 39: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

It is possible to test the predictive power of the models with simple out-of-sample tests.

We focus here on tie-level prediction, as impartial comparison of prediction of structures is very difficult, given that different effects model different dependence structures.

First expectation: Both models are bad at predicting. This is because, like statistical models in the social sciences in general, we are not very

good at predicting rare events; creation of ties are rare events, given the number of options.

Page 40: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Modelling dependence structures will equally improve correct predictions, if other ties are predicted correctly, but also predict wrong ties, if the other tie is predicted incorrectly.

See the parallel to Multilevel modelling: E.g. outcome 𝑌𝑌 of individual 𝑖𝑖in group 𝑗𝑗 (𝑌𝑌𝑖𝑖𝑖𝑖) will not be predicted better by inclusion of random group-level intercept, unless we have information about the outcome of other members of group 𝑗𝑗, i.e. we need information about the future to predict the future.

Page 41: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

x(t0)

x(t1)

xa(t2)

Dlogit < DTERGM Dlogit ≈ DTERGM Dlogit > DTERGM

xc(t2)xb(t2)

Page 42: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

Analysing period 1 from 2 datasets (one ASSIST school and the undirected fisheries treaties).

Simulating wave 3 from the obtained parameters 100 times.

Comparing the precision and recall from the simulations (and a logistic regression) to a trivial persistence model.

Page 43: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

How come other articles came to such different conclusions?

Page 44: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

In the article by Leifeld and Cranmer on arxiv, two analysis claim that the TERGM has a better out-of-sample predictive performance. These claims have restricted validity: No comparison to a null-model. The first analysis uses simulated data with extremely unrealistic parameters (e.g. negative

tie stability) to show how model performance differs. The second analysis relies on real data (good), but estimates TERGMs using pseudo-

likelihood estimation, known to result in unreliable parameters. For simulation of out-of-sample parameters, information about the future was taken into account!

Page 45: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The

The TERGM has a murky decoupling of time and dependence, which makes interpretation of parameters difficult on the network level, or impossible on the process-level.

The TERGM is not a model of change, but a model of structure; it does not allow conclusions about mechanisms that drive network evolution.

Both models are really bad at predicting, thus it is not a good criterion for model discrimination.

Page 46: Block, Boda, Hollway, Vörös May 2017 › content › dam › ethz › special-interest › gess... · 2017-06-06 · TERGM interpretation parallel to the ERGM: Network level: The