179
Effectiveness Ubiquity Way of life Fitting and understanding multilevel (hierarchical) models Andrew Gelman Department of Statistics and Department of Political Science Columbia University 8 December 2004 Andrew Gelman Fitting and understanding multilevel models

Fitting and understanding Multilevel Models-Andrew Gelman

Embed Size (px)

DESCRIPTION

Fitting and understanding Multilevel Models-Andrew Gelman

Citation preview

Page 1: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel (hierarchical)models

Andrew GelmanDepartment of Statistics and Department of Political Science

Columbia University

8 December 2004

Andrew Gelman Fitting and understanding multilevel models

Page 2: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 3: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 4: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 5: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 6: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 7: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 8: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 9: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Making more use of existing information

I The problem: not enough data to estimate effects withconfidence

I The solution: make your studies broader and deeperI Broader: extend to other countries, other years, other

outcomes, . . .I Deeper: inferences for individual states, demographic

subgroups, components of outcomes, . . .

I The solution: multilevel modelingI Regression with coefficients grouped into batchesI No such thing as “too many predictors”

Andrew Gelman Fitting and understanding multilevel models

Page 10: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel models

I The effectiveness of multilevel models

I Multilevel models in unexpected places

I Multilevel models as a way of lifeI collaborators:

I Iain Pardoe, Dept of Decision Sciences, University of OregonI David Park, Dept of Political Science, Washington UniversityI Joe Bafumi, Dept of Political Science, Columbia UniversityI Boris Shor, Dept of Political Science, Columbia UniversityI Noah Kaplan, Dept of Political Science, University of HoustonI Shouhao Zhao, Dept of Statistics, Columbia UniversityI Zaiying Huang, Circulation, New York Times

Andrew Gelman Fitting and understanding multilevel models

Page 11: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel models

I The effectiveness of multilevel models

I Multilevel models in unexpected places

I Multilevel models as a way of lifeI collaborators:

I Iain Pardoe, Dept of Decision Sciences, University of OregonI David Park, Dept of Political Science, Washington UniversityI Joe Bafumi, Dept of Political Science, Columbia UniversityI Boris Shor, Dept of Political Science, Columbia UniversityI Noah Kaplan, Dept of Political Science, University of HoustonI Shouhao Zhao, Dept of Statistics, Columbia UniversityI Zaiying Huang, Circulation, New York Times

Andrew Gelman Fitting and understanding multilevel models

Page 12: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel models

I The effectiveness of multilevel models

I Multilevel models in unexpected places

I Multilevel models as a way of lifeI collaborators:

I Iain Pardoe, Dept of Decision Sciences, University of OregonI David Park, Dept of Political Science, Washington UniversityI Joe Bafumi, Dept of Political Science, Columbia UniversityI Boris Shor, Dept of Political Science, Columbia UniversityI Noah Kaplan, Dept of Political Science, University of HoustonI Shouhao Zhao, Dept of Statistics, Columbia UniversityI Zaiying Huang, Circulation, New York Times

Andrew Gelman Fitting and understanding multilevel models

Page 13: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel models

I The effectiveness of multilevel models

I Multilevel models in unexpected places

I Multilevel models as a way of lifeI collaborators:

I Iain Pardoe, Dept of Decision Sciences, University of OregonI David Park, Dept of Political Science, Washington UniversityI Joe Bafumi, Dept of Political Science, Columbia UniversityI Boris Shor, Dept of Political Science, Columbia UniversityI Noah Kaplan, Dept of Political Science, University of HoustonI Shouhao Zhao, Dept of Statistics, Columbia UniversityI Zaiying Huang, Circulation, New York Times

Andrew Gelman Fitting and understanding multilevel models

Page 14: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Fitting and understanding multilevel models

I The effectiveness of multilevel models

I Multilevel models in unexpected places

I Multilevel models as a way of lifeI collaborators:

I Iain Pardoe, Dept of Decision Sciences, University of OregonI David Park, Dept of Political Science, Washington UniversityI Joe Bafumi, Dept of Political Science, Columbia UniversityI Boris Shor, Dept of Political Science, Columbia UniversityI Noah Kaplan, Dept of Political Science, University of HoustonI Shouhao Zhao, Dept of Statistics, Columbia UniversityI Zaiying Huang, Circulation, New York Times

Andrew Gelman Fitting and understanding multilevel models

Page 15: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 16: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 17: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 18: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 19: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 20: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 21: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 22: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 23: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Outline of talk

I The effectiveness of multilevel modelsI State-level opinions from national polls

(crossed multilevel modeling and poststratification)

I Multilevel models in unexpected placesI Estimating incumbency advantage and its variationI Before-after studies

I Multilevel models as a way of lifeI Building and fitting modelsI Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 24: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

National opinion trends

1940 1950 1960 1970 1980 1990 2000

5060

7080

Year

Per

cent

age

supp

ort f

or th

e de

ath

pena

lty

Andrew Gelman Fitting and understanding multilevel models

Page 25: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 26: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 27: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 28: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 29: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 30: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

State-level opinion trends

I Goal: estimating time series within each state

I One poll at a time: small-area estimation

I It works! Validated for pre-election polls

I Combining surveys: model for parallel time series

I Multilevel modeling + poststratification

I Poststratification cells: sex × ethnicity × age × education ×state

Andrew Gelman Fitting and understanding multilevel models

Page 31: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 32: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 33: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 34: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 35: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 36: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Multilevel modeling of opinions

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Bayesian inference, summarize by posterior simulations of β:Simulation θ1 · · · θ75

1 ** · · · **...

.... . .

...1000 ** · · · **

Andrew Gelman Fitting and understanding multilevel models

Page 37: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 38: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 39: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 40: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 41: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 42: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 43: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Interlude: why “multilevel” 6= “hierarchical”

I Logistic regression: Pr(yi = 1) = logit−1((Xβ)i )

I X includes demographic and geographic predictors

I Group-level model for the 16 age × education predictors

I Group-level model for the 50 state predictors

I Crossed (nonnested) structure of age, education, state

I Several overlapping “hierarchies”

Andrew Gelman Fitting and understanding multilevel models

Page 44: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 45: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 46: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 47: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 48: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 49: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Poststratification to estimate state opinions

I Implied inference for θj = logit−1(Xβ) in each of 3264 cells j(e.g., black female, age 18–29, college graduate,Massachusetts)

I PoststratificationI Within each state s, average over 64 cells:∑

j∈s Njθj

/ ∑j∈s Nj

I Nj = population in cell j (from Census)I 1000 simulation draws propagate to uncertainty for each θj

Andrew Gelman Fitting and understanding multilevel models

Page 50: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 51: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 52: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 53: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 54: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 55: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 56: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 57: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 58: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 59: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

CBS/New York Times pre-election polls from 1988

I Validation study: fit model on poll data and compare toelection results

I Competing estimates:I No pooling: separate estimate within each stateI Complete pooling: no state predictorsI Hierarchical model and poststratify

I Mean absolute state errors:I No pooling: 10.4%I Complete pooling: 5.4%I Hierarchical model with poststratification: 4.5%

Andrew Gelman Fitting and understanding multilevel models

Page 60: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

State-level opinions from national pollsPoststratificationValidation

Validation study: comparison of state errors

1988 election outcome vs. poll estimate

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

no pooling of state effects

Estimated Bush support

Act

ual e

lect

ion

outc

ome

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

complete pooling (no state effects)

Estimated Bush support

Act

ual e

lect

ion

outc

ome

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

multilevel model

Estimated Bush support

Act

ual e

lect

ion

outc

ome

Andrew Gelman Fitting and understanding multilevel models

Page 61: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 62: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 63: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 64: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 65: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 66: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel models always

I Anything worth doing is worth doing repeatedly

I A “method” is any procedure applied more than onceI City planning

I Outward expansion: fitting a model to other countries, otheryears, other outcomes, . . .

I Infilling: inferences for individual states, demographicsubgroups, components of data, . . .

I “Frequentist” statistical theory of repeated inferences

Andrew Gelman Fitting and understanding multilevel models

Page 67: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 68: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 69: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 70: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 71: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 72: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Incumbency advantage in U.S. House elections

I Regression approach (Gelman and King, 1990):I For any year, compare districts with and without incs runningI Control for vote in previous electionI Control for incumbent partyI vit = β0 + β1vi,t−1 + β2Pit + ψIit + εit

I Other estimates (sophomore surge, etc.) have selection bias

Andrew Gelman Fitting and understanding multilevel models

Page 73: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Estimated incumbency advantage from lagged regressions

Year

Est

inc

adv

from

lagg

ed r

egre

ssio

n

1900 1920 1940 1960 1980 2000

0.0

0.05

0.10

0.15

Andrew Gelman Fitting and understanding multilevel models

Page 74: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Can we do better?

I Regression estimate: vit = β0 + β1vi ,t−1 + β2Pit + ψIit + εitI “Political science” problem: ψ is assumed to be same in all

districts

I “Statistics” problem: the model doesn’t fit the data

I We’ll show pictures of the model not fitting

I We’ll set up a model allowing inc advantage to vary

Andrew Gelman Fitting and understanding multilevel models

Page 75: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Can we do better?

I Regression estimate: vit = β0 + β1vi ,t−1 + β2Pit + ψIit + εitI “Political science” problem: ψ is assumed to be same in all

districts

I “Statistics” problem: the model doesn’t fit the data

I We’ll show pictures of the model not fitting

I We’ll set up a model allowing inc advantage to vary

Andrew Gelman Fitting and understanding multilevel models

Page 76: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Can we do better?

I Regression estimate: vit = β0 + β1vi ,t−1 + β2Pit + ψIit + εitI “Political science” problem: ψ is assumed to be same in all

districts

I “Statistics” problem: the model doesn’t fit the data

I We’ll show pictures of the model not fitting

I We’ll set up a model allowing inc advantage to vary

Andrew Gelman Fitting and understanding multilevel models

Page 77: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Can we do better?

I Regression estimate: vit = β0 + β1vi ,t−1 + β2Pit + ψIit + εitI “Political science” problem: ψ is assumed to be same in all

districts

I “Statistics” problem: the model doesn’t fit the data

I We’ll show pictures of the model not fitting

I We’ll set up a model allowing inc advantage to vary

Andrew Gelman Fitting and understanding multilevel models

Page 78: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Can we do better?

I Regression estimate: vit = β0 + β1vi ,t−1 + β2Pit + ψIit + εitI “Political science” problem: ψ is assumed to be same in all

districts

I “Statistics” problem: the model doesn’t fit the data

I We’ll show pictures of the model not fitting

I We’ll set up a model allowing inc advantage to vary

Andrew Gelman Fitting and understanding multilevel models

Page 79: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Model misfit

Under the model, parallel lines are fitted to the circles (open seats)and dots (incs running for reelection)

Democratic vote in 1986

Dem

ocra

tic v

ote

in 1

988

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

o

oo

o

o

o

o

o

o oo

o

oo

oo

oo

o

o o

o

Year

Coe

ffici

ents

for

lagg

ed v

ote

1900 1920 1940 1960 1980 2000

0.2

0.4

0.6

0.8

1.0

1.2

incumbents runningopen seats

Andrew Gelman Fitting and understanding multilevel models

Page 80: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 81: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 82: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 83: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 84: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 85: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 86: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 87: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Multilevel model

I for t = 1, 2: vit = 0.5 + δt + αi + φit Iit + εitI δ2 − δ1 is the national vote swingI αi is the “normal vote” for district i : mean 0, sd σα.I φit is the inc advantage in district i at time t: mean ψ, sd σφ

I εit ’s are independent errors: mean 0 and sd σε.

I Candidate-level incumbency effects:I If the same incumbent is running in years 1 and 2, thenφi2 ≡ φi1

I Otherwise, φi1 and φi2 are independent

Andrew Gelman Fitting and understanding multilevel models

Page 88: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 89: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 90: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 91: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 92: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 93: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Fitting the multilevel model

I Bayesian inference

I Linear parameters: national vote swings, district effects,incumbency effects

I 3 variance parameters: district effects, incumbency effects,residual errors

I Need to model a selection effect: information provided by theincumbent party at time 1

I Solve analytically for Pr(inclusion), include factor in thelikelihood

I Gibbs-Metropolis sampling, program in Splus

Andrew Gelman Fitting and understanding multilevel models

Page 94: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Estimated incumbency advantage and its variation

Year

Ave

rage

Incu

mbe

ncy

Adv

anta

ge

1900 1920 1940 1960 1980 2000

0.0

0.04

0.08

0.12

Year

SD

of D

istr

ict E

ffect

s

1900 1920 1940 1960 1980 2000

0.0

0.05

0.10

0.15

Year

SD

of I

ncum

benc

y A

dvan

tage

1900 1920 1940 1960 1980 2000

0.0

0.02

0.04

0.06

Year

Res

idua

l SD

of E

lect

ion

Res

ults

1900 1920 1940 1960 1980 2000

0.0

0.02

0.04

0.06

Andrew Gelman Fitting and understanding multilevel models

Page 95: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Compare oldand new estimates

Year

Est

inc

adv

from

lagg

ed r

egre

ssio

n

1900 1920 1940 1960 1980 2000

0.0

0.05

0.10

0.15

YearA

vera

ge In

cum

benc

y A

dvan

tage

1900 1920 1940 1960 1980 2000

0.0

0.04

0.08

0.12

Year

SD

of D

istr

ict E

ffect

s

1900 1920 1940 1960 1980 2000

0.0

0.05

0.10

0.15

Year

SD

of I

ncum

benc

y A

dvan

tage

1900 1920 1940 1960 1980 2000

0.0

0.02

0.04

0.06

Year

Res

idua

l SD

of E

lect

ion

Res

ults

1900 1920 1940 1960 1980 2000

0.0

0.02

0.04

0.06Andrew Gelman Fitting and understanding multilevel models

Page 96: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

No-interaction modelI Before-after data with treatment and control groupsI Default model: constant treatment effects

I Fisher’s classical null hyp: effect is zero for all casesI Regression model: yi = Tiθ + Xiβ + εi

control

treatment

"before" measurement, x

"afte

r" m

easu

rem

ent,

y

Andrew Gelman Fitting and understanding multilevel models

Page 97: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

No-interaction modelI Before-after data with treatment and control groupsI Default model: constant treatment effects

I Fisher’s classical null hyp: effect is zero for all casesI Regression model: yi = Tiθ + Xiβ + εi

control

treatment

"before" measurement, x

"afte

r" m

easu

rem

ent,

y

Andrew Gelman Fitting and understanding multilevel models

Page 98: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

No-interaction modelI Before-after data with treatment and control groupsI Default model: constant treatment effects

I Fisher’s classical null hyp: effect is zero for all casesI Regression model: yi = Tiθ + Xiβ + εi

control

treatment

"before" measurement, x

"afte

r" m

easu

rem

ent,

y

Andrew Gelman Fitting and understanding multilevel models

Page 99: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

No-interaction modelI Before-after data with treatment and control groupsI Default model: constant treatment effects

I Fisher’s classical null hyp: effect is zero for all casesI Regression model: yi = Tiθ + Xiβ + εi

control

treatment

"before" measurement, x

"afte

r" m

easu

rem

ent,

y

Andrew Gelman Fitting and understanding multilevel models

Page 100: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 101: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 102: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 103: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 104: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 105: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Actual data show interactions

I Treatment interacts with “before” measurement

I Before-after correlation is higher for controls than for treatedunits

I ExamplesI An observational study of legislative redistrictingI An experiment with pre-test, post-test dataI Congressional elections with incumbents and open seats

Andrew Gelman Fitting and understanding multilevel models

Page 106: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Observational study of legislative redistrictingbefore-after data

Estimated partisan bias in previous election

Est

imat

ed p

artis

an b

ias

(adj

uste

d fo

r st

ate)

-0.05 0.0 0.05

-0.0

50.

00.

05

no redistricting

bipartisan redistrict

Dem. redistrict

Rep. redistrict.

. .. .

.

..

.....

...

.

.

.

...

.

.

..

..

.

.

.. .

. .

.

.

.

.

.

..

..

.

. .

.

.

. .

..

..

.

.. ..

.

.

.

..

.

..

..

.

.

. .

.

.

.

..

..

.

.

.

.

.

.. .

.

.

..

.

.

.

. .

..

..

.

..

.

. .

.

.. o

o

o

o

o o

ox

x

x

x

x

xx

x

x

x

•• •

•••

(favors Democrats)

(favors Republicans)

Andrew Gelman Fitting and understanding multilevel models

Page 107: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Experiment: correlation between pre-test and post-testdata for controls and for treated units

grade

corr

elat

ion

1 2 3 4

0.8

0.9

1.0

controls

treated

Andrew Gelman Fitting and understanding multilevel models

Page 108: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Correlation between two successive Congressional electionsfor incumbents running (controls) and open seats (treated)

1900 1920 1940 1960 1980 2000

0.0

0.2

0.4

0.6

0.8

year

corr

elat

ion

incumbents

open seats

Andrew Gelman Fitting and understanding multilevel models

Page 109: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 110: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 111: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 112: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 113: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 114: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 115: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

General frameworkEstimating incumbency advantage and its variationInteractions in before-after studies

Interactions as variance components

Unit-level “error term” ηi

I For control units, ηi persists from time 1 to time 2I For treatment units, ηi changes:

I Subtractive treatment error (ηi only at time 1)I Additive treatment error (ηi only at time 2)I Replacement treatment error

I Under all these models, the before-after correlation is higherfor controls than treated units

Andrew Gelman Fitting and understanding multilevel models

Page 116: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Some new tools

I Building and fitting multilevel models

I Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 117: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Some new tools

I Building and fitting multilevel models

I Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 118: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Some new tools

I Building and fitting multilevel models

I Displaying and summarizing inferences

Andrew Gelman Fitting and understanding multilevel models

Page 119: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Building and fitting multilevel models

I A reparameterization can change a model(even if it leaves the likelihood unchanged)

I Redundant additive parameterization

I Redundant multiplicative parameterization

I Weakly-informative prior distribution for group-level varianceparameters

Andrew Gelman Fitting and understanding multilevel models

Page 120: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Building and fitting multilevel models

I A reparameterization can change a model(even if it leaves the likelihood unchanged)

I Redundant additive parameterization

I Redundant multiplicative parameterization

I Weakly-informative prior distribution for group-level varianceparameters

Andrew Gelman Fitting and understanding multilevel models

Page 121: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Building and fitting multilevel models

I A reparameterization can change a model(even if it leaves the likelihood unchanged)

I Redundant additive parameterization

I Redundant multiplicative parameterization

I Weakly-informative prior distribution for group-level varianceparameters

Andrew Gelman Fitting and understanding multilevel models

Page 122: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Building and fitting multilevel models

I A reparameterization can change a model(even if it leaves the likelihood unchanged)

I Redundant additive parameterization

I Redundant multiplicative parameterization

I Weakly-informative prior distribution for group-level varianceparameters

Andrew Gelman Fitting and understanding multilevel models

Page 123: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Building and fitting multilevel models

I A reparameterization can change a model(even if it leaves the likelihood unchanged)

I Redundant additive parameterization

I Redundant multiplicative parameterization

I Weakly-informative prior distribution for group-level varianceparameters

Andrew Gelman Fitting and understanding multilevel models

Page 124: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 125: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 126: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 127: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 128: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 129: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant parameterization

I Data model: Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)I Usual model for the coefficients:

βagej ∼ N(0, σ2

age), for j = 1, . . . , 4

βstatej ∼ N(0, σ2

state), for j = 1, . . . , 50

I Additively redundant model:

βagej ∼ N(µage, σ

2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Why add the redundant µage, µstate?I Iterative algorithm moves more smoothly

Andrew Gelman Fitting and understanding multilevel models

Page 130: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant additive parameterization

I Model

Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered parameters:

β̃agej = βage

j − β̄age, for j = 1, . . . , 4

β̃statej = βstate

j − β̄state, for j = 1, . . . , 50

I Redefine the constant term:

β̃0 = β0 + β̄age + β̄age

Andrew Gelman Fitting and understanding multilevel models

Page 131: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant additive parameterization

I Model

Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered parameters:

β̃agej = βage

j − β̄age, for j = 1, . . . , 4

β̃statej = βstate

j − β̄state, for j = 1, . . . , 50

I Redefine the constant term:

β̃0 = β0 + β̄age + β̄age

Andrew Gelman Fitting and understanding multilevel models

Page 132: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant additive parameterization

I Model

Pr(yi = 1) = logit−1(β0 + βage

age(i) + βstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered parameters:

β̃agej = βage

j − β̄age, for j = 1, . . . , 4

β̃statej = βstate

j − β̄state, for j = 1, . . . , 50

I Redefine the constant term:

β̃0 = β0 + β̄age + β̄age

Andrew Gelman Fitting and understanding multilevel models

Page 133: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant multiplicative parameterization

I New model

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered and scaled parameters:

β̃agej = ξage(βage

j − β̄age), for j = 1, . . . , 4

β̃statej = ξstate

(βstate

j − β̄state), for j = 1, . . . , 50

I Faster convergence

I More general model, connections to factor analysis

Andrew Gelman Fitting and understanding multilevel models

Page 134: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant multiplicative parameterization

I New model

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered and scaled parameters:

β̃agej = ξage(βage

j − β̄age), for j = 1, . . . , 4

β̃statej = ξstate

(βstate

j − β̄state), for j = 1, . . . , 50

I Faster convergence

I More general model, connections to factor analysis

Andrew Gelman Fitting and understanding multilevel models

Page 135: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant multiplicative parameterization

I New model

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered and scaled parameters:

β̃agej = ξage(βage

j − β̄age), for j = 1, . . . , 4

β̃statej = ξstate

(βstate

j − β̄state), for j = 1, . . . , 50

I Faster convergence

I More general model, connections to factor analysis

Andrew Gelman Fitting and understanding multilevel models

Page 136: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Redundant multiplicative parameterization

I New model

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Identify using centered and scaled parameters:

β̃agej = ξage(βage

j − β̄age), for j = 1, . . . , 4

β̃statej = ξstate

(βstate

j − β̄state), for j = 1, . . . , 50

I Faster convergence

I More general model, connections to factor analysis

Andrew Gelman Fitting and understanding multilevel models

Page 137: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Weakly informative prior distribution for the multilevelvariance parameter

I Redundant multiplicative parameterization:

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Separate prior distributions on the ξ and σ parameters:I Normal on ξI Inverse-gamma on σ2

I Generalizes and fixes problems with the standard choices ofprior distributions

Andrew Gelman Fitting and understanding multilevel models

Page 138: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Weakly informative prior distribution for the multilevelvariance parameter

I Redundant multiplicative parameterization:

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Separate prior distributions on the ξ and σ parameters:I Normal on ξI Inverse-gamma on σ2

I Generalizes and fixes problems with the standard choices ofprior distributions

Andrew Gelman Fitting and understanding multilevel models

Page 139: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Weakly informative prior distribution for the multilevelvariance parameter

I Redundant multiplicative parameterization:

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Separate prior distributions on the ξ and σ parameters:I Normal on ξI Inverse-gamma on σ2

I Generalizes and fixes problems with the standard choices ofprior distributions

Andrew Gelman Fitting and understanding multilevel models

Page 140: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Weakly informative prior distribution for the multilevelvariance parameter

I Redundant multiplicative parameterization:

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Separate prior distributions on the ξ and σ parameters:I Normal on ξI Inverse-gamma on σ2

I Generalizes and fixes problems with the standard choices ofprior distributions

Andrew Gelman Fitting and understanding multilevel models

Page 141: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Weakly informative prior distribution for the multilevelvariance parameter

I Redundant multiplicative parameterization:

Pr(yi = 1) = logit−1(β0 + ξageβage

age(i) + ξstateβstatestate(i)

)βage

j ∼ N(µage, σ2age), for j = 1, . . . , 4

βstatej ∼ N(µstate, σ

2state), for j = 1, . . . , 50

I Separate prior distributions on the ξ and σ parameters:I Normal on ξI Inverse-gamma on σ2

I Generalizes and fixes problems with the standard choices ofprior distributions

Andrew Gelman Fitting and understanding multilevel models

Page 142: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Displaying and summarizing inferences

I Displaying parameters in groups rather than as a long list

I Average predictive effects

I R2 and partial pooling factors

I Analysis of variance

Andrew Gelman Fitting and understanding multilevel models

Page 143: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Displaying and summarizing inferences

I Displaying parameters in groups rather than as a long list

I Average predictive effects

I R2 and partial pooling factors

I Analysis of variance

Andrew Gelman Fitting and understanding multilevel models

Page 144: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Displaying and summarizing inferences

I Displaying parameters in groups rather than as a long list

I Average predictive effects

I R2 and partial pooling factors

I Analysis of variance

Andrew Gelman Fitting and understanding multilevel models

Page 145: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Displaying and summarizing inferences

I Displaying parameters in groups rather than as a long list

I Average predictive effects

I R2 and partial pooling factors

I Analysis of variance

Andrew Gelman Fitting and understanding multilevel models

Page 146: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Displaying and summarizing inferences

I Displaying parameters in groups rather than as a long list

I Average predictive effects

I R2 and partial pooling factors

I Analysis of variance

Andrew Gelman Fitting and understanding multilevel models

Page 147: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Raw display of inference

mean sd 2.5% 25% 50% 75% 97.5% Rhat n.eff

B.0 0.402 0.147 0.044 0.326 0.413 0.499 0.652 1.024 110

b.female -0.094 0.102 -0.283 -0.162 -0.095 -0.034 0.107 1.001 1000

b.black -1.701 0.305 -2.323 -1.910 -1.691 -1.486 -1.152 1.014 500

b.female.black -0.143 0.393 -0.834 -0.383 -0.155 0.104 0.620 1.007 1000

B.age[1] 0.084 0.088 -0.053 0.012 0.075 0.140 0.277 1.062 45

B.age[2] -0.072 0.087 -0.260 -0.121 -0.054 -0.006 0.052 1.017 190

B.age[3] 0.044 0.077 -0.105 -0.007 0.038 0.095 0.203 1.029 130

B.age[4] -0.057 0.096 -0.265 -0.115 -0.052 0.001 0.133 1.076 32

B.edu[1] -0.148 0.131 -0.436 -0.241 -0.137 -0.044 0.053 1.074 31

B.edu[2] -0.022 0.082 -0.182 -0.069 -0.021 0.025 0.152 1.028 160

B.edu[3] 0.148 0.112 -0.032 0.065 0.142 0.228 0.370 1.049 45

B.edu[4] 0.023 0.090 -0.170 -0.030 0.015 0.074 0.224 1.061 37

B.age.edu[1,1] -0.044 0.133 -0.363 -0.104 -0.019 0.025 0.170 1.018 1000

B.age.edu[1,2] 0.059 0.123 -0.153 -0.011 0.032 0.118 0.353 1.016 580

B.age.edu[1,3] 0.049 0.124 -0.146 -0.023 0.022 0.104 0.349 1.015 280

B.age.edu[1,4] 0.001 0.116 -0.237 -0.061 0.000 0.052 0.280 1.010 1000

B.age.edu[2,1] 0.066 0.152 -0.208 -0.008 0.032 0.124 0.449 1.022 140

B.age.edu[2,2] -0.081 0.127 -0.407 -0.137 -0.055 0.001 0.094 1.057 120

B.age.edu[2,3] -0.004 0.102 -0.226 -0.048 0.000 0.041 0.215 1.008 940

B.age.edu[2,4] -0.042 0.108 -0.282 -0.100 -0.026 0.014 0.157 1.017 170

B.age.edu[3,1] 0.034 0.135 -0.215 -0.030 0.009 0.091 0.361 1.021 230

B.age.edu[3,2] 0.007 0.102 -0.213 -0.039 0.003 0.052 0.220 1.019 610

B.age.edu[3,3] 0.033 0.130 -0.215 -0.029 0.009 0.076 0.410 1.080 61

B.age.edu[3,4] -0.009 0.109 -0.236 -0.064 -0.005 0.043 0.214 1.024 150

B.age.edu[4,1] -0.141 0.190 -0.672 -0.224 -0.086 -0.003 0.100 1.036 270

B.age.edu[4,2] -0.014 0.119 -0.280 -0.059 -0.008 0.033 0.239 1.017 240

B.age.edu[4,3] 0.046 0.132 -0.192 -0.024 0.019 0.108 0.332 1.010 210

B.age.edu[4,4] 0.042 0.142 -0.193 -0.022 0.016 0.095 0.377 1.015 160

B.state[1] 0.201 0.211 -0.131 0.047 0.172 0.326 0.646 1.003 960

B.state[2] 0.466 0.252 0.008 0.310 0.440 0.603 1.047 1.001 1000

B.state[3] 0.393 0.196 0.023 0.268 0.380 0.518 0.814 1.002 1000

B.state[4] -0.164 0.209 -0.607 -0.290 -0.149 -0.041 0.228 1.003 590

B.state[5] -0.054 0.141 -0.322 -0.143 -0.061 0.035 0.229 1.001 1000

B.state[6] 0.126 0.206 -0.313 0.010 0.126 0.256 0.512 1.011 1000

B.state[7] 0.095 0.183 -0.263 -0.023 0.087 0.207 0.466 1.004 490

B.state[8] -0.210 0.207 -0.666 -0.322 -0.194 -0.080 0.155 1.001 1000

B.state[9] -2.648 0.728 -4.291 -3.067 -2.602 -2.187 -1.385 1.007 290

B.state[10] 0.097 0.173 -0.296 -0.010 0.115 0.214 0.402 1.014 270

B.state[11] -0.138 0.173 -0.467 -0.253 -0.148 -0.034 0.240 1.005 1000

Andrew Gelman Fitting and understanding multilevel models

Page 148: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Raw graphical display

80% interval for each chain R−hat

−4

−4

−2

−2

0

0

2

2

1 1.5 2+

1 1.5 2+

1 1.5 2+

1 1.5 2+

1 1.5 2+

1 1.5 2+

B.0 ●●●

b.female ●●●

b.black ●●●

b.female.black ●●●

B.age[1] ●●●

[2] ●●●

[3] ●●●

[4] ●●●

B.edu[1] ●●●

[2] ●●●

[3] ●●●

[4] ●●●

B.age.edu[1,1] ●●●

[1,2] ●●●

[1,3] ●●●

[1,4] ●●●

[2,1] ●●●

[2,2] ●●●

[2,3] ●●●

[2,4] ●●●

[3,1] ●●●

[3,2] ●●●

B.state[1] ●●●

[2] ●●●

[3] ●●●

[4] ●●●

[5] ●●●

[6] ●●●

[7] ●●●

[8] ●●●

[9] ●●●

[10] ●●●

B.region[1] ●●●

[2] ●●●

[3] ●●●

[4] ●●●

[5] ●●●

Sigma.age ●●●

Sigma.edu ●●●

Sigma.age.edu ●●●

Sigma.state ●●●

Sigma.region ●●●

*

*

* array truncated for lack of space

medians and 80% intervals

B.00

0.20.40.6

●●●

b.female−0.3−0.2−0.1

00.1

●●●

b.black−2.5−2

−1.5−1

●●●

b.female.black−1

−0.50

0.5●●●

B.age−0.4−0.2

00.20.4

●●●

111111111

●●●

222222222

●●●

333333333

●●●

444444444

B.edu−0.5

00.5

●●●

111111111

●●●

222222222

●●●

333333333

●●●

444444444

B.age.edu−1

−0.50

0.5●●●

111111111111111111

●●●

222222222

●●●

333333333

●●●

444444444

●●●

222222222111111111

●●●

222222222

●●●

333333333

●●●

444444444

●●●

333333333111111111

●●●

222222222

●●●

333333333

●●●

444444444

●●●

444444444111111111

●●●

222222222

●●●

333333333

●●●

444444444

B.state−4−202

●●●

111111111

●●●

222222222

●●●

333333333

●●●

444444444

●●●

555555555

●●●

666666666

●●●

777777777

●●●

888888888●●●

999999999

●●●

101010101010101010

●●● ●●●

121212121212121212

●●●●●●

141414141414141414

●●● ●●●

161616161616161616

●●● ●●●

181818181818181818

●●● ●●●

202020202020202020

●●● ●●●

222222222222222222

●●● ●●●

242424242424242424

●●● ●●●

262626262626262626

●●●●●●

282828282828282828

●●● ●●●

303030303030303030

●●● ●●●

323232323232323232

●●● ●●●

343434343434343434

●●● ●●●

363636363636363636

●●● ●●●

383838383838383838

●●● ●●●

404040404040404040

*

B.region−1

−0.50

0.51

●●●

111111111

●●●

222222222

●●●

333333333

●●●

444444444

●●●

555555555

Sigma.age0

0.20.40.6

●●●

Sigma.edu0

0.51

●●●

Sigma.age.edu0

0.10.20.3

●●●

Sigma.state0

0.10.20.30.4

●●●

Sigma.region0

0.51

●●●

deviance2580260026202640

●●●

Bugs model at "C:/books/multilevel/election88/model4.bug", 3 chains, each with 2001 iterations

Andrew Gelman Fitting and understanding multilevel models

Page 149: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Better graphical display 1: demographics

femaleblackfemale x black

18−2930−4445−6465+

no h.s.high schoolsome collegecollege grad

18−29 x no h.s.18−29 x high school18−29 x some college18−29 x college grad

30−44 x no h.s.30−44 x high school30−44 x some college30−44 x college grad

45−64 x no h.s.45−64 x high school45−64 x some college45−64 x college grad

65+ x no h.s.65+ x high school65+ x some college65+ x college grad

−2.5

−2.5

−2

−2

−1.5

−1.5

−1

−1

−0.5

−0.5

0

0

0.5

0.5

1

1

Andrew Gelman Fitting and understanding multilevel models

Page 150: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Better graphical display 2: within states

−2.0 −1.0 0.00.0

0.4

0.8

Alaska

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

Arizona

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

Arkansas

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

California

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

Colorado

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

Connecticut

linear predictor

Pr

(sup

port

Bus

h)

−2.0 −1.0 0.00.0

0.4

0.8

Delaware

linear predictor

Pr

(sup

port

Bus

h)−2.0 −1.0 0.00.

00.

40.

8

District of Columbia

linear predictor

Pr

(sup

port

Bus

h)Andrew Gelman Fitting and understanding multilevel models

Page 151: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Better graphical display 3: between states

Northeast

R vote in prev elections

regr

essi

on in

terc

ept

0.5 0.6 0.7

−0.

50.

00.

5

CTDEME

MDMA

NHNJ

NYPA

RI

VTWV

Midwest

R vote in prev elections

regr

essi

on in

terc

ept

0.5 0.6 0.7

−0.

50.

00.

5

IL

IN

IA

KS

MI

MN

MONEND

OH

SDWI

South

R vote in prev elections

regr

essi

on in

terc

ept

0.5 0.6 0.7

−0.

50.

00.

5 AL

AR FLGAKY LAMSNC

OKSCTN

TX

VA

West

R vote in prev elections

regr

essi

on in

terc

ept

0.5 0.6 0.7

−0.

50.

00.

5

AKAZ

CA COHIID

MTNV

NMOR

UT

WAWY

Andrew Gelman Fitting and understanding multilevel models

Page 152: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

u

v

y

Andrew Gelman Fitting and understanding multilevel models

Page 153: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 154: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 155: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 156: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 157: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 158: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Average predictive effects

I What is E (y | x1 = high) − E(y | x1 = low), with all other x ’sheld constant?

I In general, difference can depend on xI Average over distribution of x in the data

I You can’t just use a central value of x

I Compute APE for each input variable x

I Multilevel factors are categorical input variables

Andrew Gelman Fitting and understanding multilevel models

Page 159: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

APE: why you can’t just use a central value of x

−6 −4 −2 0 2 4 6 80.0

0.2

0.4

0.6

0.8

1.0

v

E(y

|u, v

)

u = 1 u = 0

| || || ||| | |||| | ||| ||| | || || || || ||| || ||| | | |

avg pred effect = 0.03

pred effect at E(v) = 0.24

−6 −4 −2 0 2 4 6 80.0

0.2

0.4

0.6

0.8

1.0

v

E(y

|u, v

)

u = 1 u = 0

||| | | || ||| ||| ||| | | |||| || | |||| ||| |||||| ||

avg pred effect = 0.11

pred effect at E(v) = 0.03

Andrew Gelman Fitting and understanding multilevel models

Page 160: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 161: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 162: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 163: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 164: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 165: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 166: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Understanding sources of variation

I Generalization of R2 (explained variance), defined at eachlevel of the model

I Partial pooling factor, defined at each levelI Analysis of variance

I Summarize the scale of each batch of predictorsI Go beyond classical null-hypothesis-testing framework

I Open question: how to construct models with deepinteraction structures?

Andrew Gelman Fitting and understanding multilevel models

Page 167: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Conclusions

I Multilevel modeling is not just for grouped data

I New ideas needed to fit, understand, display, and summarizeeach level of the model

I General framework for modeling treatment effects that vary

I It’s not just about “data fitting” or “getting the rightstandard errors”

Andrew Gelman Fitting and understanding multilevel models

Page 168: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Conclusions

I Multilevel modeling is not just for grouped data

I New ideas needed to fit, understand, display, and summarizeeach level of the model

I General framework for modeling treatment effects that vary

I It’s not just about “data fitting” or “getting the rightstandard errors”

Andrew Gelman Fitting and understanding multilevel models

Page 169: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Conclusions

I Multilevel modeling is not just for grouped data

I New ideas needed to fit, understand, display, and summarizeeach level of the model

I General framework for modeling treatment effects that vary

I It’s not just about “data fitting” or “getting the rightstandard errors”

Andrew Gelman Fitting and understanding multilevel models

Page 170: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Conclusions

I Multilevel modeling is not just for grouped data

I New ideas needed to fit, understand, display, and summarizeeach level of the model

I General framework for modeling treatment effects that vary

I It’s not just about “data fitting” or “getting the rightstandard errors”

Andrew Gelman Fitting and understanding multilevel models

Page 171: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Conclusions

I Multilevel modeling is not just for grouped data

I New ideas needed to fit, understand, display, and summarizeeach level of the model

I General framework for modeling treatment effects that vary

I It’s not just about “data fitting” or “getting the rightstandard errors”

Andrew Gelman Fitting and understanding multilevel models

Page 172: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 173: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 174: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 175: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 176: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 177: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 178: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models

Page 179: Fitting and understanding Multilevel Models-Andrew Gelman

EffectivenessUbiquity

Way of life

Building and fitting modelsDisplaying and summarizing inferencesConclusions

Multilevel modeling

I Why?I Make use of lots of information out there that’s already

collectedI Use MLM to adjust for time effects, state effects,

survey-organization effectsI Don’t freak out because you have “too many predictors”

I How?I Make your studies broader and deeperI Make plots and compute summaries to understand the

estimated model

Andrew Gelman Fitting and understanding multilevel models