31
IBC 2014 Florence, Italy July 7 1/ 31 ngoeyvae @its.jnj.com Exponential-family random graph models for within-household contact networks Nele Goeyvaerts 1;2;3 , Gail Potter 4 , Kim Van Kerckhove 1;2 , Lander Willem 2;1 , Philippe Beutels 2 , Niel Hens 1;2 1 I-BioStat, Hasselt University, Belgium 2 CHERMID, Vaccine & Infectious Disease Institute, University of Antwerp, Belgium 3 Non-Clinical Statistics, Janssen Pharmaceutical Companies of Johnson & Johnson, Belgium 4 Statistics Department, California Polytechnic State University, USA

Exponential-family random graph models for within ......Final size (individuals) Frequency 0 200 600 1000 0 20 40 60 80 100 120. IBC 2014 ... Household and community transmission parameters

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

  • IBC 2014

    Florence, Italy

    July 7

    1/ 31

    ngoeyvae

    @its.jnj.com

    Exponential-family random graph models

    for within-household contact networks

    Nele Goeyvaerts1,2,3, Gail Potter4, Kim Van Kerckhove1,2,

    Lander Willem2,1, Philippe Beutels2, Niel Hens1,2

    1 I-BioStat, Hasselt University, Belgium

    2 CHERMID, Vaccine & Infectious Disease Institute, University of Antwerp, Belgium

    3 Non-Clinical Statistics, Janssen Pharmaceutical Companies of Johnson & Johnson, Belgium

    4 Statistics Department, California Polytechnic State University, USA

    Interuniversity Institute for Biostatistics and statistical Bioinformatics

  • IBC 2014

    Florence, Italy

    July 7

    2/ 31

    ngoeyvae

    @its.jnj.com

    Transmission within households

    • Transmission of airborne infectious diseases driven bysocial contacts

    • Households are important units: clusters characterized byintense contact

    • Epidemic models often assume random mixing withinhouseholds:

    • Households model: 2 levels of mixing (Ball et al., 1997)

    • IBMs: random mixing in households, schools, workplaces(Halloran et al., 2002; Ferguson et al., 2006)

    • No direct empirical evidence to support the assumption ofrandom mixing within households

  • IBC 2014

    Florence, Italy

    July 7

    3/ 31

    ngoeyvae

    @its.jnj.com

    Flemish household survey

    2010-2011

    • POLYMOD-like diary-based contact survey in HHs

    • Flemish geographical region including Brussels

    • HHs with young children: at least 1 child aged ≤ 12 years

    • Stratified sampling by age/gender youngest child, HHcomposition (1 or 2 parents), weekday-weekend, province

    • HH member = living > 50% of the time in the same HH

    • Contact = 2-way conversation (< 3m) or physical contact

  • IBC 2014

    Florence, Italy

    July 7

    4/ 31

    ngoeyvae

    @its.jnj.com

    Sample sizes

    • 1266 participants from 318 HHs who recorded 19685contacts in total

    • HH sizes range from 2 to 7

    • 3821 identified within-HH contacts with 98% reciprocity

    • Assuming reciprocity of recorded within-HH contacts:

    • 1946 distinct within-household contacts

    • 96% involve skin-to-skin touching

  • IBC 2014

    Florence, Italy

    July 7

    5/ 31

    ngoeyvae

    @its.jnj.com

    Reciprocity

    Households represented by undirected contact networks

    • Nodes: household members

    • Edges: within-household contacts

    Household Size 2 Household Size 2

  • IBC 2014

    Florence, Italy

    July 7

    6/ 31

    ngoeyvae

    @its.jnj.com

    Within-HH contact networks

    Focus on physical contacts:

  • IBC 2014

    Florence, Italy

    July 7

    7/ 31

    ngoeyvae

    @its.jnj.com

    ERGMs to model within-HH

    contact networks

    • Exponential-family Random Graph Models (ERGMs)provide statistical framework to infer on processes driving

    social interactions

    • Y random adjacency matrix across all households:

    Yij = Yji =

    {1 if HH members i and j make physical contact

    0 otherwise

    • Ω support of Y i.e. the set of all obtainable networks

  • IBC 2014

    Florence, Italy

    July 7

    8/ 31

    ngoeyvae

    @its.jnj.com

    Definition ERGM

    • Exponential-family Random Graph Model (ERGM):

    Pθ,Ω(Y = y) =exp{θTg(y,X)}

    κ(θ,Ω), y ∈ Ω

    • g(y,X) vector of network statistics

    • θ corresponding vector of coefficients

    • X additional covariate information

    • κ(θ,Ω) normalizing factor

  • IBC 2014

    Florence, Italy

    July 7

    9/ 31

    ngoeyvae

    @its.jnj.com

    Inference for ERGMs

    • Approximate MLE using Markov Chain Monte Carlo in aNewton-Raphson type algorithm (Geyer and Thompson,JRSS-B 1992):

    1 MCMC simulation of a distribution of random networks

    from starting set of parameter values

    2 Parameter values refined by comparing distribution of

    networks against observed network

    3 Process repeated until parameter estimates stabilize

    • Software: ergm package in R, part of the statnet suite ofpackages for statistical network analysis (Hunter et al., J.

    Stat. Soft. 2008)

  • IBC 2014

    Florence, Italy

    July 7

    10/ 31

    ngoeyvae

    @its.jnj.com

    Goodness-of-fit

    • Simulate within-HH contact networks from fitted ERGMusing MCMC

    • Compare specific contact network characteristics to theobserved ones

    • Proportion of complete networks: complete = every HHmember makes contact with every other HH member

    • Household network density = # observed edges# potential edges

    • Measure of clustering = # observed triangles# potential triangles

  • IBC 2014

    Florence, Italy

    July 7

    11/ 31

    ngoeyvae

    @its.jnj.com

    Goodness-of-fit

    Hunter et al. (J. Stat. Soft., 2008)

  • IBC 2014

    Florence, Italy

    July 7

    12/ 31

    ngoeyvae

    @its.jnj.com

    Network statistics g(y,X)

    • Dyad independent statistics:

    Network statistic Legend

    Edges Total no of edges

    Within-household edges Total no of edges within households

    Child-father mixing Total no of edges between children and fathers

    Child-mother mixing Total no of edges between children and mothers

    Father-mother mixing Total no of edges between partners

    Boy-boy mixing Total no of edges between boys

    Girl-girl mixing Total no of edges between girls

    Age effect children Sum age(i) + age(j), ∀ (i, j) between siblings

    Small households (≤ 3) Total no of edges within HHs of size ≤ 3Large households (≥ 5) Total no of edges within HHs of size ≥ 5

    • Dyad dependent statistics: total number of isolates,2-stars, triangles, triangles in households of size ≥ 6

  • IBC 2014

    Florence, Italy

    July 7

    13/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekday

    Network statistic Estimate s.e. p-value

    Edges -29.48 0.26 0.00

    Within-household edges 30.40 0.26

  • IBC 2014

    Florence, Italy

    July 7

    14/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekday

    2 3 4 5 6−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Pro

    port

    ion

    com

    plet

    e ne

    twor

    ks

  • IBC 2014

    Florence, Italy

    July 7

    15/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekday

    2 3 4 5 6−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Mea

    n ne

    twor

    k de

    nsity

  • IBC 2014

    Florence, Italy

    July 7

    16/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekday

    3 4 5 6−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Obs

    erve

    d vs

    . pot

    entia

    l tria

    ngle

    s

  • IBC 2014

    Florence, Italy

    July 7

    17/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekend day

    Network statistic Estimate s.e. p-value

    Edges -21.23 0.76

  • IBC 2014

    Florence, Italy

    July 7

    18/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekend day

    3 4 6−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Pro

    port

    ion

    com

    plet

    e ne

    twor

    ks

  • IBC 2014

    Florence, Italy

    July 7

    19/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekend day

    3 4 6−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Mea

    n ne

    twor

    k de

    nsity

  • IBC 2014

    Florence, Italy

    July 7

    20/ 31

    ngoeyvae

    @its.jnj.com

    ERGM: weekend day

    3 4 5−7 total

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Simulated networks n = 500

    Household size

    Obs

    erve

    d vs

    . pot

    entia

    l tria

    ngle

    s

  • IBC 2014

    Florence, Italy

    July 7

    21/ 31

    ngoeyvae

    @its.jnj.com

    Epidemic simulation model

    • Discrete-time chain binomial SIR model

    • Closed, fully susceptible population of households

    • Households model, 2 levels of mixing (Ball et al., 1997):• High-intensity mixing within households withβh = P(transmission) per physical contact, per time step

    • Low-intensity random mixing in the community: socialcontact hypothesis βc(a, a

    ′) = qc · c(a, a′) with c(a, a′)physical contact rates (POLYMOD Belgium)

    • 2 assumptions for within-household mixing:• Realistic mixing: contact networks simulated from ERGM

    • Random mixing

  • IBC 2014

    Florence, Italy

    July 7

    22/ 31

    ngoeyvae

    @its.jnj.com

    Probability of infection

    • At each time step, each susceptible i acquires infectionwith probability:

    pi1 = 1 − (1 − βh · δh)∑

    j 6=i∈hi yijIj · (1 − βc,11)∑

    j /∈hiIj1 · (1 − βc,12)

    ∑j /∈hi

    Ij2

    pi2 = 1 − (1 − βh · δh)∑

    j 6=i∈hi yijIj · (1 − βc,21)∑

    j /∈hiIj1 · (1 − βc,22)

    ∑j /∈hi

    Ij2

    • Index 1 = children ≤ 18 y, index 2 = adults > 18 y• δh = 1, for realistic mixingδh = network density, for random mixing

    • hi: household of node i

    • For random mixing: yij = 1,∀i 6= j ∈ same HH,• Ij : indicates whether node j is infected (1) or not (0)

  • IBC 2014

    Florence, Italy

    July 7

    23/ 31

    ngoeyvae

    @its.jnj.com

    Epidemic model assumptions

    • Susceptibility/infectiousness independent of age

    • No latent period, thus infected = infectious

    • Constant recovery probability for infectious individuals,mean infectious period ≈ 3.5 days

    • βh and qc → explore several values based on literatureestimates

    • Results from 500 stochastic epidemic simulations

  • IBC 2014

    Florence, Italy

    July 7

    24/ 31

    ngoeyvae

    @its.jnj.com

    Results: incidence

    0 20 40 60 80 100

    010

    2030

    4050

    Random mixing

    Time

    Tota

    l inc

    iden

    ce

    0 20 40 60 80 100

    010

    2030

    4050

    ERGM

    TimeTo

    tal i

    ncid

    ence

  • IBC 2014

    Florence, Italy

    July 7

    25/ 31

    ngoeyvae

    @its.jnj.com

    Results: final size

    Random mixing

    Final size (individuals)

    Fre

    quen

    cy

    0 200 600 1000

    020

    4060

    8010

    0

    ERGM

    Final size (individuals)F

    requ

    ency

    0 200 600 1000

    020

    4060

    8010

    012

    0

  • IBC 2014

    Florence, Italy

    July 7

    26/ 31

    ngoeyvae

    @its.jnj.com

    Results: final fraction

    Random ERGM

    0.2

    0.4

    0.6

    0.8

    Pro

    port

    ion

    indi

    vidu

    als

    affe

    cted

    Random ERGM

    0.2

    0.4

    0.6

    0.8

    Pro

    port

    ion

    hous

    ehol

    ds a

    ffect

    ed

  • IBC 2014

    Florence, Italy

    July 7

    27/ 31

    ngoeyvae

    @its.jnj.com

    Results: HH attack rate

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Household size 2

    Hou

    seho

    ld a

    ttack

    rat

    e

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Household size 3

    Hou

    seho

    ld a

    ttack

    rat

    e

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Household size 4

    Hou

    seho

    ld a

    ttack

    rat

    e

  • IBC 2014

    Florence, Italy

    July 7

    28/ 31

    ngoeyvae

    @its.jnj.com

    Results: HH attack rate

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Household size 5

    Hou

    seho

    ld a

    ttack

    rat

    e

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Household size >=6

    Hou

    seho

    ld a

    ttack

    rat

    e

    Random ERGM

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Total

    Hou

    seho

    ld a

    ttack

    rat

    e

  • IBC 2014

    Florence, Italy

    July 7

    29/ 31

    ngoeyvae

    @its.jnj.com

    Summary

    • First contact survey designed to measure socialinteractions within households

    • ERGMs show high degree of clustering and decreasingconnectedness with increasing HH size on weekdays

    • Epidemic simulation results seem to support assumptionof random mixing within households

    • Further research: impact of control strategies,heterogeneity in duration of contact

    • Assumption: physical contacts = good proxy

  • IBC 2014

    Florence, Italy

    July 7

    30/ 31

    ngoeyvae

    @its.jnj.com

    Acknowledgements

    • AXA Research Fund

    • Gail Potter (California Polytechnic State University)

    • Kim Van Kerckhove (Hasselt University and UAntwerp)

    • Lander Willem (UAntwerp and Hasselt University)

    • Philippe Beutels (UAntwerp)

    • Niel Hens (Hasselt University and UAntwerp)

  • IBC 2014

    Florence, Italy

    July 7

    31/ 31

    ngoeyvae

    @its.jnj.com

    References

    Ball, F., D. Mollison, and G. Scalia-Tomba (1997).

    Epidemics with two levels of mixing.

    The Annals of Applied Probability 7, 46–89.

    Ferguson, N. M., D. A. T. Cummings, C. Fraser, J. C. Cajka, P. C. Cooley, and D. S. Burke (2006).

    Strategies for mitigating an influenza pandemic.

    Nature Letters 442, 448–452.

    Geyer, C. J. and E. A. Thompson (1992).

    Constrained monte carlo maximum likelihood calculations.

    Journal of the Royal Statistical Society B 54, 657–699.

    Hunter, D. R., M. S. Handcock, C. T. Butts, S. M. Goodreau, and M. Morris (2008).

    ergm: A package to fit, simulate and diagnose exponential-family models for networks.

    Journal of Statistical Software 24, 1–29.

    Keeling, M. J. and K. T. D. Eames (2005).

    Networks and epidemic models.

    Journal of the Royal Society Interface 2, 295–307.

    Kolaczyk, E. D. (2009).

    Statistical Analysis of Network Data: Methods and Models.

    Springer, New York.

    Longini, Jr., I. M. and J. S. Koopman (1982).

    Household and community transmission parameters from final distributions of infections in

    households.

    Biometrics 38, 115–126.