30
Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Embed Size (px)

Citation preview

Page 1: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants

Journal club (Nov/13)SH Lee

Page 2: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Introduction

• Sequence data– Rare and unidentified variants

• Groupwise association tests– Omnibus tests– Burden test, CMC test, SKAT test• Up-weighting for rare, • down-weighting for common• Rare/common variants tested separately

Page 3: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Introduction• This study develops a joint test of rare/common– Combining burden/SKAT test for rare/common

• Can be applied to – whole exome sequencing + GWAS – Deep resequencing of GWAS loci

• Basically can analyse all variants including rare, low-frequency and common variants

• Simulation (type 1 error, power)• Real data, CD and Autism

Page 4: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Materials and Methods

Definition of rare/common• <0.01 rare• 0.01-0.05 low frequency• >0.05 common

Or• <1/sqrt(2*n) rare • >1/sqrt(2*n) common– n = 500, rare MAF < 0.031– n = 10000, rare MAF < 0.007

Page 5: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Materials and Methods

• Testing for the overall effect of rare and common variants– Rare for Burden test– Common for SKAT test

Weighted-sum statisticsFishers method of combining the p values

Page 6: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Weighted-sum statistics

• Within a region (e.g. a gene) having m variants– g(*) is a linear or logistic link function – Alpha is for covariates– X is n x m matrix– Beta is regression coefficient and random variable

Page 7: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Weighted sum score test(Variance component score test)

Taking the first derivative of log-likelihood respect with the variance τ

P-value from κχ2ν

κ is scale parameter, v is degree of freedom

Page 8: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Weighted sum score test(Variance component score test)

Wu et al (2010) AJHG 86: 929; Liu et al (2008) BMC Bioinformatics 8: 292;

Lin (1997) Biometrika 84: 309; White (1982) Econometrica 50: 1

Page 9: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Weighted sum score test(Variance component score test)

• ρ : the correlation between regression coefficients • If perfectly correlated (ρ = 1), they will be all the same after

weighting, and one should collapse the variants first before running regression, i.e., the burden test

• If the regression coefficients are unrelated to each other, one should use SKAT

Lee et al. (2012) AJHG 91: 224

Page 10: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Burden-C, SKAT-C

• Combined test statistic for rare and common– Weighting beta(p,1,25) for rare, – beta(p,0.5,0.5) for common

• Partitioning rare and common variants

Page 11: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Other methods

• Burden-A, SKAT-A– Adaptive combining rare/common– Searching φ for the minimum p-value

• Burden-F, SKAT-F– Fisher’s combination method

Page 12: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Simulation

• Sequence data on 10,000 haplotypes on 1 Mb region

• Calibrated model for the European pop• Random sample of a region of 5 or 25 kb and

simulated data with 1000-5000 individuals • Proportion of cases in the sample is 0.5

Page 13: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Disease model

Page 14: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Methods

Page 15: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Type I error

• The proposed methods agrees with the expectation

Page 16: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power (separation cut-off)

• Using burden-C test• Power with different separation cut-offs• 1/sqrt(2n) will be used further

Page 17: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power (proposed methods)

• Power for 8 different tests• The proposed combination tests outperform

Page 18: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power

• Rare/common causal variants (model 1, 2, 3, 6)– The combination methods perform better

Page 19: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power

• Common causal variants (model 5)– The combination methods perform better

• Rare causal variants (model 4)– The combination methods perform similarly

Page 20: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power (proposed methods)

•The proposed combination methods outperform CMC for all 6 disease models•The proposed combination methods outperform the original SKAT for all 6 disease models

Page 21: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power

•For model 1-4 which include only risk variants• SKAT better than Burden when prop. risk variants is small (10%)• Burden better than SKAT when prop. risk variants is large (30%)

Page 22: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power

• Model 1-3 which include both rare/common• SKAT-F better than burden-F regardless of prop. risk variants

• Model 5 which include only common risk variants• SKAT better than burden regardless of prop. risk variants

Page 23: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Power

• Adaptive test (SKAT-A, Burden-A)– Perform worse than SKAT-C and Burden-C

• Results for a region of size 5 kb were similar

Page 24: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Real data

• CD NOD2 sequence data – 453 cases, 103 controls– 60 single nucleotide variations (9 of them have >

MAF 0.05)– Because only pooled frequency counts available

for each variants, sequencing data were simulated.

• Autism LRP2 sequencing data– 430 cases, 379 controls

Page 25: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Real data

• The combination methods powerful than others

Page 26: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Discussion

• The proposed combination methods– Partitioning rare/common– Powerful approach– Better than CMC (rare/common partitioning)– Better than original Burden and SKAT test – Extend to family-based designs

Page 27: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Discussion

• T1D HLA region – SKAT (2.7e-43)– Wald test (6.7e-49)– Likelihood ratio test (8.9e-221)

• LD between regions • Multiple different components within a region

Page 28: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

• Thanks

Page 29: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

Linear SKAT vs individual variant test statistics

• Linear SKAT (lower) and individual variant test (upper) is equivalent

Page 30: Sequential Kernel Association Tests for the Combined Effect of Rare and Common Variants Journal club (Nov/13) SH Lee

• Three disease model for power comparison