25
Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Embed Size (px)

DESCRIPTION

False positive problem

Citation preview

Page 1: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Assessment of genomewide association studies

Tuan V. NguyenGarvan Institute of Medical Research

Sydney, Australia

Page 2: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

WHICH GENES ?

Gene variants ?

Page 3: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

False positive problem

Page 4: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Candidate gene studies: reproducibility problem

600 positive associations between common gene variants and disease reported 1986-2000

J N Hirschhorn et al. Genetics in Medicine 2002

166 were studied 3+ times

6 have been consistently replicated

Page 5: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Introduction to genomewide association studies

Page 6: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Genomewide association studies (GWA)

• Revolution in gene search• Hypothesis-free driven approach• Scan 100,000-500,000 gene variants (SNPs)• Case – control design (>1000 individuals)

Massive number of tests of hypothesis

Page 7: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Recent GWA studies in osteoporosis

• Styrkarsdottir U, et al (2008) Multiple genetic loci for bone mineral density and fractures. N Engl J Med 358:2355-2365.

• van Meurs JB, et al (2008) Large-scale analysis of association between LRP5 and LRP6 variants and osteoporosis. JAMA 299:1277-1290.

• Richards JB, et al (2008) Bone mineral density, osteoporosis, and osteoporotic fractures: a genome-wide association study. Lancet 371:1505-1512.

Page 8: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Some gene variants from GWAGene variant (SNP) Gene or location Trait and P-valuers3736228 11q13 (LRP5) BMD (p = 2.6 × 10-9)

Fracture (p = 0.02)rs3736228 11q13 (LRP5) BMD (p = 6.3 × 10-12)

Fracture (p = 0.002)rs4355801 LRP5rs4988321 11q13 (LRP5) BMD (p = 3.3 × 10-8)

Fracture (p = 0.002)rs2302685 12p12 (LRP6) BMD (p = 0.97)

Fracture (p = 0.95)rs4355801 8q24 (TNFRSF11B) BMD (p = 7.6 × 10-10)rs7524102 1p36 (ZBTB40) BMD (p = 9.2 × 10-19)

Fracture (p = 8.4 × 10-4)rs6696981 1p36 (close to ZBTB40) BMD (p = 1.7 × 10-7)

Fracture (p = 2.4 × 10-4)rs3130340 6p21 () BMD (p = 1.2 × 10-7)

Fracture (p = 0.008)rs9479055 6q25 (1) BMD (p = 6.2 × 10-7)rs4870044 6q25 (1) BMD (p = 1.6 × 10-11)rs1038304 6q25 (1) BMD (p = 4.0 × 10-11)rs6929137 6q25 (1) BMD (p = 2.5 × 10-10)rs1999805 6q25 (1) BMD (p = 2.2 × 10-8)rs6993813 8q24 (OPG) BMD (p = 1.8 × 10-14)

Fracture (p = 0.04)rs6469804 8q24 (OPG) BMD (p = 7.4 × 10-15)

Fracture (p = 0.052)rs9594738 13q14 (RANKL) BMD (p = 2.0 × 10-21)rs9594759 13q14 (RANKL) BMD (p = 1.1 × 10-16)rs11898505 2p16 (SPTBN1) Fracture (p = 1.8 × 10-4)rs3018362 18q21 (RANK) Fracture (p = 0.005)rs2306033 11p11 (LRP4) Fracture (p = 0.007)rs7935346 11p11 (LRP4) Fracture (p = 0.02)

Page 9: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

What is the credibility of a GWA finding ?

Page 10: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

An observed association with p<0.05 does not necessarily mean

that the association exists.

In 100,000 tests, 5000 positive findings could be false positive

Page 11: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Diagnostic test and association test

Diseased

YES

+ve -ve

NO

+ve -ve

Association

True

+ve -ve

False

Sensitivity

P(+ve | D)

False +ve Power P-value

P(+ve | False)

+ve -ve

Page 12: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

What do want we to know?

• Probability of association given observed data (eg posterior probability of association)

or

• Probability of observing data if there is no association (P-value)

Page 13: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Posterior probability of association

• Prior probability of association ()• Power = Pr(significance | association)

Sample size• P-value = Pr(significance | no association)

Effect size

is a function of

Page 14: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

What is the prior probability of association for a gene variant ?

Page 15: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Gene search = finding small needles in a VERY large haystack

• Human genome ~3 billion base pairs longBUT: Most are vanishingly rare

• 99.9% identical between any two individuals

• ~90% differences between any two individuals is due to common variants

Page 16: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Hypotheses• Common disease / common variants (CD/CV)

(Reich & Lander 2001, Pritchard et al 2005)

• ~90% differences between any two individuals is due to common variants

Page 17: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Prior probability of association ()

• Common variants in the human population: 10 million (Kruglyak and Nickerson Net Gent 2001)

• No. of genetic variants associated with a common disease ~100 or less (Yang et al, Int J Epidemiol 2005)

Prior probability of association

= 0.000001

Page 18: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

A Bayesian interpretation of association

10,000,000 common variants

True association (100) No association (9,999,900)

Significant (95)

Non-significant (5)

Significant (100)

Non-significant (9,999,800)

P(True association given a significant result) = 95 / (95+195) = 48%

Power = 95%; P-value=0.00001

Page 19: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

A Bayesian interpretation of association

10,000,000 common variants

True association (100) No association (9,999,900)

Significant (95)

Non-significant (5)

Significant (1)

Non-significant (9,999,800)

P(True association given a significant result) = 95 / (95+1) = 99%

Power = 95%; P-value=0.00000001

Page 20: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

P-value and “true” association

P-value in the range of 5% - 0.1% will virtually be false positives even in large scale studies

Page 21: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

P-value for a reliable association

P < 5 x 10-5

or P < 5 x 10-8

For 1000 cases and 1000 controls,

p< 10-8 are more likely to be true than false

Page 22: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Some gene variants from GWAGene variant (SNP) Gene or location Trait and P-valuers3736228 11q13 (LRP5) BMD (p = 2.6 × 10-9)

Fracture (p = 0.02)rs3736228 11q13 (LRP5) BMD (p = 6.3 × 10-12)

Fracture (p = 0.002)rs4355801 LRP5rs4988321 11q13 (LRP5) BMD (p = 3.3 × 10-8)

Fracture (p = 0.002)rs4355801 8q24 (TNFRSF11B) BMD (p = 7.6 × 10-10)rs7524102 1p36 (ZBTB40) BMD (p = 9.2 × 10-19)

Fracture (p = 8.4 × 10-4)rs9479055 6q25 (1) BMD (p = 6.2 × 10-7)rs4870044 6q25 (1) BMD (p = 1.6 × 10-11)rs1038304 6q25 (1) BMD (p = 4.0 × 10-11)rs6929137 6q25 (1) BMD (p = 2.5 × 10-10)rs1999805 6q25 (1) BMD (p = 2.2 × 10-8)rs6993813 8q24 (OPG) BMD (p = 1.8 × 10-14)

Fracture (p = 0.04)rs6469804 8q24 (OPG) BMD (p = 7.4 × 10-15)

Fracture (p = 0.052)rs9594738 13q14 (RANKL) BMD (p = 2.0 × 10-21)rs9594759 13q14 (RANKL) BMD (p = 1.1 × 10-16)rs11898505 2p16 (SPTBN1) Fracture (p = 1.8 × 10-4)rs3018362 18q21 (RANK) Fracture (p = 0.005)rs2306033 11p11 (LRP4) Fracture (p = 0.007)rs7935346 11p11 (LRP4) Fracture (p = 0.02)

Page 23: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Number of individuals needed to screen in population and family

Hypothetical gene Fracture risk in

Population Family

Relative risk 5 10

Cumulative risk 40% 80%

Cumulative risk after Rx 20% 40%

Number needed to treat 5 2.5

Frequency of risk “genotype”

0.2% 50%

Number needed to screen 2500 5

Page 24: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

How many genes are required for a “good” fracture prognosis ?

Odds ratio

Genotype frequency

Number of genes needed for AUC of

0.70 0.80 0.90 0.95

1.1 5% >400 >400 >400 >400

10% 330 >400 >400 >400

30% 150 >400 >400 >400

1.5 5% 33 100 280 >400

10% 19 50 150 330

30% 9 23 70 160

Page 25: Assessment of genomewide association studies Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia

Assessment of GWA finding

• Genetic components of BMD and fracture

• Finding genes of osteoporosis: a challenge

• Genes can help improve the prognosis of fracture