13
1 Genomic risk scores for juvenile idiopathic arthritis and its subtypes Supplementary Material Results Figure S1: Outline of the study design. UK n: 7,505 SNPs: ~6M CLARITY n: 940 SNPs: ~6M CHOP n: 3,513 SNPs: ~6M Datasets Testing Covariables (PCs & Sex) GRS Model Compute Model (SparSNP) Test Sets Training Set Test Measurements Getting SNPs weights AUC OR JIA Subtypes Study

Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

1

Genomic risk scores for juvenile idiopathic arthritis and its subtypes Supplementary Material Results Figure S1: Outline of the study design.

UKn: 7,505

SNPs: ~6M

CLARITYn: 940

SNPs: ~6M

CHOPn: 3,513

SNPs: ~6M

Datasets

TestingCovariables(PCs & Sex)

GRS Model

Compute Model

(SparSNP)

Test Sets

Training Set

Test Measurements

Getting SNPs

weights

AUCOR

JIA Subtypes Study

Page 2: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

2

Figure S2: Cross-validation results within the UK dataset. The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs assigned a non-zero weight in the model. The best model was selected at 26 SNPs with predictive measure 0.671 with (0.668, 0.674) 95% CI.

0.58

0.60

0.62

0.65

0.68

8 64 512 4096Number of SNPs in model

AUC

Page 3: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

3

Training the genomic risk scores Figure S3: A) Genome-wide association study (GWAS) of the UK dataset adjusted by its first ten principal components. The highlighted points correspond to the SNPs with non-zero weight selected by the GRS model. B) Weights associated for each of the SNPs selected for the model.

A)

B)

Page 4: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

4

Extensions to the model MetaGRS We explored alternative approach to enhance the predictive power of our current JIA model. Given the strong pleiotropy across autoimmune diseases, we hypothesised that it may be possible to extract more predictive signal from GRSs generated for other autoimmune diseases via a metaGRS approach (cite our JACC paper), particularly given the limited sample sizes available for JIA GWAS. Figure S4 illustrates the pipeline followed to compute the metaGRS model. Table S1 shows 16 autoimmune disease summary statistic (GWAS) sets that were considered. We generated GRSs by LD-clumping (plink --clump) each summary statistic using the CLARITY dataset. Figure S5 shows the correlations between the GRSs including our JIA GRS model. Next, using CLARITY as a training set, we applied elastic-net regression in ten-fold cross-validation (using the glmnet R package1) to combine all the GRSs into a single weighted 'meta' GRS. Figure S6 shows the model performance (binomial deviance) across a range of values of the elastic net penalty, in cross-validation on the training set, where we selected the model with the lowest binomial deviance (the parameter s="lambda.min" in the glmnet package). Finally, the weights for this model were used to construct the JIA metaGRS model. Figure S7 compares the effect size (OR) of each of the GRSs individually (logistic regression) and their effect size as part of the JIA metaGRS (lasso logistic regression), in CLARITY. Finally, Table S2 presents the OR and AUC obtained by our JIA GRS and metaGRS model GRSs. The results showed improvement of >1% in AUC in compare with JIA GRS, however the uncertainty in estimates was large due to limited sample sizes.

Page 5: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

5

Figure S4: Workflow of computing the MetaGRS.

SummaryStatistics

1

SummaryStatistics

16

Preparing the Data

GRS1

GRS 16

LD-Clumping (plink –clump)

Combining the GRSs

TestingAUCOR

CLARITY

Compute GRS (plink –score)

. . . Our JIA GRS

Phenotype(CLARITY)

Deriving the MetaGRS(R glmnet)

CHOP

JIA MetaGRS

Page 6: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

6

Table S1: List of external summary statistics (GWAS) used to compute the JIA MetaGRS. Additionally, this table provides the final number of SNPs with non-zero weight from each GRS.

GRS label

Trait name

Number of SNPs after clumping

Study reference

SLE

Systemic lupus erythematosus

1,022

2

MS Multiple Sclerosis

546 3

MS (ic) Multiple Sclerosis

964 4

PSO Psoriasis 615 5 NAR Narcolepsy 44 6 CEL Celiac 1,141 7 T1D (cc) Type 1

Diabetes 380 8

T1D (meta) Type 1 Diabetes

457 8

RA (Okada) Rheumatoid Arthritis

1,872 9

RA (Stahl) Rheumatoid Arthritis

816 10

RA (Eyre) Rheumatoid Arthritis

658 11

PBC (Liu) Primary Biliary Cirrhosis

449 12

PBC (Cordell) Primary Biliary Cirrhosis

524 13

AS Ankylosing Spondylitis

378 14

UC Ulcerative Colitis

827 15

JIA (Hinks)

Juvenile Idiopathic Arthriris

256

16

Page 7: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

7

Figure S5: Correlation matrix between the JIA GRS model and all the external GRSs generated to create the JIA MetaGRS model. All the GRSs were computed using the CLARITY dataset.

Figure S6: 10-fold cross-validated binomial deviance from elastic-net regression on JIA phenotype using our JIA GRS model in combination with the 16 autoimmune GRSs.

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1T1

D (c

c)T1

D (m

eta)

RA

(Oka

da)

RA

(Sta

hl)

RA

(Eyr

e)JI

A (H

inks

)JI

A (o

ur G

RS)

PBC

(Liu

)PB

C (C

orde

ll)AS U

CPS

ON

ARM

SM

S (ic

)C

ELSL

E

T1D (cc)T1D (meta)RA (Okada)

RA (Stahl)RA (Eyre)

JIA (Hinks)JIA (our GRS)

PBC (Liu)PBC (Cordell)

ASUC

PSONAR

MSMS (ic)

CELSLE

−8 −7 −6 −5 −4 −3 −2

1.20

1.25

1.30

1.35

log(Lambda)

Bino

mia

l Dev

ianc

e

●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

17 17 17 16 15 13 13 13 12 11 11 8 6 5 3 2 2 2 2 2 1

Page 8: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

8

Figure S7: Comparion of the effect size (95% CI) of each of the GRSs in (i) logistic regression (considering each GRS separately) vs (ii) the conditional effect size in elastic-net logistic regression (used to create the metaGRS individually), in the CLARITY dataset. Confidence intervals are not available for the odds ratios estimated via elastic-net.

Table S2: Performane of the JIA GRS and metaGRS in external validation on the CHOP dataset. Based on logistic regression, optionally adjusting for sex and top 10 genetic principal components (PCs).

AUC (95% CI) OR (95% CI)

CHOP (US) Sex + PCs GRS MetaGRS GRS + Sex + PCs MetaGRS + Sex + PCs

0.677 (0.654−0.701) 0.657 (0.631−0.683) 0.684 (0.659−0.709) 0.735 (0.712−0.758) 0.748 (0.725−0.771)

-- 1.831 (1.685−1.991) 2.051 (1.870−2.252) 1.838 (1.686−2.007) 2.042 (1.857−2.250)

Enthesitis-related JIA HLA-B27 model The human leukocyte antigen B27 gene (HLA-B27) has a strong association with the enthesitis-related arthritis JIA subtype14,17–20. We created a model to predict enthesitis-related arthritis by using the most commonly tested tag SNPs for HLA-B27 (rs13202464, rs116488202 and rs4349859) to classify each individual as HLA-B27 positive or negative. Then we run a logistic regression model between the case/control status and the presence of the HLA-B27 in the UK dataset.

●●

●●

●●

●●

●●

● ●

●●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

MSMS (ic)

PSONAR

UCRA (Stahl)

RA (Okada)CEL

RA (Eyre)AS

T1D (meta)SLE

T1D (cc)PBC (Liu)

PBC (Cordell)JIA (Hinks)

JIA (our GRS)

1.0 1.5 2.0Odds Ratio

IndividuallyMetaGRS

Page 9: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

9

Methods Genotype data and quality control

Figure S8: First five principal components plot for the UK dataset.

Figure S9: First five principal components plot for the CHOP dataset.

Page 10: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

10

Figure S10: First five principal components plot for the CLARITY dataset.

Page 11: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

11

Figure S11: First five principal components plot for the selected CHOP subset.

Figure S12: First five principal components plot for the selected CLARITY subset.

Page 12: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

12

Reference 1. Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear

models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).

2. Bentham, J. et al. Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus. Nat. Genet. 47, 1457–1464 (2015).

3. Sawcer, S. et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature vol. 476 214–219 (2011).

4. Beecham, A. H. et al. Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet. 45, 1353–1362 (2013).

5. Tsoi, L. C. et al. Identification of 15 new psoriasis susceptibility loci highlights the role of innate immunity. Nat. Genet. 44, 1341–1348 (2012).

6. Faraco, J. et al. ImmunoChip Study Implicates Antigen Presentation to T Cells in Narcolepsy. PLoS Genet. 9, e1003270 (2013).

7. Trynka, G. et al. Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease. Nat. Genet. 43, 1193–1201 (2011).

8. Onengut-Gumuscu, S. et al. Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers. Nat. Genet. 47, 381–386 (2015).

9. Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).

10. Stahl, E. A. et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat. Genet. 42, 508–514 (2010).

11. Eyre, S. et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat. Genet. 44, 1336–1340 (2012).

12. Liu, J. Z. et al. Dense fine-mapping study identifies new susceptibility loci for primary biliary cirrhosis. Nat. Genet. 44, 1137–1141 (2012).

13. Cordell, H. J. et al. International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways. Nat. Commun. 6, (2015).

14. Cortes, A. et al. Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci. Nat. Genet. 45, 730–738 (2013).

15. Anderson, C. A. et al. Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nature Genetics vol. 43 246–252 (2011).

16. Hinks, A. et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nat. Genet. 45, 664–669 (2013).

17. Evans, D. M. et al. Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat. Genet. 43, 761–767 (2011).

18. Berntson, L. et al. HLA-B27 predicts a more extended disease with increasing age at onset in boys with Juvenile idiopathic arthritis. J. Rheumatol. 35, 2055–2061 (2008).

Page 13: Genomic risk scores for juvenile idiopathic arthritis and ... · 20.02.2020  · The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs

13

19. Robinson, P. C. et al. Genetic dissection of acute anterior uveitis reveals similarities and differences in associations observed with ankylosing spondylitis. Arthritis Rheumatol. 67, 140–151 (2015).

20. Zuber, Z., Turowska-Heydel, D., Sobczyk, M. & Chudek, J. Prevalence of HLA-B27 antigen in patients with juvenile idiopathic arthritis. Reumatologia 53, 125–130 (2015).