Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
1
Genomic risk scores for juvenile idiopathic arthritis and its subtypes Supplementary Material Results Figure S1: Outline of the study design.
UKn: 7,505
SNPs: ~6M
CLARITYn: 940
SNPs: ~6M
CHOPn: 3,513
SNPs: ~6M
Datasets
TestingCovariables(PCs & Sex)
GRS Model
Compute Model
(SparSNP)
Test Sets
Training Set
Test Measurements
Getting SNPs
weights
AUCOR
JIA Subtypes Study
2
Figure S2: Cross-validation results within the UK dataset. The results are from 10x10 cross-validated AUC (LOESS-smoothed) as a function of the number of SNPs assigned a non-zero weight in the model. The best model was selected at 26 SNPs with predictive measure 0.671 with (0.668, 0.674) 95% CI.
0.58
0.60
0.62
0.65
0.68
8 64 512 4096Number of SNPs in model
AUC
3
Training the genomic risk scores Figure S3: A) Genome-wide association study (GWAS) of the UK dataset adjusted by its first ten principal components. The highlighted points correspond to the SNPs with non-zero weight selected by the GRS model. B) Weights associated for each of the SNPs selected for the model.
A)
B)
4
Extensions to the model MetaGRS We explored alternative approach to enhance the predictive power of our current JIA model. Given the strong pleiotropy across autoimmune diseases, we hypothesised that it may be possible to extract more predictive signal from GRSs generated for other autoimmune diseases via a metaGRS approach (cite our JACC paper), particularly given the limited sample sizes available for JIA GWAS. Figure S4 illustrates the pipeline followed to compute the metaGRS model. Table S1 shows 16 autoimmune disease summary statistic (GWAS) sets that were considered. We generated GRSs by LD-clumping (plink --clump) each summary statistic using the CLARITY dataset. Figure S5 shows the correlations between the GRSs including our JIA GRS model. Next, using CLARITY as a training set, we applied elastic-net regression in ten-fold cross-validation (using the glmnet R package1) to combine all the GRSs into a single weighted 'meta' GRS. Figure S6 shows the model performance (binomial deviance) across a range of values of the elastic net penalty, in cross-validation on the training set, where we selected the model with the lowest binomial deviance (the parameter s="lambda.min" in the glmnet package). Finally, the weights for this model were used to construct the JIA metaGRS model. Figure S7 compares the effect size (OR) of each of the GRSs individually (logistic regression) and their effect size as part of the JIA metaGRS (lasso logistic regression), in CLARITY. Finally, Table S2 presents the OR and AUC obtained by our JIA GRS and metaGRS model GRSs. The results showed improvement of >1% in AUC in compare with JIA GRS, however the uncertainty in estimates was large due to limited sample sizes.
5
Figure S4: Workflow of computing the MetaGRS.
SummaryStatistics
1
SummaryStatistics
16
Preparing the Data
GRS1
GRS 16
LD-Clumping (plink –clump)
Combining the GRSs
TestingAUCOR
CLARITY
Compute GRS (plink –score)
. . . Our JIA GRS
Phenotype(CLARITY)
Deriving the MetaGRS(R glmnet)
CHOP
JIA MetaGRS
6
Table S1: List of external summary statistics (GWAS) used to compute the JIA MetaGRS. Additionally, this table provides the final number of SNPs with non-zero weight from each GRS.
GRS label
Trait name
Number of SNPs after clumping
Study reference
SLE
Systemic lupus erythematosus
1,022
2
MS Multiple Sclerosis
546 3
MS (ic) Multiple Sclerosis
964 4
PSO Psoriasis 615 5 NAR Narcolepsy 44 6 CEL Celiac 1,141 7 T1D (cc) Type 1
Diabetes 380 8
T1D (meta) Type 1 Diabetes
457 8
RA (Okada) Rheumatoid Arthritis
1,872 9
RA (Stahl) Rheumatoid Arthritis
816 10
RA (Eyre) Rheumatoid Arthritis
658 11
PBC (Liu) Primary Biliary Cirrhosis
449 12
PBC (Cordell) Primary Biliary Cirrhosis
524 13
AS Ankylosing Spondylitis
378 14
UC Ulcerative Colitis
827 15
JIA (Hinks)
Juvenile Idiopathic Arthriris
256
16
7
Figure S5: Correlation matrix between the JIA GRS model and all the external GRSs generated to create the JIA MetaGRS model. All the GRSs were computed using the CLARITY dataset.
Figure S6: 10-fold cross-validated binomial deviance from elastic-net regression on JIA phenotype using our JIA GRS model in combination with the 16 autoimmune GRSs.
−1
−0.8
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
0.8
1T1
D (c
c)T1
D (m
eta)
RA
(Oka
da)
RA
(Sta
hl)
RA
(Eyr
e)JI
A (H
inks
)JI
A (o
ur G
RS)
PBC
(Liu
)PB
C (C
orde
ll)AS U
CPS
ON
ARM
SM
S (ic
)C
ELSL
E
T1D (cc)T1D (meta)RA (Okada)
RA (Stahl)RA (Eyre)
JIA (Hinks)JIA (our GRS)
PBC (Liu)PBC (Cordell)
ASUC
PSONAR
MSMS (ic)
CELSLE
−8 −7 −6 −5 −4 −3 −2
1.20
1.25
1.30
1.35
log(Lambda)
Bino
mia
l Dev
ianc
e
●
●
●
●
●
●
●
●
●
●
●●
●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
17 17 17 16 15 13 13 13 12 11 11 8 6 5 3 2 2 2 2 2 1
8
Figure S7: Comparion of the effect size (95% CI) of each of the GRSs in (i) logistic regression (considering each GRS separately) vs (ii) the conditional effect size in elastic-net logistic regression (used to create the metaGRS individually), in the CLARITY dataset. Confidence intervals are not available for the odds ratios estimated via elastic-net.
Table S2: Performane of the JIA GRS and metaGRS in external validation on the CHOP dataset. Based on logistic regression, optionally adjusting for sex and top 10 genetic principal components (PCs).
AUC (95% CI) OR (95% CI)
CHOP (US) Sex + PCs GRS MetaGRS GRS + Sex + PCs MetaGRS + Sex + PCs
0.677 (0.654−0.701) 0.657 (0.631−0.683) 0.684 (0.659−0.709) 0.735 (0.712−0.758) 0.748 (0.725−0.771)
-- 1.831 (1.685−1.991) 2.051 (1.870−2.252) 1.838 (1.686−2.007) 2.042 (1.857−2.250)
Enthesitis-related JIA HLA-B27 model The human leukocyte antigen B27 gene (HLA-B27) has a strong association with the enthesitis-related arthritis JIA subtype14,17–20. We created a model to predict enthesitis-related arthritis by using the most commonly tested tag SNPs for HLA-B27 (rs13202464, rs116488202 and rs4349859) to classify each individual as HLA-B27 positive or negative. Then we run a logistic regression model between the case/control status and the presence of the HLA-B27 in the UK dataset.
●●
●●
●●
●●
●●
● ●
●●
● ●
● ●
● ●
● ●
● ●
● ●
● ●
● ●
● ●
● ●
MSMS (ic)
PSONAR
UCRA (Stahl)
RA (Okada)CEL
RA (Eyre)AS
T1D (meta)SLE
T1D (cc)PBC (Liu)
PBC (Cordell)JIA (Hinks)
JIA (our GRS)
1.0 1.5 2.0Odds Ratio
●
●
IndividuallyMetaGRS
9
Methods Genotype data and quality control
Figure S8: First five principal components plot for the UK dataset.
Figure S9: First five principal components plot for the CHOP dataset.
10
Figure S10: First five principal components plot for the CLARITY dataset.
11
Figure S11: First five principal components plot for the selected CHOP subset.
Figure S12: First five principal components plot for the selected CLARITY subset.
12
Reference 1. Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear
models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
2. Bentham, J. et al. Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus. Nat. Genet. 47, 1457–1464 (2015).
3. Sawcer, S. et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature vol. 476 214–219 (2011).
4. Beecham, A. H. et al. Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet. 45, 1353–1362 (2013).
5. Tsoi, L. C. et al. Identification of 15 new psoriasis susceptibility loci highlights the role of innate immunity. Nat. Genet. 44, 1341–1348 (2012).
6. Faraco, J. et al. ImmunoChip Study Implicates Antigen Presentation to T Cells in Narcolepsy. PLoS Genet. 9, e1003270 (2013).
7. Trynka, G. et al. Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease. Nat. Genet. 43, 1193–1201 (2011).
8. Onengut-Gumuscu, S. et al. Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers. Nat. Genet. 47, 381–386 (2015).
9. Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
10. Stahl, E. A. et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat. Genet. 42, 508–514 (2010).
11. Eyre, S. et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat. Genet. 44, 1336–1340 (2012).
12. Liu, J. Z. et al. Dense fine-mapping study identifies new susceptibility loci for primary biliary cirrhosis. Nat. Genet. 44, 1137–1141 (2012).
13. Cordell, H. J. et al. International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways. Nat. Commun. 6, (2015).
14. Cortes, A. et al. Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci. Nat. Genet. 45, 730–738 (2013).
15. Anderson, C. A. et al. Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nature Genetics vol. 43 246–252 (2011).
16. Hinks, A. et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nat. Genet. 45, 664–669 (2013).
17. Evans, D. M. et al. Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat. Genet. 43, 761–767 (2011).
18. Berntson, L. et al. HLA-B27 predicts a more extended disease with increasing age at onset in boys with Juvenile idiopathic arthritis. J. Rheumatol. 35, 2055–2061 (2008).
13
19. Robinson, P. C. et al. Genetic dissection of acute anterior uveitis reveals similarities and differences in associations observed with ankylosing spondylitis. Arthritis Rheumatol. 67, 140–151 (2015).
20. Zuber, Z., Turowska-Heydel, D., Sobczyk, M. & Chudek, J. Prevalence of HLA-B27 antigen in patients with juvenile idiopathic arthritis. Reumatologia 53, 125–130 (2015).