26
Cancer-specific Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann atsch Lab, New York City, USA TCGA Symposium, May 13, 2014

Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Embed Size (px)

Citation preview

Page 1: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Cancer-specific Splicing and Splicing QTLs

Revealed By Pan-Cancer Genome Analysis

Kjong-Van Lehmann

Ratsch Lab, New York City, USA

TCGA Symposium, May 13, 2014

Page 2: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Splicing and Drug sensitivity

S. Bonnal et al. (2012); Nature Reviews Drug Discovery

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 1

Memorial Sloan-Kettering Cancer Center

Page 3: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Analysis Across Multiple Cancer Types

Goals1 Identify cancer-specific splicing patterns

2 Identify variants regulating splicing in same gene (cis)

3 Identify variants regulating splicing in other cancer genes (trans)

TCGA provides RNA-seq and matching exome data

RNA-seq Find & quantify splicing events

Exome Identify variants in exons & flanking intronic regions

Problem: Non-uniform processing (alignments & variant calling)

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 2

Memorial Sloan-Kettering Cancer Center

Page 4: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Analysis Across Multiple Cancer Types

Goals1 Identify cancer-specific splicing patterns

2 Identify variants regulating splicing in same gene (cis)

3 Identify variants regulating splicing in other cancer genes (trans)

TCGA provides RNA-seq and matching exome data

RNA-seq Find & quantify splicing events

Exome Identify variants in exons & flanking intronic regions

Problem: Non-uniform processing (alignments & variant calling)

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 2

Memorial Sloan-Kettering Cancer Center

Page 5: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Analysis Across Multiple Cancer Types

Goals1 Identify cancer-specific splicing patterns

2 Identify variants regulating splicing in same gene (cis)

3 Identify variants regulating splicing in other cancer genes (trans)

TCGA provides RNA-seq and matching exome data

RNA-seq Find & quantify splicing events

Exome Identify variants in exons & flanking intronic regions

Problem: Non-uniform processing (alignments & variant calling)

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 2

Memorial Sloan-Kettering Cancer Center

Page 6: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Re-analysis of Raw Sequencing Data

Computing at cluster colocated with CGHub

Scale: 9,000 exome & 4,500 RNA-seq libraries 400 TB data

⇒ Re-mapping (STAR): ≈ 6 CPU years

⇒ Variant Calling (GATK U.G. & MuTect): ≈ 12 CPU years

⇒ Splice variant quantification (SplAdder): ≈ 0.5 CPU years

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 3

Memorial Sloan-Kettering Cancer Center

Page 7: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detecting Alternative Splice Events with SplAdder

Estimating Splice Index

Build and extend splice-graph using re-aligned reads

Spliced reads support either Isoform 1 or Isoform 2

Count reads supporting alternate event

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 4

Memorial Sloan-Kettering Cancer Center

Page 8: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detecting Alternative Splice Events with SplAdder

Estimating Splice Index

Build and extend splice-graph using re-aligned reads

Spliced reads support either Isoform 1 or Isoform 2

Count reads supporting alternate event

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 4

Memorial Sloan-Kettering Cancer Center

Page 9: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detecting Alternative Splice Events with SplAdder

Estimating Splice Index

Build and extend splice-graph using re-aligned reads

Spliced reads support either Isoform 1 or Isoform 2

Count reads supporting alternate event

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 4

Memorial Sloan-Kettering Cancer Center

Page 10: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detecting Alternative Splice Events with SplAdder

Estimating Splice Index

Build and extend splice-graph using re-aligned reads

Spliced reads support either Isoform 1 or Isoform 2

Count reads supporting alternate event

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 4

Memorial Sloan-Kettering Cancer Center

Page 11: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Splicing Variation Across Cancer Types

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 5

Memorial Sloan-Kettering Cancer Center

Page 12: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detection of Cancer-Specific Splicing

≈500ENCODE RNA-Seq

TCGA/SplAdder Splicing Events

≈400 TCGA Normal

RNA-Seq

≈4000 TCGA Tumor

RNA-Seq

≈4000 TCGA Tumor RNA-

SeqId

entif

y Sp

lice

varia

nts

Confi

rm

Splic

e va

riant

sSc

ore

Splic

e va

riant

s

Detected new splicing events that occur frequently in specificcancer typesNeeds independent confirmationPotential targets for treatment

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 6

Memorial Sloan-Kettering Cancer Center

Page 13: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Detection of Cancer-Specific Splicing

Scor

e Sp

lice

varia

nts

Detected new splicing events that occur frequently in specificcancer types

Needs independent confirmation

Potential targets for treatment

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 6

Memorial Sloan-Kettering Cancer Center

Page 14: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

QTL Analyses in Comparison

Challenges in Cancer Genomics

Opportunity: Understand tissue- & cancer specificity of splicing

Opportunity: Large sample size allows to find trans-associations

Problem: Heterogeneity and purity of sample

Problem: Germline vs. Somatic mutations, many rare variants

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 7

Memorial Sloan-Kettering Cancer Center

Page 15: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

QTL Analyses in Comparison

Challenges in Cancer Genomics

Opportunity: Understand tissue- & cancer specificity of splicing

Opportunity: Large sample size allows to find trans-associations

Problem: Heterogeneity and purity of sample

Problem: Germline vs. Somatic mutations, many rare variants

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 7

Memorial Sloan-Kettering Cancer Center

Page 16: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Common Variant Association Analysis

Modeling cancer and population structure

Y = Xβ + Pop. Structure + Cancer Structure +ε

Pop. Structure ∼ N(0, σ2pP) with P = XgermX

Tgerm

Cancer Structure ∼ N(0, σ2cC ) with C = XsomaX

Tsoma

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 8

Memorial Sloan-Kettering Cancer Center

Page 17: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Common Variant Association Analysis

Modeling cancer and population structure

Y = Xβ + Pop. Structure + Cancer Structure +ε

Pop. Structure ∼ N(0, σ2pP) with P = XgermX

Tgerm

Cancer Structure ∼ N(0, σ2cC ) with C = XsomaX

Tsoma

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 8

Memorial Sloan-Kettering Cancer Center

Page 18: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Common Variant Association Analysis

Modeling cancer and population structure

Y = Xβ + Pop. Structure + Cancer Structure +ε

Pop. Structure ∼ N(0, σ2pP) with P = XgermX

Tgerm

Cancer Structure ∼ N(0, σ2cC ) with C = XsomaX

Tsoma

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 8

Memorial Sloan-Kettering Cancer Center

Page 19: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Common Variant Association Analysis

Modeling cancer and population structure

Y = Xβ + Pop. Structure + Cancer Structure +ε

Pop. Structure ∼ N(0, σ2pP) with P = XgermX

Tgerm

Cancer Structure ∼ N(0, σ2cC ) with C = XsomaX

Tsoma

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 8

Memorial Sloan-Kettering Cancer Center

Page 20: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Example: cis-Associations in SNRP-C

TRPT1 - tRNA Phosphotransferase I

+ 34,725,183Position on Chr 6

-log1

0 p-

valu

e

Alt (345) Het (1101) Ref (971)

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 9

Memorial Sloan-Kettering Cancer Center

Page 21: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

cis-Associations across Multiple Cancer Types

cis-Associations in 45 genes at 5% FDR (in 900 considered genes)

Cancer Census Genes Splicing-related genes

Replicated in multiple cancer types:

KTN1 PRCC

NUP98SSXT COX6C

MMAB

ANM7

LSM1 PRP18

THOCSRU1CU2AF4

TRPT1

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 10

Memorial Sloan-Kettering Cancer Center

Page 22: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Splicing trans-Associations (FDR 5%)

NDRG1

NTRK2

PRP19CELF2

HDAC6

Cancer Census

Transcript. Factor

RNA Binding

other

Cancer spec. Splicing

SNV -> Splicing

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 11

Memorial Sloan-Kettering Cancer Center

Page 23: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Splicing trans-Associations (FDR 5%)

LSM1

LSM2DX398 RU1CU2AF4

TRPT1

ERR1NDS3

FLT3

NDRG1

NTRK2

PRP19

P53

CELF2

FRG1

HDAC6

Cancer Census

Transcript. Factor

RNA Binding

other

Cancer spec. Splicing

SNV -> Splicing

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 11

Memorial Sloan-Kettering Cancer Center

Page 24: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Splicing trans-Associations (FDR 5%)

KTN1 PRCC

NUP98

SSXT

COX6C

MMAB

ANM7

LSM1

LSM2

PRP18

DX398 THOCSRU1CU2AF4

TRPT1

ERR1

RNPS1 SON

HG2A

MERLNDS3

NTRK1

PK3CA

SRSF6

FLT3

NDRG1IGF2

NTRK2

PRP19

P53

ABI1

CBX3

CELF2

FRG1MAGI1

HDAC6GABPA SMC3MAX SMCA4

Cancer Census

Transcript. Factor

RNA Binding

other

Cancer spec. Splicing

SNV -> Splicing

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 11

Memorial Sloan-Kettering Cancer Center

Page 25: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Conclusions and future work

Developed resource of novel & known alternative splice events

Identified cancer-specific isoforms that appear rarely expressed innormal samples

Performed common variant association study to map splicingphenotypes

Sample size in TCGA data enables detection of transassociations

All of these associations still need validation (in particular trans)

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 12

Memorial Sloan-Kettering Cancer Center

Page 26: Cancer-speci c Splicing and Splicing QTLs Revealed By … · Cancer-speci c Splicing and Splicing QTLs Revealed By Pan-Cancer Genome Analysis Kjong-Van Lehmann ... Build and extend

Acknowledgements

Andre Kahles

Cyriac Kandoth

William Lee

Cancer Genome Atlas Network

Nikolaus Schultz

Robert Klein

Oliver Stegle

Gunnar Ratsch

Funded by MSKCC.

Kjong-Van Lehmann (MSKCC, New York) TCGA Symposium, 2014 13

Memorial Sloan-Kettering Cancer Center