1
Analytical Validation of Ion AmpliSeq Cancer Panel 2.0 Weihua Liu, Zhenyu Yan, Peng Fang, Agus Darwanto, Stephane Wong, Chad Galderisi, and Jin Li, MolecularMD Corp., Cambridge, MA and Portland, OR Introduction Results continued Bli d d S d F h 37 bli d d FFPE l f i (l l GIST d il b i d l ) 222 i ll d f TS2 2 f h h h Materials and Methods The Ion AmpliSeq Cancer Panel 2.0 (Ion Torrent) allows for targeted sequencing of 189 regions of interest in 46 cancerrelated genes. We used well characterized cancer cell lines harboring clinically relevant variants as positive controls and HapMap samples NA12878 and NA19240 as wild type control samples. A total of 37 FFPE samples of a variety of cancer types were analyzed in a blinded fashion to evaluate the diagnostic sensitivity and specificity. With no gold standard available as a reference method to detect mutations with comparable sensitivity, we analyzed Blinded Study : For the 37 blinded FFPE samples from various cancer types (lung, colon, GIST , endometrial, breast, ovarian and larynx), 222 raw variants called from TS2.2 were further run through MolecularMD’s proprietary pipeline to allow for reporting 105 variants (Figure 3), 95 of which were cross confirmed by Illumina panel. Three mutations, which were evident when reviewed in IGV but missed by Torrent Suite variant caller were confirmed as false negatives by Illumina cancer panel (Table 3) and Sanger sequencing (data not shown). The 10 variants identified by Ion, but not confirmed by Illumina (Table 4), were further evaluated either by Sanger sequencing (for those with >10% frequency; data not shown) or a restriction endonucleasebased fragment analysis method described by Liu and colleagues and run on CaliperGx (ref. 3; Figure 4 contains representative data for the FGFR2 A389T variant, additional data not shown). Nine of the ten variants were confirmed as true positives. The variant in SMARCB1 remains to be confirmed, and is defined as false positive at this stage. Genomic DNA (10ng) was amplified using Ion AmpliSeq™ Primer Pools (ref.1). The resulting amplicons were treated with FuPa Reagent to partially digest primer sequences and ligated to Ion sequencing adapters. For sample multiplexing, libraries were barcoded using the Ion Xpress™ Barcode Adapters 116. The ligated amplicons were purified by Ampure XP and quantified by qPCR using Ion Library Quantitation kit. Quantified libraries were qualified with Caliper GX (optional) and amplified on OneTouch system using OneTouch template preparation available as a reference method to detect mutations with comparable sensitivity, we analyzed the same sets of samples with both the Ion AmpliSeq and the Illumina TruSeq Amplicon Cancer Panel for comparison. These two cancer panels use distinct library preparation methods, as well as different sequencing chemistries and analysis pipelines. They share largely overlapping regions of interest and have comparable sensitivity of minor allele detection. Variants detected by both panels were considered as true positive variants. While most variants can be cross lid t d bt I d Ill i i t th t l d b l bt t th Sample name Gene name Chromosome coordinate & cDNA change Amino acids Mut Freq (%) by Illumina R090060 KIT 4:55595550_c.2028delT p.Leu678X 22.8 R070690 PDGFRA 4:5515209255152103_c.2524 2535d lGACATCATGCAT p.Ile843Asp846del(84 2 845DIMH> ) 26.0 Table 3. False negatives in blind study of 37 FFPE samples Caliper GX (optional) and amplified on OneTouch system using OneTouch template preparation v2 kit and sequenced on PGM using Ion PGM 200 bp sequencing kit. Sequence alignment was performed by Torrent Suite 2.2 (ref. 2) and variants were called by TS 2.2 variant caller with modified parameters to allow optimal sensitivity and specificity. MolecularMD proprietary scripts were used for filtering recurring false positives, variant annotation and reporting. In addition to reporting hotspot variants, we report all variants at frequency >=2% with potential impact on protein Figure 3. 3D projection of variants detected; genes and variants spread on x and y axes, number of variants on z axis validated between Ion and Illumina, variants that were only covered by one panel but not the other were confirmed by a third method. _2535delGACATCATGCAT 2845DIMH>) R120320 PIK3CA 3:178936092_c.1634A>G p.Glu545Gly(545E>G) 36.0 Table 4. Mutations in need of confirmation by a third method Sample name Gene name Chromosome coordinate & cDNA change Amino acids Mut Freq (%) by Ion Torrent Figure 4. Confirmation of FGFR2 A389T mutation WT digested MT digested K iti (6 ll li i) FFPE l 8% ll li i C i ith Ill i lt impact on protein. Validation Flowchart R08_0312 TP53 17:7578208_c.641A>G p.His214Arg(214H>R) 3.5 R08_0312 TP53 17:7577121_c.817C>T p.Arg273Cys(273R>C) 2.8 R070017 FGFR2 10:123274753_c.1165G>A p.Ala389Thr(389A>T) 4.6 R070017 NRAS 1:115258723_c.59C>T p.Thr20Ile(20T>I) 4.8 R070017 SMARCB1 22:24176339_c.1103G>A p.Arg368His(368R>H) 2.5 R070017 PIK3CA 3:178916860_c.247T>G p.Phe83Val(83F>V) 40.5 R070017 PIK3CA 3:178916876 c 263G>A p Arg88Gln(88R>Q) 37 9 Mutant peak Known positives (6 cancer cell line mix) + Known negatives (HapMap samples) Serial dilution Analytical Sensitivity FFPE samples with known mutations (n=4) Robustness in FFPE Samples 8% cell line mix and FFPE samples (n=10) Intraand Interassay HorizonDx samples (n=4) Mut Freq Accuracy Blinded Study Clinical FFPE Samples (n=37, assorted tumors) MolecularMD data analysis pipeline Comparison with Illumina results Confirmation by a third method Diagnostic Sensitivity 3 4 5 6 7 8 9 10 R07 0017 PIK3CA 3:178916876_c.263G>A p.Arg88Gln(88R>Q) 37.9 R070017 KDR 4:55962485_c.2639G>A p.Arg880Gln(880R>Q) 3.1 R070017 MET 7:116340225_c.1087T>C p.Cys363Arg(363C>R) 3.7 MMD_A2012 PIK3CA 3:178921549_c.1031T>G p.Val344Gly(344V>G) 53 Reproducibility and Mut Freq Precision and Specificity Analytical Sensitivity : Equimolar mixture of gDNA from six cell lines (G361, H1650, H1395, Results Analytical Accuracy : AmpliSeq cancer panel reported mutation frequencies very close to those Mutation Spectrum : We compared the mutation spectrum from Uniformity of Coverage : For 72 distinct libraries analyzed, the average and standard deviation of relative coverage (coverage of a 0 1 2 ABL1 ATM BRAF CTNNB1 FBXW7 FGFR2 FGFR3 HRAS JAK3 KDR KIT KRAS MET NRAS PIK3CA PTEN RB1 RET SMAD4 SMARCB1 SMO STK11 TP53 Catalog# Sample Description Cell line AmpliSeq Mut Freq Chromosome di t DNA h G A i id M i f (%) R dD h Table 2. Mutation frequency determination for HorizonDx samples Figure 6. Relative coverage of 189 ROIs in Ion AmpliSeq Cancer Panel for 72 libraries Table 1. Analytical sensitivity defined by serial dilution of cancer cell line DNAs H1975, DU145 and MDAMB231) was diluted into NA12878 gDNA with final concentrations of 16%, 8% and 4% cancer cell line DNA (Table 1). The LOD for single base substitutions and indels was determined to be 25%. expected for samples provided by HorizonDx (Table 2), with the exception of the EGFR T790M sample which had two overlapping amplicons that diluted the true mutation frequency. different cancer types. For the five endometrial cancer samples, in addition to most frequent mutations reported in endometrial cancer, e.g. PTEN, PIK3CA, CTNNB1, TP53, FBXW7, and FGFR2 (ref. 4), we also observed variations in NRAS, MET, STK11, SMO and other genes (Figure 5). One sample, R070017, showed significant heterogeneity with minor clones containing NRAS ROI/mean coverage of the library) for the 189 ROIs was determined (Figure 6). The data showed the coverage for the ROIs was generally uniform. For each ROI, the percentage of libraries with relative coverage <0.2X showed inverse correlation with relative coverage, indicating uniformity is higher in the high coverage regions relative to low coverage regions. 1 35 Figure 5. Variants detected in endometrial cancer HD 10300302005 3.5% BRAF V600E SW48 3.6% HD125 6.5% EGFR T790M RKO 3.65 HD129 6% EGFR L858R RKO 5.7% HD132 4.5% EGFR L861Q RKO 4.8% coordinate_cDNA change Gene name Amino_acids 16% 8% 4% 16% 8% 4% 11:108204681_c.7996A>G ATM p.Thr2666Ala(2666T>A) 23.8 10 5 1535 1412 1412 12:25398281_c.38G>A KRAS p.Gly13Asp(13G>D) 7.6 2031 13:49037903_c.2143A>T RB1 p.Lys715X(715K>*) 18 5.7 278 439 17:7577099_c.839G>A TP53 p.Arg280Lys(280R>K) 20.3 9 3.8 6291 5426 6524 17:7577118_c.820G>T TP53 p.Val274Phe(274V>F) 13.8 5.2 2.9 3781 3238 3693 17:7577120_c.818G>A TP53 p.Arg273His(273R>H) 16 7.5 2.8 5612 5060 6127 17:7578181_c.668C>T TP53 p.Pro223Leu(223P>L) 4.8 4230 Mutation f requency(%) Read Depth A389T C363R N375S F83V G34V T41I H168R T170M R342X M541L P38S R173H Q298X A328T R368H R393Q R579X R880Q T20I V722I W537X CTNNB1 FBXW7 R88Q T319X significant heterogeneity with minor clones containing NRAS and FGFR2 and other mutations with frequencies <15%, which is below the Sanger detection limit. 0.5 0.6 0.7 0.8 0.9 1 2 2.5 3 3.5 Mean relative coverage of libraries Percentage of libraries with relative avg_cov <0.2 (total 72ibraries) Linear (Mean relative coverage of libraries) 19:1220438- 1220442_c.531_535delCAA GC STK11 p.Lys178GlyfsX86 4.4 500 4:55593464_c.1621A>C KIT p.Met541Leu(541M>L) 20 8 3.4 8487 4504 4629 4:55972974_c.1416A>T KDR p.Gln472His(472Q>H) 11.5 4.1 2583 2359 5:112175363_c.4072G>A APC p.Ala1358Thr(1358A>T) 6.5 3.6 5482 3994 7:140453136_c.1799T>A BRAF p.Val600Glu(600V>E) 18.9 10.3 4.2 7324 5466 6142 7:140481402_c.1406G>C BRAF p.Gly469Ala(469G>A) 14 7.7 3.4 5274 4250 4333 7:140481417_c.1391G>T BRAF p.Gly464Val(464G>V) 12.8 5.9 6402 5123 7:55242465 FGFR2 Jak3 KDR Kit Met Nras PIK3CA PTEN RB1 Smo STK11 Determination of Phasing of Compound Mutations : The 260 flows (~100 bp) of sequencing used here allows for identifying the phasing status of compound mutations within the range of 100 bp. For example, we observed PIK3CA cis and trans compound mutations in two samples (Figure 2). Figure 2A shows an IGV view of phasing of coexisting mutations E542K (37.1%, A) and E545K (2 7% A) in the PIK3CA gene as trans mutations in a colon cancer sample while y= 0.0073x + 1.7011 R² = 0.9768 0 0.1 0.2 0.3 0.4 0 0.5 1 1.5 Figure 2. Identification of phasing of compound mutations C l i 7:55242465- 55242479_c.2235_2249del GGAATTAAGAGAAGC EGFR p.Glu746_Ala750del(745- 750KELREA>K) 12.9 5.6 3.8 498 496 496 7:55249071_c.2369C>T EGFR p.Thr790Met(790T>M) 6.3 3.6 7074 2951 7:55259515_c.2573T>G EGFR p.Leu858Arg(858L>R) 9.4 4.7 3.1 8744 6513 6809 9:21971153_c.205G>T CDKN2A p.Glu69X(69E>*) 19.7 4.4 870 1600 Mut Freq<15% TP53 R080604 R080914 R090677 R120292 R070017 Analytical Reproducibility : Among 10 samples with known mutations (cell line DNA and FFPE), 60 of 63 expected mutations were reproducibly detected in duplicate and triplicate runs Of and E545K (2.7%, A) in the PIK3CA gene as trans mutations in a colon cancer sample, while Figure 2B shows an IGV view of phasing of coexisting mutations F83V (40.5%, G) and R88Q (37.9%, A) in the PIK3CA gene as cis mutations in an endometrial cancer sample. Rf ROI Conclusions Our validation demonstrates that the Ion AmpliSeq Cancer Panel combined with MolecularMD’s proprietary analysis pipeline provides a robust and accurate test to profile clinically relevant gene mutations with analytical sensitivity of 25% minor allele frequency. In addition to reporting hotspot mutations, we also report other variants that would impact 60 of 63 expected mutations were reproducibly detected in duplicate and triplicate runs. Of the three nonreproducible variants, two were lowlevel (frequencies close to 2%), and one was a highfrequency variant in a homopolymer region. The precision of variant frequency detection was determined using 30 mutations in 6 samples with triplicates or duplicates (Figure 1). In general, the CV is between 020%, showing inverse correlation with variant frequency. A References 1. Life Technologies; Ion AmpliSeq™ Library Kit 2.0 manual 2. Life Technologies; Torrent Suite 2.2 manual 3. Liu WH, Kaur M, Makrigiorgos GM. Detection of hotspot mutations and polymorphisms using an enhanced PCRRFLP approach. Hum. Mutat. 2003; 21(5):53541 protein (all except synonymous and intronic mutations). The diagnostic sensitivity and specificity calculated based on a blinded study of 37 FFPE samples is 97% and 99%, respectively. In this study, we developed a unique crossvalidation strategy using two different NGS platforms. We also established a rapid and accurate restriction endonuclease based fragment analysis method for confirming variants at frequencies below 10%. O lid ti td 102 i t ti i 23 ith 7 idl 7 d 88 Figure 1. Assay precision for detection of mutation frequency Variant Frequency (%) 4. Bamford S, Dawson E, Forbes S, Clements J, Pettett R, Dogan A, Flanagan A, Teague J, Futreal PA, Stratton MR, Wooster R. The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer. 2004: 91;3558 Our validation study covers 102 unique mutations in 23 genes, with 7 indels, 7 nonsense and 88 missense mutations. Ion AmpliSeq paired with the MolecularMD proprietary pipeline has excellent sensitivity for single base substitutions with only one mutation (PIK3CA E545G) missed due to the pseudogene interference. This can be resolved by manual review of Bam files in IGV for this genomic locus in the future. B For Further Information Please contact [email protected] or visit www.molecularmd.com . genomic locus in the future. One weakness of this cancer panel in its current version is the inability to detect single base indels in homopolymer regions. However, it tolerates other indels fairly well; indels ranging from 121 bp were successfully detected in this study. Ion AmpliSeq is a cancer panel wellsuited for small clinical samples with only 10ng DNA input. There was a zero failure rate for 41 FFPE samples that we tested. 10 We are currently exploring methods to improve the variant calling for indels in homopolymer regions by using TS 3.0 and other pipelines. 0

Analytical Validation of Ion AmpliSeq Cancer Panel 2molecularmd.com/wp-content/uploads/2017/03/Analytical-Validation... · Analytical Validation of Ion AmpliSeq Cancer ... scripts

Embed Size (px)

Citation preview

Page 1: Analytical Validation of Ion AmpliSeq Cancer Panel 2molecularmd.com/wp-content/uploads/2017/03/Analytical-Validation... · Analytical Validation of Ion AmpliSeq Cancer ... scripts

Analytical Validation of Ion AmpliSeq Cancer Panel 2.0Weihua Liu, Zhenyu Yan, Peng Fang, Agus Darwanto, Stephane Wong, Chad Galderisi, and Jin Li, MolecularMD Corp., Cambridge, MA and Portland, OR

Introduction Results continuedBli d d S d F h 37 bli d d FFPE l f i (l l GIST d i l b i d l ) 222 i ll d f TS2 2 f h h h

Materials and MethodsThe Ion AmpliSeq Cancer Panel 2.0 (Ion Torrent) allows for targeted sequencing of 189 regions ofinterest in 46 cancer‐related genes. We used well characterized cancer cell lines harboringclinically relevant variants as positive controls and HapMap samples NA12878 and NA19240 aswild type control samples. A total of 37 FFPE samples of a variety of cancer types were analyzedin a blinded fashion to evaluate the diagnostic sensitivity and specificity. With no gold standardavailable as a reference method to detect mutations with comparable sensitivity, we analyzed

Blinded Study: For the 37 blinded FFPE samples from various cancer types (lung, colon, GIST, endometrial, breast, ovarian and larynx), 222 raw variants called from TS2.2 were further run throughMolecularMD’s proprietary pipeline to allow for reporting 105 variants (Figure 3), 95 of which were cross confirmed by Illumina panel. Three mutations, which were evident when reviewed in IGVbut missed by Torrent Suite variant caller were confirmed as false negatives by Illumina cancer panel (Table 3) and Sanger sequencing (data not shown). The 10 variants identified by Ion, but notconfirmed by Illumina (Table 4), were further evaluated either by Sanger sequencing (for those with >10% frequency; data not shown) or a restriction endonuclease‐based fragment analysismethod described by Liu and colleagues and run on CaliperGx (ref. 3; Figure 4 contains representative data for the FGFR2 A389T variant, additional data not shown). Nine of the ten variants wereconfirmed as true positives. The variant in SMARCB1 remains to be confirmed, and is defined as false positive at this stage.

Genomic DNA (10ng) was amplified using Ion AmpliSeq™ Primer Pools (ref.1). The resultingamplicons were treated with FuPa Reagent to partially digest primer sequences and ligated toIon sequencing adapters. For sample multiplexing, libraries were barcoded using the IonXpress™ Barcode Adapters 1‐16. The ligated amplicons were purified by Ampure XP andquantified by qPCR using Ion Library Quantitation kit. Quantified libraries were qualified withCaliper GX (optional) and amplified on OneTouch system using OneTouch template preparationavailable as a reference method to detect mutations with comparable sensitivity, we analyzed

the same sets of samples with both the Ion AmpliSeq and the Illumina TruSeq Amplicon CancerPanel for comparison. These two cancer panels use distinct library preparation methods, as wellas different sequencing chemistries and analysis pipelines. They share largely overlappingregions of interest and have comparable sensitivity of minor allele detection. Variants detectedby both panels were considered as true positive variants. While most variants can be crosslid t d b t I d Ill i i t th t l d b l b t t th

Sample name Gene name Chromosome coordinate & cDNA change Amino acids Mut Freq (%) 

by Illumina

R09‐0060 KIT 4:55595550_c.2028delT p.Leu678X 22.8

R07‐0690 PDGFRA4:55152092‐55152103_c.25242535d lGACATCATGCAT

p.Ile843Asp846del(842 845DIMH> )

26.0

Table 3. False negatives in blind study of 37 FFPE samples

p p gCaliper GX (optional) and amplified on OneTouch system using OneTouch template preparationv2 kit and sequenced on PGM using Ion PGM 200 bp sequencing kit. Sequence alignment wasperformed by Torrent Suite 2.2 (ref. 2) and variants were called by TS 2.2 variant caller withmodified parameters to allow optimal sensitivity and specificity. MolecularMD proprietaryscripts were used for filtering recurring false positives, variant annotation and reporting. Inaddition to reporting hotspot variants, we report all variants at frequency >=2% with potentialimpact on protein

Figure 3. 3D projection of variants detected; genes and variants spread on x and y axes, number of variants on z axis 

validated between Ion and Illumina, variants that were only covered by one panel but not theother were confirmed by a third method.

_2535delGACATCATGCAT 2‐845DIMH>‐)R12‐0320 PIK3CA 3:178936092_c.1634A>G p.Glu545Gly(545E>G) 36.0

Table 4. Mutations in need of confirmation by a third method

Sample name Gene name Chromosome coordinate & cDNA change Amino acids Mut Freq (%) 

by Ion Torrent

Figure 4. Confirmation of FGFR2 A389T mutation

WT digested MT digested

K iti (6 ll li i ) FFPE l 8% ll li i C i ith Ill i lt

impact on protein.

Validation Flowchart

R08_0312 TP53 17:7578208_c.641A>G p.His214Arg(214H>R) 3.5R08_0312 TP53 17:7577121_c.817C>T p.Arg273Cys(273R>C) 2.8R07‐0017 FGFR2 10:123274753_c.1165G>A p.Ala389Thr(389A>T) 4.6R07‐0017 NRAS 1:115258723_c.59C>T p.Thr20Ile(20T>I) 4.8R07‐0017 SMARCB1 22:24176339_c.1103G>A p.Arg368His(368R>H) 2.5R07‐0017 PIK3CA 3:178916860_c.247T>G p.Phe83Val(83F>V) 40.5R07‐0017 PIK3CA 3:178916876 c 263G>A p Arg88Gln(88R>Q) 37 9

Mutant peak

Known positives (6 cancer cell line mix)+

Known negatives (HapMap samples)

Serial dilution

Analytical Sensitivity

FFPE sampleswith known mutations

(n=4)

Robustness in FFPE Samples

8% cell line mixand

FFPE samples (n=10)

Intra‐ and Inter‐assay

HorizonDx samples(n=4)

Mut Freq Accuracy

Blinded StudyClinical FFPE Samples

(n=37, assorted tumors)

MolecularMDdata analysis pipeline

Comparison with Illumina results

Confirmation by a third method

Diagnostic Sensitivity  3

4

5

6

7

8

9

10

R07 0017 PIK3CA 3:178916876_c.263G>A p.Arg88Gln(88R>Q) 37.9R07‐0017 KDR 4:55962485_c.2639G>A p.Arg880Gln(880R>Q) 3.1R07‐0017 MET 7:116340225_c.1087T>C p.Cys363Arg(363C>R) 3.7MMD_A2012 PIK3CA 3:178921549_c.1031T>G p.Val344Gly(344V>G) 53

Reproducibility  andMut Freq Precision

and Specificity

Analytical Sensitivity: Equimolar mixture of gDNA from six cell lines (G361, H1650, H1395,Results

Analytical Accuracy: AmpliSeq cancer panel reported mutation frequencies very close to those Mutation Spectrum:We compared the mutation spectrum from Uniformity of Coverage: For 72 distinct libraries analyzed, the average and standard deviation of relative coverage (coverage of a

0

1

2

ABL1

ATM

BRAF

CTNNB1

FBXW

7FG

FR2

FGFR3

HRA

SJAK3

KDR

KIT

KRAS

MET

NRA

SPIK3

CAPTEN RB1

RET

SMAD

4SM

ARCB

1SM

OSTK1

1TP53

Catalog# Sample Description Cell line AmpliSeq Mut FreqChromosome

di t DNA h G A i id M i f (%) R d D h

Table 2. Mutation frequency determination for HorizonDx samplesFigure 6. Relative coverage of 189 ROIs in Ion AmpliSeq Cancer Panel for 72 librariesTable 1. Analytical sensitivity defined by serial dilution of cancer cell line DNAs                             

H1975, DU145 and MDAMB231) was diluted into NA12878 gDNA with final concentrations of16%, 8% and 4% cancer cell line DNA (Table 1). The LOD for single base substitutions and indelswas determined to be 2‐5%.

expected for samples provided by HorizonDx (Table 2), with the exception of the EGFR T790Msample which had two overlapping amplicons that diluted the true mutation frequency.

different cancer types. For the five endometrial cancer samples,in addition to most frequent mutations reported in endometrialcancer, e.g. PTEN, PIK3CA, CTNNB1, TP53, FBXW7, and FGFR2(ref. 4), we also observed variations in NRAS, MET, STK11, SMOand other genes (Figure 5). One sample, R07‐0017, showedsignificant heterogeneity with minor clones containing NRAS

ROI/mean coverage of the library) for the 189 ROIs was determined (Figure 6). The data showed the coverage for the ROIs wasgenerally uniform. For each ROI, the percentage of libraries with relative coverage <0.2X showed inverse correlation with relativecoverage, indicating uniformity is higher in the high coverage regions relative to low coverage regions.

13 5

Figure 5. Variants detected in endometrial cancer

HD 103‐003‐02‐005 3.5% BRAF V600E SW48 3.6%

HD125 6.5% EGFR T790M RKO 3.65

HD129 6% EGFR L858R RKO 5.7%

HD132 4.5% EGFR L861Q RKO 4.8%

coordinate_cDNA change Gene name Amino_acids

16% 8% 4% 16% 8% 4%

11:108204681_c.7996A>G ATM p.Thr2666Ala(2666T>A) 23.8 10 5 1535 1412 1412

12:25398281_c.38G>A KRAS p.Gly13Asp(13G>D) 7.6 2031

13:49037903_c.2143A>T RB1 p.Lys715X(715K>*) 18 5.7 278 439

17:7577099_c.839G>A TP53 p.Arg280Lys(280R>K) 20.3 9 3.8 6291 5426 6524

17:7577118_c.820G>T TP53 p.Val274Phe(274V>F) 13.8 5.2 2.9 3781 3238 3693

17:7577120_c.818G>A TP53 p.Arg273His(273R>H) 16 7.5 2.8 5612 5060 6127

17:7578181_c.668C>T TP53 p.Pro223Leu(223P>L) 4.8 4230

Mutation frequency(%) Read Depth

A389T C363R N375S F83V G34V T41I H168R T170M R342X M541L P38S R173H Q298X A328T R368H R393Q R579X R880Q T20I V722I W537X

CTNNB1

FBXW7

R88Q T319X

significant heterogeneity with minor clones containing NRASand FGFR2 and other mutations with frequencies <15%, which isbelow the Sanger detection limit.

0.5

0.6

0.7

0.8

0.9

1

2

2.5

3

3.5

Mean relative coverage of libraries Percentage of libraries with relative avg_cov <0.2 (total 72ibraries) Linear (Mean relative coverage of libraries)

19:1220438-1220442_c.531_535delCAAGC STK11 p.Lys178GlyfsX86 4.4 500

4:55593464_c.1621A>C KIT p.Met541Leu(541M>L) 20 8 3.4 8487 4504 4629

4:55972974_c.1416A>T KDR p.Gln472His(472Q>H) 11.5 4.1 2583 2359

5:112175363_c.4072G>A APC p.Ala1358Thr(1358A>T) 6.5 3.6 5482 3994

7:140453136_c.1799T>A BRAF p.Val600Glu(600V>E) 18.9 10.3 4.2 7324 5466 6142

7:140481402_c.1406G>C BRAF p.Gly469Ala(469G>A) 14 7.7 3.4 5274 4250 4333

7:140481417_c.1391G>T BRAF p.Gly464Val(464G>V) 12.8 5.9 6402 5123

7:55242465

FGFR2

Jak3

KDR

Kit

Met

Nras

PIK3CA

PTEN

RB1

Smo

STK11

Determination of Phasing of Compound Mutations: The 260 flows (~100 bp) of sequencingused here allows for identifying the phasing status of compound mutations within the range of100 bp. For example, we observed PIK3CA cis and trans compound mutations in two samples(Figure 2). Figure 2A shows an IGV view of phasing of co‐existing mutations E542K (37.1%, A)and E545K (2 7% A) in the PIK3CA gene as trans mutations in a colon cancer sample while

y = ‐0.0073x + 1.7011R² = 0.9768

0

0.1

0.2

0.3

0.4

0

0.5

1

1.5

Figure 2. Identification of phasing of compound mutations

C l i

7:55242465-55242479_c.2235_2249delGGAATTAAGAGAAGC EGFR

p.Glu746_Ala750del(745-750KELREA>K) 12.9 5.6 3.8 498 496 496

7:55249071_c.2369C>T EGFR p.Thr790Met(790T>M) 6.3 3.6 7074 2951

7:55259515_c.2573T>G EGFR p.Leu858Arg(858L>R) 9.4 4.7 3.1 8744 6513 6809

9:21971153_c.205G>T CDKN2A p.Glu69X(69E>*) 19.7 4.4 870 1600

Mut Freq<15%

TP53

R080604R080914R090677R120292R070017

Analytical Reproducibility: Among 10 samples with known mutations (cell line DNA and FFPE),60 of 63 expected mutations were reproducibly detected in duplicate and triplicate runs Of

and E545K (2.7%, A) in the PIK3CA gene as trans mutations in a colon cancer sample, whileFigure 2B shows an IGV view of phasing of co‐existing mutations F83V (40.5%, G) and R88Q(37.9%, A) in the PIK3CA gene as cismutations in an endometrial cancer sample.

R f

ROI

Conclusions• Our validation demonstrates that the Ion AmpliSeq Cancer Panel combined with

MolecularMD’s proprietary analysis pipeline provides a robust and accurate test to profileclinically relevant gene mutations with analytical sensitivity of 2‐5% minor allele frequency.

• In addition to reporting hotspot mutations, we also report other variants that would impact

60 of 63 expected mutations were reproducibly detected in duplicate and triplicate runs. Ofthe three non‐reproducible variants, two were low‐level (frequencies close to 2%), and onewas a high‐frequency variant in a homopolymer region. The precision of variant frequencydetection was determined using 30 mutations in 6 samples with triplicates or duplicates(Figure 1). In general, the CV is between 0‐20%, showing inverse correlation with variantfrequency.

A References1. Life Technologies; Ion AmpliSeq™ Library Kit 2.0 manual2. Life Technologies; Torrent Suite 2.2 manual3. Liu WH, Kaur M, Makrigiorgos GM. Detection of hotspot mutations and polymorphisms

using an enhanced PCR‐RFLP approach. Hum. Mutat. 2003; 21(5):535‐41protein (all except synonymous and intronic mutations). The diagnostic sensitivity andspecificity calculated based on a blinded study of 37 FFPE samples is 97% and 99%, respectively.

• In this study, we developed a unique cross‐validation strategy using two different NGSplatforms. We also established a rapid and accurate restriction endonuclease based fragmentanalysis method for confirming variants at frequencies below 10%.O lid ti t d 102 i t ti i 23 ith 7 i d l 7 d 88

Figure 1. Assay precision for detection of mutation frequency

Variant Frequency (%)

g pp4. Bamford S, Dawson E, Forbes S, Clements J, Pettett R, Dogan A, Flanagan A, Teague J,

Futreal PA, Stratton MR, Wooster R. The COSMIC (Catalogue of Somatic Mutations inCancer) database and website. Br J Cancer. 2004: 91;355‐8

• Our validation study covers 102 unique mutations in 23 genes, with 7 indels, 7 nonsense and 88missense mutations.

• Ion AmpliSeq paired with the MolecularMD proprietary pipeline has excellent sensitivity forsingle base substitutions with only one mutation (PIK3CA E545G) missed due to thepseudogene interference. This can be resolved by manual review of Bam files in IGV for thisgenomic locus in the future.

BFor Further InformationPlease contact [email protected] or visit www.molecularmd.com.

genomic locus in the future.• One weakness of this cancer panel in its current version is the inability to detect single base

indels in homopolymer regions. However, it tolerates other indels fairly well; indels rangingfrom 1‐21 bp were successfully detected in this study.

• Ion AmpliSeq is a cancer panel well‐suited for small clinical samples with only 10ng DNA input.There was a zero failure rate for 41 FFPE samples that we tested.

10

• We are currently exploring methods to improve the variant calling for indels in homopolymerregions by using TS 3.0 and other pipelines.

0