63
Gonzalo Gómez, PhD. [email protected] Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional Analysis Bioinformatics Uni CNI

Gonzalo Gómez, PhD. [email protected] Gonzalo Gómez, PhD. [email protected] Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Embed Size (px)

Citation preview

Page 1: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gonzalo Gómez, PhD. [email protected] Gómez, PhD. [email protected]

Madrid, Feb 16th, 2009.Madrid, Feb 16th, 2009.

::: Gene Set Enrichment Analysis - GSEA - ::: Gene Set Enrichment Analysis - GSEA -

Course on Functional AnalysisCourse on Functional Analysis

Bioinformatics UnitCNIO

Bioinformatics UnitCNIO

Page 2: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 3: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge

Analysis

Page 4: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

MITBroad Institute

v 2.0 available since Jan 2007v 2.0.1 available since Feb 16th 2007

Version 2.0 includes Biocarta, Broad Institute,GeneMAPP, KEGG annotations and more...

Platforms: Affymetrix, Agilent, CodeLink, custom...

GSEA

(Subramanian et al. PNAS. 2005.)

Page 5: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

GSEA applies Kolmogorov-Smirnof test to find assymmetrical distributions for defined blocks of genes in datasets whole distribution.

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

::: How works GSEA?

Is this particular Gene Set enriched in my experiment?

Genes selected by researcher, Biocarta pathways, GeneMAPP sets, genes sharing cytoband, genes targeted by common miRNAs

…up to you…

Page 6: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Dataset distribution Num

ber o

f genes

Gene Expression Level

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

::: K-S test

The Kolmogorov–Smirnov test is used to determine whether two underlying one-dimensional probability distributions differ, or whether an underlying probability distribution differs from a hypothesized distribution, in either case based on finite samples.

The one-sample KS test compares the empirical distribution function with the cumulative distribution functionspecified by the null hypothesis. The main applications are testing goodness of fit with the normal and uniform distributions.

The two-sample KS test is one of the most useful and general nonparametric methods for comparing two samples, as it is sensitive to differences in both location and shape of the empirical cumulative distribution functions of the two samples.

Gene set 1 distribution

Gene set 2 distribution

Page 7: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

ClassA ClassB

ttest cut-offttest cut-off

FDR<0.05

FDR<0.05

...testing genes independently...

Biological meaning?

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

::: How works GSEA?

Page 8: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

ES

/NE

S statistic

-

+

ClassA ClassB

Gene Set 1

ttest cut-offttest cut-off

Gene Set 2

Gene Set 3

Gene set 3enriched in Class B

Gene set 2enriched in Class A

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

::: How works GSEA?

Page 9: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

ES examples :::

Page 10: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

NESNES

pvalpval

FDRFDR

Gene Set Enrichment Analysis - GSEA -

::: Introduction.

The Enrichment Score :::

Benjamini-Hochberg

Page 11: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 12: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

http://www.broad.mit.edu/gsea/

::: GSEA software.

Download :::

Page 13: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA software.

Main Window :::

Page 14: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA software.

Loading data :::

!!!

Page 15: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA software.

Running GSEA :::

Page 16: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA software.

Leading Edge Analysis :::

Page 17: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA software.

Chip to Chip Mapping :::

MSigDB :::

Page 18: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 19: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Data Formats.

Page 20: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Data Formats.

Page 21: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Expression datasets :::

::: Data Formats.

*.gct*.gct

Page 22: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Expression datasets :::

::: Data Formats.

*.res*.res

Page 23: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Expression datasets :::

::: Data Formats.

*.pcl*.pcl

Page 24: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Expression datasets :::

::: Data Formats.

*.txt*.txt

Page 25: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

*.cls*.cls

Gene Set Enrichment Analysis - GSEA -

Phenotype datasets :::

::: Data Formats.

For categorical phenotypes (e.g. Tumor vs Control)

Page 26: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Phenotype datasets :::

::: Data Formats.

For continuous phenotypes (e.g. Gene correlated to GeneSet)

For continuous phenotypes (e.g. Gene vs Time Series)

Time serie (each 30 minutes)

Peak profile wanted

Page 27: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Gene Set Database :::

::: Data Formats.

*.gmx*.gmx

Page 28: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Gene Set Database :::

::: Data Formats.

*.gmt*.gmt

Page 29: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Ranked list format :::

::: Data Formats.

*.rnk*.rnk

Page 30: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 31: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Using GSEA.

Loading data :::

Page 32: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: Using GSEA.

Loading data :::

Page 33: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Running GSEA :::

::: Using GSEA.

Page 34: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA - ::: MSigDB.

gsea_homegsea_home

::: Using GSEA.

Page 35: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Running GSEA :::

::: Using GSEA.

1. Choose true (default) to have GSEA collapse each probe set in your expression datasetinto a single gene vector, which is identified by its HUGO gene symbol. In this case, you areusing HUGO gene symbols for the analysis. The gene sets that you use for the analysis mustuse HUGO gene symbols to identify the genes in the gene sets.

2. Choose false to use your expression dataset "as is." In this case, you are using the probeidentifiers that are in your expression dataset for the analysis. The gene sets that you use forthe analysis must also use these probe identifiers to identify the genes in the gene sets.

Page 36: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Running GSEA :::

PhenotypeGene Sets (few samples)

::: Using GSEA.

Page 37: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Running GSEA :::

::: Using GSEA.

Page 38: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA - Chip2Chip mapping :::

Chip2Chip translates the gene identifiers in a gene sets from HUGO gene symbols

to the probe identifiers for a selected DNA chip.

::: Using GSEA.

Page 39: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Enrichment statistic :::

To calculate the enrichment score, GSEA first walks down the ranked list of genes increasing a running-sumstatistic when a gene is in the gene set and decreasing it when it is not. The enrichment score is the maximum deviation from zero encountered duringthat walk. This parameter affects therunning-sum statistic used for the analysis.

::: Using GSEA.

Page 40: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Ranking Metric :::

Signal2NoisetTestCosineEuclideanManhattenPearson (time series)Ratio of ClassesDiff of ClassesLog2_Ratio_of_Classes

Categorical phenotypesContinuous phenotypes

::: Using GSEA.

Page 41: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Ranking Metric :::

::: Using GSEA.

Page 42: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Ranking Metric :::

::: Using GSEA.

Page 43: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

real8.28.18.0…

-7.5-7.7-7.9

abs8.28.18.07.97.77.5…

parameter to determine whether to sort the genes in descending (default) or ascending order.

More parameters :::

::: Using GSEA.

Page 44: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

Launching Analysis :::

::: Using GSEA.

Page 45: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 46: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA output.

Results Accession :::

/Users/yourhome/gsea_home

C:\Documents and settings\username\gsea_home

By default in gsea_home

Page 47: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 48: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Index.html :::

Heat map of the top 50 features for each phenotype and a plot showingthe correlation between the ranked genes and the phenotypes. In a heat

map, expression values are represented as colors, where the range of colors (red, pink, light blue, dark blue) shows the range of expression values

(high, moderate, low, lowest).

Page 49: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Enrichment results in html :::

Page 50: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Enrichment results in html :::

Page 51: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Enrichment results in html :::

How can I decide about my results?

FDR ≤ 0.25

NOM p-val ≤ 0.05

Page 52: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

::: Contents.

1. Introduction.2. GSEA Software3. Data Formats4. Using GSEA5. GSEA Output6. GSEA Results7. Leading Edge Analysis

Page 53: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Page 54: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Gene in SubsetsGene in Subsets

Set-to-SetSet-to-SetHeatMapHeatMap

HistogramHistogram

Page 55: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Heat Map

The heat map shows the (clustered) genes in the leading edge subsets. In a heat map, expression values are represented as colors, where the range of colors (red, pink, light blue, dark blue) shows the range of expression values (high, moderate, low, lowest).

Page 56: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Set-to-Set

The graph uses color intensity to show the overlap between subsets: the darker the color, the greater the overlap between the subsets..When you compare a leading edge subset to itself, its members completely overlap so the corresponding cell is dark green. When you compare two subsets that have no overlapping members, the corresponding cell is white.

Page 57: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Gene in Subsets

The graph shows each gene and the number of subsets in which it appears.

Page 58: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA results.

Leading Edge Analysis :::

Histogram

The last plot is a histogram, where the Jacquard is the intersection divided by the union for a pair of leading edge subsets. Number of Occurrences is the number of leading edge subset pairs in a particular bin. In this example, most subset pairs have no overlap (Jacquard = 0).

Page 59: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Set Enrichment Analysis - GSEA -

::: GSEA & FatiScan.

Detects significant functions with Gene Ontology InterPro motifs, Swissprot KWand KEGG pathways in lists of genes ordered according to differents characteristics.

Page 60: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

http://www.whichgenes.org

- Retrieve miRNAs targets for Gene Set Enrichment Analysis (miRBase, TargetScan)- Always updated !

Login whether you want to download and store your gene sets

Enter if you simply want to download gene sets.

::: GSEA & Whichgenes.

Page 61: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

1. Choose oraanism. -Human - Mouse

2. Select source: - miRBase, TScan - Other sources

3. Copy and paste miRNAs identifiers.Create set per items.

4. Job name.

Create Sets

Looking for examples ?

Try a preloaded

example!!!

Retrieving targets

http://www.whichgenes.org

::: GSEA & Whichgenes.

Page 62: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

Gene Sets Cart

1. Choose gene sets for downloading.

2. Select output format. e.g. .CSV, .TSV, .gmt, .gmx3. Select identifier.e.g. Agilent, Affy, Mgi…

4. DOWNLOAD GENE SETS !!!

http://www.whichgenes.org

::: GSEA & Whichgenes.

Page 63: Gonzalo Gómez, PhD. ggomez@cnio.es Gonzalo Gómez, PhD. ggomez@cnio.es Madrid, Feb 16th, 2009. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional

T

H

A

N

[email protected]