Upload
lesley-logan
View
214
Download
0
Tags:
Embed Size (px)
Citation preview
Pathway analysis
Daniel Hurley
Pathway analysis: summary
A popular buzzword…but what does it mean?
How do you do it?
How do you interpret the results?
First
What we mean by ‘pathway analysis’
• A ‘pathway’ implies causation, but don’t be fooled!
What we mean by ‘pathway analysis’
• A ‘pathway’ implies causation, but don’t be fooled!
• Most ‘pathway analysis’ actually identifies groups of functionally similar transcripts.
• Louis’ example:
(http://gather.genome.duke.edu/)
• A useful paper….
What we mean by ‘pathway analysis’
But the conclusion is: lots of tools, some quite different
approaches!
• Pathway analysis tools like GATHER, DAVID, and GeneSetDB typically rely on enrichment analyses to tell us things.
• This set of techniques asks the question ‘of this set of genes, how many share any particular function, and is that more than we would expect by chance?’
• Example: the top 200 most differentially-expressed genes by some ranking (e.g. adjusted p-value)
• Determination of ‘by chance’ is usually done using a permutation (= Monte Carlo) approach
• Other ‘pathway analyses’ involve signatures of groups of transcripts (e.g. using Principal Component Analysis)
What we mean by ‘pathway analysis’
What we mean by ‘pathway analysis’
But what do we mean by a ‘function’?
Lots of things:
Protein function
Hypothetical protein function
Chromosomal location
Metabolic pathway association
Disease association
Daunting
The key point
• Pathway analysis can identify common features present in a group of transcripts
• What the output means depends on the specific biology under study
• No such thing really as a ‘general’ pathway analysis
• A good place to start is by finding papers relevant to the specific biology
What we mean by ‘pathway analysis’
What can you do with it?Some answers:
Get a general picture of the
active functions in a condition (vs.
control)
Investigate whether a particular
function is active in a condition
Differentiate conditions by their active functions
Investigate the functions
associated with a particular gene
Identify conditions with
similar functions
Next
• Begin with a list of transcripts of interest
Pathway analysis: how you do it
• Choose a web-based tool: GATHER, DAVID and GeneSetDB are good ones to start
Pathway analysis: how you do it
But Pathguide.org has 325 pathway links at last count
• Enter the list of transcripts: with most tools, you will either paste in gene names or identifiers, or upload a file
Pathway analysis: how you do it
Finally
• Basic tools will produce ranked lists of the most ‘enriched’ categories:
Pathway analysis: interpreting results
GATHER
• More sophisticated ones will produce ‘network’ diagrams
Pathway analysis: interpreting results
DAVID
Ingenuity Pathways AnalysisBut the interpretation of these
is rather subjective
Summary• Pathway analysis should probably be called
information enrichment analysis – a more accurate term
• Used prudently, it is a useful tool for exploring the functional landscape of an experiment
• To make it meaningful, you need to interpret the results in the context of the specific biology under study
• There are a lot of web-based tools; start with one which is current and produces a result you value
• To start, you need a set of (transcripts) of interest
• To present the results, you can use a simple table, or a more complex ‘network’ diagram
• Risk: false-positives are very difficult to identify, and with enough data you can link any molecular species to any other species
FIN
Any questions?