65
Bridging the Gap Between Pathways and Experimental Data Alexander Lex

Bridging the Gap Between Pathways and Experimental Data Alexander Lex

Embed Size (px)

Citation preview

Page 1: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

Bridging the Gap Between Pathways and Experimental Data

Alexander Lex

Page 2: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

2

Experimental Data and Pathways

Pathways represent consensus knowledge for a healthy organism or specific disease

Cannot account for variation found in real-world data

Branches can be (in)activated due to mutation,

changed gene expression,

modulation due to drug treatment,

etc.Alexander Lex | Harvard University

Page 3: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

3

Why use Visualization?

Efficient communication of information

A -3.4

B 2.8

C 3.1

D -3

E 0.5

F 0.3Alexander Lex | Harvard University

C

B

D

F

A

E

Page 4: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

4

Experimental Data and Pathways

[Lindroos2002]

[KEGG]

Alexander Lex | Harvard University

Page 5: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

5

Visualization Approaches

On-Node Mapping Separate Linked Views Small Multiples

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

[Lin

droo

s 20

02]

Alexander Lex | Harvard University

Path-Extraction

Page 6: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

Alexander Lex | Harvard University 6

REQUIREMENTS ANALYSIS

Teaser Picture

Page 7: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

7

What to Consider when Visualizing Experimental Data and Pathways

Conflicting GoalsPreserving topology of pathways

Showing lots of experimental data

Five RequirementsIdeal visualization technique addresses all

Alexander Lex | Harvard University

Page 8: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

8

R I: Data Scale

Large number of experimentsLarge datasets have more than 500 experiments

Multiple groups/conditions

Alexander Lex | Harvard University

Page 9: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

9

R II: Data Heterogeneity

Different types of data, e.g.,mRNA expression numerical

mutation statuscategorical

copy number variation ordered categorical

metabolite concentration numerical

Require different visualization techniques

Alexander Lex | Harvard University

Page 10: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

10

R III: Multi-Mapping

Pathways nodes are biomoleculesProteins, nucleic acids, lipids, metabolites

Experimental data often on a „gene“ level

Multiple genes can produce protein

Multiple genes encode one protein

Result: many „gene“ values map to one pathway nodeAlexander Lex | Harvard University

C

E

E1

E2

E3

E4

CA3

KJ2

RAF

Page 11: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

11

R IV: Preserving the Layout

Pathways are available in carefully designed layouts

e.g., KEGG, WikiPathways, Biocarta

Users are familiar with layouts

Goal: preserve layouts as much as possible

Two approaches: Emulate drawing conventions

Use original layoutsAlexander Lex | Harvard University

Page 12: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

12

R V: Supporting Multiple Tasks

Two central tasks:Explore topology of pathway

Explore the attributes of the nodes (experimental data)

Need to support both!

Alexander Lex | Harvard University

C

B

D

F

A

E

Page 13: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

Alexander Lex | Harvard University 13

VISUALIZATION TECHNIQUES

Teaser Picture

Page 14: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

14

Visualization Approaches

Separate Linked Views Small Multiples

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

Alexander Lex | Harvard University

Path-Extraction

On-Node Mapping

[Lin

droo

s 20

02]

Page 15: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

15

On-Node Mapping

Alexander Lex | Harvard University [Lindroos2002]

Page 16: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

16

On-Node Mapping

Alexander Lex | Harvard University

[Westenberg 2008]

[Gehlenborg 2010]

Page 17: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

17

On-Node & Tooltip

Alexander Lex | Harvard University

[Streit 2008]

Page 18: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

18

On-Node Mapping

Not scalableespecially when used with „original“ layout

animation not an alternative

Good for overview with homogeneous data

Excellent for topology-based tasks

Bad for attribute-based tasks

Alexander Lex | Harvard University

Page 19: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

19

On-Node Mapping Reflection

R I (Scale)bad if working with static layouts

limited when working with layout adaption

R II (Heterogeneity)bad – can‘t encode multiple datasets

R III (Multi-Mapping)bad – can‘t encode multiple mappings

Alexander Lex | Harvard University

Page 20: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

20

On-Node Mapping Reflection

R IV (Layout-Preservation)excellent!

R V (Multiple Tasks)excellent for topology-based tasks

bad for attribute-based tasks

Alexander Lex | Harvard University

Page 21: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

21

[Lin

droo

s 20

02]

On-Node Mapping

Visualization Approaches

Small Multiples

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

Alexander Lex | Harvard University

Path-Extraction

Separate Linked Views

Page 22: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

22

Separate Linked Views

Alexander Lex | Harvard University

[Shannon 2008]

Page 23: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

23

Separate Linked Views

Alexander Lex | Harvard University

Page 24: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

24

Separate Linked Views

Alexander Lex | Harvard University

Page 25: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

25

Separate Linked Views Reflection

R I (Scale)excellent for large numbers of attributes

R II (Heterogeneity)excellent for heterogeneous data

e.g., one view per data type

R III (Multi-Mapping)good – simple highlighting for multiple elements

Alexander Lex | Harvard University

Page 26: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

26

Separate Linked Views Reflection

R IV (Layout-Preservation)excellent!

R V (Multiple Tasks)good for topology-based tasks

good for attribute-based tasks

awful for combining them!Association node-attribute only one by one

Alexander Lex | Harvard University

Page 27: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

27

Separate Linked Views

[Lin

droo

s 20

02]

On-Node Mapping

Visualization Approaches

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

Alexander Lex | Harvard University

Path-Extraction

Small Multiples

Page 28: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

28

Small Multiples

Alexander Lex | Harvard University

Page 29: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

29

Small Multiples

Alexander Lex | Harvard University [Barsky 2008]

Video!

Page 30: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

30

Small Multiples Reflection

R I (Scale)limited to a handful of conditions/experiments

differences don‘t „pop out“

R II (Heterogeneity)limited for heterogeneous data

e.g., one view per data type

R III (Multi-Mapping)bad – no obvious solution

Alexander Lex | Harvard University

Page 31: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

31

Small Multiples Reflection

R IV (Layout-Preservation)excellent!

R V (Multiple Tasks)good for topology-based tasks

limited for attribute-based tasks

limited for combining them!comparing one by one -> change blindness

Typically requires „focus duplicate“

Alexander Lex | Harvard University

Page 32: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

32

Separate Linked Views

[Lin

droo

s 20

02]

On-Node Mapping

Visualization Approaches

Small Multiples

Linearization

[Mey

er 2

010]

Alexander Lex | Harvard University

Path-ExtractionLayout Adaption

[Jun

ker 2

006]

Page 33: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

33

Layout Adaption

„Moderate“ Layout Adaptionmake space for on-node encoding

Alexander Lex | Harvard University

[Gehlenborg 2010][Junker 2006]

Page 34: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

34

Layout Adaption

„Extreme“ layout adaptionencode information throughposition

Alexander Lex | Harvard University [Bezerianos 2010]

Video: http://www.youtube.com/watch?v=NLiHw5B0Mco

Page 35: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

35

Layout Adaption Reflection

R I (Scale)limited to a handful of conditions/experiments

R II (Heterogeneity)limited for heterogeneous data

Different story for „extreme“ layout adaption

R III (Multi-Mapping)OK– give nodes with multi-mappings extra space

Alexander Lex | Harvard University

Page 36: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

36

Layout Adaption Reflection

R IV (Layout-Preservation)not possible

R V (Multiple Tasks)limited for topology-based tasks

limited for attribute-based tasks

limited for combining them!space for trade-off between topology and attribute tasks

Alexander Lex | Harvard University

Page 37: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

37

Layout Adaption

[Jun

ker 2

006]

Separate Linked Views

[Lin

droo

s 20

02]

On-Node Mapping

Visualization Approaches

Small Multiples

Alexander Lex | Harvard University

Path-ExtractionLinearization

[Mey

er 2

010]

Page 38: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

38

Linearization – Pathline

Alexander Lex | Harvard University

[Meyer 2010]

Combination oflayout adaption

separate linked views

Page 39: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

39

Linearization

Alexander Lex | Harvard University

[Meyer 2010]

Page 40: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

40

Linearization Reflection

R I (Scale)good for many experiments

R II (Heterogeneity)good for multiple datasets

R III (Multi-Mapping)good – give nodes with multi-mappings extra space

Alexander Lex | Harvard University

Page 41: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

41

Linearization Reflection

R IV (Layout-Preservation)not possible

R V (Multiple Tasks)limited for topology-based tasks

limited for attribute-based tasks

limited for combining them!

Manual creation of linearized version

Unclear if suitable for more complex pathwaysAlexander Lex | Harvard University

Page 42: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

42

Visualization Approaches

On-Node Mapping Separate Linked Views Small Multiples

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

[Lin

droo

s 20

02]

Alexander Lex | Harvard University

Path-Extraction

Page 43: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

43

CALEYDO ENROUTE

Alexander Lex | Harvard University

Page 44: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

44

Pathway View

A

E

C

B

D

F

Pathway View

C

B

D

F

A

E

enRoute View

Concept

Group 1Dataset 1

Group 2Dataset 1

Group 1Dataset 2

B

C

F

A

D

E

Alexander Lex | Harvard University

Page 45: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

45

Pathway View

On-Node Mapping

Path highlighting with Bubble Sets [Collins2009]

SelectionStart- and end node

Iterative adding of nodes

IGF-1

low high

Alexander Lex | Harvard University

Page 46: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

46

enRoute View – Path Representation

• Design of KEGG [Kanehisa2008]

• Abstract branch nodes– Additional topological

information– Incoming vs. outgoing

branches– Expandable

• Branch switching

Alexander Lex | Harvard University

Page 47: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

47

Experimental Data Representation

Gene Expression Data (Numerical)

Copy Number Data (Ordered Categorical)

Mutation Data

Alexander Lex | Harvard University

Page 48: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

48

enRoute View – Putting All Together

Alexander Lex | Harvard University

Page 49: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

49

Video!

Alexander Lex | Harvard University

http://enroute.caleydo.org

Page 50: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

50

Glioblastoma Multiforme Example

Alexander Lex | Harvard University

Page 51: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

51

Glioblastoma Multiforme Example

Alexander Lex | Harvard University

Page 52: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

52

enRoute Reflection

R I (Scale)Excellent, can handle large amounts of data

R II (Heterogeneity)Excellent, can handle various datasets

R III (Multi-Mapping)Excellent, can resolve multi-mappings without ambiguity

Alexander Lex | Harvard University

Page 53: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

53

enRoute Reflection

R IV (Layout-Preservation)Excellent - preserves pathway layout

Not preserved in extracted path

R V (Multiple Tasks)Good for topology-based tasks

High-level topology through pathway view

Topology of path in enRoute view

Excellent for attribute-based tasksCan handle large, grouped and heterogeneous data

Alexander Lex | Harvard University

Page 54: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

54

Using enRoute

enRoute part of Caleydo Biomolecular Visualization Framework

http://caleydo.org

Caleydo is free for all – open source project

More in Marc‘s talk!

Alexander Lex | Harvard University

Page 55: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

55

SUMMARY & RECOMMENDATIONS

Alexander Lex | Harvard University

Teaser Picture

Page 56: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

56

Which to use?

On-Node Mapping Separate Linked Views Small Multiples

Layout Adaption Linearization

[Mey

er 2

010]

[Jun

ker 2

006]

[Lin

droo

s 20

02]

Alexander Lex | Harvard University

Path-Extraction

Page 57: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

57

Use Technique that fits your task

Topology is important

One experimental condition

Alexander Lex | Harvard University

On-Node Mapping

[Lindroos2002]

Page 58: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

58

Use Technique that fits your task

Topology is important

Size of graph is limited

Handful of conditions

Alexander Lex | Harvard University

Small Multiples

Page 59: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

59

Use Technique that fits your task

Experimental data is critical

Pathways are a “sideshow”

Alexander Lex | Harvard University

[Shannon 2008]

Separate Linked Views

Page 60: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

60

Use Technique that fits your task

Topology & experimental data is important

Data is heterogeneous

Alexander Lex | Harvard University

Path Extraction

Page 61: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

61

What’s Nice About That?

Caleydo supports all of them ;)

Alexander Lex | Harvard University

Page 62: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

62

FUTURE CHALLENGES

Alexander Lex | Harvard University

Teaser Picture

Page 63: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

63

Other Pathway-Related Challenges

Cross-connections between pathways

Alexander Lex | Harvard University [Klukas 2007]

Page 64: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

64

Other Pathway-Related Challenges

Effect of compounds (medication) on pathways

Alexander Lex | Harvard University

[Lounkine 2012]

Page 65: Bridging the Gap Between Pathways and Experimental Data Alexander Lex

65

Bridging the Gap

Between Pathways and Experimental Data

Alexander Lex, Harvard [email protected]://caleydo.org

?Marc Streit

Hans-Jörg Schulz Christian Partl

Dieter SchmalstiegPeter J. Park

Nils Gehlenborg

Alexander Lex | Harvard University