42
(Designing) Interactive Visualisations to Solve Analytical Problems (in biology) CAGATAY TURKAY, giCentre, City University London

Designing Interactive Visualisations to Solve Analytical Problems in Biology

Embed Size (px)

Citation preview

Page 1: Designing Interactive Visualisations to Solve Analytical Problems in Biology

(Designing)

Interactive

Visualisations to

Solve Analytical

Problems (in biology) CAGATAY TURKAY,

giCentre, City University London

Page 2: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Who?

• Lecturer in Applied Data Science @ the giCentre, CUL

• PhD @ VisGroup at Univ. of Bergen, Norway

• Research interests:– Integrating Computational Tools in Interactive Visual Analysis

Methods

– Perceptually Optimized Visualization

• Methods for several domains:– Biology, transport, intelligence, neuroscience

Page 3: Designing Interactive Visualisations to Solve Analytical Problems in Biology

giCentre (www.giCentre.net)

• 6 academics

• 2 researchers

• 5 PhDs

Page 4: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Data supported science

• Data analysis in almost all scientific fields

–Biology, medicine, astronomy, psychology,…

• Data driven science

• Research in several fields

–Visualization

–Data Mining

–Machine Learning

–Statistics

Page 5: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Visualization ?

“Computer-based visualization systems

provide visual representations of datasets

designed to help people carry out tasks more

effectively.” [Tamara Munzner, 2014]

“The use of computer-generated, interactive, visual

representations of data to amplify cognition”[Card,

Mackinlay, & Shneiderman 1999]

Page 6: Designing Interactive Visualisations to Solve Analytical Problems in Biology

VIS -- a mature field already

Page 7: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Biological data + VIS:

A good synergy

.. but why?

Page 8: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why biology is interesting for VIS?

Datasets are large & heterogeneous

Yeast Protein interaction network, Barabási & Oltvai, 2004

Clustering miR expressions

http://gdac.broadinstitute.org/

Page 9: Designing Interactive Visualisations to Solve Analytical Problems in Biology
Page 10: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why biology is interesting for VIS?

Things happen at multiple scales

[ by O’Donoghue et al., 2010]

[Nye, 2008]

Page 11: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why biology is interesting for VIS?

Processes are dynamic (spatio-temporal complexity)

Neutrophil chasing a bacteria by David Rogers

Page 12: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why biology is interesting for VIS?

• Computational methods are central in analysis

–Uncertainties hinder reliability

– Interpretation is a problem (black-box alg., little

context)

Comprehensive molecular portraits of human breast tumours, TCGA Network, Nature, 2012

Page 13: Designing Interactive Visualisations to Solve Analytical Problems in Biology

How can visualisation help?

• Ease of cognition & communication

• Relating multiple aspects

• Compare multiple computational outputs

• Investigate uncertainties

• Seamless integration of computation

and …

• Enable & foster hypothesis generation

Page 14: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Forms of visualisation support

VIS as a presentation medium

+

VIS with interaction

+

VIS with integrated computations

Page 15: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Visualisation as a

presentation medium

Page 16: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Cross-section of Escherichia coli cell, Illustration by David S. Goodsell, the Scripps Research Institute

Page 17: Designing Interactive Visualisations to Solve Analytical Problems in Biology

106 diffusing and reacting molecules in real-time, Muzic et al., 2014

Page 18: Designing Interactive Visualisations to Solve Analytical Problems in Biology

NATURE METHODS: POINTS OF VIEW, by Wong et al.

http://blogs.nature.nom/methagora/2013/07/data-visualization-points-of-view.html

Page 19: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why is VIS good here?

• Analysts’ perceptual & cognitive capabilities

• Better interpretation

• Communication

Page 20: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Visualisation

with interaction

Page 21: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Example: MizBee - Synteny Browser

Meyer et al., MizBee: A Multiscale Synteny Browser, 2009

Page 22: Designing Interactive Visualisations to Solve Analytical Problems in Biology
Page 23: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why is VIS good here?

• Linking multiple aspects

• Interactively varying the focus

• Display multiple-scales concurrently

Page 24: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Visualisation with

integrated computations

Page 25: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Combine the best of two worlds: human capabilities and

power

Facilitate the informed use of

computation through interactive visual methods

(a.k.a. Visual Analytics)

Page 26: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Example: StratomeX, Caleydo

http://caleydo.org

Page 27: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Pat

ien

ts (

sam

ple

s)

Genes

Candidate Subtype /Heat Map

Header /Summary of whole Stratification

Cancers have subtypes• different histology• different molecular alterations

Subtypes are identified by stratifying datasets, e.g.,

• based on an expression pattern• a mutation status• a copy number alteration• a combination of these

Case: Cancer Subtype Analysis

Page 28: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Multiple Stratifications

Many shared Patients

Clustering 1 Clustering 2

Sample Overlaps

Page 29: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Dependent PathwaysSlide by Alex Lex

Page 30: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Slide by Alex Lex

Page 31: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Multiple Stratifications (again)

Many shared Patients

Clustering 1 Clustering 2

Sample OverlapsG

en

e O

verl

ap

s ??

Page 32: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Finding distinctive genes

Characterizing cancer subtypes using dual analysis in Caleydo StratomeX, Turkay et al., IEEE CG&A, 2014

Page 33: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Finding distinctive genes (ex. BRCA types)

[*] Cancer Genome Atlas Network. (2012). Comprehensive molecular portraits of human breast tumours. Nature, 490(7418), 61-70.

Luminal-A

underexpressed genes

Luminal-A

overexpressed genes

Basal-like

overexpressed

Basal-like

underexpressed

Page 34: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Ex: Cavity analysis in molecular simulations

Cavities on molecular surfaces

• Important in ligand binding

• Drug design, etc.

Long molecular simulations

Cavities are dynamic, hard to track

Amino-acids to characterize the

cavity

• hydrophobicity (grey)

• polarity (green)

• positively charged (blue)

• negatively charged (red) Visual Cavity Analysis in Molecular Simulations

J. Parulek, C. Turkay, N. Reuter, I. Viola. BMC Bioinformatics, 2013.

Page 35: Designing Interactive Visualisations to Solve Analytical Problems in Biology

1. Run the simulation

2. Fit graphs cavities

3. Compute measures

4. Find touching amino-acids

5. Perform visual analysis

Analysis of Proteinase 3

Page 36: Designing Interactive Visualisations to Solve Analytical Problems in Biology
Page 37: Designing Interactive Visualisations to Solve Analytical Problems in Biology

A hydrophobic cavity

Page 38: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Why is VIS good here?

• Multiple linked data sets – improve interpretation

• Multiple computational results – deal with

uncertainty

• Integrate computation outputs, i.e., clusters, derived

data

• Allows a fast-paced iterative process

• Quick idea prototyping

Page 39: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Wrap up !

VIS as a presentation medium

+

VIS with interaction

+

VIS with integrated computations

Page 40: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Visualisation is very good to answer

HOW & WHY?questions ..

- How do these genomes overlap?

- Why is this a cluster?

....

Page 41: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Outlook

• Interaction and explorative analysis is key!

• Seamless support from integrated computation, i.e., t-tests

• Visual analysis as an everyday tool for analysts

Page 42: Designing Interactive Visualisations to Solve Analytical Problems in Biology

Thanks ! (& more biovis ?)

http://www.biovis.net

#biovis

Paper deadline: February 15, 2015

Data & Design Contests: May 1, 2015

• VisGroup (Helwig Hauser, Julius Parulek & Ivan Viola) and

Nathalie Reuter from University of Bergen

• Caleydo team (Alex Lex, Hanspeter Pfister, Nils Gehlenborg, Marc Streit)