42
Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Embed Size (px)

Citation preview

Page 1: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Visualization

Blaz Zupan

Faculty of Computer & Info ScienceUniversity of Ljubljana, Slovenia

Page 2: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Visual Data Mining

• Basic idea– visual presentation of the data– gain insight & generate hypothesis– draw conclusions– directly interact with data

• Include human in the data exploration process– use her/his flexibility– creativity– general knowledge

Page 3: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Benefits of Visualization

• involvement of the user• results are intuitive

– no need for understanding complex mathematical or statistical algorithms or parameters

• provision of qualitative overview of the data– can isolate specific patterns for further

quantitative analysis

• can deal with non-homogenous, noisy data

Page 4: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Visual Exploration Paradigm

Overview first, zoom & filter, and then details on demand.

Overview first, zoom & filter, and then details on demand.

Page 5: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Visual Exploration Paradigm

Overview first, zoom & filter, and then details on demand.

Overview first, zoom & filter, and then details on demand.

Page 6: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Classification

one-dimensional

two-dimensional

multi-dimensional

text, web content

networks

other (e.g. algorithms/software, ...)

Data Type

Interaction & DistortionTechnique

VisualizationTechnique

Standard

Projection

Filtering

Zoom

Distortion

Link & Brush

Standard 2D/3D Display

Geometrically Transformed Display

Iconic Display

Dense Pixel Display

Stacked Display

from D Keim & M Ward: Visualization, in Intelligent Data Analysis, M Berthold & DJ Hand (eds), Springer, 2003.

Page 7: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: One-Dimensional

R Bellazzi: Mining Biomedical Time Series by Combining Structural Analysis and Temporal Abstractions,In Proc. of AMIA 1998.

Page 8: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Two-Dimensional

MineSet’s Map Visualizer.

Page 9: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Multi-Dimensional

Page 10: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Text • Galaxies visualization

• Uses the “night sky” visualization to represent a set of documents

• One document – one star

• Stars clustered together represent related documents

• Includes analytical tools to investigate groups and time-based trends, query contents

From Inspire (TM) Software, seewww.pnl.gov/infoviz/technologies.html

Page 11: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Text • ThemeView (TM)• Topics or themes

of text documents shown in relief map of a natural terrain

• The height of a peek relates to the strength of the topic

From Inspire (TM) Software, seewww.pnl.gov/infoviz/technologies.html

Page 12: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Text • Theme River (TM)

• Identification of time related trends and patterns

• Themes represented as colored streams

• The width of the stream relates to the collective strength of a theme

From Inspire (TM) Software, seewww.pnl.gov/infoviz/technologies.html

Page 13: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Networks

E. coli metabolic network (colors denote predominant biochemical class of metabolites)Ravasz et al., Science 297, 30 Aug 2002.

S. cerevisiae gene interaction networkTong et al., Science 303, 6 Feb 2004.

V Batagelj, A Mrvar: Pajek @ vlado.fmf.uni-lj.si/pub/networks/pajek/

Page 14: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Data: Tree Hierarchies

Unix home directory Selected detail

Kleiberg et al.: Botanic Visualization of Huge Hierarchies, In InfoVis, 2001.

Page 15: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Classification

one-dimensional

two-dimensional

multi-dimensional

text, web content

networks

other (e.g. algorithms/software, ...)

Data Type

Interaction & DistortionTechnique

VisualizationTechnique

Standard

Projection

Filtering

Zoom

Distortion

Link & Brush

Standard 2D/3D Display

Geometrically Transformed Display

Iconic Display

Dense Pixel Display

Stacked Display

Page 16: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Standard 2D/3D

• x-y (x-y-z) plots

• bar charts• line graphs• histograms• maps

Page 17: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Standard 2D/3D

• x-y (x-y-z) plots

• bar charts• line graphs• histograms• maps

Page 18: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Standard 2D/3D

• x-y (x-y-z) plots

• bar charts• line graphs• histograms• maps

Page 19: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Geom.-Transformed Displays

• includes several classes of visualizations

• projection pursuit, finding “interesting transformations” of multi-dim data set

• scatterplot matrix

• parallel coordinates

Page 20: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Iconic Displays

W Horn et al.: Metaphor graphics to visualize ICU data over time, In IDAMAP 1998.

Page 21: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Dense Pixel Displays

DA Keim et al.: Recursive Pattern: A technique for visualizing very large amounts of dataProc. Visualization 95, pages 279-286, 1995.

Page 22: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Dense Pixel Displays

Ankerst et al.: Circle Segments: A technique for visually exploring large multidimensional data sets.In Proc. Visualization 96, Hot Topic Session, 1996.

Page 23: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Stacked Displays

J LeBlanc et al.: Exploring n-dimensional databases. In Proc. Visualization 90, pages 230-239, 1990.

• an example is dimensional stacking

• embed one coordinate system within the other

• e.g. two attributes in one system, then another two when drilling down

Page 24: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Stacked Displays

Decision table visualization from SGI’s MineSet

Page 25: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Stacked Displays

Mosaic display in Orange.

Page 26: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Classification

one-dimensional

two-dimensional

multi-dimensional

text, web content

networks

other (e.g. algorithms/software, ...)

Data Type

Interaction & DistortionTechnique

VisualizationTechnique

Standard

Dynamic Projection

Filtering

Zoom

Distortion

Link & Brush

Standard 2D/3D Display

Geometrically Transformed Display

Iconic Display

Dense Pixel Display

Stacked Display

Page 27: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Interaction Techniques

• Dynamic projection– dynamically change the

projections to explore multi-dimensional data sets

– projection pursuit, which finds well-separated clusters in scatterplot

• Interactive Filtering– browsing, can be

difficult for big data sets– querying, need to

specify a subset

• Zooming• Distortion

– e.g., fisheye view

• Brushing and linking– requires well-

integrated system for visualization

– selection from one visualization is fed into another one, selected instances highlighted in some way

Page 28: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Distortion

GW Furnas: Generalized Fisheye Views, Human Factors in Computing Systems CHI ‘86 Conference Proceedings, 16-23. 1986.

Page 29: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Distortion

From M Grobelnik, P Krese, D Mladenic: Project Intelligence (http://pi.ijs.si)

Page 30: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Distortion

From M Grobelnik, P Krese, D Mladenic: Project Intelligence (http://pi.ijs.si)

Page 31: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Distortion

From M Grobelnik, P Krese, D Mladenic: Project Intelligence (http://pi.ijs.si)

Page 32: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Brushing & Linking

Page 33: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Integration ofVisualization & Data Mining

1. Visualization techniques can be applied before (or independently) of DM

2. DM can be used to find patterns (or data subsets) that are further visualized

3. DM is interactive, users use visualization to guide the pattern search

4. Visualization of data mining models

Page 34: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Regression Tree

Regression tree visualization in SGI’s MineSet.

Page 35: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Classification Tree

Classification tree visualization in Orange.

Page 36: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Brushing: Trees & Scatter Plots

Page 37: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Sieve Diagram (Classification)

Page 38: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Nomograms

Page 39: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Intelligent Data Visualization

• Use an established visualization technique, but search for– interesting subset of attributes– interesting subset of data instances– interesting projection (how to use selected

attributes in visualization)

• All these to find “interesting” visualization

• Removes the burden for the user to find such visualizations by hand

Page 40: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Arrangement for Circle Segments

M Ankerst: Visual data mining with pixel-oriented techniques, In Proc. KDD, 2001.

Page 41: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

VizRank

Page 42: Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia

Conclusion

• Clarity of presentation• Aesthetics• Navigation & Interaction

• In data with many dimensions, tools are needed to find only “interesting” visualizations