41
1/41 Visualization and Analysis of Text Remco Chang, PhD Assistant Professor Department of Computer Science Tufts University December 17, 2010 Cologne, Germany

Visualization and Analysis of Text

  • Upload
    cyma

  • View
    54

  • Download
    0

Embed Size (px)

DESCRIPTION

Visualization and Analysis of Text. Remco Chang, PhD Assistant Professor Department of Computer Science Tufts University December 17, 2010 Cologne, Germany. Introduction. Information Visualization Novel visual representations Storytelling User-Driven Visual Analysis Data exploration - PowerPoint PPT Presentation

Citation preview

Page 1: Visualization and Analysis of Text

1/41

Visualization and Analysis of Text

Remco Chang, PhDAssistant Professor

Department of Computer ScienceTufts University

December 17, 2010Cologne, Germany

Page 2: Visualization and Analysis of Text

2/41

CMVVis Examples P Topics

Introduction

• Information Visualization– Novel visual representations– Storytelling– User-Driven

• Visual Analysis– Data exploration– Hypotheses generation– Interactive visualization + Computation

Page 3: Visualization and Analysis of Text

3/41

CMVVis Examples P Topics

Visualization

• Pre-attentive Processing

Examples courtesy of Chris Healey

Page 4: Visualization and Analysis of Text

4/41

CMVVis Examples P Topics

Visualization• This is helpful

because:

– It allows us to process more information quickly

– We can see trends and patterns

Page 5: Visualization and Analysis of Text

5/41

CMVVis Examples P Topics

Storytelling

• US Budget from 1961 - 2008

Page 6: Visualization and Analysis of Text

6/41

CMVVis Examples P Topics

Storytelling

• Minard’s Map:

• Napolean’s March to Moscow

Page 7: Visualization and Analysis of Text

7/41

CMVVis Examples P Topics

Visualization

• Influences the thought…

Images courtesy of Barbara Tversky

Page 8: Visualization and Analysis of Text

8/41

CMVVis Examples P Topics

Visual Encoding

• Affects the:

– Types of possible operations

– The user’s thinking process

Zhang and Norman. The Representation Of Numbers. Cognition. (1995)

Page 9: Visualization and Analysis of Text

9/41

CMVVis Examples P Topics

Classifying Numeric Systems

Page 10: Visualization and Analysis of Text

10/41

CMVVis Examples P Topics

Example: Arithmetic

Slide courtesy of Pat Hanrahan

Page 11: Visualization and Analysis of Text

11/41

CMVVis Examples P Topics

Example: Arithmetic

Page 12: Visualization and Analysis of Text

12/41

CMVVis Examples P Topics

Example: Arithmetic

Page 13: Visualization and Analysis of Text

13/41

CMVVis Examples P Topics

Example: Arithmetic

Page 14: Visualization and Analysis of Text

14/41

CMVVis Examples P Topics

Examples of Text Visualization

• Wordle

Images Courtesy of Many Eyes

Page 15: Visualization and Analysis of Text

15/41

CMVVis Examples P Topics

Examples of Text Visualization

• WordTree

Page 16: Visualization and Analysis of Text

16/41

CMVVis Examples P Topics

Examples of Text Visualization

• WordTree

Page 17: Visualization and Analysis of Text

17/41

CMVVis Examples P Topics

Examples of Text Visualization

• Phrase Net

Page 18: Visualization and Analysis of Text

18/41

CMVVis Examples P Topics

Examples of Text Visualization

• Google Auto-Complete

Page 19: Visualization and Analysis of Text

19/41

CMVVis Examples P Topics

Examples of Text Visualization

• Visualizing changes in Wikipedia

Images Courtesy of Info.fm

Page 20: Visualization and Analysis of Text

20/4120/37

CMVVis Examples P Topics

Examples of Text Visualization

• ThemeRiver

Page 21: Visualization and Analysis of Text

21/41

CMVVis Examples P Topics

Visual Exploration

• Coordinated Multi-Views (CMV)

Where

When

Who

What

Original Data

EvidenceBox

Page 22: Visualization and Analysis of Text

22/41

CMVVis Examples P Topics

WHY?

This group’s attacks are not bounded by geo-locations but instead, religious beliefs.

Its attack patterns changed with its developments.

Coordinated Multi-Views

Page 23: Visualization and Analysis of Text

23/41

CMVVis Examples P Topics

LIDAR Linked Feature Space

23/37

Page 24: Visualization and Analysis of Text

24/41

CMVVis Examples P Topics

LIDAR Change Detection

24/37

Page 25: Visualization and Analysis of Text

25/41

CMVVis Examples P Topics

Urban Model

25/37

Page 26: Visualization and Analysis of Text

26/41

CMVVis Examples P Topics

Urban Visualization

26/37

Page 27: Visualization and Analysis of Text

27/41

CMVVis Examples P Topics

Coordinated Multi-Views• Financial Wire Fraud

– With Bank of America– Discover suspicious

international wire transactions

• Bridge Maintenance – With US DOT– Exploring subjective

inspection reports

• Biomechanical Motion– With U. Minnesota and

Brown– Interactive motion

comparison methods

Page 28: Visualization and Analysis of Text

28/41

CMVVis Examples P Topics

Coordinated Multi-Views• Financial Wire Fraud

– With Bank of America– Discover suspicious

international wire transactions

• Bridge Maintenance – With US DOT– Exploring subjective

inspection reports

• Biomechanical Motion– With U. Minnesota and

Brown– Interactive motion

comparison methods

Page 29: Visualization and Analysis of Text

29/41

CMVVis Examples P Topics

Coordinated Multi-Views• Financial Wire Fraud

– With Bank of America– Discover suspicious

international wire transactions

• Bridge Maintenance – With US DOT– Exploring subjective

inspection reports

• Biomechanical Motion– With U. Minnesota and

Brown– Interactive motion

comparison methods

Page 30: Visualization and Analysis of Text

30/41

CMVVis Examples P Topics

CMV + Text Analysis

Page 31: Visualization and Analysis of Text

31/41

CMVVis Examples P Topics

Parallel Topics

• Task: Given the proposals submitted to the National Science Foundation (NSF), identify:

– Proposals that are interdisciplinary– Proposals that are potentially transformative– Proposals that are focused

Page 32: Visualization and Analysis of Text

32/41

CMVVis Examples P Topics

Parallel Topics

• Approach:

– Apply topic modeling algorithms to identify latent topics (David Blei, “Latent dirichlet allocation”, 2003)

– Visualize the distribution of proposals based on the topics

Page 33: Visualization and Analysis of Text

33/41

CMVVis Examples P Topics

Topic Modeling

• Given a set of k documents, find n number of topics– Each document then is described as:• (W1 * Topic1, W2 * Topic2, W3 * Topic3, …, Wn * Topicn)

• W1 + W2 + W3 + … + Wn = 1

Topic 1 Topic 2 … Topic NDocument 1 0.12 0.68 … 0.005Document 2 0.3 0.06 … 0.01…Document K

∑ = 1

∑ = 1...

Page 34: Visualization and Analysis of Text

34/41

CMVVis Examples P Topics

Topic Modeling

• A topic is a combination of keywords

Page 35: Visualization and Analysis of Text

35/41

CMVVis Examples P Topics

Parallel Topics

• Based on “Parallel Coordinates”– Each vertical axis is a topic– Each set of horizontal connected lines is a

document

Page 36: Visualization and Analysis of Text

36/41

CMVVis Examples P Topics

Visual Signatures

Single topic Bi-topic

No salient topic

• We identify different signatures for proposals:– Single Topic – focused research– Bi-Topic – Interdisciplinary research– No-Topic – Potentially transformative research

Page 37: Visualization and Analysis of Text

37/41

CMVVis Examples P Topics

Selecting Single Topic Proposals

Topic 1 Topic 2 … Topic N

Document 1 0.12 0.68 … 0.005

Document 2 0.3 0.06 … 0.01

Document K

SD = 0.14SD = 0.06

Max

SD

Page 38: Visualization and Analysis of Text

38/41

CMVVis Examples P Topics

Selecting Multi-Topic ProposalseducationtechnologyInteractive

environment

Page 39: Visualization and Analysis of Text

39/41

CMVVis Examples P Topics

Selecting No-Topic Proposals

Page 40: Visualization and Analysis of Text

40/41

CMVVis Examples P Topics

Recap

• Objective: To discover interdisciplinary and potentially innovative research proposals

• Parallel Topics – data-centric approach

• Approach: To support interactive selection of proposals based on their number of topics

Page 41: Visualization and Analysis of Text

41/41

CMVVis Examples P Topics

Questions and Comments?

Thank you!!

[email protected]://www.cs.tufts.edu/~remco