26
Exploring ENRON Email with NetLens Catherine Plaisant, Benjamin B. Bederson Hyunmo Kang, Bongshin Lee Human-Computer Interaction Laboratory University of Maryland Joint Institute for Knowledge Discovery

Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Exploring ENRON Email with NetLens

Catherine Plaisant, Benjamin B. Bederson Hyunmo Kang, Bongshin LeeHuman-Computer Interaction Laboratory

University of Maryland

Joint Institute for Knowledge Discovery

Page 2: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Our research focusAlternatives UI to Graph Visualization

how to avoid this…

Node-Link diagrams have many limitations.Not readable, may show clusters but not much else, do not scale well.

Page 3: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

NetLensIterative Exploration of Content-Actor Network Data

User Interface for exploratory search Generalizable to a variety of data

Provide consistent interface

Easy to learn and use

Kang et al.Proc. of Visual Analytics Science and Technology Conference (VAST 06)

Kang and al. Poster/Demo at Joint Conference in Digital Libraries, 2006

Page 4: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

NetLensIterative Exploration of Content-Actor Network Data

Paired networks of Content and Actors, e.g. Paired networks of Papers and Authors

Papers refers to other papersAuthors have advisors

Paired networks of Emails and PeopleEmail respond to or include emailsPeople have assistants who send email for them

Paired networks of Products and CompaniesProducts replace or integrate productsCompanies are bought or merge

Page 5: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Entity E1 Entity E2

Self-relationship Self-relationshipRelationship

Content-actor model

Examples for scientific papers:

Page 6: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Toward SCALABILITYTotal Enron email (non duplicate)249,760 emails, 87,673 people

Page 7: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Email Overview by years

People (addresses)Overview by Domain

Page 8: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Alternative overviews: emails byday of the week, grouped by year

People by: connectance magnitude(Low medium high)

Page 9: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Multiple email search capabilities

1- Keyword SearchHere a search on “California”

2- Similarity SearchFind emails similar to one or more selected emails

Result set loaded in “My list”

(with Doug Oard’s team)

Page 10: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Social network analysis:

- Number of neighbors- Connectance- Centrality- Average Path Length

- Here selected people with high connectance

With Jen Golbeck

Page 11: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Social network analysis:

- Number of neighbors- Connectance- Centrality- Average Path Length

- Here selected people with high connectance

With Jen Golbeck

Explanations of the meaning of the attributes

Page 12: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

People biosUsingsignatures and directory info

with Jen Golbeck

Page 13: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Integrated Phone callsReplay

Separate conversations

Direct access to mentions of :subject, names, keywords

(with Carol Espy’s team)

Page 14: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Thread Summaries

-List of emails in same thread-Access to thread-Access to thread summary

With Bonnie Dorr and Doug Oard’s teams

Page 15: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

TreePlus to browse subset of network connections

Page 16: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that
Page 17: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

TreePlus- Visualizing Graphs as Trees

Plant a seed and watch it growFaster, more accurate, preferred over traditional graphs for tasks that involve reading and exploration of connections

To show hidden graph structureHighlight and preview of adjacent nodesAnimated change of tree structureVisual hints about graph structure

B. Lee, C.S. Parr, C. Plaisant, B.B. Bederson, V.D. Veksler, W.D. Gray, C. Kotfila (2006)TreePlus: Interactive Exploration of Networks with Enhanced Tree LayoutsTo appear in TVCG Special Issues on Visual Analytics

B. Lee, C.S. Parr, C. Plaisant, B.B. Bederson (2005) Visualizing Graphs as Trees: Plant a seed and watch it growProceedings of GD 2005 (poster), LNCS, pp. 516-518

Page 18: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Generalization to other datasetse.g. NetLens for Scientific Publications (Papers and Authors)

Page 19: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

User evaluationHeuristic review at NIST

5 PEOPLE – self trained with video)

Usability Study 9 people, training, debriefing

Other improvementsImproved feedback• +++ Improvement of flow managementAddition of My ListAdaptive explanations of viewsVideo trainingDocumentation of source / processing of variables

Page 20: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Implementation

C# (using piccolo toolkit)MS Access Database NetLens component code available on request

Page 21: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Conclusions - Future Directions

ConclusionsSimple content actor model helpfulPowerful yet simpleTraining about flow behavior

Continue integration with other IJKD dataE.g. Entity resolution

Evaluation (case studies of analysis)

Needs for Proto ToolFacilitate code customization for different applicationsFlexible entities switching (to handle any choice of pairs)Usability

Page 22: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Thank You

[email protected] (301)[email protected] (301) 405-2764

NetLens: www.cs.umd.edu/hcil/netlensTreePlus: www.cs.umd.edu/hcil/treeplus

Papers and Video demonstrations availablefrom website. Source code available on request.

Page 23: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

OTHER relevant HCIL projects

Page 24: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Temporal Data (Categorical): PatternFinder for Patient History Search

Fails, Karlson, Shahamat & Shneiderman, VAST 2006

Page 25: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Systematic & Flexible Network Explorationwith SocialAction

Clustering showsgrouping

Abstraction revealsrelationships

Perer & Shneiderman, InfoVis 2006

Page 26: Exploring ENRON Email with NetLens · TreePlus - Visualizing Graphs as Trees |Plant a seed and watch it grow |Faster, more accurate, preferred over traditional graphs for tasks that

Network Visualization with Semantic Substrates

• Meaningfullayout of nodes

• User controlledvisibility of links

Shneiderman & Aris, InfoVis 2006