57
NodeXL Network overview, discovery and exploration for Microsoft Excel 2007 http://www.codeplex.com/nodexl Dan Fay ( Microsoft Research - Redmond ) Cody Dunne ( U Maryland ) Marc Smith ( Telligent ) Vladimir Barash ( MSR Silicon Valley / Cornell ) Tony Capone ( Microsoft Research - Redmond ) Natasa Milic-Frayling ( Microsoft Research - Cambridge ) Eduarda Mendes Rodrigues ( Microsoft Research - Cambridge ) Eric Gleave ( U Washington ) Adam Perer ( U Maryland ) Ben Shneiderman ( U Maryland )

2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Embed Size (px)

DESCRIPTION

Overview of the NodeXL project (Network Overview, Discovery and Exploration) that adds social network metrics and visualization features to Excel 2007. Contains updated images from version .84 of the NodeXL project.

Citation preview

Page 1: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXLNetwork overview, discovery and exploration

for Microsoft Excel 2007http://www.codeplex.com/nodexl

Dan Fay (Microsoft Research - Redmond)Cody Dunne (U Maryland)

Marc Smith (Telligent)Vladimir Barash (MSR Silicon Valley/Cornell)

Tony Capone (Microsoft Research - Redmond)Natasa Milic-Frayling (Microsoft Research - Cambridge)

Eduarda Mendes Rodrigues (Microsoft Research - Cambridge)Eric Gleave (U Washington)Adam Perer (U Maryland)

Ben Shneiderman (U Maryland)

Page 2: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

The NodeXL Team

Page 3: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Problem: No network chart in Excel

Page 4: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Problem: No network chart in

Excel

Page 5: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Network analysis and visualization tool

• Cyclic Graph data structures have limited support in existing Office tools

• Network analysis is of growing importance in academic, commercial, and Internet social media contexts

• Existing network analysis tools have command line interfaces or demand steep learning curves

• Many network data sets already live in Excel!

Page 6: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Goal: Make SNA easier

• Existing Social Network Tools are challenging for many novice users

• Tools like Excel are widely used• Leveraging a spreadsheet as a host for SNA

lowers barriers to network data analysis and display

Page 7: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Social Network Analysis ToolkitTools to support the study of the social network structure of

social media and other directed graph structures

Sociologist

“What are the structures of

communication in scientific

discussions?”

User Experience Information Visualization

“What are the best UI/UX workflows for network

analysis tools?”

Computer Scientist Algorithmicist for Social

Network Measures

“What are the measures and algorithms needed for understanding networks?”

Page 8: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

8

Page 9: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

The Ties that Blind?

9

Page 10: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Reply-To NetworkNetwork at distance 2 for the most prolific author of the microsoft.public.internetexplorer.general newsgroup

The Ties that Blind?

Page 12: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Pajek without modification can sometimes reveal structures of great interest.

The Ties that Blind?

Page 13: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Two “answer people” with an emerging 3rd.

Mapping Newsgroup Social Ties

Microsoft.public.windowsxp.server.general13

Page 14: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

14

Page 15: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

• Answer person– Outward ties to local isolates– Relative absence of triangles– Few intense ties

• Reply Magnet– Ties from local isolates often

inward only– Sparse, few triangles– Few intense ties

Distinguishing attributes:

15

Page 16: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Distinguishing attributes:• Answer person

– Outward ties to local isolates– Relative absence of triangles– Few intense ties

• Discussion person– Ties from local isolates often

inward only– Dense, many triangles– Numerous intense ties

16

Page 17: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Clear and consistent signaturesof an “Answer Person”

• Light touch to numerous threads initiated by someone else

• Most ties are outward to local isolates• Many more ties to small fish than big fish

1

10

100

0 1 2 4 8 16 32 64

17

Page 18: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Roles Project• Using Netscan

data to derive social roles in Usenet

• Next steps: quantify & explore in more depth

Answer Person, microsoft.public.windows.server.general

Discussion, rec.kites

Flame, alt.flame

Social Support, alt.support.divorce

PUBLISHED in HICSS, JCMC, JoSS, IEEE Internet Communications (special issue on Social Networks) 18

Page 19: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Network Overview, Discovery and Exploration for Excel• Leverage spreadsheet for storage of edge and

vertex data

Page 20: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

The NodeXL project is

Available via the CodePlex

Open Source Project

Hosting Site:

http://www.codeplex.com/nodexl

Page 21: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL workflow

data importation > processing > calculation > refinement > a network graph that tells a useful story

These steps include:

• Import data from several sources and file formats• Scrub data: Merge duplicate edges• Calculate network metrics• Insert sub-graph images• Auto-fill columns (and map data to display attributes):

- Set shape, color, opacity, size, and label/tooltip• Create clusters• Show graph• Read workbook• Adjust layout• Layout Again• Dynamic Filters – selectively hide edges and nodes

Page 22: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Import data from multiple sources:

• Multiple network “spigots” provide edge lists from several common sources and data formats.

Page 23: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Social media platforms are A source of multiple Social network data sets:

“Friends”“Replies”“Follows”“Comments”“Reads”“Co-edits”“Co-mentions”“Hybrids”

Page 25: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Export data to alternate file formats:

Prepare data for analysis

Page 26: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Import edges from other spreadsheets

• Map data columns from existing spreadsheets

Page 27: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Merge Duplicate Edges (if any)

• Aggregate duplicate edges and add a “Tie Strength Column” to store the count of “duplicates” (edges could be from multiple time slices).

Page 28: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Calculate Network Analytics and Metrics

• Starter library of basic network measures

• Users may unselect resource intensive measures

Page 29: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Insert network sub-graph images

• Create “ego-centric” networks for each node in the network

• Select number of degrees out to include

Page 30: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL:

Display nodes with subgraph images sorted by network attributes using Excel Data|Sort

Page 31: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Get reports of global network metrics

Metric Value

Graph Type Directed

Unique Edges 7,852

Edges With Duplicates 0

Total Edges 7,852

Self-Loops 10

Vertices 174

Graph Density 0.260514259

Page 32: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Display whole graph

• Toggle display of whole graph display pane with Show/Hide Graph Pane

Page 33: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Create a new whole graph display

• Select “Read Workbook” to load the graph into the Display Pane.

• The title “Document Actions” is imposed by Excel

Page 34: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Viewing the whole graph

Page 35: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Using Dynamic Filters to simplify the graph

• Each data column (including dates) associated with an edge or vertex is exposed with a slider filter.

• Filtered nodes and edges turn gray or become invisible

Page 36: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Apply dynamic filters to the data

Page 37: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Map data to display attributes

• Map Edge and Vertex attributes to size, width, color, opacity, and shape

Page 38: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Decorated Network Graphs

Page 39: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: filtered, decorated, labeled networks

Page 40: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Clustered networks

Page 41: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Add URLS to Right-click menu of Nodes

Page 42: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Filtered clusters

Page 43: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Right click the canvass to control attributes of selected nodes

Page 44: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Import social networks from email

Page 45: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL: Import social networks from email

Page 46: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007
Page 47: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Systematic Yet Flexible Network Analysis Tasks1. Overall network metrics (e.g. number of nodes, number of

edges, density, diameter)2. Node rankings (e.g. degree, betweenness, closeness

centrality)3. Edge rankings (e.g. weight, betweenness centrality)4. Node rankings in pairs (e.g. degree vs. betweenness, plotted

on a scatter gram)5. Edge rankings in pairs6. Cohesive subgroups (e.g. finding communities in networks)7. Multiplexity (e.g. analyzing comparisons between different

edge types, such as friends vs. enemies)Shneiderman, Perer, Dunne

Page 48: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Micro ---- Scale ---- Macro

Micr

o ---

- Tim

e ---

- Mac

ro

Whole Graph

/ All Time

Sub Graph /

Narrow Time Slice

Sub Graph /

All Time

Whole Graph

/ Narrow

Time Slice

Filtering Network Diagrams

Page 49: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Random Layout

Page 50: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Fruchterman-Reingold Layout(Dense)

Page 51: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Fruchterman-Reingold Layout(Loose)

Page 52: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

Random Layout (Decorated)

Page 53: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

X = In-degree, Y = Out-degree

Page 54: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL Next Steps

• Enhanced layout controls– Smart selection of nodes

• Clustering and composite nodes– Add/remove a node to/from a cluster– Add/remove a node to/from a composite

• Add social network data sources:– Twitter, YouTube, Facebook, Outlook, Messenger,

etc.

Page 55: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL Partnerships and community

• University of Maryland• Northwestern University• Ohio University• Stanford University• University of Pennsylvania

7,000 + downloads on Codeplex

Page 56: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXL User tasks and goals

Page 57: 2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007

NodeXLNetwork overview, discovery and exploration

for Microsoft Excel 2007http://www.codeplex.com/nodexl

Dan Fay (Microsoft Research - Redmond)Cody Dunne (U Maryland)

Marc Smith (Telligent)Vladimir Barash (MSR Silicon Valley/Cornell)

Tony Capone (Microsoft Research - Redmond)Natasa Milic-Frayling (Microsoft Research - Cambridge)

Eduarda Mendes Rodrigues (Microsoft Research - Cambridge)Eric Gleave (U Washington)Adam Perer (U Maryland)

Ben Shneiderman (U Maryland)