46
Marc A. Smith Chief Social Scientist Connected Action Consulting Group [email protected] http://www.connectedaction.net http://www.codeplex.com/nodexl project from the Social Media Research Foundation : http:// www.smrfou Charting Collections of Connections in Social Media: Creating Maps and Measures with NodeXL

20111103 con tech2011-marc smith

Embed Size (px)

DESCRIPTION

Slides for talk at ConTech 2011 the International Symposium on Convergence Technology (ConTech 2011) – Smart & Humane World – on November 3rd in Seoul, South Korea.Date: 2011 November 3 (Thurs)Place: COEX Grand Ballroom, Seoul, KoreaOrganized by Advanced Institutes of Convergence Technologies (AICT), Seoul National University (SNU)In Cooperation with Ministry of Knowledge Economy, Ministry of Education, Science and Technology, National Research Foundation of Korea, Graduate School of Convergence Science and Technology (GSCST)

Citation preview

Page 1: 20111103 con tech2011-marc smith

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexl

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Charting Collections of Connections in Social

Media: Creating Maps and

Measures with NodeXL

Page 2: 20111103 con tech2011-marc smith

About Me

Introductions

Marc A. SmithChief Social ScientistConnected Action Consulting Group

[email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org

Page 3: 20111103 con tech2011-marc smith

Social Media (email, Facebook, Twitter, YouTube, and more) is all about connections

from people to people.

3

Page 4: 20111103 con tech2011-marc smith

Patterns are

left behind

4

Page 5: 20111103 con tech2011-marc smith

There are many kinds of ties….

http://www.flickr.com/photos/stevendepolo/3254238329

Like, Link, Reply, Rate, Review, Favorite, Friend, Follow, Edit, Tag, Comment…

Page 6: 20111103 con tech2011-marc smith

World Wide Web

Each contains one or more social networks

Page 7: 20111103 con tech2011-marc smith

Hubs

Page 8: 20111103 con tech2011-marc smith

Bridges

Page 9: 20111103 con tech2011-marc smith

http://www.flickr.com/photos/library_of_congress/3295494976/sizes/o/in/photostream/

Clusters

Page 10: 20111103 con tech2011-marc smith

http://www.flickr.com/photos/amycgx/3119640267/

Crowds

Page 11: 20111103 con tech2011-marc smith
Page 12: 20111103 con tech2011-marc smith
Page 13: 20111103 con tech2011-marc smith

• Central tenet – Social structure emerges from – the aggregate of relationships (ties) – among members of a population

• Phenomena of interest– Emergence of cliques and clusters – from patterns of relationships– Centrality (core), periphery (isolates), – betweenness

• Methods– Surveys, interviews, observations,

log file analysis, computational analysis of matrices

(Hampton &Wellman, 1999; Paolillo, 2001; Wellman, 2001)

Source: Richards, W. (1986). The NEGOPY network analysis program. Burnaby, BC: Department of Communication, Simon Fraser University. pp.7-16

Social Network Theoryhttp://en.wikipedia.org/wiki/Social_network

Page 14: 20111103 con tech2011-marc smith

SNA 101• Node

– “actor” on which relationships act; 1-mode versus 2-mode networks• Edge

– Relationship connecting nodes; can be directional• Cohesive Sub-Group

– Well-connected group; clique; cluster• Key Metrics

– Centrality (group or individual measure)• Number of direct connections that individuals have with others in the group (usually look at

incoming connections only)• Measure at the individual node or group level

– Cohesion (group measure)• Ease with which a network can connect• Aggregate measure of shortest path between each node pair at network level reflects

average distance– Density (group measure)

• Robustness of the network• Number of connections that exist in the group out of 100% possible

– Betweenness (individual measure)• # shortest paths between each node pair that a node is on• Measure at the individual node level

• Node roles– Peripheral – below average centrality– Central connector – above average centrality– Broker – above average betweenness

E

D

F

A

CB

H

G

I

CD

E

A B D E

Page 15: 20111103 con tech2011-marc smith

http://www.flickr.com/photos/marc_smith/sets/72157622437066929/

Page 16: 20111103 con tech2011-marc smith
Page 17: 20111103 con tech2011-marc smith

Welser, Howard T., Eric Gleave, Danyel Fisher, and Marc Smith. 2007. Visualizing the Signatures of Social Roles in Online Discussion Groups. The Journal of Social Structure. 8(2).

Experts and “Answer People”

Discussion starters, Topic setters

Discussion people, Topic setters

Page 18: 20111103 con tech2011-marc smith

Now Available

Page 19: 20111103 con tech2011-marc smith

Analogy: Clusters Are OccludedHard to count nodes, clusters

Page 20: 20111103 con tech2011-marc smith

Separate Clusters Are More Comprehensible

Page 21: 20111103 con tech2011-marc smith

Twitter Network for “Microsoft Research”*BEFORE*

Page 22: 20111103 con tech2011-marc smith

Twitter Network for “Microsoft Research”*AFTER*

Page 23: 20111103 con tech2011-marc smith

Goal: Make SNA easier

• Existing Social Network Tools are challenging for many novice users

• Tools like Excel are widely used• Leveraging a spreadsheet as a host for SNA

lowers barriers to network data analysis and display

Page 25: 20111103 con tech2011-marc smith

Social Media Research Foundationhttp://smrfoundation.org

Page 26: 20111103 con tech2011-marc smith

What we are trying to do:Open Tools, Open Data, Open Scholarship

• Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data

• Connect users to network analysis – make network charts as easy as making a pie chart

• Connect researchers to social media data sources• Archive: Be the “Allen Very Large Telescope Array”

for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis

• Create open access research papers & findings• Make “collections of connections” easy for users to

manage

Page 27: 20111103 con tech2011-marc smith

What we have done: Open Tools

• NodeXL• Data providers (“spigots”)

– ThreadMill Message Board– Exchange Enterprise Email– Voson Hyperlink– SharePoint– Facebook– Twitter– YouTube– Flickr

Page 28: 20111103 con tech2011-marc smith

What we have done: Open Data

• NodeXLGraphGallery.org– User generated collection of

network graphs, datasets and annotations

– Collective repository for the research community

– Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance

Page 29: 20111103 con tech2011-marc smith

What we have done: Open Scholarship• Webshop 2011: NSF, Google, Intel

– 4 Days, 45 Students, 20 Speakers– Great tweets!

• Webshop 2012!– Expand numbers of students and add a day– Support speakers and student workers

• Workshops: Purdue, Maryland, Cape Town, Yeungnam

Page 30: 20111103 con tech2011-marc smith

What we have done: Open Scholarship

Page 31: 20111103 con tech2011-marc smith

Facebook networkshttp://www.connectedaction.net/2010/04/25/bernie-hogans-facebook-social-network-data-provider-and-visualization-toolkit/

Page 32: 20111103 con tech2011-marc smith

Twitter Networks: connections among the people who tweeted the term “Kpop” on 24 October 2011

Page 33: 20111103 con tech2011-marc smith

NodeXL data import sources

Page 34: 20111103 con tech2011-marc smith

Example NodeXL data importer for Twitter

Page 35: 20111103 con tech2011-marc smith

NodeXL imports “edges” from social media data sources

Page 36: 20111103 con tech2011-marc smith

NodeXL Automation makes analysis simple and fast

Page 37: 20111103 con tech2011-marc smith

NodeXL Network Metrics

Page 38: 20111103 con tech2011-marc smith

NodeXL simplifies mapping data attributes to display attributes

Page 39: 20111103 con tech2011-marc smith

NodeXL displays subgraph images along with network metadata

Page 40: 20111103 con tech2011-marc smith

NodeXL enables filtering of networks

Page 41: 20111103 con tech2011-marc smith

NodeXL Generates Overall Network Metrics

Page 42: 20111103 con tech2011-marc smith

What we want to do: (Build the tools to) map the social web• Move NodeXL to the web:

– Node for Google Doc Spreadsheets!– WebGL Canvas

• Connect to more data sources of interest:– RDF, MediaWikis, Gmail, NYT, Citation Networks

• Solve hard network manipulation UI problems:– Modal transform, Time series, Automated layouts

• Grow and maintain archives of social media network data sets for research use.

• Improve network science education:– Workshops on social media network analysis– Live lectures and presentations– Videos and training materials

Page 43: 20111103 con tech2011-marc smith

Work ItemsAutofill Group AttributeMerge Edges by AttributeModal TransformMerge WorkbooksAutomated Dynamic Filters: Time Series Analysis, contrastCaptions and LegendsUpload to Graph Gallery++: captions, workbookGraph Gallery++

User Accounts, Reporting, RSS Feeds, Network Visualization Web Canvas

Import: RDF, Wiki, SharePoint, Keyword networks from textMetrics: Triad CensusLayouts:

Force Atlas 2, Lin Log, “Bakshy Plots”, Quality MeasuresQuery-by-example search for network structures

Page 44: 20111103 con tech2011-marc smith

How you can help

• Sponsor a feature• Sponsor Webshop 2012• Sponsor a student• Schedule training• Sponsor the foundation• Donate your money, code, computation, storage,

bandwidth, data or employee’s time• Help promote the work of the Social Media

Research Foundation

Page 45: 20111103 con tech2011-marc smith

Contact:

Marc A. SmithChief Social ScientistConnected Action Consulting Group

[email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org

Page 46: 20111103 con tech2011-marc smith

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexl

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Charting Collections of Connections in Social

Media: Creating Maps and

Measures with NodeXL