26

20120622 web sci12-won-marc smith-semantic and social network analysis of …

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: 20120622 web sci12-won-marc smith-semantic and social network analysis of …
Page 2: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Semantic and Social Network

Analysis of Social Media with

NodeXL

Page 3: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Social Media Research Foundationhttp://smrfoundation.org

Page 5: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

About Me

Introductions

Marc A. SmithChief Social ScientistConnected Action Consulting Group

[email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org

Page 6: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Like MSPaint™ for graphs

Page 7: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

What we are trying to do:Open Tools, Open Data, Open Scholarship

• Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data

• Connect users to network analysis – make network charts as easy as making a pie chart

• Connect researchers to social media data sources• Archive: Be the “Allen Very Large Telescope Array”

for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis

• Create open access research papers & findings• Make “collections of connections” easy for users to

manage

Page 8: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

What we have done: Open Tools

• NodeXL• Data providers (“spigots”)– ThreadMill Message Board– Exchange Enterprise Email– Voson Hyperlink– SharePoint– Facebook– Twitter– YouTube– Flickr

Page 9: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

What we have done: Open Data

• NodeXLGraphGallery.org– User generated collection of

network graphs, datasets and annotations

– Collective repository for the research community

– Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance

Page 10: 20120622 web sci12-won-marc smith-semantic and social network analysis of …
Page 11: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Now Available

Page 12: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Group-in-a-box Layout

Page 14: 20120622 web sci12-won-marc smith-semantic and social network analysis of …
Page 15: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

This graph represents a directed network of 1,360 Twitter users

whose recent tweets contained "contraceptive OR contraception". The network was obtained

on Friday, 08 June 2012 at 13:22 UTC. There is

an edge for each follows relationship. There is an edge for each "replies-

to" relationship in a tweet. There is an edge

for each "mentions" relationship in a tweet.

There is a self-loop edge for each tweet that is not

a "replies-to" or "mentions". The tweets were made over the 2-

day period from Thursday, 07 June 2012 at 18:46 UTC to Friday, 08 June 2012 at 13:06

UTC. The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster

algorithm. The edge colors are based on

relationship values. The vertex sizes are based on

each user’s number of followers. Table 1

reports the summary network metrics that describe the graph.

Page 16: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Summary network metrics

Page 17: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

The Vertices spreadsheet lists users who contributed a tweet containing the terms “contraception OR

contraceptives” over two days in early June 2012. Users are ranked by their computed betweenness centrality within the network of follows, replies, and mentions edges. The top 10 vertices, ranked by betweenness centrality are the accounts

at the center of the network. These include: @thinkprogress, @gatesfoundation, @SandraFluke,

@maleeek, @Change, @foxandfriends, @melindagates, @AshleyJudd, @cnalive, and @SOHLTC.

Page 18: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

NodeXL calculates network metrics and

word pairs

Page 19: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Contrasting groups

Page 20: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

The Content summary spreadsheet displays the most

frequently used URLs, hashtags, and user names within the

network as a whole and within each calculated sub-group.

Page 21: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Contrast hashtags in Groups 2 & 4

Page 22: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Contrasting URL references

Page 23: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Word Pair Contrasts

Page 24: 20120622 web sci12-won-marc smith-semantic and social network analysis of …
Page 25: 20120622 web sci12-won-marc smith-semantic and social network analysis of …

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Semantic and Social Network

Analysis of Social Media with

NodeXL

Page 26: 20120622 web sci12-won-marc smith-semantic and social network analysis of …