China Williams Daniel Whiteheadinfo.ils.indiana.edu/~dingying/Teaching/S636... · Social Tagging...

Preview:

Citation preview

China Williams Daniel Whitehead

Social Tagging Introduction Upper Tag Ontology (UTO) Crawled from Flickr, Del.icio.us, YouTube Purpose of the Data as a whole Our Uses - Queries

Our Data Small Subset of UTO Data Del.icio.us Tagging Information ~60 unique tagging events RDF Format

Data Model – Hypothetical

OBJECTSOURCE

TAG

TAGGER

DATEData type: date

VOTEData type:

Nonnegative Integer

TAGGINGis_Tag_ofha

s_Ta

ghas_O

bjectis

_Obj

ect_

ofis_Tagger_of

is_Source_of

has_Source

has_Tagger

COMMENTis_Comment_of

has_Comment

Classes

Data Properties

Inverse RelationsDomain Relations

has_Related_Tag

Data Model – UTO Observed

OBJECTurl

TAGurl

TAGGERstring

has_tag

has_tagger COMMENTstring

has_comment

Classes and Types

Predicates

TAGGING

has_object

DATEdate

VOTENonnegative

integer

SOURCEurl

has_Source

Sample Tagging Event

Use Scenarios Given a tag, return all resources associated with that tag ordered by vote.

Given a resource, return all tags associated with it ordered by tagger.

Given any tagger, return all resources that they have tagged, with the tagsand date.

Given any vote as an integer, return all resources that have greater than that vote.

Given a resource, return the resource and all comments ordered by date.

Return all resources that have comments to see what people are talking about.

Tagger Query (“dielsy”)# Given any tagger, return all resources that they have tagged, with the

tags and date.PREFIX uto: <http://uto.deri.at/>

SELECT DISTINCT ?has_tagger ?resource ?tag ?date

{?x uto:has_tagger "dielsy" .?x uto:has_object ?resource .

?x uto:has_tagger ?has_tagger .?x uto:has_tag ?tag .?x uto:has_date ?date

}

order by desc(?date)

Tagger Query Results:

Comment Query# Return all resources that have comments.

PREFIX uto: <http://uto.deri.at/>PREFIX del: <http://del.icio.us/tag/>PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>

SELECT ?resource ?has_comment ?date ?vote WHERE{

?x uto:has_object ?resource .?x uto:has_comment ?has_comment .?x uto:has_date ?date .?x uto:has_vote ?vote .

FILTER (?has_comment != "") }

Comment Query Results:

Obstacles Encountered Shallow data set – Incomplete picture of UTO

No instances of inverse relationship predicates, etc. Assumption that it is UTO and not represented some other

way Date Format: “Jun 08”

Not easily parsed (not ISO-8601) We couldn’t do a date-range query

Short Data Set Testing of queries required manipulation of data Creating “dummy” entries Ordering, distinct, etc.

Going Further Widening Query Abilities / Query Refinement

adding in other data sources (Flickr, YouTube, etc.) Integrating into MySQL Database Web Interface

Automatic generation of queries Crawling More Data / Larger Data Set Information Visualization / Analysis Tools

ReferencesDing, Ying; Toma, Ioan; Kang, Sin-Jae; Zhang, Zhixiong; Fried, Michael

(2008). Mediating and Analyzing Social Data. OTM Conferences.

Ding, Ying; Toma, Ioan; Kang, Sin-Jae; Fried, Michael; Yan, Zhixian(2008). Data Mediation and Interoperation in Social Web: Modeling, Crawling and Integrating Social Tagging Data. Workshop on Social Web Search and Mining (WWW2008).

Recommended