105
Linked Data on the Web Olaf Hartig http://olafhartig.de/foaf.rdf#olaf Database and Information Systems Research Group Humboldt-Universität zu Berlin

Linked Data on the Web

Embed Size (px)

DESCRIPTION

An comprehensive overview on Linked Data, including an introduction, technical foundations, consuming LD, and open research issues.

Citation preview

Page 1: Linked Data on the Web

Linked Data on the Web

Olaf Hartighttp://olafhartig.de/foaf.rdf#olaf

Database and Information Systems Research GroupHumboldt-Universität zu Berlin

Page 2: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 3: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional Web = Internet + Docs + Links

The Traditional Web

Page 4: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional Web = Internet + Docs + Links

● HTML as shared content format

● HTTP to access documents on the Web

● URLs● Globally unique identifiers for documents● Retrieval mechanism

● Hyperlinks

● Single global information space

The Traditional Web

Page 5: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

The Traditional Web

Page 6: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

The Traditional Web

Page 7: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

Solution:● Increase the structure of Web content● Publish data

The Traditional Web

Page 8: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

Solution:● Increase the structure of Web content● Publish data

The Traditional Web

But wait…don't we do that already?

Page 9: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

Web APIWeb API

Web APIWeb API

Page 10: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

● Mashups combine this data

Web APIWeb API

Web APIWeb API

Page 11: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

● Mashups combine this data

Web APIWeb API

Web APIWeb API

Shortcomings:

● APIs are proprietary

● Mashups are based on a fixed set of data sources

● You can not set hyperlinks between data object

Page 12: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

My Movie DB

Page 13: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

Page 14: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 15: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 16: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 17: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB http://geo.db/cityCJ

http

://ge

o.d

b/c

ou

ntry

7

http://geo.db/country21

http://geo.db/cityXA

http://mym

ovie.db/movie2449

?

Page 18: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB http://geo.db/cityCJ

http

://ge

o.d

b/c

ou

ntry

7

http://geo.db/country21

http://geo.db/cityXA

http://mym

ovie.db/movie2449

?

Page 19: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

http://sws.geonames.org/2635167/

Page 20: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

gn:population

60943000 rdfs

:labe

l

United Kingdom

http://sws.geonames.org/2635167/

Page 21: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

rdfs

:labe

l

United Kingdom

gn:population

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://sws.geonames.org/2635167/

http://data.linkedmdb.org/.../film

60943000

dc:title

skos:subject

The Shining

http://www4.wi … /0743424425

http://www4.wi … /Fiction

Page 22: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

rdfs

:labe

l

skos:subject

skos:subject

dc:title

gn:population

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://sws.geonames.org/2635167/

http://data.linkedmdb.org/.../film

60943000

United Kingdom

http://www4.wi … /0743424425

The Shining

http://www4.wi … /1571884029

http://www4.wi … /Fiction

Page 23: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Properties of Linked Data

● Anyone can publish data to the Web of data

● Entities are connected by links● Giant global data graph that spans data sources

● Data is self-describing● Vocabulary terms are identified by URIs, too● Look-up yields their RDFS or OWL definition

● The Web of data is open● Applications can discover new data sources at run-time

Page 24: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Properties of Linked Data

● Anyone can publish data to the Web of data

● Entities are connected by links● Giant global data graph that spans data sources

● Data is self-describing● Vocabulary terms are identified by URIs, too● Look-up yields their RDFS or OWL definition

● The Web of data is open● Applications can discover new data sources at run-time

Is this real?

Page 25: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

W3C Linking Open Data Project

● Grassroots community effort

● Publish existing, open license datasets as Linked Data

● Interlink things between different data sources

Page 26: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

As of July 2007

> 500M triples ca. 120,000 links

W3C Linking Open Data Project

Page 27: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

ca. 6.7B triples ca. 150M links

W3C Linking Open Data Project

Page 28: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

ca. 6.7B triples ca. 150M links

W3C Linking Open Data Project

Media User generatedcontent Publications

Life Sciences

Geographic

Cross-domain

Page 29: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishers

● UK government

● US government

● Thomson Reuters (Open Calais)

● MetaWeb (Freebase)

● BBC

● NY Times

● Best Buy

● CNET

etc.

Page 30: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishers

● UK government

● US government

● Thomson Reuters (Open Calais)

● MetaWeb (Freebase)

● BBC

● NY Times

● Best Buy

● CNET

etc. Can I become part?

Page 31: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishing Tools

● Use HTTP URIs in your FOAF profile

● Legacy data in relational databases● D2R Server, Triplify, Virtuoso, Ultrawrap, ...

● CMS● Drupal

● Native RDF stores● Sesame, AllegroGraph, Virtuoso● Talis platform (Linked Data in the cloud)

● HTML with RDFa

Page 32: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Integrating the Traditional Web

● Annotate Web documents with Linked Data URIs

● Annotation services using named entity recognition● Open Calais (Thomson Reuters) for news● Zemanta for blog posts● Epiphany

dc:subject

http://data.semanticweb.org/ … /eswc/2007/paper-69

http://dbpedia.org/resource/Machine_Learning

Page 33: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 34: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Technical Foundations

● HTTP

● URI

● RDF

● RDFS / OWL

There is no magic – Linked Data is based on well-established

(Semantic) Web technologies.

Page 35: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

URIs

● Hash URIs

http://olafhartig.de/foaf.rdf#olaf

● Slash URIs

http://data.linkedmdb.org/resource/film/2014

Page 36: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Looking up URIs

GET /foaf.rdf HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6 OpenSSL/0.9.8l zlib/1.2.3Host: olafhartig.deAccept: */*

HTTP Request for http://olafhartig.de/foaf.rdf

Give me data abouthttp://olafhartig.de/foaf.rdf#olaf

Page 37: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Looking up URIs

HTTP/1.1 200 OKDate: Thu, 11 Mar 2010 08:47:53 GMTServer: Apache/2.2.6 (Unix) mod_ssl/2.2.6 OpenSSL/0.9.8gLast-Modified: Fri, 05 Mar 2010 18:01:07 GMTETag: "72a16-1946-7fe53ec0"Accept-Ranges: bytesContent-Length: 6470Content-Type: application/rdf+xmlContent-Language: de

<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/">

<foaf:PersonalProfileDocument rdf:about=""> <foaf:maker rdf:resource="http://olafhartig.de/foaf.rdf#olaf"/> ...

HTTP Response:

Page 38: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

HTTP Content Negotiation

GET /data/dbprofs HTTP/1.1Host: researchersmap.informatik.hu-berlin.deAccept: text/rdf+n3

● Request the resource in a specific format (representation)

● Use the HTTP header Accept to specify a media type

Example:

Page 39: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

HTTP Content Negotiation

HTTP/1.1 200 OKDate: Thu, 11 Mar 2010 09:02:22 GMTServer: Apache/2.2.13 (Linux/SUSE)Content-Location: dbprofs.n3Vary: negotiate,acceptTCN: choiceLast-Modified: Tue, 05 Jan 2010 14:46:17 GMTETag: "40e4d-2250-47c6be683f0e1;47c6be69482f5"Accept-Ranges: bytesContent-Length: 8784Content-Type: text/rdf+n3

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix contact: <http://www.w3.org/2000/10/swap/pim/contact#> . <> a foaf:Document ; foaf:maker <http://www.informatik.hu-berlin.de/~hartig/foaf.rdf#olaf> . ...

HTTP Response:

Page 40: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

URIs

● Hash URIs

http://olafhartig.de/foaf.rdf#olaf

● Slash URIs

http://data.linkedmdb.org/resource/film/2014

Page 41: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Redirections

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: application/rdf+xml

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Page 42: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: application/rdf+xml

HTTP/1.1 303 See OtherDate: Thu, 11 Mar 2010 08:15:50 GMTServer: Jetty(6.1.4)Location: http://data.linkedmdb.org/data/film/2014Content-Length: 0Via: 1.1 data.linkedmdb.orgContent-Type: text/plain

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Response:

Redirections

Page 43: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Redirections

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: text/html

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Page 44: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: text/html

HTTP/1.1 303 See OtherDate: Thu, 11 Mar 2010 08:15:50 GMTServer: Jetty(6.1.4)Location: http://data.linkedmdb.org/page/film/2014Content-Length: 0Via: 1.1 data.linkedmdb.orgContent-Type: text/plain

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Response:

Redirections

Page 45: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Vocabularies and Ontologies

● Defined using RDFS or OWL

● A plenty of vocabularies exist:● People● Social media● Commerce● Events● Radio and TV programmes● Music

etc.

Page 46: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

owl:sameAs

http://sws.geonames.org/2635167/

=

http://dbpedia.org/resource/United_Kingdom

=

http://rdf.freebase.com/ns/guid.9202a8c04000641f800000000003e30b

=

http://www4.wiwiss.fu-berlin.de/factbook/resource/United_Kingdom

=

http://www4.wiwiss.fu-berlin.de/eurostat/resource/countries/United_Kingdom

Page 47: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:typehttp://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

gn:population

owl:sameAs

60943000

owl:sameAs

rdfs

:labe

l

United Kingdom

http://sws.geonames.org/2635167/

http://dbpedia.org/resource/United_Kingdom

Page 48: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

owl:sameAs

gn:population

rdfs

:labe

l

60943000

United Kingdom

http://sws.geonames.org/2635167/

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:typehttp://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

owl:sameAs

44

db:callingCode dbp:le

ader

http://dbpedia.org/resource/Gordon_Brown

http://dbpedia.org/resource/United_Kingdom

Page 49: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 50: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… by Humans

● Linked Data browsers

● Faceted browsers

● On-the-fly Linked Data Mashups

● Linked Data based applications

Page 51: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Browsers

● Provide a tabular view on retrieved RDF data

● Some integrate data from multiple sources

● Allow to follow RDF links

● Multiple options:● Tabulator● Disco● OpenLink Data Explorer● Zitgist Data Viewer● Marbles

etc.

Page 52: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Faceted Browsers

http://dbpedia.neofonie.de

Page 53: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

On-the-fly Mashups

http://sig.ma

Page 54: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data based Applications

[SFSW'09]

Page 55: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

New Kind of Applications

● Users retain full control over their data

● Users manage and publish data on their own

● All that is needed for the application is a URI

<http://www.dbis.informatik.hu-berlin.de/ … /freytag.rdf#me> rdf:type :DBProfessor . …

http://researchersmap.informatik.hu-berlin.de/data/dbprofs

Page 56: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Users Really Own their Data

<http://www.dbis.informatik.hu-berlin.de/ … /freytag.rdf#me> contact:fullName "Prof. Johann-Christoph Freytag, Ph.D." ; contact:office [ contact:address [ contact:street "Rudower Chaussee 25" ; contact:city "Berlin"^^xsd:string ; contact:postalCode "12489"^^xsd:string ] ] ; foaf:topic_interest <http://dbpedia.org/resource/Query_optimization> , <http://dbpedia.org/resource/Privacy> , <http://dbpedia.org/resource/Data_quality> , <http://dbpedia.org/resource/Data_warehouse> ; owl:sameAs <http://dblp.l3s.de/d2r/resource/authors/Johann_Christoph_Freytag> . …

http://www.dbis.informatik.hu-berlin.de/ ... /freytag.rdf

Page 57: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… in Applications

● Look up URIs and process the retrieved data

● Query with SPARQL

Page 58: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Query language for RDF data

● Main idea: pattern matching● Describe subgraphs of the queried RDF graph● Subgraphs that match your description yield a result● Mean: graph patterns (i.e. RDF graphs with variables)

?vhttp://.../Volcano

rdf:type

Brief Introduction to SPARQL

Page 59: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Queriedgraph:

?vhttp://.../Volcano

rdf:type

http://.../Mount_Baker http://.../Volcanordf:type

"1880"

p:lastEruption

htp://.../Mount_Etna

rdf:type

?v

http://.../Mount_Bakerhttp://.../Mount_Etna

Results:

Brief Introduction to SPARQL

Page 60: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Querying Linked Data with SPARQL

● Linked Data sources usually provide a SPARQL service

● Send your query, receive the result

Data Source Endpoint Address

DBpedia http://dbpedia.org/sparql

Musicbrainz http://dbtune.org/musicbrainz/sparql

U.S. Census http://www.rdfabout.com/sparql

Semantic Crunchbase http://cb.semsol.org/sparql

More complete list: http://esw.w3.org/topic/SparqlEndpoints

Page 61: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Querying Linked Data with SPARQL

Querying a single dataset is quite boring

compared to:

Issuing SPARQL queries over multiple datasets

How can you do this?

● Issue follow-up queries to different endpoints

● Query a central collection of datasets

● Build store with copies of relevant datasets

● (Use query federation system)

● Use a link traversal based query system

Page 62: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional approach 1: data centralization

● Querying a collection ofcopies from all relevantdatasets

Querying Linked Data with SPARQL

Page 63: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional approach 2: federated query processing

● Querying a mediator whichdistributes subqueries torelevant sources andintegrates the results

Querying Linked Data with SPARQL

???

?

Page 64: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main drawback:

You have to know the relevantdata sources in advance.

You restrict yourself tothe selected sources.

You do not tap thefull potential of

the Web !

Page 65: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

A novel approach:

Link Traversal Based Query Execution[ISWC'09]

Page 66: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

Page 67: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

Page 68: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 69: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 70: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 71: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 72: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocationhttp://geo.../Italyhttp://.../movie2449

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 73: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

http://geo.../Italy

?loc

filmingLocationhttp://.../movie2449

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

http://geo.../Italy

Page 74: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 75: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 76: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 77: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 78: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 79: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

statistics http://stat.db/.../it

http://geo.../Italy

Page 80: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

http://geo.../Italy http://stats.db/../it

?stat?loc

statistics http://stat.db/.../it

http://geo.../Italy

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 81: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

http://geo.../Italy http://stats.db/../it

?stat?loc

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 82: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

In a Nutshell

● Link traversal based query execution:● Evaluation on a continuously augmented dataset● Discovery of potentially relevant data during execution● Discovery driven by intermediate solutions

● Main advantage:● No need to know all data sources in advance

Page 83: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Real-World Example

SELECT DISTINCT ?author ?phone WHERE {

?pub swc:isPartOf <http://data.semanticweb.org/conference/eswc/2009/proceedings> .

?pub swc:hasTopic ?topic . ?topic rdfs:label ?topicLabel .

FILTER regex( str(?topicLabel), "ontology engineering", "i" ) .

?pub swrc:author ?author .

{ ?author owl:sameAs ?authorAlt }

UNION

{ ?authorAlt owl:sameAs ?author }

?authorAlt foaf:phone ?phone

}

Return phone numbers ofauthors of ontology engineering papers

at ESWC'09.

2

297

161min 30sec

# of query results

# of retrieved graphs

# of accessed servers

avg. execution time

Page 84: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Application

SELECT DISTINCT ?i ?labelWHERE {

?prof rdf:type <http://res ... data/dbprofs#DBProfessor> ; foaf:topic_interest ?i .

OPTIONAL { ?i rdfs:label ?label FILTER( LANG(?label)="en" || LANG(?label)="") }}ORDER BY ?label ?

SQUINSemWeb

ClientLib

● Researchers Map implemented with SQUIN● Query interface to the whole Web of Data

Page 85: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

SELECT DISTINCT ?i ?labelWHERE {

?prof rdf:type <http://res ... data/dbprofs#DBProfessor> . ?prof foaf:topic_interest ?i .

OPTIONAL {?i rdfs:label ?labelFILTER( LANG(?label)="en" || LANG(?label)="")

}}ORDER BY ?label

Application

Page 86: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Implementation of Researchers Map was very easy due to:● SQUIN / SemWeb Client Lib

● Approx. 700 LOC JavaScript (incl. 100 for the queries)

● Approx. 50 LOC PHP (Mainly to set up server side proxy due to same origin policy)

● Convenient access to SQUIN with SQUIN PHP tools

● Try it: http://squin.org

$s = 'http:// …'; // address of the SQUIN service$q = new SparqlQuerySock( $s, '… SELECT ...' );$res = $q->getJsonResult(); // or getXmlResult()

Application

Page 87: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… getting started

Issues people have when they want to start:

● Finding URIs

● Finding additional data

● Finding SPARQL endpoints

Page 88: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding URIs

Problem:

Two options:

● Data source specific solutions● Some Linked Data sources provide a keyword based search

for things in their dataset(s)

● Search Engines for the Web of data

What URIs exist that identifythe thing I'm interested in?

Page 89: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 90: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 91: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding URIs

You may try a SPARQL query:

What if there is no search possibility?

SELECT DISTINCT ?s WHERE { ?s rdfs:label ?label . FILTER regex( str(?label), "Berlin", "i" ) .}

Page 92: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Search engines for the Web of data provide keyword based search for things in different datasets)● Falcons http://iws.seu.edu.cn/services/falcons/● Sindice http://sindice.com● SWSE http://www.swse.org● Watson http://watson.kmi.open.ac.uk

● They have also APIs

Finding URIs

Page 93: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 94: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 95: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 96: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding Additional Data

Problem:

Three options:

● Follow links (e.g. rdfs:seeAlso, owl:sameAs)

● Use a search engine for the Web of data

● Use a co-reference service● Co-reference services find different

URIs that refer to the same thing● They may also provide an API

Given a URIs, where do I find more data as what is available by looking it up?

Page 97: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 98: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 99: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding SPARQL Endpoints

Problem:

● Look at: http://esw.w3.org/topic/SparqlEndpoints

● Still an open issue

What relevant endpoints exist?Where is the SPARQL endpointfor a dataset?What is the data provided via a SPARQL endpoint about?

Page 100: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 101: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Fusion

Requirements:

● Schema mapping: map data into a single schema

● Identity resolution: smush data from all sources

● Conflict resolution: resolve inconsistencies in the data

Applications want an integrated view onall data that is available about a thing

Page 102: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

User Interfaces and Interaction

● How do we build interfaces that operate over sucha large amount of data?

● What will be their interaction paradigm?

● How to explain data provenance and data fusion?

Page 103: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Provenance, Quality, and Trust

● There are no facts on the Web – everything is a claim

● Increasing amount of research in this area

● W3C provenance incubator group

● Our contributions so far:● A provenance model for the Web of data [LDOW'09]● A provenance based Information Quality assessment method

[SWPM'09]● tSPARQL – a trust aware extension for SPARQL [ESWC'09]

Page 104: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Take-away Summary

● Entities are connected by data links

● Data is self-describing

● Anyone can publish data to the Web of data

● Linked Data holds an enormous potential: users may benefit from a virtually unbound set of data sources

● Learn more about Linked Data:● “Linked Data – The Story So Far”

by C. Bizer, T. Heath, T. Berners-Lee● On consuming Linked Data: http://consuminglinkeddata.org

The traditional Web of documents evolves into a Web of data.

Page 105: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

These slides have been created byOlaf Hartig

http://olafhartig.de

Some slides are based on slide sets provided by● Christian Bizer● Juan Sequeda

This work is licensed under aCreative Commons Attribution-Share Alike 3.0 License

(http://creativecommons.org/licenses/by-sa/3.0/)