32
Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.i e Enabling Networked Knowledge Linked Data: opportunities and challenges Dr. Michael Hausenblas, DERI, NUI Galway Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012

Linked Data: opportunities and challenges

Embed Size (px)

DESCRIPTION

Presentation about opportunities and challenges concerning Linked Data at the Open Science Data Cloud NSF PIRE Workshop [1] on 18 July 2012 in Edinburgh, UK. [1] http://www.opensciencedatacloud.org/osdc-edinburgh-workshop-71612-71712/

Citation preview

Page 1: Linked Data: opportunities and challenges

Copyright 2011 Digital Enterprise Research Institute. All rights reserved.

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Data: opportunities and

challengesDr. Michael Hausenblas, DERI, NUI Galway

Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012

Page 2: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data 101

Page 3: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data principles

① Use URIs to identify the “things” in your data

② Use HTTP URIs so people & machines can look them up

③ When a URI is looked up return a description of the thing in a structured format (RDF)

④ Link to related things to provide context

http://www.w3.org/DesignIssues/LinkedData.html

Page 4: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://sw-app.org/mic.xhtml#i

Page 5: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://dbpedia.org/resource/Edinburgh

curl -L -H "Accept: application/rdf+xml" http://dbpedia.org/resource/Edinburgh

<?xml version="1.0" encoding="utf-8" ?><rdf:RDF

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"xmlns:owl="http://www.w3.org/2002/07/owl#"xmlns:foaf="http://xmlns.com/foaf/0.1/"xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"xmlns:dbpedia-owl="http://dbpedia.org/ontology/"xmlns:dcterms="http://purl.org/dc/terms/"xmlns:dbpprop="http://dbpedia.org/property/"

xmlns:ns10="http://dbpedia.org/property/start/" > <rdf:Description rdf:about="http://dbpedia.org/resource/Firrhill_High_School"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Murrayfield_Stadium"> <dbpedia-owl:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Stewart%27s_Melville_College"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description>

Page 6: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://data.ordnancesurvey.co.uk/id/7000000000017765

curl -L -H "Accept: text/turtle" http://data.ordnancesurvey.co.uk/id/7000000000017765

<http://data.ordnancesurvey.co.uk/doc/7000000000017765> rdf:type foaf:Document, dctype:Text ; foaf:primaryTopic <http://data.ordnancesurvey.co.uk/id/7000000000017765> ; dct:title "Linked Data for The County of Hampshire" ; dct:hasFormat <http://data.ordnancesurvey.co.uk/doc/7000000000017765.rdf> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.html> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.json> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.ttl> .

<http://data.ordnancesurvey.co.uk/id/7000000000017636> rdfs:label "Tadley" ; skos:prefLabel "Tadley" .

<http://data.ordnancesurvey.co.uk/id/7000000000017510> rdfs:label "Newton Valence" ; skos:prefLabel "Newton Valence" .

<http://data.ordnancesurvey.co.uk/id/7000000000017817> rdfs:label "Ashmansworth" ; skos:prefLabel "Ashmansworth" .

Page 7: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://bio2rdf.org/genbank:AC008393

curl -L -H "Accept: text/turtle” http://bio2rdf.org/genbank:AC008393

@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix dc: http://purl.org/dc/elements/1.1/ .

<http://bio2rdf.org/genbank:AC008393> a <http://bio2rdf.org/genbank_resource:Sequence> ;rdfs:label "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence

[genbank:AC008393]" ;owl:sameAs <http://bio2rdf.org/genbank:ac008393> ;dc:title "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence" ;dc:modified "26-FEB-2002" ;<http://bio2rdf.org/bio2rdf_resource:length> "166847" ;<http://bio2rdf.org/bio2rdf_resource:linkedToFrom>

<http://bio2rdf.org/iproclass:A8MPR0> , <http://bio2rdf.org/iproclass:B5MCC3> .

Page 8: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

HTTP URIs

http://bnb.data.bl.uk/doc/resource/009468944

curl -L -H "Accept: text/turtle” http://bnb.data.bl.uk/doc/resource/009468944

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix dct: <http://purl.org/dc/terms/> .@prefix blterms: <http://www.bl.uk/schemas/bibliographic/blterms#> .@prefix elements: <http://iflastandards.info/ns/isbd/elements/> .@prefix bibo: <http://purl.org/ontology/bibo/> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix linked-data: <http://purl.org/linked-data/api/vocab#> .@prefix void: <http://rdfs.org/ns/void#> .

<http://bnb.data.bl.uk/id/resource/009468944> dct:language <http://lexvo.org/id/iso639-3/eng> ; rdfs:seeAlso <http://www4.wiwiss.fu-berlin.de/bookmashup/books/0859761541> ; elements:P1053 "vii,147p."@en ; rdfs:label "William Wallace / Andrew Fisher" ; blterms:bnb "GB8714157" ; dct:creator <http://bnb.data.bl.uk/id/person/FisherAndrew1935-> ; bibo:isbn10 "0859761541" ; dct:title "William Wallace" ; rdf:type bibo:Book , dct:BibliographicResource ; dct:subject <http://bnb.data.bl.uk/id/concept/ddc/e19/941.1020924> .

Page 9: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Open Data

Page 10: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

20082007

20082008

20082009

20092010

10

Linked Open Data cloud

Page 11: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Over 300 open data sets with 40 billion facts, interlinked by 500 million typed links.

http://lod-cloud.net/

Linked Open Data cloud

Page 12: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Open Data cloud stats

triples distribution

links distribution

http://lod-cloud.net/state/

Page 13: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

… cost and benefits

Page 14: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Linked Data life cycles

Page 15: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Data life cycles

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

http://linked-data-life-cycles.info

Page 16: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Modeling

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 17: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Neologism

http://neologism.deri.ie/

Page 18: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Neologism

http://vocab.data.gov/

Page 19: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Schema.org – Linked Data

Page 20: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Publishing

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 21: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Google Refine extension

http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/

Page 22: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

RDB2RDF – D2RQ

http://d2rq.org/

Page 23: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Discovery

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 24: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Follow-Your-Nose

24

Page 25: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

http://dsi.lod-cloud.net/

Dataset discovery

Page 26: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Integration

LOD cloud

5stardata.info

Neologism

Schema.org

Google Refine

D2RQ

FYN

LATC DSI

LATC 24/7 data-gov.ie

Page 27: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Why linking?

Central Contractor Registration (CCR)

Geonames

http://webofdata.wordpress.com/2011/05/22/why-we-link/

Page 28: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Effort distribution

Third Party Effort

Consumer‘s Effort

Publisher‘s Effort

Fix Overall Data Integration

Effort

Page 29: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

LATC – Interlinking Platform

http://latc-project.eu/platform

Page 30: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

http://www4.wiwiss.fu-berlin.de/latc/toollibrary/screencast.html

Page 31: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Conclusion

Use the LOD cloud as test-bed (experiments) Benefit from LOD cloud in apps (context) Contribute to make your data more valuable

Op

po

rtu

nit

ies

Large-scale processing of Linked Data Distributed/federated SPARQL queries Quality of links and the data C

hal

len

ges

Page 32: Linked Data: opportunities and challenges

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Resources

Tutorials, technologies, specifications: http://linkeddatabook.com http://lod-cloud.net http://linkeddata.org http://linkeddata-specs.info http://schema.rdfs.org

Videos: http://ted.com/talks/tim_berners_lee_on_the_next_web.html - Tim Berners-Lee’s TED

talk http://www.youtube.com/watch?v=GKfJ5onP5SQ - Linked Data (and the Web of Data) http://www.youtube.com/watch?v=4x_xzT5eF5Q - What is Linked Data? http://vimeo.com/36752317 - Linked Open Data (by Europeana)