48
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data Baltic DB & IS 2014 http://eis.iai.uni-bonn.de Christoph Lange 1,2 and Sören Auer 1,2 1 Enterprise Information Systems, University of Bonn, Germany 2 Organized Knowledge, Fraunhofer IAIS, Sankt Augustin, Germany 2014-06-09 Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 1

Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Embed Size (px)

DESCRIPTION

The Linked Data paradigm has emerged as a powerful enabler for data and knowledge interlinking and exchange using standardised Web technologies. In this article, we discuss our vision how the Linked Data paradigm can be employed to evolve the intranets of large organisations -- be it enterprises, research organisations or governmental and public administrations -- into networks of internal data and knowledge. In particular for large enterprises data integration is still a key challenge. The Linked Data paradigm seems a promising approach for integrating enterprise data. Like the Web of Data, which now complements the original document-centred Web, data intranets may help to enhance and flexibilise the intranets and service-oriented architectures that exist in large organisations. Furthermore, using Linked Data gives enterprises access to 50+ billion facts from the growing Linked Open Data (LOD) cloud. As a result, a data intranet can help to bridge the gap between structured data management (in ERP, CRM or SCM systems) and semi-structured or unstructured information in documents, wikis or web portals, and make all of these sources searchable in a coherent way. Keynote at Baltic DB&IS 2014, 9 June 2014, Tallinn, Estonia

Citation preview

Page 1: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Interlinking Data and Knowledgein Enterprises, Research and Society

with Linked DataBaltic DB & IS 2014

http://eis.iai.uni-bonn.de

Christoph Lange1,2 and Sören Auer1,2

1Enterprise Information Systems, University of Bonn, Germany2Organized Knowledge, Fraunhofer IAIS, Sankt Augustin, Germany

2014-06-09Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 1

Page 2: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

„Tere, maailm!”

2011 PhD at Jacobs Univ. Bremen, Germany: EnablingCollaboration on Semiformal MathematicalKnowledge by Semantic Web Integration [Lan11]

2011/12 Univ. Bremen, Germany: Ontology Integrationand Interoperability (OntoIOp)↝ DistributedOntology Language (DOL)

2012/13 Univ. Birmingham, UK: Formal MathematicalReasoning in Economics (ForMaRE) [KLR]

2013– Univ. Bonn, Germany: Enterprise InformationSystems; Fraunhofer IAIS: Organized Knowledge

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 2

Page 3: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

EIS/OK Group in Bonn

Prof. Sören Auerpreviously at the University of Leipzig, AKSW group(DBpedia etc.)Christoph Lange: 1 of 3 postdocs∼ 15 members of scientific staff6 PhD students

Business areas:Enterprise Information IntegrationDigital Libraries (cultural heritage and otherapplications)Personalised medicine

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 3

Page 4: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data in Today’s SocietyExample (Demographics in Bonn)

Statistics, e.g. frommunicipal office for integrationHousing (accessibility, availability, . . . ):municipal, commercial, self-organisedTransport (e.g. accessibility): bus/tramInfrastructure: e.g. accessible public toilets

Complex questions:Apartments that meet my requirements w.r.t.public transport, accessibility, care, co-residentsWhat bus takes me from A to B, with sufficientchanging time near an accessible toilet?

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 4

Page 5: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Example: Accessible Facilities

Collected by the Bonn Disableds’ Union. Now combinewith public transport, housing offers, . . . !

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 5

Page 6: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data in Science

Example (Quality of Scientific Workshops)Indicators for the quality of a workshop:

part of a high-profile conferencelong history

many editionscontinuity in chairsnumber of papers not shrinking

contributions frommany institutions and countries

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 6

Page 7: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data in Science: Datasets

Sources of complementary information:DBLP computer science publications (basics), author

name disambiguationCEUR-WS.org computer science workshops

→ ESWC 2014 Semantic Publishing ChallengeSpringer computer science conferencesWikiCFP calls for papers

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 7

Page 8: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data Sources of Interest

(Open) Government Datageneral Open Data: Wikipedia, OpenStreetMap, . . .private datapersonal data

How can they be . . .1 published (licenses, privacy),2 described (for machines to understand),3 discovered,4 integrated,5 analysed?

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 8

Page 9: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Linked (Open) Data: Principles

http://5stardata.info

☀ make your stuff available on the Web(whatever format) under an open li-cense

☀☀ make it available as structured data(e.g., Excel instead of image scan of atable)

☀☀☀ use non-proprietary formats (e.g., CSVinstead of Excel)

☀☀☀☀ use URIs to denote things, so thatpeople can point at your stuff

☀☀☀☀☀ linkyourdata tootherdata toprovidecontext [12]

☀☀☀☀☀☀ further stars proposed for: quality[DLA14], explicit schema [Hyv+]

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 9

Page 10: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Linked (Open) Data: Datasets

As of September 2011

MusicBrainz

(zitgist)

P20

Turismo de

Zaragoza

yovisto

Yahoo! Geo

Planet

YAGO

World Fact-book

El ViajeroTourism

WordNet (W3C)

WordNet (VUA)

VIVO UF

VIVO Indiana

VIVO Cornell

VIAF

URIBurner

Sussex Reading

Lists

Plymouth Reading

Lists

UniRef

UniProt

UMBEL

UK Post-codes

legislationdata.gov.uk

Uberblic

UB Mann-heim

TWC LOGD

Twarql

transportdata.gov.

uk

Traffic Scotland

theses.fr

Thesau-rus W

totl.net

Tele-graphis

TCMGeneDIT

TaxonConcept

Open Library (Talis)

tags2con delicious

t4gminfo

Swedish Open

Cultural Heritage

Surge Radio

Sudoc

STW

RAMEAU SH

statisticsdata.gov.

uk

St. Andrews Resource

Lists

ECS South-ampton EPrints

SSW Thesaur

us

SmartLink

Slideshare2RDF

semanticweb.org

SemanticTweet

Semantic XBRL

SWDog Food

Source Code Ecosystem Linked Data

US SEC (rdfabout)

Sears

Scotland Geo-

graphy

ScotlandPupils &Exams

Scholaro-meter

WordNet (RKB

Explorer)

Wiki

UN/LOCODE

Ulm

ECS (RKB

Explorer)

Roma

RISKS

RESEX

RAE2001

Pisa

OS

OAI

NSF

New-castle

LAASKISTI

JISC

IRIT

IEEE

IBM

Eurécom

ERA

ePrints dotAC

DEPLOY

DBLP (RKB

Explorer)

Crime Reports

UK

Course-ware

CORDIS (RKB

Explorer)CiteSeer

Budapest

ACM

riese

Revyu

researchdata.gov.

ukRen. Energy Genera-

tors

referencedata.gov.

uk

Recht-spraak.

nl

RDFohloh

Last.FM (rdfize)

RDF Book

Mashup

Rådata nå!

PSH

Product Types

Ontology

ProductDB

PBAC

Poké-pédia

patentsdata.go

v.uk

OxPoints

Ord-nance Survey

Openly Local

Open Library

OpenCyc

Open Corpo-rates

OpenCalais

OpenEI

Open Election

Data Project

OpenData

Thesau-rus

Ontos News Portal

OGOLOD

JanusAMP

Ocean Drilling Codices

New York

Times

NVD

ntnusc

NTU Resource

Lists

Norwe-gian

MeSH

NDL subjects

ndlna

myExperi-ment

Italian Museums

medu-cator

MARC Codes List

Man-chester Reading

Lists

Lotico

Weather Stations

London Gazette

LOIUS

Linked Open Colors

lobidResources

lobidOrgani-sations

LEM

LinkedMDB

LinkedLCCN

LinkedGeoData

LinkedCT

LinkedUser

FeedbackLOV

Linked Open

Numbers

LODE

Eurostat (OntologyCentral)

Linked EDGAR

(OntologyCentral)

Linked Crunch-

base

lingvoj

Lichfield Spen-ding

LIBRIS

Lexvo

LCSH

DBLP (L3S)

Linked Sensor Data (Kno.e.sis)

Klapp-stuhl-club

Good-win

Family

National Radio-activity

JP

Jamendo (DBtune)

Italian public

schools

ISTAT Immi-gration

iServe

IdRef Sudoc

NSZL Catalog

Hellenic PD

Hellenic FBD

PiedmontAccomo-dations

GovTrack

GovWILD

GoogleArt

wrapper

gnoss

GESIS

GeoWordNet

GeoSpecies

GeoNames

GeoLinkedData

GEMET

GTAA

STITCH

SIDER

Project Guten-berg

MediCare

Euro-stat

(FUB)

EURES

DrugBank

Disea-some

DBLP (FU

Berlin)

DailyMed

CORDIS(FUB)

Freebase

flickr wrappr

Fishes of Texas

Finnish Munici-palities

ChEMBL

FanHubz

EventMedia

EUTC Produc-

tions

Eurostat

Europeana

EUNIS

EU Insti-

tutions

ESD stan-dards

EARTh

Enipedia

Popula-tion (En-AKTing)

NHS(En-

AKTing) Mortality(En-

AKTing)

Energy (En-

AKTing)

Crime(En-

AKTing)

CO2 Emission

(En-AKTing)

EEA

SISVU

education.data.g

ov.uk

ECS South-ampton

ECCO-TCP

GND

Didactalia

DDC Deutsche Bio-

graphie

datadcs

MusicBrainz

(DBTune)

Magna-tune

John Peel

(DBTune)

Classical (DB

Tune)

AudioScrobbler (DBTune)

Last.FM artists

(DBTune)

DBTropes

Portu-guese

DBpedia

dbpedia lite

Greek DBpedia

DBpedia

data-open-ac-uk

SMCJournals

Pokedex

Airports

NASA (Data Incu-bator)

MusicBrainz(Data

Incubator)

Moseley Folk

Metoffice Weather Forecasts

Discogs (Data

Incubator)

Climbing

data.gov.uk intervals

Data Gov.ie

databnf.fr

Cornetto

reegle

Chronic-ling

America

Chem2Bio2RDF

Calames

businessdata.gov.

uk

Bricklink

Brazilian Poli-

ticians

BNB

UniSTS

UniPathway

UniParc

Taxonomy

UniProt(Bio2RDF)

SGD

Reactome

PubMedPub

Chem

PRO-SITE

ProDom

Pfam

PDB

OMIMMGI

KEGG Reaction

KEGG Pathway

KEGG Glycan

KEGG Enzyme

KEGG Drug

KEGG Com-pound

InterPro

HomoloGene

HGNC

Gene Ontology

GeneID

Affy-metrix

bible ontology

BibBase

FTS

BBC Wildlife Finder

BBC Program

mes BBC Music

Alpine Ski

Austria

LOCAH

Amster-dam

Museum

AGROVOC

AEMET

US Census (rdfabout)

Media

Geographic

Publications

Government

Cross-domain

Life sciences

User-generated content

http://lod-cloud.net

datacatalogs.org: 285 data cataloguesoriginal data (= ground truth) still often missing

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 10

Page 11: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data Integration in Large Organisations

Enterprise information integration: a key field ofbusiness of the OK department at Fraunhofer IAIS

production-critical information often maintained indedicated IS already:ERP, CRM, SCM, . . .challenge: integration of these systems (with eachother, and with external sources)Daimler, e.g., runs 3,000 independent IT systems(after a decade of consolidation!)

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 11

Page 12: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

XML, Web Services, SOA: Pros and Cons

Previous approaches to enterprise IT:XML syntactic data representationWeb services data exchange protocolsSOA holistic approach for distributed system

architecture and communicationStill insufficient for data integration

SOA is good for transaction processing, . . .. . . Linked Data is more efficient for networkingand integrating data: access to LOD Cloud,lightweight, flexible

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 12

Page 13: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

From Documents to Data

Web 1.0 static documents“Web 1.5” content management and e-commerce

systems, exposing databases in a user- andcontext-specific way

Web 2.0 user-generated content; mashups aggregatingdata from different sources

Web of Data popular examples: schema.org, GoogleKnowledge Graph, Facebook Open Graph

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 13

Page 14: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)

Example (Movie description)AvatarDirector: James Cameron (born August 16, 1954)Science fictionTrailer

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14

Page 15: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)

Example (Movie description)<div class="movie"><h1>Avatar</h1><div class="director">Director: James Cameron(born August 16, 1954)

</div><span class="genre">Science fiction</span><a href="../movies/avatar-theatrical-trailer.html"Trailer</a>

</div>

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14

Page 16: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)

Example (Movie description)<div itemscope itemtype="http://schema.org/Movie"><h1 itemprop="name">Avatar</h1><div itemprop="director" itemscope itemtype="http://schema.org/Person">Director: <span itemprop="name">James Cameron</span>

(born <span itemprop="birthDate">August 16, 1954</span>)

</div><span itemprop="genre">Science fiction</span>

<a href="../movies/avatar-theatrical-trailer.html"

itemprop="trailer">Trailer</a></div>

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14

Page 17: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)

Example (Movie description)Movie Avatar Person

James Cameron

August 16, 1954Science fiction../movies/. . .

type name

directorgenre

trailer

type

namebirthDate

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14

Page 18: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Social Data with schema.org

review or rating of a creative work, organization orproduct (written by a person)social network of a person: “knows”, “works for”, “iscolleague of”, “has parent/sibling/spouse/child/relative”

Example (Reviews of a movie)

Movie type

Avatar

name

reviews

authorreviewRat

ing

reviewsauthor

reviewRating

6ratingValue

8.5ratingValue

Pünktchenname

Antonname

Persontypetype

knows

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 15

Page 19: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

schema.org in a Search Engine

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 16

Page 20: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

The Web and Intranets: Evolution

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 17

Page 21: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Web and Intranet: Common Properties

Especially large organisations share these properties ofthe WWW:

Decentral organisationSelf-dependent units, often free to choose theirinformation architectureHeterogeneous information:

domain-specific applications,knowledge bases,document templates,data formats

. . . vary across organisational units

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 18

Page 22: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Linked Organisational Data Principles

The 5-star Linked Data principles (above), plus:evolve existing thesauri, taxonomies, wikis andmaster data management systems into corporateknowledge bases and knowledge hubsestablish an organisation-wide URI schemeextend existing information system in the intranetby linked data interfacesestablish links between sources of relatedinformation

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 19

Page 23: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Organisational Knowledge Bases

Organisation and Domain-specific knowledge is in:glossaries, taxonomies, internal documents, dataschemas.Large organisations often standardise terminologyin multilingual thesaury

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 20

Page 24: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo I

Search before

Page 25: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo II

OntoWiki with car model data

Page 26: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo III

OntoWikiwith carmodel data

Page 27: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo IV

Management of Enterprise Taxonomies with OntoWikibased on the W3C SKOS standardCorporate Language Management at Daimler:500K concepts in 20 languages

Page 28: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo V

Search afterShowing recommen-dations from theknowledge base in-tegrating car modeldata and enterprisetaxonomy

Page 29: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Mercedes-Benz Search Demo VI

You can search for“Kombi” (stationwagon) and find“T-Models” (Daim-ler term for thesame)

Page 30: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Identification by anOrganisation-wide URI Schema

Unique identifiers are a key prerequisite forinformation integration:

general: persons, places, organisationsspecific: terms, data sources, products, contracts

On the Web: URI for identification, URLs for makinginformation accessible.In Linked Data: use URLs as URIs

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 27

Page 31: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Properties of URIs

Decentral maintenance: different levels,combinations possible (next slide)Dereferenceability (i.e. URIs = URLs)Provenance (URI reveals organisational unit↝authenticity and correctness of information)

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 28

Page 32: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Identifier Management Strategies

Management strategy + –Issue uniform URIs cen-trally

easy overview of re-sourcesuniform identifier struc-ture

single point of failurelow flexibilityhard to ensure derefer-enceability

Issue decentrally, regis-ter centrally

easy overview of re-sourcesresilient against techni-cal failure and organisa-tional changes

requires synchronisa-tion

Manage completely de-centrally

highly flexiblehighly resilient againsttechnical failure and or-ganisational changes

lack of central overviewlack of structural unifor-mity

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 29

Page 33: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Linked Organisational Data LifecycleOf particular interest:

RDF datamanagement:including relational sources

Authoring

Linking: detect linksbetween datasets

Classification, Enrichment

Quality Assessment: datafrom the Web “fit for use”?

Evolution, Repair

Search, ExplorationLange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 30

Page 34: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Interdependence of Lifecycle Stages

Lifecycle stages depend on each other⇒ addressing one also affects the others, e.g.:

1 enrich knowledge base with links to a newknowledge hub

2 auto-linker will find additional matchesSchema and instance levels influence each other, e.g.:

rich schema prevents instance-level problemscan learn schema-level matches from instances

LOD2.eu project has developed tools for thewhole life cycle (available for Debian andothers) [Aue+12]

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 31

Page 35: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Linked Data Quality: MetricsQuality = “fitness for use”. Subjective? There are objec-tive, even application-independent metrics! [Zav+13]

Accessibility: actually linked data; machine-readable license;performance of access?

Intrinsic aspects: no logical inconsistencies; no malformedliteral values; no redundancies?

Trust: provenance metadata; digital signature?

Dynamicity: recent data?

Contextual aspects: broad use of schema’s features; goodcoverage of domain?

Representation: existing terms reused; human-readabledocumentation?

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 32

Page 36: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Analysing Linked Data QualityJava library for quality metrics in progressWe support big datasets (streaming triples)Output once more as linked data [DLA14] – why?

complexity: data cube with dimensions metric, dataset,time, intended application, . . . (e.g. “completeness ofDBpedia 3.9 for a Tallinn tourist guide”)can→ browse datasets by quality

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 33

Page 37: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Relational Data to RDFMost existing information systems use relationaldatabases – choice between

1 materialising relational database into linked data2 expose it as virtual RDF graph by on-demand query

translation

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 34

Page 38: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

R2RMLR2RML (relational database to RDF mapping language),W3C RecommendationExample (mapping a thesaurus)SUBJECT CONCEPTS

+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil || 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+

:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [

rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;

] ;rr:predicateObjectMap [

rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;

] .

<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35

Page 39: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

R2RMLExample (mapping a thesaurus)SUBJECT CONCEPTS

+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil |

| 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+

:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [

rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;

] ;rr:predicateObjectMap [

rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;

] .

<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35

Page 40: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

R2RMLExample (mapping a thesaurus)SUBJECT CONCEPTS

+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil |

| 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+

:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [

rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;

] ;rr:predicateObjectMap [

rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;

] .

<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35

Page 41: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Data PortalsHow to discover suitable open datasets?⇒ look into data catalogues, e.g. http://datahub.io

Quality-based filtering and ranking

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 36

Page 42: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Link Discovery Tools

1 Found a dataset that’s “fit for use”?2 Link them to existing organisational datasets!

LOD2 tools Silk and LIMES help with thisRule example: similar name, and ⋃︀price − price′ < 0.1⋃︀⇒create owl:sameAs link

3 <foo> owl:sameAs <bar>means: all propertiesof foo also hold for bar, and vice versa.

Linking particularly pays off on the terminology level;DBpedia serves as a common referencing target foralmost anything of interest.

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 37

Page 43: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Enterprise Knowledge Hub [Fri+13]

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 38

Page 44: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion

Take Home MessagesLinked Data: promising technology for closing thegap between SOA and unstructured informationmanagementwealth of LOD can be leveraged as backgroundknowledge for Enterprise applicationsapplication of Linked Data in large organisations (inenterprises, research and society) is still largelyunexplored (⇒ opportunity!)Linked Data will make Organisational InformationIntegration more

flexibleiterativecost effective

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 39

Page 45: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

References

References I

5 star Open Data. Apr. 3, 2012. url:http://5stardata.info/ (visited on 2013-09-18).

S. Auer, L. Bühmann, C. Dirschl, O. Erling,M. Hausenblas, R. Isele, J. Lehmann, M. Martin,P. N. Mendes, B. van Nuffelen, C. Stadler, S. Tramp,and H. Williams. “Managing the life-cycle of LinkedData with the LOD2 Stack”. In: Proceedings ofInternational Semantic Web Conference (ISWC 2012).22% acceptance rate. 2012. url:http://iswc2012.semanticweb.org/sites/default/files/76500001.pdf.

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 40

Page 46: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

References

References II

J. Debattista, C. Lange, and S. Auer. “daQ, an Ontologyfor Dataset Quality Information”. In: Linked Data ontheWeb (LDOW). (Seoul, Apr. 8, 2014). Ed. by C. Bizer,T. Heath, S. Auer, and T. Berners-Lee. 2014. url:http://events.linkeddata.org/ldow2014/.

P. Frischmuth, S. Auer, S. Tramp, J. Unbehauen,K. Holzweißig, and C.-M. Marquardt. “Towards LinkedData based Enterprise Information Integration”. In:Proceedings of theWorkshop on Semantic WebEnterprise Adoption and Best Practice (WASABI) 2013.2013. url: http://www.wasabi-ws.org/papers/wasabi03/paper.pdf.

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 41

Page 47: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

References

References III

E. Hyvönen, J. Tuominen, M. Alonen, and E. Mäkelä.“Linked Data Finland: A 7-star Model and Platform forPublishing and Re-using Linked Datasets”. In:

M. Kerber, C. Lange, and C. Rowat. ForMaRE. FormalMathematical Reasoning in Economics. url: http://cs.bham.ac.uk/research/projects/formare/(visited on 2013-02-10).

C. Lange. “Enabling Collaboration on SemiformalMathematical Knowledge by Semantic WebIntegration”. PhD thesis. Jacobs University Bremen,2011.

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 42

Page 48: Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data

References

References IV

A. Zaveri, A. Rula, A. Maurino, R. Pietrobon,J. Lehmann, and S. Auer. “Quality AssessmentMethodologies for Linked Open Data (UnderReview)”. In: Semantic Web Journal (2013). This articleis still under review. url: http://www.semantic-web-journal.net/content/quality-assessment-linked-open-data-survey.

Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 43