Upload
christoph-lange
View
404
Download
1
Embed Size (px)
DESCRIPTION
The Linked Data paradigm has emerged as a powerful enabler for data and knowledge interlinking and exchange using standardised Web technologies. In this article, we discuss our vision how the Linked Data paradigm can be employed to evolve the intranets of large organisations -- be it enterprises, research organisations or governmental and public administrations -- into networks of internal data and knowledge. In particular for large enterprises data integration is still a key challenge. The Linked Data paradigm seems a promising approach for integrating enterprise data. Like the Web of Data, which now complements the original document-centred Web, data intranets may help to enhance and flexibilise the intranets and service-oriented architectures that exist in large organisations. Furthermore, using Linked Data gives enterprises access to 50+ billion facts from the growing Linked Open Data (LOD) cloud. As a result, a data intranet can help to bridge the gap between structured data management (in ERP, CRM or SCM systems) and semi-structured or unstructured information in documents, wikis or web portals, and make all of these sources searchable in a coherent way. Keynote at Baltic DB&IS 2014, 9 June 2014, Tallinn, Estonia
Citation preview
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Interlinking Data and Knowledgein Enterprises, Research and Society
with Linked DataBaltic DB & IS 2014
http://eis.iai.uni-bonn.de
Christoph Lange1,2 and Sören Auer1,2
1Enterprise Information Systems, University of Bonn, Germany2Organized Knowledge, Fraunhofer IAIS, Sankt Augustin, Germany
2014-06-09Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 1
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
„Tere, maailm!”
2011 PhD at Jacobs Univ. Bremen, Germany: EnablingCollaboration on Semiformal MathematicalKnowledge by Semantic Web Integration [Lan11]
2011/12 Univ. Bremen, Germany: Ontology Integrationand Interoperability (OntoIOp)↝ DistributedOntology Language (DOL)
2012/13 Univ. Birmingham, UK: Formal MathematicalReasoning in Economics (ForMaRE) [KLR]
2013– Univ. Bonn, Germany: Enterprise InformationSystems; Fraunhofer IAIS: Organized Knowledge
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 2
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
EIS/OK Group in Bonn
Prof. Sören Auerpreviously at the University of Leipzig, AKSW group(DBpedia etc.)Christoph Lange: 1 of 3 postdocs∼ 15 members of scientific staff6 PhD students
Business areas:Enterprise Information IntegrationDigital Libraries (cultural heritage and otherapplications)Personalised medicine
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 3
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data in Today’s SocietyExample (Demographics in Bonn)
Statistics, e.g. frommunicipal office for integrationHousing (accessibility, availability, . . . ):municipal, commercial, self-organisedTransport (e.g. accessibility): bus/tramInfrastructure: e.g. accessible public toilets
Complex questions:Apartments that meet my requirements w.r.t.public transport, accessibility, care, co-residentsWhat bus takes me from A to B, with sufficientchanging time near an accessible toilet?
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 4
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Example: Accessible Facilities
Collected by the Bonn Disableds’ Union. Now combinewith public transport, housing offers, . . . !
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 5
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data in Science
Example (Quality of Scientific Workshops)Indicators for the quality of a workshop:
part of a high-profile conferencelong history
many editionscontinuity in chairsnumber of papers not shrinking
contributions frommany institutions and countries
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 6
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data in Science: Datasets
Sources of complementary information:DBLP computer science publications (basics), author
name disambiguationCEUR-WS.org computer science workshops
→ ESWC 2014 Semantic Publishing ChallengeSpringer computer science conferencesWikiCFP calls for papers
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 7
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data Sources of Interest
(Open) Government Datageneral Open Data: Wikipedia, OpenStreetMap, . . .private datapersonal data
How can they be . . .1 published (licenses, privacy),2 described (for machines to understand),3 discovered,4 integrated,5 analysed?
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 8
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Linked (Open) Data: Principles
http://5stardata.info
☀ make your stuff available on the Web(whatever format) under an open li-cense
☀☀ make it available as structured data(e.g., Excel instead of image scan of atable)
☀☀☀ use non-proprietary formats (e.g., CSVinstead of Excel)
☀☀☀☀ use URIs to denote things, so thatpeople can point at your stuff
☀☀☀☀☀ linkyourdata tootherdata toprovidecontext [12]
☀☀☀☀☀☀ further stars proposed for: quality[DLA14], explicit schema [Hyv+]
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 9
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Linked (Open) Data: Datasets
As of September 2011
MusicBrainz
(zitgist)
P20
Turismo de
Zaragoza
yovisto
Yahoo! Geo
Planet
YAGO
World Fact-book
El ViajeroTourism
WordNet (W3C)
WordNet (VUA)
VIVO UF
VIVO Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UniRef
UniProt
UMBEL
UK Post-codes
legislationdata.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov.
uk
Traffic Scotland
theses.fr
Thesau-rus W
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
Open Library (Talis)
tags2con delicious
t4gminfo
Swedish Open
Cultural Heritage
Surge Radio
Sudoc
STW
RAMEAU SH
statisticsdata.gov.
uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
SSW Thesaur
us
SmartLink
Slideshare2RDF
semanticweb.org
SemanticTweet
Semantic XBRL
SWDog Food
Source Code Ecosystem Linked Data
US SEC (rdfabout)
Sears
Scotland Geo-
graphy
ScotlandPupils &Exams
Scholaro-meter
WordNet (RKB
Explorer)
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAASKISTI
JISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints dotAC
DEPLOY
DBLP (RKB
Explorer)
Crime Reports
UK
Course-ware
CORDIS (RKB
Explorer)CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov.
ukRen. Energy Genera-
tors
referencedata.gov.
uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
Rådata nå!
PSH
Product Types
Ontology
ProductDB
PBAC
Poké-pédia
patentsdata.go
v.uk
OxPoints
Ord-nance Survey
Openly Local
Open Library
OpenCyc
Open Corpo-rates
OpenCalais
OpenEI
Open Election
Data Project
OpenData
Thesau-rus
Ontos News Portal
OGOLOD
JanusAMP
Ocean Drilling Codices
New York
Times
NVD
ntnusc
NTU Resource
Lists
Norwe-gian
MeSH
NDL subjects
ndlna
myExperi-ment
Italian Museums
medu-cator
MARC Codes List
Man-chester Reading
Lists
Lotico
Weather Stations
London Gazette
LOIUS
Linked Open Colors
lobidResources
lobidOrgani-sations
LEM
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
LinkedUser
FeedbackLOV
Linked Open
Numbers
LODE
Eurostat (OntologyCentral)
Linked EDGAR
(OntologyCentral)
Linked Crunch-
base
lingvoj
Lichfield Spen-ding
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Klapp-stuhl-club
Good-win
Family
National Radio-activity
JP
Jamendo (DBtune)
Italian public
schools
ISTAT Immi-gration
iServe
IdRef Sudoc
NSZL Catalog
Hellenic PD
Hellenic FBD
PiedmontAccomo-dations
GovTrack
GovWILD
GoogleArt
wrapper
gnoss
GESIS
GeoWordNet
GeoSpecies
GeoNames
GeoLinkedData
GEMET
GTAA
STITCH
SIDER
Project Guten-berg
MediCare
Euro-stat
(FUB)
EURES
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
CORDIS(FUB)
Freebase
flickr wrappr
Fishes of Texas
Finnish Munici-palities
ChEMBL
FanHubz
EventMedia
EUTC Produc-
tions
Eurostat
Europeana
EUNIS
EU Insti-
tutions
ESD stan-dards
EARTh
Enipedia
Popula-tion (En-AKTing)
NHS(En-
AKTing) Mortality(En-
AKTing)
Energy (En-
AKTing)
Crime(En-
AKTing)
CO2 Emission
(En-AKTing)
EEA
SISVU
education.data.g
ov.uk
ECS South-ampton
ECCO-TCP
GND
Didactalia
DDC Deutsche Bio-
graphie
datadcs
MusicBrainz
(DBTune)
Magna-tune
John Peel
(DBTune)
Classical (DB
Tune)
AudioScrobbler (DBTune)
Last.FM artists
(DBTune)
DBTropes
Portu-guese
DBpedia
dbpedia lite
Greek DBpedia
DBpedia
data-open-ac-uk
SMCJournals
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Metoffice Weather Forecasts
Discogs (Data
Incubator)
Climbing
data.gov.uk intervals
Data Gov.ie
databnf.fr
Cornetto
reegle
Chronic-ling
America
Chem2Bio2RDF
Calames
businessdata.gov.
uk
Bricklink
Brazilian Poli-
ticians
BNB
UniSTS
UniPathway
UniParc
Taxonomy
UniProt(Bio2RDF)
SGD
Reactome
PubMedPub
Chem
PRO-SITE
ProDom
Pfam
PDB
OMIMMGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Com-pound
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
Affy-metrix
bible ontology
BibBase
FTS
BBC Wildlife Finder
BBC Program
mes BBC Music
Alpine Ski
Austria
LOCAH
Amster-dam
Museum
AGROVOC
AEMET
US Census (rdfabout)
Media
Geographic
Publications
Government
Cross-domain
Life sciences
User-generated content
http://lod-cloud.net
datacatalogs.org: 285 data cataloguesoriginal data (= ground truth) still often missing
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 10
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data Integration in Large Organisations
Enterprise information integration: a key field ofbusiness of the OK department at Fraunhofer IAIS
production-critical information often maintained indedicated IS already:ERP, CRM, SCM, . . .challenge: integration of these systems (with eachother, and with external sources)Daimler, e.g., runs 3,000 independent IT systems(after a decade of consolidation!)
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 11
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
XML, Web Services, SOA: Pros and Cons
Previous approaches to enterprise IT:XML syntactic data representationWeb services data exchange protocolsSOA holistic approach for distributed system
architecture and communicationStill insufficient for data integration
SOA is good for transaction processing, . . .. . . Linked Data is more efficient for networkingand integrating data: access to LOD Cloud,lightweight, flexible
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 12
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
From Documents to Data
Web 1.0 static documents“Web 1.5” content management and e-commerce
systems, exposing databases in a user- andcontext-specific way
Web 2.0 user-generated content; mashups aggregatingdata from different sources
Web of Data popular examples: schema.org, GoogleKnowledge Graph, Facebook Open Graph
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 13
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)
Example (Movie description)AvatarDirector: James Cameron (born August 16, 1954)Science fictionTrailer
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)
Example (Movie description)<div class="movie"><h1>Avatar</h1><div class="director">Director: James Cameron(born August 16, 1954)
</div><span class="genre">Science fiction</span><a href="../movies/avatar-theatrical-trailer.html"Trailer</a>
</div>
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)
Example (Movie description)<div itemscope itemtype="http://schema.org/Movie"><h1 itemprop="name">Avatar</h1><div itemprop="director" itemscope itemtype="http://schema.org/Person">Director: <span itemprop="name">James Cameron</span>
(born <span itemprop="birthDate">August 16, 1954</span>)
</div><span itemprop="genre">Science fiction</span>
<a href="../movies/avatar-theatrical-trailer.html"
itemprop="trailer">Trailer</a></div>
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Web of Data: schema.orginitiative of major search engine operatorsannotation vocabulary for structuring web pages(creative works, events, organisations, persons,places, products)
Example (Movie description)Movie Avatar Person
James Cameron
August 16, 1954Science fiction../movies/. . .
type name
directorgenre
trailer
type
namebirthDate
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 14
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Social Data with schema.org
review or rating of a creative work, organization orproduct (written by a person)social network of a person: “knows”, “works for”, “iscolleague of”, “has parent/sibling/spouse/child/relative”
Example (Reviews of a movie)
Movie type
Avatar
name
reviews
authorreviewRat
ing
reviewsauthor
reviewRating
6ratingValue
8.5ratingValue
Pünktchenname
Antonname
Persontypetype
knows
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 15
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
schema.org in a Search Engine
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 16
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
The Web and Intranets: Evolution
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 17
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Web and Intranet: Common Properties
Especially large organisations share these properties ofthe WWW:
Decentral organisationSelf-dependent units, often free to choose theirinformation architectureHeterogeneous information:
domain-specific applications,knowledge bases,document templates,data formats
. . . vary across organisational units
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 18
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Linked Organisational Data Principles
The 5-star Linked Data principles (above), plus:evolve existing thesauri, taxonomies, wikis andmaster data management systems into corporateknowledge bases and knowledge hubsestablish an organisation-wide URI schemeextend existing information system in the intranetby linked data interfacesestablish links between sources of relatedinformation
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 19
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Organisational Knowledge Bases
Organisation and Domain-specific knowledge is in:glossaries, taxonomies, internal documents, dataschemas.Large organisations often standardise terminologyin multilingual thesaury
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 20
Mercedes-Benz Search Demo I
Search before
Mercedes-Benz Search Demo II
OntoWiki with car model data
Mercedes-Benz Search Demo III
OntoWikiwith carmodel data
Mercedes-Benz Search Demo IV
Management of Enterprise Taxonomies with OntoWikibased on the W3C SKOS standardCorporate Language Management at Daimler:500K concepts in 20 languages
Mercedes-Benz Search Demo V
Search afterShowing recommen-dations from theknowledge base in-tegrating car modeldata and enterprisetaxonomy
Mercedes-Benz Search Demo VI
You can search for“Kombi” (stationwagon) and find“T-Models” (Daim-ler term for thesame)
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Identification by anOrganisation-wide URI Schema
Unique identifiers are a key prerequisite forinformation integration:
general: persons, places, organisationsspecific: terms, data sources, products, contracts
On the Web: URI for identification, URLs for makinginformation accessible.In Linked Data: use URLs as URIs
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 27
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Properties of URIs
Decentral maintenance: different levels,combinations possible (next slide)Dereferenceability (i.e. URIs = URLs)Provenance (URI reveals organisational unit↝authenticity and correctness of information)
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 28
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Identifier Management Strategies
Management strategy + –Issue uniform URIs cen-trally
easy overview of re-sourcesuniform identifier struc-ture
single point of failurelow flexibilityhard to ensure derefer-enceability
Issue decentrally, regis-ter centrally
easy overview of re-sourcesresilient against techni-cal failure and organisa-tional changes
requires synchronisa-tion
Manage completely de-centrally
highly flexiblehighly resilient againsttechnical failure and or-ganisational changes
lack of central overviewlack of structural unifor-mity
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 29
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Linked Organisational Data LifecycleOf particular interest:
RDF datamanagement:including relational sources
Authoring
Linking: detect linksbetween datasets
Classification, Enrichment
Quality Assessment: datafrom the Web “fit for use”?
Evolution, Repair
Search, ExplorationLange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 30
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Interdependence of Lifecycle Stages
Lifecycle stages depend on each other⇒ addressing one also affects the others, e.g.:
1 enrich knowledge base with links to a newknowledge hub
2 auto-linker will find additional matchesSchema and instance levels influence each other, e.g.:
rich schema prevents instance-level problemscan learn schema-level matches from instances
LOD2.eu project has developed tools for thewhole life cycle (available for Debian andothers) [Aue+12]
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 31
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Linked Data Quality: MetricsQuality = “fitness for use”. Subjective? There are objec-tive, even application-independent metrics! [Zav+13]
Accessibility: actually linked data; machine-readable license;performance of access?
Intrinsic aspects: no logical inconsistencies; no malformedliteral values; no redundancies?
Trust: provenance metadata; digital signature?
Dynamicity: recent data?
Contextual aspects: broad use of schema’s features; goodcoverage of domain?
Representation: existing terms reused; human-readabledocumentation?
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 32
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Analysing Linked Data QualityJava library for quality metrics in progressWe support big datasets (streaming triples)Output once more as linked data [DLA14] – why?
complexity: data cube with dimensions metric, dataset,time, intended application, . . . (e.g. “completeness ofDBpedia 3.9 for a Tallinn tourist guide”)can→ browse datasets by quality
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 33
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Relational Data to RDFMost existing information systems use relationaldatabases – choice between
1 materialising relational database into linked data2 expose it as virtual RDF graph by on-demand query
translation
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 34
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
R2RMLR2RML (relational database to RDF mapping language),W3C RecommendationExample (mapping a thesaurus)SUBJECT CONCEPTS
+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil || 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+
:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [
rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;
] ;rr:predicateObjectMap [
rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;
] .
<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
R2RMLExample (mapping a thesaurus)SUBJECT CONCEPTS
+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil |
| 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+
:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [
rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;
] ;rr:predicateObjectMap [
rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;
] .
<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
R2RMLExample (mapping a thesaurus)SUBJECT CONCEPTS
+===+=========+=========+=========++===============+ |ID | SUBJECT | TERM_EN | TERM_ET ||ID | SUBJECT | +===+=========+=========+=========++===+===========+ | 1 | 1 | hammer | Vasar || 1 | tools | | 2 | 1 | file | Viil |
| 2 | chemistry | | 3 | 2 | oil | Õli |+===+===========+ +===+=========+=========+=========+
:ConceptsTriplesMaprr:logicalTable [ rr:tableName "CONCEPTS" ] ;rr:subjectMap [
rr:template "http://example.com/term/concept/{ID}" ;rr:class skos:Concept ;
] ;rr:predicateObjectMap [
rr:predicate skos:prefLabel ;rr:objectMap [ rr:column "TERM_ET" ; rr:language "et" ] ;
] .
<http://example.com/term/concept/1> a skos:Concept .<http://example.com/term/concept/1> skos:prefLabel "Vasar"@et .
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 35
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Data PortalsHow to discover suitable open datasets?⇒ look into data catalogues, e.g. http://datahub.io
Quality-based filtering and ranking
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 36
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Link Discovery Tools
1 Found a dataset that’s “fit for use”?2 Link them to existing organisational datasets!
LOD2 tools Silk and LIMES help with thisRule example: similar name, and ⋃︀price − price′ < 0.1⋃︀⇒create owl:sameAs link
3 <foo> owl:sameAs <bar>means: all propertiesof foo also hold for bar, and vice versa.
Linking particularly pays off on the terminology level;DBpedia serves as a common referencing target foralmost anything of interest.
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 37
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Enterprise Knowledge Hub [Fri+13]
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 38
Introduction Motivation Evolution Knowledge URIs Lifecycle Interfaces Discovery Conclusion
Take Home MessagesLinked Data: promising technology for closing thegap between SOA and unstructured informationmanagementwealth of LOD can be leveraged as backgroundknowledge for Enterprise applicationsapplication of Linked Data in large organisations (inenterprises, research and society) is still largelyunexplored (⇒ opportunity!)Linked Data will make Organisational InformationIntegration more
flexibleiterativecost effective
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 39
References
References I
5 star Open Data. Apr. 3, 2012. url:http://5stardata.info/ (visited on 2013-09-18).
S. Auer, L. Bühmann, C. Dirschl, O. Erling,M. Hausenblas, R. Isele, J. Lehmann, M. Martin,P. N. Mendes, B. van Nuffelen, C. Stadler, S. Tramp,and H. Williams. “Managing the life-cycle of LinkedData with the LOD2 Stack”. In: Proceedings ofInternational Semantic Web Conference (ISWC 2012).22% acceptance rate. 2012. url:http://iswc2012.semanticweb.org/sites/default/files/76500001.pdf.
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 40
References
References II
J. Debattista, C. Lange, and S. Auer. “daQ, an Ontologyfor Dataset Quality Information”. In: Linked Data ontheWeb (LDOW). (Seoul, Apr. 8, 2014). Ed. by C. Bizer,T. Heath, S. Auer, and T. Berners-Lee. 2014. url:http://events.linkeddata.org/ldow2014/.
P. Frischmuth, S. Auer, S. Tramp, J. Unbehauen,K. Holzweißig, and C.-M. Marquardt. “Towards LinkedData based Enterprise Information Integration”. In:Proceedings of theWorkshop on Semantic WebEnterprise Adoption and Best Practice (WASABI) 2013.2013. url: http://www.wasabi-ws.org/papers/wasabi03/paper.pdf.
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 41
References
References III
E. Hyvönen, J. Tuominen, M. Alonen, and E. Mäkelä.“Linked Data Finland: A 7-star Model and Platform forPublishing and Re-using Linked Datasets”. In:
M. Kerber, C. Lange, and C. Rowat. ForMaRE. FormalMathematical Reasoning in Economics. url: http://cs.bham.ac.uk/research/projects/formare/(visited on 2013-02-10).
C. Lange. “Enabling Collaboration on SemiformalMathematical Knowledge by Semantic WebIntegration”. PhD thesis. Jacobs University Bremen,2011.
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 42
References
References IV
A. Zaveri, A. Rula, A. Maurino, R. Pietrobon,J. Lehmann, and S. Auer. “Quality AssessmentMethodologies for Linked Open Data (UnderReview)”. In: Semantic Web Journal (2013). This articleis still under review. url: http://www.semantic-web-journal.net/content/quality-assessment-linked-open-data-survey.
Lange/Auer (Bonn) Interlinking Data and Knowledge in Enterprises, Research and Society with Linked Data 2014-06-09 43