View
217
Download
1
Tags:
Embed Size (px)
Citation preview
Exploiting the Semantic Web:Next Generation Semantic Web Applications in KMi
Watson, PowerMagpie, PowerAqua, …
Mathieu d’Aquin Laurian Gridinoc Vanessa Lopez
The Knowledge Media Institute, The Open [email protected]
The Semantic Web is growing…
0
5
10
15
20
25
30
35
40
45
2003 2004
#SW Pages
Lee, J., Goodwin, R. (2004) The Semantic Webscape: a View of the Semantic Web. IBM Research Report.
… really growinghttp://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
Ontologies
Metadata
<rdf:RDF><channel rdf:about=“http://watson.kmi.open.ac.uk/blog”><title>Elementaries - The Watson Blog</title><link>http://watson.kmi.open.ac.uk:8080/blog/</link><description>"Oh dear! Where the Semantic Web is going to go now?" -- imaginary user 23</description><language>en</language><copyright>Watson team</copyright><lastBuildDate>Thu, 01 Mar 2007 13:49:52 GMT</lastBuildDate><generator>Pebble (http://pebble.sourceforge.net)</generator><docs>http://backend.userland.com/rss</docs>…
<rdf:RDF><channel rdf:about=“http://watson.kmi.open.ac.uk/blog”><title>Elementaries - The Watson Blog</title><link>http://watson.kmi.open.ac.uk:8080/blog/</link><description>"Oh dear! Where the Semantic Web is going to go now?" -- imaginary user 23</description><language>en</language><copyright>Watson team</copyright><lastBuildDate>Thu, 01 Mar 2007 13:49:52 GMT</lastBuildDate><generator>Pebble (http://pebble.sourceforge.net)</generator><docs>http://backend.userland.com/rss</docs>…
<rdf:RDF> <foaf:Image rdf:about='http://static.flickr.com/132/400582453_e1e1f8602c.jpg'> <dc:title>Zen wisteria</dc:title> <dc:description></dc:description> <foaf:page rdf:resource='http://www.flickr.com/photos/xcv/400582453/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/vittelgarden/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/wisteria/'/> <dc:creator> <foaf:Person><foaf:name>Mathieu d'Aquin</foaf:name> …
<rdf:RDF> <foaf:Image rdf:about='http://static.flickr.com/132/400582453_e1e1f8602c.jpg'> <dc:title>Zen wisteria</dc:title> <dc:description></dc:description> <foaf:page rdf:resource='http://www.flickr.com/photos/xcv/400582453/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/vittelgarden/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/wisteria/'/> <dc:creator> <foaf:Person><foaf:name>Mathieu d'Aquin</foaf:name> …
<rdf:RDF> <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://usefulinc.com/ns/doap#"/> </owl:Ontology> <j.1:Organization rdf:ID="KMi"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >The Knoledge Media Institute of the Open University, Milton Keynes UK</rdfs:comment> </j.1:Organization> <j.1:Document rdf:ID="KMiWebSite"> …
<rdf:RDF> <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://usefulinc.com/ns/doap#"/> </owl:Ontology> <j.1:Organization rdf:ID="KMi"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >The Knoledge Media Institute of the Open University, Milton Keynes UK</rdfs:comment> </j.1:Organization> <j.1:Document rdf:ID="KMiWebSite"> …
DOAP
FOAFDC
RSS TAPWORDNET
NCI GalenMusic
…
…… …
…
…
UoD
Next Generation Semantic Web Applications
Next Generation Semantic Web Applications
NG SW Application Semantic WebSmart Features
• Able to exploit the SW at large – Dynamically retrieving the relevant semantic
resources – Combining several, heterogeneous Ontologies
Next Generation Semantic Web Applications
Need for a Gateway to the Semantic Web
Next Generation Semantic Web Applications
Architecture
Keyword Search
SPARQLQuery
Crawling Parsing(Jena)
Validation/Analysis
Indexing
RepositoryURLs Metadata Indexes
populates populates populates populatesusedextractedretrieved
OntologyExploration
queries queriesqueriesqueries queries
request
WWW
discovered
Collecting Analyzing
Querying
Design Principles Focused quality
– Provides quality information about the collected ontologies and semantic data– Provides valuable services for semantic applications, to discover, select, exploit and combine semantic resources
Provides a variety of query and access mechanisms– For both humans (web interface) and machines (web serv., API)
– To fit applications having different purposes and requirements– Ranging from Keyword search to ontology exploration and formal queries (SPARQL)
Support for relations between ontologies– Detecting redundancy, duplication, incompatibility (contradiction), modularization, versioning, etc.
Interfaces: WUI
Web User Interface: http://watson.kmi.open.ac.uk/WatsonWUI
Collection
Applications
• Existing next generation semantic web applications based on Watson:– PowerAqua, question answering– PowerMagpie, semantic browsing– Folksonomie enrichment– Scarlet, relation discovery, ontology
matching– Ontology building– …
PowerAqua• Bridge the gap between the user and the Semantic Web:
- Provide the user the capability to query the SW using Natural Language.
• Dynamically select and combine info drawn from the vast amount of heterogeneous semantic data to answer a user’s query.
PowerAqua
1. NL Question1. NL Question
2. Linguistic interpretation2. Linguistic interpretation
3. Ontology based interpretation3. Ontology based interpretation
4. Answer4. Answer
Magpie
Ontology concepts
Instances highlighted according to their type
Contextual access to web services
PowerMagpie
Key terms
Ontologies and semantic entities retrieved from Watson
Metadata about the ontologies
PowerMagpie
Semantic entities in relation with the text from Watson
Semantic definition of the entities in the ontologies
PowerMagpie
Watson(ontology selection)
Watson(ontology selection)
Google(term frequency)
Google(term frequency)
PowerMagpie Server(term ranking, term to ontological entity)
PowerMagpie Server(term ranking, term to ontological entity)
Google API Watson API
Javascript enabled Web Browser
Javascript enabled Web Browser
PowerMagpie Interface(visualization, interaction, navigation)
client
server
external
services
AJAX
SOAP SOAP
Folksonomy enrichment
Tags
{camera, digital slr, photograph} {damage, flooding, hurricane, katrina, Louisiana}
Clusters
Digital SLR
cameraphotographtakenWith
Ontologies
NLP/ClusteringNLP/Clustering
Find and combine Online ontologies +modularizaton
+matching+modularizaton
+matching
Discovering relations between tags
Folksonomy enrichment
Discovering relations between tags
Example of result.
Ontology Matching
ka2.rdf
Researcher AcademicStaff
Sem
anti
c W
eb
Researcher
AcademicStaff
⊆
⊆
ISWC SWRCHam SeaFood
Sem
anti
c W
eb
HamSeaFood
Meat
Meat
SeaFood
Agrovoc NALT
⊆
€
⊥
€
⊥
€
⊥
pizza-to-go
wine.owl
NALT
Ontology Building
Next Generation Semantic Web Applications
Take Home Message
• Next Generation Semantic Web applications dynamically exploit the semantic information available on the Web
• Watson is an infrastructure that supports the development of such applications
• We are already building this next generation of semantic Web applications, allowing – question answering (PowerAqua), – semantic browsing (PowerMagpie), – semantic tagging, etc.
• using the semantic web at large.
Thank you!Watson: http://watson.kmi.open.ac.uk,
[email protected]: http://powermagpie.open.ac.uk/,
http://kmi.open.ac.uk/technologies/poweraqua/, [email protected]
•Step 1: Linguistic Analysis-“Show me all cities of Spain” -> <what-is, cities, Spain>
•Step 2: Identify the set of relevant ontologies - Based on PowerMap: A run time knowledge matcher to produce semantically sound mappings across ontologies and domains. I.e. through Watson it searches for approximate mappings by using lexically related words obtained from WordNet and background ontologies. Then, the candidate mappings are semantically enriched by using a similarity measure based in WordNet to obtain its sense.
•Step 3: The triple similarity services analyzes the linguistic information and the ontology semantics (relationship and taxonomy) to return a small set of ontologies (represented as ontological triples) that jointly covers the user query.