Exploiting the Semantic Web: Next Generation Semantic Web Applications in KMi Watson, PowerMagpie,...

Preview:

Citation preview

Exploiting the Semantic Web:Next Generation Semantic Web Applications in KMi

Watson, PowerMagpie, PowerAqua, …

Mathieu d’Aquin Laurian Gridinoc Vanessa Lopez

The Knowledge Media Institute, The Open Universitym.daquin@open.ac.uk

The Semantic Web is growing…

0

5

10

15

20

25

30

35

40

45

2003 2004

#SW Pages

Lee, J., Goodwin, R. (2004) The Semantic Webscape: a View of the Semantic Web. IBM Research Report.

… really growinghttp://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData

Ontologies

Metadata

<rdf:RDF><channel rdf:about=“http://watson.kmi.open.ac.uk/blog”><title>Elementaries - The Watson Blog</title><link>http://watson.kmi.open.ac.uk:8080/blog/</link><description>"Oh dear! Where the Semantic Web is going to go now?" -- imaginary user 23</description><language>en</language><copyright>Watson team</copyright><lastBuildDate>Thu, 01 Mar 2007 13:49:52 GMT</lastBuildDate><generator>Pebble (http://pebble.sourceforge.net)</generator><docs>http://backend.userland.com/rss</docs>…

<rdf:RDF><channel rdf:about=“http://watson.kmi.open.ac.uk/blog”><title>Elementaries - The Watson Blog</title><link>http://watson.kmi.open.ac.uk:8080/blog/</link><description>"Oh dear! Where the Semantic Web is going to go now?" -- imaginary user 23</description><language>en</language><copyright>Watson team</copyright><lastBuildDate>Thu, 01 Mar 2007 13:49:52 GMT</lastBuildDate><generator>Pebble (http://pebble.sourceforge.net)</generator><docs>http://backend.userland.com/rss</docs>…

<rdf:RDF> <foaf:Image rdf:about='http://static.flickr.com/132/400582453_e1e1f8602c.jpg'> <dc:title>Zen wisteria</dc:title> <dc:description></dc:description> <foaf:page rdf:resource='http://www.flickr.com/photos/xcv/400582453/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/vittelgarden/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/wisteria/'/> <dc:creator> <foaf:Person><foaf:name>Mathieu d'Aquin</foaf:name> …

<rdf:RDF> <foaf:Image rdf:about='http://static.flickr.com/132/400582453_e1e1f8602c.jpg'> <dc:title>Zen wisteria</dc:title> <dc:description></dc:description> <foaf:page rdf:resource='http://www.flickr.com/photos/xcv/400582453/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/vittelgarden/'/> <foaf:topic rdf:resource='http://www.flickr.com/photos/tags/wisteria/'/> <dc:creator> <foaf:Person><foaf:name>Mathieu d'Aquin</foaf:name> …

<rdf:RDF> <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://usefulinc.com/ns/doap#"/> </owl:Ontology> <j.1:Organization rdf:ID="KMi"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >The Knoledge Media Institute of the Open University, Milton Keynes UK</rdfs:comment> </j.1:Organization> <j.1:Document rdf:ID="KMiWebSite"> …

<rdf:RDF> <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://usefulinc.com/ns/doap#"/> </owl:Ontology> <j.1:Organization rdf:ID="KMi"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >The Knoledge Media Institute of the Open University, Milton Keynes UK</rdfs:comment> </j.1:Organization> <j.1:Document rdf:ID="KMiWebSite"> …

DOAP

FOAFDC

RSS TAPWORDNET

NCI GalenMusic

…… …

UoD

Next Generation Semantic Web Applications

Next Generation Semantic Web Applications

NG SW Application Semantic WebSmart Features

• Able to exploit the SW at large – Dynamically retrieving the relevant semantic

resources – Combining several, heterogeneous Ontologies

Next Generation Semantic Web Applications

Need for a Gateway to the Semantic Web

Next Generation Semantic Web Applications

Architecture

Keyword Search

SPARQLQuery

Crawling Parsing(Jena)

Validation/Analysis

Indexing

RepositoryURLs Metadata Indexes

populates populates populates populatesusedextractedretrieved

OntologyExploration

queries queriesqueriesqueries queries

request

WWW

discovered

Collecting Analyzing

Querying

Design Principles Focused quality

– Provides quality information about the collected ontologies and semantic data– Provides valuable services for semantic applications, to discover, select, exploit and combine semantic resources

Provides a variety of query and access mechanisms– For both humans (web interface) and machines (web serv., API)

– To fit applications having different purposes and requirements– Ranging from Keyword search to ontology exploration and formal queries (SPARQL)

Support for relations between ontologies– Detecting redundancy, duplication, incompatibility (contradiction), modularization, versioning, etc.

Interfaces: WUI

Web User Interface: http://watson.kmi.open.ac.uk/WatsonWUI

Collection

Applications

• Existing next generation semantic web applications based on Watson:– PowerAqua, question answering– PowerMagpie, semantic browsing– Folksonomie enrichment– Scarlet, relation discovery, ontology

matching– Ontology building– …

PowerAqua• Bridge the gap between the user and the Semantic Web:

- Provide the user the capability to query the SW using Natural Language.

• Dynamically select and combine info drawn from the vast amount of heterogeneous semantic data to answer a user’s query.

PowerAqua

1. NL Question1. NL Question

2. Linguistic interpretation2. Linguistic interpretation

3. Ontology based interpretation3. Ontology based interpretation

4. Answer4. Answer

Magpie

Ontology concepts

Instances highlighted according to their type

Contextual access to web services

PowerMagpie

Key terms

Ontologies and semantic entities retrieved from Watson

Metadata about the ontologies

PowerMagpie

Semantic entities in relation with the text from Watson

Semantic definition of the entities in the ontologies

PowerMagpie

Watson(ontology selection)

Watson(ontology selection)

Google(term frequency)

Google(term frequency)

PowerMagpie Server(term ranking, term to ontological entity)

PowerMagpie Server(term ranking, term to ontological entity)

Google API Watson API

Javascript enabled Web Browser

Javascript enabled Web Browser

PowerMagpie Interface(visualization, interaction, navigation)

client

server

external

services

AJAX

SOAP SOAP

Folksonomy enrichment

Tags

{camera, digital slr, photograph} {damage, flooding, hurricane, katrina, Louisiana}

Clusters

Digital SLR

cameraphotographtakenWith

Ontologies

NLP/ClusteringNLP/Clustering

Find and combine Online ontologies +modularizaton

+matching+modularizaton

+matching

Discovering relations between tags

Folksonomy enrichment

Discovering relations between tags

Example of result.

Ontology Matching

ka2.rdf

Researcher AcademicStaff

Sem

anti

c W

eb

Researcher

AcademicStaff

ISWC SWRCHam SeaFood

Sem

anti

c W

eb

HamSeaFood

Meat

Meat

SeaFood

Agrovoc NALT

pizza-to-go

wine.owl

NALT

Ontology Building

Next Generation Semantic Web Applications

Take Home Message

• Next Generation Semantic Web applications dynamically exploit the semantic information available on the Web

• Watson is an infrastructure that supports the development of such applications

• We are already building this next generation of semantic Web applications, allowing – question answering (PowerAqua), – semantic browsing (PowerMagpie), – semantic tagging, etc.

• using the semantic web at large.

Thank you!Watson: http://watson.kmi.open.ac.uk,

m.daquin@open.ac.ukPowerMagpie: http://powermagpie.open.ac.uk/,

l.gridinoc@open.ac.ukPowerAqua:

http://kmi.open.ac.uk/technologies/poweraqua/, v.lopez@open.ac.uk

•Step 1: Linguistic Analysis-“Show me all cities of Spain” -> <what-is, cities, Spain>

•Step 2: Identify the set of relevant ontologies - Based on PowerMap: A run time knowledge matcher to produce semantically sound mappings across ontologies and domains. I.e. through Watson it searches for approximate mappings by using lexically related words obtained from WordNet and background ontologies. Then, the candidate mappings are semantically enriched by using a similarity measure based in WordNet to obtain its sense.

•Step 3: The triple similarity services analyzes the linguistic information and the ontology semantics (relationship and taxonomy) to return a small set of ontologies (represented as ontological triples) that jointly covers the user query.

Recommended