32
SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM - DL A OUTLOOK SIM - DL A :ANOVEL S EMANTIC S IMILARITY MEASURE FOR DESCRIPTION L OGICS R EDUCING I NTER -C ONCEPT TO I NTER -I NSTANCE S IMILARITY Krzysztof Janowicz and Marc Wilkes Münster Semantic Interoperability Lab (MUSIL) Institute for Geoinformatics University of Muenster, Germany {janowicz|marc.wilkes}@uni-muenster.de June 4, 2009 SIM - DL A KRZYSZTOF JANOWICZ AND MARC WILKES

SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SIM − DLA: A NOVEL SEMANTIC

SIMILARITY MEASURE FOR DESCRIPTION

LOGICS REDUCING INTER-CONCEPT TO

INTER-INSTANCE SIMILARITY

Krzysztof Janowicz and Marc Wilkes

Münster Semantic Interoperability Lab (MUSIL)Institute for Geoinformatics

University of Muenster, Germany{janowicz|marc.wilkes}@uni-muenster.de

June 4, 2009

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 2: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

1 SEMANTICS IN GISCIENCEGeoinformationSpatial Data InfrastructureSemantic Enablement Layer

2 SIMILARITY (REASONING) IN GISCIENCEADL Gazetteer InterfaceSimCat Gazetteer InterfaceSIM-DL Server and Plugin

3 SIM − DLAWhy SIM − DLA?The Similarity Tableau for SHIA Simplified Example

4 DISCUSSION & OUTLOOK

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 3: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

GEOINFORMATION

GEOINFORMATION

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 4: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

GEOINFORMATION

GEOINFORMATION

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 5: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

GEOINFORMATION

GEOINFORMATION

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 6: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SPATIAL DATA INFRASTRUCTURE (OGC SERVICES)

SPATIAL DATA INFRASTRUCTURE: OGC SERVICES

� Show me Hills, Forests, Rivers, [Feature Type] ... at [Loc] →WFS� Show me a map of this area→WMS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 7: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SPATIAL DATA INFRASTRUCTURE (OGC SERVICES)

SPATIAL DATA INFRASTRUCTURE: OGC SERVICES

� Water gauge? [Stimulus Type] ... at [Loc][Time] → SOS→WPS� [Sensor] move to [Loc][Time] → SPS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 8: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SPATIAL DATA INFRASTRUCTURE (OGC SERVICES)

SPATIAL DATA INFRASTRUCTURE: OGC SERVICES

� Flooding! [Event Type] ... at [Loc][Time] → SAS→WNS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 9: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

YET ANOTHER LAYER CAKE

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 10: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

1 SEMANTICS IN GISCIENCEGeoinformationSpatial Data InfrastructureSemantic Enablement Layer

2 SIMILARITY (REASONING) IN GISCIENCEADL Gazetteer InterfaceSimCat Gazetteer InterfaceSIM-DL Server and Plugin

3 SIM − DLAWhy SIM − DLA?The Similarity Tableau for SHIA Simplified Example

4 DISCUSSION & OUTLOOK

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 11: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

ADL GAZETTEER INTERFACE

ADL GAZETTEER INTERFACE

Based on a Feature Type Thesaurus (> 1000 types)SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 12: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SIMCAT GAZETTEER INTERFACE

SIMCAT GAZETTEER INTERFACE

Based on a Feature Type OntologySIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 13: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SIM-DL SERVER AND PLUGIN

SIM-DL SERVER AND PLUGIN

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 14: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

1 SEMANTICS IN GISCIENCEGeoinformationSpatial Data InfrastructureSemantic Enablement Layer

2 SIMILARITY (REASONING) IN GISCIENCEADL Gazetteer InterfaceSimCat Gazetteer InterfaceSIM-DL Server and Plugin

3 SIM − DLAWhy SIM − DLA?The Similarity Tableau for SHIA Simplified Example

4 DISCUSSION & OUTLOOK

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 15: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

WHY SIM − DLA?

THE CHALLENGE

� SIM-DL is a structural measure; it requires the normalizationof concept descriptions and recursively compares the (syntactic)structure of the normalized descriptions for similarity.

� Feature type ontologies require expressive description logicssuch as OWL-DL; a structural approach fails to support these DLs.

� Why not use an ABox-based similarity measure?

� Most feature type ontologies cannot be populated so far. Fea-tures are stored in shape-files, geo-databases, GML files, satelliteimages,...

−→ Can we learn from the history of subsumption reasoningand switch to a tableau-based approach?

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 16: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

WHY SIM − DLA?

THE IDEA

−→ Reduce the problem of inter-concept similarity to inter-instance similarity?

� How to get or compute the instances required for comparison?

� From the completion tree of a modified tableau algorithm asused for satisfiability checking. Instead of trying to find one (clash-free) model, generate a set of proxy individuals for comparison.

� How to compute similarity between proxy individuals?

� Reuse existing similarity framework, i.e., functions, contexts,similarity modes, and the alignment procedure from SIM-DL.

� Jaccard similarity coefficient of co-occurrence

� Network measures for role hierarchies and neighborhood� . . .

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 17: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE SIMILARITY TABLEAU FOR SHI

THE SIMILARITY TABLEAU FOR SHI. . .The t-rule:Condition: C1 t C2 ∈ L(x) and x is not indirectly blocked.Action: Create three t-successors w , y , z of x with:L(w) := (L(x)\{C1 t C2}) ∪ {C1}L(y) := (L(x)\{C1 t C2}) ∪ {C2}L(z) := (L(x)\{C1 t C2}) ∪ {C1, C2}

. . .The ∀-rule:Condition: (∀R.C) ∈ L(x), x is not indirectly blocked.Action:

If there is an R-neighbor y of x and C /∈ L(y):L(y) := L(y) ∪ {C}.

If there is no R-neighbor y of x , create two ∀-successors y , z of x with:L(y) := L(x) (y will then be blocked)L(z) := L(x) ∪ {∃R.C}.

. . .

≥ and ≤-rules . . .

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 18: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

A SIMPLIFIED EXAMPLE

SIM − DLA: A SIMPLIFIED EXAMPLE

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 19: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

A SIMPLIFIED EXAMPLE

SIM − DLA: ALIGNMENT MATRIX (FOR MAX. SIMILARITY)

� a) selecting individuals for comparison

� b) selecting assertions for comparison

� c) . . .

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 20: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

1 SEMANTICS IN GISCIENCEGeoinformationSpatial Data InfrastructureSemantic Enablement Layer

2 SIMILARITY (REASONING) IN GISCIENCEADL Gazetteer InterfaceSimCat Gazetteer InterfaceSIM-DL Server and Plugin

3 SIM − DLAWhy SIM − DLA?The Similarity Tableau for SHIA Simplified Example

4 DISCUSSION & OUTLOOK

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 21: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

DISCUSSION & OUTLOOK

Support more expressive DL (e.g., cardinality restrictions)Integration into existing reasoners (e.g. FaCT, Pellet)Approximation and optimization (thresholds –> roles)Evaluation of SIM − DLA (Human Participants Test)

Conceptualization of and reasoning about eventsConcepts change over time (ecology of concepts)

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 22: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

QUESTIONS?

Thank you!

SimCat Project Website: sim-dl.sourceforge.net

Münster Semantic Interoperability Lab: musil.uni-muenster.de

International Conference on Geospatial Semantics: www.geosco.org

Semantic Enablement Community at 52◦North: 52north.org/semantics

The SIM-DL server, plug-in and user interfaces are free and open source software andcan be downloaded at the SimCat project website.

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 23: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

NAVIGATION

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 24: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

QUERY-BY-EXAMPLE

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 25: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

SIMILARITY REASONING

1 Definition of application area and intended audience2 Selection of search (query) and target concepts3 Transformation of concepts to canonical form4 Definition of an alignment matrix for concept descriptors5 Application of constructor specific similarity functions6 Determination of standardized overall similarity7 Interpretation of the resulting similarity value(s)

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 26: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

COMPLEXITY?

� EXPTIME-complete, but

’[...]This shows that, at the beginning of the new millennium, evenan EXPTIME-algorithm is no longer automatically considered tobe impractical in the DL community.’(Baader & Sattler; An Overview of Tableau Algorithms for Description Logics (2000))

−→ Optimization and Approximation

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 27: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 28: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 29: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 30: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 31: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES

Page 32: SIM DLA: A N SEMANTIC IMILARITY M DESCRIPTION L R I -I Sjano/janowicz_wilkes_eswc_2009_slides.pdfSimCat Gazetteer Interface SIM-DL Server and Plugin 3 SIM DL A Why SIM DL A? The Similarity

SEMANTICS IN GISCIENCE SIMILARITY REASONING SIM − DLA OUTLOOK

THE ADL GAZETTEER WEB INTERFACE & THESAURUS

SIM − DLA KRZYSZTOF JANOWICZ AND MARC WILKES