16
ODBASE 04 26.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication Sciences Swiss Federal Institute of Technology (EPFL)

ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

  • View
    215

  • Download
    0

Embed Size (px)

Citation preview

Page 1: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

ODBASE 0426.10.04

A Necessary Condition for Semantic Interoperability in

the Large

Philippe Cudré-Mauroux and Karl AbererSchool of Computer and Communication Sciences

Swiss Federal Institute of Technology (EPFL)

Page 2: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Overview

• Peer DBMS

• Semantic graphs

• Semantic interoperability in large-scale networks

• Generatingfunctionologic model

• Results

• Use case scenarios

• Conclusions

Page 3: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Peer Data Management

Author

Author

Creator

Creator

Creator

WinFS Images peersAuthor, Keyword…

Photoshop Images peersCreator, Subject …

Familiy X peerEvent

Creator Event

Creator Author

Author Creator

• Local translations enabling global agreements

0.1

0.9

0.8

Page 4: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

On Translations

Q1=<GUID>$p/GUID</GUID> FOR $p IN /Photoshop_Image WHERE $p/Creator LIKE "%Robi%"

<Photoshop_Image> <GUID>178A8CD8865</GUID> <Creator>Robinson</Creator> <Subject> <Bag> <Item> Tunbridge Wells </Item> <Item>Royal Council</Item> </Bag> </Subject> …</Photoshop_Image>

Photoshop(own schema)

<WinFSImage> <GUID>178A8CD8866</GUID> <Author> <DisplayName> Henry Peach Robinson <DisplayName> <Role>Photographer</Role> <Author> <Keyword> Tunbridge </Keyword>…</WinFSImage>

WinFS (known schema)

T12 =<Photoshop_Image> <GUID>$fs/GUID</GUID> <Creator> $fs/Author/DisplayName </Creator></Photoshop_Image>FOR $fs IN /WinFSImage

Q2=<GUID>$p/GUID</GUID> FOR $p IN T12 WHERE $p/Creator LIKE "%Robi%"

Extending semantic integration techniques to decentralized settings!

Page 5: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

A Handful of Systems

• GridVine -- EPFL– P-Grid access structure– Principle of data independence– Semantic Web compliant– Semantic Gossiping

• Many other interesting systems– Hyperion -- univ. Toronto– Piazza -- univ. Washington– Edutella -- univ. Hannover– Mobile PDBMS -- univ. Trento– Semantic Link Networks -- ICT China– Etc.

Page 6: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Schema-to-Schema Graph

Inter-organization of the different schemas used by the peers - Logical model- Directed- Weighted- Redundant

Page 7: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

The Semantic Connectivity Graph

• Definition (Semantic Interoperability) Two peers are said to be semantically interoperable if

they can forward queries to each other in the Schema-to-Schema graph, potentially through series of semantic translation links

• Idea– As for physical network analyses, create a connectivity

layer to account for semantic interoperability

• The semantic connectivity Graph S– Unweighted, irreflexive and non-redundant version of the

Schema-to-Schema graph

Page 8: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Observations

• Theorem Peers in a set Ps are semantically interoperable iff Ss is

strongly connected, with Ss {s | p Ps, ps}

• Observation 1 A set of peers Ps cannot be semantically interoperable if

|Es| < |Vs|

• Observation 2 A set of peers Ps is semantically interoperable if

|Es| > |Vs| (|Vs|-1) - (|Vs|-1)

Page 9: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Semantic Interoperability in the Large

• Question– How can we analyze semantic interoperability in

large-scale Peer DBMS?

• Idea: use percolation theory to detect the emergence of a strongly connected component in S– Necessary condition for vertex-strong connectivity– Necessary condition for semantic interoperability

Page 10: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

The Model

• Adaptation of a recent graph-theoretic framework– Newman, Strogatz, Watts 2001

• Large-scale semantic graphs as random graphs with arbitrary degree distribution– Exponentially distributed, small-world, scale-free… graphs

• Specificities of our model– Strong clustering (clustering coefficient cc)– Bidirectionality (bidirectionality coefficient bc)

• Based on generatingfunctionality

• Percolation: z2 ≥ z1 (second and first order neighbors)

Page 11: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Main results (1)

• Necessary condition for semantic interoperability in the large: j,k (jk-j(bc+cc)-k)pjk ≥ 0– Apparition of a giant strongly-connected component

Page 12: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Main results (2)

• Size of the out-component– Propagation of a query through the semantic graph–

Page 13: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Main results (3)

Page 14: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Use Case Scenarios

• Analyzing semantic interoperability in Peer DBMS– Ways to determine statistical properties of the semantic

graph• Indexation using a common access structure (e.g., DHT)• Semantic crawlers• Semantic random walkers

• Fostering semantic interoperability in the large– Iterative approach:

• Determine the semantic connectivity state• Create semantic translations to reach super-critical phase

Page 15: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

Conclusions

• New focus on large-scale semantic systems– From bilateral to multilateral analyses

• One possible avenue: semantic connectivity graph– Percolation theory

• Initial analysis– Extensions to weighted models to predict quality of

query dissemination

• Implementation in real systems on its way– GridVine (EPFL)– PicShark (EPFL -- Microsoft Research)

Page 16: ODBASE 0426.10.04 A Necessary Condition for Semantic Interoperability in the Large Philippe Cudré-Mauroux and Karl Aberer School of Computer and Communication

ODBASE 0426.10.04

A Necessary Condition for Semantic Interoperability in

the Large

Philippe Cudré-Mauroux and Karl AbererSchool of Computer and Communication Sciences

Swiss Federal Institute of Technology (EPFL)