16
1 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.i e From OntoSelect to OntoSelect- SWSE Paul Buitelaar, Andreas Harth DERI – NUIG, Galway February 2009

From OntoSelect to OntoSelect-SWSE

  • Upload
    freya

  • View
    26

  • Download
    0

Embed Size (px)

DESCRIPTION

From OntoSelect to OntoSelect-SWSE. Paul Buitelaar, Andreas Harth DERI – NUIG, Galway February 2009. Outline. OntoSelect @ DFKI Recap of OntoSelect Functionality OntoSelect @ DERI SWSE: Semantic Web Search Engine Architecture OntoSelect-SWSE OntoSelect-SWSE Experiments - PowerPoint PPT Presentation

Citation preview

Page 1: From OntoSelect to OntoSelect-SWSE

1 Copyright 2008 Digital Enterprise Research Institute. All rights reserved.

Digital Enterprise Research Institute www.deri.ie

From OntoSelect to OntoSelect-SWSE

Paul Buitelaar, Andreas HarthDERI – NUIG, Galway

February 2009

Page 2: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

2

Outline

OntoSelect @ DFKI Recap of OntoSelect Functionality

OntoSelect @ DERI SWSE: Semantic Web Search Engine Architecture OntoSelect-SWSE

OntoSelect-SWSE Experiments Ranked List of Ontologies with Rich Metadata

Page 3: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

3

OntoSelect

Ontology Library and Ontology Search Service

http://olp.dfki.de/OntoSelect

OntoSelect monitors the web for ontologies (indexing/updates) Ontology browse and search (by keyword, topic, document) Class, property and (multilingual) label browse and search Ontology publishing (submit your ontology) Statistics on

– Formats– Human languages– Frequently used labels– Ontology publishing

Page 4: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

4

Browse Ontologies

Page 5: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

5

Ontology Search

Page 6: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

6

Expand Search with Wikipedia Topic

Page 7: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

7

Keyword Extraction & Ranked Ontologies

Page 8: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

8

OntoSelect Statistics - Multilinguality

English German French Spanish

Portugese Hungarian Italian Polish

Dutch Russian Japanese

Distribution of languages in 136 ontologies with multilingual labels - out of 1530 ontologies currently collected (~9%)

Page 9: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

9

OntoSelect Statistics - Labels

Most frequently used labels (‘words’, ‘terms’) in 1530 ontologies

Page 10: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

10

SWSE: Semantic Web Search Engine

Page 11: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

11

SWSE Architecture

Distributed shared-nothing architecture Implementation scales to billions of triples

Crawler Reasoner Indexer

HTTPQuery

Processor

Ranker

User Interface

Linked Data Raw

Data

Material-isedData

CompleteIndex

Ranks

Page 12: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

12

OntoSelect-SWSE

Page 13: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

13

OntoSelect-SWSE Experiments

Experiment Onto Seed set from Google & Yahoo (*.owl, *.daml, *.rdfs) 27,519 data sources (ontologies) 6.5m statements

Experiment Web Crawling six degrees from seed URI

http://www.w3.org/People/Berners-Lee/card 100,555 data sources (instance data + ontologies) 11.9m statements

Page 14: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

14

Ranked List of Ontologies - Onto

Page 15: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

15

Ranked List of Ontologies - Web

Page 16: From OntoSelect to OntoSelect-SWSE

Digital Enterprise Research Institute www.deri.ie

16

Conclusion

SWSE framework facilitates web data experiments

Taking into account real-world instance usage improves ontology ranking

Web data is noisy (more data providers == more noise)

Ontology registry should include rating facility to “vote out” data sources that do not provide consensus view