24
co-funded by the European Union Towards Integrating Ontologies An EDM-Based Approach Evelyn Dröge, Julia Iwanowa, Violeta Trkulja, Steffen Hennicke, Stefan Gradmann Berlin School for Library and Information Science, Humboldt-Universität zu Berlin Presentation on the 13 th International Symposium of Information Science Potsdam, 21.03.2013

Towards Integrating Ontologies An EDM-Based Approach

Embed Size (px)

Citation preview

Page 1: Towards Integrating Ontologies An EDM-Based Approach

co-funded by the European Union

Towards Integrating Ontologies

An EDM-Based Approach

Evelyn Dröge, Julia Iwanowa, Violeta Trkulja, Steffen Hennicke, Stefan GradmannBerlin School for Library and Information Science, Humboldt-Universität zu Berlin

Presentation on the 13th International Symposium of Information SciencePotsdam, 21.03.2013

Page 2: Towards Integrating Ontologies An EDM-Based Approach

DM2E project

Digitised Manuscripts to Europeana (DM2E) • EU-funded Europeana satelite project• Duration: Three years (2012 – 2015)• Partners from Germany, Austria, Norway, Greece, UK and Italy• Primary aims: To enable as many content providers as possible

to get their data into Europeana and to stimulate the creation of new tools and services for reuse of Europeana data in the Digital Humanities

IBI at the Humboldt-Universität zu Berlin• Coordinates the project• Is further involved in modeling and in evolving the technical

infrastructure

Towards Integrating Ontologies: An EDM-Based Approach 221.03.2013

Page 3: Towards Integrating Ontologies An EDM-Based Approach

DM2E: Interoperability approach

• Base: Semantic Web and Linked Data

– Enable and facilitate data interoperability

– Share and reuse ontologies

• Build on common data models

– EDM, DC and DCTerms, OAI-ORE, CIDOC-CRM, FOAF, SKOS

– BIBO, VOID, FABIO

• Uses W3C standards

– RDF(S), OWL

Enable data interoperability

Towards Integrating Ontologies: An EDM-Based Approach 321.03.2013

Page 4: Towards Integrating Ontologies An EDM-Based Approach

Why should we reuse ontologies?

• Nature of Linked Data

• Reduce multiple likewise resources

• Better visibility of the ontology

• Better quality of the ontology

• Better integration into the Linked Open Data Cloud

• Easy access for more applications

• Make it easier for others to reuse the vocabulary

Towards Integrating Ontologies: An EDM-Based Approach 421.03.2013

Page 5: Towards Integrating Ontologies An EDM-Based Approach

Reuse practice

• Three steps workflow

1. Ontology retrieval

2. Integration-oriented ontology evaluation Identification of suitable classes and properties

Analysis of missing elements

3. Ontology integration Different methods

Towards Integrating Ontologies: An EDM-Based Approach 521.03.2013

Simperl, 2010

Page 6: Towards Integrating Ontologies An EDM-Based Approach

Steps in the Reuse Workflow

Page 7: Towards Integrating Ontologies An EDM-Based Approach

Step 1: Ontology retrieval

Towards Integrating Ontologies: An EDM-Based Approach 721.03.2013

Page 8: Towards Integrating Ontologies An EDM-Based Approach

Step 2: Integration-oriented ontology evaluation

EDM

DC

DCTerms

SKOS

FOAF

ORE

rdaGr2

Towards Integrating Ontologies: An EDM-Based Approach 821.03.2013

• Which elements are

missing or can be

replaced?

• What are suitable

classes or properties

in other standards or

vocabularies?

Page 9: Towards Integrating Ontologies An EDM-Based Approach

Step 3: Ontology integration

Four different methods can be used for reusing ontologies:

Towards Integrating Ontologies: An EDM-Based Approach 921.03.2013

• External classes or properties are directly used in the ontology buildup.

• Can be used if the external class or property exactly matches the own resource.

• Definitions or labels should not be adjusted!

1. Direct adoption of external resources

• The name of a foreign property or class is used without its original namespace.

2. Reuse of external resources without original URIs

• Properties or classes are created in the own namespace.

• Subproperty or subclass relations are built between them and the external resource.

3. Reuse of external resources with integration into ontology hierarchy

•owl:equivalentClass or owl:equivalentProperty are used to refer to equivalent external resources.

4. Referring to external resources via equivalence properties

Page 10: Towards Integrating Ontologies An EDM-Based Approach

Integrating ontologies:Method comparison

Towards Integrating Ontologies: An EDM-Based Approach 1021.03.2013

Original URI New URI

IndirectIntegration

DirectIntegration

4

1 2

3

Page 11: Towards Integrating Ontologies An EDM-Based Approach

Further Insight into the Integration Methods

Page 12: Towards Integrating Ontologies An EDM-Based Approach

1. Method:Direct adoption of external resources

• Example:

Integrating bibo properties for describing text objects

• When should the method be used?– If the meaning and definition of the external resource is identical

to the meaning of the own resource.

• Advantages and Disadvantages+ Pro: Reduces the amount of resources that describe the same

thing in other words.− Con: The method is also used when the meaning is not exactly the

same, which can lead to conflicting descriptions.

Towards Integrating Ontologies: An EDM-Based Approach 1221.03.2013

dm2edata:exampleItem

edm:type text ;

bibo:isbn „978-3-86680-192-9“ .

Page 13: Towards Integrating Ontologies An EDM-Based Approach

1. Method:Direct adoption of external resources

Example: foaf:Document

Towards Integrating Ontologies: An EDM-Based Approach 1321.03.2013

Document

Dokument

Document@en

Documents

Physical Thing ?

Abstract Class ?

Electronic?

Page 14: Towards Integrating Ontologies An EDM-Based Approach

Example: Some properties of foaf:Document

Towards Integrating Ontologies: An EDM-Based Approach 1421.03.2013

1. Method:Direct adoption of external resources

rdfs:label DocumentDocument@enDocumentation@endocument@en

dc:title Document@endcterms:identifier foaf:Documentation @en-gb

rdfs:comment A document.An abstract class defining any kinds of publishing work.@enThe foaf:Document class fully represents the ADMS concept of documentation.@enSimilar to the Agent concept, we have again decided to include a concept from thepopular FOAF ontology. The FOAF Vocabulary Specification currently definesDocument in a very loose way: The foaf:Document class represents those thingswhich are, broadly conceived, 'documents'. ... We do not (currently) distinguishbetween physical and electronic documents, or between copies of a work and theabstraction those copies embody." […]

vann:usageNote Used in ADMS specifically for the class of documents that further describe a SemanticAsset or give guidelines for its use. ADMS expects all documents to have a title (usedcterms:title).^^

Retrieved from the Linked Open Vocabularies SPARQL endpoint:http://lov.okfn.org/endpoint/lov

Page 15: Towards Integrating Ontologies An EDM-Based Approach

2. Method: Reuse with new URIs

Towards Integrating Ontologies: An EDM-Based Approach 1521.03.2013

• Example: Different URIs for the class thing

Are they identical or equivalent?

• Advantages and Disadvantages+ Pro: Can be used as a first step in the construction process if other

vocabularies are not known.

− Con: It is not clear how the different classes can be set into relation and it makes it hard to query or reuse the vocabulary.

• This method should not be used!

owl:Thing

gold:Thing

Page 16: Towards Integrating Ontologies An EDM-Based Approach

3. Method: Reuse with subrelations

Towards Integrating Ontologies: An EDM-Based Approach 1621.03.2013

• Example:

New dm2e property and class as subelements

• When should this method be used?– If an existing class or property is found which is broader defined

than the resource that should be created.

• Advantages and Disadvantages+ Pro: The new property or class can have its own description.+ Pro: Broader properties or classes are easy to be found in upper

ontologies .

dm2e:Agent rdfs:subClassOf foaf:Agent .

dm2e:publishedAt rdfs:subPropertyOf

dcterms:spatial .

Page 17: Towards Integrating Ontologies An EDM-Based Approach

4. Method: Reuse with equivalence properties

Towards Integrating Ontologies: An EDM-Based Approach 1721.03.2013

• Example:

Reference to equivalent resources

• When should this method be used?– If exactly the same individuals can be part of all equivalent classes

or can be connected with all equivalent properties.

• Advantages and Disadvantages+ Pro: The elements can have different descriptions.− Con: Not every tool can interpret the equivalence properties.

edm:Event owl:equivalentClass

crm:E4.Period .

prism:issn owl:equivalentProperty

bibo:issn .

Page 18: Towards Integrating Ontologies An EDM-Based Approach

Ontology Reuse in DM2E

Page 19: Towards Integrating Ontologies An EDM-Based Approach

Background: The EDM and its specialisations

The EDM (Europeana Data Model)

… builds the backbone of Europeana

… unites several standards and vocabularies

… covers the representation of cultural heritage objects from libraries, archives and museums

… is as generic as possible

… can be specialised for different domains

Towards Integrating Ontologies: An EDM-Based Approach 1921.03.2013

Page 20: Towards Integrating Ontologies An EDM-Based Approach

Reusing practice in EDM and DM2E

Towards Integrating Ontologies: An EDM-Based Approach 2021.03.2013

• Reuse in EDM– Method 1: Direct integration

• Problem: Definitions may differ

– Method 4: Use of equivalence properties

• Reuse in DM2E– Method 1: Direct integration

• Here: Use of additional DM2E scope notes that add information aboutthe reuse of the integrated element in DM2E

– Method 3: Adding resources with subproperties• If the new resource has a narrower definition

– Method 4: Use of equivalence properties• Analogue to the EDM for additional classes or properties

Primarily used

Used for additional classes

Page 21: Towards Integrating Ontologies An EDM-Based Approach

Reusing practice in EDM and DM2E II

• Example: Methods 1 and 3

Towards Integrating Ontologies: An EDM-Based Approach 2121.03.2013

DC EDM DM2E

dc:contributor dc:contributor dm2e:contributor

rdfs:subPropertyOf

dc:contributor

Range: Not restricted. Range: Person,

organisation or service.

Range: Person as an URI

of type edm:Agent.

Definition: „An entity that

is responsible for making

contributions to the

resource.“

Definition: „An agent that

is responsible for making

contributions to the

resource.“

Definition: „A person that

is responsible for making

contributions to the

resource.“

Page 22: Towards Integrating Ontologies An EDM-Based Approach

Conclusion

• Reuse in DM2E: Mixture of all methods(excluding method 2)Making use of the method that suits best

• There is not one „best method“ for all reuse cases…

… but contradictions in descriptions can beavoided even if resources are reused!

• What we did not (yet) cover: „When owl:sameAs is not thesame…“ Problems that might occur when there are too many

different understandings of resource meanings

Towards Integrating Ontologies: An EDM-Based Approach 2221.03.2013

Page 23: Towards Integrating Ontologies An EDM-Based Approach

Thank you for your attention!

Evelyn DrögeJulia Iwanowa

Berlin School for Library andInformation Science

Humboldt-Universität zu Berlin

www.ibi.hu-berlin.de

Digitised Manuscripts to Europeana

www.dm2e.eu

[email protected]@ibi.hu-berlin.de

Towards Integrating Ontologies: An EDM-Based Approach 2321.03.2013

Page 24: Towards Integrating Ontologies An EDM-Based Approach

References

Literature• Berners-Lee, T. (2006). Linked Data - Design Issues. W3C Website.

http://www.w3.org/DesignIssues/LinkedData. [03.03.2013]• Halpin, H., & Hayes, P. J. (2010). When owl:sameAs isn’t the Same: An Analysis of Identity

Links on the Semantic Web. Proceedings of the WWW2010 Workshop on Linked Data onthe Web, LDOW 2010, Raleigh, USA, April 27, 2010, CEUR Workshop Proceedings (Bd.628).

• Heath, T., & Bizer, C. (2011). Linked Data: Evolving the Web into a Global Data Space.Synthesis Lectures on the Semantic Web: Theory and Technology (Bd. 1). Morgan &Claypool.

• Simperl, E. (2010). Guidelines for Reusing Ontologies on the Semantic Web. InternationalJournal of Semantic Computing, 04(02), 239–283.

Images• Footstep (Slide 6): http://openclipart.org/detail/left-footprint-by-anonymous• Magnifier (Slide 11): http://openclipart.org/detail/159469/web-search-grayscale-by-

sibskull• Document (Slide 13): http://info.docuvantage.com/Portals/61671/images/stack%20of%

20files%20photo_istock.jpg• Ontology (Slide 19): http://openclipart.org/detail/133363/ontology-by-imad• IBI (Slide 23): http://commons.wikimedia.org/wiki/File:Berlin,_Mitte,_Dorotheenstrasse,

_Handelskammer_Berlin_02.jpg

Towards Integrating Ontologies: An EDM-Based Approach 2421.03.2013