20
WP8 Case study: Cultural Heritage Dana Dann ´ ells, Aarne Ranta, Ramona Enache, Mariana Damova, Maria Mateva University of Gothenburg and Ontotext August 2013

WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

WP8 Case study: Cultural Heritage

Dana Dannells, Aarne Ranta, Ramona Enache,Mariana Damova, Maria Mateva

University of Gothenburg and Ontotext

August 2013

Page 2: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The aim of this work

To build an ontology-based application forcommunication of museum content on the SemanticWeb and make it accessible in 15 languages.

Page 3: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Outline

Museum Reason-able View: Interoperable culturalheritage knowledge bases

Ontology-based multilingual grammar for retrievingand generating museum content:

RDF to NL – well-formed descriptionsNL to RDF – SPARQL queries linearization

Cross-language retrieval and representation systemusing Semantic Web technology

Page 4: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The Museum Reason-able View (MRV)

Domain ontology: CIDOC Conceptual ReferenceModel (CRM) v. 5.0.1

Application ontologies: Painting ontology andMuseum Artifacts Ontology (MAO)

Ontology Classes PropertiesCIDOC-CRM 87 130Painting ontology 197 107MAO 10 20Total 836 440

Page 5: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The museum Linked Open Data (LOD)

Gothenburg City Museum (GCM): Only twocollections (GSM and GIM)

DBpedia: Larger amount of painting entities

Source Amount of entitiesGCM 48DBpedia 614Total 662

Page 6: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The application grammar overview

Page 7: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Supported languages

Bulgarian, Catalan, Danish, Dutch, English,Finnish, French, Hebrew, Italian, German,Norwegian, Romanian, Russian, Spanish, Swedish.

Page 8: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The lexicon grammar

Covers a subset of the ontology classes and properties

Classes and properties were manually translated

Input:createdBy (Ophelia, Brynolf Wennerberg)isA (Brynolf Wennerberg, Painter).isA (Ophelia, Painting)

Direct verbalization:Ophelia is a painting. Ophelia was created by Brynolf

Wennerberg. Brynolf Wennerberg is a painter.

Our approach:Ophelia was painted by Brynolf Wennerberg.

Page 9: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

The data grammar

Contains ontology entities that were extracted from GCMand DBpedia

No adequate translations from DBpedia

Entities of museum names are automatically translatedfrom Wikipedia

Class EntitiesTitle 662Painter 116Museum 104Place 22Total 904

Page 10: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Automatic translation process overview

Page 11: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Available museum names translations

Language Translated (out of 104)Bulgarian 26Catalan 63Danish 33Dutch 81Finnish 40French 94Hebrew 46Italian 94German 99Norwegian 50Romanian 27Russian 87Spanish 89Swedish 58

Page 12: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Text grammar

Nine classes: Title, Painter, Type, Colour, Size, Year,Material, Museum, Place

Three are obligatory

Interior was painted by Edgar Degas.

One function, three sentences

Interior was painted on canvas by Edgar Degas in1868. It measures 81 by 114 cm and it is painted in redand white. This painting is displayed at thePhiladelphia Museum of Art.

Page 13: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Text grammar: RDF to NL

Page 14: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Word alignment example

Page 15: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Multilingual challenges

LexicalizationsClasses: compounds, multiword expressionsProperties: verbs, adverbs, prepositions

Order of semantic elementsMaterial, YearTense and voicePast, past participle, present, active/passive

AggregationConjunction, relative clause, punctuations

CoreferencePronoun, noun, empty reference

Page 16: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Multilingual examples

Page 17: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Query grammar: NL to SPARQL

Page 18: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Grammar advantage/disadvantages

+ Modular grammar design

+ The lexicon and the data are shared by all grammars

+ 16 text patterns from one function

+ Natural realization of the ontology content

+ Takes 30 minutes to implement a new language if thelanguage is in the RGL

- The texts can become artificial if the names aremissing translations

Page 19: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Grammar writing

It may take two to five days to add a new language if

the language is not in the RGL (e.g. Hebrew)the grammarian is familiar with GFand he/she acquires good knowledge of the

language

Page 20: WP8 Case study: Cultural Heritage - Grammatical Frameworkschool.grammaticalframework.org/2013/slides/dana... · 2013-08-27 · WP8 Case study: Cultural Heritage ... CIDOC-CRM 87 130

Demo

http://museum.ontotext.com/