21
Incorporating ARGOVOC in DSpace- based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical Institute, Bangalore

Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Embed Size (px)

Citation preview

Page 1: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Incorporating ARGOVOC in DSpace-based Agricultural Repositories

Dr. Devika P. Madalli&

Nabonita Guha

Documentation Research & Training CentreIndian Statistical Institute, Bangalore

Page 2: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 2

Overview

Some Observation on present ontology plugin AGROVOC Thesaurus Plugin for DSpace AGROVOC in SKOS Difference between Thesaurus and Ontology Moving towards Ontology

Page 3: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 3

Some Observations

Present ontology plugin integrated with DSpace 1.4 doesn’t support synonyms and related terms

It shows only broader and narrower relationship It requires the conversion of OWL or SKOS representation of

AGROVOC into the native plugin format every time AGROVOC is updated

Ideally, thesaurus input can be either in SKOS or OWL format (can be achieved by XLST)

Cron job checks updates on AGROVOC site and downloads either SKOS or OWL file

Alternatively, ontology should be displayed on the fly from the AGROVOC site (idea from Dr. Johannes Keizer)

Page 4: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 4

AGROVOC Thesaurus Plug-in

To provide controlled vocabulary support in subject description in DSpace and depict: Equivalence Homographs Broader/narrower/associated relationship

To … Provide standard access points to document; & Thesaurus based Indexing & Searching

Page 5: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 5

Handling Synonyms

At the time of creating the metadata The synonymous terms are added along with the standard

term

At the time of search If the user enters a non-standard term, it will be replaced

by standard term, and search is performed using the standard term

Page 6: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 6

Homograph resolution (spelling same but different meaning) Can be resolved using context of the given term

At the search stage, the system will retrieve the subject string attached to the document along with the associated documents Fish – agriculture Fish – cooking Fish – decoration

Handling Homographs

Page 7: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

AGROVOC in SKOS

Page 8: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 8

SKOS

SKOS = Simple Knowledge Organization System

Provides a model for expressing the basic structure and content of concept schemes [1] in RDF syntax

Concept Scheme = thesaurus, classification scheme, taxonomy, subject heading list, terminologies, etc

Page 9: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 9

SKOS: Classes

Classes [2] ConceptScheme (AGROVOC, MeSH, AAT, etc)

Concept (actual terms, e.g. animals, crops, etc)

Collection (group of concepts e.g. meat cattle)

Page 10: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 10

SKOS: Properties

Properties [2] altLabel broader changeNote narrower prefLabel scopeNote hasTopConcept historyNote isPrimarySubjectOf

isSubjectOf member definition note primarySubject related subject example subjectIndicator

Page 11: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Thesaurus Vs Ontology

Page 12: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 12

Thesaurus is …

Controlled vocabulary tools in which relationship between concept terms are shown as: Equivalence = synonyms Homographic = same spelling Hierarchical = broader -> narrower Associative relationships = related terms

Represents the terms denoting certain concepts

Page 13: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 13

Ontology is …

An ontology defines the terms used to describe and represent an area of knowledge (subject matter) [3]

Model for the meaning of those terms

Definition of the vocabulary used

Page 14: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

(Real world entity)

CattleCattle

Suckler CowSuckler Cow

Dairy CowDairy Cow

HeifersHeifers

Cow MilkCow Milk

Milk yielding cowsMilk yielding cows

COW BT

NT

NT

RT

RT

RT

Related Terms in AGROVOC for Cow

Page 15: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

SKOS representation of AGROVOC Terms

19391939

CowsCowsskos:prefLabel

Domestic AnimalDomestic Animal

Milk yielding cowsMilk yielding cows

skos:Related

HeifersHeifers

skos:Related

Cow milkCow milk

skos:Related

CattleCattle

skos:Broader

Suckler CowSuckler Cow

skos:Narrower

Dairy CowDairy Cow

skos:Narrower

http://www.fao.org/aos/agrovoc#c_1939

Page 16: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

COWCOW

AnimalsAnimals

subClassOf

subClassOf

WildWildDomestic AnimalDomestic Animal

subClassOf

CattleCattle

subClassOf

Cow milkCow milk

hasByproductDomainRange

HeifersHeifers

subClassOf

Milk yielding cowsMilk yielding cows

subClassOf

RDF Representation of AGROVOC Terms

Page 17: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 17

Current Approach

To develop a plugin which can provide thesaural support in DSpace

Not only specific to AGROVOC, but can support any thesaurus in DSpace represented in SKOS

Page 18: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 18

Moving Towards Ontology

Decision regarding the nature of Information Retrieval Document retrieval ; or Retrieval of exact information extracted from the

document

To develop a generic framework for Agriculture domain knowledge with scopes for further extension for narrower domains

To incorporate the inferencing mechanism defining certain rules (SWRL)

Page 19: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Uncontrolled Vocabulary

Unstructu

red

Controlled Vocabulary in formal syntax (RDF)

Structu

red

Ontology-based information search & retrieval

Semantic

Page 20: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 20

References

1. Alistair Miles & Dan Brickley. SKOS Core Guide. W3C Working draft 10 May 2005. http://www.w3.org/TR/swbp-skos-core-guide

2. Alistair Miles & Dan Brickley. SKOS Core Vocabulary Specification. W3C Working Draft 2 November 2005.

http://www.w3.org/TR/swbp-skos-core-spec/

3. Dr. Leo Obrst. Presentation on the ontology spectrum & semantic models. MITRE, Information Semantics Group, January 12 & 19, 2006. http://ontolog.cim3.net/file/resource/presentation/LeoObrst_20060112/OntologySpectrumSemanticModels--LeoObrst_20060112.ppt

Page 21: Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical

Nov. 10, 2006 Seventh Agricultural Ontology Service (AOS) Workshop 21

Thank You

Devika P. Madalli

[email protected]

Nabonita Guha

[email protected]