78
FAO http://www.fao.org/aim Semantic Technologies at FAO International Society for Knowledge Organization (ISKO) 3 Aprile 2009, Torino Margherita Sini

Semantic Technologies at FAO

Embed Size (px)

DESCRIPTION

Several projects related to semantic technologies realized or ongoing at FAO

Citation preview

Page 1: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic Technologies at FAO

International Society for Knowledge Organization (ISKO)3 Aprile 2009, Torino

Margherita Sini

Page 2: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Few words about myself

Page 3: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Just a very rapid introduction• What?

– semantic, semantic web, semantic technologies– ontologies, Knowledge Organization Systems, – metadata

• Why?– interoperability, exchange, share– user orientation, precision and recall– multilinguality, cultural views, context

• Who?– everybody, all domains, all countries, all .org

• Which instruments?– experts, NLP, methodologies and techniques

Page 4: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Outline• Semantic projects involving FAO

– AOS– IPFSAPH, FNA, CWR, Fisheries, Food & nutrition,

Geopolitical ontology, AGROVOC Concept Server– Thai Rice Onto, Agropedia Indica

• Methods and Methodologies– Ontology models (AGROVOC Concept Server, LIR, ...)– Methods

• What’s next– registries: concepts, relationships– guidelines– networked ontologies– ontology-based applications– collaborations

Page 5: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic projects involving FAO

Page 6: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Why AOS

vessel? craft? boat?

bateaux?barco?

ship or

container

Terminology brokering

Semantic navigation,Clustering, Ranking, ...

Intelligent queryexpansion

InferencingReasoning

Machine learning

Interoperability

Page 7: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

• Built from AGROVOC • Domain concepts• Categories

AGROVOCConcept Server

Agricultural Ontology Service• An FAO initiative for

more coherence in Agricultural Information Systems

• Need of a semantic approach• AOS elements:

– AGROVOC Concept Server

– KOS registry

– Mapping registries

– Metadata standards

– Tools

– Publications (guidelines, ...)

Ontology registry

Sub-domainontologies

Metadataontologies

Page 8: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

IPFSAPH

Page 9: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

IPFSAPH

Page 10: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

The Ontology

Page 11: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Creation of the core ontology

1600 concepts

Information Resources•Brainstorming•Codex Alimentarius•SPS Agreement Ontology

Ontology Editor(OI-Modeler)

subject specialists

Agrovoc

Food SafetyDocuments

GenericDocuments

Page 12: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Concept Search

The same records will be retrieved regardless of the specific synonyms or singular/plural forms that the user uses to refer to a concept.

Related concepts

Page 13: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Multilinguality

The system is also able to understand a concept even when different languages are used.

Page 14: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Check spelling

Spelling errors are corrected: e.g. “desease” into “disease”

Page 15: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Paraphrasing

“mad cow disease symptoms” or

“clinical signs of bovine spongiform encephalopathy”

Page 16: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

give the same results, which are ranked.

Page 17: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic navigation of the bibliographical metadata (1)

Page 18: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic navigation of the bibliographical metadata (2)

Page 19: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic Navigation of Knowledge

parent concept(s)

children concept(s)

Page 20: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

FNA

Page 21: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Page 22: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Creation of the core ontology

BIBLIOGRAPHICDATABASE

CORPORATEDOCUMENT

REPOSITORYDATABASE

MERGERECORDS

+TRANSFORM

TO RDFS

Ontology Editor(OI-Modeler) maintain

Page 23: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Ontology Relationships

Page 24: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

The ontology concepts

• Publication

• Issue

• Work• Article

• Subject Term

• Category

• Author

• Region

• Language

• Year

Page 25: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

The ontology instances

Page 26: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Features• Multilingual concept resolution

• Get suggestions for the navigation(e.g. synonyms)

• Guided query formulation

• Easy navigation of the objects by following the semantic links

Page 27: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

What is possible to do• Improve browsing:

– e.g. continent / regions / countries

– e.g. link Agris/Caris categories with keywords– concept identification through natural language processing:

spell checking, parsing (e.g. “Dietary guidelines for human nutrition” or “Anaemia in children”)

• Perform some inferencing:– get the authors associated with specific keywords or vice

versa (“what an author wrote about between two years”, “who wrote about famine in 1999”, etc.)

– get the co-authors

– show articles with the same set or related keywords

Page 28: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

RDFa

Page 29: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

CWR

Page 30: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Hierarchy

Page 31: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

• Undertaken by FAO• Developed in harmony with CWR descriptor list• First version (English only) available by

December 2006• About 800 core terms + acronyms + spelling

variants;• Clearly definition of concepts (AGROVOC +

other sources); and• Relationships: hierarchical + causative

The project

Page 32: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

• Knowledge sharing and reuse is now the primary goal of research communities worldwide.

• An ontology defines a common vocabulary; its potential is enormous. For researchers, scientists, extension workers, decision- and policy-makers, who need to share information about a specialized domain an ontology allows for:

• sharing a common understanding of the structure of the information provided;

• formalizing and reusing the domain knowledge;• analyzing it; and• separating it from operational knowledge.

Page 33: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

More semantics

Term: wild plants

subclass of plants superclass of crop wild relatives

adapted by domestication benefits from resource conservation

Page 34: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

From unstructured data to formalized data

“Destruction of forests is leading to the loss of many populations of important wild relatives of fruit, nut and industrial crops such as mango and rubber.”

Page 35: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

CWR• Crop wild relatives ontology (CWR ontology)• The Ontology contains about 400 terms

– grouped into themes (different namespaces used)• OWL Full

Page 36: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Properties (1/2)

Page 37: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Properties (2/2)

Page 38: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Overall Solution (1/2)

Value-added information

services

Shared layer of

interoperability

Distributed Datasets

dataset1

dataset2

Common exchange layer (Vocabularies,Ontologies, RDF/XML)

datasetn

Aggregated Database

View

Subject specific Portals

Information

System (n)

News feed

service…

Page 39: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Overall Solution (2/2)

Data

Page 40: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Fisheries

Page 41: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

• The initial goal Making information interchangeable between ASFA, FIGIS, OneFish and

AGROVOC

• The approach Creating an ontology, integrating or mapping the 3 different systems +

AGROVOC Linking of the Ontology through wrappers to the different Information

Systems

• Evolution: NeOn

Fisheries Ontologies

Page 42: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Fisheries Ontologies (2/2)OneFish

FIGIS

AGROVOC

Aquaculture

BrackishwaterAquaculture

MarineAquaculture

AquacultureDevelopment

AquacultureFacilities

BROADERTERM

BROADERTERM

RELATEDTERM

RELATEDTERM

FreshwaterAquaculture

AquacultureEconomics

AquacultureEngineering

BROADERTERM

RELATEDTERM

RELATEDTERM

Aquaculture Resource

Water Area

land

strainsSpecieslife cycle

Farming system

management system

Production center

Spawning technique

Breeding technique

Hatchery technique

Expl. form

Regulation

Farming

technique

Environment

Institution

Health monitoring technique

diseases

suppliers

ASFA

Page 43: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Features1. Form versus meaning:

a) Traditional Searchb) Concept Search

2. Implemented functionalities:a) synonym searchb) multilingual capabilityc) terminology brokeringd) disambiguatione) related conceptsf) query expansion

3. Basic natural language queries

4. Semantic navigation of bibliographical metadata

5. Semantic Navigation of Knowledgea) Alphabetic list ...b) Core Fishery

Concepts ...

Page 44: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Ontology properties|Fishing Grounds@ag| PARTICIPANT-IN |Fishing Operations@ag||Sponges@ag| PARTICIPANT-IN |Sponge Fisheries@ag||Pearls@ag| PARTICIPANT-IN |Pearl Culture@ag||Pearls@ag| PARTICIPANT-IN |Pearl Fisheries@ag||Silk@ag| PARTICIPANT-IN |Sericulture@ag||Small Enterprises@ag| PARTICIPANT-IN |Artisanal Fisheries@ag||Dairy Farms@ag| PARTICIPANT-IN |Milk Production@ag||Fish Farms@ag| PARTICIPANT-IN |Aquaculture@ag||Fish Farms@ag| PARTICIPANT-IN |Fish Feeding@ag||Fish Farms@ag| PARTICIPANT-IN |Fisheries@ag||Natural Resources@ag| PARTICIPANT-IN |Resource Conservation@ag|

Page 45: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Example• "tell me what vessels from a nearby country are

currently in the marine area 50N060W within Atlantic Ocean, provided that also some Thunnus alalunga stock can be fished by those vessels, through allowed techniques"

Page 46: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Using multilingual lexicalizations ENGLISH

SPANISH

FRENCH

Page 47: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Using hierarchically related concepts

hierarchically related concept

Polyvalent Trawlers

Page 48: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Using non-hierarchically related concepts

non-hierarchically related concept

gears

Page 49: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Help the user formulate queries

Original query: bateau de pêcheTo refine your query, click on the concepts you are interested in. They will appear to the left.

Search:

Page 50: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Enhancement of terminologies

Page 51: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Reconcile different vocabularies

“navire de pêche”, “fishing

vessel”, “embarcaciones

de pesca”

AGROVOC or ASFA or

other“fishing vessels,”

“fishing boat,”

AGROVOC: “fishing vessels”, “barco”, etc...ASFA: “fishing

vessels”

Page 52: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Semantic Navigation of Knowledge: a) Thesaurus based

Highlighting the originator thesaurus. User can select a specific thesaurus to look for.

Page 53: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Geopolitical ontology

Page 54: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Geopolitical ontology• Incorporate geopolitical data

• Will serve as a bridge to allow communication between the various systems.

Page 55: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

PropertiesisValidFrom

hasOfficialName

hasCode

isSuccessorOf

hasBorderWith

dependsOn

Page 56: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Nutrition Ontology

Page 57: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Page 58: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Procedure

=CONCATENATE("<owl:Class rdf:ID=""",J2,"""><rdfs:subClassOf><owl:Class rdf:ID=""c_",B2,"""/></rdfs:subClassOf><rdfs:label xml:lang=""en""><![CDATA[",D2,"]]></rdfs:label><code><![CDATA[",J2,"]]></code><TAGNAME><![CDATA[",J2,"]]></TAGNAME>",S2, T2,"</owl:Class>")

<?xml version="1.0"?><rdf:RDF xmlns="http://www.fao.org/aos/infoods#" xmlns:protege="http://protege.stanford.edu/plugins/owl/protege#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:daml="http://www.daml.org/2001/03/daml+oil#" xmlns:dc="http://purl.org/dc/elements/1.1/" xml:base="http://www.fao.org/aos/infoods"> <owl:Ontology rdf:about=""> <owl:imports rdf:resource="http://protege.stanford.edu/plugins/owl/protege"/> <owl:versionInfo rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >Revision 4.0</owl:versionInfo> <protege:defaultLanguage rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >en</protege:defaultLanguage> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >International Network of Food Data Systems (INFOODS) was established in 1984 on the basis of the recommendations of an international group convened under the auspices of the United Nations University (UNU). Its goal was to .....</rdfs:comment> </owl:Ontology> <owl:Class rdf:ID="c_0413"> <code rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >0413</code> <rdfs:subClassOf> <owl:Class rdf:ID="c_041"/> </rdfs:subClassOf> <rdfs:label xml:lang="en">Vitamin D</rdfs:label> </owl:Class>

Page 59: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

AGROVOC

Page 60: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

TerminologyWorkbench

AGROVOCOWL

AOS Core: the Concept Server

Export

AGROVOCRDFS formats

(e.g. SKOS)and

TagTextISO2709

Other thesauriand

terminologies

integration

ABACA NT1 Food NT2 AppleANIMAL BT Organ NT ....

mapping

Other thesauri & terminologies

ABACA NT1 Food NT2 AppleANIMAL BT Organ NT ....

Page 61: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Concept Server project• Refine semantics and enrich data pool and

lexicon

• Develop a workbench for terminology and ontology development and maintenance.

• Support information management specialists in the development, maintenance, and quality assurance of the AOS/CS

• Global knowledge vs local knowledge

Page 62: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

AGROVOC Concept Server• AGROVOC cleaning and refinement

CurrentAGROVOC

MySQL

ImprovedAGROVOC

MySQL

AGROVOC OWLRevision

andRefinement

Page 63: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

How to obtain more semantics

MAIZEUF corn NT flint maize NT popcorn NT sweet corn  

MILKNT Milk Fat NT ColostrumNT Cow Milk

International Fund for Agricultural Development

UF IFAD

MAIZEsynonym cornsuperclass-of flint maize used-to-make popcorn hybridized-into sweet corn  

MILKingredient Milk Fat

ingredient Colostrumsuperclass-of Cow Milk

International Fund for Agricultural Development

acronym IFAD

Page 64: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Modelling– Conversion to UTF-8– Migration to MySQL (from SQL server)– Migration to PostgreSQL (from MySQL)– Incorporated AGRIS/CARIS classification scheme

(multilingual) and the mapping with AGROVOC keywords

– Modified structure to store multiple classification schemes

– Revised RDBMS scheme for ontology representation– Designed OWL models– Export to OWL format (v0.8a)– Export to SKOS format (v0.8a)

Page 65: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Methods• Concepts from

descriptors• Synonym

<owl:DatatypeProperty rdf:ID="synonym">

• Acronyms <owl:AnnotationProperty rdf:about="http://www.fao.org/aos/agrovoc/2005#acronym">

<owl:Class rdf:about=" http://www.fao.org/aos/agrovoc/2005#c_3"> <rdfs:label xml:lang="en">ABA</rdfs:label> <rdfs:label xml:lang="fr">ABA</rdfs:label> <rdfs:label xml:lang="es">ABA</rdfs:label> <rdfs:label xml:lang="ar">آبا</rdfs:label> <rdfs:label xml:lang="zh">脱è½é…¸</rdfs:label>� <synonym xml:lang="en">[8565] Abscisic acid</synonym> <rdfs:subClassOf rdf:resource=" http://www.fao.org/aos/agrovoc/2005#c_3397"/> <rdfs:subClassOf rdf:resource=" http://www.fao.org/aos/agrovoc/2005#c_32543"/></owl:Class>

Page 66: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

SKOS• SKOS export from AGROVOC Concept Server

Workbench (WB)

• SKOS web services– SKOS-services for DSpace plug-in

• SKOS for mapping projects

Page 67: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

AGROVOC CSWorkbench

triple store

SKOS

export

maintain

WebServices

access

access response

Page 68: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Ontology models (AGROVOC Concept Server, LIR, ...)

Concept

Relationshipsbetweenconcepts

Lexicalization/Term

String

Relationshipsbetweenstrings

Relationshipsbetweenterms

designated by

manifested asOther information:language/culture

subvocabulary/scopeaudiencetype, etc.

Note

annotation relationship

Relationship

RelationshipsbetweenRelationships

All terms are created as instances of the class o_terms. All at the same level. Only one language per term.

term level

string level

concept level

Page 69: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Agropedia Indica

Page 70: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

References• http://www.slideshare.net/marghe_rita/1-pantnag

ar

• http://www.slideshare.net/marghe_rita/2-pantnagar-w-guidelines

• http://www.slideshare.net/marghe_rita/3-pantnagar-w-exercices

• http://agropedia.iitk.ac.in/

Page 71: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Thai Rice Ontology

Page 72: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/Plant ontology: Relationship typesTaxon <hasSuperclass> Taxon

Taxon <has GrowthType> GrowthType

Taxon <hasPropagationMethod> PropagationMethod

Taxon <occursIn> Environment

Taxon <hasPest> Taxon

Taxon <hasDisease> Disease

Disease <causedBy> Taxon

TaxonPart <isa> AnatomicalPart

TaxonPart <isa> AnatomicalTypeOfFruit

TaxonPart <partOf> Taxon

TaxonPart <usedAs> Use

TaxonPart <usedToMake> ProductType

Taxon <hasDescription> Text

Page 73: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Thai plant ontology: ExampleMangifera indica Linn. <hasSuperclass> Mangifera

Mangifera indica Linn. <hasGrowthType> tree

Mangifera indica Linn. <hasPropagationMethod> seedling

Mangifera indica Linn <hasDescription> "leaves ...., flower ...... “

Mangifera indica Linn <occursIn> dry soil

Mangifera indica Linn. <hasPest> Scirtothrips dosalis Hood

Mangifera indica Linn <hasPest> Oidium mangiferae

OR, instead of the last statement or in addition to it

Mangifera indica Linn <hasDisease> Powdery Mildew

Powdery mildew <caused by> Oidium mangiferae

Page 74: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Conclusions

Page 75: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Ontology-based applications• Networked Ontologies

• Better exploitation of the potentiality at the application level: powerful IR

• No more words but URIs in IS

• Ontology Web services (OWS)

Page 76: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Collaborations• With AOS partners• Within EU Projects

– NeOn

– SEMIC.EU

• With other initiatives– GFIS

– Ecoterm

• Mapping projects• GBIF Global Biodiversity Information Facility secretariat• JRC + BGS + Biblioteca Nazionale di Firenze

Page 77: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Take-home message

• There are many uses for terminology & ontology systems in food and agriculture, both for information access and information processing

• FAO has several projects using such systems• FAO is deploying the

Agricultural Ontology Server (AOS)as a global resource

• SKOS and other knowledge representation standards play a key role

Page 78: Semantic Technologies at FAO

FAO

http://www.fao.org/aims/

Questions?

Thanks

Margherita Sini: [email protected] Keizer: [email protected]

Dagobert Soergel: [email protected] Kawtrakul: [email protected]

But Also: Gudrun Johannsen, Boris Lauser, Claudio Baldassarre, Gauri Salokhe, Marta Iglesias, Caterina Caracciolo, Sachit

Rajbhandari, Jeetendra Singh, Mary Redahan, Shrestha, Prashanta, Ton, Imm, Thanapth, Trakul, and many others...