12
User-Friendly Ontology Authoring Using a Controlled Language Valentin Tablan , Tamara Polajnar Hamish Cunningham, Kalina Bontcheva NLP Research Group University of Sheffield Regent Court, 211 Portobello Street, Sheffield, S1 4DP, UK http://nlp.shef.ac.uk , http://gate.ac.uk

User-Friendly Ontology Authoring Using a Controlled Language

  • Upload
    istas

  • View
    24

  • Download
    1

Embed Size (px)

DESCRIPTION

User-Friendly Ontology Authoring Using a Controlled Language. Valentin Tablan , Tamara Polajnar Hamish Cunningham, Kalina Bontcheva NLP Research Group University of Sheffield Regent Court, 211 Portobello Street, Sheffield, S1 4DP, UK http://nlp.shef.ac.uk , http://gate.ac.uk. Motivation. - PowerPoint PPT Presentation

Citation preview

Page 1: User-Friendly Ontology Authoring Using a Controlled Language

User-Friendly Ontology Authoring Using a Controlled Language

Valentin Tablan, Tamara PolajnarHamish Cunningham, Kalina Bontcheva

NLP Research GroupUniversity of Sheffield

Regent Court, 211 Portobello Street,Sheffield, S1 4DP, UK

http://nlp.shef.ac.uk, http://gate.ac.uk

Page 2: User-Friendly Ontology Authoring Using a Controlled Language

2LREC 2006, Genoa, Italy http://gate.ac.uk

Motivation

Ontologies starting to be used in many NLP applications for: encoding system ‘knowledge’; storing results.

Current standards (RDF-S, OWL) are complex: Large number of features supported; Steep learning curve; Training required; Authoring tools (e.g. Protégé) complicated and difficult

to use by non-specialists.

Page 3: User-Friendly Ontology Authoring Using a Controlled Language

3LREC 2006, Genoa, Italy http://gate.ac.uk

Motivation (continued)

Ontological requirements for NLP applications usually simple:Taxonomy of classes;Hierarchy of properties; Instances.

Graphical tools difficult to embed in a text-based pipelines (e.g. wikis, existing NLP apps, other web set-ups).

Page 4: User-Friendly Ontology Authoring Using a Controlled Language

4LREC 2006, Genoa, Italy http://gate.ac.uk

Controlled Languages

Good compromise between structured data and natural language:Feels [almost] natural to humans;Can be ‘understood’ by machines.

People find it easy to ‘put into words’ ontological information (which they may find difficult to do with a specialised tool).

Used before for automating translation (e.g. Caterpillar and Boeing).

Page 5: User-Friendly Ontology Authoring Using a Controlled Language

5LREC 2006, Genoa, Italy http://gate.ac.uk

Round-Trip Authoring

CL Text

Very little or no training necessary (learning by example).

Can be used to extend existing ontologies or create new ones.

Limited number of syntactical constructs.

Open vocabulary.

CLIE

Generation

Page 6: User-Friendly Ontology Authoring Using a Controlled Language

6LREC 2006, Genoa, Italy http://gate.ac.uk

Controlled LanguageNew Class There are … There are pets and owners.

Subclass_of … is a type of … Cat is a type of pet.

Cats and dogs are types of pet.

New instance … is a … Tabatha is a pet.

New object property

… [can] have … Owners have pets.

New datatype property

… [can] have textual …

Pets can have textual nickname.

Property value … has … John has Tabatha.

… has <property> … with value …

Tabatha has nickname with value “Tabby”.

Page 7: User-Friendly Ontology Authoring Using a Controlled Language

7LREC 2006, Genoa, Italy http://gate.ac.uk

An Example

There are pets and owners. Cat is a type of pet.

Tabatha is a cat. John is an owner.

Owners have pets. Pets can have textual nickname.

John has Tabatha. Tabatha has nickname with value "Tabby".

Page 8: User-Friendly Ontology Authoring Using a Controlled Language

8LREC 2006, Genoa, Italy http://gate.ac.uk

From Text to Ontologies

CLText

Tokeniser POS Tagger Morph Quote Finder

Key-phrase NP Chunker CLIE Parser

Page 9: User-Friendly Ontology Authoring Using a Controlled Language

9LREC 2006, Genoa, Italy http://gate.ac.uk

Closing the Loop

Generating CL text from ontologies: Generate triples. Match triples to generation templates. Group similar triples. Generate sentences for each group of

triples.

Page 10: User-Friendly Ontology Authoring Using a Controlled Language

10LREC 2006, Genoa, Italy http://gate.ac.uk

<Pet, rdf:type, owl:Class> <in> <triple id="t1"> <property ns="rdf" name="type"/> <object ns="owl" name="Class"/> </triple> </in><out> <singular> <phrase>There are <ref ref="t1.subject" number="plural"/>.

</phrase> </singular> <plural> <phrase>There are <ref ref="t1.subject" number="plural"/>. </phrase> </plural></out>

Page 11: User-Friendly Ontology Authoring Using a Controlled Language

11LREC 2006, Genoa, Italy http://gate.ac.uk

Conclusions

Simple way of editing ontologies. Standards compliant (through GATE’s

ontology support I/O). No training required. Embeddable in text-only applications. Language could be extended to:

Better cover OWL features;Better cover natural ways of expression.

Page 12: User-Friendly Ontology Authoring Using a Controlled Language

12LREC 2006, Genoa, Italy http://gate.ac.uk

Thank you!

More information:

http://gate.ac.uk

http://nlp.shef.ac.uk