17
Experiences from the NCBO Experiences from the NCBO OBO-to-OWL Mapping Effort OBO-to-OWL Mapping Effort Dilvan A. Moreira, University of São Paulo Mark A. Musen, Stanford University

Experiences from the NCBO OBO-to-OWL Mapping Effort

  • Upload
    wynona

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

Experiences from the NCBO OBO-to-OWL Mapping Effort. Dilvan A. Moreira, University of São Paulo Mark A. Musen, Stanford University. OBO Format. Evolved from DAG representation initially created for Gene Ontology Adopted by dozens of biomedical ontologies stored in the OBO repository - PowerPoint PPT Presentation

Citation preview

Page 1: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Experiences from the NCBO Experiences from the NCBO OBO-to-OWL Mapping EffortOBO-to-OWL Mapping Effort

Dilvan A. Moreira, University of São Paulo

Mark A. Musen, Stanford University

Page 2: Experiences from the NCBO  OBO-to-OWL Mapping Effort

OBO FormatOBO Format

• Evolved from DAG representation initially created for Gene Ontology

• Adopted by dozens of biomedical ontologies stored in the OBO repository

• Used by most GO-based data analysis tools

• Designed for • human readability• extensibility•minimal redundancy

Page 3: Experiences from the NCBO  OBO-to-OWL Mapping Effort
Page 4: Experiences from the NCBO  OBO-to-OWL Mapping Effort

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

OWL – Web Ontology LanguageOWL – Web Ontology Language

• W3C “recommendation” • Offers important advantages

• A well defined, standardized syntax and semantics• Interoperability with ontologies created in other

domains of science • Growing support from open-source and

commercial tools• Editors• Parsers• Classifiers• Reasoning systems

Page 5: Experiences from the NCBO  OBO-to-OWL Mapping Effort

The NCI Thesaurus in Protégé-OWL

Page 6: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Why map from OBO to OWL?Why map from OBO to OWL?

• Many Bio-Ontologies are modeled in OWL (e.g., NCI Thesaurus, BioPAX, SNOMED-CT)

• OBO format has not been adopted outside the Bio-ontology community, where OWL is the recognized standard

• Bio-Ontologies need to interoperate with other ontologies used throughout e-science

• The standardization of OWL is leading to many commercial ontology-oriented tools that biologists might wish to use

Page 7: Experiences from the NCBO  OBO-to-OWL Mapping Effort
Page 8: Experiences from the NCBO  OBO-to-OWL Mapping Effort

http://bioportal.bioontology.

org

Page 9: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Requirements for mapping Requirements for mapping OBO to OWLOBO to OWL

• Map all OBO format constructs in use to OWL

• Make no assumptions other than those written in the OBO format specification itself

• Ensure no loss of information

• Enable “round trip” conversion (OBO to OWL and back again)

Page 10: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Mapping clarifies the OBO Mapping clarifies the OBO specificationspecification

•OBO syntax is not uniformly documented•No complete BNF exists to define the grammar•Some new constructs lack clear definitions and

have not yet been used by the community

•OBO Semantics sometimes need be clarified using textual style guides that do not adopt formal representations

Page 11: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Both languages support two types of Both languages support two types of elementselements

• Semantic informationdefines classes and relationships about which computers can reason automatically

• Textual properties offer the intended meanings of ontology elements for human consumption; such entries include names, textual definitions, descriptions, usage notes, and so on

Page 12: Experiences from the NCBO  OBO-to-OWL Mapping Effort

Example of MappingExample of Mapping

format-version: 1.0date: 28:11:2006 23:22saved-by: dilvanauto-generated-by: OBO-Edit 1.001default-namespace: test ontologyremark: Modified snipet of the cell ontology

[Term]id: CL:0000000name: celldef: "Minute protoplasmic masses that make up organized tissue." [MESH:A.11]

[Term]id: CL:0000003name: cell in vivois_a: CL:0000000 ! Cell

[Term]id: CL:0000026name: nurse cellrelated_synonym: "nurse cell" []xref_analog: FBbt:00004878is_a: CL:0000003 ! cell in vivorelationship: develops_from CL:0000000 ! Cell

[Typedef]id: develops_fromname: develops_fromis_transitive: true

<?xml version="1.0"?><!DOCTYPE rdf:RDF [ … ]><rdf:RDF … > <owl:Ontology rdf:about=""> <rdfs:comment …> Modified snipet of the cell ontology </…> <oboInOwl:hasDate …> 2006-11-28T23:22:00 </…> <oboInOwl:savedBy …> dilvan </…> <oboInOwl:hasDefaultNamespace …> test ontology </…> </owl:Ontology>

<owl:Class rdf:ID="CL_0000000"> <rdfs:label …> cell </…> <oboInOwl:hasDefinition> <rdf:Description> <rdf:type rdf:resource="&oboInOwl;Definition"/> <rdfs:label …> Minute protoplasmic … tissue.</…> <oboInOwl:hasDbXref> <rdf:Description> <rdf:type rdf:resource="&oboInOwl;DbXref"/> <rdfs:label …>MESH:A.11</…> </rdf:Description> </oboInOwl:hasDbXref> </rdf:Description> </oboInOwl:hasDefinition> </owl:Class>

<owl:Class rdf:ID="CL_0000003"> <rdfs:subClassOf rdf:resource="#CL_0000000"/> <rdfs:label …> cell in vivo </rdfs:label> </owl:Class> …

<owl:ObjectProperty rdf:ID="UNDEFINED_develops_from"> <rdf:type rdf:resource="&owl;TransitiveProperty"/> <rdfs:label …> develops_from </…> </owl:ObjectProperty>…</rdf:RDF>

OBO OWL

Page 13: Experiences from the NCBO  OBO-to-OWL Mapping Effort

OWL Converter Tab for OBO-EditOWL Converter Tab for OBO-Edit

Page 14: Experiences from the NCBO  OBO-to-OWL Mapping Effort

OBO Converter Tab for ProtégéOBO Converter Tab for Protégé

Page 15: Experiences from the NCBO  OBO-to-OWL Mapping Effort

ResultsResults

• We converted all 40 OBO Format files in the OBO repository (November 2006) to OWL with no errors detected by Protégé

• Of these 40 files, 30 were converted back to OBO Format without loss of information, as evidenced by running the obodiff tool

• Of the remaining 10 files that could not make the round trip

• Five had illegal characters in some of their OBO ids• Three used new OBO 1.2 constructs that did not pass

obodiff criteria for equality• Two had syntax errors in their original form

Page 16: Experiences from the NCBO  OBO-to-OWL Mapping Effort

ConclusionsConclusions

• A translator enables interconversion of OBO format and standard OWL

• Underspecification of OBO format has made the development of OBO-to-OWL translators difficult in the past

• Even when the formal specification is incomplete, languages such as OBO format often have de facto specifications based on usage patterns in the community

• Work with the OBO community enabled us to clarify and document underspecified elements of OBO format, enabling implementation of a robust translation system

Page 17: Experiences from the NCBO  OBO-to-OWL Mapping Effort

AcknowledgmentsAcknowledgments

• Stuart Aitkin

• John Day-Richter

• Suzanna Lewis

• Chris Mungall

• Nigam Shah

http://www.bioontology.org/wiki/index.php/OboInOwl:Main_Page

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.