61
Department of Biomedical Informatics Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational Science Introduction Rakesh Dhaval The Ohio State University Medical Center

Department of Biomedical Informatics Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational Science

Embed Size (px)

Citation preview

Department of Biomedical Informatics

Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational

ScienceIntroduction

Rakesh Dhaval

The Ohio State University Medical Center

Department of Biomedical Informatics

Topics

• Use Case• TRIAD• openMDR• TRIAD – openMDR Workflow• Questions

Department of Biomedical Informatics

Use Case

A clinical researcher is conducting an outcomes study in which they want to utilize data collected during the course of standard clinical care, as well as bio-specimen data from the Tissue Bank.

For patients that have consented to have their tissue specimen stored in the Tissue Bank with specific sexGenotype, ethnicity and gender - The researcher is interested in knowing Pre-Operating Room Diagnosis and Procedure, as well as Post-Operating Room Diagnosis and Procedure.

The researcher would like to be able integrate and analyze all of this information as a single data set.

Department of Biomedical Informatics

Bedside-Bench

Bench-Bedside

Basic Sciences

Hypothesis Discovery

Analysis & Dissemination

Pre-Clinical Research

Clinical Practice

Observation &

Surveillance

Clinical Research

Hypothesis Formulation

Translational Research

Translational Research

Department of Biomedical Informatics

Topics

• Use Case• TRIAD• openMDR• TRIAD – openMDR Workflow• Questions

Department of Biomedical Informatics

The Translational Research Informatics and Data Management Grid (TRIAD)

TRI D

Department of Biomedical Informatics

TRIAD Introduction

• TRIAD = CTSA + caGrid Data Service Wizards Federated and Localized Metadata Management

• caBIG - Cancer Biomedical Informatics Grid• Grid - “Controlled and coordinated resource sharing and problem solving in

dynamic, scalable virtual organizations.”• caGrid - Provides the software infrastructure that underlies the tools and

applications of caBIG

TRIAD middleware

Department of Biomedical Informatics

• TRIAD is an instance of caGrid that leverages existing, core components: Service oriented architecture utilizing Axis, Globus, and caGrid extensions. Data model management via the Global Model Exchange. Customizable security infrastructure leveraging GAARDS. Graphical strongly typed grid service creation using Introduce. Extendable and customizable web service infrastructure.

• TRIAD extends caGrid: Additional options for data service creation, security policy, metadata management, etc. Federated and/or local metadata management utilized for semantically annotated grid

service creation “Out of the box” data services for popular tools such as I2B2 and RedCAP Integration with commercial and open-source CTMS platforms

TRIAD Design

TRI D

Department of Biomedical Informatics

Compatibility and Interoperability

• caBIG® provides standards-based compatibility guidelines for creating software systems that are syntactically and semantically interoperable.

Department of Biomedical Informatics

Knowledge Engineering Framework

Borlawsky TB, et al. Development of an Agile Knowledge Engineering Framework in Support of Multi-Disciplinary Translational Research. Proc AMIA Translational Bioinformatics Summit, 2009.

Department of Biomedical Informatics

Example Use Case

Department of Biomedical Informatics

Use Case: Technical Implementation

Department of Biomedical Informatics

Enhanced Data Services

• Data service generation wizards for common data centric applications

Class Model & Assertions

CDEs & Semantics

Department of Biomedical Informatics

Create Semantically Harmonized Data Model Generate Data

Resource

Grid-ify

TRIAD Development Process

Create an Information Model using a modeling

tool

Information Models

Perform Semantic

Integration using the EA Plugin

Vocabularies

Generate Code and Interfaces using the caCORE SDK

Code GeneratorAPIs

Transform the Model

into Metadata using the

Domain Model Gen.CDEs

Generate a Grid Service

using Introduce

Grid

Reference: Dr. Robert Freimuth, Vocabulary Knowledge Center Director

• caCORE is a robust set of tools and resources to support the development of caBIG®-compatible systems

• NCI offers comprehensive training for caCORE tools

Department of Biomedical Informatics

Topics

• Use Case• TRIAD• openMDR• TRIAD – openMDR Workflow• Questions

Department of Biomedical Informatics

OpenMDR-based Federated WorkFlow

openMDR - TRIAD based domain modeling and annotation workflow

Department of Biomedical Informatics

openMDR: Components

Department of Biomedical Informatics

• Metadata Registry - Capable of storing, versioning, and maintaining semantic and representational metadata.

• Utilizing/enhancing the UK cancergrid cgMDR ISO11179 database.

• Built on eXist database.» Metadata is annotated

with conceptual information from a local or remote lexEVS system(s).

» Web based front-end for creating and curating semantic metadata.

openMDR:mdrCORE

Department of Biomedical Informatics

• API and Grid Service for querying across many disparate semantic metadata repositories» caGrid grid service - capable of talking to caDSR and many openMDR systems enabling federated query of common data elements.

» Enhances and wraps the cgMDR mdrConnector in order to parse the received information into a common format.

» caGrid service enables it to be deployed in any local or production grid environment and can be configured and modified to talk to many semantic metadata systems.

openMDR:mdrQuery

Department of Biomedical Informatics

• Semantic Annotation Tool» Enables semantic annotation of federated semantics

» Utilizes MDRQuery service for locating and utilizing CDE’s from multiple semantic metadata sources

» Options for Search Filters» Enables Search within “Context”» Tags are created that identify CDE via local unique id

» No need to leave the EA environment

openMDR:mdrEAPlugin

Department of Biomedical Informatics

• Process an MDR Plugin annotated model and create service metadata which will enable the service to be a semantically annotated grid service and easily discovered» The metadata can be used to generate a data service using the Introduce Data Service Wizard.

openMDR:mdrDomainModelGenerator

Department of Biomedical Informatics

Topics

• Use Case• TRIAD• openMDR• TRIAD – openMDR WorkFlow• Questions

Department of Biomedical Informatics

TRIAD - openMDR Based Service Creation Workflow

openMDR - TRIAD based Service Creation workflow

Department of Biomedical Informatics

① CDE curation in Local instance of openMDR - using concepts from Local or Remote (NCI) LexEVS instance

② Create an information model - create logical model, data model and Map Object Model to Data Model in Enterprise Architect (EA)

③ Annotate logical model with CDE's in EA - using openMDR EA plugin.

④ Export annotated XMI

⑤ Create domain model file (XML file) with grid service metadata

⑥ Generate a system using caCORE SDK 4.X by supplying your UML model as input

⑦ Create Grid Service - Use Introduce grid Service Authoring Toolkit

OpenMDR Service Creation Phases

Department of Biomedical Informatics

Topics

• Use Case• TRIAD• openMDR• TRIAD – openMDR WorkFlow• Questions

Department of Biomedical Informatics

Questions/Comments?

Thank you for your time and attention

TRIAD: Dave Ervin [email protected]

http://wiki.triadcommunity.org

openMDR: Rakesh Dhaval [email protected]

http://cagrid.org/display/mdr

caGrid Knowledge Center: [email protected]

Department of Biomedical Informatics

Development of Ontology-anchored Grid-based Data Services to Facilitate Integrative Clinical and Translational

Science

TRIAD: Walkthrough and Demonstration

Rakesh Dhaval

Applications Architect, The Ohio State University Medical Center

Department of Biomedical Informatics

Topics

• Review of Use Case• Model Development in Enterprise Architect

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• Acknowledgements• Questions

Department of Biomedical Informatics

Use Case

A clinical researcher is conducting an outcomes study in which they are utilizing data collected during the course of standard clinical care, as well as bio-specimen data from the Tissue Bank.

For patients that have consented to have their tissue specimen stored in the Tissue Bank with specific sexGenotype, ethnicity and gender. The researcher is interested in knowing Pre-Operating Room Diagnosis and Procedure, as well as Post-Operating Room Diagnosis and Procedure.

The researcher would like to be able integrate and analyze all of this information as a single data set.

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

UML Model Development

Create a Logical

Model (UML class

diagram) using

Enterprise ArchitectLogical Model

Create a Data Model (database schema) using

Enterprise Architect

Data Model

Semantically Annotate the UML Model using the openMDR EA Plugin

Semantics

Map the Logical

Model to the Data Model

using caAdapter

Mapping

Model is complete and ready for review and load into caDSR/ openMDR

Load Model

Department of Biomedical Informatics

Import Data Model

• Project > Database Engineering > Import DB Schema from ODBC ...

Department of Biomedical Informatics

Data Model

Department of Biomedical Informatics

UML Model Development

Create a Logical

Model (UML class

diagram) using

Enterprise ArchitectLogical Model

Create a Data Model (database schema) using

Enterprise Architect

Data Model

Semantically Annotate the UML Model using the openMDR EA Plugin

Semantics

Map the Logical

Model to the Data Model

using caAdapter

Mapping

Model is complete and ready for

compatibility review and load into caDSR/ openMDR

Load Model

Department of Biomedical Informatics

Logical Model : Add a Class

• Select Class• Click pointer in diagram workspace

Department of Biomedical Informatics

Logical Model : Enter Class Name

Department of Biomedical Informatics

Logical Model :Add Class Attributes

• Select ‘Details’ tab and click ‘Attributes’ button

Department of Biomedical Informatics

Logical Model : Add Class Attributes

• Enter ‘Name’• Select Java ‘Type’

– lang• Boolean• Character• Double• Float• Integer• Long• String

– util• Date

• Click ‘Save’

Department of Biomedical Informatics

Logical Model :Add Association

• Select ‘Associate’ tool from Class Tools menu

• Drag from Source to Target class

Source Target

Department of Biomedical Informatics

Logical Model :Association Properties

• Double-click association• Specify directionality under

General tab• Specify multiplicity under

Source and Target tabs

Department of Biomedical Informatics

Logical Model

Department of Biomedical Informatics

UML Model Development

Create a Logical

Model (UML class

diagram) using

Enterprise ArchitectLogical Model

Create a Data Model (database schema) using

Enterprise Architect

Data Model

Semantically Annotate the UML Model using the openMDR EA Plugin

Semantics

Map the Logical

Model to the Data Model

using caAdapter

Mapping

Model is complete and ready for review and load into caDSR/ openMDR

Load Model

Department of Biomedical Informatics

ORM : Export XMI

• Right-click on Logical View• Select Import/Export > Export Package to XMI file...

Department of Biomedical Informatics

ORM : Export XMI

Department of Biomedical Informatics

ORM : caAdapter

• Select File > New > Model Mapping Service > Object Model to Data Model Map Specification

Department of Biomedical Informatics

ORM : caAdapter

• Open the XMI file that was exported from EA

Department of Biomedical Informatics

ORM : caAdapter

• Drag and drop to associate objects and tables, attributes and columns

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

UML Model Development

Create a Logical

Model (UML class

diagram) using

Enterprise ArchitectLogical Model

Create a Data Model (database schema) using

Enterprise Architect

Data Model

Semantically Annotate the UML Model using the openMDR EA Plugin

Semantics

Map the Logical

Model to the Data Model

using caAdapter

Mapping

Model is complete and ready for review and load into caDSR/ openMDR

Load Model

Department of Biomedical Informatics

MetaData:Common Data Elements

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

Semantic Annotation

• Creating New Data Elements using openMDR Metadata Registry

Demo

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

Semantic Annotation

• Annotate the Clinical Model in Enterprise Architect Tool

Demo

Department of Biomedical Informatics

Semantic Annotations

• New concept created using openMDR– Logical model:

• Object: Visit• Attribute: dischargeDisposition

– Data element: Discharge Disposition cagrid.org_0edff23f-c0fa-4a18-ac06-97e4893d42e8_0.1

• caTissue link– Logical model:

• Object: Patient• Attribute: studyNumber

– Data element: Participant Identifier java.lang.Long (public ID: 2513658)• Other

– Logical model:• Object: Patient• Attribute: Gender

– Data element: Participant Gender Person Gender Text Type (public ID: 2513661)

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

TRIAD Portal Demo

• User Interface for Investigator to Query DataTRIAD Portal – Under Development

Demo

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

Acknowledgements

• CTSA: http://ctsaweb.org/ • NCI caBIG

– caGrid toolset: https://cabig.nci.nih.gov/index_html/sharable/cagrid_overview

– caGrid Knowledge Center: https://cabig-kc.nci.nih.gov/CaGrid/KC/index.php/Main_Page

• Active caGrid Community

Department of Biomedical Informatics

Topics

• Review of Use Case• UML Model Development

– Data Model Development– Logical Model Development– Object Relational Mapping

• Semantic Annotation– Create Data Elements in openMDR– Annotate Model with Data Elements

• TRIAD Portal • Acknowledgements• Questions

Department of Biomedical Informatics

Questions/Comments?

Thank you for your time and attention

TRIAD: Dave Ervin [email protected]

http://wiki.triadcommunity.org

openMDR: Rakesh Dhaval [email protected]

http://cagrid.org/display/mdr

caGrid Knowledge Center: [email protected]