38
Building a Collaborative Informatics Platform for Translational Research: An IMI Project Experience An IMI Project Experience Prof. Yike Guo Department of Computing Imperial College London

Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Building a Collaborative Informatics

Platform for Translational Research:

An IMI Project ExperienceAn IMI Project Experience

Prof. Yike Guo

Department of Computing

Imperial College London

Page 2: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Living in the Era of BIG • Big Data : Massive amounts of information

derived from dry /wet lab investigations, feasibility studies and clinical trials.

• Big Science: Research silos are evaporating with the merging of scientific methods. Traditional hypothesis-testing studies will couple with data-driven research.couple with data-driven research.

• Big Collaboration : As evidence accumulates, personalized medicine will become a reality, and patient-specific cancer interventions will become available. Teams of disease specialists, researchers and bioinformaticians working in concert in a virtual frontier.

Page 3: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Collaboration Platform is the Key

for Doing Big Science with Big Data

Page 4: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

New Science Economy Model:

Public-Private Partnership for Research

Page 5: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

The Requirements for PPP-based

Translational Research

• IT Infrastructure : hosted and shared by a community

• Data management: shared with configurable access control by various subgroups of a access control by various subgroups of a community

• Software : PaaS for development community

• Analysis : collaborative analysis

• Knowledge management: Dynamic integration of knowledge based on semantics

Page 6: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

PPP Example : EU IMI

Page 7: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

U-BIOPRED Project: An IMI Project

Innovative Medicines Imitative (IMI)

• Worlds largest public private partnership

• 2 Billion Euro: 1 Billion from EU, 1 Billion from EFPIA

U-BIOPRED: Unbiased Biomarkers for predicting respiratory disease outcomes predicting respiratory disease outcomes

• 40 Party Collaboration between 10 Pharma & 30 Academic Medical Centres

Knowledge Management Work Package

• Imperial Data Co-ordinator, JnJ TranSMART

• AZ, Roche, UCB, CNRS

Page 8: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

UBIOPRED Knowledge Product : Handprints

1. Reaching international consensus on

diagnostic criteria

2. Creating adult/pediatric cohorts and

biobanks

3. Creating novel biology ‘handprints’ by

combining molecular, histological, clinical

and patient-reported data

4. Validating such ‘handprints’ in relation to 4. Validating such ‘handprints’ in relation to

exacerbations and disease progression

5. Refining the ‘handprints’ by using

preclinical and human exacerbation models

6. Predicting efficacy of gold-standard and

novel interventions

7. Refining the diagnostic criteria and

phenotypes

8. Establishing a platform for exchange,

eduction and dissemination

Page 9: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

A Typical Querying ProcessQuery

Sub-set of « X » matched patient

e.g. age

Specific « Y » criteria

e.g. diagnostic

Sub-set of patients

« Z » measurements originating from different partnerse.g. proteomics

Integrated analysis methods Omics Data export to analysis tool

3 pathways

Search for connection

to inflammation regulating drugs

Query for disease association

Create own pathway

Use public and proprietary omics data

to infer network extensions

and connect disjunct networks

Export Omicsdata-matrix and context to tab. files

Upload exported files into

Application (cloud/systems)

Perform analysis

Export analytsis results into KR

Select patients which provided sample

associated the results

and check their physiologic data for

significant differences.

analysis tool

Page 10: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

The Challenge in Building the KM for

UBIOPRED

• The consortium formed as a loosely coupled virtual organisation

• Such a VO conducts highly collaborative and multidisciplinary scientific activities

• Data is generated by many people in the VO with a wide range

• of connected modalities• of connected modalities

• Knowledge is build through an iterative knowledge production process where data need to be incrementally collected, flexibly integrated, systematically analysed and interactively reported

• Project has a period but knowledge need to be managed forever

• Very big task with very little money

Page 11: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

• Cohort-based information integration to support clinical driven integrative biomarker study

• Cloud based infrastructure to facilitate collaborative translational research

• An integrated framework supports the management of a collaborative knowledge production process (curation, analysis, content enhancement, reporting, modeling and simulation)

U-BIOPRED KM System Design Goal and KM Key Features

and simulation)

• A sustainable environment with longevity and total ownership : workflow-based PaaS for integrative analytics:– Curation ETL workbenches

– Analytics workbenches

• A scalable platform adaptable for future science/technology development (such as NGS technology; medical image et.al )

Page 12: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

UBIOPRED Collaborative TR PlatformAnalytical Register :

Research Collaboration

Management

TranSMART:

Cohort-based Omics

Data Management

IDBS InforSense Workflow:

Analytical Engine +

Development PaaS

30/04/2012 Deploying TranSmart to UBIOPRED 12

Physical Layer (Imperial Based Data Centre)

IC Cloud (Virtualisation Layer)

Sample

Tracking

Gene

Pattern&R

Analytical

Registry

EWB for Assay

Data

Management

InforSense

KDE

Workflow

TSmart

DB

Virtual Machines

Page 13: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

UBIOPRED TR Platform Components

Content

Building:

Curation

(ETL/Ontology)

Omics

Experiment

Management

(ELN/LIMS)

Collaborative

Biomarker

Data Analysis

(Analytical Workflow)

System Biology

Study

(Modelling and

Simulation)

UBIOPRED

TR Platform

Collaborative

Study

Management

(Collaboration

Management)

Study

Data

Management

(tranSMART)

Page 14: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Curation: Building Study Contents and

Background Knowledge

tranSMART-based

UBIOPRED Study Data

Management System

Page 15: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Omics Experiment Management

Omics experiments (Proteomics , lipidomics experiments (WP7)

and pre-clinical study (WP6) are managed with EWB of IDBS

Page 16: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Study Data Management Study oriented curated data together with patient information

are integrated and warehoused in tranSMART for analysis (WP8)

From John Shon et.al tranSMART , AMIA TBI 2012

Page 17: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Workflow-based Analytics:

A Proteomics Example

Mass Spectra

Q-TOF mass spectrometry experiment

MS data preprocessing

m/z peak data

Mass Spectra

0 500 1000 1500 2000 2500 30000

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

MS data preprocessing

Calibration

Baseline Correction

Noise Reduction and Smoothing

Peak Detection

Peak Alignmentm/z peak data 0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Inte

nsity

Smoothed MS

Peaks

1070 1075 1080 1085

0

2000

4000

6000

8000

m/z

Inte

ns

ity

Smoothed MS

Peaks

m/z peak data

Peak Selection

Selected peaks

Classification model

Cross-validation

Model performance

CV training set CV testing set

Label prediction

Use Monte Carlo method to randomly select

other biomarker detection models and alternative

parameter settings

0 500 1000 1500 2000 2500 3000

Intensity Normalization

m/z peak data

0 500 1000 1500 2000 2500 3000-1000

m/z

Peak Selection

Coefficient of variation test

Statistical tests (e.g., KS test,

permutation test)

Supervised learning (e.g., SVM,

discriminant analysis)

Supervised learning (e.g., SVM,

discriminant analysis)

Label prediction

Page 18: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Workflows in InforSense

Page 19: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Workflow Deployed as Cloud

Applications

Consume Experiments

on Cloud

Develop, Deploy and

Share Experiments on

Cloud Experiment Experiment Experiment

Cloud

Scientist

Scientist

Scientist

Physical Infrastructure

Hypervisor

OSOS

VMVM

OS

VM

OSOS

VMVM

OS

VM

OSOS

VMVM

OS

VM

Elastic Scheduler

Experiment Deployment

Develop, Deploy and

Share Experiments on

Cloud

Scientist

Page 20: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

A Development Cycle of Cloud

Application Building via Workflow

Page 21: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Enables Collaborative

Knowledge Management

Page 22: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Mapping Research Collaboration in UBIOPRED

Investigation ProcedureCohort

UBIOPREDUBIOPRED

Work Package

Study

Researcher

Organisation

Page 23: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: A Study View

Page 24: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Work Package View

Page 25: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: A Cohort View

Page 26: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Interfacing to Cohort Selection

Page 27: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register – Cohort View

Page 28: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: An Investigation View

Page 29: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Investigation Drill Down

Page 30: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register Further Drill Down: Method

Page 31: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Interfacing to Sample Tracking

Page 32: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Analytical Register: Fully Integrated with TranSMART

Page 33: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

UBIOPRED

System Biology Study

1. We have

developed p38

pathway model

in SBML and in SBML and

perform

simulation

throughput

MATLAB.

Page 34: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

2. We have

proposed a

novel

glucocorticoid

UBIOPRED

System Biology Study

glucocorticoid

signalling

pathway model

in SBML.

Page 35: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

3. We have proposed some hypotheses to suggest possible crosstalk between GC and p38 pathways.

UBIOPRED

System Biology Study

and p38 pathways.

4. We then construct the integrated pathway model in SBML.

5. Test the stability of this integrated model.

Page 36: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Extending UBIOPRED to Support

Multiple Projects

Project 1Project n

DB Instance

Virtual

Container

Virtual

Container

DB Instance DB Instance DB Instance

Virtual

Container

Virtual

Container

Virtual

Container

Virtual

Container

Virtual

Container

Virtual

Container

Page 37: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

eTRIKS : European translational research

and knowledge management services

Page 38: Building a Collaborative Informatics Platform for ... · Integrated analysis methods Omics Data export to analysis tool 3 pathways Search for connection to inflammation regulating

Conclusion: the end of the begining

• UBIOPRED knowledge management system supports cross-institutional large scale translational research

• UBIOPRED knowledge management system support the entire knowledge production process

• For PPP-based TR projects, collaborative research • For PPP-based TR projects, collaborative research support in a VO environment is circuital

• Modern technology such as cloud and workflow provide the technical foundation for its implementation

• The development is moving forward to support IMI translational projects in general by eTRIKS project