20
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Embed Size (px)

Citation preview

Page 1: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Taverna Workflows for Systems Biology

Katy WolstencroftSchool of Computer Science

University of Manchester

Page 2: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

What is a Taverna Workflow?

Workflow management system

Sophisticated analysis pipelines

A set of services to analyse or manage data (either local or remote)

Data flow through services

Control of service invocation

Page 3: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Interoperability, Integration and Collaboration

Access to distributed and local resources

Iteration over data sets Automation of data flow Agile methods development Extensible Experimental protocols

Taverna Workflows

Page 4: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Workflows are ideal for…

High throughput analysis Transcriptomics, proteomics, Next Gen

sequencing, etc Data integration, data interoperation Data management

Model construction Data format manipulation

Systems Biology

Page 5: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Workflow engine to run workflows

List of services

Construct and visualise workflows

Taverna Workbench

Web ServicesWeb Services e.g. KEGG

ProgramminglibrariesProgramminglibraries

e.g. libSBML

Page 6: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W729-32.Taverna: a tool for building and running workflows of services.Hull D, Wolstencroft K, Stevens R, Goble C, Pocock MR, Li P, Oinn T.

Freely availableopen source

Current Version 2.2

70,000+ downloads across version

Part of the myGrid Toolkit

Taverna Workbench

Page 7: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

myGrid Open Suite of Tools

Client User InterfacesWorkflow GUI Workbench

Workflow Repository

Service CatalogueThird Party Tools

Programming and APIs

Web Portal

Activity and Service Plug-in Manager

Provenance Store

Workflow Server

Open Provenance

Model

Secure Service Access

Page 8: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Examples of Workflows for Systems Biology

Page 9: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Escherichia coli : From cDNA Microarray Raw Data to Pathways and Published Abstracts

Identify differentially expressed genesusing t-test with RPeter Li, MCISB

SysMO SUMO: Systems Understanding of Microbial Oxygen responses

Afsaneh Maleki-Dizaji, University of Sheffield

High Throughput Experiments

Wei Tan Univ. Chicago, CABIG

Using gene-expression patterns associated with two lymphoma types to predict the type of an unknown sample

Page 10: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Workflows for Model Building

Results from experiments in systems biology -> related to mathematical models in SBML

Workflows can link data and models

Workflows can create models

SBML

Location of components

Species

Reactions

Page 11: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Model construction workflow

Input: list of ORFs

Output: SBML file

1. Get reaction info

3. Create species

2. Create compartments

4. Create reactions

Get

ann

otat

ions

Peter Li et al, MCISB, myGrid

Page 12: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Integrating libSBML into Taverna

Peter Li et al, MCISB, myGrid

Page 13: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Workflows for Data Integration

Read enzyme names from

SBML

Read enzyme names from

SBML

Query maxd database using enzyme names

Query maxd database using enzyme names

Calculate colours based on gene

expn level

Calculate colours based on gene

expn level

Create new SBML model

with new colour nodes

Create new SBML model

with new colour nodes

Mapping transcriptomic data onto SBML models

Page 14: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

HUMAN Microarray CEL file to candidate pathways

Reuse, Recycle, Repurpose Workflows

From cDNA Microarray Raw Data to Pathways and Published Abstracts

SUMO

Page 15: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester
Page 16: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Metware: Workflows for metabolomics, Netherlands/GermanySteffen Neumann, Leibniz Institute of Plant Biochemistry

Workflows through web interface

Reuse, Recycle, Replay Workflows

Page 17: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Workflows in e-Laboratories

SysMO SEEK e-Laboratory for

interlinking and sharing data, models, SOPS and workflows for Systems Biology in Europe

Workflows for data analysis

Page 18: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

Summary

Informatics in Systems Biology relies on data integration and large-scale data analysis

Taverna workflows are a mechanism for linking together resources and analyses

myExperiment allows you to reuse workflows and benefit from others work

Page 19: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester

More information

Taverna http://www.taverna.org.uk

myExperiment http://www.myexperiment.org http://wiki.myexperiment.org

BioCatalogue http://www.biocatalogue.org

SysMO-SEEK http://www.sysmo-db.org

Page 20: Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester