32
Alejandra González-Beltrán, Ph.D University of Oxford e-Research Centre, UK From experimental planning to data publication: the ISA infrastructure and case studies in toxicology [email protected] OpenTox Europe - Mainz, Germany - 30th September, 2013 1

OpenTox Europe 2013

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: OpenTox Europe 2013

Alejandra González-Beltrán, Ph.D

University of Oxford e-Research Centre, UK

From experimental planning to data publication: the ISA infrastructureand case studies in toxicology

[email protected]

OpenTox Europe - Mainz, Germany - 30th September, 2013

1

Page 2: OpenTox Europe 2013

2

The data workflow

Data Scientist

Visualization

Analysis

Planning

Data Management

Data CollectionPublication

Use existing data

Perform new experiment

Page 3: OpenTox Europe 2013

3

The data workflow

Data Scientist

Visualization

Analysis

Planning

Data Management

Data CollectionPublication

Use existing data

Perform new experiment

metadata

metadata

metadata

metadata

metadata

metadata

metadata tracking infrastructure

Page 4: OpenTox Europe 2013

4

Data Scientist

Visualization

Analysis

Planning

Data Management

Data CollectionPublication

Use existing data

Perform new experiment

metadata

metadata

metadata

metadata

metadata

metadata

Traceability

Assessm

ent

Accountability

Evidence

Reusability

Reproducibility

Storage

Mining

Provenance

Page 5: OpenTox Europe 2013

5

semantics

structure

Page 6: OpenTox Europe 2013

6

semantics

structure

investigationstudyassay

Page 7: OpenTox Europe 2013

7

Page 8: OpenTox Europe 2013

8

infrastructureThe

generic format for experimental description and data exchange

open source software toolscommunity engagement

Page 9: OpenTox Europe 2013
Page 10: OpenTox Europe 2013
Page 11: OpenTox Europe 2013

11

Run Assays4

SAMPLE1

SAMPLE2

SAMPLE3

SAMPLE4

SAMPLE5

SAMPLE6

SAMPLE7

SAMPLE8

SAMPLE9

SAMPLE10

SAMPLE11

SAMPLE 1

SAMPLE 2

SAMPLE 3

SAMPLE 4

SAMPLE 5

SAMPLE 6

SAMPLE 7

SAMPLE 8

SAMPLE 9

SAMPLE 10

SAMPLE 11

FILE 1

FILE 2

FILE 3

FILE 4

FILE 5

FILE 6

FILE 7

FILE 8

FIL

FIL

FIL

Experiment Design Analysis

Arabidopsis thaliana

Treatment groups

70% 90% 100%

Collect Samples1 2 3 5

6

Parses ISA-Tab datasets into R objects, allowing to update them and save them after analysis.

Bridges the ISA-Tab metadata to analysis pipelines of specific assay types, by building objects for use in other R packages downstream: currently considering mass spectrometry (xmcs package, xcmsSet) and DNA microarray (Biobase package, ExpressionSet)

Suggests packages in BioConductor that might be relevant for an assay type, according to the BioCViews annotations.

Gonzalez-Beltran et al. The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again. In press

Page 12: OpenTox Europe 2013
Page 13: OpenTox Europe 2013
Page 14: OpenTox Europe 2013
Page 15: OpenTox Europe 2013

Data Publication with

Page 16: OpenTox Europe 2013

• New open-access, online-only publication for descriptions of scientifically valuable datasets

• Only content type: Data Descriptor, narrative + structured parts

• Initially focused on the life, environmental and biomedical sciences

• Data Descriptor will be complementary to traditional research journals and data repositories

• Designed to foster data sharing and reuse, and ultimately to accelerate scientific discoverywww.nature.com/scientificdata

Data Publication withhttp://www.nature.com/scientificdata/

Page 17: OpenTox Europe 2013

• New open-access, online-only publication for descriptions of scientifically valuable datasets

• Only content type: Data Descriptor, narrative + structured parts

• Initially focused on the life, environmental and biomedical sciences

• Data Descriptor will be complementary to traditional research journals and data repositories

• Designed to foster data sharing and reuse, and ultimately to accelerate scientific discoverywww.nature.com/scientificdata

Data Publication withhttp://www.nature.com/scientificdata/

http://gigasciencejournal.com

Page 18: OpenTox Europe 2013

1

Page 19: OpenTox Europe 2013
Page 20: OpenTox Europe 2013

20

A growing ecosystem of over 30 public and internal resources using the ISA metadata tracking framework (ISA-Tab and/or format) to facilitate standards-compliant collection, curation, management and reuse of investigations in an increasingly diverse set of life science domains, including:

• stem cell discovery• system biology• transcriptomics• toxicogenomics• also by communities working to build a library of cellular

signatures

• environmental health• environmental genomics• metabolomics• metagenomics• nanotechnology• proteomics

Page 21: OpenTox Europe 2013

21

Toxicity data

http://xkcd.com/1260/

Page 22: OpenTox Europe 2013

22

Suter et al 2011. EU Framework 6 Project: Predictive Toxicology (PredTox)—overview and outcome. Boitier et al 2011. A comparative integrated transcript analysis and functional characterization of differential mechanisms

for induction of liver hypertrophy in the rat

InnoMed PredTox ProjectGoal: earlier pre-clinical safety evaluation by combining results from ‘omics

technologies and conventional toxicology methods

Page 23: OpenTox Europe 2013

23

2-week systemic rat study using male Wistar rats (N=15 per dose group)

14 proprietary drug candidates from

participating companies and 2 reference toxic

compounds

Page 24: OpenTox Europe 2013

24

Page 25: OpenTox Europe 2013

25

Page 27: OpenTox Europe 2013

27

Data Infrastructure for Chemical Safety

http://www.dixa-fp7.eu/about

Page 28: OpenTox Europe 2013

28

Kohonen et al. 2013 The ToxBank Data Warehouse: a research cluster of 7 EU FP7 Health systems toxicology and toxicogenomics projects.

Safety Evaluation Ultimately Replacing Animal Testing-1 (SEURAT-1): looking at improving safety assessment without the need for animal experiments

ToxBank: cross-cluster infrastructure project

http://toxbank.net

Page 29: OpenTox Europe 2013

29 https://wiki.nci.nih.gov/display/ICR/ISA-TAB-Nano

Nanotechnology Informatics Working Group

Thomas et al. 2013 ISA-TAB-Nano: A specification for sharing nanomaterial research data in spreadsheet-based format

Baker et al. 2013 Standardizing data

ISA-TAB-Nano

Extension of ISA-TAB format to represent nano-materials, small molecules and

biological specimens along with their assay characterisation data

Page 30: OpenTox Europe 2013

30

Data Scientist

Visualization

Analysis

Planning

Data Management

Data CollectionPublication

Page 31: OpenTox Europe 2013

31

Page 32: OpenTox Europe 2013

Questions?

You can email [email protected]

View our bloghttp://isatools.wordpress.com

Follow us on Twitter@isatools

View our websitehttp://www.isa-tools.org

View our Git repo & contributehttp://github.com/ISA-tools