8
TAVERNA PROVENANCE Stian Soiland - Reyes, University of Manchester https://github.com/wf4ever/taverna - prov This work is licensed under a Creative Commons Attribution 3.0 Unported License 2013-05-29

2013-05-29 Taverna Provenance

Embed Size (px)

DESCRIPTION

Slide deck presenting the Provenance support of Taverna workflow system, detailing architecture, ontologies and how results are exported as Research Object bundles, including the PROV-O provenance of the workflow run. This upload is the PDF version, for PPTX source, see https://www.slideshare.net/soilandreyes/20130529-taverna-provenance-pptx-source/

Citation preview

Page 1: 2013-05-29 Taverna Provenance

TAVERNAPROVENANCE

Stian Soiland-Reyes, University of Manchester

https://github.com/wf4ever/taverna-prov

This work is licensed under a

Creative Commons Attribution 3.0 Unported

License

2013-05-29

Page 2: 2013-05-29 Taverna Provenance

ARCHITECTURE

Provenance

Workflow

Workflow run

Process run (iteration)

Parameter bindings

Data

Lists

Values

References

Errors

Process1

portA B C

D E

Process2

portA B C

D E

Invoke

Retry

Failover

Loop

Error bounce

Provenance

Parallelise

Processor

dispatch stack

layer injected by plugin

P Missier, S Soiland-Reyes, S Owen, W Tan, A Nenadic, I Dunlop, C

Goble: (2010, January). Taverna, reloaded. In Scientific and

Statistical Database Management (pp. 471-481). Springer Berlin

Heidelberg. DOI 10.1007/978-3-642-13818-8_33

captures provenance trace

Workflow execution

Page 3: 2013-05-29 Taverna Provenance

ONTOLOGY STACK

tavernaprov

• Lists, errors, byte content, checksums

wfprov + wfdesc

• Workflow execution, parameters, processes

PROV-O

• Activity start/stop, generation of values

http://purl.org/wf4ever/wfprov#

http://www.w3.org/ns/prov-o#

http://ns.taverna.org.uk/2012/tavernaprov/

Page 4: 2013-05-29 Taverna Provenance

INTERMEDIATE RESULTS

• Within the Taverna Workbench, the provenance database is

used for showing intermediate results and previous runs

Clicking a processor

Inputs and outputs of individual invocations

Page 5: 2013-05-29 Taverna Provenance

WORKFLOW RESULTS (FOLDER)

workflowrun.prov.ttl

(RDF)

outputA.txt

outputC.jpg

outputB/

Folder structure

intermediates/

1.txt2.txt

3.txt

de/def2e58b-50e2-4949-9980-fd310166621a.txt

Workflow outputs, one file per value

Provenance trace

Values from intermediate steps in workflow

Page 6: 2013-05-29 Taverna Provenance

WORKFLOW RESULTS (BUNDLE)

workflowrun.prov.ttl

(RDF)

outputA.txt

outputC.jpg

outputB/

https://w3id.org/bundle

intermediates/

1.txt2.txt

3.txt

de/def2e58b-50e2-4949-9980-fd310166621a.txt .ro/manifest.json

inputA.txtworkflow

URI

references

attribution

execution

environment

Aggregating in Research Object

ZIP folder structure (RO Bundle)

mimetype

application/vnd.wf4ever.robundle+zip

Page 7: 2013-05-29 Taverna Provenance

ACKNOWLEDGEMENTS

• Paolo Missier – initial provenance engine for Taverna 2

• Ian Dunlop – provenance capture execution layer

• Khalid Belhajjame – ontologies

• Alexandra Nenadic – intermediates, folder structure

• W3C Provenance working group – PROV-O

• Funded by European Commission’s 7th FWP FP7-ICT-2007-6

270192 and ESPRC platform grant EP/G026238/1

Page 8: 2013-05-29 Taverna Provenance

QUESTIONS?

Twitter: @soilandreyes

Skype: soiland

Email: [email protected]

http://soiland-reyes.com/stian/work/

http://practicalprovenance.wordpress.com/