9
Processing of scientific data From field capture to web delivery Hector Quintero Casanova Postgraduate in e-Science

Processing of scientific data: from field capture to web delivery

Embed Size (px)

DESCRIPTION

Short presentation on the lifecycle of scientific data and how it relates to the Glastir Monitoring and Evaluation Programme. The GMEP is effectively a "real-time" healthcheck system for the new Welsh agri-environment scheme Glastir.

Citation preview

Page 1: Processing of scientific data: from field capture to web delivery

Processing of scientific dataFrom field capture to web delivery

Hector Quintero CasanovaPostgraduate in e-Science

Page 2: Processing of scientific data: from field capture to web delivery

● GMEP ticks all the boxes:

✔ Highly multidisciplinary: social, landscape, water, birds

plants...

✔ Large volumes of data: covers the whole of Wales.

✔ Cross-organisational collaboration: 13 institutions.

Why e-Science? Data-intensive

Page 3: Processing of scientific data: from field capture to web delivery

Why e-Science? Metadata

● NERC's data policy says it all

– “It is essential that metadata are submitted”

● Metadata = context information about data

– Provenance = who, when, where, how

● Exposes data relationships → traceability

– Workflow = how. Essential if using models

● Enables reproducing outcome → repeatability

● Exactly what information depends on the stage.

Page 4: Processing of scientific data: from field capture to web delivery

● Raw data from the field– Metadata: method, calibration, place, units...

Data collection

Page 5: Processing of scientific data: from field capture to web delivery

● Information products: e.g. data from models– Metadata: name, conditions, where it applies

Data analysis

Page 6: Processing of scientific data: from field capture to web delivery

Data analysis

● Workflow metadata avoids costly reruns

– Identify model output needed → reuse

● But not enough for cross-organisation collab.

– 13 institutions in Glastir.

– Differences in storage structure, metadata defs...

● Need extra layer(s) for seamless access

– Web already offers tools needed.

Page 7: Processing of scientific data: from field capture to web delivery

Publication: linked data

● HTTP for generic retrieval of resources

● URIs for unique identification of those resources

– E.g. http://www.ceh.ac.uk

● Both can be used to build web services

– Amount to remote functions.

– Eg: seamless recording of workflows across institutions.

● Semantics for automated reasoning

– Acts as standardised metadata aimed at machines.

Page 8: Processing of scientific data: from field capture to web delivery

… We've come full circle!

¿?

Page 9: Processing of scientific data: from field capture to web delivery

Hector Quintero Casanova Postgraduate in e-Science

Thank youwww.hqcasanova.com