Advanced Climate Research Infrastructure for Data (ACRID)

Preview:

DESCRIPTION

Advanced Climate Research Infrastructure for Data (ACRID). Dr. Andrew Woolf 1 , Dr. Tim Osborn 2 , Dr. Arif Shaon 1 , Dr. Colin Harpham 2 (1) STFC e-Science Centre, RAL (2) Climatic Research Unit, UEA. JISC 14/09. - PowerPoint PPT Presentation

Citation preview

Advanced Climate Research Infrastructure for Data

(ACRID)Dr. Andrew Woolf1, Dr. Tim

Osborn2, Dr. Arif Shaon1, Dr. Colin Harpham2

(1) STFC e-Science Centre, RAL(2) Climatic Research Unit, UEA

JISC 14/09• Citing: “Agreed conventions for data citation and for

data description are important for research data discovery. Persistent identification is required...”

• Linking: “A recent position paper written for JISC ... makes a case for the benefits of linking research data using semantic or linked data technology ... data on which a journal article is based are bi-directionally linked to other data, resources, articles and people.”

• Integrating: “Integrating heterogeneous data across distributed sources can enable effective and innovative reuse”

ACRID• Advanced Climate Research Infrastructure for Data• Collaboration between:

– Climatic Research Unit, University of East Anglia– STFC e-Science Centre, Rutherford Appleton Laboratory– Met Office (unfunded partner)

• Various inquiries following 2009 email hacking recommended greater access to data and workings

• Project aims:– Information architecture, tools, infrastructure for managing

climate data and processing workflows– ‘linked-data’ approach for climate data publishing and citation– Prototype using four high-profile climate datasets

Citing• Convergence around DOI for linking

publication to data in Earth science– DataCite, Parsons and Duerr (2010), Wilson et.

al. (2010), UNESCO (2010), ESSD, etc.

• But “(w)hat is the citeable unit within a DOI?”– file? set of files? OAIS AIP?

• Answer: linked-data graph

Linking

Integrating• An example information model for Observations

and Measurements (ISO/DIS 19156)– An observation is an event that estimates an

observed property of a feature of interest, using a procedure, and generating a result

class Figure 5 - Observ ation

OM_Observation

+ phenomenonTime+ resultTime+ validTime [0..1]+ resultQuality [0..*]+ parameter [0..*]

GFI_PropertyType

GFI_Feature OM_Process

Any{root}

+observedProperty

1

+propertyValueProvider

0..*

+featureOfInterest

1

+generatedObservation0..*

+procedure1

+result

Range

Name:Package:Version:Author:

Figure 5 - ObservationAIP-31.0Simon Cox

Climate Science Modelling Language (CSML)

ReferencesACRID• http://www.cru.uea.ac.uk/cru/projects/acrid• http://www.jisc.ac.uk/whatwedo/programmes/mrd.aspxLinked data• http://linkeddata.org• Tim Berners-Lee: Linked Data – Design Issues

http://www.w3.org/DesignIssues/LinkedData.html

• Bizer, C., T. Heath and T. Berners-Lee (2009): Linked Data – The Story So Farhttp://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf

URI structure• W3C: Cool URIs don’t change

http://www.w3.org/Provider/Style/URI

• Cabinet Office (2009): Designing URI Sets for the Public Sectorhttp://www.cabinetoffice.gov.uk/media/308995/public_sector_uri.pdf

• Cabinet Office (2010): Designing URI Sets for LocationCSML• http://ndg.nerc.ac.uk/csmlOAI-ORE• http://www.openarchives.org/ore

Questions?

andrew.woolf@stfc.ac.uk

Recommended