Transcript
Page 1: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Digital Curation Centre: tools and services under development

David GiarettaAssociate Director (Development)

Funders:

Digital Curation Centrea centre of expertise in data curation and preservation

Page 2: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Organisation

Industry

research collaborators

standards bodies

testbeds& tools

communities of practice: users

UKOLN

U of Edinburgh

CCLRC

U of Glasgow

U of Edinburgh

curation organisations eg DPC

Collaborative Associates Network of DataOrganisations

Page 3: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Organisation

Industry

research collaborators

standards bodies

testbeds& tools

communities of practice: users

community support & outreach

research

development co-ordination

service definition & delivery

management & admin support

curation organisations eg DPC

Collaborative Associates Network of DataOrganisations

Page 4: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

CCLRC UKOLN

UofGUofE

CMS-Bristol

NIEeS

RG

Durham

WT-CFGLeicester

ICMaastricht

Oxford

Dutch NASwiss NAUrbino

UNC

Salzburg

SDSC

NEODC

CEH

RI

NCS

RLG

Innogen

NHS

Capri NTUAINRIAHUJUPCMax-

PlanckMIMAS

IASSIST

LDCACM

Data Archive

EDGGridPPEGEE

CambridgeLeicester

Jodrell Bank

DLI (US)DPC

DELOS

UNC

ESA

NASANARACNESESARLG

BNSC

TU Vienna UPenn

EBIMRC HGU

KyotoUSC

INRIA

GSK

Roslin

IBM Almaden

JHUCSIRO

CaltechJHU

CSIRO

CDSESO

OCLC

AHDSMicrosoft

IBMOracle

BTSTK

BADCBODC

ESO

IVOA

ResearchCouncils

HEIs&

FE

ResearchInstitutes

InternationalCollaborations

StandardsBodies

DPC

MIMAS

ILRT

Council forMuseums, Archives

& LibrariesRDN. OCLC

So’ton

OAI

NOF

NLA

NeSC

Page 5: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Overview

• Developing tools and services which will be needed in the short-medium term– integrating tools from many sources

• Will be new DCC services as well as useable separately by other projects

• Strongly OAIS based• Support automated processing &

interoperability

Page 6: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

OAIS Reference Model – Functional Model

4-1.

2

MANAGEMENT

Ingest

Data Management

SIP

AIPDIP

queries

result setsAccess

PRODUCER

CONSUMER

Descriptive Info

AIP

orders

Descriptive Info

Archival Storage

Administration

Preservation Planning

Page 7: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Representation Net

Page 8: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Representation Information Classification

Page 9: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Representation Information vs Format

• Format = Structure

• Omits important information e.g – Language, terminology– Encryption

• Need to know more than just Format in order to stand a chance of being in a position to use the information

Page 10: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Layered Model from OAIS

More easily applicable to Science data

Page 11: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Representation Information - High Level View

Example of use of Representation Information Labelling

Page 12: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Registry/Repository

• Interface and protocols – JAXR “standard”– freebXML implementation– many access methods

• URL• Web Services• API• Etc..

• Findability– Persistent IDs

• What can we rely on?– Labels (to support automated processing)

• Initial service this Summer– Hope to work with PRONOM 4 & GDFR

Page 13: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Registry/ Repository

• Trusted repository of Rep. Info– Authenticity of info– Access control– Certificates/Digests : (are they trustable over the

long term?)• Extensibility• Distributed

Page 14: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Certification

• RLG task force preparing draft standard– Based on OAIS (plus TDR)– Expect this to become an ISO standard

• Tool:– Checklist and reports– …– Awaiting release of draft (in May)

Page 15: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Archival Information Package

• METS

• XFDU Packaging

• Expect tools available by end of year

Page 16: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Preservation Description Info

Will be working with PREMIS on tools

Page 17: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

DCC Development Roadmap for next 6-12 months

• Registry– Complete phase 1– Include links to TNA/PRONOM– Hand over to Services group– Start Phase 2 – aim for “Trusted Repository” status

• Representation Information:– Data descriptions of science data using EAST (http://east.cnes.fr) & others– Import other Structure description tools and Data Dictionary tools– Develop Mapping to data object level– Work with other projects e.g. Emulation, Processing

• Certification– Draft certification

• Checklist• Proposed standard

• Additional Tools– Metadata extraction tool set– Ingest tool (based on PAIMAS standard)

• Testbeds e.g. large scale data management tools

Page 18: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Research

• To draw together the various functions of curation, from the traditional archival functions to the maintenance and publication of evolving knowledge as seen in scientific databases.

• To identify through direct research collaboration, and through interaction with the service arm of DCC, the key projects in which research is needed.

• To conduct research in areas already identified by the partners as crucial to digital curation.

• To institute two-way conduits between research and service in which practical issues can be drawn to the attention of researchers and the products of research can be tested in practice.

Page 19: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Current research priorities

• Data integration and publication • Performance and optimisation • Annotation • Appraisal and long-term preservation • Socio-economic and legal context: rights,

responsibilities and viability • Cost-benefit analysis of the data curation process • Security: safe and effective data analysis

environments • Automation of metadata extraction • Visitors Programme and Seminar Series

Page 20: Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre

Summary

• Developing and integrating OAIS based tools

• Reviewing other related tools• See http://www.dcc.ac.uk

– also Development Web site (http://dev.dcc.rl.ac.uk) with a Wiki and associated open email list have been set up.

– aim to encourage widest possible collaboration with other projects.

• In medium-long term expect tools from DCC Research activities e.g. Annotation


Recommended