Upload
stephany-long
View
218
Download
0
Embed Size (px)
Citation preview
David Giaretta
Associate Director (Development)
Funders:
DCC Development
Digital Curation Centrea centre of expertise in data curation and preservation
Organisation to Engage & Collaborate
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice: users
community support & outreach
research
development co-ordination
service definition & delivery
management & admin support
curation organisations eg DPC
Collaborative Associates Network of DataOrganisations
Development
3
Development – initial plans
• Registries/Repositories– offering a repository of tools and technical
information, a focal point for digital curators– metadata standards
• Testbeds– for testing and evaluating tools, methods,
standards and policies in realistic settings
• Certification– standards
Development
4
What can we rely on in the Long Term?
• The bits (original or migrated)– let us for the moment put to one side the issue
of BIT PRESERVATION (but it is an issue)
• Physical documents that people can read– e.g. ISO standards on paper
• Additional information we collect – either held by the DCC, its collaborators or successors
Development
5
Preservation “vs” Current Use
• There are already very many architectures to support immediate use of information– Aim to support these
• Therefore chose to be guided by– long-term preservation aspects
• try to ensure that components of the preservation architecture can supplement other “current use” architectures.
– to promote this we should emphasise “interoperability” and “automated use” as far as possible.
– based initially on OAIS Reference Model – but not limited to that
Development
6
OAIS Reference Model – Functional Model
4-1.
2
MANAGEMENT
Ingest
Data Management
SIP
AIPDIP
queries
result setsAccess
PRODUCER
CONSUMER
Descriptive Info
AIP
orders
Descriptive Info
Archival Storage
Administration
Preservation Planning
Development
7
OAIS – Preservation Planning - key aspects
• Designated Communities & Knowledge Base
• Representation Net
Development
8
Representation Net
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice:
userscommunity support & outreach
research
development co-ordination
service definition
& delivery
management & admin support
Collaborative Associates Network of DataOrganisations
curation organisations eg DPC
Development
9
Representation Information vs File Format
• File Format provides only limited information– Knowing that a file is in Word 6.0 format does not allow one to
understand its contents e.g.• File contains French text • File has text with specialised terms
– Science data file (e.g. FITS) also has keywords and values• What do they mean?
• Representation Information is not limited in this way– N.B. includes File Format
• See DCC demo– Registry/Repository of Representation Info
• Low cost of “buy-in”
Development
10
Archival Information Package
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice:
userscommunity support & outreach
research
development co-ordination
service definition
& delivery
management & admin support
Collaborative Associates Network of DataOrganisations
curation organisations eg DPC
Development
11
Testbeds
• Hardware used by “curators”
in the wild– Details from projects
• Hardware suppliers
• Software suppliers– Commercial– Non-commercial
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice:
userscommunity support & outreach
research
development co-ordination
service definition
& delivery
management & admin support
Collaborative Associates Network of DataOrganisations
curation organisations eg DPC
Development
12
Standards and Audit & Certification
• How can people know to whom their information can be entrusted?
• OAIS follow-on standard(s) underway – on which a certification program can be based
• From the standards: – need to establish accreditation and certification bodies in
preparation for offering audit and certification services – audit, certification and accreditation are potential sources of
long term funding for the DCC – Testbeds and testing procedures
• for software certification• hardware and software systems will need to be purchased,
hired or borrowed. – we expect to work with hardware and software manufacturers to
certify hardware and software components
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice:
userscommunity support & outreach
research
development co-ordination
service definition
& delivery
management & admin support
Collaborative Associates Network of DataOrganisations
curation organisations eg DPC
Development
13
Working with Others
• Digital Library Federation• The National Archives• Global Grid Forum• NARA• Library of Congress• Research Library Group• Digital Preservation Coalition• JISC community• E-Science Community• Associates Network• …and many more
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice:
userscommunity support & outreach
research
development co-ordination
service definition
& delivery
management & admin support
Collaborative Associates Network of DataOrganisations
curation organisations eg DPC
Development info – see
http://dev.dcc.rl.ac.uk/twiki/bin/view
for details of Wiki and email list open to all
The Virtuous Circle
Industry
research collaborators
standards bodies
testbeds& tools
communities of practice: users
community support & outreach
research
development co-ordination
service definition & delivery
management & admin support
curation organisations eg DPC
Collaborative Associates Network of DataOrganisations