20
| Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations from the German National Library Reinhard Altenhöner and Katarina Haage German National Library

| Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Embed Size (px)

Citation preview

Page 1: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 20141

Ingest Levels and Persistent Identification Services for R & D and heritage organisations

from the German National Library

Reinhard Altenhöner and Katarina Haage

German National Library

Page 2: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Content overview

- Some background: Online publications – how do they find their way into the repository?

- Ingest Levels as a leading principle in German National Library‘s dp-workflow

- Persistent Identification – a basic service in the APARSEN community and how do we find what we are looking for?

- AREDO – Archiving and sustainable preservation of digital objects – a cooperation service

- DNB contribution in R&D

| Ingest Levels and Persistent Identification | October 20142

Page 3: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Background: Preservation at the German National Library

– Legal deposit for German online publications since 2006

– Archival system for digital preservation developed in German project kopal (2004 – 2007)- Based on DIAS by IBM, which is multi-client capable and therefore

aimed to cooperation- Support for file format migration: Object versioning, query

interface for technical metadata- Ingest and access by own developed open source library (koLibRI)

– Infrastructure for online publications: Local repository, DIAS (externally hosted), catalogue (biographical metadata, user access)

| Ingest Levels and Persistent Identification | October 20143

Page 4: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Online publications and their way into the repository – a new ingest workflow

– Need for flexible ingest process for variety of object types and file formats with integrated dp

– Three delivery interfaces for publishers: OAI-PMH, web form with upload, hotfolder (FTP)

– Cooperation service for other institutions (AREDO) - Traceable ingest workflow- Predictable preservation risks- Quality checks

| Ingest Levels and Persistent Identification | October 20144

Page 5: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Ingest workflow

| Ingest Levels and Persistent Identification | October 20145

TP

Import serviceHotfolder manager(OAI-PMH, web form)

SIP launcherRepository

SIP

didigo

Bibl. metadata

Integrity checks

Ticket system

DIAS

Page 6: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Ingest indicators

- File integrity (FI): Correct checksum

- Identifiability (ID): File format could be recognized

- No restrictions (NR): No technical protection measurements

- Generation of technical metadata (MD): Tool or module is available for the characterization of the recognized file format

- Validity (V): File format is valid

| Ingest Levels and Persistent Identification | October 20146

Page 7: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Ingest levels and error evaluation

| Ingest Levels and Persistent Identification | October 20147

FI ID NR MD V

Level 0 X O O O O

Level 1 X X O O O

Level 2 X X X O O

Level 3 X X X X O

Level 4 X X X X X

Page 8: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 20148

Persistent Identification – how do we find what we are looking for?- Uniform Resource Name (URN) is a persistent identifier

(PI); an unique identifier for digital objects, enabling online resources to be permanently identified and reliably cited

- APARSEN’s WP22 (CITABILITY AND IDENTIFICATION) development of an interoperability framework for PI systems

- All APARSEN deliverables are provided with an URN and can be found in the DNB catalogue: https://portal.dnb.de/opac.htm?query=%22aparsen%22&method=simpleSearch

Page 9: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 20149

The DNB and its URN Service – The German National Library assigns and administrates

URNs from the namespace "urn:nbn:de" and offers an URN resolving service for Germany, Austria and Switzerland

– The URN service is a free service of the German National Library

See also:

– Policy for issuing URNs in the urn:nbn:de namespace (DNB)

– Catalogue of criteria for assessing the trustworthiness of PI systems (nestor)

Page 10: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

10

Long-term preservation cooperation service in the DNB:

- Basic offers- Optional services- Formal structure

| Ingest Levels and Persistent Identification | October 2014

AREDO

ARchivierung und dauerhafte Erhaltung Digitaler Objekte(Archiving and sustainable preservation of digital objects)

Page 11: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 201411

AREDO

Page 12: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 201412

Basic offers - Provision of a mass input and output interface

- Integrity check of all digital objects

- Check for duplicates in partner's preservation repository

- Quality checking of all digital objects

- Generation of technical metadata

- Reporting

- Provision of permanent access and search functions

- Backup storage of all archived objects (triple) / Archiving at multiple

locations

- Rights management

- Preservation planning: ongoing risk analysis and risk information

Page 13: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Statistic tool: Ingest

| Ingest Levels and Persistent Identification | October 201413

Page 14: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

14

Optional services

- Ingest:- Risk handling based on jointly agreed quality rules for

digital objects- Processing of descriptive metadata supplied in Dublin Core

Metadata Element Set, Version 1.1- Processing of additional information

- Curation:- Regular and comprehensive status reports on all archived

digital objects- Agreement on suitable migration, or other safeguarding,

measures

| Ingest Levels and Persistent Identification | October 2014

Page 15: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

German National Library

Libraries

Museums Educational Institutes

Archives

15 | Ingest Levels and Persistent Identification | October 2014

Adaptation to partner's requirementsAREDO allows individualised long-term preservation workflows to be set up between the partners and the German National Library which meet the specific needs of the institution in question:

Page 16: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Service provider

External Provider (DIAS)

Service catalogue / Underpinning Contract

Service catalogue

SLA SLA SLA SLA SLA SLA

S e r v i c e t a k e r

Service model

16 | Ingest Levels and Persistent Identification | October 2014

Page 17: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

Formal structure

- Cooperation contract between both organisations

- The deliverer makes sure the submitted objects have a certain standard (see Ingest Level)

- The cooperation provider makes sure the stored objects are accessible over a long-term and certificated for a long-term preservation process ( nestor certification)

- DNB as Not-for-Profit institution; no other costs than the actual costs

For more details go to: http://www.dnb.de/aredo

or contact: Karlheinz Schmitt, [email protected]

| Ingest Levels and Persistent Identification | October 201417

Page 18: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 201418

DNB and R&D: action and principles 1

- Participation in projects with strong relation to R&D-topics: - KEEP, SHAMAN, KOPAL, DP4lib, …

- Evaluation and preparing-the-scene-projects- ODE, LUKII, 4C

– Umbrella projects & services- Nestor, Parse.insight, APARSEN

Page 19: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 201419

Deliver requirements and transmit results from r&d into operation- New project: „Using emulation for the provision of multimedia

objects” – based on KEEP

Sharing knowledge, OSS-tools, regular exchange

- More than before the Horizon: Research data

- Topic for DNB as a memory organisation, if the data „is part“ of a publication and openly published in the web

- Advanced need for cooperate with data producers and data centres to make use of generated data VCoE as a platform?!

DNB and R&D: action and principles 2

Page 20: | Ingest Levels and Persistent Identification | October 2014 1 Ingest Levels and Persistent Identification Services for R & D and heritage organisations

| Ingest Levels and Persistent Identification | October 201420

Questions and remarks are welcome now or later:

Reinhard Altenhöner, [email protected]

and

Katarina Haage, [email protected]

German National Library Information Infrastructure and PreservationAdickesallee 1D-60322 Frankfurtwww.dnb.de